aryve Vision

What is it

The aryve Vision component is a camera-based positioning module that enables high-precision indoor localization using visual input from a device.

The component processes live camera data, matches it against a pre-mapped environment, and returns the device’s precise position and orientation within the aryve coordinate system.

What is it used for

The Vision component is designed for:

High-accuracy indoor positioning in pre-mapped spaces
Visual localization without hardware installations
Navigation in any indoor multi floor space
Augmented Reality (AR) navigation experiences
Robotics or device guidance indoors

How it integrates into the aryve ecosystem

The aryve Vision component integrates as a positioning service within the aryve ecosystem, providing real-time, camera-based localization to client applications and other aryve services.

The Vision component relies on precomputed visual maps generated within the aryve ecosystem. At runtime, the Vision component continuously matches incoming camera frames against this map data to determine the device pose and centimeter precise position. The system also keeps the information of the device's session to increase the precision based on the navigation context of the device.

aryve Vision communicates with the aryve Studio and aryve Guide components via REST API.

Input requirements

What is needed to use aryve Vision is a device with a camera and access to internet, that can send the image to the server to be localized and receive back the position of the device.

The camera requirements are:

RGB camera
Minimum resolution: 1024x768 pixels
Camera intrinsics

Device requirements:

Image processing (jpeg encoding)
Access to camera stream
Network connection

Physical requirements recommendations

At a height of 1,5 meters the camera should be pointing horizontally
If the camera is lower/higher then it has to be adjusted to not point at the floor or ceiling but at the surrounding space
While we can work with some motion blur, fast rotation and shakiness should be avoided

Output data

The Vision component returns real-time localization data to the client device.

Position

x, y, z coordinates
Defined in the map global system (every map has an origin point and every position in the map is referencing to it)
Unit: meters
With orientation

Accuracy

Position accuracy estimate
Localization status (failed, not failed)

Important to note

Performance depends on lighting conditions, clarity of the image, visual features richness (Note:standard smartphones are better at handling light conditions)
Map quality might degrate over time in a changing environment
On devices not using the aryve Guide app, people have to be blurred on the device to ensure you are GDPR compliant

What is it​

What is it used for​

How it integrates into the aryve ecosystem​

Input requirements​

Physical requirements recommendations​

Output data​

Important to note​