aryve Vision
What is it
The aryve Vision component is a camera-based positioning module that enables high-precision indoor localization using visual input from a device.
The component processes live camera data, matches it against a pre-mapped environment, and returns the device’s precise position and orientation within the aryve coordinate system.
What is it used for
The Vision component is designed for:
- High-accuracy indoor positioning in pre-mapped spaces
- Visual localization without hardware installations
- Navigation in any indoor multi floor space
- Augmented Reality (AR) navigation experiences
- Robotics or device guidance indoors
How it integrates into the aryve ecosystem
The aryve Vision component integrates as a positioning service within the aryve ecosystem, providing real-time, camera-based localization to client applications and other aryve services.
The Vision component relies on precomputed visual maps generated within the aryve ecosystem. At runtime, the Vision component continuously matches incoming camera frames against this map data to determine the device pose and centimeter precise position. The system also keeps the information of the device's session to increase the precision based on the navigation context of the device.
aryve Vision communicates with the aryve Studio and aryve Guide components via REST API.
Input requirements
What is needed to use aryve Vision is a device with a camera and access to internet, that can send the image to the server to be localized and receive back the position of the device.
The camera requirements are:
- RGB camera
- Minimum resolution: 1024x768 pixels
- Camera intrinsics
Device requirements:
- Image processing (jpeg encoding)
- Access to camera stream
- Network connection
Physical requirements recommendations
- At a height of 1,5 meters the camera should be pointing horizontally
- If the camera is lower/higher then it has to be adjusted to not point at the floor or ceiling but at the surrounding space
- While we can work with some motion blur, fast rotation and shakiness should be avoided
Output data
The Vision component returns real-time localization data to the client device.
- Position
- x, y, z coordinates
- Defined in the map global system (every map has an origin point and every position in the map is referencing to it)
- Unit: meters
- With orientation
- Accuracy
- Position accuracy estimate
- Localization status (failed, not failed)
Important to note
- Performance depends on lighting conditions, clarity of the image, visual features richness (Note:standard smartphones are better at handling light conditions)
- Map quality might degrate over time in a changing environment
- On devices not using the aryve Guide app, people have to be blurred on the device to ensure you are GDPR compliant