This repository contains the Unity app code for the AV-ALOHA project for VR passthrough + teleoperation. You can find more details about the project in the following resources:
- AV-ALOHA Code: AV-ALOHA
- Project Page: AV-ALOHA Project
- Paper: "Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation"
This repository includes the full Unity project and the APK for Meta Quest 2 and Meta Quest 3. While the code may not be perfectly organized, the key scripts are limited and straightforward to understand. For WebRTC communication and video streaming, refer to the following script:
- Main WebRTC Streaming Script:
Assets/Scripts/PassthroughScene/WebRTCStreamer.cs
This script renders two separate video streams for each eye (left and right), which minimizes compression issues compared to combining the streams into one. The project also includes options to use a TURN server, although this feature has not been tested yet.
- APK:
/TwoStreamGuidedVision.apk
If you wish to modify the Unity project, follow these steps:
- Clone the repository and open the project located at
/Guided-Vision
in Unity. - Ensure that you are using Unity Editor version 2022.3.20f1.
- All settings and dependencies should load automatically, but verify that the settings below are configured correctly.
- Build the project for Android.
These settings should already be configured, but it’s important to check them:
-
Graphics API:
Go toPlayer
>Android
>Graphics API
, and make sure thatOpenGLES3
is selected (not Vulkan). Using Vulkan may cause crashes in the WebRTC package. -
Stereo Rendering Mode:
Go toProject Settings
>XR Plug-in Management
>Oculus
>Android
and ensure that Stereo Rendering Mode is set toMulti Pass
. This allows rendering two separate images to the left and right eye.