CV mapping with Depth Camera

psyje6 · February 13, 2025, 2:41pm

Unity Editor version : 2022.3.42f1
ML2 OS version : 1.8.0
Unity SDK version : 2.5.0
Host OS : MacOS

Hi, I am working on a project where I would like to use the 2D coordinates from my object detection model and convert them to world space using the depth camera. But I’m a bit confused on where to start, as the other posts here seem to be using a different API then is in the current documentation. Could you please point me in the right direction. Thank you!

Example:

kbabilinski · February 13, 2025, 4:45pm

I think the other link was in reference to how another device aligns the data from the two sensors, not that it is specific to the device. You are correct to use the Pixel Sensor API.

We do not provide SDK functionality to align or synchronize sensors, so this would need to be done using custom logic in your application. Note that the RGB camera and the Depth camera are not located at the same place on the Magic Leap Headset. So you will need to use the Intrinsic and Extrinsic data to align the RGB and Depth images.

psyje6 · February 17, 2025, 1:17pm

Thank you! That helped me make a good start. However I’m getting stuck when getting the pose of the frame. I’m doing it as soon as the camera callback is triggered as suggested in the other post. But I’m getting the same MLCVCameraGetFramePose error and I’m really confused as to why. Here is my function:

private void OnCaptureRawVideoFrameAvailable(MLCameraBase.CameraOutput cameraOutput,
            MLCameraBase.ResultExtras resultExtras,
            MLCameraBase.Metadata metadata)
        {

            MLResult result = MLCVCamera.GetFramePose(resultExtras.VCamTimestamp, out Matrix4x4 outMatrix);
            if (result.IsOk)
            {
                string cameraExtrinsics = "Camera Extrinsics";
                cameraExtrinsics += "Position " + outMatrix.GetPosition();
                cameraExtrinsics += "Rotation " + outMatrix.rotation;

                CameraExtrinsics.position = outMatrix.GetPosition();
                CameraExtrinsics.rotation = Matrix4x4.Rotate(outMatrix.rotation);
                CameraExtrinsics.timestamp = resultExtras.VCamTimestamp;

                Debug.Log(cameraExtrinsics);
            }

            CameraIntrinsics.fx = resultExtras.Intrinsics.Value.FocalLength.x;
            CameraIntrinsics.fy = resultExtras.Intrinsics.Value.FocalLength.y;

            CameraIntrinsics.cx = resultExtras.Intrinsics.Value.PrincipalPoint.x;
            CameraIntrinsics.cy = resultExtras.Intrinsics.Value.PrincipalPoint.y;

            UpdateRGBTexture(ref videoTexture, cameraOutput.Planes[0]);
}

kbabilinski · February 17, 2025, 4:12pm

If you are using OpenXR , you will need to make sure the Perception Snapshots is enabled in the OpenXR Magic Leap Support Feature in your project settings

psyje6 · February 17, 2025, 8:34pm

I had enabled it before but am still getting the same error.

psyje6 · February 18, 2025, 11:20am

This is the specific error I am getting

leapcore/frameworks/perception/data_sources/include/pad/xpad_data_source.h(105) GetClosestTimestampedData():
ERR: Data Not Found for timestamp: 1022446282us, now time: 1045159819us
Error Unity Error: MLCVCameraGetFramePose in the Magic Leap API failed. Reason: MLResult_PoseNotFound 
Error Unity UnityEngine.XR.MagicLeap.MLResult:DidNativeCallSucceed(Code, String, Predicate`1, Boolean)
Error Unity UnityEngine.XR.MagicLeap.MLCVCamera:InternalGetFramePose(CameraID, MLTime, Matrix4x4&)
Error Unity YoloHolo.Services.MLImageAcquiringService:OnCaptureRawVideoFrameAvailable(CameraOutput, ResultExtras, Metadata)
Error Unity UnityEngine.XR.MagicLeap.Native.DispatchPayload3`3:Dispatch()
Error Unity UnityEngine.XR.MagicLeap.Native.MLThreadDispatch:DispatchAll()

kbabilinski · February 18, 2025, 2:51pm

You will need to make sure to get the pose of the camera within 500ms of the callback. This means that you need to maintain a high frame rate, otherwise the timestamp might become invalid.

You can also use the OpenXR Pixel Sensor logic to obtain the camera pose. This way you don’t have to convert between OpenXR and MLSDK poses. Note, that if you are using the MLCamera , you can just use the following example without additional configuration. Just call GetSensorPose in the callback.

Callback example


    private MagicLeapPixelSensorFeature _pixelSensorFeature;
    private PixelSensorId _sensorType;
    private XROrigin _xrOrigin;
...

    void RawVideoFrameAvailable(MLCamera.CameraOutput output, MLCamera.ResultExtras extras, MLCameraBase.Metadata metadataHandle)
    {
        if (output.Format == MLCamera.OutputFormat.RGBA_8888)
        {
            //Flips the frame vertically so it does not appear upside down.
            MLCamera.FlipFrameVertically(ref output);
            UpdateRGBTexture(ref _videoTextureRgb, output.Planes[0], _screenRendererRGB);
        }

        if(_pixelSensorFeature == null)
        {
            _pixelSensorFeature = OpenXRSettings.Instance.GetFeature<MagicLeapPixelSensorFeature>();
            _sensorType = _pixelSensorFeature.GetSupportedSensors().Find(x=> x.SensorName == "Picture Center"); // Simplified for example

            bool wasCreated = _pixelSensorFeature.CreatePixelSensor(_sensorType);
        }


        Pose sensorPose = _pixelSensorFeature.GetSensorPose(_sensorType);

        // Updates the Sensor Pose to be relative to the XR Origin
        if (_xrOrigin = FindAnyObjectByType<XROrigin>())
        {
            Vector3 worldPosition = _xrOrigin.CameraFloorOffsetObject.transform.TransformPoint(sensorPose.position);
            Quaternion worldRotation = _xrOrigin.transform.rotation * sensorPose.rotation;
            // Update the existing pose
            sensorPose = new Pose(worldPosition, worldRotation);
        }

        Debug.Log("Sensor Position:" + sensorPose.position);
        Debug.Log("Sensor Rotation:" + sensorPose.rotation);

    }

psyje6 · February 19, 2025, 10:03pm

kbabilinski:

f (_xrOrigin = FindAnyObjectByType<XROrigin>())
        {
            Vector3 worldPosition = _xrOrigin.CameraFloorOffsetObject.transform.TransformPoint(sensorPose.position);
            Quaternion worldRotation = _xrOrigin.transform.rotation * sensorPose.rotation;
            // Update the existing pose
            sensorPose = new Pose(worldPosition, worldRotation);
        }

Hi thank you for this. What does it mean that the sensor pose is relative to the XR origin?
I am using the MLDepthCamera like this, I’m assuming there’s no difference in what the PixelSensor and the MLDepthCamera return?:

public override void Update()
        {
            //Debug.Log("Depth camera update");
            if (!permissionGranted || !MLDepthCamera.IsConnected) return;

            var result = MLDepthCamera.GetLatestDepthData(0, out MLDepthCamera.Data data);
            isFrameAvailable = result.IsOk;
            if (isFrameAvailable)
            {
                lastData = data;

                DepthCameraIntrinsics.fx = data.Intrinsics.FocalLength.x;
                DepthCameraIntrinsics.fy = data.Intrinsics.FocalLength.y;
                DepthCameraIntrinsics.cx = data.Intrinsics.PrincipalPoint.x;
                DepthCameraIntrinsics.cy = data.Intrinsics.PrincipalPoint.y;
                DepthCameraIntrinsics.width = data.Intrinsics.Width;
                DepthCameraIntrinsics.height = data.Intrinsics.Height;

                DepthCameraExtrinsics.position = data.Position;
                DepthCameraExtrinsics.rotation = Matrix4x4.Rotate(data.Rotation);
                DepthCameraExtrinsics.timestamp = data.FrameTimestamp;


            }
        }

Also do you have any advice on aligning the image from the CV Camera and depth camera as I am struggling to.

psyje6 · February 20, 2025, 10:29am

I was also wondering if you could explain how to access the depth information from the MLDepthCamera.FrameBuffer depthFrame. I’m currently converting it into an array to get the depth value at each pixel, but my depth values seem to be too high.

kbabilinski · February 20, 2025, 4:36pm

Regarding the Pose:

When using the MLSDK APIs with OpenXR , you will need to set the App into Unbounded tracking space. As mentioned here : MLCamera | MagicLeap Developer Documentation

This is because the MLSDK returns a Pose in a diferent space which is the equivalent of OpenXR’s unbounded.

Regarding the Depth Camera API:

Please note, the MLSDK API (MLDepthCamera) has been deprecated in favor of the OpenXR Magic Leap Pixel Sensor API. This means that the API will not receive updates and is no longer supported. That said, here is an old post regarding the ML Depth Camera:

psyje6 · March 25, 2025, 11:04am

Thank you so much for your help. I’ve been trying to get this working for the past month but am having issues aligning the two images. Would you be able to tell me if this approach is along the right tracks? It follows after the point cloud has been generated.

 // This method converts YOLO detection RGB coordinates to world coordinates based on depth camera's parameters.
    public static Vector3 GetWorldPointFromDepthImage(Vector2 yoloPixel,
                                               Vector2Int rgbImageSize,
                                               Vector2Int depthImageSize,
                                               Vector3[] depthImageWorldPoints,
                                               MLDepthCamera.Data depthCameraData,
                                               MLCameraBase.ResultExtras rgbCameraData,
                                               Matrix4x4 rgbCameraExtrinsics)
    {
        // Step 1: Map the YOLO pixel coordinates from the RGB image to camera space (RGB Camera).
        Vector3 rgbCameraCoords = ConvertPixelToCameraCoordinates(yoloPixel, rgbImageSize, rgbCameraData);

        // Step 2: Convert from RGB Camera space to Depth Camera space using extrinsics.
        Vector3 depthCameraCoords = ConvertCameraToDepthCameraSpace(rgbCameraCoords, rgbCameraExtrinsics, depthCameraData);

        // Step 3: Map the depth camera coordinates to depth image coordinates.
        Vector2 depthPixel = MapCameraToDepthPixel(depthCameraCoords, depthImageSize, depthCameraData);

        // Step 4: Fetch the world point from the depth image based on the depth pixel.
        Vector3 worldPoint = GetWorldPointFromDepthPixel(depthPixel, depthImageWorldPoints, depthImageSize);

        return worldPoint;
    }

kbabilinski · March 25, 2025, 3:48pm

Yes, your logic of

Convert YOLO pixel → RGB camera coordinates
Transform RGB → Depth camera coordinates
Depth camera coords → Depth image pixel
Lookup 3D from that depth pixel

is exactly the typical approach. Things that you want to consider is the resolution and FoV differences, Depth Lookup and interpolation (handle “off-pixel” coordinates), and Frame Synchronization / timestamps

Topic		Replies	Views
CV mapping with Depth Camera Unity Development depth-camera , sensors	7	745	April 8, 2024
Camera capture timestamp delta Unity Development api , camera-api , camera	11	559	February 21, 2024
MLCVCamera.GetFramePose Failure Unity Development	14	44	October 7, 2024
Unable to get physical camera pose Unity Development camera-api , sensors	13	412	December 7, 2023
GetFramePose is Not OK Unity Development openxr , camera-api	10	285	February 20, 2024

CV mapping with Depth Camera

Related topics