How to send out 32bit ToF Depthmap through spout

jakaskerl · 9 Jan

BonkoKaradjov
Thanks, can you also send a left camera frame and ToF depth frame so we can test your calibration on our side?
edit: images should be taken at the same time of course
edit2: i see I'm mixing the threads a bit, sorry for that, let's first fix alignment and the move on to the depth streaming

Thanks,
Jaka

BonkoKaradjov · 9 Jan

Dear @jakaskerl,

thank you very much for your help.
To get a better quality of the maps I could record the images in Touchdesigner through sending ndi for example of the third additional stream (cam_a, cam_b, and cam_c).
Or record it directly from hard disk, this would probably give you the most exact output.
But I am not shure how to implement the third camera synced to the other two to produce the output.

Is there any script example out there that I can use for this with the ToF sensor to record or send out the
unaligned 3 videostreams through syphon, ndi or record on disk?
I am slow in coding such things and it would be really helpful if there is a code example instead of implementing this into the two scripts above and probably sending you the wrong data.

I would also switch the space where I record the video so that you get more angles and data in the depth.

Thank you very much for your help.
With kind regards
Bonko

jakaskerl · 13 Jan

BonkoKaradjov
Should be good:

import os
import time
import cv2
import depthai as dai
import numpy as np
from datetime import timedelta

def create_pipeline():
    pipeline = dai.Pipeline()

    # Create ToF node
    tof = pipeline.create(dai.node.ToF)
    tof.setNumShaves(4)

    # Configure ToF
    tofConfig = tof.initialConfig.get()
    tofConfig.enableFPPNCorrection = True
    tofConfig.enableOpticalCorrection = True
    tofConfig.enableWiggleCorrection = True
    tofConfig.enableTemperatureCorrection = True
    tofConfig.phaseUnwrappingLevel = 4
    tof.initialConfig.set(tofConfig)

    # Create ToF camera node
    cam_tof = pipeline.create(dai.node.Camera)
    cam_tof.setFps(20)
    cam_tof.setImageOrientation(dai.CameraImageOrientation.ROTATE_180_DEG)
    cam_tof.setBoardSocket(dai.CameraBoardSocket.CAM_A)
    cam_tof.raw.link(tof.input)

    # Create RGB camera nodes
    colorLeft = pipeline.create(dai.node.ColorCamera)
    colorRight = pipeline.create(dai.node.ColorCamera)

    # Configure RGB cameras
    colorLeft.setBoardSocket(dai.CameraBoardSocket.CAM_B)
    colorRight.setBoardSocket(dai.CameraBoardSocket.CAM_C)
    colorLeft.setResolution(dai.ColorCameraProperties.SensorResolution.THE_800_P)
    colorRight.setResolution(dai.ColorCameraProperties.SensorResolution.THE_800_P)
    colorLeft.setFps(20)
    colorRight.setFps(20)
    colorLeft.setInterleaved(False)
    colorRight.setInterleaved(False)

    # Create Sync node
    sync = pipeline.create(dai.node.Sync)
    sync.setSyncThreshold(timedelta(milliseconds=50))

    # Link outputs to Sync
    tof.depth.link(sync.inputs["tof"])
    colorLeft.isp.link(sync.inputs["left"])
    colorRight.isp.link(sync.inputs["right"])

    # Create output
    xout = pipeline.create(dai.node.XLinkOut)
    xout.setStreamName("sync_out")
    sync.out.link(xout.input)

    return pipeline

def colorize_depth_for_display(depth_frame):
    depth_normalized = cv2.normalize(depth_frame, None, 0, 255, cv2.NORM_MINMAX)
    depth_colored = cv2.applyColorMap(np.uint8(depth_normalized), cv2.COLORMAP_JET)
    return depth_colored

if __name__ == '__main__':
    pipeline = create_pipeline()

    with dai.Device(pipeline) as device:
        q_sync = device.getOutputQueue(name="sync_out", maxSize=4, blocking=False)
        frame_counter = 0
        save_dir = None

        while True:
            msgGrp = q_sync.get()
            
            frames = {}
            for name, msg in msgGrp:
                frames[name] = msg.getCvFrame()

            if len(frames) == 3:  # We expect 3 frames (ToF, left RGB, right RGB)
                # Get raw frames
                tof_depth = frames['tof']
                left_rgb = frames['left']
                right_rgb = frames['right']

                # Display frames (colorize depth only for display)
                cv2.imshow("ToF Depth", colorize_depth_for_display(tof_depth))
                cv2.imshow("Left RGB", left_rgb)
                cv2.imshow("Right RGB", right_rgb)

                key = cv2.waitKey(1)
                if key == ord('q'):
                    break
                elif key == ord('c'):
                    # Create new directory for saving if it doesn't exist
                    if save_dir is None:
                        timestamp = time.strftime("%Y%m%d_%H%M%S")
                        save_dir = os.path.join("data", timestamp)
                        os.makedirs(save_dir, exist_ok=True)
                        print(f"Created directory: {save_dir}")

                    # Save raw frames
                    tof_filename = os.path.join(save_dir, f"tof_{frame_counter:06d}.npy")
                    left_filename = os.path.join(save_dir, f"left_{frame_counter:06d}.png")
                    right_filename = os.path.join(save_dir, f"right_{frame_counter:06d}.png")

                    # Save ToF as raw numpy array, RGB images as PNG
                    np.save(tof_filename, tof_depth)
                    cv2.imwrite(left_filename, left_rgb)
                    cv2.imwrite(right_filename, right_rgb)
                    
                    print(f"Saved frame {frame_counter}")
                    frame_counter += 1

    cv2.destroyAllWindows()
    print("Program finished")

Thanks,
Jaka

BonkoKaradjov · 14 Jan

Dear @jakaskerl ,

Thank you very much for your help.
Here are the files I got in a bigger room:
Link to Photofiles

As mentioned I am trying to achive a sendout of color, depth and pointcloud with as less latency as possible and as good Rgb Quality as possible by keeping my osc rgb and tof camera control I implemented in my last script.
Link to last script

Here in parallel unfortunately I had to pay the price, becaus my production is already running => my approach with Orbbec Femto Mega.

To clear out what I am trying. If quality and latentcy is ok and the alignment as prefect as those from the Femto Mega I would slice the pointcloud as in the video example and use the combination of depthmap and sliced pointcloud as alpha channel to cutout the parts that have to become visible.

Link to my Femto Mega approach

Thank you again very much for your help.
As an information:

Luxonis could become serious competitor to the Femto Mega or even the Femto Mega 1 with the 12MP Tof sensor or even with the Oak 4 Pro (with ToF)
because Femtos Color Cameras have not such an easy live control + not such a good quality with their color cameras.

They are only more practical at the moment because of their direct connection through the Ip adress in Touchdesigner and their pretty exact point cloud.

For visual arts a good Oak 4 Pro with Tof option that is well aligned could beat Orbbec in speed (2.5 Gbs) and quality, especially if you bind in another ToF sensor.

Thats why I am not shure, that if I don't achieve the quality with my custom build, if it would be better to exchange my sensor with another camera from OaK.

Anyway I am very thankul for all your help and please be honest If you think if I could reach Femto Mega with my custom Oak Tof or if I should redirect my money probably to something new like the Oak 4 Pro (wide) or the interesting Oak Thermal sensor.

jakaskerl · 18 Jan

Hi BonkoKaradjov
Team has had a chance to look at your scripts.
Issue was that your calibration was done using rotated TOF frame not RGB, so the alignment has to also be done for rotated TOF and not rotated RGB frame, otherwise the rot matrix is incorrect.

Code for properly aligning the streams:

import os 
import pathlib
import numpy as np
import cv2
import depthai as dai
from numba import jit, prange

@jit(nopython=True, parallel=True)
def reprojection(depth_image, depth_camera_intrinsics, camera_extrinsics, color_camera_intrinsics, depth_image_show = None):
    height = len(depth_image)
    width = len(depth_image[0])
    if depth_image_show is not None:
        image = np.zeros((height, width), np.uint8)
    else:
        image = np.zeros((height, width), np.uint16)
    if(camera_extrinsics[0][3] > 0):
        sign = 1
    else:
        sign = -1
    for i in prange(0, height):
        for j in prange(0, width):
            if sign == 1:
                # Reverse the order of the pixels
                j = width - j - 1
            d = depth_image[i][j]
            if(d == 0):
                continue
            # Convert pixel to 3d point
            x = (j - depth_camera_intrinsics[0][2]) * d / depth_camera_intrinsics[0][0]
            y = (i - depth_camera_intrinsics[1][2]) * d / depth_camera_intrinsics[1][1]
            z = d

            # Move the point to the camera frame
            x1 = camera_extrinsics[0][0] * x + camera_extrinsics[0][1] * y + camera_extrinsics[0][2] * z + camera_extrinsics[0][3]
            y1 = camera_extrinsics[1][0] * x + camera_extrinsics[1][1] * y + camera_extrinsics[1][2] * z + camera_extrinsics[1][3]
            z1 = camera_extrinsics[2][0] * x + camera_extrinsics[2][1] * y + camera_extrinsics[2][2] * z + camera_extrinsics[2][3]

            u = color_camera_intrinsics[0][0] * (x1  / z1) + color_camera_intrinsics[0][2]
            v = color_camera_intrinsics[1][1] * (y1  / z1) + color_camera_intrinsics[1][2]
            int_u = round(u)
            int_v = round(v)
            if int_u >= 0 and int_u < (len(image[0]) - 1) and int_v >= 0 and int_v < len(image):
                if depth_image_show is not None:
                    image[int_v][int_u] = depth_image_show[i][j][0]
                    image[int_v][int_u + sign] = depth_image_show[i][j][0]
                else:
                    image[int_v][int_u] = z1
                    image[int_v][int_u + sign] = z1
    return image

def colorizeDepth(frameDepth):
    invalidMask = frameDepth == 0
    # Log the depth, minDepth and maxDepth
    try:
        minDepth = np.percentile(frameDepth[frameDepth != 0], 3)
        maxDepth = np.percentile(frameDepth[frameDepth != 0], 95)
        logDepth = np.log(frameDepth, where=frameDepth != 0)
        logMinDepth = np.log(minDepth)
        logMaxDepth = np.log(maxDepth)
        np.nan_to_num(logDepth, copy=False, nan=logMinDepth)
        # Clip the values to be in the 0-255 range
        logDepth = np.clip(logDepth, logMinDepth, logMaxDepth)

        # Interpolate only valid logDepth values, setting the rest based on the mask
        depthFrameColor = np.interp(logDepth, (logMinDepth, logMaxDepth), (0, 255))
        depthFrameColor = np.nan_to_num(depthFrameColor)
        depthFrameColor = depthFrameColor.astype(np.uint8)
        depthFrameColor = cv2.applyColorMap(depthFrameColor, cv2.COLORMAP_JET)
        # Set invalid depth pixels to black
        depthFrameColor[invalidMask] = 0
    except IndexError:
        # Frame is likely empty
        depthFrameColor = np.zeros((frameDepth.shape[0], frameDepth.shape[1], 3), dtype=np.uint8)
    except Exception as e:
        raise e
    return depthFrameColor

def get_Calibration_Data(calibData: dai.CalibrationHandler, TOF_SOCKET, ALIGN_SOCKET, depthSize = (640, 480), rgbSize = (1920, 1080)):
    M1 = np.array(calibData.getCameraIntrinsics(TOF_SOCKET, *depthSize))
    D1 = np.array(calibData.getDistortionCoefficients(TOF_SOCKET))
    M2 = np.array(calibData.getCameraIntrinsics(ALIGN_SOCKET, *rgbSize))
    D2 = np.array(calibData.getDistortionCoefficients(ALIGN_SOCKET))
    T = (
        np.array(calibData.getCameraTranslationVector(TOF_SOCKET, ALIGN_SOCKET, False))
        * 10
    )  # to mm for matching the depth
    R = np.array(calibData.getCameraExtrinsics(TOF_SOCKET, ALIGN_SOCKET, False))[
        0:3, 0:3
    ]
    TARGET_MATRIX = M1
    return M1, D1, M2, D2, TARGET_MATRIX, R, T

def getAlignedDepth(frameDepth, TARGET_MATRIX, M2, R, T, rgbSize = (1920, 1080)):
    R_180 = np.array([[-1., 0., 0.],
                      [0., -1., 0.],
                      [0., 0., 1.]])
    combinedExtrinsics = np.eye(4)
    combinedExtrinsics[0:3, 0:3] =  R
    combinedExtrinsics[0:3, 3] =  T
    depthAligned = reprojection(frameDepth, TARGET_MATRIX, combinedExtrinsics, TARGET_MATRIX)
    mapX, mapY = cv2.initUndistortRectifyMap(TARGET_MATRIX, None, np.eye(3), M2, rgbSize, cv2.CV_32FC1)
    outputAligned = cv2.remap(depthAligned, mapX, mapY, cv2.INTER_NEAREST)
    return outputAligned
import argparse
parser = argparse.ArgumentParser()
parser.add_argument("-path", help="Path to the dataset, were all files are.")
args = parser.parse_args()

### Write down, which sockets you wanna align to ####
ALIGN_SOCKET = dai.CameraBoardSocket.CAM_C
TOF_SOCKET = dai.CameraBoardSocket.CAM_A

path = args.path
calibData = dai.CalibrationHandler(path + "/calib.json")
for files in os.listdir(path):
    ### LOADING ALL images which are used ###
    if files.split(".")[-1] == "json":
        continue
    index = files.split(".")[0].split("_")[1]
    depth = np.load(path + f"/tof_{index}.npy")
    left = cv2.imread(path + f"/left_{index}.png")
    right = cv2.imread(path + f"/right_{index}.png")


    ### LOGIC ####
    # Logic is following:

    # Since device was calibrated with the rotated depth, the depth frame MUST be rotated before aligned
    depth = cv2.rotate(depth, cv2.ROTATE_180)

    # From data you have, get all needed matrices
    M1, D1, M2, D2, TARGET_MATRIX, R, T = get_Calibration_Data(calibData, TOF_SOCKET, ALIGN_SOCKET, (depth.shape[1], depth.shape[0]), (right.shape[1], right.shape[0]))

    #Align rotated depth to the rotated RGB image
    alignedDepth = getAlignedDepth(depth, TARGET_MATRIX, M2, R, T, (right.shape[1], right.shape[0]))

    #Rotate back both depth and RGB
    right = cv2.rotate(right, cv2.ROTATE_180)
    alignedDepth = cv2.rotate(alignedDepth, cv2.ROTATE_180)


    blended = cv2.addWeighted(right, 0.5, colorizeDepth(alignedDepth), 0.5, 0)
    cv2.imshow("Blended", blended)
    cv2.waitKey(0)

Thanks,
Jaka

BonkoKaradjov · 9 Feb

Dear Jaka, thank you very much for your help,

To be honest, I lost the job, the camera was planned in, and all the money and Luxonis does not want take back this custom build brick, even ignores my last email..but ok.
At least I payed extra 300 € for a calculated sensor I asked for and received a ......

But I don't want to feel so shitty that I posses a 800 € brick working as the most expensive 12MP camera ever selled!!

I am afraid I will never calibrate that sensor right. Your script, if I understood it correctly needs the calibration pictures with a charuco board first, which I tried.....

After following the calibration instructions here:
LINK Luxonis Website

I first completely bricked my pycharm virtual environment. Now it either says "no sensor detected" or does not recognize certain parts such as the FPPN_ENABLE etc. in none of my written scripts even the examples from Luxonis don't work... so now the sensor is on niveau even more expensive paperweight.

My calibration problems:

I constantly get different errors.
The newest one is (I tryed a clean environment in Python 3.9)
==> self.aruco_dictionary = cv2.aruco.Dictionary_get(
AttributeError: module 'cv2.aruco' has no attribute 'Dictionary_get'

A Forum tells that the aruco Api had changed for certain versions.
So I should rewrite the Luxonis calibrate.py ???

I am afraid that if I solve this issue the calibration will not start again and new errors will appear.

Is it right from Luxonis that when receiving a wrong calibrated sensor and I payed for that I have to invest a huge amount of time in finding out how to calibrate it myself and living with the problem that all future upcoming scripts for it will not work and cost me more time and therefore paying extra 300 €?

I was at least hoping that they will take back the order and send me a standard, well calibrated version (that has the same components...I even can not think why they think this sensor is unsellable again - it has the same components ) + give me back my 300 € for the custom build.

To be honest: Buying the 12MP camera from somewhere else and soldering it to the board by a friend would have been much cheaper...and was not the reason why I send the sensor back to Luxonis for the "custom build"

I was very thankful in the beginning with their great support but now I am seriously disappointed, now that I know that the support was not aware of the fact that my sensor was wrongly calibrated and I wasted soo much time, my time and the time of the support.
Which I am very sorry for.

I wish I could have used the support and this forum for more productive questions than, "please help me how to use my brocken device and repair it with me......"

I will of course have to write this into my test review for ToF Sensors for Artists to never send Luxonis a sensor for custom build, even if it has the exact same components as an industrial build from them if you work on a job, even for a job in the future.

But thank you very, very much for your help.

BonkoKaradjov · 13 Feb

Any suggestions @jakaskerl @erik what is wrong to try at least myself to calibrate that sensor anyhow?

I was getting errors before downgrading to numpy1, for whatever reason there is a requirements installer.

I am only trying to get the pictures to use the script from @jakaskerl above to get the correct calibration.

BonkoKaradjov · 13 Feb

If I write it as the first example I get this beautiful error:

jakaskerl · 14 Feb

BonkoKaradjov
Why are you trying to recalibrate the device? Everything should be calibrated already.

Thanks,
Jaka

BonkoKaradjov · 15 Feb

jakaskerl Thank you very much for your help.

I maybe misunderstood the workflow.
I get errors because your script is searching for a calib.json and the photo files.

That brought me to the misunderstanding that I first have to get the images from a charuco board from different angles so that this script makes the alignment?

BonkoKaradjov · 15 Feb

So for example: Could I use this script of mine and modify it?

[https://www.dropbox.com/scl/fi/gdlxmh68vattwt6uvusv9/test_reversedAlign_old_NDI.py?rlkey=3ehmn3oz5yl1dd3d8bvbb260o&dl=0](https://)

@jakaskerl Look at my final approach with the Femto Mega and Touchdesigner:

If I could get Color Stream + Depth Stream + Pointcloud Stream through NDI or Spout perfectly aligned out
in combination with my scrip that controls everything through OSC this could beat the Femto Mega immediately because Femto Megas image quality is way less worse than the image quality from your 12MP ToF sensor.

Here what I get in Touchdesigner from the Orbbec Top through network:
[https://gyazo.com/96490f9625ddef7833c3e4dbb4352896](https://)

jakaskerl · 17 Feb

BonkoKaradjov
Camera should already calibrated, you shouldn't need further calibration.

You can use your script, but make sure to use alignment from https://discuss.luxonis.com/d/5692-how-to-send-out-32bit-tof-depthmap-through-spout/11 in order to achieve proper alignment.

LMK what the main issue is with NDIlib if you run into any.

Thanks,
Jaka

BonkoKaradjov · 18 Feb

Dear @jakaskerl , Dear @erik , Thank you very much for all your help.

I want to apologize for my emotional words above. Somehow I was totaly confused and misunderstood completely and thought that the sensor not had been calibrated and was upset that because my lack of knowledge (so my fault) I could not prepare the sensor for the one arts job I planned.

Now the custom Oak ToF is on stack and I have time to research its possibilities and hope to reach the goal one day.

I just wanted to say thank you for all your help and patience.
I really appreciate what Luxonis support does and hope that more people could know about your company and buy your sensors.

With kind regards
Bonko

AidalLori · 2 May

My uncle used to say if you chase chickens, you get feathers. If you chase strategy, you get crypto. The thing about gaming on blockchain-powered portals is that you need speed, stealth, and a seriously smart approach. Sites like https://bgpochivnidni.eu/ list platforms like CoinCasino, where crypto deposits are instant, and you’ve got thousands of titles to test your tactical acumen Stick to games with transparent odds, watch your bankroll like it’s your last pizza slice, and skip the emotional drama. Math is your best buddy in this business.