Poor Depth Quality on OAK FFC 4P with OV9782 Stereo Pair

ShreyasK

I am using a Luxonis OAK-FFC-4P module with OV9782 W sensors connected at B and C ports. My goal is to generate a depth map from this stereo pair. I will be using the default un-distortion method through enableUndistortion to reduce the FOV and get undistorted image feed. Hoping this extra layer of processing will not bother a much for the depth calculations.

I followed the official documentation on Configuring Stereo Depth and came up with the code attached.

Additional Information:

Stereo baseline: 5 cm (same as Intel RealSense baseline)
Distance from object: ~45 cm
Use cases: VIO, obstacle avoidance, and other drone applications
Observed issue: Disparity map from OAK is noisy and inaccurate, unlike the D435i output.

However, the depth output quality is far from expected. To validate this, I ran a comparison against an Intel RealSense D435i connected to the same PC, with both devices placed at the same distance (approx. 45 cm from a water bottle used as the target object). For a fair comparison, I disabled the infrared emitter on the D435i.

The depth results from the RealSense are significantly more reliable, at least visually and stable, than those from the OAK module. I’ve attached images of the depth maps from both devices for reference.

Depth map from OAK Module:

Infra Feed from OAK Module:

Realsense Depth Map:

Code:

#!/usr/bin/env python3

import cv2

import depthai as dai

import numpy as np

pipeline = dai.Pipeline()

monoLeft = pipeline.create(dai.node.Camera).build(dai.CameraBoardSocket.CAM_B)

monoRight = pipeline.create(dai.node.Camera).build(dai.CameraBoardSocket.CAM_C)

stereo = pipeline.create(dai.node.StereoDepth)

# Linking

monoLeft.initialControl.setSharpness(2)

monoLeft.initialControl.setLumaDenoise(1)

monoLeft.initialControl.setChromaDenoise(4)

monoLeft.initialControl.setAntiBandingMode(dai.CameraControl.AntiBandingMode.MAINS_50_HZ)

monoRight.initialControl.setSharpness(2)

monoRight.initialControl.setLumaDenoise(1)

monoRight.initialControl.setChromaDenoise(4)

monoRight.initialControl.setAntiBandingMode(dai.CameraControl.AntiBandingMode.MAINS_50_HZ)

monoLeftOut = monoLeft.requestOutput(

size=(640,400),

type=dai.ImgFrame.Type.NV12,

resizeMode=dai.ImgResizeMode.CROP,

enableUndistortion=True,

fps=30

)

monoRightOut = monoRight.requestOutput(

size=(640,400),

type=dai.ImgFrame.Type.NV12,

resizeMode=dai.ImgResizeMode.CROP,

enableUndistortion=True,

fps=30

)

monoLeftOut.link(stereo.left)

monoRightOut.link(stereo.right)

stereo.setRectification(True)

stereo.setExtendedDisparity(True)

stereo.setLeftRightCheck(True)

stereo.setSubpixel(False)

stereo.setSubpixelFractionalBits(4)

# stereo.initialConfig.setConfidenceThreshold(0)

stereo.PresetMode(5)

stereo.setBaseline(5)

stereo.setPostProcessingHardwareResources(3, 3)

stereo.initialConfig.setMedianFilter(dai.StereoDepthConfig.MedianFilter.KERNEL_3x3)

stereo.initialConfig.postProcessing.temporalFilter.enable = True

stereo.initialConfig.postProcessing.temporalFilter.alpha = 0.5

stereo.initialConfig.postProcessing.temporalFilter.delta = 3

stereo.initialConfig.postProcessing.speckleFilter.enable = True

stereo.initialConfig.postProcessing.speckleFilter.speckleRange = 12

stereo.initialConfig.postProcessing.spatialFilter.enable = True

stereo.initialConfig.postProcessing.spatialFilter.alpha = 0.5

stereo.initialConfig.postProcessing.spatialFilter.delta = 3

stereo.initialConfig.postProcessing.spatialFilter.holeFillingRadius = 2

stereo.initialConfig.postProcessing.spatialFilter.numIterations = 1

stereo.initialConfig.postProcessing.decimationFilter.decimationFactor = 2

# stereo.initialConfig.postProcessing.thresholdFilter.minRange = 400

# stereo.initialConfig.postProcessing.thresholdFilter.maxRange = 15000

syncedLeftQueue = stereo.syncedLeft.createOutputQueue()

syncedRightQueue = stereo.syncedRight.createOutputQueue()

disparityQueue = stereo.disparity.createOutputQueue()

colorMap = cv2.applyColorMap(np.arange(256, dtype=np.uint8), cv2.COLORMAP_JET)

colorMap[0] = [0, 0, 0] # to make zero-disparity pixels black

with pipeline:

pipeline.start()

maxDisparity = 1

while pipeline.isRunning():

leftSynced = syncedLeftQueue.get()

rightSynced = syncedRightQueue.get()

disparity = disparityQueue.get()

assert isinstance(leftSynced, dai.ImgFrame)

assert isinstance(rightSynced, dai.ImgFrame)

assert isinstance(disparity, dai.ImgFrame)

cv2.imshow("left", leftSynced.getCvFrame())

cv2.imshow("right", rightSynced.getCvFrame())

npDisparity = disparity.getFrame()

maxDisparity = max(maxDisparity, np.max(npDisparity))

# Display disparity as greyscale instead of color

greyscaleDisparity = ((npDisparity / maxDisparity) * 255).astype(np.uint8)

cv2.imshow("disparity", greyscaleDisparity)

key = cv2.waitKey(1)

if key == ord('q'):

pipeline.stop()

break

Questions for the Luxonis Team

Is there any sensor-specific tuning required for OV9782 W modules to improve stereo depth quality?
Is the 5 cm baseline setup fully supported for stereo depth on the OAK FFC 4P?
Are there additional calibration or parameter tweaks (e.g., Subpixel mode, confidence threshold, median filter, or other advanced configs) that I should try?
Is there a recommended pipeline template for getting the best stereo performance from OV9782 sensors on OAK FFC 4P?
Is the current performance limitation hardware-related (sensor choice, baseline, etc.) or software/tuning-related?
When subpixel is turned with median filter, it ends up with below errors:

[18443010313351F500] [3.4] [4.993] [StereoDepth(2)] [error] Maximum disparity value '3040' exceeds the maximum supported '1024' by median filter. Disabling median filter!

[18443010313351F500] [3.4] [5.046] [StereoDepth(2)] [error] Maximum disparity value '3040' exceeds the maximum supported '1024' by median filter. Disabling median filter!

This issue is opened to understand if I’ve missed any configuration steps or if there are known limitations with OV9782 stereo modules compared to Realsense.

Looking forward to any guidance, best practices, or sample configurations from the Luxonis team.

OskarSonc

Hi @ShreyasK thanks for the detailed report!

Main issue I see is that you’re undistorting in the Camera node and letting StereoDepth rectify, which double-warps the images and degrades matching. I’d suggest starting from our v3 StereoDepth example and adjusting from there:

https://docs.luxonis.com/software-v3/depthai/examples/stereo_depth/stereo_depth/

Also, did you calibrate the cameras? Guide here:
https://docs.luxonis.com/hardware/platform/depth/calibration/

Hope it helps,
Oskar

ShreyasK

Thank you for reply @OskarSonc

I will attach few images for your kind reference:

"Bottle" is placed at a distance of 44cm approx.

This the default stereo.py
This default stereo.py with resolution of 640,400.
This actual camera feed:

Calibration is done this is the output:

After Calibration:

640,400 with default code:
default code(FUllresolution output)
Camera Feed(640,400):

Based on these, I felt calibration improved the stuff majorly. And undistorted feed cannot be used for Stereo.

So few question:

do u have any suggestions on setting etc (I have read configuring stereo depth page but need little info from personal experience with Wide-angle lens and oak FFC module).
So as I am working on a project where it needs undistorted feed for VIO and stereo depth for obstacle avoidance, how can I get two feed at a time. Because undistortion setting can be turned on only in requestOutput?
Or any trick to use undistorted feed with stereo depth?
Left camera feels more blurry at corners (I do not mean extreme corner). Tried cleaning etc but didn't able to fix the same.

jakaskerl

ShreyasK
Looks like like a bad calibration.. Can you send the configuration.json you used during calibration and the command you used to run it?

Thanks
Jaka

ShreyasK

Sure @jakaskerl.

configuration.json file:

https://drive.google.com/file/d/12YLIk5qbIWdq_8dVGJ23lpssJwDdt74F/view?usp=sharing

I used 32 inch charuco pdf on my 27 inch.

Distance between cameras is 5cm.

And this is the command I used:
python3 calibrate.py -s 3.284 -nx 17 -ny 9 -brd OAK-FFC-4P.json

jakaskerl

ShreyasK
Try specifying this for both stereo cameras:

monoRight = pipeline.create(dai.node.Camera).build(dai.CameraBoardSocket.CAM_C)
monoRight.setSensorType(dai.CameraSensorType.COLOR)

This should force the board to treat OV9782 as a COLOR sensor so it doesn't 2x2 bin it to 400p and break the image.

Thanks,
Jaka

ShreyasK

@jakaskerl Will try the same and check performance.

Is the calibration and stuff correct?

jakaskerl

ShreyasK
Yeah looks fine. Only issue I can see would be the wrong placement of sensors (mix-up of L and R).

Thanks,
Jaka

ShreyasK

Do u mean that right should be left and left should be right?
The left and right configuration is correct, I have rechecked it.
This is the output of using this: monoRight.setSensorType(dai.CameraSensorType.COLOR)

Compared to default mono:
With few filters like and also updated to newer stable release 3.0.0:
stereo.setSubpixel(True)

stereo.setSubpixelFractionalBits(4)

stereo.PresetMode(dai.node.StereoDepth.PresetMode.ROBOTICS)

stereo.setPostProcessingHardwareResources(3, 3)

stereo.initialConfig.postProcessing.temporalFilter.enable = True

stereo.initialConfig.postProcessing.temporalFilter.alpha = 0.5

stereo.initialConfig.postProcessing.temporalFilter.delta = 3

stereo.initialConfig.postProcessing.speckleFilter.enable = True

stereo.initialConfig.postProcessing.speckleFilter.speckleRange = 5

stereo.initialConfig.postProcessing.spatialFilter.enable = True

stereo.initialConfig.postProcessing.spatialFilter.alpha = 0.5

stereo.initialConfig.postProcessing.spatialFilter.delta = 3

stereo.initialConfig.postProcessing.spatialFilter.holeFillingRadius = 2

stereo.initialConfig.postProcessing.spatialFilter.numIterations = 1

stereo.initialConfig.postProcessing.decimationFilter.decimationFactor = 2

Output
This is the output of real sense from same spot the oak camera was kept:
Is there any way we can get output from decimation filter in desired resolution than that short output?

ShreyasK

Hi @jakaskerl
Any inputs from your end?

jakaskerl

ShreyasK
Was on vacation, apologies for late reply. The image looks good, I was mainly concerned with the 400p image since binning is done when downsizing from 800p.

If you want better depth, you need to improve the texture of the surfaces you are looking at. The bottle looks completely black so the algorithm can not perform depthai.

Thanks,
Jaka

ShreyasK

jakaskerl Thank you for your reply.

With regard to binning, as I need grayscale than color, will there be any way on this regard?
Because if I set sensorType to color it give color output which will affect the other systems which need the greyscale image feed.

jakaskerl

Hi ShreyasK
The OV9728 sensor is a color sensor. If it's shows a mono stream then it's already been converted. You can manually convert it to mono using imageManip if you really need to - otherwise, there shouldn't be any issues just using the color stream.

Thanks,
Jaka