StereoDepth

Supported on:RVC2 RVC4

StereoDepth node calculates disparity and/or depth from a stereo pair of Camera nodes.

Configure Stereo Depth

Learn how to achieve the best depth results with step-by-step configuration guide.

Open tutorial

To view the accuracy measurements, see Depth Accuracy.

How to place it

Python

C++

Python

1pipeline = dai.Pipeline()
2stereo = pipeline.create(dai.node.StereoDepth)

Inputs and Outputs

React Flow

Debug outputs:
- debugDispLrCheckIt1: ImgFrame
- debugDispLrCheckIt2: ImgFrame
- debugExtDispLrCheckIt1: ImgFrame
- debugExtDispLrCheckIt2: ImgFrame
- debugDispCostDump: ImgFrame
- confidenceMap: ImgFrame

Usage

Python

C++

Python

1pipeline = dai.Pipeline()
2stereo = pipeline.create(dai.node.StereoDepth)
3
4# Set profile preset to ROBOTICS
5stereo.setDefaultProfilePreset(dai.node.StereoDepth.PresetMode.ROBOTICS)
6# Better handling for occlusions:
7stereo.setLeftRightCheck(True)
8# Closer-in minimum depth, disparity range is doubled:
9stereo.setExtendedDisparity(True)
10# Better accuracy for longer distance, fractional disparity 32-levels:
11stereo.setSubpixel(True)
12
13# Define and configure MonoCamera nodes beforehand
14left.out.link(stereo.left)
15right.out.link(stereo.right)

Examples of functionality

Stereo Depth - Visualize disparity depth from stereo camera.
Stereo Depth Remap - Remap geometry from disparity depth <frame> to color frame.
Stereo Depth From Host - Runs stereo disparity on-device using pre-rectified stereo images from the host.
Stereo Depth Filters - Apply various filters to the stereo depth output.
Stereo Depth Calibration Update - Update stereo calibration parameters on-the-fly.

Depth Alignment

Image alignment is used for aligning depth map to a specific stream.For Depthai v3, the default Stereo Depth alignment is RECTIFIED_LEFT.

Python

1pipeline = dai.Pipeline()
2stereo = pipeline.create(dai.node.StereoDepth)
3 # alignment to RECTIFIED_LEFT or RECTIFIED_RIGHT
4stereo.setDepthAlign(dai.StereoDepthConfig.AlgorithmControl.DepthAlign.RECTIFIED_LEFT)

Or you can use the ImageAlign node:

Python

1align = pipeline.create(dai.node.ImageAlign)
2stereo.depth.link(align.input)
3rgbOut.link(align.inputAlignTo)  # Align depth to RGB
4align.outputAligned.link(sync.inputs["depth_aligned"])

Configure Camera for High FPS

The performance of the Stereo node is directly constrained by the MonoCamera configuration. To achieve high frame rates (e.g., 60 FPS) with StereoDepth, the MonoCameras must be configured to deliver frames at the required rate.

Python

1with dai.Pipeline(device) as pipeline:
2  stereo = pipeline.create(dai.node.StereoDepth)
3  mono_left = pipeline.create(dai.node.Camera).build(dai.CameraBoardSocket.CAM_B)
4  mono_right = pipeline.create(dai.node.Camera).build(dai.CameraBoardSocket.CAM_C)
5
6  def configure_cam(cam, size_x: int, size_y: int, fps: float):
7    cap = dai.ImgFrameCapability()
8    cap.size.fixed((size_x, size_y))
9    cap.fps.fixed(fps)
10  return cam.requestOutput(cap, True)
11
12  # Configure MonoCameras for 60 FPS at 640x400
13  mono_left_out = configure_cam(mono_left, 640, 400, 60)
14  mono_right_out = configure_cam(mono_right, 640, 400, 60)

From a processing standpoint, and in the absence of MonoCamera input constraints, the Stereo node itself is capable of processing image pairs at rates of up to 400 FPS when operating on 256×256 pixel inputs. By default, however, the MonoCameras operate at 30 FPS, which limits the overall StereoDepth output accordingly.

Changing camera calibration during runtime

Changing the camera calibration during runtime is possible. The calibration can be changed using the device.setCalibration() method and can be used in combination with dynamic calibration if device stereo quality degrades over time due to environmental factors. The same data can be obtained using the device.getCalibration().

Platform-Specific Configuration

Select your platform (RVC2 or RVC4) to view platform-specific documentation, presets, configuration options, and limitations.

RVC2

RVC4

Stereo depth preset	Use case	FPS @800P	FPS @400P	Output resolution	Motion Blur	Range (meters)
Default	General purpose	16	30	1/2 Input Resolution	True	0–10
Face	Face tracking, biometrics	16	30	1/2 Input Resolution	True	0–3
High Detail	3D scanning, object details	16	30	1/2 Input Resolution	True	0–15
Robotics	Navigation, obstacle detection	16	34	1/2 Input Resolution	False	0–15
Fast Accuracy	General purpose	30	90	Input Resolution	False	0–65
Fast Density	General purpose	30	90	Input Resolution	False	0–65
Fast Density No Subpixel	General purpose	50	110	Input Resolution	False	0–65

Subpixel Fractional Bits	Number of Unique Values
3	754
4	1506
5	3010

Internal block diagram of StereoDepth node

Depth Presets

We have made a few depth presets that can be used to quickly set up the StereoDepth node for different scenarios, without having to manually set all the parameters. For detailed preset specifications including all parameter values, see Configuring stereo depth.

Stereo depth preset	Use case	FPS @800P	FPS @400P	Output resolution	Motion Blur	Range (meters)
Fast Accuracy	General purpose	60	120	Input Resolution	False	0–65
Fast Density	General purpose	36	72	Input Resolution	False	0–65

Depth Configuration

Extended Disparity

Extended disparity mode allows detecting closer/nearer objects for the given baseline. This increases the maximum disparity search from 64 to 128, meaning the range is now: [0..127].

Computes disparity on the original size images (e.g. 1280x800).
Computes disparity on 2x downscaled images (e.g. 640x400).
Combines the two level disparities.

Usage:

Python

1stereo.setExtendedDisparity(True)

Subpixel Disparity

Subpixel mode improves disparity precision and is especially useful for long-range depth measurements. It also enables more accurate estimation of surface normals.On RVC4, subpixel processing is enabled by default and cannot be disabled, as it is fixed in hardware. The implementation uses 4 fractional bits.

LR Check

Left-Right Check or LR-Check is used to remove incorrectly calculated disparity pixels due to occlusions at object borders (Left and Right camera views are slightly different).On RVC4, LR Check runs purely in software on CPU.Left Right Check Threshold: Maximum allowed disparity difference between pixels. Smaller values lead to sparser depth (more pixels invalidated).

Usage:

Python

1stereo.initialConfig.algorithmControl.enableSwLeftRightCheck = True
2stereo.initialConfig.algorithmControl.leftRightCheckThreshold = 10  # Set threshold value

Note: RVC4 has a different inner mechanism that basically performs the LR check functionality - this is the Occlusion Confidence Weight (see the Confidence Metrics section below). The occlusion confidence weight compares the difference in disparity from doing left to right and right to left search, which is essentially what LR check does.

Software Confidence

Software Confidence filter takes disparity and confidence map (per-pixel 8‑bit confidence from the stereo hardware block) and applies a threshold value to filter out low-confidence pixels. This filtration is done on CPU and is applied after the hardware block generates the disparity. This filtering is performed on the CPU and is applied after the hardware block generates the disparity. When left–right (LR) check is enabled, both L→R and R→L disparities (with their corresponding confidence maps) are invalidated using the same threshold.Values:

Limited filtration: 50-100
Medium filtration: 200
Heavy filtration: >200 (very scene specific)

Usage:

Python

1stereo.initialConfig.costMatching.enableSwConfidenceThresholding = True
2stereo.initialConfig.costMatching.confidenceThreshold = 200  # Adjust based on desired filtration level

Confidence Metrics (Occlusion, Motion Vector, Flatness)

Three different metrics are used to calculate confidence maps, which get added together into one final confidence map.

Occlusion Confidence Weight
- Basically performs LR check. Compares the difference in disparity from doing left to right and right to left search (forward/backward motion vector consistency).
Motion Vector (MV) Confidence Weight
- MV Variance calculates variance in a local edge-aware window around each pixel; given a constraint to ensure depth is smooth and consistent within the same object.
- MV Confidence Threshold: The threshold offset for MV variance. Valid range [0,3]. Value = 0 allows most variance.
Flatness Confidence Weight
- Flatness considers the input image texture and masks out low texture regions with too similar neighboring features. The minimum amount of texture required can be adjusted with the flatness threshold.
- Flatness Threshold: The threshold value for flat region check. The higher the value, the more likely the pixel is to be detected as flat region based on census feature comparison. Used for low texture regions which do not have adequate features for matching.
- Flatness Override: If pixel is detected as flat, set confidence to zero.

Usage:

Python

1# Confidence metrics weights (must sum to 32 for RVC4)
2stereo.initialConfig.confidenceMetrics.occlusionConfidenceWeight = 12
3stereo.initialConfig.confidenceMetrics.motionVectorConfidenceWeight = 10
4stereo.initialConfig.confidenceMetrics.flatnessConfidenceWeight = 10
5
6# Thresholds
7stereo.initialConfig.confidenceMetrics.motionVectorConfidenceThreshold = 1  # Valid range [0,3]
8stereo.initialConfig.confidenceMetrics.flatnessConfidenceThreshold = 5
9stereo.initialConfig.confidenceMetrics.flatnessOverride = True  # Set confidence to zero if flat

Note: Confidence metrics weights must sum to 32 for RVC4.

Adaptive Median Filter

Adaptive median filter applies the filter only to low-confidence pixels below a specified threshold value.Adaptive median filter threshold: Applies filter only to low-confidence pixels below this value. This threshold should be less than Fill confidence threshold in Hole filling.

Usage:

Python

1stereo.initialConfig.postProcessing.adaptiveMedianFilter.enable = True
2stereo.initialConfig.postProcessing.adaptiveMedianFilter.confidenceThreshold = 200  # Should be < hole filling fillConfidenceThreshold

Hole Filling

Hole Filling is a post-processing filter that fills gaps in the depth map by grouping pixels into superpixels (hexagonal regions) and computing disparity values based on high-confidence pixels within those regions. This filter modifies the confidence map to mark filled regions.The filter works by:

Grouping pixels into superpixels (hexagonal regions)
Using pixels with confidence above the high confidence threshold to calculate disparity values per superpixel
Filling holes based on the fill confidence threshold and minimum valid disparity requirements

Parameters:

High confidence threshold - Pixels with confidence above this threshold are used to calculate disparity value per superpixel. If set to maximum value (255), no superpixels should be formed.
Fill confidence threshold - Pixels with confidence below this value are filtered out (if invalidateDisparities is set to True) or become part of the superpixel calculation.
Min valid disparity - Decides what proportion of pixels in a region need to have confidence above the high confidence threshold to be included in computing the disparity for the superpixel. Values: 1 (50%), 2 (25%), or 3 (12.5%). Higher min valid disparity leads to bigger superpixels (more hexagons).
Invalidate disparities - If set to True, pixels below the fill confidence threshold are filtered out from the final disparity map.

Usage:

Python

1stereo.initialConfig.postProcessing.holeFilling.enable = True
2stereo.initialConfig.postProcessing.holeFilling.highConfidenceThreshold = 200
3stereo.initialConfig.postProcessing.holeFilling.fillConfidenceThreshold = 210
4stereo.initialConfig.postProcessing.holeFilling.minValidDisparity = 1  # 1, 2, or 3
5stereo.initialConfig.postProcessing.holeFilling.invalidateDisparities = True

Median Filter

This is a non-edge preserving Median filter, which can be used to reduce noise and smoothen the depth map.On RVC4: Median filter is implemented in software (CPU), and only supports 3x3 or 5x5 kernel sizes. If 7x7 is requested, it is automatically downgraded to 5x5.

Usage:

Python

1stereo.initialConfig.setMedianFilter(dai.MedianFilter.KERNEL_3x3)  # Options: MEDIAN_OFF, KERNEL_3x3, KERNEL_5x5

Speckle Filter

Speckle Filter is used to reduce the speckle noise. Speckle noise is a region with huge variance between neighboring disparity/depth pixels, and speckle filter tries to filter this region.

Usage:

Python

1stereo.initialConfig.postProcessing.speckleFilter.enable = True
2stereo.initialConfig.postProcessing.speckleFilter.speckleRange = 48

Temporal Filter

Temporal Filter is intended to improve the depth data persistency by manipulating per-pixel values based on previous frames. The filter performs a single pass on the data, adjusting the depth values while also updating the tracking history.In cases where the pixel data is missing or invalid, the filter uses a user-defined persistency mode to decide whether the missing value should be improved with stored data. Note that due to its reliance on historic data, the filter may introduce visible motion blurring/smearing artifacts, and therefore is best-suited for static scenes.

Usage:

Python

1stereo.initialConfig.postProcessing.temporalFilter.enable = True
2stereo.initialConfig.postProcessing.temporalFilter.alpha = 0.4  # Weight of current frame (0-1)
3stereo.initialConfig.postProcessing.temporalFilter.delta = 3  # Threshold for valid depth change

Spatial Filter

Spatial Edge-Preserving Filter will fill invalid depth pixels with valid neighboring depth pixels. It performs a series of 1D horizontal and vertical passes or iterations, to enhance the smoothness of the reconstructed data. It is based on this research paper.

Usage:

Python

1stereo.initialConfig.postProcessing.spatialFilter.enable = True
2stereo.initialConfig.postProcessing.spatialFilter.alpha = 0.5  # Edge-preserving strength
3stereo.initialConfig.postProcessing.spatialFilter.delta = 8  # Threshold for valid depth change
4stereo.initialConfig.postProcessing.spatialFilter.holeFillingRadius = 2  # Radius for hole filling
5stereo.initialConfig.postProcessing.spatialFilter.numIterations = 1  # Number of iterations

Brightness Filter

Brightness filter will filter out (invalidate, by setting to 0) all depth pixels for which input stereo camera image pixels are outside the configured min/max brightness threshold values. This filter is useful when you have a high dynamic range scene, like outside on a bright day, or in general whenever stereo camera pair can directly see a light source.It also helps with rectification "artifacts", especially when you have Wide FOV lenses and you apply alpha param.

Usage:

Python

1stereo.initialConfig.postProcessing.brightnessFilter.enable = True
2stereo.initialConfig.postProcessing.brightnessFilter.minBrightness = 0  # Minimum brightness threshold
3stereo.initialConfig.postProcessing.brightnessFilter.maxBrightness = 255  # Maximum brightness threshold

Threshold Filter

Threshold filter will filter out all depth pixels outside the configured min/max threshold values. In a controlled environment, where you know exactly how far the scene can be (eg. 30cm - 2m) it's advised to use this filter.

Usage:

Python

1stereo.initialConfig.postProcessing.thresholdFilter.minRange = 0  # Minimum depth in cm
2stereo.initialConfig.postProcessing.thresholdFilter.maxRange = 200  # Maximum depth in cm

Decimation Filter

Decimation Filter will sub-sample the depth map, which means it reduces the depth scene complexity and allows other filters to run faster. Setting decimationFactor to 2 will downscale 1280x800 depth map to 640x400. We can either select pixel skipping, median, or mean decimation mode, and the latter two modes help with filtering as well. decimationFactor 1 disables the filter.

Usage:

Python

1stereo.initialConfig.postProcessing.decimationFilter.decimationFactor = 2  # 1 = disabled, 2 = 2x downscale, etc.

Filtering Order

The order of the filters is important, as the output of one filter is the input of the next filter. The order of the filters is customizable:

Usage:

Python

1config.postProcessing.filteringOrder = [
2    dai.RawStereoDepthConfig.PostProcessing.Filter.TEMPORAL,
3    dai.RawStereoDepthConfig.PostProcessing.Filter.SPECKLE,
4    dai.RawStereoDepthConfig.PostProcessing.Filter.SPATIAL,
5    dai.RawStereoDepthConfig.PostProcessing.Filter.MEDIAN,
6    dai.RawStereoDepthConfig.PostProcessing.Filter.DECIMATION
7]
8stereo.initialConfig.set(config)

Limitations

Median filter: On RVC4, median filter is implemented in software (CPU) and only supports 3x3 or 5x5 kernel sizes. If 7x7 is requested, it is automatically downgraded to 5x5.
Subpixel: Fixed to 4 bits on RVC4 (cannot be changed to 3 or 5 bits).
LR Check: Runs purely in software on CPU (not hardware-accelerated like on RVC2).

Reference

class

dai::node::StereoDepth

#include StereoDepth.hpp

variable

std::shared_ptr< StereoDepthConfig > initialConfig

Initial config to use for StereoDepth.

variable

Input inputConfig

Input StereoDepthConfig message with ability to modify parameters in runtime.

variable

Input inputAlignTo

Input align to message. Default queue is non-blocking with size 1.

variable

Input left

Input for left ImgFrame of left-right pair

variable

Input right

Input for right ImgFrame of left-right pair

variable

Output depth

Outputs ImgFrame message that carries RAW16 encoded (0..65535) depth data in depth units (millimeter by default).Non-determined / invalid depth values are set to 0

variable

Output disparity

Outputs ImgFrame message that carries RAW8 / RAW16 encoded disparity data: RAW8 encoded (0..95) for standard mode; RAW8 encoded (0..190) for extended disparity mode; RAW16 encoded for subpixel disparity mode:

0..760 for 3 fractional bits (by default)
0..1520 for 4 fractional bits
0..3040 for 5 fractional bits

variable

Output syncedLeft

Passthrough ImgFrame message from 'left' Input.

variable

Output syncedRight

Passthrough ImgFrame message from 'right' Input.

variable

Output rectifiedLeft

Outputs ImgFrame message that carries RAW8 encoded (grayscale) rectified frame data.

variable

Output rectifiedRight

Outputs ImgFrame message that carries RAW8 encoded (grayscale) rectified frame data.

variable

Output outConfig

Outputs StereoDepthConfig message that contains current stereo configuration.

variable

Output debugDispLrCheckIt1

Outputs ImgFrame message that carries left-right check first iteration (before combining with second iteration) disparity map. Useful for debugging/fine tuning.

variable

Output debugDispLrCheckIt2

Outputs ImgFrame message that carries left-right check second iteration (before combining with first iteration) disparity map. Useful for debugging/fine tuning.

variable

Output debugExtDispLrCheckIt1

Outputs ImgFrame message that carries extended left-right check first iteration (downscaled frame, before combining with second iteration) disparity map. Useful for debugging/fine tuning.

variable

Output debugExtDispLrCheckIt2

Outputs ImgFrame message that carries extended left-right check second iteration (downscaled frame, before combining with first iteration) disparity map. Useful for debugging/fine tuning.

variable

Output debugDispCostDump

Outputs ImgFrame message that carries cost dump of disparity map. Useful for debugging/fine tuning.

variable

Output confidenceMap

Outputs ImgFrame message that carries RAW8 confidence map. Lower values mean lower confidence of the calculated disparity value. RGB alignment, left-right check or any postprocessing (e.g., median filter) is not performed on confidence map.

function

StereoDepth()

inline function

std::shared_ptr< StereoDepth > build(Node::Output & left, Node::Output & right, PresetMode presetMode)

function

std::shared_ptr< StereoDepth > build(bool autoCreateCameras, PresetMode presetMode, std::pair< int, int > size, std::optional< float > fps)

Create StereoDepth node. Note that this API is global and if used autocreated cameras can't be reused.

Parameters

autoCreateCameras: If true, will create left and right nodes if they don't exist
presetMode: Preset mode for stereo depth

function

void loadMeshFiles(const std::filesystem::path & pathLeft, const std::filesystem::path & pathRight)

Specify local filesystem paths to the mesh calibration files for 'left' and 'right' inputs.When a mesh calibration is set, it overrides the camera intrinsics/extrinsics matrices. Overrides useHomographyRectification behavior. Mesh format: a sequence of (y,x) points as 'float' with coordinates from the input image to be mapped in the output. The mesh can be subsampled, configured by With a 1280x800 resolution and the default (16,16) step, the required mesh size is:width: 1280 / 16 + 1 = 81height: 800 / 16 + 1 = 51

function

void loadMeshData(const std::vector< std::uint8_t > & dataLeft, const std::vector< std::uint8_t > & dataRight)

Specify mesh calibration data for 'left' and 'right' inputs, as vectors of bytes. Overrides useHomographyRectification behavior. See

function

void setMeshStep(int width, int height)

Set the distance between mesh points. Default: (16, 16)

function

void setInputResolution(int width, int height)

Specify input resolution sizeOptional if MonoCamera exists, otherwise necessary

function

void setInputResolution(std::tuple< int, int > resolution)

Specify input resolution sizeOptional if MonoCamera exists, otherwise necessary

function

void setOutputSize(int width, int height)

Specify disparity/depth output resolution size, implemented by scaling.Currently only applicable when aligning to RGB camera

function

void setOutputKeepAspectRatio(bool keep)

Specifies whether the frames resized by

function

void setDepthAlign(Properties::DepthAlign align)

Parameters

align: Set the disparity/depth alignment: centered (between the 'left' and 'right' inputs), or from the perspective of a rectified output stream

function

void setDepthAlign(CameraBoardSocket camera)

Parameters

camera: Set the camera from whose perspective the disparity/depth will be aligned

function

void setRectification(bool enable)

Rectify input images or not.

function

void setLeftRightCheck(bool enable)

Computes and combines disparities in both L-R and R-L directions, and combine them.For better occlusion handling, discarding invalid disparity values

function

void setSubpixel(bool enable)

Computes disparity with sub-pixel interpolation (3 fractional bits by default).Suitable for long range. Currently incompatible with extended disparity

function

void setSubpixelFractionalBits(int subpixelFractionalBits)

Number of fractional bits for subpixel mode. Default value: 3. Valid values: 3,4,5. Defines the number of fractional disparities: 2^x. Median filter postprocessing is supported only for 3 fractional bits.

function

void setExtendedDisparity(bool enable)

Disparity range increased from 0-95 to 0-190, combined from full resolution and downscaled images.Suitable for short range objects. Currently incompatible with sub-pixel disparity

function

void setRectifyEdgeFillColor(int color)

Fill color for missing data at frame edges

Parameters

color: Grayscale 0..255, or -1 to replicate pixels

function

void setRuntimeModeSwitch(bool enable)

Enable runtime stereo mode switch, e.g. from standard to LR-check. Note: when enabled resources allocated for worst case to enable switching to any mode.

function

void setNumFramesPool(int numFramesPool)

Specify number of frames in pool.

Parameters

numFramesPool: How many frames should the pool have

function

void setPostProcessingHardwareResources(int numShaves, int numMemorySlices)

Specify allocated hardware resources for stereo depth. Suitable only to increase post processing runtime.

Parameters

numShaves: Number of shaves.
numMemorySlices: Number of memory slices.

function

void setDefaultProfilePreset(PresetMode mode)

Sets a default preset based on specified option.

Parameters

mode: Stereo depth preset mode

function

void useHomographyRectification(bool useHomographyRectification)

Use 3x3 homography matrix for stereo rectification instead of sparse mesh generated on device. Default behaviour is AUTO, for lenses with FOV over 85 degrees sparse mesh is used, otherwise 3x3 homography. If custom mesh data is provided through loadMeshData or loadMeshFiles this option is ignored.

Parameters

useHomographyRectification: true: 3x3 homography matrix generated from calibration data is used for stereo rectification, can't correct lens distortion. false: sparse mesh is generated on-device from calibration data with mesh step specified with setMeshStep (Default: (16, 16)), can correct lens distortion. Implementation for generating the mesh is same as opencv's initUndistortRectifyMap function. Only the first 8 distortion coefficients are used from calibration data.

function

void enableDistortionCorrection(bool enableDistortionCorrection)

Equivalent to useHomographyRectification(!enableDistortionCorrection)

function

void setFrameSync(bool enableFrameSync)

Whether to enable frame syncing inside stereo node or not. Suitable if inputs are known to be synced.

function

void setBaseline(float baseline)

Override baseline from calibration. Used only in disparity to depth conversion. Units are centimeters.

function

void setFocalLength(float focalLength)

Override focal length from calibration. Used only in disparity to depth conversion. Units are pixels.

function

void setDisparityToDepthUseSpecTranslation(bool specTranslation)

Use baseline information for disparity to depth conversion from specs (design data) or from calibration. Default: true

function

void setRectificationUseSpecTranslation(bool specTranslation)

Obtain rectification matrices using spec translation (design data) or from calibration in calculations. Should be used only for debugging. Default: false

function

void setDepthAlignmentUseSpecTranslation(bool specTranslation)

Use baseline information for depth alignment from specs (design data) or from calibration. Default: true

function

void setAlphaScaling(float alpha)

Free scaling parameter between 0 (when all the pixels in the undistorted image are valid) and 1 (when all the source image pixels are retained in the undistorted image). On some high distortion lenses, and/or due to rectification (image rotated) invalid areas may appear even with alpha=0, in these cases alpha < 0.0 helps removing invalid areas. See getOptimalNewCameraMatrix from opencv for more details.

enum

std::uint32_t PresetMode

Preset modes for stereo depth.

enumerator

FAST_ACCURACY

enumerator

FAST_DENSITY

enumerator

DEFAULT

enumerator

FACE

enumerator

HIGH_DETAIL

enumerator

ROBOTICS

enumerator

DENSITY

enumerator

ACCURACY

enum

dai::StereoDepthConfig::MedianFilter MedianFilter

class

dai::StereoDepthConfig

#include StereoDepthConfig.hpp

variable

AlgorithmControl algorithmControl

Controls the flow of stereo algorithm - left-right check, subpixel etc.

variable

PostProcessing postProcessing

Controls the postprocessing of disparity and/or depth map.

variable

CensusTransform censusTransform

Census transform settings.

variable

CostMatching costMatching

Cost matching settings.

variable

CostAggregation costAggregation

Cost aggregation settings.

variable

ConfidenceMetrics confidenceMetrics

Confidence metrics settings.

variable

dai::ProcessorType filtersBackend

function

StereoDepthConfig()

Construct StereoDepthConfig message.

function

~StereoDepthConfig()

function

StereoDepthConfig & setDepthAlign(AlgorithmControl::DepthAlign align)

Parameters

align: Set the disparity/depth alignment: centered (between the 'left' and 'right' inputs), or from the perspective of a rectified output stream

function

StereoDepthConfig & setConfidenceThreshold(int confThr)

Confidence threshold for disparity calculation

Parameters

confThr: Confidence threshold value 0..255

function

int getConfidenceThreshold()

Get confidence threshold for disparity calculation

function

StereoDepthConfig & setMedianFilter(MedianFilter median)

Parameters

median: Set kernel size for disparity/depth median filtering, or disable

function

MedianFilter getMedianFilter()

Get median filter setting

function

StereoDepthConfig & setBilateralFilterSigma(uint16_t sigma)

A larger value of the parameter means that farther colors within the pixel neighborhood will be mixed together, resulting in larger areas of semi-equal color.

Parameters

sigma: Set sigma value for 5x5 bilateral filter. 0..65535

function

uint16_t getBilateralFilterSigma()

Get sigma value for 5x5 bilateral filter

function

StereoDepthConfig & setLeftRightCheckThreshold(int threshold)

Parameters

threshold: Set threshold for left-right, right-left disparity map combine, 0..255

function

int getLeftRightCheckThreshold()

Get threshold for left-right check combine

function

StereoDepthConfig & setLeftRightCheck(bool enable)

Computes and combines disparities in both L-R and R-L directions, and combine them.For better occlusion handling, discarding invalid disparity values

function

bool getLeftRightCheck()

Get left-right check setting

function

StereoDepthConfig & setExtendedDisparity(bool enable)

Disparity range increased from 95 to 190, combined from full resolution and downscaled images. Suitable for short range objects

function

bool getExtendedDisparity()

Get extended disparity setting

function

StereoDepthConfig & setSubpixel(bool enable)

Computes disparity with sub-pixel interpolation (3 fractional bits by default).Suitable for long range. Currently incompatible with extended disparity

function

bool getSubpixel()

Get subpixel setting

function

StereoDepthConfig & setSubpixelFractionalBits(int subpixelFractionalBits)

function

int getSubpixelFractionalBits()

Get number of fractional bits for subpixel mode

function

StereoDepthConfig & setDepthUnit(AlgorithmControl::DepthUnit depthUnit)

Set depth unit of depth map.Meter, centimeter, millimeter, inch, foot or custom unit is available.

function

AlgorithmControl::DepthUnit getDepthUnit()

Get depth unit of depth map.

function

StereoDepthConfig & setCustomDepthUnitMultiplier(float multiplier)

Set custom depth unit multiplier relative to 1 meter.

function

float getCustomDepthUnitMultiplier()

Get custom depth unit multiplier relative to 1 meter.

function

StereoDepthConfig & setDisparityShift(int disparityShift)

Shift input frame by a number of pixels to increase minimum depth. For example shifting by 48 will change effective disparity search range from (0,95] to [48,143]. An alternative approach to reducing the minZ. We normally only recommend doing this when it is known that there will be no objects farther away than MaxZ, such as having a depth camera mounted above a table pointing down at the table surface.

function

StereoDepthConfig & setNumInvalidateEdgePixels(int32_t numInvalidateEdgePixels)

Invalidate X amount of pixels at the edge of disparity frame. For right and center alignment X pixels will be invalidated from the right edge, for left alignment from the left edge.

function

StereoDepthConfig & setFiltersComputeBackend(dai::ProcessorType filtersBackend)

Set filters compute backend

function

dai::ProcessorType getFiltersComputeBackend()

Get filters compute backend

function

float getMaxDisparity()

Useful for normalization of the disparity map.

Returns

Maximum disparity value that the node can return

function

void serialize(std::vector< std::uint8_t > & metadata, DatatypeEnum & datatype)

inline function

DatatypeEnum getDatatype()

function

 DEPTHAI_SERIALIZE(StereoDepthConfig, algorithmControl, postProcessing, censusTransform, costMatching, costAggregation, confidenceMetrics, filtersBackend)

struct

dai::StereoDepthConfig::AlgorithmControl

variable

DepthAlign depthAlign

Set the disparity/depth alignment to the perspective of a rectified output, or center it

variable

DepthUnit depthUnit

Measurement unit for depth data. Depth data is integer value, multiple of depth unit.

variable

float customDepthUnitMultiplier

Custom depth unit multiplier, if custom depth unit is enabled, relative to 1 meter. A multiplier of 1000 effectively means depth unit in millimeter.

variable

bool enableLeftRightCheck

Computes and combines disparities in both L-R and R-L directions, and combine them. For better occlusion handling

variable

bool enableSwLeftRightCheck

Enables software left right check. Applicable to RVC4 only.

variable

bool enableExtended

Disparity range increased from 95 to 190, combined from full resolution and downscaled images. Suitable for short range objects

variable

bool enableSubpixel

Computes disparity with sub-pixel interpolation (5 fractional bits), suitable for long range

variable

std::int32_t leftRightCheckThreshold

Left-right check threshold for left-right, right-left disparity map combine, 0..128 Used only when left-right check mode is enabled. Defines the maximum difference between the confidence of pixels from left-right and right-left confidence maps

variable

std::int32_t subpixelFractionalBits

Number of fractional bits for subpixel modeValid values: 3,4,5Defines the number of fractional disparities: 2^xMedian filter postprocessing is supported only for 3 fractional bits

variable

std::int32_t disparityShift

variable

std::optional< float > centerAlignmentShiftFactor

variable

std::int32_t numInvalidateEdgePixels

Invalidate X amount of pixels at the edge of disparity frame. For right and center alignment X pixels will be invalidated from the right edge, for left alignment from the left edge.

function

 DEPTHAI_SERIALIZE(AlgorithmControl, depthAlign, depthUnit, customDepthUnitMultiplier, enableLeftRightCheck, enableSwLeftRightCheck, enableExtended, enableSubpixel, leftRightCheckThreshold, subpixelFractionalBits, disparityShift, centerAlignmentShiftFactor, numInvalidateEdgePixels)

enum

int32_t DepthAlign

Align the disparity/depth to the perspective of a rectified output, or center it

enumerator

RECTIFIED_RIGHT

enumerator

RECTIFIED_LEFT

enumerator

CENTER

enum

dai::DepthUnit DepthUnit

struct

dai::StereoDepthConfig::CensusTransform

#include StereoDepthConfig.hpp

variable

KernelSize kernelSize

Census transform kernel size.

variable

uint64_t kernelMask

Census transform mask, default - auto, mask is set based on resolution and kernel size. Disabled for 400p input resolution. Enabled for 720p. 0XA82415 for 5x5 census transform kernel. 0XAA02A8154055 for 7x7 census transform kernel. 0X2AA00AA805540155 for 7x9 census transform kernel. Empirical values.

variable

bool enableMeanMode

If enabled, each pixel in the window is compared with the mean window value instead of the central pixel.

variable

uint32_t threshold

Census transform comparison threshold value.

variable

int8_t noiseThresholdOffset

Used to reduce small fixed levels of noise across all luminance values in the current image. Valid range is [0,127]. Default value is 0.

variable

int8_t noiseThresholdScale

Used to reduce noise values that increase with luminance in the current image. Valid range is [-128,127]. Default value is 0.

function

 DEPTHAI_SERIALIZE(CensusTransform, kernelSize, kernelMask, enableMeanMode, threshold, noiseThresholdOffset, noiseThresholdScale)

enum

std::int32_t KernelSize

Census transform kernel size possible values.

enumerator

AUTO

enumerator

KERNEL_5x5

enumerator

KERNEL_7x7

enumerator

KERNEL_7x9

struct

dai::StereoDepthConfig::ConfidenceMetrics

variable

uint8_t occlusionConfidenceWeight

Weight used with occlusion estimation to generate final confidence map. Valid range is [0,32]

variable

uint8_t motionVectorConfidenceWeight

Weight used with local neighborhood motion vector variance estimation to generate final confidence map. Valid range is [0,32].

variable

uint8_t motionVectorConfidenceThreshold

Threshold offset for MV variance in confidence generation. A value of 0 allows most variance. Valid range is [0,3].

variable

uint8_t flatnessConfidenceWeight

Weight used with flatness estimation to generate final confidence map. Valid range is [0,32].

variable

uint8_t flatnessConfidenceThreshold

Threshold for flatness check in SGM block. Valid range is [1,7].

variable

bool flatnessOverride

Flag to indicate whether final confidence value will be overidden by flatness value. Valid range is {true,false}.

function

 DEPTHAI_SERIALIZE(ConfidenceMetrics, occlusionConfidenceWeight, motionVectorConfidenceWeight, motionVectorConfidenceThreshold, flatnessConfidenceWeight, flatnessConfidenceThreshold, flatnessOverride)

struct

dai::StereoDepthConfig::CostAggregation

#include StereoDepthConfig.hpp

variable

uint8_t divisionFactor

Cost calculation linear equation parameters.

variable

uint16_t horizontalPenaltyCostP1

Horizontal P1 penalty cost parameter.

variable

uint16_t horizontalPenaltyCostP2

Horizontal P2 penalty cost parameter.

variable

uint16_t verticalPenaltyCostP1

Vertical P1 penalty cost parameter.

variable

uint16_t verticalPenaltyCostP2

Vertical P2 penalty cost parameter.

variable

P1Config p1Config

variable

P2Config p2Config

function

 DEPTHAI_SERIALIZE(CostAggregation, divisionFactor, horizontalPenaltyCostP1, horizontalPenaltyCostP2, verticalPenaltyCostP1, verticalPenaltyCostP2, p1Config, p2Config)

struct

dai::StereoDepthConfig::CostAggregation::P1Config

#include StereoDepthConfig.hpp

variable

bool enableAdaptive

Used to disable/enable adaptive penalty.

variable

uint8_t defaultValue

Used as the default penalty value when nAdapEnable is disabled. A bigger value enforces higher smoothness and reduced noise at the cost of lower edge accuracy. This value must be smaller than P2 default penalty. Valid range is [10,50].

variable

uint8_t edgeValue

Penalty value on edges when nAdapEnable is enabled. A smaller penalty value permits higher change in disparity. This value must be smaller than or equal to P2 edge penalty. Valid range is [10,50].

variable

uint8_t smoothValue

Penalty value on low texture regions when nAdapEnable is enabled. A smaller penalty value permits higher change in disparity. This value must be smaller than or equal to P2 smoothness penalty. Valid range is [10,50].

variable

uint8_t edgeThreshold

Threshold value on edges when nAdapEnable is enabled. A bigger value permits higher neighboring feature dissimilarity tolerance. This value is shared with P2 penalty configuration. Valid range is [8,16].

variable

uint8_t smoothThreshold

Threshold value on low texture regions when nAdapEnable is enabled. A bigger value permits higher neighboring feature dissimilarity tolerance. This value is shared with P2 penalty configuration. Valid range is [2,12].

function

DEPTHAI_SERIALIZE(P1Config, enableAdaptive, defaultValue, edgeValue, smoothValue, edgeThreshold, smoothThreshold)

struct

dai::StereoDepthConfig::CostAggregation::P2Config

#include StereoDepthConfig.hpp

variable

bool enableAdaptive

Used to disable/enable adaptive penalty.

variable

uint8_t defaultValue

Used as the default penalty value when nAdapEnable is disabled. A bigger value enforces higher smoothness and reduced noise at the cost of lower edge accuracy. This value must be larger than P1 default penalty. Valid range is [20,100].

variable

uint8_t edgeValue

Penalty value on edges when nAdapEnable is enabled. A smaller penalty value permits higher change in disparity. This value must be larger than or equal to P1 edge penalty. Valid range is [20,100].

variable

uint8_t smoothValue

Penalty value on low texture regions when nAdapEnable is enabled. A smaller penalty value permits higher change in disparity. This value must be larger than or equal to P1 smoothness penalty. Valid range is [20,100].

function

DEPTHAI_SERIALIZE(P2Config, enableAdaptive, defaultValue, edgeValue, smoothValue)

struct

dai::StereoDepthConfig::CostMatching

#include StereoDepthConfig.hpp

variable

DisparityWidth disparityWidth

Disparity search range, default 96 pixels.

variable

bool enableCompanding

Disparity companding using sparse matching. Matching pixel by pixel for N disparities. Matching every 2nd pixel for M disparitites. Matching every 4th pixel for T disparities. In case of 96 disparities: N=48, M=32, T=16. This way the search range is extended to 176 disparities, by sparse matching. Note: when enabling this flag only depth map will be affected, disparity map is not.

variable

uint8_t invalidDisparityValue

Used only for debug purposes, SW postprocessing handled only invalid value of 0 properly.

variable

uint8_t confidenceThreshold

Disparities with confidence value over this threshold are accepted.

variable

bool enableSwConfidenceThresholding

Enable software confidence thresholding. Applicable to RVC4 only.

variable

LinearEquationParameters linearEquationParameters

Cost calculation linear equation parameters.

function

 DEPTHAI_SERIALIZE(CostMatching, disparityWidth, enableCompanding, invalidDisparityValue, confidenceThreshold, enableSwConfidenceThresholding, linearEquationParameters)

struct

dai::StereoDepthConfig::CostMatching::LinearEquationParameters

#include StereoDepthConfig.hpp

variable

uint8_t alpha

variable

uint8_t beta

variable

uint8_t threshold

function

DEPTHAI_SERIALIZE(LinearEquationParameters, alpha, beta, threshold)

enum

std::uint32_t DisparityWidth

Disparity search range: 64 or 96 pixels are supported by the HW.

enumerator

DISPARITY_64

enumerator

DISPARITY_96

struct

dai::StereoDepthConfig::PostProcessing

#include StereoDepthConfig.hpp

variable

std::array< Filter, 5 > filteringOrder

Order of filters to be applied if filtering is enabled.

variable

MedianFilter median

Set kernel size for disparity/depth median filtering, or disable

variable

std::int16_t bilateralSigmaValue

Sigma value for bilateral filter. 0 means disabled. A larger value of the parameter means that farther colors within the pixel neighborhood will be mixed together.

variable

SpatialFilter spatialFilter

Edge-preserving filtering: This type of filter will smooth the depth noise while attempting to preserve edges.

variable

TemporalFilter temporalFilter

Temporal filtering with optional persistence.

variable

ThresholdFilter thresholdFilter

Threshold filtering. Filters out distances outside of a given interval.

variable

BrightnessFilter brightnessFilter

Brightness filtering. If input frame pixel is too dark or too bright, disparity will be invalidated. The idea is that for too dark/too bright pixels we have low confidence, since that area was under/over exposed and details were lost.

variable

SpeckleFilter speckleFilter

Speckle filtering. Removes speckle noise.

variable

DecimationFilter decimationFilter

Decimation filter. Reduces disparity/depth map x/y complexity, reducing runtime complexity for other filters.

variable

HoleFilling holeFilling

variable

AdaptiveMedianFilter adaptiveMedianFilter

function

 DEPTHAI_SERIALIZE(PostProcessing, filteringOrder, median, bilateralSigmaValue, spatialFilter, temporalFilter, thresholdFilter, brightnessFilter, speckleFilter, decimationFilter, holeFilling, adaptiveMedianFilter)

struct

dai::StereoDepthConfig::PostProcessing::AdaptiveMedianFilter

variable

bool enable

Flag to enable adaptive median filtering for a final pass of filtering on low confidence pixels.

variable

uint8_t confidenceThreshold

Confidence threshold for adaptive median filtering. Should be less than nFillConfThresh value used in evaDfsHoleFillConfig. Valid range is [0,255].

function

DEPTHAI_SERIALIZE(AdaptiveMedianFilter, enable, confidenceThreshold)

struct

dai::StereoDepthConfig::PostProcessing::BrightnessFilter

#include StereoDepthConfig.hpp

variable

std::int32_t minBrightness

Minimum pixel brightness. If input pixel is less or equal than this value the depth value is invalidated.

variable

std::int32_t maxBrightness

Maximum range in depth units. If input pixel is less or equal than this value the depth value is invalidated.

function

DEPTHAI_SERIALIZE(BrightnessFilter, minBrightness, maxBrightness)

struct

dai::StereoDepthConfig::PostProcessing::DecimationFilter

#include StereoDepthConfig.hpp

variable

std::uint32_t decimationFactor

Decimation factor. Valid values are 1,2,3,4. Disparity/depth map x/y resolution will be decimated with this value.

variable

DecimationMode decimationMode

Decimation algorithm type.

function

DEPTHAI_SERIALIZE(DecimationFilter, decimationFactor, decimationMode)

enum

int32_t DecimationMode

Decimation algorithm type.

enumerator

PIXEL_SKIPPING

enumerator

NON_ZERO_MEDIAN

enumerator

NON_ZERO_MEAN

struct

dai::StereoDepthConfig::PostProcessing::HoleFilling

variable

bool enable

Flag to enable post-processing hole-filling.

variable

uint8_t highConfidenceThreshold

Pixels with confidence higher than this value are used to calculate an average disparity per superpixel. Valid range is [1,255]

variable

uint8_t fillConfidenceThreshold

Pixels with confidence below this value will be filled with the average disparity of their corresponding superpixel. Valid range is [1,255].

variable

uint8_t minValidDisparity

Represents the required percentange of pixels with confidence value above nHighConfThresh that are used to calculate average disparity per superpixel, where 1 means 50% or half, 2 means 25% or a quarter and 3 means 12.5% or an eighth. If the required number of pixels are not found, the holes will not be filled.

variable

bool invalidateDisparities

If enabled, sets to 0 the disparity of pixels with confidence below nFillConfThresh, which did not pass nMinValidPixels criteria. Valid range is {true, false}.

function

 DEPTHAI_SERIALIZE(HoleFilling, enable, highConfidenceThreshold, fillConfidenceThreshold, minValidDisparity, invalidateDisparities)

struct

dai::StereoDepthConfig::PostProcessing::ThresholdFilter

#include StereoDepthConfig.hpp

variable

std::int32_t minRange

Minimum range in depth units. Depth values under this value are invalidated.

variable

std::int32_t maxRange

Maximum range in depth units. Depth values over this value are invalidated.

function

DEPTHAI_SERIALIZE(ThresholdFilter, minRange, maxRange)

enum

int32_t Filter

enumerator

NONE

enumerator

DECIMATION

enumerator

SPECKLE

enumerator

MEDIAN

enumerator

SPATIAL

enumerator

TEMPORAL

enumerator

FILTER_COUNT

enum

filters::params::SpatialFilter SpatialFilter

enum

filters::params::TemporalFilter TemporalFilter

enum

filters::params::SpeckleFilter SpeckleFilter

enum

filters::params::MedianFilter MedianFilter

Need assistance?

Head over to Discussion Forum for technical support or any other questions you might have.

ON THIS PAGE

StereoDepth

Configure Stereo Depth

How to place it

Python

C++

Inputs and Outputs

Usage

Python

C++

Examples of functionality

Depth Alignment

Configure Camera for High FPS

Changing camera calibration during runtime

Platform-Specific Configuration

RVC2

Internal block diagram of StereoDepth node

Depth Presets

Depth Configuration

Extended Disparity

Usage:

Subpixel Disparity

Usage:

LR Check

Usage:

Confidence Threshold

Usage:

Median Filter

Usage:

Speckle Filter

Usage:

Temporal Filter

Usage:

Spatial Filter

Usage:

Brightness Filter

Usage:

Threshold Filter

Usage:

Decimation Filter

Usage:

Filtering Order

Limitations

RVC4

Internal block diagram of StereoDepth node

Depth Presets

Depth Configuration

Extended Disparity

Usage:

Subpixel Disparity

LR Check

Usage:

Software Confidence

Usage:

Confidence Metrics (Occlusion, Motion Vector, Flatness)

Usage:

Adaptive Median Filter

Usage:

Hole Filling

Usage:

Median Filter

Usage:

Speckle Filter

Usage:

Temporal Filter

Usage:

Spatial Filter

Usage:

Brightness Filter

Usage:

Threshold Filter

Usage:

Decimation Filter

Usage:

Filtering Order

Usage:

Limitations

Reference

dai::node::StereoDepth

std::uint32_t PresetMode