Compared to StereoLabs ZED cameras, OAK cameras feature a ton of on-device features (stereo depth estimation/disparity matching, custom AI models, object tracking, scripting, encoding etc.).
Overview
Here's a quick comparison of on-device capabilities between OAK cameras and ZED cameras. More details can be found at at On-device feature comparison.
On-device capability
OAK Cameras
ZED cameras
Camera ISP
✔️
✔️
Stereo matching
✔️
-
Stereo postprocessing
✔️
-
AI processing
✔️
-
CV processing
✔️
-
Video encoding
✔️
-
Essentially, ZED cameras require a powerful host computer with beefy NVIDIA GPU to process the stereo data and run AI models. OAK cameras can do all the processing on-device, eliminating the need for a powerful host (see Host requirements for details).
Depth accuracy comparison
From our own evaluation (details at Stereo cameras accuracy comparison blog post), we found that among the tested cameras, the OAK-D Long Range delivers the most impressive long-range depth accuracy. Its 15cm baseline significantly reduces depth errors, making it ideal for applications requiring extended-range perception.Camera Performance Overview
OAK-D Long Range: Excels in long-range performance with a 15cm baseline, ensuring great depth accuracy and minimal errors at greater distances.
ZED 2i: Performs well at long ranges, benefiting from a 12cm baseline and high 2K resolution.
OAK-D Pro: Provides a balanced active stereo depth estimation performance across various distances with its 7.5cm baseline.
Our platform was built from the ground up with the idea of being customizable. All of our products based on OAK-SoM are open-source so you can easily redesign the board (see Integrating DepthAI into products), for example to change the stereo baseline distance or use a different image sensor (we support a bunch of different sensors).OAK FFC line is great for prototyping, as it allows users to use different camera sensors/optics and place them at an ideal stereo baseline distance for their application.Below is a long-range disparity depth visualized over a color frame. This customer used narrow FOV M12 lenses with wide stereo baseline distance (25cm) to achieve such results with our platform.See stereo depth documentation on max depth perception calculations based on camera intrinsics/baseline distance.
On-device feature comparison
OAK cameras integrate a wide range of advanced processing capabilities directly on-device, eliminating the need for a powerful external host. In contrast, StereoLabs™ ZED cameras rely entirely on host-based processing. Here's a snapshot of what Luxonis offers:
Custom AI models - You can run any AI/NN model(s) on the device, as long as all layers are supported. You can also choose from 200+ pretrained NN models from Open Model Zoo and DepthAI Model Zoo.
Object detection - Most popular object detectors have been converted and run on our devices. DepthAI supports onboard decoding of Yolo and MobileNet based NN models.
Object tracking - ObjectTracker node comes with 4 tracker types, and it also supports tracking of objects in 3D space.
On-device scripting - Script node enables users to run custom Python 3.9 scripts that will run on the device, used for managing the flow of the pipeline (business logic).
Video/Image encoding - VideoEncoder node allows encoding into MJPEG, H265, or H264 formats.
Image Manipulation - ImageManip node allows users to resize, warp, crop, flip, and thumbnail image frames and do type conversions (YUV420, NV12, RGB, etc.)
Skeleton/Hand Tracking - Detect and track key points of a hand or human pose. Geaxgx's demos: Hand tracker, Blazepose, Movenet.
3D Semantic segmentation - Perceive the world with semantically-labeled pixels. DeeplabV3 demo here.
3D Object Pose Estimation - MediaPipe's Objectron has been converted to run on OAK cameras. Video here.
3D Edge Detection - EdgeDetector node uses Sobel filter to detect edges. With depth information, you can get physical position of these edges.