• OAK vs RealSense™
  • Depth comparison
  • 3rd party evaluation
  • Custom stereo depth perception
  • On-device feature comparison
  • Features described
  • Camera specification
  • Embedded use-case

OAK vs RealSense™

Compared to RealSense™ stereo cameras, the DepthAI platform adds a ton of on-device features (custom AI modes, tracking, scripting, encoding etc.) to OAK cameras and can be used in embedded applications (Embedded use-case).

Depth comparison

From our own evaluation, we have found out that our OAK-D-Pro camera has a slighly better depth accuracy compared to RealSense™ D435i / D455. For more information, please see:
  1. Accuracy comparison docs
  2. Accuracy comparison blog post

3rd party evaluation

A third party (customer) sent us their OAK evaluation results, comparing OAK-D Pro with RealSense™ D435i.Laser dot projector disabled (passive stereo)Laser dot projector enabled (active stereo)

Custom stereo depth perception

Our platform was built from the ground up with the idea of being customizable. All of our products based on OAK-SoM are open-source so you can easily redesign the board (see Integrating DepthAI into products), for example to change the stereo baseline distance or use a different image sensor (we support a bunch of different sensors).OAK FFC line is great for prototyping, as it allows users to use different camera sensors/optics and place them at an ideal stereo baseline distance for their application.Below is a long-range disparity depth visualized over a color frame. This customer used narrow FOV M12 lenses with wide stereo baseline distance (25cm) to achieve such results with our platform.See stereo depth documentation on max depth perception calculations based on camera intrinsics/baseline distance.

On-device feature comparison

On-Device CapabilitiesOAK-D ProOAK-D S2OAK-D LiteL515D415D430-D435D450-D455F455F450T261-T265
Custom AI models✔️✔️✔️
Object detection✔️✔️✔️
Object tracking✔️✔️✔️
On-device scripting✔️✔️✔️
Video/Image Encoding✔️✔️✔️
Image Manipulation✔️✔️✔️
Skeleton/Hand Tracking✔️✔️✔️✔️✔️✔️
3D Semantic Segmentation✔️✔️✔️
3D Object Pose Estimation✔️✔️✔️
3D Edge Detection✔️✔️✔️
Feature Tracking✔️✔️✔️✔️
3D Feature Tracking✔️✔️✔️
Face Recognition✔️✔️✔️✔️✔️

Features described

  • Custom AI models - You can run any AI/NN model(s) on the device, as long as all layers are supported. You can also choose from 200+ pretrained NN models from Open Model Zoo and DepthAI Model Zoo.
  • Object detection - Most popular object detectors have been converted and run on our devices. DepthAI supports onboard decoding of Yolo and MobileNet based NN models.
  • Object tracking - ObjectTracker node comes with 4 tracker types, and it also supports tracking of objects in 3D space.
  • On-device scripting - Script node enables users to run custom Python 3.9 scripts that will run on the device, used for managing the flow of the pipeline (business logic).
  • Video/Image encoding - VideoEncoder node allows encoding into MJPEG, H265, or H264 formats.
  • Image Manipulation - ImageManip node allows users to resize, warp, crop, flip, and thumbnail image frames and do type conversions (YUV420, NV12, RGB, etc.)
  • Skeleton/Hand Tracking - Detect and track key points of a hand or human pose. Geaxgx's demos: Hand tracker, Blazepose, Movenet.
  • 3D Semantic segmentation - Perceive the world with semantically-labeled pixels. DeeplabV3 demo here.
  • 3D Object Pose Estimation - MediaPipe's Objectron has been converted to run on OAK cameras. Video here.
  • 3D Edge Detection - EdgeDetector node uses Sobel filter to detect edges. With depth information, you can get physical position of these edges.
  • Feature Tracking - FeatureTracker node detects and tracks key points (features).
  • 3D Feature Tracking - With depth information, you can track these features in physical space.
  • OCR - Optical character recognition, demo here.
  • Face Recognition - Demo here, which runs face detection, alignment, and face recognition (3 different NN models) on the device simultaneously.
  • Encryption - Not yet addressed.

Camera specification

SpecificationOAK-D Pro / -WOAK-D S2 / -WOAK-D LiteOAK-D SR PoEL515D415D430-D435D450-D455F455F450T261-T265
RGB HFOV69 / 10969 / 10973.68080696990N/AN/AN/A
RGB ShutterRolling / GlobalRollingRollingGlobalRollingRollingRollingGlobalN/AN/AN/A
RGB resolution12MP12MP13MP1MP2MP2MP2MP1MPN/AN/AN/A
Depth TypeActive StereoPassive StereoPassive StereoActive Stereo + ToFToFActive StereoActive StereoActive StereoActive StereoActive StereoN/A
Depth sensorOV9282OV9282OV7251OV9782 + ToF-OV2740OV9282OV9782---
Stereo ShutterGlobalGlobalGlobalGlobalRollingGlobalGlobalGlobal
Depth HFOV72 / 12772 / 1277270N/A7087875656173
Min Depth20 cm20 cm20 cm20 cm25 cm45 cm28 cm52 cm30 cm30 cmN/A
Depth resolution1280x8001280x800640x4801280x800N/A1024x7681280x7201280x720848x800
IR LED✔️✔️✔️✔️✔️

Embedded use-case

Unlike RealSense™, our platform supports booting from flash (standalone mode) and features a 2-way SPI communication (SPIOut, SPIIn nodes). Standalone/on-the-edge mode means that you can flash your application to the device, which means that you don't need to have the device connected to a host (RPi/PC/laptop...), more information here.This allows users to build small, low-powered, embedded devices and integrate OAK SoM to upgrade their products with the power of Spatial AI.
Intel, Intel RealSense and Intel Movidius Myriad are trademarks of Intel Corporation or its subsidiaries.