AI Capabilities

AI building blocks
- Run generic networks with NeuralNetwork and hardware-accelerated DetectionNetwork/SpatialDetectionNetwork.
- Source models from Model Zoo, custom training, or integrations; convert with Model Conversion.
- Package once with NNArchive; measure performance using Benchmarking.
Typical use cases
- Object detection & tracking: run YOLO via DetectionNetwork on RGB and feed IDs into ObjectTracker for persistent tracklets (optionally spatial with depth).
- Pose/landmarks & overlays: use ParsingNeuralNetwork which automatically parses the outputs in standard message types.
- Segmentation & fusion: run pixelwise models, then fuse with depth or spatial perception to measure volume, occupancy, or free space.
Code-less quick start with App Store
Data Collection App
On-device open-vocabulary detection (YOLOE via DepthAI) auto-collects snaps and lets the UI pick labels, set confidence, and enable snap conditions.
Open Data Collection App
Guides and examples
Generic NN Example
Run any neural model on OAK camera
Detection Network
Run Model Zoo YOLO on RGB
Detection remap
Align detections between streams
Detection replay
Track detections on recorded video
Model Zoo example
Download and unpack HubAI models in-pipeline
AI Inference Docs
Read more about inference in DepthAI
Need assistance?
Head over to Discussion Forum for technical support or any other questions you might have.