# Roboflow RF-DETR

## Overview

[RF-DETR](https://github.com/roboflow/rf-detr) is Roboflow's transformer-based object detection and instance segmentation model
family. On Luxonis devices, the current deployment workflow targets RVC4.

This integration shows the main path from an official RF-DETR checkpoint to on-device inference:

 * export the model to `ONNX`,
 * package it as an [NN Archive](https://docs.luxonis.com/software-v3/ai-inference/nn-archive.md),
 * convert it to an RVC4 archive with
   [HubAI](https://docs.luxonis.com/software-v3/ai-inference/conversion/rvc-conversion/online/hubai.md), and
 * run it with [DepthAI Nodes](https://docs.luxonis.com/software-v3/ai-inference/inference/depthai-nodes.md).

The linked tutorial is validated with RF-DETR `1.7.1` and RF-DETR Nano, but the same general approach can be used for other
RF-DETR variants as long as the relevant modules are present.

If you only need a ready-to-run model, RF-DETR Nano is already available in the [Model
Zoo](https://docs.luxonis.com/software-v3/ai-inference/model-source/zoo.md): [RF-DETR Nano model
card](https://models.luxonis.com/luxonis/rf-detr-nano/aim_XhzwWA4gawZ1a5g5ayirrb) and [RF-DETR Nano Instance Segmentation model
card](https://models.luxonis.com/luxonis/rf-detr-nano-instance-segmentation/aim_VWNN4569rNtANS9owXTJCJ). The steps below focus
primarily on the detection model, but the same process can also be applied to the instance segmentation variant, as demonstrated
in the linked notebook.

## Usage

> **Note**
> For the full end-to-end notebook, see the [RF-DETR RVC4 conversion tutorial](https://github.com/luxonis/ai-tutorials/blob/main/conversion/rfdetr_rvc4_conversion.ipynb).

You can run the export, archive creation, and HubAI conversion steps in Colab. The final inference step must run on a machine that
can access an RVC4 device.

### Main steps

 1. Install the required packages and clone the official `rf-detr` repository at a known-good version.
 2. Apply a small runtime class injection before export so the generated ONNX graph is compatible with the RVC4 conversion flow,
    without modifying the upstream RF-DETR source tree.
 3. Export the model to `ONNX` with a fixed `384x384` input and `opset 17`.
 4. Create an [NN Archive](https://docs.luxonis.com/software-v3/ai-inference/nn-archive.md) that describes the model inputs,
    outputs, preprocessing, and the `RFDETRParser` head used at runtime.
 5. Convert the archive to RVC4 with the [HubAI SDK](https://docs.luxonis.com/cloud/hubai/model-registry/hubai-sdk.md).
 6. Run the resulting `.rvc4.tar.xz` archive with a [DepthAI inference
    pipeline](https://docs.luxonis.com/software-v3/ai-inference/inference.md).

### Export compatibility changes

Before export, RF-DETR needs a small compatibility patch so the generated ONNX graph can be converted cleanly for RVC4.

The tutorial applies these changes at runtime through class injection rather than by editing the upstream RF-DETR source:

 * the windowed DINOv2 backbone is adjusted where image windows are merged back into a full spatial feature map,
 * the `MSDeformAttn` module is adjusted where multi-scale deformable attention computes sampling locations and attention weights.

These changes do not retrain the model and do not modify the weights. They only express the same computation in a form that is
more export- and conversion-friendly.

### Export to ONNX

The export step starts from the official RF-DETR model, applies the runtime compatibility patch, and writes the ONNX model:

```python
from rfdetr import RFDETRNano
from rvc4_class_injection import apply_rvc4_class_injection

model = RFDETRNano()
apply_rvc4_class_injection(model, require=True, verbose=True)

model.export(
    output_dir="onnx_model",
    shape=(384, 384),
    batch_size=1,
    dynamic_batch=False,
    opset_version=17,
    format="onnx",
)
```

### Create the NN Archive

The generated archive should describe the `input` tensor, the `dets` and `labels` outputs, and an `RFDETRParser` head so runtime
parsing can be configured automatically:

```python
from luxonis_ml.nn_archive.archive_generator import ArchiveGenerator

generator = ArchiveGenerator(
    archive_name="rfdetr-nano-onnx",
    save_path=".",
    cfg_dict=cfg_dict,
    executables_paths=["onnx_model/rfdetr-nano.onnx"],
)

archive_path = generator.make_archive()
```

For a full example of the archive configuration, including preprocessing values and COCO class metadata, refer to the tutorial
notebook.

### Convert the NN Archive to RVC4

Once the ONNX NNArchive is ready, convert it with HubAI:

```python
import os
from hubai_sdk import HubAIClient

client = HubAIClient(api_key=os.environ["HUBAI_API_KEY"])

response = client.convert.RVC4(
    path="rfdetr-nano-onnx.tar.xz",
    name="rfdetr-nano-rvc4",
    quantization_mode="FP16_STANDARD",
    tool_version="2.41.0",
    output_dir=".",
)

print(response.downloaded_path)
```

This produces the deployable RVC4 archive, typically named `rfdetr-nano-onnx.rvc4.tar.xz`.

### Small inference example

Because the [NN Archive](https://docs.luxonis.com/software-v3/ai-inference/nn-archive.md) already declares `RFDETRParser`,
[`ParsingNeuralNetwork`](https://docs.luxonis.com/software-v3/ai-inference/inference/depthai-nodes.md) can wire the parser
automatically:

```python
from pathlib import Path

import depthai as dai
from depthai_nodes.node.parsing_neural_network import ParsingNeuralNetwork

MODEL_PATH = Path("rfdetr-nano-onnx.rvc4.tar.xz")
nn_archive = dai.NNArchive(str(MODEL_PATH))
visualizer = dai.RemoteConnection(httpPort=8082)

with dai.Pipeline() as pipeline:
    cam = pipeline.create(dai.node.Camera).build(sensorFps=10)
    cam_out = cam.requestOutput(
        size=(384, 384),
        fps=10,
        type=dai.ImgFrame.Type.BGR888i,
    )

    nn_with_parser = pipeline.create(ParsingNeuralNetwork).build(
        input=cam_out,
        nn_source=nn_archive,
    )

    visualizer.addTopic(topicName="rgb", output=nn_with_parser.passthrough)
    visualizer.addTopic(topicName="detections", output=nn_with_parser.out)

    pipeline.start()
    visualizer.registerPipeline(pipeline)

    while pipeline.isRunning():
        pipeline.processTasks()
```

For a complete script with device checks, model validation, and a ready-to-use local visualizer flow, use the tutorial linked
above.