Roboflow
Overview
Usage
Dataset Selection
- Annotation type must match your prediction target: For object detection models, the dataset must include bounding box annotations. For instance segmentation, annotations must also include per-object masks in addition to bounding boxes. And similarly then for other type of tasks.
- Dataset quality matters: Ensure that annotations are accurate, the image content is diverse, and the dataset is sufficiently large. We generally recommend a minimum of 500–1000 images for achieving robust performance - but more is better.
- Image domain should match your deployment environment: The training images should resemble what your model will see in production. For example, if your factory only uses blue PCBs but the dataset contains green ones, it may not generalize well. You can try using color augmentation to help with this domain shift or seach for a dataset that is more suitable for your deployment.
Training
dataset_dir under the params section of loader.We use a special URI format for Roboflow datasets:Markdown
1roboflow://<TEAM_NAME>/<DATASET_NAME>/<DATASET_VERSION>/cocoMarkdown
1https://universe.roboflow.com/learn-uzoux/pcb-defect-0i1a7dataset_dir becomes:Markdown
1roboflow://learn-uzoux/pcb-defect-0i1a7/2/cocoIf the dataset is private to your Roboflow team, be sure to export your Roboflow API key before starting the training process:
export ROBOFLOW_API_KEY="<ADD_YOUR_ROBOFLOW_API_KEY_HERE>"