VisDrone Detection
Dataset
| Dataset Name | Training Images | Validation Images | Class Labels | License |
|---|---|---|---|---|
| VisDrone 2019 dataset | 6,471 | 548 | 10 classes | CC BY-NC-SA 3.0 |
An example of each class is given from left to right.
| Class ID | Label | Description |
|---|---|---|
| 0 | Pedestrian | A person who is standing/walking. |
| 1 | People | A person who is not standing/walking. |
| 2 | Bicycle | A bicycle. |
| 3 | Car | A car around the size of a sedan. |
| 4 | Van | A van. |
| 5 | Truck | A vehicle with an open load bed. |
| 6 | Tricycle | A three wheeled vehicle, pedal or motor operated. |
| 7 | Awning Tricycle | A three wheeled vehicle, with a roof of some sorts. |
| 8 | Bus | A bus. |
| 9 | Motor | A moped or motorcycle. |

Model Zoo
DeGirum's VisDrone detection model zoo consists of a variety of models gathered and trained using ultralytics repository.
Here is the link to DeGirum's VisDrone detection zoo.
| Model Architecture | Input Size | Precision | Runtime | Device Type | mAP 50-95 | mAP 50 | mAP 50-95 small | mAP 50-95 medium | mAP 50-95 large | FPS |
|---|---|---|---|---|---|---|---|---|---|---|
| yolov8n_relu6_visdrone | 640x384 | INT8 | N2X | ORCA | 0.143 | 0.2521 | 0.3226 | 0.2287 | 0.3226 | 154.6 |
| yolov8n_relu6_visdrone | 640x384 | INT8 | TFLITE | EDGETPU | 0.1427 | 0.2516 | 0.3262 | 0.2279 | 0.3261 | -- |
| yolov8n_relu6_visdrone | 640x384 | INT8 | OPENVINO | CPU | 0.1442 | 0.2531 | 0.3255 | 0.2284 | 0.3255 | -- |
| yolov8n_relu6_visdrone | 640x384 | FP32 | OPENVINO | CPU | 0.1454 | 0.2552 | 0.3312 | 0.2295 | 0.3312 | -- |
| yolov8n_relu6_visdrone | 960x544 | INT8 | N2X | ORCA | 0.1963 | 0.3349 | 0.3575 | 0.2925 | 0.3575 | 86.6 |
| yolov8n_relu6_visdrone | 960x544 | INT8 | TFLITE | EDGETPU | 0.1968 | 0.3363 | 0.3554 | 0.2935 | 0.3553 | -- |
| yolov8n_relu6_visdrone | 960x544 | INT8 | OPENVINO | CPU | 0.2015 | 0.3388 | 0.3606 | 0.3018 | 0.3606 | -- |
| yolov8n_relu6_visdrone | 960x544 | FP32 | OPENVINO | CPU | 0.2027 | 0.3405 | 0.3606 | 0.3036 | 0.3606 | -- |
| yolov8n_relu6_visdrone | 1280x736 | INT8 | N2X | ORCA | 0.2328 | 0.3948 | 0.3787 | 0.3294 | 0.3786 | 58.6 |
| yolov8n_relu6_visdrone | 1280x736 | INT8 | TFLITE | EDGETPU | 0.2331 | 0.395 | 0.3771 | 0.3306 | 0.3770 | -- |
| yolov8n_relu6_visdrone | 1280x736 | INT8 | OPENVINO | CPU | 0.2393 | 0.3981 | 0.3838 | 0.3424 | 0.3838 | -- |
| yolov8n_relu6_visdrone | 1280x736 | FP32 | OPENVINO | CPU | 0.2406 | 0.3999 | 0.3884 | 0.3425 | 0.3883 | -- |
| yolov8s_relu6_visdrone | 640x384 | INT8 | N2X | ORCA | 0.1874 | 0.3193 | 0.3938 | 0.2912 | 0.3938 | 56.8 |
| yolov8s_relu6_visdrone | 640x384 | INT8 | TFLITE | EDGETPU | 0.1874 | 0.3187 | 0.3884 | 0.2918 | 0.3883 | -- |
| yolov8s_relu6_visdrone | 640x384 | INT8 | OPENVINO | CPU | 0.1903 | 0.3219 | 0.3847 | 0.2955 | 0.3846 | -- |
| yolov8s_relu6_visdrone | 640x384 | FP32 | OPENVINO | CPU | 0.1904 | 0.3223 | 0.3889 | 0.2950 | 0.3888 | -- |
| yolov8s_relu6_visdrone | 960x544 | INT8 | N2X | ORCA | 0.249 | 0.417 | 0.4409 | 0.3647 | 0.4408 | 28.3 |
| yolov8s_relu6_visdrone | 960x544 | INT8 | TFLITE | EDGETPU | 0.2496 | 0.4172 | 0.4402 | 0.3658 | 0.4401 | -- |
| yolov8s_relu6_visdrone | 960x544 | INT8 | OPENVINO | CPU | 0.2508 | 0.4188 | 0.4371 | 0.3683 | 0.4371 | -- |
| yolov8s_relu6_visdrone | 960x544 | FP32 | OPENVINO | CPU | 0.2515 | 0.4188 | 0.4394 | 0.3692 | 0.4393 | -- |
| yolov8s_relu6_visdrone | 1280x736 | INT8 | N2X | ORCA | 0.2863 | 0.4735 | 0.4572 | 0.3946 | 0.4572 | 17.8 |
| yolov8s_relu6_visdrone | 1280x736 | INT8 | TFLITE | EDGETPU | 0.2867 | 0.4737 | 0.457 | 0.3942 | 0.4569 | -- |
| yolov8s_relu6_visdrone | 1280x736 | INT8 | OPENVINO | CPU | 0.293 | 0.4765 | 0.4639 | 0.4071 | 0.4638 | -- |
| yolov8s_relu6_visdrone | 1280x736 | FP32 | OPENVINO | CPU | 0.2942 | 0.4776 | 0.4696 | 0.4076 | 0.4696 | -- |
| yolov8m_relu6_visdrone | 640x384 | INT8 | N2X | ORCA | 0.2111 | 0.3557 | 0.4453 | 0.3249 | 0.4453 | 24.5 |
| yolov8m_relu6_visdrone | 640x384 | INT8 | OPENVINO | CPU | 0.2156 | 0.3595 | 0.4415 | 0.3316 | 0.4415 | -- |
| yolov8m_relu6_visdrone | 640x384 | FP32 | OPENVINO | CPU | 0.2155 | 0.3589 | 0.4483 | 0.3310 | 0.4483 | -- |
| yolov8m_relu6_visdrone | 960x544 | INT8 | N2X | ORCA | 0.2812 | 0.4602 | 0.4731 | 0.4006 | 0.4730 | 2.1 |
| yolov8m_relu6_visdrone | 960x544 | INT8 | OPENVINO | CPU | 0.2864 | 0.4634 | 0.4813 | 0.4091 | 0.4812 | -- |
| yolov8m_relu6_visdrone | 960x544 | FP32 | OPENVINO | CPU | 0.2876 | 0.4639 | 0.4877 | 0.4095 | 0.4877 | -- |
| yolov8m_relu6_visdrone | 1280x736 | INT8 | N2X | ORCA | 0.3048 | 0.5078 | 0.4639 | 0.4141 | 0.4639 | 0.7 |
| yolov8m_relu6_visdrone | 1280x736 | INT8 | OPENVINO | CPU | 0.3212 | 0.518 | 0.4662 | 0.4383 | 0.4662 | -- |
| yolov8m_relu6_visdrone | 1280x736 | FP32 | OPENVINO | CPU | 0.3219 | 0.5189 | 0.4681 | 0.4393 | 0.4680 | -- |
| yolov8l_relu6_visdrone | 640x384 | INT8 | N2X | ORCA | 0.226 | 0.3775 | 0.4664 | 0.3475 | 0.4663 | 13.0 |
| yolov8l_relu6_visdrone | 640x384 | INT8 | OPENVINO | CPU | 0.232 | 0.3824 | 0.4812 | 0.3556 | 0.4811 | -- |