The sample training script was made to train object detection models on PASCAL VOC 2012.
Ensure that you have holocron installed
git clone https://github.com/frgfm/Holocron.git
pip install -e "Holocron/.[training]"
No need to download the dataset, torchvision will handle this for you! From there, you can run your training with the following command
python train.py VOC2012 --arch yolov2 --lr 1e-5 -b 32 -j 16 --epochs 20 --opt radam --sched onecycle
Performances are evaluated on the validation set of the dataset. Since the mAP does not allow easy interpretation by humans, the performance metrics have been changed here.
A prediction is considered as correct if it checks two criteria:
- Localization: it is the best acceptable localization candidate (highest IoU among predictions with the GT, and IoU >= 0.5)
- Classification: the top predicted probabilities is for the class label of the matched ground truth object.
Then we define:
- Localization error rate: with loc_recall being the matching rate of ground truth boxes, and loc_precision being the matching rate of predicted boxes, we define the localization error as 1 - (harmonic mean of localization loc_recall & loc_precision)
- Classification error rate: classification error rate of matched predictions.
- Detection error rate: with det_recall being the correctness rate of ground truth boxes, and det_precision being the correctness rate of predicted boxes, we define the localization error as 1 - (harmonic mean of localization det_recall & det_precision)
Here, the recall being the ratio of correctly predicted ground truth predictions by the total number of ground truth objects, and the precision being the ratio of correctly predicted ground truth predictions by the total number of predicted boxes.
Size (px) | Epochs | args | [email protected] | [email protected] | [email protected] | # Runs |
---|---|---|---|---|---|---|
416 | 40 | VOC2012 --arch yolov2 --img-size 416 --lr 5e-4 -b 64 -j 16 --epochs 40 --opt tadam --freeze-backbone --sched onecycle | 83.09 | 52.82 | 92.02 | 1 |
Model | [email protected] | [email protected] | [email protected] | Param # | MACs | Interpolation | Image size |
---|---|---|---|---|---|---|---|
yolov2 | 83.09 | 52.82 | 92.02 | 50.65M | bilinear | 416 |