-
Notifications
You must be signed in to change notification settings - Fork 442
Training param tuning
There has two types data augmentation method for different application
https://github.com/eric612/MobileNet-YOLO/issues/29
For example , I will choose adaptive aspect ratio in fisheye video which make pixel level geometry distortion
- Set preprocessing resize mode to "FIT_LARGE_SIZE_AND_PAD"
- Remove all expand param
- Inference use "FIT_LARGE_SIZE_AND_PAD" resize
This type may break k-mean anchors rule and effect accuracy about 1% in my test
-
Set preprocessing resize mode to "WARP"
-
Expand param set to {VOC:4.0 , COCO:1.5 , ...}
-
Inference use "WARP" resize
-
For advance , modify jitter code
unmark
caffe_rng_uniform(1, 1.0f - jitter, 1.0f, &img_h)
and mark
img_h = img_w;
If solver type set to "SGD" , you may need set learning rate policy like this
If pre-trained weights use
-
Classification model (like imagenet) batch size set to 64 at least
-
Detection model (like ms-coco)
total batch size = iter_size * batch_size
batch size set to 16 at least (when training "PASCAL-VOC")
batch size set to 32 at least (when training "MS-COCO") , recommend to 64