Fine-tuned fast_base detection model is picking up noise #1732
Replies: 3 comments 2 replies
-
Hi @hanshupe007 👋, If the only problem are such super small detections i would suggest to play a bit around with the binarization threshold and box threshold values before continue any training. The default value for A sec option would be to plug a hook into the pipeline where you could filter such boxes by it's area before passing it to the Could you describe a bit more how you trained the model (fine tuned with Best, |
Beta Was this translation helpful? Give feedback.
-
Thanks that's useful. Can the binarization threshold only be set for inference, or also during training (in train_pytorch.py), so that it impacts the loss? What's the difference between box and binary threshold? I fine-tuned it with a few thousand samples and --pretrained, also tried freeze-backbone and from scratch, but was slightly worse. Hyperparameter search was done by hand, increasing batch size up to 16 made it usually worse, tried a wide range of learning rates, and epochs, but couldn't get rid of the tiny boxes. As mentioned once before, I don't see the tiny false positive boxes reflected in any metric (including the loss), training runs with false positive or false negative boxes result in better metrics than the ones without (but slightly misaligned boxes), so I do currently a visual validation. |
Beta Was this translation helpful? Give feedback.
-
You can but both are parameters for the post-processing so it would show only an impact on recall, precision and miou Could you share the logs from your "best" run ? I say several times that Adam as optimizer seems to be a bit to "aggresive" in some cases maybe RMSProp would be an alternative and if the loss starts to stuck after ~3 epochs i would try to switch to a StepLR-Scheduler or do a test with a constant lr without a scheduler |
Beta Was this translation helpful? Give feedback.
-
After fine-tuning the fast_base detection model to overcome some minor detection issues, it starts to detect very tiny pieces of noise, like dots as text, typically within another box. I already tried already to tune the synthetic data or tune hyper-parameters, but can't really solve the problem. Any suggestions I can try?
Beta Was this translation helpful? Give feedback.
All reactions