Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to tune hyperparameters for custom datasets? #49

Open
krumo opened this issue Aug 11, 2023 · 1 comment
Open

How to tune hyperparameters for custom datasets? #49

krumo opened this issue Aug 11, 2023 · 1 comment

Comments

@krumo
Copy link

krumo commented Aug 11, 2023

Thanks for your excellent work! I am trying to run SIGMA and SIGMA++ on a new UDA benchmark for adapting Cityscapes to ACDC dataset. I find for different UDA tasks, you adopt different GA_DIS_LAMBDA, GRL_WEIGHT_{P3-P7}, MATCHING_LOSS_WEIGHT and BG_RATIO in configs. Could you share some insights about how to tune these hyperparameters for a new UDA task? Looking forward to your response!

@wymanCV
Copy link
Contributor

wymanCV commented Aug 11, 2023

I am so glad that you can pay attention to our work! Honestly, compared with your excellent work DAFRCNN, this FCOS-based (or EPM-based) framework has so many imperfections.

We find that GRL_WEIGHT_{P3-P7} are sensitive to the dataset. Except for City-to-Foggy, it seems that a relatively larger value is required for all the other settings. Otherwise, the discriminator may beat the backbone and give a small adv loss, as shown here. For example. in KITTI and Pascal settings, I tried GRL_WEIGHT_ in the range of 0.01 to 1.0 and found the results would make a huge difference, which also gave me a headache (adv almost does not work in KITTI with 0.01). The larger domain gap (especially for the style-based gap) and the imperfect discriminator design for dense detectors may be the reason.

For GA_DIS_LAMBDA , if the adv loss is normal but the performance is significantly lower than other detector baselines, it may need to enlarge GA_DIS_LAMBDA to improve the adv intensity.

For BG_RATIO, we set it as (num_cls+1), which will sample 1/(num_cls+1) background points as the negative node. The MATCHING_LOSS_WEIGHT is robust for different datasets, which can be set as default 0.1 for other datasets.

I am also looking forward to seeing new works that can address these imperfections since I like the FCOS simple architecture. However, it seems that the current trend in this field is turning to Mean-Teacher and DETR-based architecture. Haha~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants