Label smoothing in training #261

melihyilmaz · 2023-11-01T19:40:18Z

After encountering the NaN outputs from the model half way through training in a few runs, I experimented with minimal label smoothing when calculating the training loss as a mitigation strategy. I was able redo the same training runs with the same setup significantly longer without encountering NaNs and with similar performance metrics compared to the original runs.

Loss calculation is only impacted by smoothing during training steps, i.e. not validation, and I tentatively added the minimal label smoothing factor as the default option.

…d a new one.

codecov · 2023-11-01T19:44:22Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (235420f) 89.43% compared to head (5e58b65) 89.47%.

Additional details and impacted files

@@            Coverage Diff             @@
##              dev     #261      +/-   ##
==========================================
+ Coverage   89.43%   89.47%   +0.03%     
==========================================
  Files          12       12              
  Lines         909      912       +3     
==========================================
+ Hits          813      816       +3     
  Misses         96       96

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

This reverts commit 5716c7a, reversing changes made to b044bc6.

wfondrie

This looks good to me aside from needing a unit test.

casanovo/denovo/model_runner.py

wfondrie

This looks good to me!

Justin Sanders and others added 6 commits September 27, 2023 17:53

Add option to change learning rate scheduler and made it easier to ad…

e402555

…d a new one.

docs

f136798

tests and formatting

5557d97

Add label smoothing

cc31c4e

Modify config file

4d3f21c

Minor fix config.yaml

9ffbc97

melihyilmaz and others added 2 commits November 1, 2023 14:41

Run black

9b4d06c

Lint casanovo.py

b044bc6

melihyilmaz requested a review from wfondrie November 1, 2023 22:03

melihyilmaz and others added 2 commits November 2, 2023 09:59

Merge branch 'add_lr_schedule_options' into label-smoothing

5716c7a

Revert "Merge branch 'add_lr_schedule_options' into label-smoothing"

ade5a31

This reverts commit 5716c7a, reversing changes made to b044bc6.

wfondrie requested changes Nov 14, 2023

View reviewed changes

casanovo/denovo/model_runner.py Outdated Show resolved Hide resolved

melihyilmaz added 3 commits November 27, 2023 15:51

Add unit test

8033fc7

Merge branch 'dev' into label-smoothing

625f7f3

Fix config test and add changelog

5e58b65

melihyilmaz requested a review from wfondrie November 28, 2023 01:00

wfondrie approved these changes Dec 11, 2023

View reviewed changes

bittremieux merged commit 3b688e8 into dev Dec 12, 2023
6 checks passed

bittremieux deleted the label-smoothing branch December 12, 2023 07:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Label smoothing in training #261

Label smoothing in training #261

melihyilmaz commented Nov 1, 2023

codecov bot commented Nov 1, 2023 •

edited

Loading

wfondrie left a comment

wfondrie left a comment

Label smoothing in training #261

Label smoothing in training #261

Conversation

melihyilmaz commented Nov 1, 2023

codecov bot commented Nov 1, 2023 • edited Loading

Codecov Report

wfondrie left a comment

Choose a reason for hiding this comment

wfondrie left a comment

Choose a reason for hiding this comment

codecov bot commented Nov 1, 2023 •

edited

Loading