Gradient Noise #46

ClashLuke · 2022-05-17T15:24:22Z

Some have suggested that adding gradient noise helps deep models converge and generalise. Other works, such as DDPG, showed that this is the case even for shallow networks of a different domain. That's why it could be interesting for us to explore gradient noise as an option to improve generalisation and with that convergence by avoiding overfitting and other local minima during training.
One option to further improve gradient noise would be to combine it with #35, by adding different noise to each optimiser. This change would allow us to create combinations like Adam#Adam, where each optimiser sees slightly different noise at each step.
This issue tracks the progress of such a scheme.

ClashLuke added research Creative project that might fail but could give high returns ML Requires machine-learning knowledge (can be built up on the fly) core Improves core model while keeping core idea intact labels May 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradient Noise #46

Gradient Noise #46

ClashLuke commented May 17, 2022

Gradient Noise #46

Gradient Noise #46

Comments

ClashLuke commented May 17, 2022