You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ClashLuke opened this issue
Apr 30, 2022
· 0 comments
Labels
coreImproves core model while keeping core idea intactMLRequires machine-learning knowledge (can be built up on the fly)researchCreative project that might fail but could give high returns
Like WideNet proposed, we could combine a MoE-architecture with weight sharing. Incorporating a WideNet-style architecture should increase performance, decrease training time, and reduce the number of parameters needed.
This issue is about implementing such a weight-sharing protocol and benchmarking its performance.
The text was updated successfully, but these errors were encountered:
ClashLuke
added
research
Creative project that might fail but could give high returns
engineering
Software-engineering problems that don't require ML-Expertise
ML
Requires machine-learning knowledge (can be built up on the fly)
and removed
engineering
Software-engineering problems that don't require ML-Expertise
labels
Apr 30, 2022
coreImproves core model while keeping core idea intactMLRequires machine-learning knowledge (can be built up on the fly)researchCreative project that might fail but could give high returns
Like WideNet proposed, we could combine a MoE-architecture with weight sharing. Incorporating a WideNet-style architecture should increase performance, decrease training time, and reduce the number of parameters needed.
This issue is about implementing such a weight-sharing protocol and benchmarking its performance.
The text was updated successfully, but these errors were encountered: