Image Classification #11

ClashLuke · 2022-04-30T09:01:45Z

At the moment, we have a novel architecture that's very powerful in language modelling. However, we don't know whether it will transfer as well to other domains as the transformer. That's why it'd be interesting to test its versatility by training it on ImageNet.
This issue is about implementing the input projection for image tokens (as in ViT), the necessary data pipelines and testing the model on this new modality.

ClashLuke added research Creative project that might fail but could give high returns ML Requires machine-learning knowledge (can be built up on the fly) labels Apr 30, 2022

ClashLuke added the downstream Changes code wrapping the core model label May 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image Classification #11

Image Classification #11

ClashLuke commented Apr 30, 2022

Image Classification #11

Image Classification #11

Comments

ClashLuke commented Apr 30, 2022