Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pretraining the nucleotide-transformer #81

Open
mpage21 opened this issue Nov 19, 2024 · 2 comments
Open

pretraining the nucleotide-transformer #81

mpage21 opened this issue Nov 19, 2024 · 2 comments

Comments

@mpage21
Copy link

mpage21 commented Nov 19, 2024

Hi, I would like to pre-train the model myself to gain a better understanding of machine learning models.

Specifically, could you provide the code that was used to pre-train the v2 500m multi-species model?

I am new to this so any and all help is appreciated.

thank you!

@mpage21
Copy link
Author

mpage21 commented Nov 21, 2024

Okay, I think I might have found what I'm looking for. It looks like the code in the nucleotide_transformer directory is what was used to build the models (maybe I just need to tweak it a bit?) and then I think the configurations are found here: https://huggingface.co/InstaDeepAI/nucleotide-transformer-v2-500m-multi-species/tree/main

Is this correct?

@mpage21
Copy link
Author

mpage21 commented Nov 21, 2024

Actually, I'm not sure I see a training script. Is it named something else?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant