Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train the tinymodel(Like 300M or 150M) #759

Open
yongding-tao opened this issue Dec 3, 2024 · 2 comments
Open

How to train the tinymodel(Like 300M or 150M) #759

yongding-tao opened this issue Dec 3, 2024 · 2 comments
Labels
type/question An issue that's a question

Comments

@yongding-tao
Copy link

❓ The question

Hello,

Thank you for this project. I encountered an issue while trying to train a TinyModel for my research. The data paths in the config files under ./configs/tiny/ point to S3 locations, for example:
s3://ai2-llm/preprocessed/olmo-mix/v1_6-decontaminated/books/gpt-neox-olmo-dolma-v1_5/part-0-00000.npy.

I tried replacing s3:// with https:// based on the previous "Reproducibility" section mentioned in the README (which seems to have been removed), but this approach did not work.

Could you clarify the correct way to access these files or provide updated data paths or instructions? Furthermore, can you give a guidance on how to train a TinyModel using Olmo?

Thank you for your help!

@yongding-tao yongding-tao added the type/question An issue that's a question label Dec 3, 2024
@SimonSuster
Copy link

You should replace s3://ai2-llm/ with https://olmo-data.org/.

So, your example would be: https://olmo-data.org/preprocessed/olmo-mix/v1_6-decontaminated/books/gpt-neox-olmo-dolma-v1_5/part-0-00000.npy

@yongding-tao
Copy link
Author

@SimonSuster Thank you for your advice. It actually helps me solve the problem.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type/question An issue that's a question
Projects
None yet
Development

No branches or pull requests

2 participants