allenai / OLMo Public

Notifications You must be signed in to change notification settings
Fork 502
Star 4.9k

Code
Issues 54
Pull requests 54
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: allenai/OLMo

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

54 Open 152 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

tokenizer.encode function`s param add_special_tokens=False not work. type/bug

An issue about a bug

#765 opened Dec 12, 2024 by xiaohan2909

How to inspect training data in a particular batch? type/question

An issue that's a question

#763 opened Dec 9, 2024 by explanare

Difference Between DDP and FSDP Modes type/question

An issue that's a question

#762 opened Dec 6, 2024 by lllabmaster

How to train the tinymodel(Like 300M or 150M) type/question

An issue that's a question

#759 opened Dec 3, 2024 by yongding-tao

Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2? type/question

An issue that's a question

#758 opened Nov 29, 2024 by Taoer1996

About eos_token_id in config file (20M, 1B) type/question

An issue that's a question

#757 opened Nov 29, 2024 by lllabmaster

Difference between 0724 and 0424 7B models type/documentation

An issue or pull request related to documentation

#746 opened Nov 13, 2024 by jiahai-feng

TypeError - running example code type/bug

An issue about a bug

#743 opened Nov 3, 2024 by KPK101

Fail to load tokenizer for checkpoints type/bug

An issue about a bug

#741 opened Oct 24, 2024 by tresiwald

Error Encountered During Multi-Node Pretraining with Torchrun type/bug

An issue about a bug

#737 opened Oct 21, 2024 by Zehui127

Missing OLMo checkpoints

#726 opened Oct 3, 2024 by mirandrom

8-bit allgather support type/question

An issue that's a question

#722 opened Sep 19, 2024 by yaroslavvb

Expected Data Format type/question

An issue that's a question

#715 opened Aug 27, 2024 by aflah02

Which mmlu validation setting is recommend? type/question

An issue that's a question

#714 opened Aug 27, 2024 by mathfinder

[Quick question]: How do I turn off FSDP? type/question

An issue that's a question

#703 opened Aug 15, 2024 by candygocandy

RuntimeError: Triton Error [CUDA]: invalid device context type/bug

An issue about a bug

#700 opened Aug 13, 2024 by andymvp2018

slurm script for: configs/official/OLMo-7B.yaml type/question

An issue that's a question

#699 opened Aug 13, 2024 by andymvp2018

Gflops computation is faulty for FSDP due to bug in OLMo.num_params()

#695 opened Aug 7, 2024 by AkshitaB

why CrossEntropyLoss is zero,i type/question

An issue that's a question

#692 opened Aug 6, 2024 by aizhweiwei

Olmo 0724 -hf checkpoints don't load the proper config when instantiating with OLMoForCausalLM type/bug

An issue about a bug

#689 opened Aug 5, 2024 by sarahwie

Model ladder has no documentation type/documentation

An issue or pull request related to documentation

#683 opened Jul 31, 2024 by IanMagnusson

mlp_ratio not adjusted in config if mlp_hidden_size is set type/bug

An issue about a bug

#673 opened Jul 21, 2024 by Muennighoff

Does global_train_batch_size support gradient accumulation? type/question

An issue that's a question

#672 opened Jul 21, 2024 by jinzhuoran

Is there explicitly instruction-following data in the version of Dolma used to train v1? type/question

An issue that's a question

#658 opened Jul 15, 2024 by john-hewitt

Can long text be splitted into short texts? type/question

An issue that's a question

#655 opened Jul 12, 2024 by CoinCheung

Previous 1 2 3 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly