-
Notifications
You must be signed in to change notification settings - Fork 502
Issues: allenai/OLMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
tokenizer.encode function`s param add_special_tokens=False not work.
type/bug
An issue about a bug
#765
opened Dec 12, 2024 by
xiaohan2909
How to inspect training data in a particular batch?
type/question
An issue that's a question
#763
opened Dec 9, 2024 by
explanare
Difference Between DDP and FSDP Modes
type/question
An issue that's a question
#762
opened Dec 6, 2024 by
lllabmaster
How to train the tinymodel(Like 300M or 150M)
type/question
An issue that's a question
#759
opened Dec 3, 2024 by
yongding-tao
Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2?
type/question
An issue that's a question
#758
opened Nov 29, 2024 by
Taoer1996
About eos_token_id in config file (20M, 1B)
type/question
An issue that's a question
#757
opened Nov 29, 2024 by
lllabmaster
Difference between 0724 and 0424 7B models
type/documentation
An issue or pull request related to documentation
#746
opened Nov 13, 2024 by
jiahai-feng
Fail to load tokenizer for checkpoints
type/bug
An issue about a bug
#741
opened Oct 24, 2024 by
tresiwald
Error Encountered During Multi-Node Pretraining with Torchrun
type/bug
An issue about a bug
#737
opened Oct 21, 2024 by
Zehui127
8-bit allgather support
type/question
An issue that's a question
#722
opened Sep 19, 2024 by
yaroslavvb
Which mmlu validation setting is recommend?
type/question
An issue that's a question
#714
opened Aug 27, 2024 by
mathfinder
[Quick question]: How do I turn off FSDP?
type/question
An issue that's a question
#703
opened Aug 15, 2024 by
candygocandy
RuntimeError: Triton Error [CUDA]: invalid device context
type/bug
An issue about a bug
#700
opened Aug 13, 2024 by
andymvp2018
slurm script for: configs/official/OLMo-7B.yaml
type/question
An issue that's a question
#699
opened Aug 13, 2024 by
andymvp2018
Gflops computation is faulty for FSDP due to bug in
OLMo.num_params()
#695
opened Aug 7, 2024 by
AkshitaB
why CrossEntropyLoss is zero,i
type/question
An issue that's a question
#692
opened Aug 6, 2024 by
aizhweiwei
Olmo 0724 An issue about a bug
-hf
checkpoints don't load the proper config when instantiating with OLMoForCausalLM
type/bug
#689
opened Aug 5, 2024 by
sarahwie
Model ladder has no documentation
type/documentation
An issue or pull request related to documentation
#683
opened Jul 31, 2024 by
IanMagnusson
mlp_ratio not adjusted in config if mlp_hidden_size is set
type/bug
An issue about a bug
#673
opened Jul 21, 2024 by
Muennighoff
Does global_train_batch_size support gradient accumulation?
type/question
An issue that's a question
#672
opened Jul 21, 2024 by
jinzhuoran
Is there explicitly instruction-following data in the version of Dolma used to train v1?
type/question
An issue that's a question
#658
opened Jul 15, 2024 by
john-hewitt
Can long text be splitted into short texts?
type/question
An issue that's a question
#655
opened Jul 12, 2024 by
CoinCheung
Previous Next
ProTip!
no:milestone will show everything without a milestone.