-
Notifications
You must be signed in to change notification settings - Fork 470
Issues: allenai/OLMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Fail to load tokenizer for checkpoints
type/bug
An issue about a bug
#741
opened Oct 24, 2024 by
tresiwald
Error Encountered During Multi-Node Pretraining with Torchrun
type/bug
An issue about a bug
#737
opened Oct 21, 2024 by
Zehui127
8-bit allgather support
type/question
An issue that's a question
#722
opened Sep 19, 2024 by
yaroslavvb
Which mmlu validation setting is recommend?
type/question
An issue that's a question
#714
opened Aug 27, 2024 by
mathfinder
[Quick question]: How do I turn off FSDP?
type/question
An issue that's a question
#703
opened Aug 15, 2024 by
candygocandy
RuntimeError: Triton Error [CUDA]: invalid device context
type/bug
An issue about a bug
#700
opened Aug 13, 2024 by
andymvp2018
slurm script for: configs/official/OLMo-7B.yaml
type/question
An issue that's a question
#699
opened Aug 13, 2024 by
andymvp2018
Number of tokens Olmo-1B was trained: 2T or 3T?
type/question
An issue that's a question
#697
opened Aug 9, 2024 by
jiyeonkimd
Gflops computation is faulty for FSDP due to bug in
OLMo.num_params()
#695
opened Aug 7, 2024 by
AkshitaB
why CrossEntropyLoss is zero,i
type/question
An issue that's a question
#692
opened Aug 6, 2024 by
aizhweiwei
Olmo 0724 An issue about a bug
-hf
checkpoints don't load the proper config when instantiating with OLMoForCausalLM
type/bug
#689
opened Aug 5, 2024 by
sarahwie
Model ladder has no documentation
type/documentation
An issue or pull request related to documentation
#683
opened Jul 31, 2024 by
IanMagnusson
mlp_ratio not adjusted in config if mlp_hidden_size is set
type/bug
An issue about a bug
#673
opened Jul 21, 2024 by
Muennighoff
Does global_train_batch_size support gradient accumulation?
type/question
An issue that's a question
#672
opened Jul 21, 2024 by
jinzhuoran
Is there explicitly instruction-following data in the version of Dolma used to train v1?
type/question
An issue that's a question
#658
opened Jul 15, 2024 by
john-hewitt
Can long text be splitted into short texts?
type/question
An issue that's a question
#655
opened Jul 12, 2024 by
CoinCheung
Cannot convert internal OLMo checkpoint to HF
type/bug
An issue about a bug
#654
opened Jul 11, 2024 by
viking-sudo-rm
start_index not getting reset in data loader when moving to new epoch
type/bug
An issue about a bug
#650
opened Jul 10, 2024 by
leon-g-xu
Issue with tokenizer wrapper
type/question
An issue that's a question
#644
opened Jul 8, 2024 by
davidbrandfonbrener
What did OLMo 1B converge to?
type/question
An issue that's a question
#642
opened Jul 4, 2024 by
sidereior
Resuming training on unsharded checkpoint
type/bug
An issue about a bug
#641
opened Jul 4, 2024 by
lecifire
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.