Skip to content

Commit

Permalink
addressing Anna Shor's comment
Browse files Browse the repository at this point in the history
Signed-off-by: Lifu Zhang <[email protected]>
  • Loading branch information
tomlifu committed Dec 12, 2024
1 parent 097266a commit 9c526ad
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion scripts/nlp_language_modeling/prepare_packed_ft_dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@
python scripts/nlp_language_modeling/prepare_packed_ft_dataset.py \
model.data.train_ds.file_names=[/path/to/training.jsonl] \
model.data.train_ds.max_seq_length=4096 \
+model.context_parallel_size=2 \
++model.context_parallel_size=2 \
+tokenizer_path=<see note 1 below> \
+output_dir=/path/to/output_folder \
+pack_sizes=[4096]
Expand Down

0 comments on commit 9c526ad

Please sign in to comment.