-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Context Parallel SFT Support for dataset in THD format #10688
base: main
Are you sure you want to change the base?
Conversation
nemo/collections/nlp/models/language_modeling/megatron_gpt_model.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/gpt_sft_dataset.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/models/language_modeling/megatron_gpt_model.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/gpt_sft_dataset.py
Outdated
Show resolved
Hide resolved
Signed-off-by: Lifu Zhang <[email protected]>
… thd_cp_support
Signed-off-by: Lifu Zhang <[email protected]>
Signed-off-by: Lifu Zhang <[email protected]>
Signed-off-by: tomlifu <[email protected]>
nemo/collections/nlp/models/language_modeling/megatron_gpt_model.py
Fixed
Show resolved
Hide resolved
nemo/collections/nlp/models/language_modeling/megatron_gpt_model.py
Fixed
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/gpt_sft_dataset.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/gpt_sft_dataset.py
Outdated
Show resolved
Hide resolved
Signed-off-by: Lifu Zhang <[email protected]>
Signed-off-by: Lifu Zhang <[email protected]>
Please fix DCO also. |
Signed-off-by: Lifu Zhang <[email protected]>
0339a8f
to
90feb4a
Compare
Signed-off-by: tomlifu <[email protected]>
Signed-off-by: Lifu Zhang <[email protected]>
Signed-off-by: Lifu Zhang <[email protected]>
Signed-off-by: tomlifu <[email protected]>
Signed-off-by: Lifu Zhang <[email protected]>
… thd_cp_support
Signed-off-by: Lifu Zhang <[email protected]>
Signed-off-by: Lifu Zhang <[email protected]>
Signed-off-by: tomlifu <[email protected]>
6e8ac3b
to
463a478
Compare
Could you let me know when it will be completed? I’ve been really looking forward to this feature. It works in pretrain, but it’s really strange that it doesn’t work in SFT. |
Signed-off-by: root <[email protected]>
Signed-off-by: Lifu Zhang <[email protected]>
Signed-off-by: tomlifu <[email protected]>
Signed-off-by: Lifu Zhang <[email protected]>
beep boop 🤖: 🙏 The following files have warnings. In case you are familiar with these, please try helping us to improve the code base. Your code was analyzed with PyLint. The following annotations have been identified:
Thank you for improving NeMo's documentation! |
What does this PR do ?
This PR adds CP support for THD format and is compatible with cu_seqlen_padded in the latest CUDNN fused attention.
Steps to run SFT + CP + THD format:
scripts/nlp_language_modeling/prepare_packed_ft_dataset.py
to pack the dataset into THD format in desired sequence length. For example:PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.