Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
enables default data step in megatron parallel to operate on a wider …
…variety of tensors - second try (NVIDIA#9671) * enables default data step in megatron parallel to operate on a wider variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]> * handles the case where a batch is empty Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]> * Allows the default data step to operate on more types than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> --------- Signed-off-by: Jonathan Mitchell <[email protected]> Signed-off-by: jomitchellnv <[email protected]> Co-authored-by: jomitchellnv <[email protected]> Co-authored-by: John St. John <[email protected]>
- Loading branch information