Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Domino news update on readme.md
#6815 opened Dec 3, 2024 by GuanhuaWang Loading…
Ulyssess offload blog
#6814 opened Dec 2, 2024 by samadejacobs Loading…
add FPDT tutorial
#6813 opened Dec 2, 2024 by samadejacobs Loading…
Support pure meta model lm_head tp
#6812 opened Dec 2, 2024 by Yejing-Lai Loading…
Fix uneven head sequence parallelism bug (#6774)
#6797 opened Nov 27, 2024 by Eugene29 Loading…
Fix type error in ZeROOrderedDict
#6794 opened Nov 27, 2024 by oraluben Loading…
Fix zero checkpoint
#6792 opened Nov 26, 2024 by xu-song Loading…
Remove warnings from autodoc and sphinx
#6788 opened Nov 26, 2024 by loadams Loading…
Stage3: Use new torch grad accumulation hooks API
#6773 opened Nov 21, 2024 by deepcharm Loading…
Check transformers version in BLOOM for inference v1
#6766 opened Nov 19, 2024 by lekurile Loading…
BLOOM fixes for DS Legacy Inference
#6765 opened Nov 19, 2024 by lekurile Draft
Flops profiler support einops.einsum
#6755 opened Nov 17, 2024 by lvhoaa Loading…
Fix building on Windows with presence of Triton
#6749 opened Nov 14, 2024 by woct0rdho Loading…
Update flake8 version
#6740 opened Nov 11, 2024 by loadams Loading…
Update formatting workflow
#6738 opened Nov 11, 2024 by loadams Loading…
Merge LoCo with Zero++
#6730 opened Nov 8, 2024 by XingyuXie Loading…
Support latest transformers with DSChat
#6711 opened Nov 4, 2024 by loadams Loading…
ProTip! Follow long discussions with comments:>50.