Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does tensor_parallel support multi-node tensor parallel training? #84

Open
liguodongiot opened this issue Jun 7, 2023 · 6 comments
Open

Comments

@liguodongiot
Copy link

No description provided.

@liguodongiot liguodongiot changed the title Does tensor_parallel support multi-node training? Does tensor_parallel support multi-node tensor parallel training? Jun 7, 2023
@zhangjunyi111
Copy link

I want to konw too.

@longday1102
Copy link

@BlackSamorez Hope you can answer this question 😄😄

@longday1102
Copy link

@BlackSamorez I have 2 servers with a total of 16 GPUs, so I would love to be able to use multi-nodes tensor-parallel to train a large language model, for example Bloom 176B. So I hope you can answer how to use multi-nodes tensor-parallel.
Thank you very much

@PieterZanders
Copy link

Is this solved? if so, how?

@deema-A
Copy link

deema-A commented Aug 14, 2024

same question.

@Tezcan98
Copy link

Tezcan98 commented Oct 8, 2024

ahaha everybody have same problem but I think there is no feature like this but we absolutely need it.
Recently I tried DeepSpeed which is developing from microsoft, maybe it has but Microsoft's code doesn't suppport Windows 😄

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants