Distributed Training with PyTorch on Multi-GPU Setup #5

ballin0105 · 2023-12-25T15:20:03Z

Hi there,

I've been following your work with great interest and appreciate all the effort you've put into it. I encountered an issue when running your code on a remote server with 8 V-100 GPUs under PyTorch. After switching the launcher to PyTorch, I ran into a address already in use error that seems to prevent multi-GPU utilization, restricting the process to a single GPU.

Is there any chance that a distributed training update compatible with PyTorch might be on the horizon? It would greatly benefit those of us working with similar hardware configurations.

Thanks for your continued contributions to the field!

The text was updated successfully, but these errors were encountered:

Jingkang50 · 2023-12-25T16:27:07Z

Thank you for your interest in our work! The distributed training framework is based on MMDet, maybe you can look for some solutions at the MMDet repo or community? If you still cannot solve it, please lemme know.
Thanks!

ballin0105 · 2023-12-28T16:05:27Z

Thank you for your interest in our work! The distributed training framework is based on MMDet, maybe you can look for some solutions at the MMDet repo or community? If you still cannot solve it, please lemme know. Thanks!

Thank you so much for your helpful response! :)

leaozhun · 2024-10-13T07:38:19Z

Hello,have you resolved the issue with PyTorch distributed training?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed Training with PyTorch on Multi-GPU Setup #5

Distributed Training with PyTorch on Multi-GPU Setup #5

ballin0105 commented Dec 25, 2023

Jingkang50 commented Dec 25, 2023

ballin0105 commented Dec 28, 2023

leaozhun commented Oct 13, 2024

Distributed Training with PyTorch on Multi-GPU Setup #5

Distributed Training with PyTorch on Multi-GPU Setup #5

Comments

ballin0105 commented Dec 25, 2023

Jingkang50 commented Dec 25, 2023

ballin0105 commented Dec 28, 2023

leaozhun commented Oct 13, 2024