How to use LADE in single-node multi-process way? #57

sjrrr13 · 2024-04-15T02:56:16Z

I've tried to load LADE distributively with

CUDA_VISIBLE_DEVICES=0,1,2,3 \
USE_LADE=1 LOAD_LADE=1 DIST_WORKERS=4 \
python -m torch.distributed.launch minimal.py

However, when I try to monitor GPU usage with watch nvidia-smi, I've found that only gpu:0 was used. I want to use Llama-2-70b-hf and it can't be loaded in only one GPU. What can I do to use all the GPUs? Is there any problem in my launch command?

The text was updated successfully, but these errors were encountered:

Viol2000 · 2024-05-29T07:31:37Z

minimal.py does not support single-node- multi-process
Please check applications/run_mtbench.sh for examples, thank you!

Viol2000 · 2024-05-29T07:34:17Z

Maybe minimal.py can also support. Please set torch_device="auto" in the code and not changing DIST_WORKERS=4 and just use python minimal.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use LADE in single-node multi-process way? #57

How to use LADE in single-node multi-process way? #57

sjrrr13 commented Apr 15, 2024

Viol2000 commented May 29, 2024

Viol2000 commented May 29, 2024

How to use LADE in single-node multi-process way? #57

How to use LADE in single-node multi-process way? #57

Comments

sjrrr13 commented Apr 15, 2024

Viol2000 commented May 29, 2024

Viol2000 commented May 29, 2024