One error should be solved and one improvement for reducing the CUDA memory #22

hongfangyu · 2023-05-06T08:09:44Z

1.One error should be solved
when install apex, there will be 4 erors about "convert unsigned long to long", you need to edit:
(1) line 65 in apex_22.01_pp/csrc/mlp.cpp
auto reserved_space = at::empty({reserved_size}, inputs[0].type());
change to:
auto reserved_space = at::empty({static_cast<long>(reserved_size)}, inputs[0].type());

(2) line 138 in apex_22.01_pp/csrc/mlp.cpp
auto work_space = at::empty({work_size / sizeof(scalar_t)}, inputs[0].type());
change to:
auto work_space = at::empty({static_cast<long>(work_size / sizeof(scalar_t))}, inputs[0].type());

or you need to change the compile option

2.one improvement for reducing the CUDA memory
when launch the owl_demo.py using a GPU with 16G, I ran into a CUDA memory overflow error. Then I edit here:
line 33 and 34 in interface.py:

    model = model.to(device)
    model = model.to(dtype)

change to:

    model = model.to(dtype)
    model = model.to(device)

Then, After the demo is started, the memory usage is about 14 GB. It can run very well on a 16GB GPU.

The text was updated successfully, but these errors were encountered:

hongfangyu · 2023-05-06T08:18:56Z

Just a reminder. My env is V100, gcc-7, cuda11.7, python3.10

MAGAer13 · 2023-05-06T08:35:39Z

Nice work!

LovingThresh · 2023-05-07T01:37:39Z

Without changine the code, 4090 will run into a CUDA memory overflow error So, Thank you!

MAGAer13 pinned this issue May 6, 2023

MAGAer13 added the good first issue Good for newcomers label May 6, 2023

wttc-nitr added a commit to wttc-nitr/mPLUG-Owl that referenced this issue May 7, 2023

make the desired changes as said here -> X-PLUG#22 (comment)

6bbb72e

MAGAer13 closed this as completed May 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One error should be solved and one improvement for reducing the CUDA memory #22

One error should be solved and one improvement for reducing the CUDA memory #22

hongfangyu commented May 6, 2023 •

edited

Loading

hongfangyu commented May 6, 2023

MAGAer13 commented May 6, 2023

LovingThresh commented May 7, 2023

One error should be solved and one improvement for reducing the CUDA memory #22

One error should be solved and one improvement for reducing the CUDA memory #22

Comments

hongfangyu commented May 6, 2023 • edited Loading

hongfangyu commented May 6, 2023

MAGAer13 commented May 6, 2023

LovingThresh commented May 7, 2023

hongfangyu commented May 6, 2023 •

edited

Loading