GPU memory requirement for inference #4

caoandong · 2024-07-29T01:41:05Z

Hi, thank you so much for open sourcing this amazing work!

I'm wondering what's the memory requirement to run the inference script? I tested the script verbatim on an A100 40G machine and it went OOM. Curious if we need to use a 80G machine instead, or is there something obvious that I'm missing?

Thanks!

JC1DA · 2024-07-29T02:55:41Z

it requires more than 40GB for 2 seconds of 720p video in my early experiments, 3 seconds video needs ~71 GB Vram without upscaling (upscale = 1)

Another question is "can we use something like flash attention to reduce vram usage"?

hejingwenhejingwen · 2024-07-29T12:58:20Z

Hi, thank you so much for open sourcing this amazing work!

I'm wondering what's the memory requirement to run the inference script? I tested the script verbatim on an A100 40G machine and it went OOM. Curious if we need to use a 80G machine instead, or is there something obvious that I'm missing?

Thanks!

At this time, A100 80G machine is required for high-resolution (~2k) and high-frame-rate (fps>=24) video generation. You can decline "up_scale" or "target_fps" to avoid OOM, but the visual results will observe obvious drop.

hejingwenhejingwen · 2024-07-29T13:12:48Z

it requires more than 40GB for 2 seconds of 720p video in my early experiments, 3 seconds video needs ~71 GB Vram without upscaling (upscale = 1)

Another question is "can we use something like flash attention to reduce vram usage"?

You are correct, this algorithm is expensive. Actually, we have already incorporated xformers for attention computation.
You can modify the number of sampling steps to achieve faster inference, but the performance will drop undoubtedly. We will design more efficient sampling strategies in the future.

O-O1024 · 2024-07-29T22:47:46Z

2 秒的 720p 视频用了多长时间推理？ @JC1DA

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU memory requirement for inference #4

GPU memory requirement for inference #4

caoandong commented Jul 29, 2024

JC1DA commented Jul 29, 2024 •

edited

Loading

hejingwenhejingwen commented Jul 29, 2024

hejingwenhejingwen commented Jul 29, 2024

O-O1024 commented Jul 29, 2024

GPU memory requirement for inference #4

GPU memory requirement for inference #4

Comments

caoandong commented Jul 29, 2024

JC1DA commented Jul 29, 2024 • edited Loading

hejingwenhejingwen commented Jul 29, 2024

hejingwenhejingwen commented Jul 29, 2024

O-O1024 commented Jul 29, 2024

JC1DA commented Jul 29, 2024 •

edited

Loading