Performance on Streaming Example #68

ehossai2 · 2024-01-04T18:56:16Z

You did a great great work, thanks for this. I want to share one experience with the Streaming version of the system. The performance seems poor, I think it is due not using VAD. I might be wrong, but just sharing.

I was running it on CPU, do you think that could be the reason?

I tried following models:

gmml-tiny, fast but performance is really poor
ggml-small.en, fast enough, performance is bad though
ggml-base, ggml-medium, really slow, as I am running it on CPU, that is why it is slow, but it is also not so accurate. That is why I felt maybe using VAD might improve the performance.

thanks.

Macoron · 2024-01-05T21:09:34Z

If I understand you right, by performance you mean transcription quality.

I want to share one experience with the Streaming version of the system. The performance seems poor, I think it is due not using VAD.

Yes, Whisper tends to severally hallucinate on silent segments, especially when prompted by previous transcription. VAD helps to skip silence segments and improve overall quality. I highly recommend to use streaming with VAD.

Try to enable VAD and see if it improves transcription quality.

i-s-t-e-m-i · 2024-07-14T20:00:19Z

I was running it on CPU, do you think that could be the reason?

as I am running it on CPU, that is why it is slow, but it is also not so accurate.

A noob question: How do you choose between using the CPU or the GPU? I mean is there a flag for this somewhere? Is the "Enable CUDA" option in the Project Settings what enables/disables the use of the GPU?

Macoron · 2024-07-14T22:32:14Z

I was running it on CPU, do you think that could be the reason?

as I am running it on CPU, that is why it is slow, but it is also not so accurate.

A noob question: How do you choose between using the CPU or the GPU? I mean is there a flag for this somewhere? Is the "Enable CUDA" option in the Project Settings what enables/disables the use of the GPU?

Yes, there is a flag in Project Settings. Check readme for more info.

i-s-t-e-m-i · 2024-07-14T23:41:20Z

Thank you Macoron, I’ve already checked the readme and tried the “Enable CUDA” option. I suppose this is the flag you mentioned.

So I’ll assume checking “Enable CUDA” makes it run on the GPU, while unchecking makes it run on the CPU.

When I run with CUDA enabled though, the app crashes at the inference stage. Do you think this might be due to my GTX960M?

Macoron · 2024-07-16T15:44:25Z

Thank you Macoron, I’ve already checked the readme and tried the “Enable CUDA” option. I suppose this is the flag you mentioned.

So I’ll assume checking “Enable CUDA” makes it run on the GPU, while unchecking makes it run on the CPU.

When I run with CUDA enabled though, the app crashes at the inference stage. Do you think this might be due to my GTX960M?

Hard to say for sure. What version of the CUDA Toolkit you have installed? Does the original whisper.cpp builds works on your PC?

i-s-t-e-m-i · 2024-07-17T13:54:23Z

I have CUDA Toolkit version 12.2

Cuda compilation tools, release 12.2, V12.2.91
Build cuda_12.2.r12.2

It is not exactly "12.2.0" but this is what you get installed through the link for 12.2.0

I tried the original whisper.cpp build now. It works OK with the default settings, but when I change the GGML_CUDA to 1, I get errors.

The errors are " A single input file is required for a non-link phase when an outputfile is specified".

This error has been discussed in one of the issues, and removal of a few lines from the CMakeLists.txt is suggested. However I don't see those lines in my CMakeLists.txt file. There are several of these CMakeLists.txt files in different folders though.

Macoron · 2024-07-17T20:45:06Z

I tried the original whisper.cpp build now. It works OK with the default settings, but when I change the GGML_CUDA to 1, I get errors.
The errors are " A single input file is required for a non-link phase when an outputfile is specified".

I see. Unfortunately, no idea what might be wrong. Maybe the GPU is indeed just too old and doesn't support instructions need for whisper.cpp.

Macoron added the enhancement New feature or request label Jan 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance on Streaming Example #68

Performance on Streaming Example #68

ehossai2 commented Jan 4, 2024

Macoron commented Jan 5, 2024

i-s-t-e-m-i commented Jul 14, 2024 •

edited

Loading

Macoron commented Jul 14, 2024

i-s-t-e-m-i commented Jul 14, 2024

Macoron commented Jul 16, 2024

i-s-t-e-m-i commented Jul 17, 2024

Macoron commented Jul 17, 2024

Performance on Streaming Example #68

Performance on Streaming Example #68

Comments

ehossai2 commented Jan 4, 2024

Macoron commented Jan 5, 2024

i-s-t-e-m-i commented Jul 14, 2024 • edited Loading

Macoron commented Jul 14, 2024

i-s-t-e-m-i commented Jul 14, 2024

Macoron commented Jul 16, 2024

i-s-t-e-m-i commented Jul 17, 2024

Macoron commented Jul 17, 2024

i-s-t-e-m-i commented Jul 14, 2024 •

edited

Loading