MacWhisper, native macOS app for Whisper #420
Replies: 49 comments 46 replies
-
Added a bunch more features such as editing and deleting segments, as well as language selection. Love this framework, thanks again! |
Beta Was this translation helpful? Give feedback.
-
Love it! Working well. Here are a few thoughts from some initial use:
Definitely a good start! Any way to see when updates are released? |
Beta Was this translation helpful? Give feedback.
-
This is a really good app! I remember last time I tried something similar and the AI with Japanese was a bit shitty… I need to review the extracted text but it looks really good (at least what I've saw on the first mins). I have two ideas that could be good to have:
About the first point For the 2nd point Also, I would like to ask why MacWhisper isn't on Github or Gitlab so all of us can collaborate too. |
Beta Was this translation helpful? Give feedback.
-
Love it! And works really well. |
Beta Was this translation helpful? Give feedback.
-
Nice job ! Looking further for live transcript… |
Beta Was this translation helpful? Give feedback.
-
would be nice if i could choose the language |
Beta Was this translation helpful? Give feedback.
-
Nice app. Please add support for *.ogg files (WhatsApp, Telegram voice). |
Beta Was this translation helpful? Give feedback.
-
Fantastic work. Question: I initially downloaded the non-pro version and chose only the english dataset. After testing I want to try the multi-language dataset, but I see no way in the app or website to go back and get that file. Pointer would be appreciated. |
Beta Was this translation helpful? Give feedback.
-
I purchased MacWhisper Pro and love the application. I primarily use this for podcasts and hour long recordings. I'd like to request a way to export a transcript and its accompanying audio or video and have it be interactive like click on three paragraphs down and there's where the media will jump to. Or A program that could do that as I'd happily pay for something that like that. Would also love to see macwhisper come to iOS as well. Gladly pay for this again just to have it on mobile as well. |
Beta Was this translation helpful? Give feedback.
-
Also, once I identify a speaker, it would be fantastic for the AI to then label all instances of that voice appropriately!! |
Beta Was this translation helpful? Give feedback.
-
Batch mode does not work in the Pro version (only reason why I bought it). |
Beta Was this translation helpful? Give feedback.
-
I used Mac Whisper Pro Medium to transcribe an interview - stereo file. I wish I could get MW to identify each track with a name. It's too much work to do that manually. I see I can add people but I didn't figure out how that worked. Would be nice if there were YouTube videos showing what it can do and how to do it. Maybe there are but I haven't found them yet. |
Beta Was this translation helpful? Give feedback.
-
I got Mac Whisper for my Macbook Air M1 Ventura and got the pro. It then gave me size options so I chose medium. Can I have Pro large? There was no option to do that with the Pro. I do podcasts with a different person each time and am in no hurry so the best I can get is what I want. |
Beta Was this translation helpful? Give feedback.
-
Just found a little bug: Looks like it only listens to the left channel of a stereo file. I kept getting only "[BLANK_AUDIO]" for an mp3 file that clearly had voices in it. Mystery was solved when I opened it in audacity and saw that the speaking was all in the right channel. |
Beta Was this translation helpful? Give feedback.
-
you should add a settings section to tweak num of cpus / threads weights etc. |
Beta Was this translation helpful? Give feedback.
-
Macwhisper version 6.11 - no longer lets me edit the lines in the transcript to assign speakers! I'm not a comp sci guy, I am just a simple user. If anyone can provide help with this, or can suggest something like a new procedure - please - let me know, ok? Without the ability to assign speakers - it's essentially lost 50% of its utility. yikes! |
Beta Was this translation helpful? Give feedback.
-
Anyone knows something similar for Windows? |
Beta Was this translation helpful? Give feedback.
-
This app is absolutely phenomenal, I've bought it 3 times now for myself and two friends. Is there any chance there's an equivalent for Android? I'm looking for something that allows at the very least a long, uninterrupted mic recording where I can just put the phone down and let it pick everything up and transcribe it later. I realize I could just do an audio recording and move the file and transcribe it via the MacWhisperer app later, but if I could do it all on-device that would be incredible. Thanks! |
Beta Was this translation helpful? Give feedback.
-
I am using a paid Pro version of the app. Is this thread the only community for the app? I see a feedback email address for the app but no community links. I really think the app would benefit from either a forum/discord. Even a github discussion is fine. |
Beta Was this translation helpful? Give feedback.
-
Now that I have marked speakers for the transcription, I see now output format which preserves this information. Exporting segments to pdf or html seems to just export the original timestamp based segments. Is there a way I can add this speaker information to audio segments anywhere? |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Any suggestions for optimizing speed on an M2 Max MBP 96 GB? It'd be sweet if theres any benchmarks or you have any advice on models (distilled, turbo, normal?), audio pipeline and encoding/decoding format to use (assuming whisperkit is recommended), and effects of pipeline and encoding/decoding compute units on flash attention, greedy vs beam search, etc... I saw some notes on CoreML + GPU processing in discussions back in march but have not been following repo closely enough to know whether this has been implemented (at which point I assume whisperkit is no longer best option? although tbh idk what the difference between whisperkit and .cpp models are other than swift support). |
Beta Was this translation helpful? Give feedback.
-
Will ollama be supported as an alternative to openai and anthropic at one point ? |
Beta Was this translation helpful? Give feedback.
-
Hi, I've been using the new version of Macwhisper (Paid account) for a few weeks. Yesterday I went to launch it on my Macbook M2 and got a crash report. Here is a GTP summary: Any one run into this? Would appreciate any guidance. Thanks. From the crash report you provided, the key issue seems to be a segmentation fault (EXC_BAD_ACCESS) with SIGSEGV, which occurs when the program attempts to access memory that it shouldn't. In your case, it looks like the crash is happening in Thread 4, where a function in the MacWhisper app is attempting to read from an invalid address (far: 0x0000000000000004), which isn't mapped to any memory region. Key Indicators: Potential Causes:
Recommended Actions:
|
Beta Was this translation helpful? Give feedback.
-
Has anybody figured out any way to automate transcriptions with Mac Whisper? It appears to not have AppleScript or Shortcuts support, but I need to do weekly transcriptions and it would be great to be able to control Mac Whisper programatically so it can be part of an automatic workflow. |
Beta Was this translation helpful? Give feedback.
-
I absolutely love the dictation feature it is much better than the Apple dictation but at present the custom shortcut button doesn't work and I also noticed sometimes it makes a dictation noise but the microphone won't pop up and it does not work for instance right now so I have to use Apple dictation which is way less accurate. Any suggestions? |
Beta Was this translation helpful? Give feedback.
-
I absolutely love the app, and that it does the transcription fully offline. @jordibruin, thank you and to everyone who's contributed and help make this app happen. I get a lot of voice notes which are many times long, and end up feeding them through MacWhisper to transcribe. It would be nice to be able to control the translation feature... I've noticed however that with transcriptions that are bi-lingual (e.g. English-Arabic), some voice notes get transcribed without translation (i.e. English gets transcribed in English, and Arabic in Arabic), however in most cases MacWhisper will translate to one of the languages and only transcribe in that language. Is there any way to disable translation for a particular transcription, but keep the multi-language detections and transcriptions? Thanks! |
Beta Was this translation helpful? Give feedback.
-
First of all, a massive thanks to @ggerganov for making all this! Most of the low level stuff is voodoo to me, but I was able to get a native macOS app up and running thanks to all your hard work!
MacWhisper lets you run Whisper locally on your Mac without having to install anything else.
Features
MacWhisper is very basic right now, so please let me know if you run into anything. You can download it for free here:
http://goodsnooze.gumroad.com/l/macwhisper
Beta Was this translation helpful? Give feedback.
All reactions