- Nice! I really like how many variations on this idea are coming out. MacWhisper used to be great, but is kinda of a buggy mess now.
I'm making my own, for personal use. I did a survey of many and they all (that I could find) skip the fundamentals.
The major issues that I've run into:
- Crash recovery. Most of these apps are incredibly buggy and crash all the time, taking the recorded audio with them. Macwhisper is incredibly bad at this.
- Disk space. Many of these apps save wav files to disk. After a few hours of meetings, you may end up with gigabytes eaten.
- Microphone bleed. People don't always use headphones, the system mic will pick up the speaker sounds, causing duplicate (approximately) transcriptions.
I've yet to find a solution that handles all these correctly, let alone having high quality transcriptions.
Anyway, most of these apps are built around https://github.com/FluidInference/FluidAudio, if anyone is curious. Their readme has a big list of similar apps as well.
- The App looks really interesting and I’d love to try it out. How well does it work in other languages than English? For me, German would be important.
Due to audio quality, transcription sometimes produces garbled output or understands something wrong. FluidVoice offers the option to use a LLM to „interpret“ the text to rescue garbled audio through context. Do you also plan to support something like this? This would be a great feature!
- I'd love to have a purchase option not tied to the App Store if possible. I don't use an Apple account with my Mac, but I would love to try Trace.
by watchlight
1 subcomments
- Agreed with JohnBiz, the moment flagging is interesting and unusual, and a nice contrast to passive transcription. I only recently learned about MacWhisper (I'm Windows primarily) and was floored to learn how expensive the Pro option is. Nowadays it's not so hard to have some-level of DIY transcription, so crazy that it's priced with a premium.
What's your diarization pipeline? Pyannote?
I'd taken a different approach that used a LLM clean-up pass to summarize and progressively compress the transcript for ultra-long content, but I like the idea of targeted "pay attention here" flags.
by tillcarlos
1 subcomments
- Had the same idea, but have to focus on my main business. This comes at the right time!
I just purchased it. What's the best way to give you feedback? (Do you want any?)
From the top of my head:
- will the mic switch automatically when I am at my office? Or do I have to change settings every time? Maybe a preference of what's available + auto switch would be good.
- I personally don't need the hot key. Menu bar icon would be fine.
- Download the model is a long process. Put it into the installer, not into the bar on the bottom
- Speaker correction would be amazing. If it could "Learn" the speakers based on voice.
- Overall neat app. Good animations and UX
**Speaker 1** [00:00] What if I fell to the floor?
**Microphone** [00:02] Yes, this is Phil, I'm just speaking, this should be my voice, and there's music in the
**Speaker 1** [00:05] Couldn't tell this anymore
- I've tried so many of these and paid for a lot of them and I still can't find what I want. It sounds like this is closer than most:
- record and separate two sides of the conversation
- save meetings in a simple transcription format in a local folder
- connect with my calendar (Outlook, Google Calendar) and name meeting transcripts accordingly
- for recurring meetings, append rather than create a new transcript
- let me label speaker voices and recognise those voices across different meetings
A tool that did all this and then ALSO built a knowledge base to let me RAG query my meetings would be the holy grail for me.
- The key moments feat is neat. Been working on a free opensource offline transcriber that runs fast on CPU and does diarization too
https://github.com/kouhxp/yapsnap
by addozhang
1 subcomments
- This is an excellent product and exactly what I've been looking for. But most of my meetings are done on my company Mac, and they definitely won't let me install this kind of software, even though I'd be willing to pay for it myself.
by robertkarl
3 subcomments
- This looks sick. I was going to download it but for $10 I am more willing to attempt asking Claude to implement something like it, than to purchase.
I would be more willing to purchase if it was open source and I could build from source to try it first.
by littlecranky67
0 subcomment
- Would love to use this app, I recently thought about coding something similar myself. I would need to only record my own voice due to privacy laws (here in Germany, you can record yourself without consent). With overthe-ear headset, the microphone only captures my voice. Would need to store the original audio plus the transcription. Ideally, you can configure it to start as soon as it detects a new window with a given title (i.e. Webex launches meetings in a new window named "Meeting ....").
- Which Speech-to-Text is used? Is it possible to configure it? This might be crucial for supporting languages other than English - the model that comes built-in with macOS fails completely for German.
- This looks like a good approach, though I would expect this to be a native macOs feature within 12 months -- this seems totally like it fits into their product roadmap.
by usernametaken29
0 subcomment
- I don’t have this particular use case right now but if anything it feels like LLMs and their distilled on prem models are starting to kill SaaS simply because it becomes more and more tenable to build a “complete software” in a short time frame. That’s freaking awesome. Good idea and love the return of the good old you buy, you own it mentality
- I tried this but it took over my mic and people couldnt hear me on teams call until I turned it off. Nice idea but needs to share the mic w teams to be useful for me. Not sure if it’s teams fault or trace fault but either way…
- It works well so far, but what's up with the weird non-standard menubar menu? It's very odd, and it doesn't respect system light/dark mode preferences.
by nightpool
2 subcomments
- Is this legal to use in 2 party consent states? Might vary from state to state, which is probably why both zoom and Meet require users to click through consent screens when meetings are bring transcribed. Might be useful to have that on the FAQ page
- I've just bought it based on the description.
Minor complaint is that it steals Cmd-Shift-P (Firefox Private Browsing shortcut) by default.
Easy to change in the UI though, so no big deal.
by Myrmornis
1 subcomments
- I will be happy to spend £10 on this. One feature question though -- does it continue transcribing the meeting even if I've turned my volume down / muted it?
- Really nice, I will buy now and test out. How did you make the video on your website? And are you based in UK?
by transitKnox
1 subcomments
- Would it be hard to add a summarization feature? Or put an executive overview at the top of the transcript?
by yilugurlu
1 subcomments
- Looks awesome, I just bought. I'd love to see Office/Teams Calendar integration.
Good luck!
- Super interesting! How accurate is the local model to transcribe audio compared to other cloud services? E.g. Google Meet, Otter, Granola, etc.
- looks great! and good that you decided on the one-time fee instead of a subscription. side question: what did you use to create these videos/gifs on the homepage that shows the app? They look really good!
- Those transcription times are fast fast. What model/library do you use?
- I thought it was really interesting
- I've been looking for this exact thing!
- Will this be available for iOS?
- Does it support multiple languages?
by triyambakam
0 subcomment
- My stack has been QuickTime and Assembly AI lol
by Zhite_Panther
0 subcomment
- [flagged]
- [flagged]
- [flagged]
- [dead]
by _onecookie
0 subcomment
- [dead]
by beefmumbai
0 subcomment
- [dead]
by JohnBizBiz
1 subcomments
- [flagged]
by satvikpendem
2 subcomments
- I don't see how this is different to literally the dozens of other offline transcription apps, many open source even unlike this one.