That sounds nice. I’ve been looking at serenade.ai and thought about extending their STT with an option to use another third-party STT engine. I would then like to extend their command engine with LLM command recognition. In my experience, maybe also with my pronunciation as a non-english speaker, their STT and command recognition really doesn’t work that well.
That sounds nice. I’ve been looking at serenade.ai and thought about extending their STT with an option to use another third-party STT engine. I would then like to extend their command engine with LLM command recognition. In my experience, maybe also with my pronunciation as a non-english speaker, their STT and command recognition really doesn’t work that well.
Have you tried Whisper from OpenAI? It’s the best I’ve ever seen. I’m curious how it would handle accents.
No, not yet. But thanks for the tip!