Hacker News with Generative AI: Voice Recognition

Ichigo: Local real-time voice AI (github.com/homebrewltd)
🍓 Ichigo is an open, ongoing research experiment to extend a text-based LLM to have native "listening" ability. Think of it as an open data, open weight, on device Siri.
Show HN: Open source framework OpenAI uses for Advanced Voice (github.com/livekit)
The Agents framework allows you to build AI-driven server programs that can see, hear, and speak in realtime.
OpenAI Announces Realtime Voice API (twitter.com)
Show HN: Wispr Flow – A new voice dictation experience (ycombinator.com)
Hey HN! We're Tanay and Sahaj from the Wispr Flow team (https://flowvoice.ai). Today, we're officially launching Flow: a Mac dictation app that lets you speak naturally and writes in your style, in every application — with auto-edits, command mode, and over 100 languages.
OpenAI rolls out Advanced Voice Mode with more voices and a new look (techcrunch.com)
OpenAI announced it is rolling out Advanced Voice Mode (AVM) to an expanded set of ChatGPT’s paying customers on Tuesday. The audio feature, which makes ChatGPT more natural to speak with, will initially roll out to customers in ChatGPT’s Plus and Teams tiers. Enterprise and Edu customers will start receiving access next week.
Show HN: Bot or Not? AI voices vs. humans (unison.fm)
Bot or Not?
Show HN: Turn voice messages to beautiful journals (voicejournal.live)
Kyutai unveils today the first voice-enabled AI openly accessible to all [pdf] (kyutai.org)
200ms Voice LLM (github.com/fixie-ai)