Hacker News with Generative AI: Voice Recognition

Noise cancellation improves turn-taking for AI Voice Agents (krisp.ai)
AI Voice Agents are rapidly evolving, powering critical use-cases such as customer support automation, virtual assistants, gaming, and remote collaboration platforms. For these voice-driven interactions to feel natural and practical, the underlying audio pipeline must be resilient to noise, responsive, and accurate—especially in real-time scenarios.
AI-Generated Voice Evidence Poses Dangers in Court (lawfaremedia.org)
In the age of AI, listener authentication of voice evidence should be permissive,  not mandatory.
Microsoft Launches Dragon Copilot for Healthcare (microsoft.com)
REDMOND, Wash. — March 3, 2025 — On Monday, Microsoft Corp. is unveiling Microsoft Dragon Copilot, the first AI assistant for clinical workflow that brings together the trusted natural language voice dictation capabilities of DMO with the ambient listening capabilities of DAX, fine-tuned generative AI and healthcare-adapted safeguards.
OpenAI announces Advanced Voice with Vision [video] (youtube.com)
Aqua Voice (YC W24) for Desktop (ycombinator.com)
Hey, this is Finn from Aqua Voice. Today we're releasing Aqua Voice for Desktop (https://withaqua.com). Aqua is a voice-driven text editor that lets you dictate using natural language commands.
Ichigo: Local real-time voice AI (github.com/homebrewltd)
🍓 Ichigo is an open, ongoing research experiment to extend a text-based LLM to have native "listening" ability. Think of it as an open data, open weight, on device Siri.
Show HN: Open source framework OpenAI uses for Advanced Voice (github.com/livekit)
The Agents framework allows you to build AI-driven server programs that can see, hear, and speak in realtime.
OpenAI Announces Realtime Voice API (twitter.com)
Show HN: Wispr Flow – A new voice dictation experience (ycombinator.com)
Hey HN! We're Tanay and Sahaj from the Wispr Flow team (https://flowvoice.ai). Today, we're officially launching Flow: a Mac dictation app that lets you speak naturally and writes in your style, in every application — with auto-edits, command mode, and over 100 languages.
OpenAI rolls out Advanced Voice Mode with more voices and a new look (techcrunch.com)
OpenAI announced it is rolling out Advanced Voice Mode (AVM) to an expanded set of ChatGPT’s paying customers on Tuesday. The audio feature, which makes ChatGPT more natural to speak with, will initially roll out to customers in ChatGPT’s Plus and Teams tiers. Enterprise and Edu customers will start receiving access next week.
Show HN: Bot or Not? AI voices vs. humans (unison.fm)
Bot or Not?
Show HN: Turn voice messages to beautiful journals (voicejournal.live)
Kyutai unveils today the first voice-enabled AI openly accessible to all [pdf] (kyutai.org)
200ms Voice LLM (github.com/fixie-ai)