Hacker News with Generative AI: Voice Recognition

Show HN: Real-time AI Voice Chat at ~500ms Latency (github.com/KoljaB)
Have a natural, spoken conversation with AI!

AI, Real-time Chat, Voice Recognition, Latency, Software

524 points by koljab 79 days ago | 227 comments

Noise cancellation improves turn-taking for AI Voice Agents (krisp.ai)
AI Voice Agents are rapidly evolving, powering critical use-cases such as customer support automation, virtual assistants, gaming, and remote collaboration platforms. For these voice-driven interactions to feel natural and practical, the underlying audio pipeline must be resilient to noise, responsive, and accurate—especially in real-time scenarios.

AI, Voice Recognition, Customer Service, Technology

113 points by davitb 120 days ago | 45 comments

AI-Generated Voice Evidence Poses Dangers in Court (lawfaremedia.org)
In the age of AI, listener authentication of voice evidence should be permissive, not mandatory.

Artificial Intelligence, Law, Voice Recognition

206 points by hn_acker 134 days ago | 163 comments

Microsoft Launches Dragon Copilot for Healthcare (microsoft.com)
REDMOND, Wash. — March 3, 2025 — On Monday, Microsoft Corp. is unveiling Microsoft Dragon Copilot, the first AI assistant for clinical workflow that brings together the trusted natural language voice dictation capabilities of DMO with the ambient listening capabilities of DAX, fine-tuned generative AI and healthcare-adapted safeguards.

Healthcare, Artificial Intelligence, Microsoft, Voice Recognition, Generative AI

8 points by stupeo 142 days ago | 3 comments

OpenAI announces Advanced Voice with Vision [video] (youtube.com)

OpenAI, Artificial Intelligence, Voice Recognition, Computer Vision

13 points by BoorishBears 223 days ago | 1 comments

Aqua Voice (YC W24) for Desktop (ycombinator.com)
Hey, this is Finn from Aqua Voice. Today we're releasing Aqua Voice for Desktop (https://withaqua.com). Aqua is a voice-driven text editor that lets you dictate using natural language commands.

Software, Text Editors, Artificial Intelligence, Voice Recognition

12 points by the_king 224 days ago | 9 comments

Ichigo: Local real-time voice AI (github.com/homebrewltd)
🍓 Ichigo is an open, ongoing research experiment to extend a text-based LLM to have native "listening" ability. Think of it as an open data, open weight, on device Siri.

Artificial Intelligence, Voice Recognition, Open Source, Research

217 points by egnehots 282 days ago | 40 comments

Show HN: Open source framework OpenAI uses for Advanced Voice (github.com/livekit)
The Agents framework allows you to build AI-driven server programs that can see, hear, and speak in realtime.

Open Source, AI, Voice Recognition, Realtime Communication

266 points by russ 292 days ago | 61 comments

OpenAI Announces Realtime Voice API (twitter.com)

OpenAI, Artificial Intelligence, Voice Recognition, APIs

16 points by swyx 295 days ago | 2 comments

Show HN: Wispr Flow – A new voice dictation experience (ycombinator.com)
Hey HN! We're Tanay and Sahaj from the Wispr Flow team (https://flowvoice.ai). Today, we're officially launching Flow: a Mac dictation app that lets you speak naturally and writes in your style, in every application — with auto-edits, command mode, and over 100 languages.

Software, Voice Recognition, Mac Apps, Productivity, Artificial Intelligence

10 points by tankots42 296 days ago | 4 comments

OpenAI rolls out Advanced Voice Mode with more voices and a new look (techcrunch.com)
OpenAI announced it is rolling out Advanced Voice Mode (AVM) to an expanded set of ChatGPT’s paying customers on Tuesday. The audio feature, which makes ChatGPT more natural to speak with, will initially roll out to customers in ChatGPT’s Plus and Teams tiers. Enterprise and Edu customers will start receiving access next week.

OpenAI, ChatGPT, Artificial Intelligence, Voice Recognition, User Experience

62 points by XavierShaw 301 days ago | 78 comments

Show HN: Bot or Not? AI voices vs. humans (unison.fm)
Bot or Not?

Artificial Intelligence, Voice Recognition, Music, Audio, Software

13 points by smock 308 days ago | 6 comments

Show HN: Turn voice messages to beautiful journals (voicejournal.live)

Show HN, Voice Recognition, Journaling, Software, Productivity

20 points by akshaynathr 376 days ago | 10 comments

Kyutai unveils today the first voice-enabled AI openly accessible to all [pdf] (kyutai.org)