Hacker News with Generative AI: Transcription

Show HN: Gemini LLM corrects ASR YouTube transcripts (ldenoue.github.io)
Getting summary...
Researchers claim that an AI-powered transcription tool invents things (apnews.com)
Tech behemoth OpenAI has touted its artificial intelligence-powered transcription tool Whisper as having near “human level robustness and accuracy.”
AI-powered transcription tool used in hospitals invents things no one ever said (apnews.com)
Tech behemoth OpenAI has touted its artificial intelligence-powered transcription tool Whisper as having near “human level robustness and accuracy.”
Improving Whisper Transcriptions with GPT-4o (github.com/orcaman)
I was watching the latest news episode from Whisky.com (where fine spirits meet ™) the other day on YouTube, and noticed that the transcription was really off.
Ask HN: How to transcribe a couple thousand calls per day? (ycombinator.com)
We have tried Microsoft Speech Service and found it to be way too complicated.
Show HN: LLM Aided Transcription Improvement (github.com/Dicklesworthstone)
OTranscribe: A free and open tool for transcribing audio interviews (otranscribe.com)
Show HN: Transcripto (transcripto.xyz)
Ask HN: How to transcribe 1000s of handwritten notes (ycombinator.com)
Self-hosted offline transcription and diarization service with LLM summary (github.com/transcriptionstream)