Hacker News with Generative AI: Audio Generation

Show HN: PlayNote – NotebookLM but with custom voices and API (play.ai)
Turn your files and data into captivating audio creations. Enjoy cutting-edge AI voice synthesis.
Pushing the frontiers of audio generation (deepmind.google)
Our pioneering speech generation technologies are helping people around the world interact with more natural, conversational and intuitive digital assistants and AI tools.
Amphion: An open-source audio, music, and speech generation toolkit (github.com/open-mmlab)
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer (haidog-yaqub.github.io)
EzAudio is an advanced text-to-audio (T2A) generation model that creates high-quality audio from text prompts. It sets a new standard for open-source T2A models by delivering fast, efficient, and realistic sound effects generation.
Generating audio for video: using video and text prompts to generate soundtracks (deepmind.google)