Hacker News with Generative AI: Language Modeling

A CC-By Open-Source TTS Model with Voice Cloning (huggingface.co)
OuteTTS-0.1-350M is a novel text-to-speech synthesis model that leverages pure language modeling without external adapters or complex architectures, built upon the LLaMa architecture using our Oute3-350M-DEV base model, it demonstrates that high-quality speech synthesis is achievable through a straightforward approach using crafted prompts and audio tokens.
Have we stopped to think about what LLMs model? (theregister.com)