Hacker News with Generative AI: Synthetic Data

Nvidia Bets Big on Synthetic Data (wired.com)
Nvidia has acquired synthetic data firm Gretel for nine figures, according to two people with direct knowledge of the deal.
Show HN: Curator – an open-source library for synthetic data generation (github.com/bespokelabsai)
Bespoke Curator makes it easy to create synthetic data pipelines. Whether you are training a model or extracting structure, Curator will prepare high-quality data quickly and robustly.
Model Collapse (wikipedia.org)
Model collapse is a phenomenon where machine learning models gradually degrade due to errors coming from uncurated training on the outputs of another model, including prior versions of itself.[1][2][3][4] Such outputs are known as synthetic data.
'Nemotron-4 340B' model redefines synthetic data generation, rivals GPT-4 (venturebeat.com)