Hacker News with Generative AI: Fine-Tuning

All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning (arxiv.org)
From a first-principles perspective, it may seem odd that the strongest results in foundation model fine-tuning (FT) are achieved via a relatively complex, two-stage training procedure.

Machine Learning, Reinforcement Learning, Fine-Tuning, Foundation Models

3 points by gkswamy98 495 days ago | 0 comments

Machine Learning, Foundation Models, Fine-Tuning

12 points by PaulHoule 723 days ago | 0 comments

Fine-Tuning, Embeddings, Machine Learning, Google Colab

8 points by esleightholm 724 days ago | 1 comments

Machine Learning, Fine-Tuning, Catastrophic Forgetting

62 points by mnoorfawi 733 days ago | 9 comments