Hacker News with Generative AI: Fine-Tuning

All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning (arxiv.org)
From a first-principles perspective, it may seem odd that the strongest results in foundation model fine-tuning (FT) are achieved via a relatively complex, two-stage training procedure.
Domain-Aware Fine-Tuning of Foundation Models (arxiv.org)
Fine-Tune Embeddings in Google Colab (research.google.com)
CURLoRA: Stable LLM Fine-Tuning and Catastrophic Forgetting Mitigation (zenodo.org)