Hacker News with Generative AI: Pre-training

Exploring Impact of Code in Pre-Training (arxiv.org)
Electra: Pre-Training Text Encoders as Discriminators Rather Than Generators (2020) (arxiv.org)
CatLIP: Clip Vision Accuracy with 2.7x Faster Pre-Training on Web-Scale Data (arxiv.org)