Hacker News with Generative AI: Olympiads

Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad (arxiv.org)
Recent math benchmarks for large language models (LLMs) such as MathArena indicate that state-of-the-art reasoning models achieve impressive performance on mathematical competitions like AIME, with the leading model, o3-mini, achieving scores comparable to top human competitors.
Gold-Medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 (arxiv.org)
We present AlphaGeometry2, a significantly improved version of AlphaGeometry introduced in Trinh et al. (2024), which has now surpassed an average gold medalist in solving Olympiad geometry problems.
Where have the IMO gold medallists ended up? Part three of three (xquant.substack.com)