Hacker News with Generative AI: Olympiads

Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad (arxiv.org)
Recent math benchmarks for large language models (LLMs) such as MathArena indicate that state-of-the-art reasoning models achieve impressive performance on mathematical competitions like AIME, with the leading model, o3-mini, achieving scores comparable to top human competitors.

Mathematics, Benchmarking, AI, Olympiads

6 points by mauriziocalo 480 days ago | 1 comments

Gold-Medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 (arxiv.org)
We present AlphaGeometry2, a significantly improved version of AlphaGeometry introduced in Trinh et al. (2024), which has now surpassed an average gold medalist in solving Olympiad geometry problems.

Artificial Intelligence, Mathematics, Geometry, Olympiads

64 points by hnhn34 533 days ago | 5 comments

Where have the IMO gold medallists ended up? Part three of three (xquant.substack.com)

Mathematics, Olympiads, Education

29 points by nb_quant 718 days ago | 7 comments