MathNet: 30,676 olympiad problems from 47 countries, SOTA models still fall short
An MIT team published MathNet, a multimodal benchmark with 30,676 olympiad math problems from 47 countries and 17 languages. Gemini-3.1-Pro achieves 78.4%, GPT-5 69.3%, and embedding models have significant difficulty finding mathematically equivalent problems.