arXiv:2605.22763: AI agent with Lean verification solves 9 open Erdős problems and 44 OEIS conjectures
A team of 20 researchers from DeepMind and MIT CSAIL published the first large-scale evaluation of LLMs for autonomous generation of formal proofs in the Lean theorem prover. The agent combines LLM generation with Lean symbolic verification and autonomously solves 9 of 353 open Erdős problems and proves 44 of 492 OEIS conjectures.