Fwd: AI at the Maths Olympiad

44 views

Skip to first unread message

Siddhartha Gadgil

unread,

Jul 26, 2024, 2:33:49 AM7/26/24

to Automated Mathematics India

---------- Forwarded message ---------
From: Siddhartha Gadgil <gad...@iisc.ac.in>
Date: Fri, 26 Jul 2024 at 09:29
Subject: AI at the Maths Olympiad
To: ALL @ MATH <all....@iisc.ac.in>
Cc: Rajesh Sundaresan <raj...@iisc.ac.in>

Dear All,

Another step for AI in mathematics from Google-Deepmind: https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/

Here are some of my comments:

The system ran for 2 days (so beyond Olympiad time limits). It got 4 out of 6 problems, getting 7/7 in each of them. This meant its score was at the top end of the silver medal range (taking more time is not a big deal as the system will surely get faster).
In addition to "AlphaGeometry" for solving the geometry problems, the system included "AlphaProof", which uses generative AI with the Lean Theorem Prover.
Indeed, one of the top experts in Lean (Eric Weiser) was part of the team and there was deep integration with Lean including "metaprogramming"
Crucially, the actual problems were solved. A lot of false hype with AI solving Olympiad problems has been where a problem says, "find the number n such that ..." and the people running the system change this to "show that 1324 is the number such that ..." , which often makes it a much easier problem. This was not done here - the system guessed answers and then tried to prove the guesses in Lean, accepting an answer if it was proved.
That Generative AI paired with Interactive Theorem Provers is what is needed for mathematical reasoning (in a broad sense) is something a large number of people have concluded independently. This seems to show that this is correct and that such a system has promise.

AI achieves silver-medal standard solving International Mathematical Olympiad problems

Breakthrough models AlphaProof and AlphaGeometry 2 solve advanced reasoning problems in mathematics

deepmind.google

regards,

Siddhartha

P.S. My opinion: the mathematics community (in a broad sense, including applied mathematics that is theorem-proof based at least) has two choices,

Work on and think about tools and workflows that will allow us to use AI (probably coupled with Lean) in our research (and teaching).
Continue as usual and let AI in mathematics develop in parallel. Essentially this means betting that a system that can autonomously do mathematics at a level equal to or better than us (come up with questions from the literature, answer some, decide which are non-trivial, write a paper and submit it) is sufficiently far in the future (i.e., many decades rather than, say, 10 years).
Whether or not a system can be a Gromov or Deligne, if it can do work better than most papers of most mathematicians it will already be disruptive.

Reply all

Reply to author

Forward

0 new messages