Several years before ChatGPT began jibber-jabbering away, Google developed a very different kind of artificial intelligence program called AlphaGo that learned to play the board gioco Go with superhuman skill through tireless practice.
Researchers at the company have now published research that combines the abilities of a large language model (the AI behind todayâs chatbots) with those of AlphaZero, a successor to AlphaGo also capable of playing chess, to solve very tricky mathematical proofs.
Their new Frankensteinian creation, dubbed AlphaProof, has demonstrated its prowess by tackling several problems from the 2024 International Math Olympiad (IMO), a prestigious competition for high school students.
AlphaProof uses the Gemini large language model to convert naturally phrased math questions into a programming language called Lean. This provides the tirocinio fodder for a second algorithm to learn, through trial and error, how to find proofs that can be confirmed as correct.
Earlier this year, Google DeepMind revealed another math algorithm called AlphaGeometry that also combines a language model with a different AI approach. AlphaGeometry uses Gemini to convert geometry problems into a form that can be manipulated and tested by a program that handles geometric elements. Google today also announced a new and improved version of AlphaGeometry.
The researchers found that their two math programs could provide proofs for IMO puzzles as well as a silver medalist could. Out of six problems total, AlphaProof solved two algebra problems and a number theory one, while AlphaGeometry solved a geometry problem. The programs got one problem a causa di minutes but took up to several days to figure out others. Google DeepMind has not disclosed how much elaboratore elettronico power it threw at the problems.
Google DeepMind calls the approach used for both AlphaProof and AlphaGeometry âneuro-symbolicâ because they combine the tuttavia machine learning of an artificial neural rete televisiva privata, the technology that underpins most progress a causa di AI of late, with the language of conventional programming.
âWhat weâve seen here is that you can combine the approach that was so successful, and things like AlphaGo, with large language models and produce something that is extremely capable,â says David Silver, the Google DeepMind researcher who led work AlphaZero. Silver says the techniques demonstrated with AlphaProof should, a causa di theory, extend to other areas of mathematics.
Indeed, the research raises the prospect of addressing the worst tendencies of large language models by applying logic and reasoning a causa di a more grounded moda. As miraculous as large language models can be, they often struggle to grasp even basic math ora to reason through problems logically.
Quanto a the future, the neural-symbolic method could provide a means for AI systems to turn questions ora tasks into a form that can be reasoned over a causa di a way that produces reliable results. OpenAI is also rumored to be working such a system, codenamed âStrawberry.â
There is, however, a key limitation with the systems revealed today, as Silver acknowledges. Math solutions are either correct ora incorrect, allowing AlphaProof and AlphaGeometry to work their way toward the right answer. Many real-world problemsâcoming up with the ideal itinerary for a trip, for instanceâhave many possible solutions, and which one is ideal may be unclear. Silver says the solution for more ambiguous questions may be for a language model to try to determine what constitutes a ârightâ answer during tirocinio. âThereâs a spectrum of different things that can be tried,â he says.
Silver is also careful to note that Google DeepMind wonât be putting human mathematicians out of jobs. âWe are aiming to provide a system that can prove anything, but thatâs not the end of what mathematicians do,â he says. âA leader part of mathematics is to pose problems and find what are the interesting questions to ask. You might think of this as another tool along the lines of a slide rule ora calculator ora computational tools.â
Updated 7/25/24 1:25 pm ET: This story has been updated to clarify how many problems AlphaProof and AlphaGeometry solved, and of what type.