What are the limits of the AI mathematician?

00:00

{"text":[[{"start":6.74,"text":"The writer is a theoretical cosmologist at the University of Cambridge and director of the Infosys-Cambridge AI Centre "}],[{"start":15.13,"text":"Mathematics was once assumed to be relatively safe from the incoming juggernaut of artificial intelligence automation. Chatbots might be able to generate text, code and images on demand, but the deep reasoning required for mathematics was supposedly out of reach. The gold medals that OpenAI and DeepMind recently achieved at the International Mathematical Olympiad have therefore left maths professors like me feeling suddenly a little less safe. "}],[{"start":48.28,"text":"Is AI about to do to mathematical proofs what it’s already doing to coding? After all, the two have clear similarities: both are highly structured “languages” with clear conventions and restricted “dictionaries”. Both have large corpora of examples on which AI can be trained with known solutions.  "}],[{"start":72.57,"text":"Yet while the results from cutting-edge AI maths models are impressive, there is another class of maths that generative AI still struggles with: simple computation. Ask “what is 5.11 minus 5.9?” and the answers vary. This morning, OpenAI’s latest GPT5 model gave me the correct answer of -0.79. But phrase the question as part of a calculation and you may receive a different answer."}],[{"start":102.72,"text":"What should we make of AI models that can outperform high school-age Olympiad competitors but cannot always add or subtract to primary school level? To understand this, it’s helpful to think about what it means to be good at maths."}],[{"start":117.77,"text":"The way maths is taught is by showing students a problem, demonstrating the method required to solve it and then assigning examples. Weaker students require numerous examples and sometimes end up simply memorising the method without understanding it. The strongest students need only one or two examples to master the concept and apply it to new problems."}],[{"start":142.35,"text":"The ability to conceptualise and generalise distinguishes the best mathematicians. Good mathematicians solve hard problems; great ones find ways to make the hard problems easy."}],[{"start":156.35,"text":"The strengths of AI models lie in their speed and ability to “practise” at extremely high volumes. This means they can solve very difficult problems that bear some resemblance to things they have been shown before but may struggle when given something new. This is particularly a problem for theoretical maths. The number of examples available for training drops as you move towards more advanced problems."}],[{"start":183.04999999999998,"text":"These are well-known issues with neural networks. They are great at interpolation (generating answers that are “between” things they’ve seen before) and bad at extrapolation (generating answers that fall outside their training set)."}],[{"start":197.63,"text":"In maths, this is made extra difficult by problems that sound similar. Consider: “What is the maximum number of cubes of volume 1 that you can fit in a cube of volume 64?” and “What is the maximum number of spheres of volume 1 that you can fit in a sphere of volume 64?”. They sound alike but one is simple to solve (cubes fit together neatly in a 4x4x4 block), while the other is fiendish (spheres do not stack nicely)."}],[{"start":231.82,"text":"What this means is that AI use in applied mathematics and cosmology is still limited. We can take things we already know how to do and use AI to automate them. But so far, calculation has seen little advancement."}],[{"start":247.35999999999999,"text":"It is possible, however, that more training will solve the problem without extrapolation ever being required. If AI models can be fed enough complex calculations they could perhaps solve problems that have so far eluded us without the need for any human-level inspiration."}],[{"start":267.84,"text":"The question being asked in my field is: “How powerful is an extremely fast, extremely well-trained, unthinking mathematician?” We are in the process of finding out."}],[{"start":null,"text":"<ft-content type=\"http://www.ft.com/ontology/content/Video\" url=\"http://api.ft.com/content/fe19e874-e428-42ca-bcef-4933e59fda09\" data-embedded=\"true\"></ft-content>"}],[{"start":287.28,"text":""}]],"url":"https://audio.ftmailbox.cn/album/a_1755773104_9805.mp3"}

尊敬的用户您好，这是来自FT中文网的温馨提示：如您对更多FT中文网的内容感兴趣，请在苹果应用商店或谷歌应用市场搜索“FT中文网”，下载FT中文网的官方应用。

{"text":[[{"start":6.74,"text":"The writer is a theoretical cosmologist at the University of Cambridge and director of the Infosys-Cambridge AI Centre "}],[{"start":15.13,"text":"Mathematics was once assumed to be relatively safe from the incoming juggernaut of artificial intelligence automation. Chatbots might be able to generate text, code and images on demand, but the deep reasoning required for mathematics was supposedly out of reach. The gold medals that OpenAI and DeepMind recently achieved at the International Mathematical Olympiad have therefore left maths professors like me feeling suddenly a little less safe. "}],[{"start":48.28,"text":"Is AI about to do to mathematical proofs what it’s already doing to coding? After all, the two have clear similarities: both are highly structured “languages” with clear conventions and restricted “dictionaries”. Both have large corpora of examples on which AI can be trained with known solutions. "}],[{"start":72.57,"text":"Yet while the results from cutting-edge AI maths models are impressive, there is another class of maths that generative AI still struggles with: simple computation. Ask “what is 5.11 minus 5.9?” and the answers vary. This morning, OpenAI’s latest GPT5 model gave me the correct answer of -0.79. But phrase the question as part of a calculation and you may receive a different answer."}],[{"start":102.72,"text":"What should we make of AI models that can outperform high school-age Olympiad competitors but cannot always add or subtract to primary school level? To understand this, it’s helpful to think about what it means to be good at maths."}],[{"start":117.77,"text":"The way maths is taught is by showing students a problem, demonstrating the method required to solve it and then assigning examples. Weaker students require numerous examples and sometimes end up simply memorising the method without understanding it. The strongest students need only one or two examples to master the concept and apply it to new problems."}],[{"start":142.35,"text":"The ability to conceptualise and generalise distinguishes the best mathematicians. Good mathematicians solve hard problems; great ones find ways to make the hard problems easy."}],[{"start":156.35,"text":"The strengths of AI models lie in their speed and ability to “practise” at extremely high volumes. This means they can solve very difficult problems that bear some resemblance to things they have been shown before but may struggle when given something new. This is particularly a problem for theoretical maths. The number of examples available for training drops as you move towards more advanced problems."}],[{"start":183.04999999999998,"text":"These are well-known issues with neural networks. They are great at interpolation (generating answers that are “between” things they’ve seen before) and bad at extrapolation (generating answers that fall outside their training set)."}],[{"start":197.63,"text":"In maths, this is made extra difficult by problems that sound similar. Consider: “What is the maximum number of cubes of volume 1 that you can fit in a cube of volume 64?” and “What is the maximum number of spheres of volume 1 that you can fit in a sphere of volume 64?”. They sound alike but one is simple to solve (cubes fit together neatly in a 4x4x4 block), while the other is fiendish (spheres do not stack nicely)."}],[{"start":231.82,"text":"What this means is that AI use in applied mathematics and cosmology is still limited. We can take things we already know how to do and use AI to automate them. But so far, calculation has seen little advancement."}],[{"start":247.35999999999999,"text":"It is possible, however, that more training will solve the problem without extrapolation ever being required. If AI models can be fed enough complex calculations they could perhaps solve problems that have so far eluded us without the need for any human-level inspiration."}],[{"start":267.84,"text":"The question being asked in my field is: “How powerful is an extremely fast, extremely well-trained, unthinking mathematician?” We are in the process of finding out."}],[{"start":null,"text":""}],[{"start":287.28,"text":""}]],"url":"https://audio.ftmailbox.cn/album/a_1755773104_9805.mp3"}

What are the limits of the AI mathematician?

热门文章

相关话题

法律AI初创公司为律师开辟的另类职业路径

苹果、伯克希尔与耐心的美德

沃什应该倾听美联储的反对声音

Lex专栏：诺和诺德再迎问鼎减重药霸主地位的机会

FT社评：美国欠欧洲盟友一份防务路线图

欧洲能否开发出欧洲版的“战斧”？