3rd Edition of Mathematics for and by Large Language Models

Collection 3rd Edition of Mathematics for and by Large Language Models

Organizer(s) Michael Douglas, Amaury Hayat, Julio Parra-Martinez and Yiannis Vlassopoulos
Date(s) 28/05/2026 - 28/05/2026
linked URL https://indico.math.cnrs.fr/event/16396/
00:00:00 / 00:00:00
4 4

Why AI Needs Formal Mathematics

By Edward Lockhart

Current reinforcement learning methods train Large Language Models to generate outputs that satisfy an automated judge. While this drives impressive feats of reasoning, it inadvertently incentivises the superficial appearance of correctness. Models may learn to "reward hack" by glossing over logical flaws or confidently making false claims. In this talk, I will explore how some AI researchers are turning to formal verification to solve this illusion of competence. By pairing LLMs with proof assistants, we can shift AI training from adversarial reward-maximisation to a cooperative process where reward hacking becomes impossible. I will also examine the broader implications of this emerging capability, discussing how "formalisation on-demand" can serve as a substitute for human social credibility and lay the groundwork for fully autonomous AI mathematical research.

Information about the video

  • Date of recording 28/05/2026
  • Date of publication 12/06/2026
  • Institution IHES
  • Language English
  • Audience Researchers
  • Format MP4

Last related questions on MathOverflow

You have to connect your Carmin.tv account with mathoverflow to add question

Ask a question on MathOverflow




Register

  • Bookmark videos
  • Add videos to see later &
    keep your browsing history
  • Comment with the scientific
    community
  • Get notification updates
    for your favorite subjects
Give feedback