3rd Edition of Mathematics for and by Large Language Models

Collection 3rd Edition of Mathematics for and by Large Language Models

Organisateur(s) Michael Douglas, Amaury Hayat, Julio Parra-Martinez and Yiannis Vlassopoulos
Date(s) 28/05/2026 - 28/05/2026
URL associée https://indico.math.cnrs.fr/event/16396/
00:00:00 / 00:00:00
4 4

Why AI Needs Formal Mathematics

De Edward Lockhart

Current reinforcement learning methods train Large Language Models to generate outputs that satisfy an automated judge. While this drives impressive feats of reasoning, it inadvertently incentivises the superficial appearance of correctness. Models may learn to "reward hack" by glossing over logical flaws or confidently making false claims. In this talk, I will explore how some AI researchers are turning to formal verification to solve this illusion of competence. By pairing LLMs with proof assistants, we can shift AI training from adversarial reward-maximisation to a cooperative process where reward hacking becomes impossible. I will also examine the broader implications of this emerging capability, discussing how "formalisation on-demand" can serve as a substitute for human social credibility and lay the groundwork for fully autonomous AI mathematical research.

Informations sur la vidéo

  • Date de captation 28/05/2026
  • Date de publication 12/06/2026
  • Institut IHES
  • Langue Anglais
  • Audience Chercheurs
  • Format MP4

Dernières questions liées sur MathOverflow

Pour poser une question, votre compte Carmin.tv doit être connecté à mathoverflow

Poser une question sur MathOverflow




Inscrivez-vous

  • Mettez des vidéos en favori
  • Ajoutez des vidéos à regarder plus tard &
    conservez votre historique de consultation
  • Commentez avec la communauté
    scientifique
  • Recevez des notifications de mise à jour
    de vos sujets favoris
Donner son avis