Search, Reason or Recombine? Paradigms for Scaling Formal Proving | Vidéo | Carmin.tv

00:00:00 / 00:00:00

Search, Reason or Recombine? Paradigms for Scaling Formal Proving

By Fabian Glöckle

Appears in collection : Mathematics for an by Large Language Models – 2025 Edition

In the effort to scale test-time computation for language models on mathematical benchmarks, two prominent paradigms have emerged: large-scale search with reinforcement learning, exemplified by methods like AlphaProof, and long chain-of-thought reasoning with emergent self-verification, as seen in models like o1. For the future of reinforcement learning in formal theorem proving, this opens up a spectrum of hybrid methods. These range from line-level tree search with environment feedback to multi-turn iterative whole proof generation, with and without integrated informal reasoning, to hierarchical problem decompositions and recombination of partial proofs. I will explain these methods as inference methods and discuss the challenges faced when applying reinforcement learning to them.

Information about the video

Date of recording 22/05/2025
Date of publication 30/05/2025
Institution IHES
Language English
Audience Researchers
Format MP4

Domain(s)

Last related questions on MathOverflow

You have to connect your Carmin.tv account with mathoverflow to add question

Ask a question on MathOverflow

Copyright Carmin.tv 2025

Give feedback