Mathematics for and by Large Language Models

Collection Mathematics for and by Large Language Models

Organizer(s) François Charton, Michael Douglas, Yiannis Vlassopoulos
Date(s) 23/05/2024 - 23/05/2024
linked URL https://indico.math.cnrs.fr/event/11933/
00:00:00 / 00:00:00
5 7

Mathematics as a Translation Task - the Importance of Training Distributions

By Francois Charton

Many problems of mathematics can be set as translation tasks: problems, represented as sentences in some language, are translated into their solutions, by language models trained from synthetic examples. In this setting, we can choose the distribution of problems and solutions we use to train the model. I present examples from three different experiments, which suggest that this can make a large difference in model performance, and provide intuition on the inner workings of transformer models.

Information about the video

  • Date of recording 23/05/2024
  • Date of publication 25/05/2024
  • Institution IHES
  • Licence CC BY-NC-ND
  • Language English
  • Audience Researchers
  • Format MP4

Last related questions on MathOverflow

You have to connect your Carmin.tv account with mathoverflow to add question

Ask a question on MathOverflow




Register

  • Bookmark videos
  • Add videos to see later &
    keep your browsing history
  • Comment with the scientific
    community
  • Get notification updates
    for your favorite subjects
Give feedback