Multi-armed bandits and beyond | Vidéo | Carmin.tv

00:00:00 / 00:00:00

Multi-armed bandits and beyond

By Shipra Agrawal

Appears in collection : Theoretical Computer Science Spring School: Machine Learning / Ecole de Printemps d'Informatique Théorique : Apprentissage Automatique

In this tutorial I will discuss recent advances in theory of multi-armed bandits and reinforcement learning, in particular the upper confidence bound (UCB) and Thompson Sampling (TS) techniques for algorithm design and analysis.

Information about the video

Date of recording 23/05/2022
Date of publication 21/06/2022
Institution CIRM
Licence CC BY NC ND
Language English
Audience Researchers, Graduate Students
Director(s) Jean Petit
Format MP4

Citation data

DOI 10.24350/CIRM.V.19921203
Cite this video Agrawal, Shipra (23/05/2022). Multi-armed bandits and beyond. CIRM. Audiovisual resource. DOI: 10.24350/CIRM.V.19921203
URL https://dx.doi.org/10.24350/CIRM.V.19921203

Domain(s)

MSC codes

Last related questions on MathOverflow

You have to connect your Carmin.tv account with mathoverflow to add question

Ask a question on MathOverflow

Copyright Carmin.tv 2026

Give feedback