Multi-armed bandits and beyond | Vidéo | Carmin.tv

00:00:00 / 00:00:00

Multi-armed bandits and beyond

De Shipra Agrawal

Apparaît dans la collection : Theoretical Computer Science Spring School: Machine Learning / Ecole de Printemps d'Informatique Théorique : Apprentissage Automatique

In this tutorial I will discuss recent advances in theory of multi-armed bandits and reinforcement learning, in particular the upper confidence bound (UCB) and Thompson Sampling (TS) techniques for algorithm design and analysis.

Informations sur la vidéo

Date de captation 23/05/2022
Date de publication 21/06/2022
Institut CIRM
Licence CC BY NC ND
Langue Anglais
Audience Chercheurs, Doctorants
Réalisateur(s) Jean Petit
Format MP4

Données de citation

DOI 10.24350/CIRM.V.19921203
Citer cette vidéo Agrawal, Shipra (23/05/2022). Multi-armed bandits and beyond. CIRM. Audiovisual resource. DOI: 10.24350/CIRM.V.19921203
URL https://dx.doi.org/10.24350/CIRM.V.19921203

Domaine(s)

Codes MSC

Dernières questions liées sur MathOverflow

Pour poser une question, votre compte Carmin.tv doit être connecté à mathoverflow

Poser une question sur MathOverflow

Copyright Carmin.tv 2026

Donner son avis