Collection 2022 - T3 - WS1 - Non-Linear and High Dimensional Inference

Organisateur(s) Aamari, Eddie ; Aaron, Catherine ; Chazal, Frédéric ; Fischer, Aurélie ; Hoffmann, Marc ; Le Brigant, Alice ; Levrard, Clément ; Michel, Bertrand

Date(s) 03/10/2022 - 07/10/2022

URL associée https://indico.math.cnrs.fr/event/7545/

10 21

Dimensionality reduction in reinforcement learning by randomisation

De Denis Belomestny

In reinforcement learning an agent interacts with an environment, whose underlying mechanism is unknown, by sequentially taking actions, receiving rewards, and transitioning to the next state. With the goal of maximizing the expected sum of the collected rewards, the agent must carefully balance between exploring in order to gather more information about the environment and exploiting the current knowledge to collect the rewards. In this talk, we are interested in solving this exploration-exploitation dilemma by injecting noise into the agent’s decision-making process in such a way that the dependence of the regret on the dimension of state and action spaces is minimised. We also review some recent approaches towards dimension reduction in RL.

Informations sur la vidéo

Date de captation 06/10/2022
Date de publication 03/05/2024
Institut IHP
Langue Anglais
Audience Chercheurs, Doctorants
Réalisateur(s) Alexandre Duplessis, Marco Perez
Format MP4
Lieu Amphitheater Hermite, IHP

Données de citation

DOI 10.57987/IHP.2022.T3.WS1.010
Citer cette vidéo Belomestny, Denis (06/10/2022). Dimensionality reduction in reinforcement learning by randomisation. IHP. Audiovisual resource. DOI: 10.57987/IHP.2022.T3.WS1.010
URL https://dx.doi.org/10.57987/IHP.2022.T3.WS1.010