14Sep2021

An Introduction to Actor Critic Methods for Optimal Control

Sean Meyn, University of Florida, USA 2.27.10116:00 - 17:00

The goal of actor critic methods is to estimate the best policy among a parameterized family for a controlled Markov chain. Through the magic of Markov chain theory, it is possible to obtain unbiased estimates of the objective through the geometry of TD-learning. These algorithms were born from the dissertations of Van Roy and Konda in the 1990s, under the supervision of Tsitsiklis at MIT.

The lecture will consist of two parts. Part 1 is an introduction to the TD(1) algorithm, that is one part of the actor-critic method. The elegant theory is accompanied by a significant warning: while the algorithm solves a projection problem, it is a Monte-Carlo method that can come with massive variance. Part 2 is an introduction to the actor critic algorithm, and the crucial role of the TD(1) algorithm. It seems likely that the variance can be tamed in these algorithms, but this remains a research frontier.

Upcoming Events

15Sep2025

IRTG Workshop: Career Planning, Profile Development, and Self-Presentation for female Researchers I

Pd Dr Mareike Menne, BUSCH & DR MENNE Hochschulconsulting 09:00-13:00

Online Workshop with PD Dr Mareike Menne

In the first part of the workshop, we will cover key principles of career planning, labour market…

more ›

17Sep2025

IRTG Workshop: Career planning, profile development, and self-presentation for female Researchers II

PD Dr Mareike Menne, BUSCH & DR MENNE Hochschulconsulting 09:00 - 13:00

Online Workshop with PD Dr Mareike Menne

In the second part, we will build on this initial work. Often, concrete questions arise at this stage, such…

more ›

23Sep2025

Potsdam DA Days 2025

The Potsdam DA Days 2025 will take place from September 23^rd to 24^th.The DA Days will be opened by the 8^th Kálmán Lecture with Daniela Calvetti with…

more ›