- Seminar Calendar
- Seminar Archive
- 2024-2025 Semester 2
- 2024-2025 Semester 1
- 2023-2024 Semester 2
- 2023-2024 Semester 1
- 2022-2023 Semester 2
- 2022-2023 Semester 1
- 2021-2022 Semester 2
- 2021-2022 Semester 1
- 2020-2021 Semester 2
- 2020-2021 Semester 1
- 2019-2020 Semester 2
- 2019-2020 Semester 1
- 2018-2019 Semester 2
- 2018-2019 Semester 1
- 2017-2018 Semester 2
- 2017-2018 Semester 1
- 2016-2017 Semester 2
- 2016-2017 Semester 1
- 2015-2016 Semester 1
- 2015-2016 Semester 2
- 2014-2015 Semester 2
- 2014-2015 Semester 1
- 2013-2014 Semester 2
- 2013-2014 Semester 1
- 2012-2013 Semester 2
- 2012-2013 Semester 1
- 2011-2012 Semester 2
- 2011-2012 Semester 1
- 2010-2011 Semester 2
- 2010-2011 Semester 1
- 2009-2010 Semester 2
- 2009-2010 Semester 1
- 2008-2009 Semester 2
- 2008-2009 Semester 1
- 2007-2008 Semester 2
- 2007-2008 Semester 1
- 2006-2007 Semester 2
- 2006-2007 Semester 1
- 2005-2006 Semester 2
- 2005-2006 Semester 1
- Contact
- Site Map
Actor-Critic Reinforcement Learning Algorithms for Mean Field Games in Continuous time, State and Action Spaces
----------------------------------------------------------------------------------------------------
Department of Systems Engineering and Engineering Management
The Chinese University of Hong Kong
----------------------------------------------------------------------------------------------------
Date: Wednesday, May 21, 15:00 pm – 16:00 pm
Venue: ERB 513, The Chinese University of Hong Kong
Speaker: Professor Zhiping Chen, Xi'an Jiaotong University
Title: Actor-Critic Reinforcement Learning Algorithms for Mean Field
Games in Continuous time, State and Action Spaces
Abstract: We investigate mean field games in continuous time, state
and action spaces with an infinite number of agents, where each agent
aims to maximize its expected cumulative reward. Using the technique
of randomized policies, we show policy evaluation and policy gradient
are equivalent to the martingale conditions of a process by focusing
on a representative agent. Then combined with fictitious game, we
propose online and offline actor-critic algorithms for solving
continuous mean field games that update the value function and policy
alternatively under the given population state distribution. We
demonstrate through numerical experiments that our proposed algorithms
can converge to the mean field equilibrium quickly and stably.
Bio: Dr. Zhiping Chen is a Professor of Operations Research and
Finance at School of Mathematics and Statistics, Xi'an Jiaotong
University, is vice director of Tianyuan Mathematical Center in
Northwest China. His research interests include reinforcement
learning, machine learning, multistage stochastic programming,
distributionally robust optimization, risk measure and insurance. He
serves as an Editorial Board Member of OR Spectrum, Journal of Xi'an
Jiaotong University and Chinese Journal of Engineering Mathematics,
and as standing committee members of a few academic societies like the
Operations Research Society of China. He has chaired five projects
from the National Natural Science Foundation of China, one project
from the National Key R\&D Program of China and quite a few industrial
projects. He has published more than 120 papers in journals such as
MP, MOR, SIAM J Optim, JOTA, JOGO, EJOR, AOR, JB F, JEDC, IME, SAJ and
top meetings like AAAI.
Date:
Wednesday, May 21, 2025 - 15:00