Seminar: Learning in Linear-quadratic Framework: From Single-agent to Multi-agent, and to Mean-field

published by semadmin on Wed, 01/26/2022 - 11:29

----------------------------------------------------------------------------------------------------

Department of Systems Engineering and Engineering Management
The Chinese University of Hong Kong

----------------------------------------------------------------------------------------------------

Date:Friday, February 11, 2022, 1:00 pm HKT

Title: Learning in Linear-quadratic Framework: From Single-agent to Multi-agent, and to Mean-field

Speaker: Professor Renyuan Xu, University of Southern California

Abstract:

Linear-quadratic (LQ) framework is widely studied in the literature of stochastic control, game theory, and mean-field analysis due to its simple structure, tractable solution, and local approximation power to nonlinear control problems. In this talk, we discuss several theoretical results of the policy gradient (PG) method, a popular reinforcement learning algorithm, for several LQ problems where agents are assumed to have limited information about the stochastic system. In the single-agent setting, we explain how the PG method is guaranteed to learn the global optimal policy. In the multi-agent setting, we show that (a modified) PG method could guide agents to find the Nash equilibrium solution provided there is a certain level of noise in the system. The noise can either come from the underlying dynamics or carefully designed explorations from the agents. Finally when the number of agents goes to infinity, we propose an exploration scheme with entropy regularization that could help each individual agent to explore the unknown system as well as the behavior of other agents. The proposed scheme is shown to be able to speed up and stabilize the learning procedure.

The numerical performance of PG methods is demonstrated with two examples, one is the optimal execution problem under the single-agent setting and the other one is the institutional negotiation/bargaining problem under the multi-agent setting.

This talk is based on several projects with Xin Guo (UC Berkeley), Ben Hambly (U of Oxford), Huining Yang (U of Oxford), and Thaleia Zariphopoulou (UT Austin).

Biography:

Renyuan Xu is currently a WiSE Gabilan Assistant Professor in the Epstein Department of Industrial and Systems Engineering at the University of Southern California. Before joining USC, Renyuan spent two years as a Hooke Research Fellow in the Mathematical Institute at the University of Oxford. She completed her Ph.D. degree in Operations Research from UC Berkeley in 2019. Her research interests lie broadly in the span of machine learning, stochastic control, game theory, and mathematical finance.

Date:
Friday, February 11, 2022, 1:00 pm HKT

Date:

Friday, February 11, 2022 - 13:00

Main menu

Seminar Calendar

Main menu