BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//132.216.98.100//NONSGML kigkonsult.se iCalcreator 2.20.4//
BEGIN:VEVENT
UID:20260617T082416EDT-10110u69Z1@132.216.98.100
DTSTAMP:20260617T122416Z
DESCRIPTION:\n	Dynamic Games and Applications Seminar\n\n	Speaker: Kaiqing Zh
 ang – University of Illinois at Urbana-Champaign\, United States\n\n	Webina
 r link\n		Webinar ID: 962 7774 9870\n		Passcode: 285404\n\n	Abstract: Recent ye
 ars have witnessed both tremendous empirical successes and fast-growing th
 eoretical development of reinforcement learning (RL)\, in solving many seq
 uential decision-making and control tasks. However\, many RL algorithms ar
 e still several miles away from being applied to practical autonomous syst
 ems\, which usually involve more complicated scenarios with multiple decis
 ion-makers and safety-critical concerns. In this talk\, I will introduce o
 ur work on the development of RL algorithms with provable guarantees\, wit
 h focuses on the multi-agent and safety-critical settings. I will first sh
 ow that policy optimization\, one of the main drivers of the empirical suc
 cesses of RL\, enjoys global convergence and sample complexity guarantees 
 for a class of robust control problems. More importantly\, we show that ce
 rtain policy optimization approaches automatically preserve some 'robustne
 ss' during the iterations\, some property we termed as 'implicit regulariz
 ation'. Interestingly\, such a setting naturally unifies other important b
 enchmark settings in control and game theory: risk-sensitive control desig
 n\, and linear quadratic zero-sum dynamic games\, while the latter is the 
 benchmark multi-agent RL (MARL) setting that mirrors the role played by li
 near quadratic regulators (LQR) for single-agent RL. Despite the nonconvex
 ity and the fundamental challenges in the optimization landscape\, our the
 ory shows that policy optimization enjoys global convergence guarantees in
  these problems as well. The results have then provided some theoretical j
 ustifications for several basic robust RL and MARL settings that are popul
 ar in the empirical RL world. In addition\, I will introduce several other
  works along this line of provable MARL and robust RL\, including decentra
 lized MARL with networked agents\, sample complexity of model-based MARL\,
  etc. Time permitting\, I will also share several future directions based 
 on the previous results\, towards large-scale and reliable autonomy.\n\n
DTSTART:20210218T160000Z
DTEND:20210218T170000Z
LOCATION:CA\, ZOOM
SUMMARY:Provable reinforcement learning for multi-agent and robust control 
 systems
URL:/cim/channels/event/provable-reinforcement-learnin
 g-multi-agent-and-robust-control-systems-328630
END:VEVENT
END:VCALENDAR