BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//132.216.98.100//NONSGML kigkonsult.se iCalcreator 2.20.4//
BEGIN:VEVENT
UID:20260615T030609EDT-0478HSzbPc@132.216.98.100
DTSTAMP:20260615T070609Z
DESCRIPTION:ISS Informal Systems Seminar\n\nSpeaker: Pierre-Luc Bacon – Uni
 versité de Montréal\, Canada \n\n \n\n\n\nPresentation on YouTube\n\nAbstr
 act: Decision awareness is the learning principle according to which the c
 omponents of a learning system ought to be optimized directly to satisfy t
 he global performance criterion: to produce optimal decisions. This end-to
 -end perspective has recently led to significant advances in model-based r
 einforcement learning by addressing the problem of compounding errors plag
 uing alternative approaches. In this talk\, I will present some of our rec
 ent work on this topic: 1. on learning control-oriented transition models 
 by implicit differentiation and 2. on learning neural ordinary differentia
 l equations end-to-end for nonlinear trajectory optimization. Along the wa
 y\, we will also discuss some of the computational challenges associated w
 ith those methods and our attempts at scaling up performance\, specificall
 y: using an efficient factorization of the Jacobians in the forward mode o
 f automatic differentiation through novel constrained optimizers inspired 
 by adversarial learning.\n\n\nBiography: Pierre-Luc Bacon is an assistant 
 professor at the University of Montreal in the Computer Science and Operat
 ions Research department. He is also a core member of Mila and Ivado and a
  Facebook CIFAR chair holder. He leads a research group of 15 students wor
 king on the challenge posed by the curse of the horizon in reinforcement l
 earning and optimal control.\n
DTSTART:20230210T170000Z
DTEND:20230210T180000Z
LOCATION:CA\, ZOOM
SUMMARY:Decision Awareness in Reinforcement Learning 
URL:/cim/channels/event/decision-awareness-reinforceme
 nt-learning-351638
END:VEVENT
END:VCALENDAR