BEGIN:VCALENDAR VERSION:2.0 PRODID:-//132.216.98.100//NONSGML kigkonsult.se iCalcreator 2.20.4// BEGIN:VEVENT UID:20250826T164007EDT-8817c9V245@132.216.98.100 DTSTAMP:20250826T204007Z DESCRIPTION:Virtual Informal Systems Seminar (VISS) Centre for Intelligent Machines (CIM) and Groupe d'Etudes et de Recherche en Analyse des Decision s (GERAD)\n\nDileep Kalathil\n\nAbstract:\n Reinforcement Learning (RL) is the class of machine learning that addresses the problem of learning to co ntrol unknown dynamical systems. RL has achieved remarkable success recent ly in applications like playing games and robotics. However\, most of thes e successes are limited to very structured or simulated environments. When applied to real-world systems\, RL algorithms face two fundamental source s of fragility. Firstly\, the real-world system parameters can be very dif ferent from that of the nominal values used for training RL algorithms. Se condly\, the control policy for any real-world system is required to maint ain some necessary safety criteria to avoid undesirable outcomes. Most dee p RL algorithms overlook these fundamental challenges which often results in learned polices that can performs poorly in the real-world setting. We address these issues in two steps. First\, we propose a robust reinforceme nt learning algorithm to train policies that account for the possible para meter mismatches between the simulation system and real-world system. Seco nd\, we develop a safe reinforcement learning algorithm to learn policies such that the frequency of visiting undesirable states and expensive actio ns satisfies the safety constraints.\n \n Bio:\n Dileep Kalathil is an Assist ant Professor in the Department of Electrical and Computer Engineering at Texas A&M University. His main research area is reinforcement learning\, w ith applications in cyber-physical systems\, intelligent transportation sy stems and power systems. In particular\, his research addresses three fund amental problems in RL: (i) How to develop data efficient RL algorithms? ( ii) How to develop safe and robust RL algorithms? and (iii) How to develop scalable multi-agent RL algorithms? Before joining TAMU\, he was a postdo ctoral researcher in the EECS department at UC Berkeley. He received his P hD from University of Southern California (USC) in 2014 where he won the b est PhD Dissertation Prize in the Department of Electrical Engineering. He received an M. Tech. from IIT Madras where he won the award for the best academic performance in the Electrical Engineering Department.\n DTSTART:20201030T153000Z DTEND:20201030T163000Z LOCATION:CA\, ZOOM SUMMARY:Reinforcement Learning with Robustness and Safety Guarantees URL:/cim/channels/event/reinforcement-learning-robustn ess-and-safety-guarantees-325752 END:VEVENT END:VCALENDAR