The course will provide a rigorous treatment of reinforcement learning by building on the mathematical foundations laid by optimal control, dynamic programming, and machine learning. Topics include model-based methods such as deterministic and stochastic dynamic programming, LQR and LQG control, as well as model-free methods that are broadly identified as Reinforcement Learning. In particular, we will cover on and off-policy tabular methods such as Monte Carlo, Temporal Differences, n-step bootstrapping, as well as approximate solution methods, including on- and off-policy approximation, policy gradient methods, including Deep Q-Learning. The course has a final project where students are expected to formulate and solve a problem based on the techniques learned in class.
Course Prerequisite(s)
***Robotics and Autonomous Systems students only: Must complete core courses first (EN.685.621 AND EN.535.641 AND EN.535.630 AND EN.605.613).
Course Offerings
|
Open
Foundations of Reinforcement Learning
08/31/2026 - 12/11/2026
|