Lecture, four hours; discussion, one hour; outside study, seven hours. Requisite: course 131A. Key concepts, principles, and algorithms of online learning and learning how to make decisions under uncertainty in broad context, including Markov decision processes, optimal stopping, reinforcement learning, structural results for online learning, multiarmed bandits learning, multiagent learning, multiagent deep learning. Letter grading.

Review Summary

Clarity
N/A
Organization
N/A
Time
N/A
Overall
N/A

Course

Previously taught
21S
Formerly offered as
EL ENGR 238

Previous Grades

Grade distributions not available.