Lecture, four hours; discussion, one hour; outside study, seven hours. Requisite: course 131A. Key concepts, principles, and algorithms of online learning and learning how to make decisions under uncertainty in broad context, including Markov decision processes, optimal stopping, reinforcement learning, structural results for online learning, multiarmed bandits learning, multiagent learning, multiagent deep learning. Letter grading.

Review Summary

Clarity
N/A
Organization
N/A
Time
N/A
Overall
N/A

Enrollment Progress

Enrollment data not available.

Course

Previously taught
20S
Formerly offered as
EL ENGR 238