Topics

Below you’ll find papers and topics we hope to cover on the seminar. We will also gladly accept talks relevant to AI safety from outside the list.

Introductions

Foundational work

Corrigibility

Machine Learning

Objective functions

Reasoning about other agents

Decision theory

Other

Additional sources