Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
Dan Ryan
Ryan: Human and Machine Intelligence Alignment Fall 2025
*but not about the course, that's what we'll talk about next.
Ryan: Human and Machine Intelligence Alignment Fall 2025
2015
Do YOU think we should worry about intelligent machines that do exactly what we ask them to do?
Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
2019
Ryan: Human and Machine Intelligence Alignment Fall 2025
2019
Ryan: Human and Machine Intelligence Alignment Fall 2025
Dan Ryan on Social Theory and Alignment [6m]
Ryan: Human and Machine Intelligence Alignment Fall 2025
Introduction to AI
Computer Science Ethics
Ethics of AI
AI Alignment (CS Version)
Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
AI 101
Alignment 101
How to read hard stuff
Thinking Analogically
The Four
Alignment
Problems
Isn't It Just "Ethics"?
Principles Are
Not Enough!
Align is a Verb
Shared Meaning
Hierarchy
Groups
Markets
Qualification
Records
Control
Deterrence
Incentives
Safety
Institutions
Foundations
Humans
Have Been Doing Alignment Forever
Traits Are
Not Enough!
Humans +
Humans
Principals + Agents
Normies +
Magicians
Humans +
Machines
Humans
Organizations
Experts
Machines
Humans
Organizations
Experts
Machines
Humans
Organizations
Experts
Machines
Humans
Organizations
Experts
Machines
Humans
Organizations
Experts
Machines
Humans
Organizations
Experts
Machines
Humans
Organizations
Experts
Machines
Ryan: Human and Machine Intelligence Alignment Fall 2025
27 class meetings.
~25 pre-class work.
<25 post-class work.
Multiple draft essay.
Alignment card deck.
Oral exam.
Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
PostClass Work
PreClass Work AI Bootcamp
Ryan: Human and Machine Intelligence Alignment Fall 2025
Ryan: Human and Machine Intelligence Alignment Fall 2025
maximize a utility function
inverse reinforcement learning
value alignment
Ryan: Human and Machine Intelligence Alignment Fall 2025