Week 8: Research & Careers in Safety

Empirical Research Workflow, Active Alignment Agendas, and Careers in Safety

Overview

Over the past several weeks, we surveyed both the technical landscape of AI safety and the institutional levers through which it might be governed. This closing session turns from understanding the field to working in it. We start with practical advice on the empirical research workflow — choosing problems, prioritizing experiments for information gain, and moving fast under uncertainty — alongside the broader mindset of explore, understand, distill. We then survey active alignment research agendas and discuss what next steps and careers in AI safety can look like.

Learning Objectives

By the end of Week 8, fellows should be able to:

  • Describe the stages of an empirical research project — exploration, hypothesis-testing, distillation
  • Apply practical heuristics for doing safety research effectively, including prioritizing for information gain and designing fast experimental loops
  • Identify major active research agendas in AI alignment and articulate considerations for evaluating which problems are tractable, neglected, or high-impact

Core Readings

Recommended Readings

Further Readings