Alignment Research Engineer Accelerator
This is where the ARENA course content is hosted. For more information about the ARENA program, including upcoming cohorts and how to apply, visit arena.education.
Fundamentals
Build your foundation in deep learning, from prerequisites through CNNs, optimization, backpropagation, and generative models.
Interpretability
Dive deep into language model interpretability, from linear probes and SAEs to circuit analysis and toy models.
RL
Take a whirlwind tour through RL, starting from tabular learning and Atari, and ending with some of the cutting-edge techniques used in current LLM post-training.
Evals
Learn to build and run evaluations for large language models, including dataset generation and LLM agents.
Alignment Science
Case studies in misalignment, covering a range of topics and techniques (both white-box and black-box).