Alignment Research Engineer Accelerator

This is where the ARENA course content is hosted. For more information about the ARENA program, including upcoming cohorts and how to apply, visit arena.education.

0

Fundamentals

Build your foundation in deep learning, from prerequisites through CNNs, optimization, backpropagation, and generative models.

1

Interpretability

Dive deep into language model interpretability, from linear probes and SAEs to alignment faking and thought anchors.

2

RL

Take a whirlwind tour through RL, starting from tabular learning and Atari, and ending with some of the cutting-edge techniques used in current LLM post-training.

3

Evals

Learn to build and run evaluations for large language models, including dataset generation and LLM agents.