ARENA
Content
Chapter 0: Fundamentals Chapter 1: Transformer Interpretability Chapter 2: Reinforcement Learning Chapter 3: LLM Evaluations Chapter 4: Alignment Science
Planner Setup Instructions FAQ

Evals

  • 3.1 Intro to Evals
  • 3.2 Dataset Generation
  • 3.3 Running Evals with Inspect
  • 3.4 LLM Agents
In this section
    On this page
    1. Content
    2. Evals

    Chapter 3: LLM Evaluations

    Learn to build and run evaluations for large language models, including dataset generation and LLM agents.

    Sections

    3.1 Intro to Evals Design threat models and specifications for evaluating model properties.
    3.2 Dataset Generation Use LLMs to generate and refine high-quality evaluation datasets.
    3.3 Running Evals with Inspect Run standardised LLM evaluations using UK AISI's Inspect library.
    3.4 LLM Agents Build LLM agents with scaffolding to play Wikipedia Racing and other tasks.
    Select Context
    Select exercise content to download as text files, or to provide as context when asking questions below.
    Sections
    File type
    Visit a chapter section to download content
    Ask a Question
    Ask questions about the exercises. The model will use the context you've selected above to provide relevant answers.
    Ask questions about the exercises...

    ARENA - Alignment Research Engineer Accelerator