Chapter 3: LLM Evaluations

Learn to build and run evaluations for large language models, including dataset generation and LLM agents.

Sections

Sections

File type

Python Markdown (full) Markdown (without solutions)

Visit a chapter section to download content

Model:

Ask questions about the exercises...