Skip to main content Skip to secondary navigation
Page Content

Evaluating Language Models

CRFM’s Percy Liang explains foundation models, key findings of benchmarking project HELM, and gaps between public and private models on this episode of the podcast The Data Exchange.