Skip to main content Skip to secondary navigation
Page Content

Stanford debuts first AI benchmark to help understand LLMs

HAI’s Center for Research on Foundation Models launches Holistic Evaluation of Language Models (HELM), the first benchmarking project aimed at improving the transparency of language models and the broader category of foundation models.