Natural Language Processing

NLP is changing how we interact with machines, enabling more fluid communication and better understanding of human language.

New Approach to Scaling Laws Could Change How AI Models Are Trained

Andrew Myers

May 21, 2026

News

Leveraging statistical concepts from measurement science and education, AI researchers have greatly reduced the computational demand of predicting how the largest of large language models will scale up in the future. It could save millions of dollars in training costs.

News

New Approach to Scaling Laws Could Change How AI Models Are Trained

Andrew Myers

Natural Language ProcessingGenerative AIMay 21

The Promise and Perils of Artificial Intelligence in Advancing Participatory Science and Health Equity in Public Health

Abby C King, Zakaria N Doueiri, Ankita Kaulberg, Lisa Goldman Rosas

Feb 14, 2025

Research

Current societal trends reflect an increased mistrust in science and a lowered civic engagement that threaten to impair research that is foundational for ensuring public health and advancing health equity. One effective countermeasure to these trends lies in community-facing citizen science applications to increase public participation in scientific research, making this field an important target for artificial intelligence (AI) exploration. We highlight potentially promising citizen science AI applications that extend beyond individual use to the community level, including conversational large language models, text-to-image generative AI tools, descriptive analytics for analyzing integrated macro- and micro-level data, and predictive analytics. The novel adaptations of AI technologies for community-engaged participatory research also bring an array of potential risks. We highlight possible negative externalities and mitigations for some of the potential ethical and societal challenges in this field.

Research

The Promise and Perils of Artificial Intelligence in Advancing Participatory Science and Health Equity in Public Health

Abby C King, Zakaria N Doueiri, Ankita Kaulberg, Lisa Goldman Rosas

Foundation ModelsGenerative AIMachine LearningNatural Language ProcessingSciences (Social, Health, Biological, Physical)HealthcareFeb 14

How Can AI Support Language Digitization and Digital Inclusion?

Juan N. Pava, Thomas S. Mullaney, Caroline Meinhardt, Audrey Gao, Diyi Yang

Deep DiveFeb 26, 2026

White Paper

This white paper analyzes the varying ways AI tools can advance language digitization work, and provides recommendations for responsibly realizing the potential of AI in supporting the digital inclusion of digitally disadvantaged languages.

White Paper

How Can AI Support Language Digitization and Digital Inclusion?

Juan N. Pava, Thomas S. Mullaney, Caroline Meinhardt, Audrey Gao, Diyi Yang

Ethics, Equity, InclusionInternational Affairs, International Security, International DevelopmentNatural Language ProcessingDeep DiveFeb 26

Christopher Manning

Person

Christopher Manning

Natural Language ProcessingOct 05

AI Leaders Discuss How To Foster Responsible Innovation At TIME100 Roundtable In Davos

TIME

Jan 21, 2026

Media Mention

HAI Senior Fellow Yejin Choi discussed responsible AI model training at Davos, asking, “What if there could be an alternative form of intelligence that really learns … morals, human values from the get-go, as opposed to just training LLMs on the entirety of the internet, which actually includes the worst part of humanity, and then we then try to patch things up by doing ‘alignment’?”

Media Mention

AI Leaders Discuss How To Foster Responsible Innovation At TIME100 Roundtable In Davos

TIME

Ethics, Equity, InclusionGenerative AIMachine LearningNatural Language ProcessingJan 21

LABOR-LLM: Language-Based Occupational Representations with Large Language Models

Susan Athey, Herman Brunborg, Tianyu Du, Ayush Kanodia, Keyon Vafa

Dec 11, 2024

Research

Vafa et al. (2024) introduced a transformer-based econometric model, CAREER, that predicts a worker’s next job as a function of career history (an “occupation model”). CAREER was initially estimated (“pre-trained”) using a large, unrepresentative resume dataset, which served as a “foundation model,” and parameter estimation was continued (“fine-tuned”) using data from a representative survey. CAREER had better predictive performance than benchmarks. This paper considers an alternative where the resume-based foundation model is replaced by a large language model (LLM). We convert tabular data from the survey into text files that resemble resumes and fine-tune the LLMs using these text files with the objective to predict the next token (word). The resulting fine-tuned LLM is used as an input to an occupation model. Its predictive performance surpasses all prior models. We demonstrate the value of fine-tuning and further show that by adding more career data from a different population, fine-tuning smaller LLMs surpasses the performance of fine-tuning larger models.

Research

LABOR-LLM: Language-Based Occupational Representations with Large Language Models

Susan Athey, Herman Brunborg, Tianyu Du, Ayush Kanodia, Keyon Vafa

Foundation ModelsNatural Language ProcessingDec 11

All Work Published on Natural Language Processing

An AI Social Coach Is Teaching Empathy to People with Autism

Sarah Wells

Aug 13, 2025

News

A specialized chatbot named Noora is helping individuals with autism spectrum disorder practice their social skills on demand.

An AI Social Coach Is Teaching Empathy to People with Autism

Sarah Wells

Aug 13, 2025

A specialized chatbot named Noora is helping individuals with autism spectrum disorder practice their social skills on demand.

Healthcare

Natural Language Processing

Generative AI

News

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab

Nov 14, 2024

Research

Language Model Programs, i.e. sophisticated pipelines of modular language model (LM) calls, are increasingly advancing NLP tasks, but they require crafting prompts that are jointly effective for all modules. We study prompt optimization for LM programs, i.e. how to update these prompts to maximize a downstream metric without access to module-level labels or gradients. To make this tractable, we factorize our problem into optimizing the free-form instructions and few-shot demonstrations of every module and introduce several strategies to craft task-grounded instructions and navigate credit assignment across modules. Our strategies include (i) program- and data-aware techniques for proposing effective instructions, (ii) a stochastic mini-batch evaluation function for learning a surrogate model of our objective, and (iii) a meta-optimization procedure in which we refine how LMs construct proposals over time. Using these insights we develop MIPRO, a novel algorithm for optimizing LM programs. MIPRO outperforms baseline optimizers on five of seven diverse multi-stage LM programs using a best-in-class open-source model (Llama-3-8B), by as high as 13% accuracy. We have released our new optimizers and benchmark in DSPy at [http://dspy.ai](http://dspy.ai).

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab

Nov 14, 2024

Natural Language Processing

Research

Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts

Juan N. Pava, Caroline Meinhardt, Haifa Badi Uz Zaman, Toni Friedman, Sang T. Truong, Daniel Zhang, Elena Cryst, Vukosi Marivate, Sanmi Koyejo

Deep DiveApr 22, 2025

White Paper

This white paper maps the LLM development landscape for low-resource languages, highlighting challenges, trade-offs, and strategies to increase investment; prioritize cross-disciplinary, community-driven development; and ensure fair data ownership.

Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts

Juan N. Pava, Caroline Meinhardt, Haifa Badi Uz Zaman, Toni Friedman, Sang T. Truong, Daniel Zhang, Elena Cryst, Vukosi Marivate, Sanmi Koyejo

Deep DiveApr 22, 2025

International Affairs, International Security, International Development

Natural Language Processing

Ethics, Equity, Inclusion

White Paper

Percy Liang

Associate Professor of Computer Science, Stanford University | Director, Stanford Center for Research on Foundation Models | Senior Fellow, Stanford HAI

Person

Percy Liang

Associate Professor of Computer Science, Stanford University | Director, Stanford Center for Research on Foundation Models | Senior Fellow, Stanford HAI

Foundation Models

Generative AI

Machine Learning

Natural Language Processing

Person

Social Science Moves In Silico

Katharine Miller

Jul 25, 2025

News

Despite limitations, advances in AI offer social science researchers the ability to simulate human subjects.

Social Science Moves In Silico

Katharine Miller

Jul 25, 2025

Despite limitations, advances in AI offer social science researchers the ability to simulate human subjects.

Generative AI

Natural Language Processing

Sciences (Social, Health, Biological, Physical)

News

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

Joey Hejna, Chethan Anand Bhateja, Yichen Jiang, Karl Pertsch, Dorsa Sadigh

Sep 05, 2024

Research

Increasingly large robotics datasets are being collected to train larger foundation models in robotics. However, despite the fact that data selection has been of utmost importance to scaling in vision and natural language processing (NLP), little work in robotics has questioned what data such models should actually be trained on. In this work we investigate how to weigh different subsets or "domains'' of robotics datasets during pre-training to maximize worst-case performance across all possible downstream domains using distributionally robust optimization (DRO). Unlike in NLP, we find that these methods are hard to apply out of the box due to varying action spaces and dynamics across robots. Our method, ReMix, employs early stopping and action normalization and discretization to counteract these issues. Through extensive experimentation on both the Bridge and OpenX datasets, we demonstrate that data curation can have an outsized impact on downstream performance. Specifically, domain weights learned by ReMix outperform uniform weights by over 40% on average and human-selected weights by over 20% on datasets used to train the RT-X models.

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

Joey Hejna, Chethan Anand Bhateja, Yichen Jiang, Karl Pertsch, Dorsa Sadigh

Sep 05, 2024

Computer Vision

Robotics

Natural Language Processing

Research

Navigate

Participate

Stay Up To Date

Natural Language Processing

New Approach to Scaling Laws Could Change How AI Models Are Trained

New Approach to Scaling Laws Could Change How AI Models Are Trained

The Promise and Perils of Artificial Intelligence in Advancing Participatory Science and Health Equity in Public Health

The Promise and Perils of Artificial Intelligence in Advancing Participatory Science and Health Equity in Public Health

How Can AI Support Language Digitization and Digital Inclusion?

How Can AI Support Language Digitization and Digital Inclusion?

Christopher Manning

Christopher Manning

AI Leaders Discuss How To Foster Responsible Innovation At TIME100 Roundtable In Davos

AI Leaders Discuss How To Foster Responsible Innovation At TIME100 Roundtable In Davos

LABOR-LLM: Language-Based Occupational Representations with Large Language Models

LABOR-LLM: Language-Based Occupational Representations with Large Language Models

All Work Published on Natural Language Processing

An AI Social Coach Is Teaching Empathy to People with Autism

An AI Social Coach Is Teaching Empathy to People with Autism

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts

Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts

Percy Liang

Percy Liang

Social Science Moves In Silico

Social Science Moves In Silico

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning