Machine Learning | Stanford HAI

Policy-Shaped Prediction: Avoiding Distractions in Model-Based Reinforcement Learning

Nicholas Haber, Miles Huston, Isaac Kauvar

Dec 13, 2024

Research

Model-based reinforcement learning (MBRL) is a promising route to sampleefficient policy optimization. However, a known vulnerability of reconstructionbased MBRL consists of scenarios in which detailed aspects of the world are highly predictable, but irrelevant to learning a good policy. Such scenarios can lead the model to exhaust its capacity on meaningless content, at the cost of neglecting important environment dynamics. While existing approaches attempt to solve this problem, we highlight its continuing impact on leading MBRL methods —including DreamerV3 and DreamerPro — with a novel environment where background distractions are intricate, predictable, and useless for planning future actions. To address this challenge we develop a method for focusing the capacity of the world model through synergy of a pretrained segmentation model, a task-aware reconstruction loss, and adversarial learning. Our method outperforms a variety of other approaches designed to reduce the impact of distractors, and is an advance towards robust model-based reinforcement learning.

Policy-Shaped Prediction: Avoiding Distractions in Model-Based Reinforcement Learning

Nicholas Haber, Miles Huston, Isaac Kauvar

Dec 13, 2024

Model-based reinforcement learning (MBRL) is a promising route to sampleefficient policy optimization. However, a known vulnerability of reconstructionbased MBRL consists of scenarios in which detailed aspects of the world are highly predictable, but irrelevant to learning a good policy. Such scenarios can lead the model to exhaust its capacity on meaningless content, at the cost of neglecting important environment dynamics. While existing approaches attempt to solve this problem, we highlight its continuing impact on leading MBRL methods —including DreamerV3 and DreamerPro — with a novel environment where background distractions are intricate, predictable, and useless for planning future actions. To address this challenge we develop a method for focusing the capacity of the world model through synergy of a pretrained segmentation model, a task-aware reconstruction loss, and adversarial learning. Our method outperforms a variety of other approaches designed to reduce the impact of distractors, and is an advance towards robust model-based reinforcement learning.

Machine Learning

Foundation Models

Research

Brief Definitions of Key Terms in AI

Stanford HAI

Apr 01, 2022

Explainer

This explainer provides brief definitions for key terms associated with artificial intelligence, ranging from autonomous systems to deep learning and foundation models.

Brief Definitions of Key Terms in AI

Stanford HAI

Apr 01, 2022

This explainer provides brief definitions for key terms associated with artificial intelligence, ranging from autonomous systems to deep learning and foundation models.

Machine Learning

Foundation Models

Explainer

Meg Cychosz

Assistant Professor of Linguistics

Person

Meg Cychosz

Assistant Professor of Linguistics

Ethics, Equity, Inclusion

Communications, Media

Human Reasoning

Machine Learning

Sciences (Social, Health, Biological, Physical)

Person

The Architects of AI Are TIME’s 2025 Person of the Year

TIME

Dec 11, 2025

Media Mention

HAI founding co-director Fei-Fei Li has been named one of TIME's 2025 Persons of the Year. From ImageNet to her advocacy for human-centered AI, Dr. Li has been a guiding light in the field.

The Architects of AI Are TIME’s 2025 Person of the Year

TIME

Dec 11, 2025

HAI founding co-director Fei-Fei Li has been named one of TIME's 2025 Persons of the Year. From ImageNet to her advocacy for human-centered AI, Dr. Li has been a guiding light in the field.

Machine Learning

Computer Vision

Media Mention

Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI

Tauhidul Islam, Lei Xing

Aug 01, 2024

Research

AI driven by deep learning is transforming many aspects of science and technology. The enormous success of deep learning stems from its unique capability of extracting essential features from Big Data for decision-making. However, the feature extraction and hidden representations in deep neural networks (DNNs) remain inexplicable, primarily because of lack of technical tools to comprehend and interrogate the feature space data. The main hurdle here is that the feature data are often noisy in nature, complex in structure, and huge in size and dimensionality, making it intractable for existing techniques to analyze the data reliably. In this work, we develop a computational framework named contrastive feature analysis (CFA) to facilitate the exploration of the DNN feature space and improve the performance of AI. By utilizing the interaction relations among the features and incorporating a novel data-driven kernel formation strategy into the feature analysis pipeline, CFA mitigates the limitations of traditional approaches and provides an urgently needed solution for the analysis of feature space data. The technique allows feature data exploration in unsupervised, semi-supervised and supervised formats to address different needs of downstream applications. The potential of CFA and its applications for pruning of neural network architectures are demonstrated using several state-of-the-art networks and well-annotated datasets across different disciplines.

Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI

Tauhidul Islam, Lei Xing

Aug 01, 2024

AI driven by deep learning is transforming many aspects of science and technology. The enormous success of deep learning stems from its unique capability of extracting essential features from Big Data for decision-making. However, the feature extraction and hidden representations in deep neural networks (DNNs) remain inexplicable, primarily because of lack of technical tools to comprehend and interrogate the feature space data. The main hurdle here is that the feature data are often noisy in nature, complex in structure, and huge in size and dimensionality, making it intractable for existing techniques to analyze the data reliably. In this work, we develop a computational framework named contrastive feature analysis (CFA) to facilitate the exploration of the DNN feature space and improve the performance of AI. By utilizing the interaction relations among the features and incorporating a novel data-driven kernel formation strategy into the feature analysis pipeline, CFA mitigates the limitations of traditional approaches and provides an urgently needed solution for the analysis of feature space data. The technique allows feature data exploration in unsupervised, semi-supervised and supervised formats to address different needs of downstream applications. The potential of CFA and its applications for pruning of neural network architectures are demonstrated using several state-of-the-art networks and well-annotated datasets across different disciplines.

Machine Learning

Research

A New Direction for Machine Learning in Criminal Law

Kristen Bell, Jenny Hong, Nick McKeown, Catalin Voss

Quick ReadDec 01, 2021

Policy Brief

This brief proposes a machine learning approach to studying decision-making in the criminal legal system as a way to identify and reduce systemic inequalities.

A New Direction for Machine Learning in Criminal Law

Kristen Bell, Jenny Hong, Nick McKeown, Catalin Voss

Quick ReadDec 01, 2021

This brief proposes a machine learning approach to studying decision-making in the criminal legal system as a way to identify and reduce systemic inequalities.

Law Enforcement and Justice

Machine Learning

Policy Brief

Navigate

Participate

Stay Up To Date

All Work Published on Machine Learning

Policy-Shaped Prediction: Avoiding Distractions in Model-Based Reinforcement Learning

Policy-Shaped Prediction: Avoiding Distractions in Model-Based Reinforcement Learning

Brief Definitions of Key Terms in AI

Brief Definitions of Key Terms in AI

Meg Cychosz

Meg Cychosz

The Architects of AI Are TIME’s 2025 Person of the Year

The Architects of AI Are TIME’s 2025 Person of the Year

Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI

Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI

A New Direction for Machine Learning in Criminal Law

A New Direction for Machine Learning in Criminal Law