Computer Vision

Finding Monosemantic Subspaces and Human-Compatible Interpretations in Vision Transformers through Sparse Coding

Romeo Valentin, Vikas Sindhwan, Summeet Singh, Vincent Vanhoucke, Mykel Kochenderfer

Jan 01, 2025

Research

We present a new method of deconstructing class activation tokens of vision transformers into a new, overcomplete basis, where each basis vector is “monosemantic” and affiliated with a single, human-compatible conceptual description. We achieve this through the use of a highly optimized and customized version of the K-SVD algorithm, which we call Double-Batch K-SVD (DBK-SVD). We demonstrate the efficacy of our approach on the sbucaptions dataset, using CLIP embeddings and comparing our results to a Sparse Autoencoder (SAE) baseline. Our method significantly outperforms SAE in terms of reconstruction loss, recovering approximately 2/3 of the original signal compared to 1/6 for SAE. We introduce novel metrics for evaluating explanation faithfulness and specificity, showing that DBK-SVD produces more diverse and specific concept descriptions. We therefore show empirically for the first time that disentangling of concepts arising in Vision Transformers is possible, a statement that has previously been questioned when applying an additional sparsity constraint. Our research opens new avenues for model interpretability, failure mitigation, and downstream task domain transfer in vision transformer models. An interactive demo showcasing our results can be found at https://disentangling-sbucaptions.xyz, and we make our DBK-SVD implementation openly available at https://github.com/RomeoV/KSVD.jl.

Research

Finding Monosemantic Subspaces and Human-Compatible Interpretations in Vision Transformers through Sparse Coding

Romeo Valentin, Vikas Sindhwan, Summeet Singh, Vincent Vanhoucke, Mykel Kochenderfer

Computer VisionJan 01

Using AI to Understand Residential Solar Power

Zhecheng Wang, Marie-Louise Arlt, Chad Zanocco, Arun Majumdar, Ram Rajagopal

Quick ReadSep 28, 2023

Policy Brief

This brief introduces a computer-vision approach to analyzing solar panel adoption in U.S. households that can help policymakers tailor incentive mechanisms.

Policy Brief

Using AI to Understand Residential Solar Power

Zhecheng Wang, Marie-Louise Arlt, Chad Zanocco, Arun Majumdar, Ram Rajagopal

Energy, EnvironmentComputer VisionQuick ReadSep 28

This brief introduces a computer-vision approach to analyzing solar panel adoption in U.S. households that can help policymakers tailor incentive mechanisms.

How a HAI Seed Grant Helped Launch a Disease-Fighting AI Platform

Dylan Walsh

Mar 03, 2026

News

Stanford scientists in Senegal hunting for schistosomiasis—a parasitic disease infecting 200+ million people worldwide—used AI to transform local field work into satellite-powered disease mapping.

News

How a HAI Seed Grant Helped Launch a Disease-Fighting AI Platform

Dylan Walsh

Computer VisionHealthcareSciences (Social, Health, Biological, Physical)Machine LearningMar 03

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

Joey Hejna, Chethan Anand Bhateja, Yichen Jiang, Karl Pertsch, Dorsa Sadigh

Sep 05, 2024

Research

Increasingly large robotics datasets are being collected to train larger foundation models in robotics. However, despite the fact that data selection has been of utmost importance to scaling in vision and natural language processing (NLP), little work in robotics has questioned what data such models should actually be trained on. In this work we investigate how to weigh different subsets or "domains'' of robotics datasets during pre-training to maximize worst-case performance across all possible downstream domains using distributionally robust optimization (DRO). Unlike in NLP, we find that these methods are hard to apply out of the box due to varying action spaces and dynamics across robots. Our method, ReMix, employs early stopping and action normalization and discretization to counteract these issues. Through extensive experimentation on both the Bridge and OpenX datasets, we demonstrate that data curation can have an outsized impact on downstream performance. Specifically, domain weights learned by ReMix outperform uniform weights by over 40% on average and human-selected weights by over 20% on datasets used to train the RT-X models.

Research

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

Joey Hejna, Chethan Anand Bhateja, Yichen Jiang, Karl Pertsch, Dorsa Sadigh

Computer VisionRoboticsNatural Language ProcessingSep 05

Evaluating Facial Recognition Technology: A Protocol for Performance Assessment in New Domains

Daniel E. Ho, Emily Black, Maneesh Agrawala, Fei-Fei Li

Deep DiveNov 01, 2020

White Paper

This white paper provides research- and scientifically-grounded recommendations for how to give context to calls for testing the operational accuracy of facial recognition technology.

White Paper

Evaluating Facial Recognition Technology: A Protocol for Performance Assessment in New Domains

News

An Amazon-backed fellowship will support 10 Stanford PhD students whose work explores everything from how we communicate to understanding disease and protecting our data.

From Privacy to ‘Glass Box’ AI, Stanford Students Are Targeting Real-World Problems

Nikki Goth Itoi

Feb 27, 2026

An Amazon-backed fellowship will support 10 Stanford PhD students whose work explores everything from how we communicate to understanding disease and protecting our data.

Generative AI

Healthcare

Privacy, Safety, Security

Computer Vision

Sciences (Social, Health, Biological, Physical)

News

America's 250 Greatest Innovators: Celebrating The American Dream

Forbes

Feb 11, 2026

Media Mention

HAI Co-Director Fei-Fei Li named one of America's top 250 greatest innovators, alongside fellow Stanford affiliates Rodney Brooks, Carolyn Bertozzi, Daphne Koller, and Andrew Ng.

America's 250 Greatest Innovators: Celebrating The American Dream

Forbes

Feb 11, 2026

HAI Co-Director Fei-Fei Li named one of America's top 250 greatest innovators, alongside fellow Stanford affiliates Rodney Brooks, Carolyn Bertozzi, Daphne Koller, and Andrew Ng.

Ethics, Equity, Inclusion

Media Mention

AI Can’t Do Physics Well – And That’s a Roadblock to Autonomy

Andrew Myers

Jan 26, 2026

News

QuantiPhy is a new benchmark and training framework that evaluates whether AI can numerically reason about physical properties in video images. QuantiPhy reveals that today’s models struggle with basic estimates of size, speed, and distance but offers a way forward.

AI Can’t Do Physics Well – And That’s a Roadblock to Autonomy

Andrew Myers

Jan 26, 2026

Computer Vision

Robotics

Sciences (Social, Health, Biological, Physical)

News

Spatial Intelligence Is AI’s Next Frontier

TIME

Dec 11, 2025

Media Mention

"This is AI’s next frontier, and why 2025 was such a pivotal year," writes HAI Co-Director Fei-Fei Li.

Spatial Intelligence Is AI’s Next Frontier

TIME

Dec 11, 2025

"This is AI’s next frontier, and why 2025 was such a pivotal year," writes HAI Co-Director Fei-Fei Li.

Computer Vision

Machine Learning

Generative AI

Media Mention

Navigate

Participate

Stay Up To Date

5 Questions for Russell Wald

5 Questions for Russell Wald

Finding Monosemantic Subspaces and Human-Compatible Interpretations in Vision Transformers through Sparse Coding

Finding Monosemantic Subspaces and Human-Compatible Interpretations in Vision Transformers through Sparse Coding

Using AI to Understand Residential Solar Power

Using AI to Understand Residential Solar Power

How a HAI Seed Grant Helped Launch a Disease-Fighting AI Platform

How a HAI Seed Grant Helped Launch a Disease-Fighting AI Platform

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

Evaluating Facial Recognition Technology: A Protocol for Performance Assessment in New Domains

Evaluating Facial Recognition Technology: A Protocol for Performance Assessment in New Domains

All Work Published on Computer Vision

Inside Stanford's Human-Centered AI Institute With Fei-Fei Li

Inside Stanford's Human-Centered AI Institute With Fei-Fei Li

Meet USA TODAY's 2026 Women Of The Year Honorees

Meet USA TODAY's 2026 Women Of The Year Honorees

From Privacy to ‘Glass Box’ AI, Stanford Students Are Targeting Real-World Problems

From Privacy to ‘Glass Box’ AI, Stanford Students Are Targeting Real-World Problems

America's 250 Greatest Innovators: Celebrating The American Dream

America's 250 Greatest Innovators: Celebrating The American Dream

AI Can’t Do Physics Well – And That’s a Roadblock to Autonomy

AI Can’t Do Physics Well – And That’s a Roadblock to Autonomy

Spatial Intelligence Is AI’s Next Frontier

Spatial Intelligence Is AI’s Next Frontier