Seed Research Grants | Stanford HAI
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
Navigate
  • About
  • Events
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
researchGrant

Seed Research Grants

Status
Closed
Date
Applications closed on September 15, 2025
Apply
Overview
2024 Recipients
2023 Recipients
2022 Recipients
2021 Recipients
2020 Recipients
2019 Recipients
2018 Recipients
Overview
2024 Recipients
2023 Recipients
2022 Recipients
2021 Recipients
2020 Recipients
2019 Recipients
2018 Recipients
Share
Link copied to clipboard!
Related
  • Stanford HAI Funds Groundbreaking AI Research Projects
    Nikki Goth Itoi
    Quick ReadJan 30
    news
    collage

    Thirty-two interdisciplinary teams will receive $2.37 million in Seed Research Grants to work toward initial results on ambitious proposals.

  • Policy-Shaped Prediction: Avoiding Distractions in Model-Based Reinforcement Learning
    Nicholas Haber, Miles Huston, Isaac Kauvar
    Dec 13
    Research
    Your browser does not support the video tag.

    Model-based reinforcement learning (MBRL) is a promising route to sampleefficient policy optimization. However, a known vulnerability of reconstructionbased MBRL consists of scenarios in which detailed aspects of the world are highly predictable, but irrelevant to learning a good policy. Such scenarios can lead the model to exhaust its capacity on meaningless content, at the cost of neglecting important environment dynamics. While existing approaches attempt to solve this problem, we highlight its continuing impact on leading MBRL methods —including DreamerV3 and DreamerPro — with a novel environment where background distractions are intricate, predictable, and useless for planning future actions. To address this challenge we develop a method for focusing the capacity of the world model through synergy of a pretrained segmentation model, a task-aware reconstruction loss, and adversarial learning. Our method outperforms a variety of other approaches designed to reduce the impact of distractors, and is an advance towards robust model-based reinforcement learning.

  • LABOR-LLM: Language-Based Occupational Representations with Large Language Models
    Susan Athey, Herman Brunborg, Tianyu Du, Ayush Kanodia, Keyon Vafa
    Dec 11
    Research
    Your browser does not support the video tag.

    Vafa et al. (2024) introduced a transformer-based econometric model, CAREER, that predicts a worker’s next job as a function of career history (an “occupation model”). CAREER was initially estimated (“pre-trained”) using a large, unrepresentative resume dataset, which served as a “foundation model,” and parameter estimation was continued (“fine-tuned”) using data from a representative survey. CAREER had better predictive performance than benchmarks. This paper considers an alternative where the resume-based foundation model is replaced by a large language model (LLM). We convert tabular data from the survey into text files that resemble resumes and fine-tune the LLMs using these text files with the objective to predict the next token (word). The resulting fine-tuned LLM is used as an input to an occupation model. Its predictive performance surpasses all prior models. We demonstrate the value of fine-tuning and further show that by adding more career data from a different population, fine-tuning smaller LLMs surpasses the performance of fine-tuning larger models.

  • How Persuasive Is AI-generated Propaganda?
    Josh A. Goldstein, Jason Chao, Shelby Grossman, Alex Stamos, Michael Tomz
    Feb 20
    Research

    Can large language models, a form of artificial intelligence (AI), generate persuasive propaganda? We conducted a preregistered survey experiment of US respondents to investigate the persuasiveness of news articles written by foreign propagandists compared to content generated by GPT-3 davinci (a large language model). We found that GPT-3 can create highly persuasive text as measured by participants’ agreement with propaganda theses. We further investigated whether a person fluent in English could improve propaganda persuasiveness. Editing the prompt fed to GPT-3 and/or curating GPT-3’s output made GPT-3 even more persuasive, and, under certain conditions, as persuasive as the original propaganda. Our findings suggest that propagandists could use AI to create convincing content with limited effort.

  • Sociotechnical Audits: Broadening the Algorithm Auditing Lens to Investigate Targeted Advertising
    Michelle Lam, Ayush Pandit, Colin H. Kalicki, Rachit Gupta, Poonam Sahoo, Danaë Metaxa
    Oct 04
    Research
    Your browser does not support the video tag.

    Algorithm audits are powerful tools for studying black-box systems without direct knowledge of their inner workings. While very effective in examining technical components, the method stops short of a sociotechnical frame, which would also consider users themselves as an integral and dynamic part of the system. Addressing this limitation, we propose the concept of sociotechnical auditing: auditing methods that evaluate algorithmic systems at the sociotechnical level, focusing on the interplay between algorithms and users as each impacts the other. Just as algorithm audits probe an algorithm with varied inputs and observe outputs, a sociotechnical audit (STA) additionally probes users, exposing them to different algorithmic behavior and measuring their resulting attitudes and behaviors. As an example of this method, we develop Intervenr, a platform for conducting browser-based, longitudinal sociotechnical audits with consenting, compensated participants. Intervenr investigates the algorithmic content users encounter online, and also coordinates systematic client-side interventions to understand how users change in response. As a case study, we deploy Intervenr in a two-week sociotechnical audit of online advertising (N = 244) to investigate the central premise that personalized ad targeting is more effective on users. In the first week, we observe and collect all browser ads delivered to users, and in the second, we deploy an ablation-style intervention that disrupts normal targeting by randomly pairing participants and swapping all their ads. We collect user-oriented metrics (self-reported ad interest and feeling of representation) and advertiser-oriented metrics (ad views, clicks, and recognition) throughout, along with a total of over 500,000 ads. Our STA finds that targeted ads indeed perform better with users, but also that users begin to acclimate to different ads in only a week, casting doubt on the primacy of personalized ad targeting given the impact of repeated exposure. In comparison with other evaluation methods that only study technical components, or only experiment on users, sociotechnical audits evaluate sociotechnical systems through the interplay of their technical and human components.

  • How Culture Shapes What People Want From AI
    Chunchen Xu, Xiao Ge, Daigo Misaki, Hazel Markus, Jeanne Tsai
    May 11
    Research
    Your browser does not support the video tag.

    There is an urgent need to incorporate the perspectives of culturally diverse groups into AI developments. We present a novel conceptual framework for research that aims to expand, reimagine, and reground mainstream visions of AI using independent and interdependent cultural models of the self and the environment. Two survey studies support this framework and provide preliminary evidence that people apply their cultural models when imagining their ideal AI. Compared with European American respondents, Chinese respondents viewed it as less important to control AI and more important to connect with AI, and were more likely to prefer AI with capacities to influence. Reflecting both cultural models, findings from African American respondents resembled both European American and Chinese respondents. We discuss study limitations and future directions and highlight the need to develop culturally responsive and relevant AI to serve a broader segment of the world population.

  • Minority-group incubators and majority-group reservoirs for promoting the diffusion of climate change and public health adaptations
    Matthew Adam Turner, Alyson L Singleton, Mallory J Harris, Cesar Augusto Lopez, Ian Harryman, Ronan Forde Arthur, Caroline Muraida, James Holland Jones
    Jan 01
    Research
    Your browser does not support the video tag.

    Current theory suggests that heterogeneous metapopulation structures can help foster the diffusion of innovations to solve pressing issues including climate change adaptation and promoting public health. In this paper, we develop an agent-based model of the spread of adaptations in simulated populations with minority-majority metapopulation structure, where subpopulations have different preferences for social interactions (i.e., homophily) and, consequently, learn deferentially from their own group. In our simulations, minority-majority-structured populations with moderate degrees of in-group preference better spread and maintained an adaptation compared to populations with more equal-sized groups and weak homophily. Minority groups act as incubators for novel adaptations, while majority groups act as reservoirs for the adaptation once it has spread widely. This suggests that population structure with in-group preference could promote the maintenance of novel adaptations.

  • Interaction of a Buoyant Plume with a Turbulent Canopy Mixing Layer
    Hayoon Chung, Jeffrey R Koseff
    Jun 23
    Research
    Your browser does not support the video tag.

    This study aims to understand the impact of instabilities and turbulence arising from canopy mixing layers on wind-driven wildfire spread. Using an experimental flume (water) setup with model vegetation canopy and thermally buoyant plumes, we study the influence of canopy-induced shear and turbulence on the behavior of buoyant plume trajectories. Using the length of the canopy upstream of the plume source to vary the strength of the canopy turbulence, we observed behaviors of the plume trajectory under varying turbulence yet constant cross-flow conditions. Results indicate that increasing canopy turbulence corresponds to increased strength of vertical oscillatory motion and variability in the plume trajectory/position. Furthermore, we find that the canopy coherent structures characterized at the plume source set the intensity and frequency at which the plume oscillates. These perturbations then move longitudinally along the length of the plume at the speed of the free stream velocity. However, the buoyancy developed by the plume can resist this impact of the canopy structures. Due to these competing effects, the oscillatory behavior of plumes in canopy systems is observed more significantly in systems where the canopy turbulence is dominant. These effects also have an influence on the mixing and entrainment of the plumes. We offer scaling analyses to find flow regimes in which canopy induced turbulence would be relevant in plume dynamics.

  • Stanford AI Scholars Find Support for Innovation in a Time of Uncertainty
    Nikki Goth Itoi
    Jul 01
    news

    Stanford HAI offers critical resources for faculty and students to continue groundbreaking research across the vast AI landscape.

Virtual Reality technology has shown great promise in visually immersing users in a variety of environments. However, these environments lack the rich auditory and haptic signals that humans expect during real-life interactions. Our long-term research goal is to create virtual experiences that provide rich multi-sensory feedback with applications in immersive journalism, storytelling, entertainment, education or e-commerce.  We will focus on the challenge of modelling how objects sound when users interact with them. While analytical models of sound exist, they assume precise knowledge of object material and shape, the environment, and user interaction. This makes it challenging to scale up an immersive virtual environment. Additionally, analytical models may be too computationally expensive for real-time feedback. These two challenges can be addressed through learning-based approaches. We propose to develop a generative audio model that is learned from a curated video dataset showing a person interacting with objects of various textures, materials, and physical properties. This model will generate realistic sounds for objects as users interact with them in a virtual environment. This approach promises to alleviate modelling challenges by leveraging data and generalising to new objects and user interactions that are not contained in the training dataset.

Name

Role

School

Department

Jeannette Bohg

PI

School of Engineering

Computer Science

Doug James

Co-PI

School of Engineering

Computer Science

Allison Okamura

Co-PI

School of Engineering

Mechanical Engineering

Our ability to interpret and explain algorithmic decisions is at the center of implementing AI safely and responsibly. In consumer lending, AI comes with the promise of expanding credit access to underserved populations. But machine decisions have to be transparent to detect and mitigate discrimination, assess model robustness across the business cycle, and provide legally mandated explanations to consumers applying for credit. To deploy algorithms in a sensitive area like consumer lending, we postulate that model explanations must diagnose the specific societal impact and economic consequences an AI model has. In this project, we first develop a theoretical framework that derives desired properties of explainability techniques given regulatory objectives in consumer lending. We then propose a step-by-step approach to empirically evaluate the ability of explainability techniques to assess model behavior with respect to those regulatory objectives. Third, we demonstrate the utility of our proposed framework by applying it in a study involving commercially available explainability tools that use a variety of techniques for supplying information about a model’s functioning. Through our partnership with FinRegLab, an independent, nonprofit research organization with extensive expertise in financial policy, regulation, and compliance, we ensure the inclusion of a diverse and interdisciplinary group of stakeholders in formulating research questions, incorporating current legal and regulatory conditions, conducting the evaluation, and disseminating our research findings. This effort is designed to inform and complement ongoing efforts to develop standards for the safe, fair, and responsible use of ML/AI.

Name

Role

School

Department

Laura Blattner

PI

Graduate School of Business

 

Jann Spiess

Co-PI

Graduate School of Business

As robotic hardware and intelligence improve, robots are becoming more effective teammates with humans. One collaborative task is cooperatively carrying a heavy object. When multiple humans carry a large or heavy object together, they reduce the risk of individual injury and leverage cues from other teammates and the environment to work cooperatively. This cooperation is usually characterized by efficiency: moving quickly toward a goal and not using more force than necessary. The goal of this work is to make a robot an effective teammate in this task, where at least one of the other teammates are human. By having a robotic teammate, the same advantage of mitigating personal injury remains. The challenge is teaching the robot how to leverage cues from the human teammate(s) and the environment in order to carry efficiently. To accomplish this, we plan to leverage behavioral cloning with emerging techniques in machine learning to teach the robot how to be an effective teammate in this cooperative transport task.

Name

Role

School

Department

Monroe Kennedy

PI

School of Engineering

Mechanical Engineering

Dorsa Sadigh

Co-PI

School of Engineering

Computer Science

We are increasingly surrounded by autonomous mobile robots that augment our capabilities in a variety of applications including household cleaning, package delivery, and security monitoring. These robots operate on a variety of uncertain information, including noisy sensor data and probabilistic decision making processes. In their current state, they often fail and require users with limited training to intervene and troubleshoot the problem. To help users effectively troubleshoot robots, it is important to present relevant information and its uncertainty in an interpretable form without causing information overload. We study how humans reason about spatiotemporal data under uncertainty and subsequently strategize using the information available to them. We then leverage this understanding and explore the design of adaptive interfaces that can assist users in better troubleshooting robots.

Name

Role

School

Department

Sean Follmer

PI

School of Engineering

Mechanical Engineering

Significant lifetime benefits in human health are conferred from exposure to responsive parenting behaviors during early childhood. After adjusting for sociodemographic factors, such early exposure is associated with positive socioemotional, cognitive, and physical health outcomes through adulthood. As a result, much early child-development research requires carefully curated video-taped observations of parent-child interaction. These studies, however, typically employ human annotators to code these video observations manually, and because the process is so time-consuming and labor intensive, study findings have been limited by small and less diverse samples. In this project, we propose to develop an AI-based tool for automated coding of video capture of parent-child interaction, in order to advance measurement detail and scale in child-development research. Once we develop the AI-based tool, we will assess its validity by conducting a longitudinal study of patterns associated with responsive parent-child interactions, using the Study of Environmental Effects on Developing LINGuistic Skills (SEEDLingS) dataset, which captures video encounters of parent-child interactions over time.

Name

Role

School

Department

Serena Yeung

PI

School of Medicine

Biomedical Data Science

Lee Sanders

Co-PI

School of Medicine

Pediatrics

What do political micro-targeting, platform content moderation, and the use of stingray and facial recognition technologies by local police forces have in common? Each of these AI-enabled practices reveals the role of third-parties in people’s ability to come together to take action. The Digital Civil Society Lab (DCSL) will develop a path-breaking, multi-disciplinary and collaborative volume exploring how artificial intelligence impacts a cornerstone of democratic life: the people’s ability to assemble. The manuscript (tentatively titled Artificial Intelligence, Real Democracy: How AI changes people’s ability to assemble, and what it means for democracy) will be written through a workshop process designed to produce a volume organized around a few central questions and that stands as an example of collaborative research production. The research team will also draw on our Digital Assembly Research Network (DARN) - a global community of 350+ scholars, technologists, policy makers and civil society actors dedicated to addressing the ways digital systems influence our ability and right to assemble. The volume and the DARN together serve to integrate policy, practice, and scholarship.

Name

Role

School

Department

Rob Reich

PI

School of Humanities and Sciences

Political Science

Lucy Bernholz

Co-Investigator

School of Humanities and Sciences

Center on Philanthropy and Civil Society

Toussaint Nothias

Co-Investigator

School of Humanities and Sciences

Center on Philanthropy and Civil Society

Each day, cable TV news networks determine what information millions of Americans receive. They also set the context and tone in which that information is presented. These decisions shape public opinion and culture. In this project we aim to develop computational tools that provide greater transparency about these editorial choices. Specifically, we will use modern deep-learning based image, text, and audio processing techniques on a decade of nearly 24-7 broadcasts from CNN, MSNBC, and Fox News to identify patterns and trends in content, bias, polarization, and editorial choices.  We will also develop new systems for quickly validating the accuracy of these machine annotations.

We will work closely with media watchdogs, journalists and other researchers to perform in-depth, data-driven analyses of specific areas of bias in cable TV news coverage. These efforts will address questions like: How much screen time is given to specific topics, and how does this differ across shows or channels? What is the gender and race distribution of expert commentators asked to speak about key topics? Our goal is to discover where such biases appear and to show the potential of AI to aid large-scale quantitative analysis of the news.

Name

Role

School

Department

Kayvon Fatahalian

PI

School of Engineering

Computer Science

Maneesh Agrawala

Co-PI

School of Engineering

Computer Science

James Hamilton

Co-PI

School of Humanities and Sciences

Communication

This project focuses on creating transformative algorithms, solutions, and tools for human-AI augmented control of facilities. It will fulfill a critical need for various next-generation establishments, ranging from power grids to industrial-scale factories to nuclear plants. As these grow in complexity, their control and operation exceed human operators' abilities.

For such applications, human-AI augmentation is essential. Still, pure AI approaches create additional risks, with black-box AI solutions having uncertain validity and robustness. Our research paradigm focuses on using AI-agents not to supplant but to supplement human operators. However, the introduction of a highly automated AI-agent can cause "out-of-the-loop'' syndrome, whereby operators become increasingly detached from the system and may lose their ability to act when the AI fails or becomes unavailable. Also, a critical weakness of AI prediction systems is their opacity and lack of interpretability. Consequently, our focus will be on designing an AI system that augments human operators but leaves them in charge through interpretability, transparency, and explainability.

Name

Role

School

Department

Eric Darve

PI

School of Engineering

Mechanical Engineering

Ram Rajagopal

Co-PI

School of Engineering

Civil and Environmental Engineering

Recent developments in AI—including the creation of OpenAI’s GPT-3 autoregressive language model—have improved language-generating capabilities in tasks such as translation, question-answering, and content creation. Earlier generations of OpenAI’s technology, such as GPT2, have already been used to write emails to colleagues and blog posts by venture capitalists, and similar technologies may be used by malicious actors for disinformation campaigns. The extent to which GPT-3 poses a threat to U.S. public discourse depends partly on persuasiveness and detectability. In this study, we will conduct multiple survey experiments comparing the persuasiveness and perceived credibility of GPT-3 created content, content created by known disinformation actors, and content created by people sharing their authentic beliefs. We plan to vary topics, content length, and content format in order to assess the conditions under which AI-created text is likely to be effective. We will also look at the effect of learning about the abilities of GPT-3 on general trust in media. If GPT-3 can produce content that is nearly as, or more convincing than, human-created content, GPT-3 may pose a significant risk to public discourse and the quality of the information environment.

Name

Role

School

Department

Christopher Manning

Co-PI

School of Engineering

Computer Science

Michael Tomz

Co-PI

School of Humanities and Sciences

Political Science

According to the U.S government, the persistent spread of fake news and other types of misinformation is one of the main on-going threats to societal cohesion and trust. This misinformation is often disseminated by computational “bots” that first learn about specific social media landscapes and then tailor fake news to be “emotionally and culturally resonant” with those landscapes. Thus, while there is a clear need for digital defense tools, their effectiveness hinges on a deep understanding of why certain affective content garners increased engagement or is “culturally resonant”. Unfortunately, we know relatively little about how affective and cultural factors shape the spread of misinformation on social media. Based on our previous research (Hsu et al., 2020), we propose a values-violation account of virality, in which we predict that the affective content that is most likely to spread is that which violates dominant cultural values regarding emotion. According to this account, high arousal negative content spreads the most in the U.S. because it violates the American value placed on maximizing positive feelings and minimizing negative ones. To further test this account, we propose to: (1) build natural language processing tools to examine and compare the spread of affective content on social media, and (2) develop algorithms capable of supporting “affective filters” that can be deployed on social media platforms to flag or modify affective content, providing a culturally-adaptable defense against affectively viral misinformation. We will also examine affective virality in specific subgroups within each country. These studies will not only advance AI research by integrating machine learning algorithms with human emotion and culture, but also advance our understanding of the kinds of affective propaganda users are most vulnerable to so that organizations individuals can defend against them.

Name

Role

School

Department

Jeanne Tsai

PI

School of Humanities and Sciences

Psychology

Michael Bernstein

Co-PI

School of Engineering

Computer Science

Johannes Eichstaedt

Co-PI

School of Humanities and Sciences

Psychology

Jeffrey Hancock

Co-PI

School of Humanities and Sciences

Communication

Brian Knutson

Co-PI

School of Humanities and Sciences

Psychology

Readmission of patients after discharge from hospital is common and often costly. In the United States, one-fifth of the discharged patients (nearly 2.6 million seniors), face an acute medical complication within 30 days that requires readmission or emergency department (ED) visits. In this project, we propose strategies to reduce this readmission rate, especially among older adults. We introduce an intervention approach based on the monitoring of Activities of Daily Living (ADL) trajectories using the computer vision (CV) technology.

Name

Role

School

Department

Fei-Fei Li

Co-PI

School of Engineering

Computer Science

Kevin Schulman

Co-PI

School of Medicine

Hospital Medicine

Ehsan Adeli

Co-PI

School of Medicine

Psych/Major Laboratories and Clinical & Translational Neurosciences Incubator

Over the last several years, researchers and practitioners have worked to design fair prediction models, used in criminal justice, health care, social services, and beyond. Less attention, though, has been paid to the inextricable link between predictions and decisions. In this project, we aim to develop methods for creating equitable prediction-decision systems that respect the often complex preferences and constraints that arise in many applied problems. We will develop our conceptual framework within the context of a high-stakes application: designing online advertising strategies to recruit participants for CalFresh, California's Supplemental Nutrition Assistance Program (SNAP), which helps people with lower incomes buy nutritious food. Through scientific advancements in fair machine learning, we have an opportunity to substantially improve food security.

Name

Role

School

Department

Sharad Goel

PI

School of Law

 

Susan Athey

Co-PI

Graduate School of Business

With the COVID 19 pandemic, the live performance industry has been shut down.  Many audiences and performers find that Zoom performances are unsatisfying both due to the constraints of web cameras and to the slow network speed. However, there is an unexplored area of technology that could be applied to improving streamable live performances, one that utilizes machine learning algorithms to generate “virtual performers” that are controlled by a live performer. We approach working with these algorithms from a different perspective—to think about these systems of technology from a poetic and creative standpoint. Can we improve theatrical streamed performances using machine learning? What are the creative affordances that appear when we apply artificial intelligence to the representations of characters within a live performance? Our project will undertake research both in a creative field by exploring the aesthetics of a “digital performer” alongside a rigorous scientific analysis of this networking protocol and technology behind it.

Name

Role

School

Department

Michael Rau

PI

School of Humanities and Sciences

Theater and Performance Studies

Tsachy Weissman

Co-PI

School of Engineering

Electrical Engineering

Keith Winstein

Co-PI

School of Engineering

Computer Science

Wildfires are increasing in prevalence and severity, causing incredible human suffering and damage to property. In 2009-2018, 7 million acres burned in California, double the amount from 1979-1988. To combat wildfires in the Western US, ALERTWildfire has deployed 600 remotely operable pan-tilt-zoom cameras that have the potential to enable early detection of wildfires. Unfortunately, human monitoring is too expensive for these vast networks of cameras. We will design statistical algorithms and a runtime system to aid human operators in monitoring these camera networks for early wildfire detection. Designing such algorithms and systems must address key challenges: the need for low-latency, early detection, limited resources in the form of computational resources and human attention, and constantly changing environments in nature. We hope to allow scaling to tens of thousands of cameras, enabling early responses to potentially devastating wildfires.

Name

Role

School

Department

Tatsunori Hashimoto

PI

School of Engineering

Computer Science

Trevor Hebert

Co-PI

School of Humanities and Sciences

Jasper Ridge

Matei Zaharia

Co-PI

School of Engineering

Computer Science

The role of AI in providing us content online through social media feeds and search engine results is critically important—from consumer choices to political opinions, we know that these algorithms can change our beliefs and behaviors. But they often do so in unpredictable and invisible ways; studying AI-mediated content is a complex challenge, since most of these algorithms are proprietary and inaccessible to researchers and the public, and users might interact with dozens of different systems online every day. Our team will spend the coming year building a system to help users understand the role of AI in the online content they consume, and measure the impact this content has on them, specifically in the domain of news media and political information. To do this, we are building an in-browser tool that combines a web application and browser extension to measure the algorithmic content users receive, make insights about this content visible to them, and deploy interventions and surveys to understand how this affects them. This research will highlight the way AI systems influence us through our media environments, and the impact of making that influence more transparent, with the goal of promoting the development of AI-powered online systems that are interpretable and human-centered. 

Name

Role

School

Department

Jeffrey Hancock

PI

School of Humanities and Sciences

Communication

Access to quantitative, robust, yet affordable diagnostic tools is necessary to reduce global infectious disease burden. Manual microscopy has served as a bedrock for diagnostics with wide adaptability, although at a cost of tedious labor and human errors. Automated robotic microscopes are poised to enable a new era of smart field microscopy but current platforms remain cost prohibitive and largely inflexible, especially for resource poor and field settings. Here we present Octopi, a low-cost and reconfigurable autonomous microscopy platform capable of automated slide scanning and correlated bright-field and fluorescence imaging. With roughly two orders of magnitude in cost reduction, we aim to develop an Octopi based large robotic microscope network for improved disease diagnosis while providing an avenue for collective efforts for development of modular instruments and machine learning algorithms for better disease diagnostics for the masses.

Name

Role

School

Department

Manu Prakash

PI

School of Engineering

Bioengineering

Deep neural networks have emerged as leading models for predicting neural data from a variety of brain areas and species. We will explore whether this modeling framework can be used to predict how neural representations of visual stimuli change over development. Like the human visual system, deep network models of the adult visual system are created from a combination of pre-specified “hardware” and an extensive period of exposure to visual images.  We will thus approach the modeling in two ways, one is to build models that represent the initial conditions of the visual cortex prior to the onset of visual experience and the second is to use the training phase of models that differ in their architectures and training rules as models of human brain development.  Critically, to test these models, we will acquire rich, high temporal resolution data sets from developing human infants using high-density EEG recordings.

Name

Role

School

Department

Dan Yamins

PI

School of Humanities and Sciences

Psychology

Anthony Norcia

Co-PI

School of Humanities and Sciences

Psychology

Kalanit Grill-Spector

Co-PI

School of Humanities and Sciences

Psychology

Illegal fishing is a pressing global problem that threatens food security, livelihoods, ecosystems, and robs governments and communities of billions of dollars in income. Current approaches focus on detecting illegal fishing by commercial vessels in international waters. We propose to develop an AI-based algorithm to detect fishing vessels of all sizes, using computer vision to automatically analyze satellite imagery, focusing on nearshore areas and ports that are not well-studied. By pairing these data with available GPS-based data from vessel tracking systems, we aim to paint a comprehensive picture of vessel activities in places where fisheries are central to local communities economies and well being. With this research, we aim to address key gaps in currently available data and understanding, and highlight priorities for future studies to improve fisheries transparency and eliminate illegal fishing.

Name

Role

School

Department

Fiorenza Micheli

PI

School of Humanities and Sciences

Hopkins Marine Station

Trevor Hastie

Co-PI

School of Humanities and Sciences

Statistics

Jim Leape

Co-PI

 

Woods Institute for the Environment

Serena Yeung

Co-PI

School of Medicine

Biomedical Data Science

Physicians face an exponential influx of medical data. The lack of computerized assistance places an immense cognitive burden on physicians who must manually assimilate, review and interpret these data to make clinical decisions leading to treatment variability and physician burnout. Artificial intelligence (AI) techniques show promise to alleviate these problems but suffer from two main limitations: (1) They require enormous quantities of labelled data for training. Curating large labeled datasets is time-consuming and expensive; and (2) training labels are often simplistic and task-oriented (e.g. normal vs. abnormal). Even state-of-the-art AI models trained on such labelled data often fail to generalize to unseen real-world patients. The goal of this project is to develop new methodology for transferring acquired human insights and knowledge to machine learning models in the context of their providing healthcare, stepping away from classic supervision using simplistic labeling or specifying explicit rules, by simply observing skilled humans in action. We introduce a novel paradigm for developing more robust AI models that we call “observational supervision,” wherein we train AI models by observing how experienced physicians examine medical data while making decisions. Our approach not only requires minimal, if any hand labelling of data, but the signals derived from physicians’ actions (e.g. what data they look at, the order in which they look, length of time, etc.) also reflect the physician’s underlying thought process that can be injected into model training. Moreover, our work will be applicable to and benefit many other domains beyond medical care, such as augmented reality - virtual reality (AR-VR), sensor fusion, environmental monitoring, among others. 

Name

Role

School

Department

Daniel Rubin

PI

School of Medicine

Biomedical Data Science

Jonathan Chen

Co-PI

School of Medicine

Biomedical Informatics

Chris Re

Co-PI

School of Engineering

Computer Science

Name

Role

School

Department

Sarah Billington

PI

School of Engineering

Civil and Environmental Engineering

James Landay

Co-PI

School of Engineering

Computer Science

Design sketching is both a fundamental skill in artistic expression and a powerful form of visual communication in its own right. However, outside of classroom environments, tools supporting design sketching cannot yet provide feedback and critique necessary for learning. We propose a joint human-AI approach to providing student feedback on design sketching, thereby augmenting human capacities to learn drawing and design sketching. We will develop Korekta, a human+AI corrective feedback system that balances evaluation between AI---for the skills that can be classified algorithmically---and humans---for the skills that cannot. We will further demonstrate how Korekta's correction process generates training data that helps its AI improve itself as a side effect.

Name

Role

School

Department

Michael Bernstein

PI

School of Engineering

Computer Science

Camille Utterback

Co-PI

School of Humanities and Sciences

Art

In the wake of George Floyd’s death, calls for companies to fight racism and racial bias in their organizations and products have escalated. In the tech world, machine learning algorithms that reproduce, reinforce, and even amplify racial disparities and biases in society have been identified as key sites for intervention. Our team of social psychologists, computational linguists, and computer scientists will collaborate to develop a series of AI tools that will detect and root out bias rather than magnify it.  Working with Nextdoor, a neighborhood-focused social network, we plan to pair our bias detectors with behavioral nudges and interventions that encourage inclusive rather than exclusive behavior among users on the platform. With these detectors, we will be able to experimentally test and iterate multiple interventions across a large number of users, leveraging the strengths of AI and social science to support positive behavioral change and strengthen communities.

Name

Role

School

Department

Jennifer Eberhardt

PI

School of Humanities and Sciences

Psychology

Dan Jurafsky

Co-PI

School of Humanities and Sciences

Humanities

Hazel Markus

Co-PI

School of Humanities and Sciences

Behavioral Sciences

Name

Role

School

Department

Pablo Paredes Castro

PI

School of Medicine

Psych/Major Laboratories and Clinical & Translational Neurosciences Incubator

Mert Pilanci

Co-PI

School of Engineering

Electrical Engineering

One’s voice is arguably one of the most human aspects of a person. Unfortunately, patients with laryngeal cancer often undergo laryngectomies (or surgical removal of the voice organ) and consequently suffer permanent loss of their voice. Current methods for voice rehabilitation rely on modulation of artificially produced vibrations, which results in a voice that is unnatural, monotonous and difficult for an untrained listener to understand. We aim to develop a process to naturalize and individualize the post-laryngectomy voice. More specifically, our approach will be to apply machine learning methods to perform audio-to-audio translation of post-laryngectomy speech, utilizing the unique characteristics of the patient’s pre-laryngectomy voice. We anticipate that any meaningful voice naturalization will help patients regain their individuality, enhance communication, and improve overall quality of life.

Name

Role

School

Department

Fred Baik

PI

School of Medicine

Otolaryngology

Maneesh Agrawala

Co-PI

School of Engineering

Computer Science

Opening the Loop is the tentative name for a documentary film being directed and produced by Muhammad Khattak (Stanford) and Joe Khoury (California Institute of the Arts). The film seeks to critically interrogate how broader issues regarding power, culture and our ethical priorities influence current developments of artificial intelligence. Primarily focusing on AI’s regulatory uses, it aims to shift away from predominant attitudes that technology is merely a neutral tool and focuses on AI’s slow expansion into the public realm. This is part of a broader project to promote non-technical engagements in AI. In conjunction with the film, we will be collaborating with Stanford artists and educators to organize an AI art exhibit and high school outreach program about the societal implications of AI. The latter is meant to introduce ethical questions about AI to the realm of secondary education whereas the former serves to promote unique engagements in current technological changes. Together, these three projects focus on the particular experiences that are oftentimes marginalized in popular representations of AI and aim to endorse more open-ended, philosophical and artistic means of describing AI in current conversations.

Name

Role

School

Department

Russell Berman

PI

School of Humanities and Sciences

Comparative Literature

Ruth Starkman

Co-PI

School of Humanities and Sciences

Writing and Rhetoric

Rapidly developing technologies that use artificial intelligence can be an unprecedented force for good; however, they can also codify and amplify existing forms of racial inequality, discrimination, and bias. The Technology and Racial Equity Action Lab, based at the Center for Comparative Studies in Race and Ethnicity, combines research, teaching, and practice to advance racial justice in the analysis, production, and deployment of new technologies. The Lab will produce public-facing reports, policy briefs, and recommendations on critical issues at the intersection of race and technology. In addition, the Lab expands these impacts through its international Practitioner Fellows Program that brings together these external fellows with Stanford students and faculty.

Name

Role

School

Department

Jennifer DeVere Brody

PI

School of Humanities and Sciences

Theater and Performance Studies

Teresa LaFramboise

Co-PI

Graduate School of Education

 

Michele Elam

Co-PI

School of Humanities and Sciences

English

More than sixty years after “Brown v. Board of Education”, segregation by race and class is still a growing problem in public schools in the US. Many school districts adopted student assignment mechanisms in order to provide families more options while hoping also to disentangle neighborhood segregation and school segregation. Although families express their preferences in such mechanisms, priorities and zones can be designed by the district to achieve societal objectives. Ad-hoc design choices have been incorporated by districts to address these issues, such as assigning priority to students from neighborhoods with low income or a high percentage of minorities. However, there is little evidence that such policies in the context of choice have helped to address concerns about equity. In 2018, the San Francisco Unified School District passed a resolution for developing a new student assignment system for elementary schools, which seeks to improve diversity, transparency, and equal access to quality schools. This project will build tools from AI, optimization and Economics to design mechanisms, priorities and zones towards achieving these goals.

Name

Role

School

Department

Itai Ashlagi

PI

School of Engineering

Management Science and Engineering

Irene Lo

Co-PI

School of Engineering

Management Science and Engineering

Explanations are critical to how humans understand and learn about the world. As humans, we readily go beyond what happened to reason about why it happened and how things could have played out differently. While today’s AI systems can achieve super-human performance on many challenging tasks, their lack of generalizability, interpretability, and ability to interact with humans limits their potential, especially in high-stakes settings. One key aspect of human intelligence that is missing is the ability to understand and communicate about causality. Drawing inspiration from how humans think and communicate about causality, the Science and Engineering of Explanations (SEE) project pursues the twin goal of developing AI systems that generate and understand causal explanations the way humans do, and that help to improve human explanatory abilities.

Name

Role

School

Department

Tobias Gerstenberg

PI

School of Humanities and Sciences

Psychology

Hyowon Gweon

Co-PI

School of Humanities and Sciences

Psychology

Thomas Icard

Co-PI

School of Humanities and Sciences

Philosophy

Percy Liang

Co-PI

School of Engineering

Computer Science

Jiajun Wu

Co-PI

School of Engineering

Computer Science

Take a moment to attend to the varying degrees of control you have over your surroundings. Your limbs are decidedly part of you: you can control them directly, sensing them through high-bandwidth proprioception. Further along this "agency spectrum" are tool-like objects such as forks, which you can grab, control, and use as extensions of yourself, albeit without proprioceptive feedback. Other objects, like a table, can be manipulated, yet the act of doing so is less direct than with tools. Then there are other agents, which you can affect even less directly, such as through social interaction. Finally, there are aspects of the environment that affect you but cannot easily be affected by you, such as the weather, and yet other aspects which neither affect you nor can be affected by you, such as far-off buildings or background noise. 

We hypothesize that two major mysteries regarding agency are strongly connected. The first is computational: how do we design an artificial system with a complex, capable embodiment that learns how to efficiently leverage the full spectrum of agency available to it in a natural manner---that is, through exploration, and without explicit supervision? The second problem is cognitive: humans represent an agentic self as a distinct entity from the world. What is the utility of this self: why and how do we cognitively track it in this distinguished way? We strive to answer these computational and cognitive questions, while also elucidating the underpinnings of dissociation-- a psychiatric disruption of self implicated in prevalent stress and trauma-related mental disorders.

Name

Role

School

Department

Nick Haber

PI

Graduate School of Education

 

Karl Deisseroth

Co-PI

School of Engineering

Bioengineering

Progress on the UN Sustainable Development Goals (SDGs) is hampered by a persistent lack of data regarding key social, environmental, and economic indicators, particularly in developing countries. While there have been numerous attempts to fill this gap through the combination of machine learning and satellite imagery, a key challenge in many sustainability applications is the lack of sufficient labeled training data. In particular, geospatial analysis lacks methods like the word-vector representations and pre-trained networks that significantly boost performance across a wide range of natural language and computer vision tasks. To fill this gap, we propose to extend recent advances in contrastive self-supervised learning to remote sensing data. On natural images, these techniques can learn in an unsupervised way (i.e., without human annotations) representations that perform close to their supervised counterparts. By extending these techniques to satellite imagery, we expect to be able to learn representations that can significantly improve performance in downstream classification tasks such as poverty and food security estimation.

Name

Role

School

Department

Stefano Ermon

PI

School of Engineering

Computer Science

David Lobell

Co-PI

School of Earth, Energy and Environmental Sciences

Earth System Science

Much of our visual world is highly regular: objects are often symmetric and have repetitive parts; scenes may consist of objects organized in a repetitive layout. The goal of this project is to build machines that infer and represent such regular structures from raw visual data and later leverage them for scene understanding and analysis. The key technical innovation will be methods that integrate program representations into modern deep networks.

Name

Role

School

Department

Jiajun Wu

PI

School of Engineering

Computer Science

Alex Aiken

Co-PI

School of Engineering

Computer Science

Stanford University´s Human Trafficking Data Lab is a multi-stakeholder effort to develop scalable technologies that leverage the modern data economy to find human trafficking and support interventions for those who have been enslaved. Our Lab is assembling powerful anti-trafficking data resources to create a unique decision support system combining near real-time machine learning insights with AI technology capable of proactively identifying forced labor and trafficking. This project will further enable us to crosslink satellite imagery with cases of illegal labor camps, producing open-source tools and a newly digitized inspection report cache, in order to disrupt systems of exploitation. 

Name

Role

School

Department

Victoria Ward

PI

School of Medicine

Pediatrics

Mike Baiocchi

Co-PI

School of Medicine

Epidemiology and Population Health

Grant Miller

Co-PI

School of Medicine

Health Policy and Primary Care Outcomes

Sea level rise and its associated effects will be among the most immediate and dramatic impacts of climate change in this century. Small changes in mean sea level increase the rate of high sea level events that can have devastating impacts on coastal communities. Despite its importance, sea level rise projections have significant error margins, in large part due to high uncertainty in the response of Greenland and Antarctica to a changing climate. This is a result of sparse measurements, high measurement uncertainty, and incomplete understandings of the governing physics of ice sheets. Advances in autonomous aircraft promise to enable continuous data collection campaigns over ice sheets previously considered logistically impossible, but even a huge volume of data will not resolve current uncertainties if it is collected without regard for the uncertainty in the governing physics. Building on techniques from scientific machine learning and Bayesian inference, we are investigating an uncertainty-aware approach to selecting measurement locations to maximize the value of autonomous data collection aircraft in improving future sea level rise estimates.

Name

Role

School

Department

Dustin Schroeder

PI

School of Earth, Energy and Environmental Sciences

Geophysics

Mykel Kochenderfer

Co-PI

School of Engineering

Aeronautics and Astronautics

During the COVID-19 pandemic, health care resources are stretched to the limit. Lack of testing capabilities and proliferation of “worried well” patients have put pressure on clinicians to prioritize needs at healthcare facilities. An efficient and effective way to triage patients has become increasingly crucial.

We aim to develop and validate a novel video-based AI algorithm to automatically triage patients with COVID-19 concerns based on a combination of their clinical appearances and their reported symptoms during a telehealth video encounter. The novel algorithm can help determine which patients need emergency assistance, which patients need to come to the clinic for further evaluation, and which patients are safe to stay home.

If successful, the novel algorithm can enable large-scale triage of patients with COVID-19 concerns without using clinicians’ time, thus using digital technologies to boost the health system's capacity. In addition, this video-based self-triage tool could potentially reduce healthcare disparity in underserved populations given their poor access to in-person providers but relatively high rate of mobile phone and internet usage. Finally, the video-based algorithm can be broadly applied to other respiratory outbreaks and diseases beyond COVID-19.

Name

Role

School

Department

Rusty Hofmann

PI

School of Medicine

Interventional Radiology

Nigam Shah

Co-PI

School of Medicine

Biomedical Informatics

Serena Yeung

Co-PI

School of Medicine

Biomedical Data Science

James Quinn

Co-PI

School of Medicine

Emergency Medicine

Wui Ip

 

Stanford Health Care

Pediatric Hospital Medicine

Elyse Ruan

 

Stanford Health Care

Digital Health Strategy