Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
BEHAVIOR Challenge Charts the Way Forward for Domestic Robotics | Stanford HAI

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

Navigate
  • About
  • Events
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
news

BEHAVIOR Challenge Charts the Way Forward for Domestic Robotics

Date
September 22, 2025
Topics
Robotics
Machine Learning

In the BEHAVIOR Challenge, robots will compete against each other on 50 everyday domestic tasks, from making toast to tidying a room.

With a first-of-its-kind competition for roboticists everywhere, researchers at Stanford are hoping to push domestic robotics into a new age of autonomy and capability.

Anyone who has ever made a pizza by hand knows that it is not easy to make a pizza. Now, consider the difficulty in training a robot to do the same, from making the pizza dough to setting the oven temperature, to slicing the mushrooms. Such was the challenge for a team of robotics engineers at Stanford over the last several years to not just simulate the making of a single pizza step by step but also 999 other domestic challenges that robots will be expected to complete—autonomously—for their human owners, from turning on the radio to cleaning a bathroom.

“Actually, making pizza is one of the easier tasks in our list,” says Ruohan Zhang with a smile. Zhang is a postdoctoral researcher in the Stanford Vision and Learning Lab run by renowned computer vision experts, Fei-Fei Li and Jiajun Wu, both professors of computer science.

Zhang co-led BEHAVIOR, a yearslong effort to benchmark 1,000 everyday domestic tasks in 50 richly simulated environments—houses, gardens, restaurants, offices, and more—stocked with over 10,000 interactive objects modeled with realistic physics. Li, Wu, and their Stanford Vision and Learning Lab colleagues hope BEHAVIOR will become something of a technical “North Star” that others in the field can use to guide the training of their robots.

"In BEHAVIOR, we see a bold vision of human-centered AI that connects academic research to everyday human life,” says Professor Li, who is the founding director of the Stanford Institute for Human-Centered AI. “This is not just a benchmark—it's a call to imagine and create intelligent agents that can truly assist people in their daily lives. Meanwhile, it represents a step forward in our quest to build generalist AI systems that understand, reason, and act in the complex world of humans."

 “While any one of those tasks is already highly complex in its own right, the target is to create a single robot that can do all of these things,” Wu adds. “Creating these benchmarks now, before the field has evolved too far, will help to set up potential common goals for the community."

A Grand Challenge

Having completed the Herculean effort of simulating all 1,000 tasks, Zhang and colleagues announced the 2025 BEHAVIOR Challenge, a global competition for robotics engineers everywhere to demonstrate the capabilities of their robots to complete a hand-selected subset of 50 of BEHAVIOR’s tasks in one of the most comprehensive simulated households ever built, all before a worldwide audience of top academic researchers and robotics companies.

"By grounding robotics research in realistic human tasks, BEHAVIOR Challenge sets the stage for creating robots that people will actually want and need in their daily lives," says Yunfan Jiang, a doctoral student and BEHAVIOR core member.

Examples include making toast, tidying a room, or setting a table. Each task is defined by initial conditions, required objects, and a clear success criterion. Competitors are not judged on achieving individual tasks, but on a team’s success across all 50 tasks.

“BEHAVIOR stands out as a benchmark because it tackles real-world, long-horizon tasks that require mobile manipulation. I’m excited to see the community build on this foundation and push the boundaries of task success,” says team member Raven Huang, a postdoctoral researcher.

Submissions are welcome from academic labs, independent researchers, or industry teams. Entries are due November 15, 2025, with the winners to be announced at the NeurIPS 2025 Conference in December. The top prize is $1,000.

“This is about more than just a cash prize. The BEHAVIOR Challenge is really a chance to help shape the future of embodied AI,” Zhang says. “We actually don’t know if today’s robotic solutions—hardware or software—are ready to solve these real, everyday tasks. This competition is a chance to see where we stand as a field.”

"By open-sourcing everything—the simulator, demonstrations, and baselines—we're betting that collective progress will move faster than any single institution working in isolation,” says Hang Yin. "We're as excited as anyone to see what solutions emerge—may the best-behaved robot win!"

The Future of Robotics

The timing is not by chance. After great strides in recent years in perception and object manipulation, domestic robotics is at a crossroads where, despite some gains, it still struggles in complex real-world settings. For many roboticists, the competition is an opportunity to stress-test their algorithms in a fun environment with a mutually supportive community.

The BEHAVIOR Challenge is open to anyone worldwide. Teams can download the simulation environment, assets, and task definitions from the official challenge website. Full details are available on the BEHAVIOR Challenge website.

"It'll be encouraging to see robotics researchers strive to solve tasks that humans need. With such diverse scenes, challenging tasks, and high-fidelity simulation, it should indeed be a grand challenge,” says team member Sanjana Srivastava, who recently earned her PhD at Stanford.

Apply to the 2025 BEHAVIOR Challenge by November 15, 2025. Winners will be announced at the NeurIPS 2025 Conference in December.

In the BEHAVIOR Challenge, robots will compete against each other on 50 everyday domestic tasks, from making toast to tidying a room.

Share
Link copied to clipboard!
Contributor(s)
Andrew Myers

Related News

AI Can’t Do Physics Well – And That’s a Roadblock to Autonomy
Andrew Myers
Jan 26, 2026
News
breaking of pool balls on a pool table

QuantiPhy is a new benchmark and training framework that evaluates whether AI can numerically reason about physical properties in video images. QuantiPhy reveals that today’s models struggle with basic estimates of size, speed, and distance but offers a way forward.

News
breaking of pool balls on a pool table

AI Can’t Do Physics Well – And That’s a Roadblock to Autonomy

Andrew Myers
Computer VisionRoboticsSciences (Social, Health, Biological, Physical)Jan 26

QuantiPhy is a new benchmark and training framework that evaluates whether AI can numerically reason about physical properties in video images. QuantiPhy reveals that today’s models struggle with basic estimates of size, speed, and distance but offers a way forward.

AI Leaders Discuss How To Foster Responsible Innovation At TIME100 Roundtable In Davos
TIME
Jan 21, 2026
Media Mention

HAI Senior Fellow Yejin Choi discussed responsible AI model training at Davos, asking, “What if there could be an alternative form of intelligence that really learns … morals, human values from the get-go, as opposed to just training LLMs on the entirety of the internet, which actually includes the worst part of humanity, and then we then try to patch things up by doing ‘alignment’?” 

Media Mention
Your browser does not support the video tag.

AI Leaders Discuss How To Foster Responsible Innovation At TIME100 Roundtable In Davos

TIME
Ethics, Equity, InclusionGenerative AIMachine LearningNatural Language ProcessingJan 21

HAI Senior Fellow Yejin Choi discussed responsible AI model training at Davos, asking, “What if there could be an alternative form of intelligence that really learns … morals, human values from the get-go, as opposed to just training LLMs on the entirety of the internet, which actually includes the worst part of humanity, and then we then try to patch things up by doing ‘alignment’?” 

Stanford’s Yejin Choi & Axios’ Ina Fried
Axios
Jan 19, 2026
Media Mention

Axios chief technology correspondent Ina Fried speaks to HAI Senior Fellow Yejin Choi at Axios House in Davos during the World Economic Forum.

Media Mention
Your browser does not support the video tag.

Stanford’s Yejin Choi & Axios’ Ina Fried

Axios
Energy, EnvironmentMachine LearningGenerative AIEthics, Equity, InclusionJan 19

Axios chief technology correspondent Ina Fried speaks to HAI Senior Fellow Yejin Choi at Axios House in Davos during the World Economic Forum.