AI Agents that “Self-Reflect” Perform Better in Changing Environments

AI Agents that “Self-Reflect” Perform Better in Changing Environments | Stanford HAI

Who would you pick to win in a head-to-head competition — a state-of-the-art AI agent or a mouse? Isaac Kauvar, a Wu Tsai Neurosciences Institute interdisciplinary postdoctoral scholar, and Chris Doyle, a machine learning researcher at Stanford, decided to pit them against each other to find out. Working in the lab of Nick Haber, an assistant professor in the Stanford Graduate School of Education, Kauvar and Doyle designed a simple task based on their longtime interest in a skill set that animals naturally excel at: exploring and adapting to their surroundings.

Kauvar put a mouse in a small empty box and similarly put a simulated AI agent in an empty 3D virtual arena. Then, he placed a red ball in both environments. Kauvar measured to see which would be the quicker to explore the new object.

The test showed that the mouse quickly approached the ball and repeatedly interacted with it over the next several minutes. But the AI agent didn’t seem to notice it. “That wasn’t expected,” said Kauvar. “Already, we were realizing that even with a state-of-the-art algorithm, there were gaps in performance.”

The scholars pondered: Could they use such seemingly simple animal behaviors as inspiration to improve AI systems?

That question catalyzed Kauvar, Doyle, graduate student Linqi Zhou, and Haber to design a new training method called curious replay, which programs AI agents to self-reflect about the most novel and interesting things they recently encountered. Adding curious replay was all that was needed for the AI agent to approach and engage with the red ball much faster. Plus, it dramatically improved performance on a game based on Minecraft, called Crafter. The results of this project, currently published on preprint service arXiv, will be presented at the International Conference on Machine Learning on July 25.

Learning Through Curiosity

It may seem like curiosity offers only intellectual benefits, but it’s crucial to our survival, both in avoiding dangerous situations and finding necessities like food and shelter. That red ball in the experiment could be leaking a deadly poison or covering a nourishing meal, and it would be difficult to find out which if we ignore it.

That’s why labs like Haber’s have recently been adding a curiosity signal to drive the behavior of AI agents and, in particular, model-based deep reinforcement learning agents. This signal tells them to select the action that will lead to a more interesting outcome, like opening a door rather than disregarding it.

Read the full study, Curious Replay for Model-based Adaptation

But this time, the team used curiosity for AI in a new way: to help the agent learn about its world, not just make a decision. “Instead of choosing what to do, we want to choose what to think about, more or less — what experiences from our past do we want to learn from.” said Kauvar. In other words, they wanted to encourage the AI agent to self-reflect, in a sense, about its most interesting or peculiar (and thus, curiosity-related) experiences. That way, the agent may be prompted to interact with the object in different ways to learn more, which would guide its understanding of the environment and perhaps encourage curiosity toward additional items, too.

To accomplish self-reflection in this way, the researchers amended a common method used to train AI agents, called experience replay. Here, an agent stores memories of all its interactions and then replays some of them at random to learn from them again. It was inspired by research on sleep: Neuroscientists have found that a brain region called the hippocampus will “replay” events of the day (by reactivating certain neurons) to strengthen memories. In AI agents, experience replay has led to high performance in scenarios where the environment rarely changes and clear rewards are given for the right behaviors.

But to be successful in a changing environment, the researchers reasoned that it would make more sense for AI agents to prioritize replaying primarily the most interesting experiences — like the appearance of a new red ball — rather than replaying the empty virtual room over and over.

They named their new method curious replay and found that it worked immediately. “Now, all of a sudden, the agent interacts with the ball much more quickly,” said Kauvar.

But they didn’t stop there. They also added curious replay to AI agents playing a game called Crafter, a standard test of creative problem solving for AI agents, where — much like in Minecraft — agents have to figure out how to survive and adapt by learning how to collect wood and stone, make a pickaxe, and collect iron to make additional tools. The curious replay method boosted the current state-of-the-art score from around 14 up to 19 (humans typically score around 50) — with “just this one change,” said Kauvar.

A Curious Future

The success of the curious replay method in both simple and complex tasks suggests that it will be important for a vast array of AI research moving forward. “The overall aim of this work — to make agents that can leverage prior experience and adapt well by efficiently exploring new or changing environments — will lead to much more adaptive, flexible technologies, from household robotics to personalized learning tools,” said Haber.

Kauvar, whose postdoctoral work is jointly mentored by Haber and neuroscientist Karl Deisseroth, the D.H Chen Professor in the departments of Bioengineering and Psychiatry, is excited to continue the theme of taking inspiration from animal behavior to improve AI systems — he plans to continue testing mice and AI agents on more complicated tasks to compare their behavior and abilities. “Lots of people give lip service to saying that they're inspired by animals, but here we are building a direct bridge — not a vague bridge. We are trying to do the exact same [tasks],” he said.

Kauvar hopes that work like this will help “close the loop” between AI research and neuroscience and benefit our understanding of animal behavior and the underlying neural processes, too. “You can imagine that this whole approach might yield hypotheses and new experiments that would never have been thought of before,” he said.

Stanford HAI’s mission is to advance AI research, education, policy and practice to improve the human condition. Learn more.

Navigate

Participate

Stay Up To Date

AI Agents that “Self-Reflect” Perform Better in Changing Environments

Related News

AI Challenges Core Assumptions in Education

AI Challenges Core Assumptions in Education

AI Sovereignty’s Definitional Dilemma

AI Sovereignty’s Definitional Dilemma

America's 250 Greatest Innovators: Celebrating The American Dream

America's 250 Greatest Innovators: Celebrating The American Dream

Learning Through Curiosity

A Curious Future