Doctors Receptive to AI Collaboration in Simulated Clinical Case without Introducing Bias | Stanford HAI
Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
Navigate
  • About
  • Events
  • AI Glossary
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

news

Doctors Receptive to AI Collaboration in Simulated Clinical Case without Introducing Bias

Date
February 20, 2024
Topics
Healthcare

Doctors worked with a prototype AI assistant and adapted their diagnoses based on AI’s input, which led to better clinical decisions.

While many health care practitioners believe generative language models like ChatGPT will one day be commonplace in medical evaluations, it’s unclear how these tools will fit into the clinical environment. A new study points the way to a future where human physicians and generative AI collaborate to improve patient outcomes.

In a mock medical environment with role-playing patients reporting chest pains, doctors accepted the advice of a prototype ChatGPT-like medical agent and even willingly adapted their diagnoses based on the AI’s advice. The upshot was better outcomes for the patients. 

In the trial, 50 licensed doctors reviewed videos of white male and Black female actors describing their chest pains symptoms and electrocardiograms to make triage-, risk-, and treatment-based assessments of the patients. In the study’s next step, the doctors were then presented with ChatGPT-based recommendations derived from the same conversations and asked to reevaluate their own assessments.

Unconventional Wisdom

The study found that the doctors were not just receptive to AI advice but willing to reconsider their own analyses based on that advice. Most important, this willingness led to a “significant improvement” in accuracy of the doctors’ clinical decisions. Notable, as well, is that the racial and gender makeup of the patient pool was not happenstance but carefully structured into the study to ensure that AI did not introduce or intensify existing racial or gender biases — which the study found it did not.

The study’s findings go against the conventional wisdom that doctors may be resistant, or even antagonistic, to the introduction of AI in their workflows.

“This study shows that doctors who work with AI do so collaboratively. It’s not at all adversarial,” said Ethan Goh, a health care AI researcher in Stanford’s Clinical Excellence Research Center (CERC) and the first author of the study. “And, when the AI tool is good, the collaboration produces better outcomes.”

The study was published in preprint by medRxiv and has been formally accepted by a peer-reviewed conference, AMIA Informatics Summit in Boston this March.

Milestone Moment

Goh is quick to point out that the AI tools used in the study are only a prototype and not yet ready or approved for clinical application. The results are nonetheless encouraging about the prospects for future collaborations between doctors and AI, he said.

“The overall point is when we do have those tools, someday, they could prove useful in augmenting the doctors and improving outcomes. And, far from being resistant to such tools, physicians seem willing, even welcoming, of such advances,” Goh said. In a survey following the trial, a majority of the doctors confirmed that they fully anticipate large language model-based (LLM) tools to play a significant role in clinical decision-making.

As such, the authors write that this particular study is “a critical milestone” in the progress of LLMs in medicine. With this study, medicine moves beyond evaluating whether generative LLMs belong in the clinical environment to exactly how they will fit in that environment and how they will support human physicians in their work, not replace them, Goh said. 

“It’s no longer a question of whether LLMs will replace doctors in the clinic — they won’t — but how humans and machines will work together to make medicine better for everyone,” Goh said.

Stanford HAI’s mission is to advance AI research, education, policy and practice to improve the human condition. Learn more. 

Share
Link copied to clipboard!
Contributor(s)
Andrew Myers

Related News

Collaborative Coding, Better Scaling, Health Tracking: HAI Awards $2.17M to Innovative Research
Nikki Goth Itoi
Apr 29, 2026
Announcement
Your browser does not support the video tag.

Seed grants will fund 29 research teams pursuing novel research ideas across disciplines.

Announcement
Your browser does not support the video tag.

Collaborative Coding, Better Scaling, Health Tracking: HAI Awards $2.17M to Innovative Research

Nikki Goth Itoi
HealthcareSciences (Social, Health, Biological, Physical)Apr 29

Seed grants will fund 29 research teams pursuing novel research ideas across disciplines.

An AI Health Coach Could Change Your Mindset
Katharine Miller
Apr 23, 2026
News
A runner with a smartphone laces her shoes

Bloom, a health coaching app created by Stanford researchers, helps people tap into their own motivations.

News
A runner with a smartphone laces her shoes

An AI Health Coach Could Change Your Mindset

Katharine Miller
HealthcareGenerative AIApr 23

Bloom, a health coaching app created by Stanford researchers, helps people tap into their own motivations.

Using LLMs To Improve Workplace Social Skills
Katharine Miller
Apr 20, 2026
News
A woman takes notes while working on a tablet

Practicing specific social skills with AI chatbots helps users build confidence and competence.

News
A woman takes notes while working on a tablet

Using LLMs To Improve Workplace Social Skills

Katharine Miller
Education, SkillsGenerative AIHealthcareApr 20

Practicing specific social skills with AI chatbots helps users build confidence and competence.