Toward Fairness in Health Care Training Data

Date

October 01, 2020

Toward Fairness in Health Care Training Data

abstract

With recent advances in artificial intelligence (AI), researchers can now train sophisticated computer algorithms to interpret medical images – often with accuracy comparable to trained physicians. Yet our recent survey of medical research shows that these algorithms rely on datasets that lack population diversity and could introduce bias into the understanding of a patient’s health condition.

Key Takeaways

Bias arises when we build algorithms using datasets that don’t mirror the population. When generalized to larger swathes of the population, these nonrepresentative data have the potential to confound research findings.
The vast majority of the health data used to build AI algorithms came from only three states, with little or no representation from the remaining 47 states.
Policymakers, regulators, industry, and academia need to work together to ensure medical AI data reflect America’s diversity across not only geography but also many other important attributes. To that end, nationwide data sharing initiatives should be a top priority.

Related Publications

Safeguarding Third-Party AI Research

Kevin Klyman, Shayne Longpre, Sayash Kapoor, Rishi Bommasani, Percy Liang, Peter Henderson

Feb 13, 2025

Policy Brief

This brief examines the barriers to independent AI evaluation and proposes safe harbors to protect good-faith third-party research.

Policy Brief

Safeguarding Third-Party AI Research

Kevin Klyman, Shayne Longpre, Sayash Kapoor, Rishi Bommasani, Percy Liang, Peter Henderson

Privacy, Safety, SecurityRegulation, Policy, GovernanceFeb 13

This brief examines the barriers to independent AI evaluation and proposes safe harbors to protect good-faith third-party research.

Assessing the Implementation of Federal AI Leadership and Compliance Mandates

Jennifer Wang, Mirac Suzgun, Caroline Meinhardt, Daniel Zhang, Kazia Nowacki, Daniel E. Ho

Jan 17, 2025

Whitepaper

This white paper assesses federal efforts to advance leadership on AI innovation and governance through recent executive actions and emphasizes the need for senior-level leadership to achieve a whole-of-government approach.

Whitepaper

Assessing the Implementation of Federal AI Leadership and Compliance Mandates

Jennifer Wang, Mirac Suzgun, Caroline Meinhardt, Daniel Zhang, Kazia Nowacki, Daniel E. Ho

Government, Public AdministrationRegulation, Policy, GovernanceJan 17

What Makes a Good AI Benchmark?

Anka Reuel, Amelia Hardy, Chandler Smith, Max Lamparth, Malcolm Hardy, Mykel Kochenderfer

Dec 11, 2024

Policy Brief

This brief presents a novel assessment framework for evaluating the quality of AI benchmarks and scores 24 benchmarks against the framework.

Policy Brief

What Makes a Good AI Benchmark?

Anka Reuel, Amelia Hardy, Chandler Smith, Max Lamparth, Malcolm Hardy, Mykel Kochenderfer

Foundation ModelsPrivacy, Safety, SecurityDec 11

This brief presents a novel assessment framework for evaluating the quality of AI benchmarks and scores 24 benchmarks against the framework.

Expanding Academia’s Role in Public Sector AI

Kevin Klyman, Caroline Meinhardt, Daniel Zhang, Elena Cryst, Russell Wald, Aaron Bao

Dec 04, 2024

Issue Brief

Expanding Academias role in public sector ai

This brief analyzes the disparity between academia and industry in frontier AI research and presents policy recommendations for ensuring a stronger role for academia in public sector AI.

Issue Brief

Expanding Academia’s Role in Public Sector AI

Kevin Klyman, Caroline Meinhardt, Daniel Zhang, Elena Cryst, Russell Wald, Aaron Bao

Government, Public AdministrationDec 04

This brief analyzes the disparity between academia and industry in frontier AI research and presents policy recommendations for ensuring a stronger role for academia in public sector AI.

policyPolicy Brief

Toward Fairness in Health Care Training Data

Date

October 01, 2020

Read Paper

abstract

Key Takeaways

Bias arises when we build algorithms using datasets that don’t mirror the population. When generalized to larger swathes of the population, these nonrepresentative data have the potential to confound research findings.
The vast majority of the health data used to build AI algorithms came from only three states, with little or no representation from the remaining 47 states.
Policymakers, regulators, industry, and academia need to work together to ensure medical AI data reflect America’s diversity across not only geography but also many other important attributes. To that end, nationwide data sharing initiatives should be a top priority.

Related Publications

Safeguarding Third-Party AI Research

Kevin Klyman, Shayne Longpre, Sayash Kapoor, Rishi Bommasani, Percy Liang, Peter Henderson

Feb 13, 2025

Policy Brief

This brief examines the barriers to independent AI evaluation and proposes safe harbors to protect good-faith third-party research.

Policy Brief

Safeguarding Third-Party AI Research

Kevin Klyman, Shayne Longpre, Sayash Kapoor, Rishi Bommasani, Percy Liang, Peter Henderson

Privacy, Safety, SecurityRegulation, Policy, GovernanceFeb 13

This brief examines the barriers to independent AI evaluation and proposes safe harbors to protect good-faith third-party research.

Assessing the Implementation of Federal AI Leadership and Compliance Mandates

Jennifer Wang, Mirac Suzgun, Caroline Meinhardt, Daniel Zhang, Kazia Nowacki, Daniel E. Ho

Jan 17, 2025

Whitepaper

Whitepaper

Assessing the Implementation of Federal AI Leadership and Compliance Mandates

Jennifer Wang, Mirac Suzgun, Caroline Meinhardt, Daniel Zhang, Kazia Nowacki, Daniel E. Ho

Government, Public AdministrationRegulation, Policy, GovernanceJan 17

What Makes a Good AI Benchmark?

Anka Reuel, Amelia Hardy, Chandler Smith, Max Lamparth, Malcolm Hardy, Mykel Kochenderfer

Dec 11, 2024

Policy Brief

This brief presents a novel assessment framework for evaluating the quality of AI benchmarks and scores 24 benchmarks against the framework.

Policy Brief

What Makes a Good AI Benchmark?

Anka Reuel, Amelia Hardy, Chandler Smith, Max Lamparth, Malcolm Hardy, Mykel Kochenderfer

Foundation ModelsPrivacy, Safety, SecurityDec 11

This brief presents a novel assessment framework for evaluating the quality of AI benchmarks and scores 24 benchmarks against the framework.

Expanding Academia’s Role in Public Sector AI

Kevin Klyman, Caroline Meinhardt, Daniel Zhang, Elena Cryst, Russell Wald, Aaron Bao

Dec 04, 2024

Issue Brief

This brief analyzes the disparity between academia and industry in frontier AI research and presents policy recommendations for ensuring a stronger role for academia in public sector AI.

Issue Brief

Expanding Academia’s Role in Public Sector AI

Kevin Klyman, Caroline Meinhardt, Daniel Zhang, Elena Cryst, Russell Wald, Aaron Bao

Government, Public AdministrationDec 04

This brief analyzes the disparity between academia and industry in frontier AI research and presents policy recommendations for ensuring a stronger role for academia in public sector AI.

Toward Fairness in Health Care Training Data

Key Takeaways

Amit Kaushal

Russ Altman

Curtis Langlotz

Related Publications

Safeguarding Third-Party AI Research

Safeguarding Third-Party AI Research

Assessing the Implementation of Federal AI Leadership and Compliance Mandates

Assessing the Implementation of Federal AI Leadership and Compliance Mandates

What Makes a Good AI Benchmark?

What Makes a Good AI Benchmark?

Expanding Academia’s Role in Public Sector AI

Expanding Academia’s Role in Public Sector AI

Stay Up To Date

Navigate

Participate

Toward Fairness in Health Care Training Data

Key Takeaways

Amit Kaushal

Russ Altman

Curtis Langlotz

Related Publications

Safeguarding Third-Party AI Research

Safeguarding Third-Party AI Research

Assessing the Implementation of Federal AI Leadership and Compliance Mandates

Assessing the Implementation of Federal AI Leadership and Compliance Mandates

What Makes a Good AI Benchmark?

What Makes a Good AI Benchmark?

Expanding Academia’s Role in Public Sector AI

Expanding Academia’s Role in Public Sector AI