Dan Iancu & Antonio Skillicorn | Interpretable Machine Learning and Mixed Datasets for Predicting Child Labor in Ghana’s Cocoa Sector | Stanford HAI
Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
Navigate
  • About
  • Events
  • AI Glossary
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

Your browser does not support the video tag.
eventSeminar

Dan Iancu & Antonio Skillicorn | Interpretable Machine Learning and Mixed Datasets for Predicting Child Labor in Ghana’s Cocoa Sector

Status
Past
Date
Wednesday, March 18, 2026 12:00 PM - 1:15 PM PST/PDT
Location
353 Jane Stanford Way, Stanford, CA, 94305 | Room 119
Topics
Machine Learning
Workforce, Labor
Energy, Environment
Ethics, Equity, Inclusion
Overview
Watch Event Recording

Child labor remains prevalent in Ghana’s cocoa sector and is associated with adverse educational and health outcomes for children.

This exploratory work examines how two surveys that measure child labor in Ghana (NORC and GLSS7), but differ in quality and scale, can be jointly leveraged for less biased prediction and to identify key predictors of child labor risk. We further investigate whether district-level satellite indicators, including yield-weighted cocoa-driven deforestation, newly lit area, and newly urban area, enhance predictive performance and play important roles in shaping model predictions. Using non-parametric machine learning models (XGBoost, Random Forest) paired with cross-validation and a hyperparameter grid search, we find that the best-performing model in classifying child laborers achieves an out of sample AUC of 0.95 and F1 of 0.84. Model interpretability tools (SHAP values, partial dependence plots) highlight influential predictors such as child age, cocoa-driven deforestation, school commute time, newly lit area, and household herbicide expenditures. In addition to emerging as the second most explanatory feature, cocoa-driven deforestation also shows a clear nonlinear association with predicted child labor risk. Our approach demonstrates new ways of grappling with data scarcity and bias in child labor measurement, while our findings provide actionable risk profiles to support monitoring efforts and underscore the complex interconnections between child labor and environmental practices.

Speakers
Dan Iancu
Associate Professor of Operations, Information, and Technology at the Graduate School of Business
Antonio Skillicorn
PhD Candidate in Civil Engineering, Stanford University
Overview
Watch Event Recording
Share
Link copied to clipboard!
Event Contact
Stanford HAI
stanford-hai@stanford.edu
More from HAI and SDS seminars
  • Inside the 2026 AI Index Report | Stanford HAI
    SeminarMay 20, 202612:00 PM - 1:15 PM
    May
    20
    2026

    The AI Index, currently in its ninth year, tracks, collates, distills, and visualizes data relating to artificial intelligence.

Related Events

Kristina McElheran | The Rise of Industrial AI in America: Microfoundations of the Productivity J-curve(s)
May 11, 202612:00 PM - 1:00 PM
May
11
2026

We examine the prevalence and productivity dynamics of artificial intelligence (AI) in American manufacturing. Working with the Census Bureau to collect detailed large-scale data for 2017 and 2021, we focus on AI-related technologies with industrial applications.

Event

Kristina McElheran | The Rise of Industrial AI in America: Microfoundations of the Productivity J-curve(s)

May 11, 202612:00 PM - 1:00 PM

We examine the prevalence and productivity dynamics of artificial intelligence (AI) in American manufacturing. Working with the Census Bureau to collect detailed large-scale data for 2017 and 2021, we focus on AI-related technologies with industrial applications.

Wolfgang Lehrach | Code World Models for General Game Playing
SeminarMay 13, 202612:00 PM - 1:15 PM
May
13
2026

While Large Language Models (LLMs) show promise in many domains, relying on them for direct policy generation in games often results in illegal moves and poor strategic play.

Seminar

Wolfgang Lehrach | Code World Models for General Game Playing

May 13, 202612:00 PM - 1:15 PM

While Large Language Models (LLMs) show promise in many domains, relying on them for direct policy generation in games often results in illegal moves and poor strategic play.

Inside the 2026 AI Index Report | Stanford HAI
SeminarMay 20, 202612:00 PM - 1:15 PM
May
20
2026

The AI Index, currently in its ninth year, tracks, collates, distills, and visualizes data relating to artificial intelligence.

Seminar

Inside the 2026 AI Index Report | Stanford HAI

May 20, 202612:00 PM - 1:15 PM

The AI Index, currently in its ninth year, tracks, collates, distills, and visualizes data relating to artificial intelligence.