Dan Iancu & Antonio Skillicorn | Interpretable Machine Learning and Mixed Datasets for Predicting Child Labor in Ghana’s Cocoa Sector | Stanford HAI
Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
Navigate
  • About
  • Events
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

Your browser does not support the video tag.
eventSeminar

Dan Iancu & Antonio Skillicorn | Interpretable Machine Learning and Mixed Datasets for Predicting Child Labor in Ghana’s Cocoa Sector

Status
Upcoming
Date
Wednesday, March 18, 2026 12:00 PM - 1:15 PM PST/PDT
Location
353 Jane Stanford Way, Stanford, CA, 94305 | Room 119
Topics
Machine Learning
Workforce, Labor
Energy, Environment
Ethics, Equity, Inclusion
Attend Virtually

Child labor remains prevalent in Ghana’s cocoa sector and is associated with adverse educational and health outcomes for children.

This exploratory work examines how two surveys that measure child labor in Ghana (NORC and GLSS7), but differ in quality and scale, can be jointly leveraged for less biased prediction and to identify key predictors of child labor risk. We further investigate whether district-level satellite indicators, including yield-weighted cocoa-driven deforestation, newly lit area, and newly urban area, enhance predictive performance and play important roles in shaping model predictions. Using non-parametric machine learning models (XGBoost, Random Forest) paired with cross-validation and a hyperparameter grid search, we find that the best-performing model in classifying child laborers achieves an out of sample AUC of 0.95 and F1 of 0.84. Model interpretability tools (SHAP values, partial dependence plots) highlight influential predictors such as child age, cocoa-driven deforestation, school commute time, newly lit area, and household herbicide expenditures. In addition to emerging as the second most explanatory feature, cocoa-driven deforestation also shows a clear nonlinear association with predicted child labor risk. Our approach demonstrates new ways of grappling with data scarcity and bias in child labor measurement, while our findings provide actionable risk profiles to support monitoring efforts and underscore the complex interconnections between child labor and environmental practices.

Speakers
Dan Iancu
Associate Professor of Operations, Information, and Technology at the Graduate School of Business
Antonio Skillicorn
PhD Candidate in Civil Engineering, Stanford University
Share
Link copied to clipboard!
Event Contact
Stanford HAI
stanford-hai@stanford.edu
More from HAI and SDS seminars
  • Hari Subramonyam | Learning by Creating: A Human-Centered Vision for AI in Education
    SeminarMar 11, 202612:00 PM - 1:15 PM
    March
    11
    2026

Related Events

Zoë Hitzig | How People Use ChatGPT
Mar 09, 202612:00 PM - 1:00 PM
March
09
2026

Despite the rapid adoption of LLM chatbots, little is known about how they are used. We approach this question theoretically and empirically, modeling a user who chooses whether to complete a task herself, ask the chatbot for information that reduces decision noise, or delegate execution to the chatbot...

Event

Zoë Hitzig | How People Use ChatGPT

Mar 09, 202612:00 PM - 1:00 PM

Despite the rapid adoption of LLM chatbots, little is known about how they are used. We approach this question theoretically and empirically, modeling a user who chooses whether to complete a task herself, ask the chatbot for information that reduces decision noise, or delegate execution to the chatbot...

Joel Becker | Reconciling Impressive AI Benchmark Performance with Limited Developer Productivity Impacts
Mar 16, 202612:00 PM - 1:00 PM
March
16
2026

AI coding agents now complete multi-hour coding benchmarks with roughly 50% reliability, yet a randomized trial found experienced open-source developers took about 19% longer when allowed frontier AI tools than when tools were disallowed...

Event

Joel Becker | Reconciling Impressive AI Benchmark Performance with Limited Developer Productivity Impacts

Mar 16, 202612:00 PM - 1:00 PM

AI coding agents now complete multi-hour coding benchmarks with roughly 50% reliability, yet a randomized trial found experienced open-source developers took about 19% longer when allowed frontier AI tools than when tools were disallowed...