Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
Thomas Mullaney and Diyi Yang | How Can AI Support Language Digitization and Digital Inclusion? | Stanford HAI
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
Navigate
  • About
  • Events
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

Your browser does not support the video tag.
eventSeminar

Thomas Mullaney and Diyi Yang | How Can AI Support Language Digitization and Digital Inclusion?

Status
Upcoming
Date
Wednesday, April 15, 2026 12:00 PM - 1:15 PM PST/PDT
Location
353 Jane Stanford Way, Stanford, CA, 94305 | Room 119
Topics
Ethics, Equity, Inclusion
International Affairs, International Security, International Development
Natural Language Processing
Attend Virtually

What does digital inclusion look like in the age of AI? Over 6,000 of the world’s 7,000-plus living languages remain digitally disadvantaged.

That might mean few websites exist in that language, for example, or keyboards don’t have the necessary characters. Language communities excluded from digital systems can only participate minimally in a world increasingly mediated by technology. They also can’t generate enough data needed to be represented in AI. And without access to AI, communities face further barriers to digital participation.

Empowering digitally disadvantaged language communities requires holistic progress on a range of foundational language tools (from script encoding to keyboard layouts) and supporting language tools (from grammar checkers to accessibility features). Progress on this language digitization work is often slow due to chronic underfunding and a lack of coordination. 

AI has the potential to scale and accelerate language digitization. In recent years, scholars have increasingly leveraged AI — and especially natural language processing tools — to sidestep major bottlenecks in the field, particularly when it comes to compiling, organizing, and reviewing digital records.

In this seminar, researchers from HAI and SILICON will present key findings from their recent white paper charting the varying ways AI tools and techniques can support language digitization work and digital inclusion efforts more broadly. They explain how AI alone can’t solve the field’s fundamental bottlenecks and highlight what is needed to advance language digitization and digital inclusion in the age of AI while centering community needs and contexts.

Speaker
Thomas S. Mullaney
Professor of History Professor of East Asian Languages and Cultures, by Courtesy
Diyi Yang
Assistant Professor, Computer Science Department, Stanford University
Share
Link copied to clipboard!
Event Contact
Stanford HAI
stanford-hai@stanford.edu
Related
  • How Can AI Support Language Digitization and Digital Inclusion?
    Juan Pava, Thomas S. Mullaney, Caroline Meinhardt, Audrey Gao, Diyi Yang
    Deep DiveFeb 26
    whitepaper

    This white paper analyzes the varying ways AI tools can advance language digitization work, and provides recommendations for responsibly realizing the potential of AI in supporting the digital inclusion of digitally disadvantaged languages.

Related Events

Dan Iancu & Antonio Skillicorn | Interpretable Machine Learning and Mixed Datasets for Predicting Child Labor in Ghana’s Cocoa Sector
SeminarMar 18, 202612:00 PM - 1:15 PM
March
18
2026

Child labor remains prevalent in Ghana’s cocoa sector and is associated with adverse educational and health outcomes for children.

Seminar

Dan Iancu & Antonio Skillicorn | Interpretable Machine Learning and Mixed Datasets for Predicting Child Labor in Ghana’s Cocoa Sector

Mar 18, 202612:00 PM - 1:15 PM

Child labor remains prevalent in Ghana’s cocoa sector and is associated with adverse educational and health outcomes for children.

Wolfgang Lehrach | Code World Models for General Game Playing
SeminarMay 13, 202612:00 PM - 1:15 PM
May
13
2026

While Large Language Models (LLMs) show promise in many domains, relying on them for direct policy generation in games often results in illegal moves and poor strategic play.

Seminar
Your browser does not support the video tag.

Wolfgang Lehrach | Code World Models for General Game Playing

May 13, 202612:00 PM - 1:15 PM

While Large Language Models (LLMs) show promise in many domains, relying on them for direct policy generation in games often results in illegal moves and poor strategic play.

Juan Sebastián Gómez-Cañón | Challenges And Opportunities For Human-Centered Music Emotion Recognition
SeminarJun 03, 202612:00 PM - 1:15 PM
June
03
2026

Music is intertwined with human emotion, memory, and identity, making it a powerful medium for affective experience and regulation.

Seminar
Your browser does not support the video tag.

Juan Sebastián Gómez-Cañón | Challenges And Opportunities For Human-Centered Music Emotion Recognition

Jun 03, 202612:00 PM - 1:15 PM

Music is intertwined with human emotion, memory, and identity, making it a powerful medium for affective experience and regulation.