Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
Wikimedia | Wikipedia in the Age of AI and Bots | Stanford HAI
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
Navigate
  • About
  • Events
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

Your browser does not support the video tag.
eventSeminar

Wikimedia | Wikipedia in the Age of AI and Bots

Status
Past
Date
Wednesday, February 04, 2026 12:00 PM - 1:15 PM PST/PDT
Location
353 Jane Stanford Way, Stanford, CA, 94305 | Room 119
Topics
Natural Language Processing
Regulation, Policy, Governance
Overview
Watch Event Recording

The rapid growth of large language models has created new challenges for the Wikimedia Foundation in managing changing traffic patterns while upholding its free knowledge mission and fostering human engagement.

Wikipedia has been a source of text data for natural language processing and machine learning for almost 25 years. We’d like to cover how data science and computer scientists can help foster the health of what has been agreed on as an essential public good in the age of AI by giving some context that explains how the datasets are created. Historically, access to the site via scraping and bots has caused multiple issues for Wikimedia Foundation infrastructure, but recent changes in traffic behavior and volume due to growth of large language models (LLMs) have caused an increase in incidents. Managing this expansion has created unique challenges for the organization considering Wikimedia’s free knowledge mission, and the need to continue to foster human traffic growth. Maintaining the sustainability of the platform and prioritizing human and mission-oriented access first has required nuanced approaches to identifying and responding to observed trends.

This talk will cover the basic editorial processes within Wikipedia driven by the large community of volunteers, the emergence of new AI-specific tooling and datasets from Wikimedia, and the best practices for engaging with Wikimedia content to support open data growth. Examples of automated traffic observed on Wikimedia projects will be discussed, highlighting traffic trends, bot behavior, and resource impacts. We will showcase current risk strategies aimed at reducing server load and mitigating potential abuse without impacting general service availability. There will be time for Q&A at the end of the presentation to discuss policy development and how to get involved with the broader Wikimedia movement. 

Speakers
Chris Petrillo
Head of Product: Wikimedia Enterprise, Commercial Partnerships @ the Wikimedia Foundation
Overview
Watch Event Recording
Share
Link copied to clipboard!
Event Contact
Stanford HAI
stanford-hai@stanford.edu
More from HAI and SDS seminars
  • Hari Subramonyam | Learning by Creating: A Human-Centered Vision for AI in Education
    SeminarMar 11, 202612:00 PM - 1:15 PM
    March
    11
    2026

Related Events

Zoë Hitzig | How People Use ChatGPT
Mar 09, 202612:00 PM - 1:00 PM
March
09
2026

Despite the rapid adoption of LLM chatbots, little is known about how they are used. We approach this question theoretically and empirically, modeling a user who chooses whether to complete a task herself, ask the chatbot for information that reduces decision noise, or delegate execution to the chatbot...

Event

Zoë Hitzig | How People Use ChatGPT

Mar 09, 202612:00 PM - 1:00 PM

Despite the rapid adoption of LLM chatbots, little is known about how they are used. We approach this question theoretically and empirically, modeling a user who chooses whether to complete a task herself, ask the chatbot for information that reduces decision noise, or delegate execution to the chatbot...

Hari Subramonyam | Learning by Creating: A Human-Centered Vision for AI in Education
SeminarMar 11, 202612:00 PM - 1:15 PM
March
11
2026
Seminar

Hari Subramonyam | Learning by Creating: A Human-Centered Vision for AI in Education

Mar 11, 202612:00 PM - 1:15 PM
Joel Becker | Reconciling Impressive AI Benchmark Performance with Limited Developer Productivity Impacts
Mar 16, 202612:00 PM - 1:00 PM
March
16
2026

AI coding agents now complete multi-hour coding benchmarks with roughly 50% reliability, yet a randomized trial found experienced open-source developers took about 19% longer when allowed frontier AI tools than when tools were disallowed...

Event

Joel Becker | Reconciling Impressive AI Benchmark Performance with Limited Developer Productivity Impacts

Mar 16, 202612:00 PM - 1:00 PM

AI coding agents now complete multi-hour coding benchmarks with roughly 50% reliability, yet a randomized trial found experienced open-source developers took about 19% longer when allowed frontier AI tools than when tools were disallowed...