Laila Shereen Sakr and Cathy Thomas
Thursday, April 30, 2026
Carsey Wolf-Center
Media Industries in AI Inititiave
PROBLEM
Global data is uneven, multilingual, power-structured
AI systems
flatten context
Result: loss of meaning, misrepresentation
METHOD
SYSTEM
CONTRIBUTION
For example, sentiment analysis fails to recognize sarcastic humor (2012).How can AI systems represent global data without flattening the historical, linguistic, and political relations that give it meaning?
Challenge
Global data is uneven, multilingual, power-structured and AI flattens context.
70+ languages
billions of posts
global archive of social media
Pattern ≠ meaning
Scale ≠ interpretation
Multilingual data ≠ contextual understanding
Archive required new epistemology
Human–AI collaborative platform
Data Visualization
Contextual metadata
Citational lineage
Participatory annotation
Small language models
Bias evaluation Tool
Online Community
Shared Governance
1. Data Layer (historical and culturally based)
contextual metadata
multilingual structure
historical annotation
2. Model Layer (linguistic based)
small language models
comparative training (baseline vs enriched datasets)
3. Evaluation Layer (community based)
interpretability
bias mitigation
semantic fidelity across languages
multiple histories
in relation
without resolution
Capitol Mosaic, 30-ft wall of Jan 6 images from Parler and Twitter,
VJ Um Amel (Qualcomm Gallery, 2021).Collaborators
Students
Denise Alfonso
Ripley Baker
Henry Coburn
Elisa Coccioli
Jessie Ding
Calista Dollag
Lexxus Edison Coffey
Kamaya Jackson
Jiyoo Kim-Jung
Corinna Kelley
Cass Mayeda
Funders
Sofia Mosqueda
Anna Shaverdyan
Shashank Shivashankar
Saide Singh
Justus Wan
Calais Waring
Winston Zuo
Consultants:
Josh Bevan
Sierra Peltcher
r-shief.org