The research network Data Science @ Uni Vienna warmly invites you to attend the Data Science Distinguished Lecture for Summer Semester 2026!
The lecture will be given by Prof. David Smith of Northeastern University, who is an Associate Professor in the Khoury College of Computer Sciences at Northeastern University in Boston. His interdisciplinary work focuses on natural language processing and computational linguistics, particularly through applications in the humanities and social sciences. He is currently a Visiting Professor at the Faculty of Historical and Cultural Studies.
When:
1st of June 2026
17:00-19:00
Where:
Oskar-Morgenstern-Platz 1
Lecture Hall 4
1090 Vienna
If you want to join us online, follow here:
Meeting-ID: 684 6425 8793
Passcode: 933409
https://univienna.zoom.us/j/68464258793?pwd=ONIMGXIaGfmCPRZbBg0UN8A0oZrByg.1
Abstract:
As suggested by Alison Gopnik and others (e.g., Farrell et al., 2025), large language models and similar AI artifacts are "cultural technologies". Like language and writing—and also bureaucracies, democracies, and markets—AI transforms our relationship to memory and our interactions with each other. More particularly, state-of-the-art models are trained on archives collected from the digitized human record. Model trainers are recreating, intentionally or not, processes for selection, categorization, and source criticism that resemble some archival practices. After surveying some of the consequences of this archival view of AI, this talk will present work from our research group that traces the effects of training data composition on training dynamics and of mixtures of genres on high-level LLM capabilities. I will also discuss the ways in which analyzing large-scale patterns in the human record can help us build better models.
We invite you to refreshments after the talk!
Registration:
Please register here:
booking.univie.ac.at/dsdl2026/
