Research statement – Annabel Rothschild

Who:

I am a junior (assistant) professor of computer science at Bard College starting Fall 2025. I hold a PhD in Human-Centered Computing (HCC) from Georgia Tech’s School of Interactive Computing. While there, I was a member of the DataWorks research team, and was advised by Dr. Betsy DiSalvo and Dr. Carl DiSalvo. Before Georgia Tech, I studied computer science at Wellesley College, and I previously held short-term visiting researcher positions at Aalto University and the OFFIS research institute.

What:

At a high level, I am interested in how people collect and curate data, as well as how they begin to make sense of it. Said differently, how do people decide what data represents? I examine the tools and devices people employ to perform these sense-making activities, with an eye towards improving the experiences of documentation and dataset contextualization. Working primarily in critical data studies and responsible AI (R-AI), my approach is also informed by prior research experience and ongoing interest in programming languages, usable security and privacy, and information credibility.

Where:

My dissertation field site was DataWorks, a combined data services firm and work-training program. At DataWorks, we are figuring out how to build an alternative data annotation site that does not reproduce the exploitative work practices common among data annotation platforms and providers. The Data Fellows are full-time university employees with competitive pay and benefits, and become experts in data cleaning, organization, and standardization through a mix of dedicated training modules and work on real client projects. The emphasis is on creating a lucrative, sustainable career in data work.

Through the unique structure of DataWorks, we are also able to create comprehensive understandings of how a dataset came into being, both in origin and its current form, helping determine fair and pro-social later use. At DataWorks, I focus primarily on datasets being used to train and develop AI and ML systems.

When and how:

I defended my dissertation “Developing Pro-Social AI Training Datasets Through Data Workers’ Critical Perspectives” in April 2025. Within the dissertation, I explore a series of projects related to developing an alternative data work site—or, one in which data workers’ lived experiences and perspectives are not only valued, but brought in as valuable assets for the process of dataset collection and curation. To me, data workers can serve as our best chance at auditing a dataset before it is used for something like training an AI system, because the data workers have actually seen the contents of the dataset and noted irregularities or offensive content, which the requesters of such data labor often don’t realize.

At Bard, I will be building a research group to critically examine the status quo of AI: how it is computationally developed, on what ecological infrastructures and theoretical commitments it is based, and of course the data on which it is trained and refined. I am looking forward to developing a group indicative of Bard’s liberal arts focus, with students ranging from practicing studio artists and language arts majors to dedicated programmers.

Other research “hobbies”:

reading about anything related to data and computing. Recently this has included particularly studies of how computing and computational tools are used for surveillance, prescribing boundaries on human movement, and to enact borders, nationality, and immigration;
keeping a kind of data diary of the temporal and financial cost of protecting my personal information on the web – mostly to highlight the barriers to doing so;
right to repair and restoration of old and aging computer systems, along with free & open source software, in protest of surveillance capitalism, planned obsolescence, and platform capture. Right now I’m thinking a lot about how well those values cohere with the democratic process.