A Pipeline for Extracting Data from Videos of Complex Political Events

Mirya Holman
Andreas Küpfer
Tyler Simko

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Political scientists now regularly use audio, video, and text data to investigate questions about deliberation, representation, and emotion. However, existing research using multimodal data focuses largely on highly professionalized settings like national legislatures where pre-processed data is often available. Although scholars can increasingly access a wide range of data on less standardized environments such as campaign events, committee hearings, and local government meetings, using videos of these events for research poses a number of common measurement challenges, including low quality transcripts, missing speaker information, idiosyncratic production styles, and varying formats. In this paper, we present a streamlined pipeline using open-source tools to automatically extract text (e.g. transcription), audio (e.g. vocal features), and images (e.g. scene detection) from videos of complex political events. The outputs of our pipeline can then be readily used for a wide range of substantive analyses. We validate our approach through an examination of local government meetings in the United States, showing how we can accurately segment audio, identify speakers, transcribe speech, and detect gender in videos of varying structure and audio/video quality. As a demonstration, we examine participation by gender in over 1,000 hours of school board meeting videos.

Version published to 10.31235/osf.io/nxjr6_v1 on OSF Preprints
Nov 11, 2025

Exploration of Large Language Models forGeotagging of Social Media Posts

This article has 2 authors:
1. Riwaz Udas
2. Richard Sinnott
This article has no evaluationsLatest version Feb 3, 2026
Sentiment Analysis of Naturalistic Speech Using Open-Weight Large Language Models

This article has 5 authors:
1. Jeffrey M. Girard
2. Daiil Jun
3. Desmond Ong
4. Einat Liebenthal
5. Justin T. Baker
This article has no evaluationsLatest version Dec 23, 2025
The Confidence – Quality Mismatch: Assertive Language Signals Lower-Quality News

This article has 5 authors:
1. Akshina Banerjee
2. Zhina Aghamohammadi
3. Matthew D. Rocklage
4. David Gertler Rand
5. Mohsen Mosleh
This article has no evaluationsLatest version Jan 25, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Exploration of Large Language Models forGeotagging of Social Media Posts

Sentiment Analysis of Naturalistic Speech Using Open-Weight Large Language Models

The Confidence – Quality Mismatch: Assertive Language Signals Lower-Quality News