A methodological tutorial in Python for automated content analysis of digital videos using artificial intelligence

Carlos A. Almenara

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background . The exponential growth of short social media videos has created new opportunities for research. Nevertheless, traditional video content analysis remains labor-intensive and therefore difficult to scale. This methodological paper provides a practical step-by-step tutorial for conducting automated content analysis of digital videos using a multimodal large language model (LLM, Gemini 3 Pro) via an application programming interface (API). Methods . Using Python in a cloud notebook environment, we demonstrate how to (1) collect a public dataset of TikTok videos, (2) upload videos to Google API Files, (3) apply a codebook-based prompt to extract structured variables, (4) enforce the outputs to a JSON template, (5) implement robust error handling and reprocessing logic, and (6) export results for statistical analysis. The tutorial is illustrated with an open dataset of 1,028 TikTok videos on weight loss, yielding one JSON record per video that includes video description, topic classification, and identification of explicit weight-loss product advertising, plus additional attributes (e.g., framing, identity, narrative type, call to action) when advertising is detected. Results . The full run produced 1,028 JSON files in 11.39 hours at a cost of USD $20.27 dollars. Human–LLM coding agreement, assessed on a random subset using Krippendorff’s alpha, was high (mean 94.87%). Conclusion . The provided Python code and results demonstrate that the method employed here is very useful and can be escalated to analyze thousands if not hundreds of thousands of short digital videos.

Version published to 10.21203/rs.3.rs-8817867/v1 on Research Square
Mar 6, 2026

TTV-HRM: Hierarchical Reasoning Architecture for Efficient Text-to-Video Generation

This article has 1 author:
1. Ahsan Umar
This article has no evaluationsLatest version Mar 23, 2026
Generative AI : A Comprehensive Overview of Large Language Models for Prompt Engineering and Applications

This article has 9 authors:
1. Ali Daud
2. Mobushira Khan
3. Sakher Ghanem
4. Sami Alesawi
5. Saud Yonbawi
6. Raed Alsini
7. Omar Alghushairy
8. Manal Linjawi
9. Abdulrahman Ahmed Gharawi
This article has no evaluationsLatest version Feb 10, 2026
FME-24: A Film, Music, and Emotion Dataset

This article has 2 authors:
1. Ruby Olivia Nagano Crocker
2. György Fazekas
This article has no evaluationsLatest version Feb 10, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

TTV-HRM: Hierarchical Reasoning Architecture for Efficient Text-to-Video Generation

Generative AI : A Comprehensive Overview of Large Language Models for Prompt Engineering and Applications

FME-24: A Film, Music, and Emotion Dataset