The Risks of Using Large Language Models for Text Annotation in Social Science Research

Yongjun Zhang
Hao Lin

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Generative artificial intelligence (AI) or large language models (LLMs) have revolutionized computational social science, particularly in automated textual analysis. In this paper, we conduct a systematic evaluation of the promises and risks of using LLMs for diverse coding tasks in social movement studies. We propose a framework for social scientists to adopt LLMs to text annotation, either as the primary coding decision-maker or as a coding assistant. Additionally, we discuss the associated epistemic risks related to validity, reliability, replicability, and transparency. We conclude by offering several practical guidelines for using LLMs in coding tasks.

Version published to 10.31235/osf.io/79qu8 on OSF Preprints
Aug 2, 2024

The Use of Large Language Models for Qualitative Research: DECOTA

This article has 8 authors:
1. Lois Player
2. Ryan Hughes
3. Kaloyan Mitev
4. Lorraine Whitmarsh
5. Christina Demski
6. Nicholas Nash
7. Trisevgeni Papakonstantinou
8. Mark Wilson
This article has no evaluationsLatest version Jul 24, 2024
AITurk: Using ChatGPT for Social Science Research

This article has 3 authors:
1. Xin Qin
2. Mingpeng Huang
3. Jie Ding
This article has no evaluationsLatest version Aug 11, 2024
Challenges for multilingual computational text analysis researchers: evidence from a survey of social scientists

This article has 8 authors:
1. Alona Olga Dolinsky
2. Martijn Schoonvelde
3. Christian Pipal
4. Christian Baden
5. Fabienne Lind
6. Guy Shababo
7. Mariken Anna Catharina Geertruida van der Velden
8. avital zalik
This article has no evaluationsLatest version Jul 24, 2024

Listed in

Abstract

Article activity feed

Related articles

The Use of Large Language Models for Qualitative Research: DECOTA

AITurk: Using ChatGPT for Social Science Research

Challenges for multilingual computational text analysis researchers: evidence from a survey of social scientists