Missing the Margins: A Systematic Literature Review on the Demographic Representativeness of LLMs

Indira Sen
Marlene Lutz
Elisa Rogers
David Garcia
Markus Strohmaier

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Many applications of Large Language Models (LLMs) require them to either simulate people or offer personalized functionality, making the demographic representativeness of LLMs crucial for equitable utility. At the same time, we know little about the extent to which these models actually reflect the demographic attributes and behaviors of certain groups or populations, with conflicting findings in empirical research. To shed light on this debate, we review 211 papers on the demographic representativeness of LLMs. We find that while 29% of the studies report positive conclusions on the representativeness of LLMs, 30% of these do not evaluate LLMs across multiple demographic categories or within demographic subcategories. Another 35% and 47% of the papers concluding positively fail to specify these subcategories altogether for gender and race, respectively. Of the articles that report subcategories, less than half include marginalized groups in their study. Finally, 35% of 211 papers do not define the target population to whom their findings apply; of those that do define it either implicitly or explicitly, a large majority study only the U.S. Taken together, our findings suggest an inflated perception of LLM representativeness in the broader community. We recommend more precise evaluation methods and comprehensive documentation of demographic attributes to ensure the responsible use of LLMs for social applications.

Version published to 10.31235/osf.io/vk3x6_v1 on OSF Preprints
May 14, 2025

Random forests in corpus research: A systematic review

This article has 1 author:
1. Lukas Sönning
This article has no evaluationsLatest version Jan 17, 2026
Random forests in corpus research: A systematic review

This article has 1 author:
1. Lukas Sönning
This article has no evaluationsLatest version Jan 17, 2026
The Synthetic Nomological Net: A search engine to identify conceptual overlap in measures in the behavioral sciences

This article has 3 authors:
1. Björn Erik Hommel
2. Annika Iris Külpmann
3. Ruben C. Arslan
This article has no evaluationsLatest version Jan 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Random forests in corpus research: A systematic review

Random forests in corpus research: A systematic review

The Synthetic Nomological Net: A search engine to identify conceptual overlap in measures in the behavioral sciences