Text Psychometrics: Assessing Psychological Constructs in Text Using Natural Language Processing

Daniel Mark Low
Patrick Mair
Matthew Nock
Satrajit S Ghosh

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) have revolutionized natural language processing (NLP). Yet when used to assess psychological constructs in text, they are generally not evaluated for the types of validity, reliability, and standardization typically expected from traditional questionnaires with rating scales. This study bridges that gap by demonstrating how to evaluate the psychometric properties of text-based models, which we call Text Psychometrics.We first review different NLP methods, compare their ability to address key challenges in psychological research such as explainability, and outline methods for evaluating them on many desirable psychometric properties. We then demonstrate this through two empirical studies. Study 1 classifies thousands of crisis counseling conversations and Reddit posts into different types of mental health issues and introduces a novel method to evaluate text models for content validity —the extent to which a test captures the full range of expressions of a construct. Study 2 examines prospective criterion validity by estimating how 49 known suicide risk factors predict imminent risk in crisis counseling conversations.In sum, NLP studies in psychology often rely on only a few validation metrics; here, we demonstrate the need for broader psychometric evaluation and provide a practical blueprint and future directions for achieving it.

Version published to 10.31234/osf.io/9rdux_v4 on OSF Preprints
Sep 15, 2025
Version published to 10.31234/osf.io/9rdux_v3 on OSF Preprints
Aug 11, 2025
Version published to 10.31234/osf.io/9rdux_v2 on OSF Preprints
Jul 29, 2025
Version published to 10.31234/osf.io/9rdux_v1 on OSF Preprints
Jul 29, 2025

Personality Auto-Scoring with Large Language Models Using a Realistic Accuracy Model of Behavioral Cues in Chatbot Interviews

This article has 5 authors:
1. Zihan Yan
2. Ashley Sylvara
3. Pengda Wang
4. Tianjun Sun
5. Ziang Xiao
This article has no evaluationsLatest version Dec 26, 2025
Sentiment Analysis of Naturalistic Speech Using Open-Weight Large Language Models

This article has 5 authors:
1. Jeffrey M. Girard
2. Daiil Jun
3. Desmond Ong
4. Einat Liebenthal
5. Justin T. Baker
This article has no evaluationsLatest version Dec 23, 2025
Navigating the Maze of Measurement: Large Language Models for objective instrument selection

This article has 2 authors:
1. Viktória Gajdošová
2. Matus Adamkovic
This article has no evaluationsLatest version Dec 19, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Personality Auto-Scoring with Large Language Models Using a Realistic Accuracy Model of Behavioral Cues in Chatbot Interviews

Sentiment Analysis of Naturalistic Speech Using Open-Weight Large Language Models

Navigating the Maze of Measurement: Large Language Models for objective instrument selection