Measuring Individual Differences in Meaning: The Supervised Semantic Differential

Hubert Plisiecki
Paweł Lenartowicz
Artur Pokropek
Kinga Małyska
Maria Flakus

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The Supervised Semantic Differential (SSD) is a new method for measuring differences in how people understand the same concept using their own language, instead of questionnaire results. SSD builds a “personal concept vector” for each participant by aggregating the word embeddings of the local linguistic contexts (±3-word windows) around a small, targeted lexicon (e.g., “climate,” “vaccination”) in their open-ended writing. These vectors are then reduced with PCA and regressed onto questionnaire scores; the resulting regression weights are back-projected to recover a single semantic gradient in embedding space that captures how meaning shifts as the psychological construct changes. This gradient is interpreted through nearest-neighbor retrieval and clustering, producing human-readable contrasts in meaning. We applied SSD to seven corpora (Politics, Climate, Vaccination, Trust, Scientists, Health, Immigration) collected from 1,736 Polish adults, each of whom wrote short essays and completed a corresponding attitude item (e.g., perceived political influence, readiness to change lifestyle for the climate, trust in vaccines). Six of seven regressions showed reliable associations between language use and questionnaire scores (adjusted R² ≈ .03–.12), with Immigration showing no coherent gradient. SSD revealed theoretically meaningful semantic contrasts (e.g., politics as empowered civic participation vs. politics as corrupt and irrelevant; climate change as urgent emotional distress vs. abstract historical process). Finally, SSD-derived scores showed convergent patterns with demographics and information-seeking behavior that broadly paralleled those of the questionnaires, indicating that SSD recovers psychologically interpretable individual differences. Simulations showed that SSD holds perfect statistical power even in samples lower than 100 datapoints. Taken together, SSD joins qualitative meaning with quantitative measurement thereby opening a new avenue for psychological research in which language itself becomes an interpretable, testable operationalization of psychological constructs rather than just their narrative residue.

Version published to 10.31234/osf.io/gvrsb_v1 on OSF Preprints
Nov 4, 2025

Characterizing the Large-Scale Structure of Multimodal Semantic Networks

This article has 6 authors:
1. Raja Marjieh
2. Pol van Rijn
3. Ilia Sucholutsky
4. Harin Lee
5. Nori Jacoby
6. Thomas L. Griffiths
This article has no evaluationsLatest version Sep 23, 2025
Bridging Behavioral Gaps: Automatic Extrapolation of Concreteness Norms for Arabic and English with a k-Nearest Neighbor Approach

This article has 4 authors:
1. Marina Aziz
2. Urban Knupleš
3. Diego Frassinelli
4. Sabine Schulte im Walde
This article has no evaluationsLatest version Nov 2, 2025
Core vocabulary reveals differences between human word prediction and large language models

This article has 4 authors:
1. Andrew Wang
2. Simon De Deyne
3. Meredith McKague
4. Andrew Perfors
This article has no evaluationsLatest version Oct 27, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Characterizing the Large-Scale Structure of Multimodal Semantic Networks

Bridging Behavioral Gaps: Automatic Extrapolation of Concreteness Norms for Arabic and English with a k-Nearest Neighbor Approach

Core vocabulary reveals differences between human word prediction and large language models