Encoding of pretrained large language models mirrors the genetic architectures of human psychological traits

Bohan Xu
Nick Obradovich
Wenjie Zheng
Robert Loughnan
Lucy Shao
Masaya Misaki
Wesley K. Thompson
Martin Paulus
Chun Chieh Fan

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent advances in large language models (LLMs) have prompted a frenzy in utilizing them as universal translators for biomedical terms. However, the black box nature of LLMs has forced researchers to rely on artificially designed benchmarks without understanding what exactly LLMs encode. We demonstrate that pretrained LLMs can already explain up to 51% of the genetic correlation between items from a psychometrically-validated neuroticism questionnaire, without any fine-tuning. For psychiatric diagnoses, we found disorder names aligned better with genetic relationships than diagnostic descriptions. Our results indicate the pretrained LLMs have encodings mirroring genetic architectures. These findings highlight LLMs’ potential for validating phenotypes, refining taxonomies, and integrating textual and genetic data in mental health research.

Version published to 10.1101/2025.03.27.25324744v1 on medRxiv
Mar 27, 2025

Illusions of Alignment Between Large Language Models and Brains Emerge From Fragile Methods and Overlooked Confounds

This article has 5 authors:
1. Nima Hadidi
2. Ebrahim Feghhi
3. Bryan H. Song
4. Idan A. Blank
5. Jonathan C. Kao
This article has no evaluationsLatest version Mar 10, 2025
Language or Syndrome? Investigating the Origins of the General Psychopathology Factor through Large Language Model Embeddings

This article has 3 authors:
1. Hiroki Kojima
2. Takafumi Soda
3. Yuichi Yamashita
This article has no evaluationsLatest version Mar 10, 2025
Large Language Models for Psychological Assessment: A Comprehensive Overview

This article has 3 authors:
1. Jocelyn Brickman
2. Mehak Gupta
3. Joshua R. Oltmanns
This article has no evaluationsLatest version Apr 6, 2025

Listed in

Abstract

Article activity feed

Related articles

Illusions of Alignment Between Large Language Models and Brains Emerge From Fragile Methods and Overlooked Confounds

Language or Syndrome? Investigating the Origins of the General Psychopathology Factor through Large Language Model Embeddings

Large Language Models for Psychological Assessment: A Comprehensive Overview