Development and validation of a scale assessing perceived trustworthiness in large language models

Ala Yankouskaya
Basad Barajeeh
Areej Babiker
Sameha AlShakhsi
Yunsi Tina Ma
Chun Sing Maxwell Ho
Raian Ali

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) are increasingly part of everyday life, yet there is no established way to measure how users evaluate their trustworthiness. This study introduces the Perceived Trustworthiness of LLMs scale (PT-LLM-8), developed from the TrustLLM framework and adapted as a human-centred measure. The scale was designed to measure the perceived trustworthiness of a user’s primary LLM and assesses eight dimensions: truthfulness, safety, fairness, robustness, privacy, transparency, accountability, and compliance with laws. Psychometric properties of the scale were tested with 752 LLM users in the United Kingdom (Mean age = 28.58, SD = 6.11, 50.3% males, 48.8% females). The PT-LLM-8 functions as a unidimensional measure with high internal consistency (Cronbach’s alpha = 0.90, Composite Reliability = 0.91, strong item-total correlations (ranged between 0.62–0.75), and measurement invariance across gender. The measure of perceived trustworthiness of LLM that can be applied as an overall score, along with item-level responses when insight into specific dimensions is needed. For researchers, practitioners, and developers, the PT-LLM-8 offers a practical instrument for evaluating interventions, comparing groups and contexts, and examining whether technical safeguards are reflected in users’ perceived trustworthiness of LLM. The scale can also be applied to guide system design, support policy development, and help organisations monitor shifts in user trust toward LLMs over time, making it applicable across research, practice, and governance.

Version published to 10.21203/rs.3.rs-7738215/v1 on Research Square
Nov 13, 2025

Navigating the Maze of Measurement: Large Language Models for objective instrument selection

This article has 2 authors:
1. Viktória Gajdošová
2. Matus Adamkovic
This article has no evaluationsLatest version Dec 19, 2025
Navigating the Maze of Measurement: Large Language Models for objective instrument selection

This article has 2 authors:
1. Viktória Gajdošová
2. Matus Adamkovic
This article has no evaluationsLatest version Dec 19, 2025
Developing and Validating the Chinese Version of the Attitudes Toward Large Language Models Scale (AT-LLM Chinese)

This article has 9 authors:
1. Sameha AlShakhsi
2. Ala Yankouskaya
3. Haibo Yang
4. Xiaokun Wang
5. Jiaojiao Chen
6. Tina Yunsi MA
7. Guandong Xu
8. Christian Montag
9. Raian Ali
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Navigating the Maze of Measurement: Large Language Models for objective instrument selection

Navigating the Maze of Measurement: Large Language Models for objective instrument selection

Developing and Validating the Chinese Version of the Attitudes Toward Large Language Models Scale (AT-LLM Chinese)