Implicit neural measures of trust in artificial intelligence

Tobias Feldmann-Wüstefeld
Eva Wiese

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Trust in AI systems is critical for effective collaboration, yet traditional measures—such as self-reports and behavioral proxies—are limited in capturing its dynamic and latent nature. This study introduces the contralateral delay activity (CDA), a neural marker of visual working memory load, as a novel, objective index of trust. While the CDA has been widely used in change detection tasks to track memory load, we repurpose it here to measure how many working memory resources users offload to an AI partner. Participants performed a lateralized memory task under low and high working memory load, collaborating with an AI whose reliability was experimentally manipulated across three phases: trust formation, violation, and restoration. In dyad trials, where the AI was responsible for one hemifield, CDA amplitude served as an index of how much information participants chose to maintain themselves versus offload. As AI reliability increased, CDA amplitude rose, indicating greater trust and reliance. When reliability dropped, participants encoded more from both hemifields, and CDA amplitude declined. During trust restoration, CDA amplitude returned to pre-violation levels, indicating renewed reliance—though it never matched the high amplitude of solo trials, suggesting lingering mistrust. Behavioral measures (e.g., reliance, compliance, response time) tracked these dynamics but lacked the resolution and specificity of CDA. Together, these results establish CDA as a powerful neural index of dynamic trust. It captures trial-by-trial fluctuations in offloading behavior that reflect users’ evolving confidence in AI assistance, offering a continuous, covert, and cognitively grounded measure of trust in interactive settings.

Version published to 10.31234/osf.io/v9adu_v1 on OSF Preprints
Aug 22, 2025

Measuring trust in Artificial Intelligence with the N2pc component

This article has 2 authors:
1. Eva Wiese
2. Tobias Feldmann-Wüstefeld
This article has no evaluationsLatest version Aug 23, 2025
Distinct brain mechanisms support trust violations, belief integration, and bias in human-AI teams

This article has 5 authors:
1. Luisa Roeder
2. Pamela Hoyte
3. Graham K Kerr
4. Peter Bruza
5. Johan N van der Meer
This article has no evaluationsLatest version Aug 19, 2025
Tricking into trusting? The influence of social cues of a generative AI on perceived trust

This article has 5 authors:
1. Nicole Krämer
2. Ivana Lamia
3. Hanne Siegert
4. Florian Wenda
5. Lovis Bero Suchmann
This article has no evaluationsLatest version Aug 31, 2025

Listed in

Abstract

Article activity feed

Related articles

Measuring trust in Artificial Intelligence with the N2pc component

Distinct brain mechanisms support trust violations, belief integration, and bias in human-AI teams

Tricking into trusting? The influence of social cues of a generative AI on perceived trust