Illusions of Confidence in Artificial Systems

Clara Colombatto
Stephen M Fleming

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Effective collaboration requires that we monitor both the cognitive states (e.g., beliefs) and metacognitive states (e.g., confidence) of other agents. While humans routinely share confidence, metacognitive capabilities are still developing in artificial intelligence (AI), raising the question of how humans attribute metacognition to AI systems. In seven pre-registered experiments, we show that attributions of metacognition are sensitive to observed behaviour (e.g., response times), but also agent types: observers consistently overestimated AI confidence compared to humans—even when their behaviour was identical. This illusion of confidence was robust across behavioural profiles, agent descriptions, and decision-making tasks (visual perception, general knowledge) but was reduced in more subjective decisions (emotion categorisation). An experimental manipulation further showed that illusions of confidence are rooted in prior beliefs about the agents’ capabilities. Together, these findings uncover a powerful illusion of confidence in artificial systems and highlight a central role for metacognition in human-AI interactions.

Version published to 10.31234/osf.io/mjx2v_v2 on OSF Preprints
May 24, 2025
Version published to 10.31234/osf.io/mjx2v_v1 on OSF Preprints
Sep 28, 2023

Human Shadows in Machine Minds: Interpreting AI Responses to Rorschach Test

This article has 2 authors:
1. Katalin Csigó
2. György Cserey
This article has no evaluationsLatest version May 21, 2025
Inference about Absence as a Window into the Mental Self-Model

This article has 1 author:
1. Matan Mazor
This article has no evaluationsLatest version May 13, 2025
Performance Feedback Triggers Liberal Detection and Perceptual Confidence Biases in Early Childhood: Implications for metacognitive training

This article has 4 authors:
1. David Soto
2. Marie Lallier
3. Kobe Desender
4. Patxi Elosegi
This article has no evaluationsLatest version May 21, 2025

Listed in

Abstract

Article activity feed

Related articles

Human Shadows in Machine Minds: Interpreting AI Responses to Rorschach Test

Inference about Absence as a Window into the Mental Self-Model

Performance Feedback Triggers Liberal Detection and Perceptual Confidence Biases in Early Childhood: Implications for metacognitive training