Human-in-the-Loop Oversight of AI is Compromised by Political Preferences

Terence Daniel Dores Cruz
Christopher Starke
Tim Katzke
Emmanuel Müller
Marta Kwiatkowska
Orly Lobel
Nils Köbis
Shaul Shalvi

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Humans are legally responsible for monitoring sensitive Artificial Intelligence (AI) systems, requiring a human-in-the-loop (HITL) to have the final say in approving or rejecting AI decisions. However, behavioral science shows humans are not error and bias free. We report the results of three experiments (n = 5798, 115960 decisions) modeled on real-world welfare allocation scenarios in which locals and immigrants are considered for financial support. Results revealed that HITL oversight does not reliably correct algorithmic errors, and may even exacerbate errors. Moreover, political preferences of the HITL shape the type of errors they make. Interventions incentivizing accuracy or clarifying instructions reduced HITL error rates. Nevertheless, HITL error rates were higher than algorithmic error rates. These results point out that HITL is not a silver-bullet solution across settings. Our work calls for more systematic research into when, and how, oversight of AI can effectively reduce error and improve decision making quality.

Version published to 10.31234/osf.io/9ecms_v2 on OSF Preprints
Oct 23, 2025
Version published to 10.31234/osf.io/9ecms_v1 on OSF Preprints
Oct 19, 2025

Human-in-the-Loop Oversight of AI is Compromised by Political Preferences

This article has 8 authors:
1. Terence Daniel Dores Cruz
2. Christopher Starke
3. Tim Katzke
4. Emmanuel Müller
5. Marta Kwiatkowska
6. Orly Lobel
7. Nils Köbis
8. Shaul Shalvi
This article has no evaluationsLatest version Oct 23, 2025
Whistleblowers can contain the unethical externalities of human-AI delegation

This article has 4 authors:
1. Zoe Alexandra Purcell
2. Nils Köbis
3. Andrew Samuel
4. Jean-François Bonnefon
This article has no evaluationsLatest version Sep 30, 2025
When AI gets it wrong: False inference and political harm

This article has 1 author:
1. Slobodan Tomic
This article has no evaluationsLatest version Oct 4, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Human-in-the-Loop Oversight of AI is Compromised by Political Preferences

Whistleblowers can contain the unethical externalities of human-AI delegation

When AI gets it wrong: False inference and political harm