Quantifying Human-AI Synergy

Christoph Riedl
Ben Weidmann

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We introduce a novel Bayesian Item Response Theory framework to quantify human–AI synergy, separating individual and collaborative ability while controlling for task difficulty in interactive settings. Unlike standard static benchmarks, our approach models human–AI performance as a joint process, capturing both user-specific factors and moment-to-moment fluctuations. We validate the framework by applying it to human–AI benchmark data (n=667) and find significant synergy. We demonstrate that collaboration ability is distinct from individual problem-solving ability. Users better able to infer and adapt to others’ perspectives achieve superior collaborative performance with AI–but not when working alone. Moreover, moment-to-moment fluctuations in perspective taking influence AI response quality, highlighting the role of dynamic user factors in collaboration. By introducing a principled framework to analyze data from human-AI collaboration, interactive benchmarks can better complement current single-task benchmarks and crowd-assessment methods. This work informs the design and training of language models that transcend static prompt benchmarks to achieve adaptive, socially aware collaboration with diverse and dynamic human partners.

Version published to 10.31234/osf.io/vbkmt_v1 on OSF Preprints
Sep 22, 2025

Designing AI-Agents with Personalities: A Psychometric Approach

This article has 4 authors:
1. Muhua Huang
2. Xijuan Zhang
3. Christopher J. Soto
4. James Evans
This article has no evaluationsLatest version Sep 22, 2025
Exploration of Stability Judgments: Assessing Multimodal LLMs in Game-Inspired Physical Reasoning Tasks

This article has 7 authors:
1. Mury Fajar Dewantoro
2. Febri Abdullah
3. Yi Xia
4. Ibrahim Khan
5. Ruck Thawonmas
6. Wenwen Ouyang
7. Fitra Abdurrachman Bachtiar
This article has no evaluationsLatest version Oct 21, 2025
Guiding solution insight with human-compatible virtual agents

This article has 9 authors:
1. Patrick Nalepka
2. Madhura Sabanayagam
3. Sakshi Joshi
4. Finn O’Connor
5. Michael Parada
6. Ayeh Alhasan
7. Gaurav Patil
8. Rachel Kallen
9. Michael J Richardson
This article has no evaluationsLatest version Oct 2, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Designing AI-Agents with Personalities: A Psychometric Approach

Exploration of Stability Judgments: Assessing Multimodal LLMs in Game-Inspired Physical Reasoning Tasks

Guiding solution insight with human-compatible virtual agents