Brief Commentary: A Framework for Detecting AI Agents in Online Research

Felipe M. Affonso

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Online behavioral research assumes survey responses come from humans, yet vision-enabled AI agents can now autonomously complete surveys by capturing screenshots, processing questions, and submitting responses. Because these agents perceive the same rendered visual content that humans see, traditional detection methods are ineffective. This article introduces the Cognitive Trap Framework: researchers can transform architectural constraints of vision-language models into survey questions where the correct answers are simultaneously difficult for AI agents but easily processed by humans. Six traps derived from computer science benchmarks demonstrate the framework. Against 1,007 human participants (Prolific) and 526 researcher-deployed AI agents (e.g., ChatGPT Agent, Google Project Mariner), cognitive traps detected 97.1% of agents (vs. 2.3% with traditional attention checks), while flagging only 4.1% of humans. Pre-registered replications on Amazon MTurk and CloudResearch Connect demonstrate cross-platform effectiveness, and validation against 34 frontier models spanning two years reveals that model improvement is non-monotonic because each new architecture reconfigures which constraints it resolves and which it introduces. The framework can thus generate new cognitive traps as AI agent models evolve, and a public repository provides researchers with validated traps ready for deployment: https://FelipeMAffonso.github.io/cognitive-trap-repository.

Version published to 10.31234/osf.io/enuqj_v2 on OSF Preprints
Mar 18, 2026
Version published to 10.31234/osf.io/enuqj_v1 on OSF Preprints
Nov 19, 2025

Estimating the threat of AI-agent responding across online survey platforms

This article has 8 authors:
1. Stephanie Chen
2. Oleg Urminsky
3. Grace Zhang
4. Robert Walatka
5. Kianté Fernandez
6. Andrea Low
7. Jonathan Bogard
8. Craig R. Fox
This article has no evaluationsLatest version Mar 2, 2026
A scale for detecting LLM-generated responses in online survey research

This article has 2 authors:
1. Cameron Stuart Kay
2. Madalina Vlasceanu
This article has no evaluationsLatest version Mar 21, 2026
A scale for detecting LLM-generated responses in online survey research

This article has 2 authors:
1. Cameron Stuart Kay
2. Madalina Vlasceanu
This article has no evaluationsLatest version Mar 21, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Estimating the threat of AI-agent responding across online survey platforms

A scale for detecting LLM-generated responses in online survey research

A scale for detecting LLM-generated responses in online survey research