Are Eight Chatbots Better Than One? Boosting Chatbot Creative Outcomes via Exposure to Self- and Peer-Generated Examples

Dimitris Grammenos
Todd Lubart

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

An important aspect of the emerging field of human-AI co-creativity concerns how users can consistently make the most of whichever AI systems they have at their disposal. To advance this know-how and provide practical insights, the present study reports an empirical exploratory investigation examining if, and how, exposure to self- and peer-generated examples affects the creative performance of chatbots. We introduce two strategies: (a) “Pick & Mix”, which involves selecting, combining, and enhancing elements from examples, and (b) “Try to Beat”, which uses examples as baselines to outperform. We test these strategies with eight widely used chatbots (ChatGPT, Claude, Copilot, DeepSeek, Gemini, Grok, Meta, and Perplexity) in realistic usage settings, using a two-round multi-iteration process involving two standardized creativity tasks, the Divergent Association Task (DAT) and the Alternative Uses Test (AUT). Findings indicate that Pick & Mix is an effective and simple approach for improving chatbots’ creative performance. In contrast, Try to Beat is generally ineffective and rarely outperforms Pick & Mix outcomes. Overall, the findings suggest that chatbots can repeatedly identify and improve the best available candidates within a set of provided examples, but have difficulty extracting and reusing task-relevant features from them to generate consistently improved alternative results.

Version published to 10.21203/rs.3.rs-8718021/v1 on Research Square
Feb 16, 2026

Conversations From Make-Believe: An Attentive Encoder–Decoder Chatbot Trained on Scripted Dialogue

This article has 1 author:
1. Sourabh Subhash Rajput
This article has no evaluationsLatest version Jan 29, 2026
Using Artificial Intelligence in an Undergraduate History Course: A Pilot Study of Personalized Chatbot Design Promoting Student Engagement and Self-Directed Learning

This article has 1 author:
1. Atharva Dange
This article has no evaluationsLatest version Jan 19, 2026
AI-Calibrated Metacognition: How Genre-based ChatGPT Feedback and Interaction Shape L2 Writers' Metacognitive Judgments and Self-regulation

This article has 1 author:
1. Issam Rian
This article has no evaluationsLatest version Jan 1, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Conversations From Make-Believe: An Attentive Encoder–Decoder Chatbot Trained on Scripted Dialogue

Using Artificial Intelligence in an Undergraduate History Course: A Pilot Study of Personalized Chatbot Design Promoting Student Engagement and Self-Directed Learning

AI-Calibrated Metacognition: How Genre-based ChatGPT Feedback and Interaction Shape L2 Writers' Metacognitive Judgments and Self-regulation