Forma mentis networks predict creativity ratings of short texts via interpretable artificial intelligence in human and AI-simulated raters

Edith Haim
Natalie Fischer
Salvatore Citraro
Giulio Rossetti
Massimo Stella

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Creativity is a fundamental skill of human cognition. We use textual forma mentis networks (TFMN) to extract network (semantic/syntactic associations) and emotional features from approximately one thousand human-, GPT3.5-, and Sonnet 3.7-generated stories. Using Explainable Artificial Intelligence (XAI) we test whether features relative to Mednick’s associative theory of creativity can explain creativity ratings assigned by humans or AI raters. Using XGBoost, we examine 5 scenarios: (i) human rating human stories, (ii) GPT-3.5 rating human stories, (iii) GPT-3.5 rating GPT-3.5 stories, (iv) Sonnet 3.7 rating human stories, and (v) Sonnet 3.7 rating Sonnet 3.7 stories. Our findings reveal that GPT-3.5 and Sonnet 3.7 ratings differ significantly from human ratings not only in terms of correlations but also because of feature patterns identified with XAI methods. GPT-3.5 and Sonnet 3.7 favour “their own” stories and rate human stories differently from humans. Feature importance analysis with SHAP scores shows that: (i) network features are more predictive for human creativity ratings but also for ratings by GPT-3.5 and Sonnet 3.7 for human stories; (ii) emotional features played a greater role than semantic/syntactic network structure in GPT-3.5 and Sonnet 3.7 rating their own stories. These quantitative results underscore key limitations in the ability of GPT-3.5 and Sonnet 3.7 to align with human assessments of creativity. We emphasise the need for caution when using AI models to assess and generate creative content, as they may not yet capture the nuanced complexity that characterises human creativity.

Version published to 10.31234/osf.io/6zpre_v3 on OSF Preprints
Sep 2, 2025
Version published to 10.31234/osf.io/6zpre_v2 on OSF Preprints
Aug 31, 2025
Version published to 10.31234/osf.io/6zpre_v1 on OSF Preprints
Nov 30, 2024

Bot or not: Can people tell the difference between stories written by a human or by an AI system?

This article has 2 authors:
1. Sydney Sears
2. Deena Skolnick Weisberg
This article has no evaluationsLatest version Jan 28, 2026
Multi-Gate Mixture-of-Experts with Explanation for Predictive Computational Personality Analysis

This article has 5 authors:
1. Ahmed R. Elmahalawy
2. Min Wei
3. Xiaohua Wu
4. Xiaohui Tao
5. Lin Li
This article has no evaluationsLatest version Dec 12, 2025
Using Large Language Models to Explore and Predict Human Choice from Verbal Description

This article has 1 author:
1. Eyal Marantz
This article has no evaluationsLatest version Dec 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Bot or not: Can people tell the difference between stories written by a human or by an AI system?

Multi-Gate Mixture-of-Experts with Explanation for Predictive Computational Personality Analysis

Using Large Language Models to Explore and Predict Human Choice from Verbal Description