Human Experts Vs. LLMs: Who is Better at Explaining Students’ Clustering into Knowledge Profiles?

Elad Yacobson
Shelley Rap
Ron Blonder
Giora Alexandron

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Models (LLMs) are increasingly used in educational settings to enhance assessment and feedback.While prior research has focused primarily on their ability to score responses or model learners’ knowledge,less attention has been given to their use in explaining outputs of machine learning algorithms – a key goal ofexplainable AI (XAI) in education. In this study, we explore the capacity of ChatGPT to generate natural-languageexplanations of student knowledge profiles derived from clustering analysis of multi-item chemistry assessments.These explanations are compared to those authored by human experts, with 16 chemistry teachers evaluatingboth versions in a blind review. While ChatGPT’s explanations were generally preferred for profiles representingsimpler student performance patterns, human-authored explanations were favored for more complex profilesrequiring nuanced pedagogical reasoning. Our findings highlight the capabilities and limitations of LLMs ingenerating high-level explanations of algorithmic outputs and suggest that relying on LLMs to analyze multi-itemassessment data may actually work against students with more complex knowledge structures.

Version published to 10.35542/osf.io/4ewfg_v1 on OSF Preprints
Mar 20, 2026

ARPG+: Teaching Students to Ask Effective Questions for Educational LLM Use

This article has 6 authors:
1. Pei-Gen Ye
2. Kanghua Mo
3. Yucheng Long
4. Mengyun Liu
5. Haiwei Sang
6. Jun Zheng
This article has no evaluationsLatest version Apr 15, 2026
Can Large Language Models Emulate Human Performance on Educational Assessments?

This article has 4 authors:
1. Xiuxiu Tang
2. Yikai Lu
3. John T. Behrens
4. Ying Cheng
This article has no evaluationsLatest version Apr 23, 2026
Redistributing Epistemic Labor: Prior Knowledge Shapes How Effectively Students Use Large Language Models

This article has 5 authors:
1. Matthias Stadler
2. Raisa Kirikaidou
3. Michael Sailer
4. Maria Bannert
5. Constanze Richters
This article has no evaluationsLatest version Mar 18, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

ARPG+: Teaching Students to Ask Effective Questions for Educational LLM Use

Can Large Language Models Emulate Human Performance on Educational Assessments?

Redistributing Epistemic Labor: Prior Knowledge Shapes How Effectively Students Use Large Language Models