AI Feedback in Education: The Impact of Prompt Design and Human Expertise on LLM Performance

Lucas Jasper Jacobsen
Jonathan Rohlmann
Kira Elena Weber

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This article investigates the potential of large language models (LLMs) as tools for high-quality feedback in higher education, emphasizing the critical role of prompt design and human supervision. Addressing challenges such as time constraints for educators and variability in feedback quality, two empirical studies evaluate feedback generated by ChatGPT-4, Claude 3, and Gemini Advanced. Study 1 examines the influence of prompt structures on feedback quality, contributing robust evidence to a manual for effective prompt engineering. Study 2 compares 459 pieces of LLM-generated feedback on learning goals from 153 pre-service teachers across nine quality dimensions. Findings reveal that domain specificity and clarity in criteria within prompts significantly enhance feedback quality, with ChatGPT-4 outperforming in all categories of feedback quality, except errors. While Claude 3 demonstrates minimal content errors, Gemini Advanced provides balanced but lower-quality feedback. These results underscore that prompt engineering is a learnable skill for educators and students, aligning with the call for AI literacy in education. By combining expertly crafted prompts with human oversight, this research provides a framework to address the feedback challenges in higher education.

Version published to 10.31219/osf.io/fx5qz on OSF Preprints
Jan 27, 2025

AI-Assisted Assessment and Instruction in Higher Education: Foundations, Applications, and Implications for Exam Design

This article has 1 author:
1. Florian Klapproth
This article has no evaluationsLatest version Mar 28, 2026
ARPG+: Teaching Students to Ask Effective Questions for Educational LLM Use

This article has 6 authors:
1. Pei-Gen Ye
2. Kanghua Mo
3. Yucheng Long
4. Mengyun Liu
5. Haiwei Sang
6. Jun Zheng
This article has no evaluationsLatest version Apr 15, 2026
Pedagogical Roles of Large Language Models in Computing Education: A Systematic Literature Review

This article has 5 authors:
1. Opetunde Ibitoye
2. Amir-reza Asadi
3. Joel Appiah
4. Taiwo Akinremi
5. Popoola Saheed
This article has no evaluationsLatest version Mar 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

AI-Assisted Assessment and Instruction in Higher Education: Foundations, Applications, and Implications for Exam Design

ARPG+: Teaching Students to Ask Effective Questions for Educational LLM Use

Pedagogical Roles of Large Language Models in Computing Education: A Systematic Literature Review