Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs

Jiangang Hao

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Writing is a foundational literacy skill that underpins effective communication, fosters critical thinking, facilitates learning across disciplines, and enables individuals to organize and articulate complex ideas. Consequently, writing assessment plays a vital role in evaluating language proficiency, communicative effectiveness, and analytical reasoning. The rapid advancement of large language models (LLMs) has made it increasingly easy to generate coherent, high-quality essays, raising significant concerns about the authenticity of student-submitted work. This chapter first provides an overview of the current landscape of detectors for AI-generated and AI-assisted essays, along with guidelines for their responsible use. It then presents empirical analyses to evaluate how well detectors trained on essays from one LLM generalize to identifying essays produced by other LLMs, based on essays generated in response to public GRE writing prompts. These findings provide guidance for developing and retraining detectors for practical applications.

Version published to 10.35542/osf.io/76nck_v1 on OSF Preprints
Feb 23, 2026

Argumentative essay assessment with LLMs: A critical scoping review

This article has 5 authors:
1. Lucile Favero
2. Gabrielle Gaudeau
3. Juan Antonio Pérez-Ortiz
4. Tanja Käser
5. Nuria Oliver
This article has no evaluationsLatest version Feb 2, 2026
The AI Paradox in L2 Writing: Why Helpful Feedback Creates Unhelpful Dependency in Higher Education

This article has 2 authors:
1. Mohamed SEDDIKI
2. Souhila KORICHI
This article has no evaluationsLatest version Feb 18, 2026
The AI Paradox in L2 Writing: Why Helpful Feedback Creates Unhelpful Dependency in Higher Education

This article has 2 authors:
1. Mohamed SEDDIKI
2. Souhila KORICHI
This article has no evaluationsLatest version Feb 18, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Argumentative essay assessment with LLMs: A critical scoping review

The AI Paradox in L2 Writing: Why Helpful Feedback Creates Unhelpful Dependency in Higher Education

The AI Paradox in L2 Writing: Why Helpful Feedback Creates Unhelpful Dependency in Higher Education