The Death of Take-Home Assessment in the Era of GenAI, Here Is the Evidence

Mahmoud Elkhodr

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study examined whether a mature, empirically validated generative artificial intelligence (GenAI) intervention framework can produce reliable process evidence when deployed in unsupervised take-home assessments. Twenty-five group submissions from two cybersecurity management cohorts were audited using a five-check protocol that tested primary evidence presence, traceability, internal data consistency, modification provenance, and reflection specificity. The assessments were designed using the Structured AI-Guided Education (SAGE) framework and incorporated base prompts, structured decision tables, mandatory AI interaction logs, and reflective commentary. Only 3 of 25 submissions (12%) produced evidence chains that were substantially auditable. Full traceability between documented AI outputs and human evaluation claims was not achieved in any of the 25 submissions. The remaining submissions exhibited logical checksum failures, compliance-pattern text in evaluation cells, procedural rather than functional reflection, and structural indicators consistent with audit trail simulation. These patterns were consistent across both cohorts. The paper identifies a compliance gradient in which conscientious students who follow the process in good faith incur a disproportionate documentation burden, while students who simulate compliance can produce comparable outputs with less effort. On the basis of this evidence, the paper argues that take-home assessments can no longer be relied upon as standalone assurance instruments in the GenAI era. SAGE remains a validated pedagogy for fostering AI orchestration competency through scaffolded tutorial practice. However, the burden of assurance must shift to secure, supervised tasks where process fidelity cannot be simulated.

Version published to 10.35542/osf.io/ynas7_v1 on OSF Preprints
Mar 2, 2026

AI-facilitated child sexual exploitation and abuse: A Scoping Review Protocol

This article has 5 authors:
1. Chad Hemady
2. Sara Valdebenito
3. Chanel Hayre
4. Franziska Meinck
5. Deborah Fry
This article has no evaluationsLatest version Mar 18, 2026
Governance Determinants of AI-Enabled Mental Health Systems in LMICs

This article has 3 authors:
1. Effiong Ndarake Effiong
2. Daniel Mairafi Gimbason
3. Yakubu Boyi Ngwai
This article has no evaluationsLatest version Mar 9, 2026
Safety Mechanisms and Risk Mitigation in Generative AI Mental Health Chatbots: A Systematic Scoping Review

This article has 5 authors:
1. Lotenna Olisaeloka
2. Chris Richardson
3. Angel Wang
4. Richard Munthali
5. Daniel Vigo
This article has no evaluationsLatest version Apr 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

AI-facilitated child sexual exploitation and abuse: A Scoping Review Protocol

Governance Determinants of AI-Enabled Mental Health Systems in LMICs

Safety Mechanisms and Risk Mitigation in Generative AI Mental Health Chatbots: A Systematic Scoping Review