Teacher-Student Framework for Short-Context Classification with Domain Adaptation and Data Augmentation

Fu Lei
Haoran Zheng
Beichen Liu
Zhejun Zhao
Lipeng Liu
Xuan Li

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Detecting AI-generated text is an important challenge, especially as large language models become better at creating content that looks like human writing. This paper presents a teacher-student framework to improve the performance and efficiency of short-context document classification. The framework uses domain adaptation and data augmentation to improve detection. The teacher model combines DeBERTa-v3-large and Mamba-790m models, using their fine-tuning on domain-specific data and semantic knowledge for accurate predictions. The student model works with short-context text of 128 and 256 tokens and learns from the teacher to balance accuracy and computational efficiency. To make the model more robust, we built a data generation and augmentation pipeline, applying techniques like spelling correction, character removal, and case flipping to increase data variety. The proposed framework outperforms current methods in accuracy and robustness, offering an efficient solution for detecting AI-generated text. This work also lays the groundwork for future studies on multilingual classification and real-time inference improvements in AI text detection.

Version published to 10.20944/preprints202505.2421.v1
May 30, 2025

Research on a Denoising Model for Entity-Relation Extraction Using Hierarchical Contrastive Learning with Distant Supervision

This article has 4 authors:
1. Ayiguli Halike*
2. Aishan Wumaier
3. kahaerjiang abiderexiti
4. Tuergen Yibulayin
This article has no evaluationsLatest version Apr 30, 2025
Experimental Evaluation of Machine Learning Models for Goal-oriented Customer Service Chatbot with Pipeline Architecture

This article has 3 authors:
1. Nurul Ain Nabilah Mohd Isa
2. Siti Nuraishah Agos Jawaddi
3. Azlan Ismail
This article has no evaluationsLatest version Apr 23, 2025
Sentence Classification Using Transfer Learning with BERT

This article has 2 authors:
1. Abhishek Verma
2. Nallarasan V
This article has no evaluationsLatest version May 29, 2025

Listed in

Abstract

Article activity feed

Related articles

Research on a Denoising Model for Entity-Relation Extraction Using Hierarchical Contrastive Learning with Distant Supervision

Experimental Evaluation of Machine Learning Models for Goal-oriented Customer Service Chatbot with Pipeline Architecture

Sentence Classification Using Transfer Learning with BERT