Comparison of Algorithms for the Recognition of ChatGPT Paraphrased Texts

Aleksandar Kartelj
Miljana Mladenovic
Stasa Vujicic Stankovic

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rapid development of artificial intelligence, especially chatbots, is leading to new forms of plagiarism that are difficult to detect using existing methods. Paraphrasing tools make this problem even more difficult and are incredible in minor languages with inadequate resources and tools. This study explores strategies that can help detect plagiarism generated by ChatGPT 4.0 and altered by paraphrasing tools. We propose two new datasets consisting of abstracts of doctoral theses in English and Serbian. Both datasets were subjected to ChatGPT paraphrasing, which allowed us to form two classes of texts: human-generated and AI-generated, i.e. AI-paraphrased. We then perform a comprehensive comparison of 19 widely used classification algorithms based on two feature sets, namely word unigrams and character multigrams. In addition, we compare these to the results of a commercially available pre-trained ChatGPT content detector, ZeroGPT. The results on the English corpus turn out to be very accurate, achieving an accuracy of 95% or more. In contrast, the results on the Serbian corpus were less accurate, achieving an accuracy of just over 85%. We attribute this difference to the lower ability of ChatGPT to parahprase in minor languages such as Serbian.

Version published to 10.21203/rs.3.rs-5107971/v1 on Research Square
Sep 20, 2024

PruneBERT: Context-Aware Sentence Classification through Statistical Relevance Pruning

This article has 5 authors:
1. Raghav Kaushik R
2. Jeganathan L
3. Janaki Meena M
4. Ummity Srinivasa Rao
5. Jayaram Balabaskaran
This article has no evaluationsLatest version Feb 6, 2026
Comparing the Performance of SOTA Text Summarization Models on AI Research Papers

This article has 2 authors:
1. Pradnya Gotmare
2. Sushant Nair
This article has no evaluationsLatest version Jan 22, 2026
Transformer-driven chatbot for Arabic agricultural knowledge

This article has 1 author:
1. Manal AlGhieth
This article has no evaluationsLatest version Mar 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

PruneBERT: Context-Aware Sentence Classification through Statistical Relevance Pruning

Comparing the Performance of SOTA Text Summarization Models on AI Research Papers

Transformer-driven chatbot for Arabic agricultural knowledge