Enhancing Text Quality with Human-Machine Collaboration: A Refinement Approach

Yicheng Sun
Yi Wang
Jianwei Yang
Tao Liu
Hanbo Yang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Human writing often exhibits a range of styles and levels of sophistication. However, automated text generation systems typically lack the nuanced understanding required to produce refined and elegant prose. Due to the inherent one-to-many relationship between inputs and outputs in natural language generation tasks, achieving annotator consistency is challenging. This complexity makes the annotation process considerably more difficult compared to tasks focused on natural language understanding. Our study focuses on the typical task of text refinement, which faces annotation difficulties, aiming to generate sentences with more elegant expressions while preserving the original semantics of the input sentence. This paper proposes a semi-automatic data construction method that combines auto-generation with human judgment. Initially, this method translates collected sentences containing elegant expressions into ordinary expressions through back translation. Subsequently, in an iterative quality control process, data filtering and human judgment are introduced to screen the auto-generated data based on quality standards, resulting in a large-scale text refinement dataset. By replacing manual annotation with human judgment and involving only a small amount of data for human judgment in each iteration, this method significantly reduces annotation difficulty and workload. With minimal human effort, it acquires a substantial amount of labeled data for text refinement, laying a foundation for further research in the field.

Version published to 10.21203/rs.3.rs-5506073/v1 on Research Square
Dec 3, 2024

Evaluating an LLM’s Performance in Annotating Discourse Strategies

This article has 2 authors:
1. Taylor Meizlish
2. Chris Ziffo
This article has no evaluationsLatest version Sep 2, 2025
Improving Large Language Models with Concept-Aware Fine-Tuning

This article has 5 authors:
1. Dacheng Tao
2. Michael Chen
3. Xikun ZHANG
4. Jiaxing Huang
5. Yingjie Wang
This article has no evaluationsLatest version Oct 1, 2025
Computation of Sentence Similarity Score through Hybrid Deep Learning with a Special Focus on Negation Sentence.

This article has 5 authors:
1. Rohit M
2. Jeganathan L
3. Srinivasa Rao Ummity
4. Janaki Meena M
5. Jayaram Balabaskaran
This article has no evaluationsLatest version Sep 22, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Evaluating an LLM’s Performance in Annotating Discourse Strategies

Improving Large Language Models with Concept-Aware Fine-Tuning

Computation of Sentence Similarity Score through Hybrid Deep Learning with a Special Focus on Negation Sentence.