Hierarchical Reinforcement Learning for Adaptive Text Summarization

Ahmad Farooq

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study presents a novel approach to adaptive text summarization using hierarchical reinforcement learning. We develop a T5-based hierarchical summarizer with a level selector, implementing and comparing three reinforcement learning algorithms: Proximal Policy Optimization (PPO), Advantage Actor-Critic (A2C), and Soft Actor-Critic (SAC). Our system adapts summary length based on time constraints and is evaluated using ROUGE and BERTScore metrics. Experiments conducted on the CNN/DailyMail dataset illustrate the potential of this approach in balancing summary quality and generation speed. Results show that PPO achieves the highest ROUGE and BERTScores, while A2C demonstrates a better balance between quality and efficiency. The paper emphasizes the potential as well as challenges of employing reinforcement learning for adaptive summarization, paving the way for future research in this critical domain of natural language processing.

Version published to 10.20944/preprints202503.2300.v1
Mar 31, 2025

Multimodal Fusion Network for Multimodal Sentiment Analysis

This article has 3 authors:
1. Blythe Ellison
2. Emily Marwood
3. Huxley Sinclair
This article has no evaluationsLatest version Feb 21, 2025
Analysis of Short Texts Using Intelligent Clustering Methods

This article has 7 authors:
1. Jamalbek Tussupov
2. Akmaral Kassymova
3. Ayagoz Mukhanova
4. Assyl Bissengaliyeva
5. Zhanar Azhibekova
6. Moldir Yessenova
7. Zhanargul Abuova
This article has no evaluationsLatest version Apr 1, 2025
Token-Level Pruning in Attention Models

This article has 1 author:
1. Shui Xiuying
This article has no evaluationsLatest version Mar 10, 2025

Listed in

Abstract

Article activity feed

Related articles

Multimodal Fusion Network for Multimodal Sentiment Analysis

Analysis of Short Texts Using Intelligent Clustering Methods

Token-Level Pruning in Attention Models