LFOSum: Summarizing Long-form Opinions with Large Language Models

Mir Tafseer Nayeem
Davood Rafiei

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Online reviews play a pivotal role in influencing consumer decisions across various domains, from purchasing products to selecting hotels or restaurants. However, the sheer volume of reviews—often containing repetitive or irrelevant content—leads to information overload, making it challenging for users to extract meaningful insights. Traditional opinion summarization models face challenges in handling long inputs and large volumes of reviews, while newer Large Language Model (LLM) approaches often fail to generate accurate and faithful summaries. To address those challenges, this paper introduces (1) a new dataset of long-form user reviews, each entity comprising over a thousand reviews, (2) two training-free LLM-based summarization approaches that scale to long inputs, and (3) automatic evaluation metrics. Our dataset of user reviews is paired with in-depth and unbiased critical summaries by domain experts, serving as a reference for evaluation. Additionally, our novel reference-free evaluation metrics provide a more granular, context-sensitive assessment of summary faithfulness. We benchmark several open-source and closed-source LLMs using our methods. Our evaluation reveals that LLMs still face challenges in balancing sentiment and format adherence in long-form summaries, though open-source models can narrow the gap when relevant information is retrieved in a focused manner1

Version published to 10.32388/d1mvb5
Oct 28, 2024

LLM Aspect Prediction: Reviewing Academic Papers from Different Aspects with Large Language Model

This article has 3 authors:
1. Zihao Hu
2. Fumiyo Fukumoto
3. Dongjin Yu
This article has no evaluationsLatest version Dec 11, 2025
Identifying Customer Priorities in Online Reviews through Sequence-to-Sequence Learning with Dual Contextual Attention

This article has 3 authors:
1. Sam Rahimzadeh Holagh
2. Jinfeng Zhou
3. Bugao Xu
This article has no evaluationsLatest version Dec 15, 2025
DiLLaB: Discussion Labeling with LLMs for Building Datasets

This article has 6 authors:
1. Ludimila Gonçalves
2. Márcia Lima
3. André Carvalho
4. Walter Nakamura
5. Igor Steinmacher
6. Tayana Conte
This article has no evaluationsLatest version Jan 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

LLM Aspect Prediction: Reviewing Academic Papers from Different Aspects with Large Language Model

Identifying Customer Priorities in Online Reviews through Sequence-to-Sequence Learning with Dual Contextual Attention

DiLLaB: Discussion Labeling with LLMs for Building Datasets