Fine-Tuned LLM Workflows for Feature Extraction from App Reviews

Hirofumi Tanahashi
Hoa Khanh Dam

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

App reviews contain actionable signals for improvement, but colloquial language, abbreviations, and fuzzy discourse boundaries make automatic extraction difficult. We build and compare single- and multi-node (sequential and parallel) workflows using a fine-tuned LLM extractor. The multi-node designs combine non‑fine‑tuned and fine‑tuned models, parameter variation, and domain‑specific instructions at inference. On a public benchmark, the single‑node LLM outperforms representative baselines, improving Weighted F1 by up to +69.7% and substantially increasing recall. In multi‑node comparisons, using fine‑tuned models for all nodes performs best in both sequential and parallel settings, while adding domain‑specific instructions at inference harms consistency and accuracy. Overall, (i) preserving training‑time output formats and instructions at inference and (ii) avoiding model mixing and excessive prompt changes are key to maximizing LLM‑based feature extraction. These findings provide practical guidance for deployment‑oriented extraction workflows with LLMs.

Version published to 10.21203/rs.3.rs-7927492/v1 on Research Square
Nov 15, 2025

Optimizing discharge summary generation: fine-tuning LLMs by DoRA and iterative self-evaluation for enhanced medical text generation

This article has 5 authors:
1. Wenbin Li
2. Hui Feng
3. Chao Hu
4. Minpeng Xu
5. Longlong Cheng
This article has no evaluationsLatest version Nov 4, 2025
Assessing the Applicability of Fine-Tuning LargeLanguage Models for Designing and Deploying 24/7 Context-Aware Multichannel CRM

This article has 3 authors:
1. Naoudouwel Fulbert
2. Maria Vinitha
3. Kanagasabai Thiruthanigesan
This article has no evaluationsLatest version Sep 30, 2025
Automated LLM based Extraction of Standardized Synthesis Procedures: an All-Domain, Zero-Shot Approach

This article has 5 authors:
1. Pedro Mendes
2. Daniel Costa
3. Matteo Manica
4. Teodoro Laino
5. Filipa Ribeiro
This article has no evaluationsLatest version Nov 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Optimizing discharge summary generation: fine-tuning LLMs by DoRA and iterative self-evaluation for enhanced medical text generation

Assessing the Applicability of Fine-Tuning LargeLanguage Models for Designing and Deploying 24/7 Context-Aware Multichannel CRM

Automated LLM based Extraction of Standardized Synthesis Procedures: an All-Domain, Zero-Shot Approach