Fine-Tuning and Explaining FinBERT for Sector-Specific Financial News: A Reproducible Workflow

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The increasing use of complex “black-box” models for financial news sentiment analysis presents a challenge in high-stakes settings where transparency and trust are paramount. This study introduces and validates a finance-focused, fully reproducible, open-source workflow for building, explaining, and evaluating sector-specific sentiment models mapped to standard market taxonomies and investable proxies. We benchmark interpretable and transformer-based models on public datasets and a newly constructed, manually annotated gold-standard corpus of 1500 U.S. sector-tagged financial headlines. While a zero-shot FinBERT establishes a reasonable baseline (macro F1 = 0.555), fine-tuning on our gold data yields a robust macro F1 = 0.707, a substantial uplift. We extend explainability to the fine-tuned FinBERT with Integrated Gradients (IG) and LIME and perform a quantitative faithfulness audit via deletion curves and AOPC; LIME is most faithful (AOPC = 0.365). We also quantify the risks of weak supervision: accuracy drops (−21.0%) and explanations diverge (SHAP rank ρ = 0.11) relative to gold-label training. Crucially, econometric tests show the sentiment signal is reactive, not predictive, of next-day returns; yet it still supports profitable sector strategies (e.g., Technology long-short Sharpe 1.88). Novelty lies in a finance-aligned, sector-aware, trustworthiness blueprint that pairs fine-tuned FinBERT with audited explanations and uncertainty checks, all end-to-end reproducible and tied to investable sector ETFs.

Article activity feed