Leveraging Large Language Models for Document Classification in the Banking Sector

Rómulo Nogueira
Hugo Mentzingen
Nuno Garcia

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Document classification serves as foundational step in critical tasks such as information extraction, analysis and decision-making. However, existing approaches often struggle with the variability, volume, and complexity of real-world documents. These methods are further limited by a lack of configurability and explainability, requiring specialized technical expertise to accommodate diverse user needs and often producing results that are difficult to interpret. To address the complexities of modern document processing, this paper introduces a novel zero-shot document classification framework that leverages Large Language Models (LLMs), designed for accessibility and configurability by both technical and non-technical users. Unlike traditional methods, which require extensive labeled data, the zero-shot configuration enables our framework to perform the classification task without any prior exposure to labeled examples of the target categories, relying instead on semantic understanding derived from user-provided label descriptions and document content. Developed and validated using a real-world banking dataset, our framework leverages different strategies for providing context to LLMs during classification. Experimental results demonstrate substantial improvements in both accuracy and efficiency, outperforming current zero-shot methods while also reducing operating costs.

Version published to 10.21203/rs.3.rs-7511605/v1 on Research Square
Oct 27, 2025

A Comprehensive Evaluation of Llama 3 for Text Classification Tasks

This article has 4 authors:
1. AmirAhmad Amjadi
2. Shiva TaghipourEivazi
3. Bahman Arasteh
4. Huseyin Kusetogullari
This article has no evaluationsLatest version Dec 23, 2025
Advancing Sentiment Analysis in Gujarati: Performance Enhancement through a Hybrid Annotation Framework

This article has 2 authors:
1. Neha Shah¹
2. Preeti Baser²
This article has no evaluationsLatest version Jan 6, 2026
LLM Aspect Prediction: Reviewing Academic Papers from Different Aspects with Large Language Model

This article has 3 authors:
1. Zihao Hu
2. Fumiyo Fukumoto
3. Dongjin Yu
This article has no evaluationsLatest version Dec 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Comprehensive Evaluation of Llama 3 for Text Classification Tasks

Advancing Sentiment Analysis in Gujarati: Performance Enhancement through a Hybrid Annotation Framework

LLM Aspect Prediction: Reviewing Academic Papers from Different Aspects with Large Language Model