A Performance-efficiency Analysis of Transformer Models for Code-mixed Hausa Sentiment Data

Muhammed Bashir Muazu
Abdulkadir Bayero
Abdullahi Ahmad
Sufiyanu Ibrahim
Dahir Ibrahim Dahir

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The deployment of large language models (LLMs) in African contexts offers significant opportunities for societal innovation, but is often hindered by their substantial computational requirements. This challenge is particularly acute for code-mixed Hausa sentiment analysis, where the state-of-the-art (SOTA) model achieves high accuracy at a significant cost in model size and inference speed. This paper addresses this performance-efficiency trade-off by presenting the first comprehensive benchmark of the SOTA model against a suite of smaller, more efficient alternatives, including specialized African-centric and generalist models. Using the established NaijaSenti benchmark, our experiments reveal that a compact, specialized model (castorini/afriberta small) retains over 82% of the SOTA’s F1-score (0.761 vs. 0.928) while being 2.25 times faster at inference. Furthermore, our results demonstrate that specialized pre-training is a critical factor, as the small African-centric model significantly outperforms its generalist counterpart. We conclude that castorini/afriberta small represents the optimal ”sweet spot” for practical deployment, and we recommend it for applications where a balance between high performance and computational efficiency is required. Our findings provide a data-driven guide for practitioners building scalable NLP solutions in the African context.

Version published to 10.21203/rs.3.rs-8982792/v1 on Research Square
Mar 2, 2026

Efficient Knowledge Distillation for News Classification Based on ModernBERT

This article has 2 authors:
1. Xuyang Wang
2. Yuxi Zheng
This article has no evaluationsLatest version Apr 8, 2026
Do Transformers Always Win? An Empirical Study of Semantic Embeddings for Short-Text E-commerce Reviews

This article has 4 authors:
1. Longying Lai
2. Zhiyuan Cheng
3. Kai Cheng
4. Xiaoxi Qi
This article has no evaluationsLatest version Mar 20, 2026
AB-TC-BLATT: A Resource-Efficient Parallel System Architecture with Frozen ALBERT for Practical Chinese Sentiment Analysis

This article has 2 authors:
1. Li Qiusheng
2. Long Yu
This article has no evaluationsLatest version Apr 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Efficient Knowledge Distillation for News Classification Based on ModernBERT

Do Transformers Always Win? An Empirical Study of Semantic Embeddings for Short-Text E-commerce Reviews

AB-TC-BLATT: A Resource-Efficient Parallel System Architecture with Frozen ALBERT for Practical Chinese Sentiment Analysis