Reg2Bangla: An End-to-End Regional Speech Standardization

Samiul Basir Bhuiyan
Md Sazzad Hossain Adib
Mohammed Aman Bhuiyan
Aritra Islam Saswato
Ahmed Faizul Haque Dhrubo
Mohammad Ashrafuzzaman Khan
Mohammad Abdul Qayum

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents a complete approach for transcribing twenty regional Bangladeshi dialects into standard Bangla text. We fine tuned the tugstugi bengaliai regional asr whisper medium model, which is a fine tuned variant of the Whisper model from OpenAI trained on external datasets. After this, we further fine tuned the model on a corpus of three thousand three hundred fifty dialectal audio recordings using the annotated label texts in the training set. We also applied post processing with an n gram KenLM language model and used KV cache optimization to achieve faster inference for the audio input. The proposed methodology tackles major challenges that arise from regional variation, pronunciation differences, and vocabulary shifts across linguistic communities in Bangladesh. The proposed pipeline shows strong performance on the evaluation set and demonstrates that a combination of pretrained multilingual speech models and targeted fine tuning, supported by post-processing techniques, can effectively manage the complexity of Bangladeshi dialectal speech.

Version published to 10.21203/rs.3.rs-9118485/v1 on Research Square
Mar 17, 2026

Benchmarking Self-Supervised Speech Models on Multilingual Nigerian Speech

This article has 2 authors:
1. Omotayo Omoyemi
2. Ifeoluwa Oladeni
This article has no evaluationsLatest version Mar 20, 2026
NE-OCR: Unified Optical Character Recognition for 10 Languages of Northeast India

This article has 1 author:
1. Badal Nyalang
This article has no evaluationsLatest version Mar 20, 2026
Towards High-Quality Machine Translation for Kokborok: A Low-Resource Tibeto-Burman Language of Northeast India

This article has 2 authors:
1. Badal Nyalang
2. Biman Debbarma
This article has no evaluationsLatest version Mar 31, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Benchmarking Self-Supervised Speech Models on Multilingual Nigerian Speech

NE-OCR: Unified Optical Character Recognition for 10 Languages of Northeast India

Towards High-Quality Machine Translation for Kokborok: A Low-Resource Tibeto-Burman Language of Northeast India