Enhancing Thai Sentiment Analysis with WangchanBERTa and Instance-Class-Aware Dynamic Kernel Selection

Panida Songram
Suchart Khummanee
Jirapond Muangprathub
Khanabhorn Kawattikul

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Thai sentiment analysis on social media remains challenging due to the lack of explicit word boundaries, informal expressions, code-mixing, short text length, and severe class imbalance, particularly in the WISESIGHT benchmark dataset. Although transformer-based models such as WangchanBERTa provide strong contextual representations for Thai language processing, conventional hybrid architectures using fixed convolutional kernels may fail to capture diverse local semantic patterns across different sentiment classes. To address this limitation, this study proposes WangchanBERTa with Instance-Class-Aware Dynamic Kernel Selection (WangchanBERTa-IC-DKS), a novel hybrid framework that integrates contextualized embeddings with multi-kernel convolutional neural networks and dynamic kernel selection at both the instance and class levels. The proposed model adaptively determines kernel importance according to sentence semantics and class-specific sentiment characteristics, enabling more effective local feature extraction and improved minority-class recognition. Experiments were conducted on the WISESIGHT sentiment benchmark using four kernel candidates: [2, 3, 4], [3, 4, 5], [2, 3, 4, 5], and [1, 2, 3, 4, 5]. The kernel candidate [2, 3, 4, 5] achieves the best performance, with a mean macro-F1 of 63.93% averaged across three independent runs using different random seeds. Ablation studies further show that combining both class-aware and instance-aware dynamic kernel selection improves balanced classification performance compared with fixed multi-kernel CNN baselines. The proposed model also outperforms the previous state-of-the-art Parallel Hybrid model by 1.23 percentage points on the WISESIGHT dataset and by 1.36 percentage points on the 40 Thai children’s tales dataset. These findings demonstrate that dynamic kernel selection is more effective than fixed convolutional kernels across the two evaluated Thai sentiment benchmarks, particularly for short, noisy, and imbalanced Thai texts.

Version published to 10.3390/app16115310
May 25, 2026
Version published to 10.20944/preprints202604.0631.v1
Apr 9, 2026

A Novel Hybrid Deep Learning Model for Aspect-Based Sentiment Analysis (ABSA)

This article has 3 authors:
1. PRIYANKA SHAKYA
2. AANCHAL SINGH
3. Ashish Kumar Mishra
This article has no evaluationsLatest version Apr 9, 2026
Exploring Large Language Models for Multitask Learning in Bengali Text Classification

This article has 4 authors:
1. Md. Sajjad Hossain
2. Kawsar Ahmed
3. Suny Md Ashraf Khan
4. Mohammed Moshiul Hoque
This article has no evaluationsLatest version Apr 3, 2026
Optimizing Fake News Detection in Low-Resource Languages: A Comparative Study of Deep Learning Models Using Sentence-Level FastText Vectors in Kurdish and English

This article has 2 authors:
1. Azad M. Karim¹
2. Bryar A.Hassn
This article has no evaluationsLatest version Apr 15, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Novel Hybrid Deep Learning Model for Aspect-Based Sentiment Analysis (ABSA)

Exploring Large Language Models for Multitask Learning in Bengali Text Classification

Optimizing Fake News Detection in Low-Resource Languages: A Comparative Study of Deep Learning Models Using Sentence-Level FastText Vectors in Kurdish and English