Hybrid Architectures for Chinese Text Processing: Optimizing LLaMA2 with CNN and LSTM

Xize Liu
Yiyi Wang
Nana Niu
Bingyan Zhang
Jingsheng Li

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In the rapidly evolving field of natural language processing (NLP), the processing of the Chinese language, with its unique complexities, presents significant challenges, especially in the context of Large Language Models (LLMs) like LLaMA2. These challenges are further exacerbated by the presence of non-standardized text prevalent across digital Chinese content. To address these challenges, this paper proposes a novel hybrid approach that seamlessly integrates deep contextual embeddings with Convolutional Neural Networks (CNNs) to enhance the processing of standardized Chinese text. Our approach involves a multi-stage process wherein deep contextual embeddings are first utilized to capture the nuanced semantic relationships within text. Following this, CNNs are employed to identify and exploit structural and syntactic patterns, facilitating a comprehensive understanding of the text. This hybrid model significantly improves LLaMA2’s efficiency and accuracy across various Chinese text processing tasks by ensuring that both semantic depth and structural nuances are accurately captured. The effectiveness of our model is demonstrated through rigorous testing across several benchmarks, showcasing its superiority in processing Chinese text with enhanced accuracy and speed. This research not only contributes to the advancement of text processing capabilities of LLMs but also opens new avenues for their application in tasks such as automated translation and sentiment analysis.

Version published to 10.20944/preprints202410.1643.v1
Oct 22, 2024

Fusion of Local and Global Context in Large Language Models for Text Classification

This article has 5 authors:
1. Ran Hao
2. Xin Hu
3. Jiasen Zheng
4. Chong Peng
5. Junjiang Lin
This article has no evaluationsLatest version Sep 19, 2025
A Deep Learning Approach for Multilingual Sentiment Analysis

This article has 3 authors:
1. Bablu Pramanik
2. Santanu Modak
3. Chayan Paul
This article has no evaluationsLatest version Sep 11, 2025
Parameter-Efficient Fine-Tuning (PEFT) Approaches for Large Language Models: A Comparative Analysis on AG News

This article has 1 author:
1. Asmaa Mohammed Shuibi
This article has no evaluationsLatest version Oct 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Fusion of Local and Global Context in Large Language Models for Text Classification

A Deep Learning Approach for Multilingual Sentiment Analysis

Parameter-Efficient Fine-Tuning (PEFT) Approaches for Large Language Models: A Comparative Analysis on AG News