Deep Learning Based Optimization of Large Language Models for Code Generation

Shanqi Zhan
Ying Lin
Junlin Zhu
Yao Yao

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In order to improve the performance of the code generation system in semantic modeling and structural dependency construction, a deep learning-based multi-layer Transformer encoding and decoding structure is constructed, and the overall architecture consists of 12 layers of Transformer modules stacked together, with a multi-head self-attention mechanism (8-head attention) and a positional feed-forward network (dimension 2048) to enhance the contextual modeling capability. The encoder input sequence is capped at 512, fusing positional embeddings with semantic embeddings generated by Item2Vec to achieve accurate capture of variable dependencies and syntactic levels. The decoder introduces multi-tasking goals to jointly perform code completion and semantic annotation tasks to improve generalization and context adaptation. The platform combines neural collaborative filtering structure and multimodal semantic fusion strategy, which significantly enhances the comprehensive understanding of user behavioral features and code structure graph. It reaches 47.82, 53.67 and 28.94 in BLEU-4, CodeBLEU and Exact Match, respectively, and the inference response latency is optimized from 257ms to 87ms, showing excellent accuracy and response efficiency.

Version published to 10.20944/preprints202506.0137.v1
Jun 3, 2025

DSA-GNAS: Graph Neural Architecture Search with Deep Semantic Adaption of Large Language Models

This article has 5 authors:
1. Siyang Xiao
2. Jiamin Chen
3. Zhenpeng Wu
4. Shuqing Wu
5. Jianliang Gao
This article has no evaluationsLatest version Jun 13, 2025
Gemini-GraphQA: Integrating Language Models and Graph Encoders for Executable Graph Reasoning

This article has 3 authors:
1. Xiong Luo
2. Erfan Wang
3. Yunfei Guo
This article has no evaluationsLatest version Jun 3, 2025
Selective Knowledge Injection via Adapter Modules in Large‐Scale Language Models

This article has 6 authors:
1. Hongye Zheng
2. Lipeng Zhu
3. Wanyu Cui
4. Ray Pan
5. Xu Yan
6. Yue Xing
This article has no evaluationsLatest version Jun 3, 2025

Listed in

Abstract

Article activity feed

Related articles

DSA-GNAS: Graph Neural Architecture Search with Deep Semantic Adaption of Large Language Models

Gemini-GraphQA: Integrating Language Models and Graph Encoders for Executable Graph Reasoning

Selective Knowledge Injection via Adapter Modules in Large‐Scale Language Models