Semantic Context-Aware Automated Front-End Code Generation for Mobile applications using a Vision Language Code Transformer

Suji Jose
Philip Samuel
Sumam Mary Idicula

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Automated front-end Code Generation (CG) is required due to the rising demand for fast and error-free development of mobile applications. Existing research has overlooked semantic context-aware automated front-end code generation for mobile applications, resulting in less reliable generated interfaces. Thus, this paper proposes a novel Visual Top-k Attention Bidirectional Encoder Representations from Transformers (VisualTABERT) and Code Longformer Attention Text-To-Text Transfer Transformer (CodeLAT5+) enabled semantic context-aware automated front-end CG system. First, the mobile application screenshots are collected and preprocessed. The UI components are then segmented using a Spatial Pyramid Pooling–enhanced YOLOv8 (SPP-YOLOv8) model, followed by text extraction and attribute identification for each detected element. Similar UI elements are grouped based on the extracted text attributes and segmented UI elements employing Density Survival Function Based Spatial Clustering of Applications with Noise (DSFBSCAN). Then using a VisualTABERT based module, the semantic relationships between UI elements and their associated textual content are established. Consequently, by using Affine Polynomial Kernel Transformation (APKT), the coordinate alignment is done for the UI elements. According to the grouped similar UI elements, semantic relationships, and coordinate alignment, the structured intermediate representation is performed via Schema Mapping (SM). Finally, CodeLAT5+ generates the target source code by utilizing the structured intermediate representation obtained from the preceding transformation process. As per the outcome, the proposed model achieved a higher Mean Reciprocal Rank (MRR) (0.9256) than the conventional methods.

Version published to 10.21203/rs.3.rs-8405699/v1 on Research Square
Jan 9, 2026

PRIME: Prompt Refinement via Information-driven Methods and Expansion, A Modular Framework for Context-Aware Prompt Amplification

This article has 1 author:
1. Rajesh More
This article has no evaluationsLatest version Jan 27, 2026
Compositional AI-Service Pipeline to Generate Interactive Structured-Data from Scanned Images

This article has 4 authors:
1. Anthony Savidis
2. Yannis Valsamakis
3. Theodoros Chalkidis
4. Stephanos Soultatos
This article has no evaluationsLatest version Jan 19, 2026
<p style="-qt-block-indent: 0; text-indent: 0px; margin: 0px;">AttnLink: Enhancing Cross-Modal Fusion for Robust Image-to-PointCloud Place Recognition

This article has 2 authors:
1. Ziyu Fang
2. Minghao Ye
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

PRIME: Prompt Refinement via Information-driven Methods and Expansion, A Modular Framework for Context-Aware Prompt Amplification

Compositional AI-Service Pipeline to Generate Interactive Structured-Data from Scanned Images

<p style="-qt-block-indent: 0; text-indent: 0px; margin: 0px;">AttnLink: Enhancing Cross-Modal Fusion for Robust Image-to-PointCloud Place Recognition