Semantic Bridges for Student Modeling: Leveraging LLM-Generated Narratives for Interpretable Representation Learning

Wei Qiu
Jiawei Li
Fun Siong Lim

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Student modeling plays a central role in adaptive learning systems. However, current methods often require a trade-off between representational richness and interpretability. Traditional feature engineering produces interpretable but semantically shallow representations, while deep learning methods achieve rich embeddings at the cost of explainability. We propose a framework that utilizes Large Language Model (LLM)-generated narratives as semantic bridges between structured educational data and dense vector representations. Unlike representations learned from scratch, LLM-derived embeddings inherit semantic structure from pretraining on vast text corpora, encoding domain knowledge about education, academic performance, and student success factors. Our approach generates human-readable narratives from student data, then extracts embeddings that preserve both interpretability (through the narrative layer) and computational utility (through dense vectors). We present a comprehensive quality assessment framework evaluating narratives across five dimensions: content quality, linguistic diversity, coherence, uniqueness, and tone appropriateness. Empirical evaluation on 3,169 undergraduate students demonstrates strong content coverage and coherence, with excellent uniqueness.

Version published to 10.35542/osf.io/vscmr_v1 on OSF Preprints
Apr 1, 2026

Modeling students’ Chinese language learningpathways by introducing a behavior-driven semanticgraph construction mechanism

This article has 1 author:
1. Ming Li
This article has no evaluationsLatest version Apr 17, 2026
A Topic-Based Framework for Interpretable and Sparse Semantic Representations

This article has 2 authors:
1. Ming Song
2. Qing Cai
This article has no evaluationsLatest version Mar 25, 2026
Introducing a fusion model of language content attention mechanisms and structural embeddings to achieve automatic scoring of English writing

This article has 1 author:
1. Bingling Chen
This article has no evaluationsLatest version Apr 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Modeling students’ Chinese language learningpathways by introducing a behavior-driven semanticgraph construction mechanism

A Topic-Based Framework for Interpretable and Sparse Semantic Representations

Introducing a fusion model of language content attention mechanisms and structural embeddings to achieve automatic scoring of English writing