Do Transformers Always Win? An Empirical Study of Semantic Embeddings for Short-Text E-commerce Reviews

Longying Lai
Zhiyuan Cheng
Kai Cheng
Xiaoxi Qi

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

While transformer-based embeddings like Sentence-BERT have become the de facto standard for text representation, their performance on short, noisy industrial texts has received limited empirical scrutiny. This study compares TF-IDF, Word2Vec, and Sentence-BERT on clustering 100,000 Amazon product reviews. Our findings challenge prevailing assumptions: Word2Vec achieves superior clustering performance with a Silhouette score of 0.1828 versus SBERT's 0.0401, representing a 356% improvement. We attribute this to insufficient text length for contextual modeling, domain mismatch between pre-training corpora and e-commerce reviews, and destabilizing variance in cluster centroids from contextualized representations. For topic modeling, Non-negative Matrix Factorization with Count Vectorizer achieves highest coherence (Cv=0.5836), while Latent Dirichlet Allocation produces most balanced distributions. These results suggest classical methods offer compelling cost-performance advantages for short industrial texts.

Version published to 10.21203/rs.3.rs-9163424/v1 on Research Square
Mar 20, 2026

AB-TC-BLATT: A Resource-Efficient Parallel System Architecture with Frozen ALBERT for Practical Chinese Sentiment Analysis

This article has 2 authors:
1. Li Qiusheng
2. Long Yu
This article has no evaluationsLatest version Apr 8, 2026
Graph-Based RAG for Manuscript Collections: A LangGraph Approach

This article has 4 authors:
1. Yahya Momtaz
2. Guido Russo
3. Massimo Brescia
4. Luisa Di Landa
This article has no evaluationsLatest version Apr 3, 2026
Efficient Knowledge Distillation for News Classification Based on ModernBERT

This article has 2 authors:
1. Xuyang Wang
2. Yuxi Zheng
This article has no evaluationsLatest version Apr 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

AB-TC-BLATT: A Resource-Efficient Parallel System Architecture with Frozen ALBERT for Practical Chinese Sentiment Analysis

Graph-Based RAG for Manuscript Collections: A LangGraph Approach

Efficient Knowledge Distillation for News Classification Based on ModernBERT