Paraphrase Tremors: Uncovering TinyLLaMA’s Sensitivity to Subtle Rewordings

Pratik Mallick

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Natural language instructions can be phrased in countless ways by end users, yet small language models intended for on-device or low-resource deployment may react unpredictably to minor paraphrasing noise. In this paper, we quantify the sensitivity of a 1.1B-parameter chat model (TinyLLaMA-1.1B-Chat) to two independent paraphrases generated by a T5-based paraphrasing strategy. We evaluate response drift across three metrics-embedding cosine similarity, BERTScore, and BLEU-on 80 Alpaca-style instruction prompts. Our results show an average embedding drift of 0.118 (±0.085), with surface-form BLEU drifting by 0.098 and semantic BERTScore-F1 by 0.025. Correlation analysis reveals only a weak link (r=0.127, p=0.263) between prompt variation and response variation. Qualitative failure cases illustrate domain misinterpretation, style oscillation, and task misclassification even when paraphrases remain semantically close. These findings highlight the need for robustness-aware prompt engineering in small-scale LLM deployments.

Version published to 10.31219/osf.io/zgcyw_v1 on OSF Preprints
May 19, 2025

When Corporate Chatbots Show Bias: A Multi-Dimensional Analysis of LLMs in Enterprise Settings

This article has 3 authors:
1. Shreya Bhattacharya
2. Vincent Hagenow
3. Marco Di Gennaro
This article has no evaluationsLatest version May 16, 2025
OrthoKnow-SP: A Large-Scale Dataset on Orthographic Knowledge and Spelling Decisions in Spanish Adults

This article has 1 author:
1. Jon Andoni Duñabeitia
This article has no evaluationsLatest version Jun 24, 2025
Dynamic Sparse LoRA: Adaptive Low-Rank Finetuning for Nuanced Offensive Language Detection

This article has 7 authors:
1. Yanzhe Wang
2. Bingquan Chen
3. Kunlin Yang
4. Shijie Yi
5. Jiachi Jing
6. Xuhua Chen
7. Jingchao Sun
This article has no evaluationsLatest version May 27, 2025

Listed in

Abstract

Article activity feed

Related articles

When Corporate Chatbots Show Bias: A Multi-Dimensional Analysis of LLMs in Enterprise Settings

OrthoKnow-SP: A Large-Scale Dataset on Orthographic Knowledge and Spelling Decisions in Spanish Adults

Dynamic Sparse LoRA: Adaptive Low-Rank Finetuning for Nuanced Offensive Language Detection