Reasoning Distillation by Prompt Optimization

Marcin Koralewski

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Models (LLMs) increasingly rely on explicit chains of thoughts (CoT) reasoning to produce reliable and interpretable outputs. However, transferring such reasoning behaviour from one system to another typically requires expensive dataset construction, full-model retraining, or large task-specific corpora used for distillation. In this work, we introduce a lightweight prompt-level distillation framework that aligns a student model with the reasoning patterns of an external 'parent' source ? which may be a larger model or a human expert. Instead of constructing a curated supervision dataset, our method operates solely on reasoning traces generated by the parent. These traces serve as optimization targets for an automated prompt-search procedure that improves the logical consistency and step-wise reasoning of the student without modifying its parameters. Across multiple reasoning benchmarks, we show that prompt-level distillation substantially narrows the performance gap between student and parent models while eliminating the cost of dataset preparation and model training. This approach provides a practical pathway for disseminating high-quality reasoning behaviours in settings where computational resources, data availability, or human labor are limited.

Version published to 10.21203/rs.3.rs-8231090/v1 on Research Square
Dec 3, 2025

EPMORE: Explainable Process Mixture-of-Experts

This article has 7 authors:
1. Wei Sheng
2. Chengzhu Xiao
3. Lunhao Ao
4. Junyan Long
5. Ye Yu
6. Yangguang Jia
7. Qihua Zhang
This article has no evaluationsLatest version Feb 5, 2026
Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem

This article has 5 authors:
1. Peiqing Lu
2. Yuan Zhang
3. Haoyun Zhang
4. Jiassen Zheng
5. Kejian Tong
This article has no evaluationsLatest version Jan 7, 2026
Less is More: Recursive Reasoning with Tiny Networks

This article has 1 author:
1. Alexia Jolicoeur-Martineau
This article has no evaluationsLatest version Dec 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

EPMORE: Explainable Process Mixture-of-Experts

Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem

Less is More: Recursive Reasoning with Tiny Networks