LatentRecurrentDepthLM: An Open-Source Framework for Recurrent-Depth Language Models with Controllable Test-Time Compute

Ahsan Umar

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

LatentRecurrentDepthLM is a modular, production-ready open-source framework implementing a hybrid recurrent-depth language model that decouples effective reasoning depth from parameter count by iterating a single weight-shared block over a continuous latent state, enabling controllable test-time compute scaling without generating intermediate tokens or modifying model weights. Built in PyTorch with full Hugging Face Transformers compatibility, the framework provides end-to-end pipelines for dataset preparation, tokenization, training with randomized iteration depth and cosine scheduling, autoregressive generation with temperature and top-k sampling, and one-command Hub deployment via a custom PreTrainedModel subclass. This paper documents the software architecture, core algorithms, training and inference workflows, practical use cases, and comparisons with related tools, connecting the framework to recent advances in recurrent-depth and latent reasoning research to serve researchers, educators, and practitioners exploring parameter-efficient sequence modeling. The repository (codewithdark-git/LatentRecurrentDepthLM) and Hugging Face model checkpoint are released under the MIT license.

Version published to 10.31224/6512
Feb 24, 2026

Task-Conditioned Representation Adaptation for Many-Shot In-Context Learning via Continued Pretraining

This article has 3 authors:
1. Lukas Schneider
2. Anna-Maria Keller
3. Michael Tobias Fischer
This article has no evaluationsLatest version Feb 16, 2026
EPMORE: Explainable Process Mixture-of-Experts

This article has 7 authors:
1. Wei Sheng
2. Chengzhu Xiao
3. Lunhao Ao
4. Junyan Long
5. Ye Yu
6. Yangguang Jia
7. Qihua Zhang
This article has no evaluationsLatest version Mar 24, 2026
DeepFuseLoc: A Deep Neural Framework with KAN-based Feature Crossing for Semantic and Structural Bug Localization

This article has 4 authors:
1. Wei Dan
2. Guangze Zhou
3. Xingqi Wang
4. Bin Chen
This article has no evaluationsLatest version Mar 9, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Task-Conditioned Representation Adaptation for Many-Shot In-Context Learning via Continued Pretraining

EPMORE: Explainable Process Mixture-of-Experts

DeepFuseLoc: A Deep Neural Framework with KAN-based Feature Crossing for Semantic and Structural Bug Localization