Structural Recomputation Framework: A Deterministic Execution Abstraction for Memory-Constrained Dynamic Programming in Bioinformatics

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Dynamic programming (DP) algorithms are foundational to bioinformatics, yet their application to genomic-scale data is frequently constrained by a quadratic memory bottleneck. This storage requirement primarily arises from the necessity of persisting traceback matrices or trellis structures to enable path reconstruction and posterior decoding. While specialized solutions such as Hirschberg’s algorithm and algorithm-specific checkpointing schemes have been developed to mitigate these constraints, they often remain tightly coupled to specific recurrences and lack a unified execution abstraction. This work introduces the Structured Recomputation Framework (SRF), a deterministic execution-level abstraction that decouples algorithmic recurrence definitions from physical execution schedules. SRF employs a recurrence-agnostic design and a structured recomputation schedule to enforce strict, user-defined memory upper bounds. The framework’s utility is demonstrated across diverse algorithmic paradigms, including sequence alignment (Needleman-Wunsch), Hidden Markov Model evaluation (Forward and Viterbi), and graph-based dynamic programming. Empirical validation on mitochondrial DNA and Gene Ontology datasets confirms a reduction in peak working set size from O(N2) to O(N), with memory savings exceeding 99% for extreme-scale workloads. Deterministic properties are verified through cross-platform and compiler invariance checks, ensuring bit-exact reproducibility across heterogeneous compute environments. SRF provides a stable foundation for memory-efficient biological computing without requiring improvement in asymptotic time complexity, effectively trading compute cycles for spatial capacity to enable exact analyses on commodity hardware.

Article activity feed