LieOTAlign: A Differentiable Protein Structure Alignment Framework Combining Optimal Transport and Lie Algebra

Yue Hu
Zanxia Cao
Yingchao Liu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The comparison of protein structures is fundamental to understanding biological function and evolutionary relationships. Existing methods, while powerful, often rely on heuristic search algorithms and non-differentiable scoring functions, which limits their direct integration into end-to-end deep learning pipelines. This paper introduces LieOTAlign ¹ , a novel and fully differentiable protein structure alignment framework built on the mathematical principles of Lie algebra and Optimal Transport (OT). LieOTAlign represents rigid body transformations within the Lie algebra of SE(3), which intrinsically preserves the geometric validity of rotations and translations during optimization. We formulate the alignment task as an optimal transport problem, seeking the most efficient mapping between two protein structures. This approach leads to a differentiable version of the TM-score, the Sinkhorn score, which is derived from the entropically regularized OT solution computed via the Sinkhorn algorithm. The entire LieOTAlign pipeline is differentiable, enabling the use of gradient-based optimizers like AdamW to maximize structural similarity. Benchmarking against the official TM-align on the RPIC dataset shows that LieOTAlign can identify longer, topologically significant alignments, achieving higher TM-scores. While the current RMSD is higher, LieOTAlign provides a powerful and flexible framework for protein structure alignment, paving the way for its integration into next-generation deep learning models for diverse bioinformatics challenges.

Version published to 10.1101/2025.08.21.671657 on bioRxiv
Aug 25, 2025

Quantum-Assisted Refinement of AlphaFold Protein Structures

This article has 1 author:
1. Parham Ghayour
This article has no evaluationsLatest version Dec 31, 2025
A Survey on Efficient Protein Language Models

This article has 8 authors:
1. Shouren Wang
2. Debargha Ganguly
3. Vinooth Kulkarni
4. Wang Yang
5. Zhuoran Qiao
6. Daniel Blankenberg
7. Vipin Chaudhary
8. Xiaotian Han
This article has no evaluationsLatest version Dec 24, 2025
GTcomplex: Spatial indexing-powered search and alignment of macromolecular complexes

This article has 1 author:
1. Mindaugas Margelevicius
This article has no evaluationsLatest version Jan 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Quantum-Assisted Refinement of AlphaFold Protein Structures

A Survey on Efficient Protein Language Models

GTcomplex: Spatial indexing-powered search and alignment of macromolecular complexes