KegAlign: Optimizing pairwise alignments with diagonal partitioning

A. Burak Gulhan
Richard Burhans
Robert Harris
Mahmut Kandemir
Maximilian Haeussler
Anton Nekrutenko

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Our ability to generate sequencing data and assemble it into high quality complete genomes has rapidly advanced in recent years. These data promise to advance our understanding of organismal biology and answer longstanding evolutionary questions. Multiple genome alignment is a key tool in this quest. It is also the area which is lagging: today we can generate genomes faster than we can construct and update multiple alignments containing them. The bottleneck is in considerable computational time required to generate accurate pairwise alignments between divergent genomes, an unavoidable precursor to multiple alignments. This step is typically performed with lastZ, a very sensitive and yet equally slow tool. Here we describe an optimized GPU-enabled pairwise aligner KegAlign. It incorporates a new parallelization strategy, diagonal partitioning, with the latest features of modern GPUs. With KegAlign a typical human/mouse alignment can be computed in under 6 hours on a machine containing a single NVidia A100 GPU and 80 CPU cores without the need for any pre-partitioning of input sequences: a ∼150× improvement over lastZ. While other pairwise aligners can complete this task in a fraction of that time, none achieves the sensitivity of KegAlign’s main alignment engine, lastZ, and thus may not be suitable for comparing divergent genomes. In addition to providing the source code and a Conda package for KegAlign we also provide a Galaxy workflow that can be readily used by anyone.

Version published to 10.1101/2024.09.02.610839v1 on bioRxiv
Sep 3, 2024

FastGA: Fast Genome Alignment

This article has 3 authors:
1. Gene Myers
2. Richard Durbin
3. Chenxi Zhou
This article has no evaluationsLatest version Jun 19, 2025
2Pipe: It Starts with a Question. Matching You with the Correct Pipeline for MAG Reconstruction

This article has 2 authors:
1. Jeferyd Yepes Garcí
2. Laurent Falquet
This article has no evaluationsLatest version Jun 9, 2025
Dotplotic: a lightweight visualization tool for BLAST+ alignments and genomic annotations

This article has 2 authors:
1. Hideyuki Miyazawa
2. Toshiyuki Oda
This article has no evaluationsLatest version May 15, 2025

Listed in

Abstract

Article activity feed

Related articles

FastGA: Fast Genome Alignment

2Pipe: It Starts with a Question. Matching You with the Correct Pipeline for MAG Reconstruction

Dotplotic: a lightweight visualization tool for BLAST+ alignments and genomic annotations