Reference-based variant detection with varseek
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Variant detection from sequencing data is fundamental for genomics and is the first step in a wide range of applications, ranging from genome-wide association studies to disease diagnosis. Widely used tools for variant detection utilize a de novo approach that is based on a combination of read mapping algorithms and statistical methods for identifying genetic variation from error-prone sequencing data. This approach has been successful, although the detection of insertion and deletion variants, as well as the detection of variants from low-coverage data, remain challenging problems. We introduce varseek, a reference-based approach to variant detection that provides large improvements in performance in these challenging cases. The varseek approach utilizes a k-mer pseudoalignment approach, which provides the ability to identify variants at single-cell resolution in single-cell transcriptomics data. We showcase the versatility and performance of varseek for detecting tumor-specific COSMIC variants in glioblastoma single-cell sequencing.