TypeAssembly: Copy number estimation and allele typing for haplotype assemblies
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Accurately annotating complex genes in the human genome, particularly from haplotype assemblies, remains a significant challenge. To overcome this, we developed TypeAssembly, a local alignment-based framework for copy number estimation and allele typing. Operating in two modes, mode-FASTA and mode-VCF, TypeAssembly can define alleles by either sequence or variant information. We successfully applied it to annotate 41 genes in the MHC locus, 17 KIR genes, and, for the first time, 15 pharmacogenes across 466 haplotype assemblies. This study establishes TypeAssembly as a robust method for accurately annotating complex genomic regions and provides an evaluation of existing gene annotations and callers.