Comprehensive detection of structural variations in long and short reads dataset of French cattle

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Structural variants (SVs) correspond to different types of genomic variants larger than 50 bp. Many findings suggest the use of long rather than short reads to improve the accuracy of SV detection. Here, we present the results of an in-depth analysis for detection of SVs, mainly large insertions and deletions, in 14 French bovine breeds, based on whole-genome data comprising 176 long-read and 571 short-read samples, with 154 individuals having both long- and short-read data available. We first investigated possible biases on the performances of well-known SV detection tools, namely CUTESV, PBSV, and SNIFFLES, using long reads from different technologies, including PacBio HiFi, Oxford ONT, and PacBio CLR. We subsequently highlighted the abilities of tools for detecting SVs (DELLY, LUMPY, and MANTA) and for genotyping known SVs (GRAPHTYPER, SVTYPER, PARAGRAPH, and VG toolkit) using short-read data. We then show how the incremental composition of samples in the reference panel affected the SV genotyping for six validation individuals sequenced in short reads. We then searched for the optimal parameters and created the final SV reference panel consisting of 25,191 deletions and 30,118 insertions. Finally, we emphasized the landscape of the genotyped SVs segregating across 571 short-read individuals of 14 breeds.

Article activity feed