The unreachable genomic profiling of complex diseases: genotype missingness matters

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The problem of building genome-wide predictors of individual risk to complex diseases seems to be more challenging than it was thought when the first human genome was sequenced on 2003. We have build different enhanced genetic risk predictors from genome-wide data and different complex diseases, making use of haplotypes accurately ascertained from family trios. We confirmed the widely known inability to accurately predict individual risk to complex diseases returned by the state-of-the-art genome-wide predictors. This result is mainly due to the small effect that most genetic variants have in a disease. We also found out that rates of missing genotypes were usually too high for these small-effect variants, as we could force missing imputation in such a tricky way that we would build highly accurate predictors, either by using our own design or the state-of-the-art genetic predictors. We observed that unknown genotypes were not missing at random but related to disease affectation, with more missing genotypes in affected than in non-affected individuals. We were not able to find a way to accurately reduce missing rates to correctly improve accuracy, but we identified a common pattern of missing data in multiple sclerosis, asthma and autism that makes us think that other complex diseases could behave the same way. Because (1) missing rates are high enough to completely change risk prediction due to the small-effect of most of the variants, and (2) there are more missing genotypes in affected than in non affected individuals, we conclude that perhaps the widely known defeat in genomic profiling for complex diseases may be solved by looking closer to the way current genotyping technologies handle genetic variants that may be rare in reference panels but have some effect in a given complex disease.

Article activity feed