Revealing the range of maximum likelihood estimates in the admixture model

Carola Sophia Heinzel
Franz Baumdicker
Peter Pfaffelhuber

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Many ancestry inference tools, including STRUCTURE and ADMIXTURE, rely on the admixture model to infer both, allele frequencies p and individual admixture proportions q for a collection of individuals relative to a set of hypothetical ancestral populations. We show that under realistic conditions the likelihood in the admixture model is typically flat in some direction around a maximum likelihood estimate . In particular, the maximum likelihood estimator is non-unique and there is a complete spectrum of possible estimates. Common inference tools typically identify only a few points within this spectrum.

We provide an algorithm which computes the set of equally likely , when starting from . It is analytic for K = 2 ancestral populations and numeric for K > 2. We apply our algorithm to data from the 1000 genomes project, and show that inter-European estimators of q can come with a large set of equally likely possibilities. In general, markers with large allele frequency differences between populations in combination with individuals with concentrated admixture proportions lead to small areas with a flat likelihood.

Our findings imply that care must be taken when interpreting results from STRUCTURE and ADMIXTURE if populations are not separated well enough.

Version published to 10.1101/2024.10.18.619150 on bioRxiv
Oct 20, 2024

An Advanced Entropy Approach for Minimizing False Discoveries in Imputation-Based Association Analyses

This article has 4 authors:
1. Zhihui Zhang
2. Dakai Zhu
3. Xiangjun Xiao
4. Christopher I. Amos
This article has no evaluationsLatest version Dec 17, 2025
Impact of scale parameter for marker variance prior in some Bayesian whole-genome regression methods

This article has 2 authors:
1. Özge KOZAKLI
2. Ayhan CEYHAN
This article has no evaluationsLatest version Jan 20, 2026
Testing the validity and adequacy of linguistic phylogenetic analyses

This article has 1 author:
1. Benedict King
This article has no evaluationsLatest version Dec 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

An Advanced Entropy Approach for Minimizing False Discoveries in Imputation-Based Association Analyses

Impact of scale parameter for marker variance prior in some Bayesian whole-genome regression methods

Testing the validity and adequacy of linguistic phylogenetic analyses