Genetics. Published Articles Ahead of Print: April 15, 2007, Copyright © 2007
doi:10.1534/genetics.106.066233


A more recent version of this article appeared on July 1, 2007.


REGULAR RESEARCH PAPERS

Estimating the number of ancestral lineages using a maximum likelihood method based on rejection sampling

1 University Grenoble I
2 University of Michigan

* To whom correspondence should be addressed. E-mail: michblum{at}umich.edu.

Submitted on September 25, 2006
Revised on January 24, 2007
Accepted on 12 April 2007


Abstract

Estimating the number of ancestral lineages of a sample of DNA sequences at time t in the past can be viewed as a variation on the problem of estimating the time to the most recent common ancestor. In order to estimate the number of ancestral lineages, we develop a maximum likelihood approach that takes advantage of a prior model of population demography, in addition to the molecular data summarized by the pattern of polymorphic sites. The method relies on a rejection sampling algorithm that is introduced for simulating conditional coalescent trees given a fixed number of ancestral lineages at time t. Computer simulations show that the number of ancestral lineages can be estimated accurately, provided that the number of mutations that occurred since time t is sufficiently large. The method is applied to 986 present-day human sequences located in hypervariable region 1 of the mitochondrion in order to estimate the number of ancestral lineages of modern humans at the time of potential admixture with the Neanderthal population. Our estimates support a view that the proportion of the modern population consisting of Neanderthal contributions must be relatively small, lower than about 5%, if the admixture happened as recently as 30,000 years ago.

Key Words: Ancient admixture, Coalescent theory, Mitochondrial DNA, Neanderthal, Segregating sites