RT Journal Article
SR Electronic
T1 Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling.
JF Genetics
JO Genetics
FD Genetics Society of America
SP 1421
OP 1430
VO 140
IS 4
A1 Kuhner, M K
A1 Yamato, J
A1 Felsenstein, J
YR 1995
UL http://www.genetics.org/content/140/4/1421.abstract
AB We present a new way to make a maximum likelihood estimate of the parameter 4N mu (effective population size times mutation rate per site, or theta) based on a population sample of molecular sequences. We use a Metropolis-Hastings Markov chain Monte Carlo method to sample genealogies in proportion to the product of their likelihood with respect to the data and their prior probability with respect to a coalescent distribution. A specific value of theta must be chosen to generate the coalescent distribution, but the resulting trees can be used to evaluate the likelihood at other values of theta, generating a likelihood curve. This procedure concentrates sampling on those genealogies that contribute most of the likelihood, allowing estimation of meaningful likelihood curves based on relatively small samples. The method can potentially be extended to cases involving varying population size, recombination, and migration.