Genetics, Vol. 159, 401-411, September 2001, Copyright © 2001

Mutations as Missing Data: Inferences on the Ages and Distributions of Nonsynonymous and Synonymous Mutations

Rasmus Nielsena
a Department of Biometrics, Cornell University, Ithaca, New York 14853-7801

Corresponding author: Rasmus Nielsen, Department of Biometrics, Cornell University, 439 Warren Hall, Ithaca, NY 14853-7801., rn28{at}cornell.edu (E-mail)

Communicating editor: W. STEPHAN

This article describes a new Markov chain Monte Carlo (MCMC) method applicable to DNA sequence data, which treats mutations in the genealogy as missing data. The method facilitates inferences regarding the age and identity of specific mutations while taking the full complexities of the mutational process in DNA sequences into account. We demonstrate the utility of the method in three applications. First, we demonstrate how the method can be used to make inferences regarding population genetical parameters such as {theta} (the effective population size times the mutation rate). Second, we show how the method can be used to estimate the ages of mutations in finite sites models and for making inferences regarding the distribution and ages of nonsynonymous and synonymous mutations. The method is applied to two previously published data sets and we demonstrate that in one of the data sets the average age of nonsynonymous mutations is significantly lower than the average age of synonymous mutations, suggesting the presence of slightly deleterious mutations. Third, we demonstrate how the method in general can be used to evaluate the posterior distribution of a function of a mapping of mutations on a gene genealogy. This application is useful for evaluating the uncertainty associated with methods that rely on mapping mutations on a phylogeny or a gene genealogy.





This article has been cited by other articles:


Home page
Mol Biol EvolHome page
M. Anisimova and C. Kosiol
Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models
Mol. Biol. Evol., February 1, 2009; 26(2): 255 - 271.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
D. M. Robinson, D. T. Jones, H. Kishino, N. Goldman, and J. L. Thorne
Protein Evolution with Dependence Among Codons Due to Tertiary Structure
Mol. Biol. Evol., October 1, 2003; 20(10): 1692 - 1704.
[Abstract] [Full Text]