- THIS ARTICLE
- Full Text (Rapid PDF)
- Data Supplement
-
All Versions of this Article:
genetics.108.091116v1
180/3/1511 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Rosset, S.
- Articles by Behar, D. M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Rosset, S.
- Articles by Behar, D. M.
doi:10.1534/genetics.108.091116
A more recent version of this article appeared on November 1, 2008.
REGULAR RESEARCH PAPERS |
Maximum Likelihood Estimation of Site-Specific Mutation Rates in Human Mitochondrial DNA from Partial Phylogenetic Classification
Saharon Rosset 1*, Spencer Wells 2, David F. Soria-Hernanz 2, Chris Tyler-Smith 3, Ajay K. Royyuru 4 and Doron M. Behar 5
1 Tel Aviv University
2 National Geographic Society
3 The Wellcome Trust Sanger Institute
4 IBM Thomas J. Watson Research Center
5 Rambam Health Care Campus
* To whom correspondence should be addressed. E-mail: saharon{at}post.tau.ac.il.
Submitted on May 27, 2008
Revised on July 15, 2008
Accepted on 21 August 2008
The mitochondrial DNA hyper-variable segment I (HVS-I) is widely used in studies of human evolutionary genetics, and therefore accurate estimates of mutation rates among nucleotide sites in this region are essential. We have developed a novel maximum-likelihood methodology for estimating site-specific mutation rates from partial phylogenetic information, such as haplogroup association. The resulting estimation problem is a generalized linear model, with a non-standard link function. We develop inference and bias correction tools for our estimates and a hypothesis testing approach for site independence. We demonstrate our methodology using 16,609 HVS-I samples from the Genographic Project. Our results suggest that mutation rates among nucleotide sites in HVS-I are highly variable. The 16,400--16,500 region exhibits significantly lower rates compared to other regions, suggesting potential functional constraints. Several loci identified in the literature as possible termination associated sequences (TAS) do not yield statistically slower rates than the rest of HVS-I, casting doubt on their functional importance. Our tests do not reject the null hypothesis of independent mutation rates among nucleotide sites, supporting the use of site-independence assumption for analyzing HVS-I. Potential extensions of our methodology include its application to estimation of mutation rates in other genetic regions, like Y-chromosome short tandem repeats.
Key Words: Generalized linear models, Mutation rate estimation, mtDNA control region