- THIS ARTICLE
- Full Text
- Full Text (PDF)
-
All Versions of this Article:
genetics.109.107995v1
183/3/1087 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- GOOGLE SCHOLAR
- Articles by Jenkins, P. A.
- Articles by Song, Y. S.
- PUBMED
- PubMed Citation
- Articles by Jenkins, P. A.
- Articles by Song, Y. S.
Originally published as Genetics Published Articles Ahead of Print on September 7, 2009.
Genetics, Vol. 183, 1087-1103, November 2009, Copyright © 2009
doi:10.1534/genetics.109.107995
Closed-Form Two-Locus Sampling Distributions: Accuracy and Universality
Paul A. Jenkins* and
Yun S. Song*,
,1
* Computer Science Division and
Department of Statistics, University of California, Berkeley, California 94720
1 Corresponding author: Department of Electrical Engineering and Computer Sciences, University of California, 683 Soda Hall #1776, Berkeley, CA 94720-1776.
E-mail: yss{at}cs.berkeley.edu
Sampling distributions play an important role in population genetics analyses, but closed-form sampling formulas are generally intractable to obtain. In the presence of recombination, there is no known closed-form sampling formula that holds for an arbitrary recombination rate. However, we recently showed that it is possible to obtain useful closed-form sampling formulas when the population-scaled recombination rate
is large. Specifically, in the case of the two-locus infinite-alleles model, we considered an asymptotic expansion of the sampling formula in inverse powers of
and obtained closed-form expressions for the first few terms in the expansion. In this article, we generalize this result to an arbitrary finite-alleles mutation model and show that, up to the first few terms in the expansion that we are able to compute analytically, the functional form of the asymptotic sampling formula is common to all mutation models. We carry out an extensive study of the accuracy of the asymptotic formula for the two-locus parent-independent mutation model and discuss in detail a concrete application in the context of the composite-likelihood method. Furthermore, using our asymptotic sampling formula, we establish a simple sufficient condition for a given two-locus sample configuration to have a finite maximum-likelihood estimate (MLE) of
. This condition is the first analytic result on the classification of the MLE of
and is instantaneous to check in practice, provided that one-locus probabilities are known.