Genetics. Published Articles Ahead of Print: September 2, 2005, Copyright © 2005
doi:10.1534/genetics.104.040402


A more recent version of this article appeared on November 1, 2005.


REGULAR RESEARCH PAPERS

Likelihoods from summary statistics: Recent divergence between species

1 Duke University
2 University of Illinois at Urbana-Champaign

* To whom correspondence should be addressed. E-mail: marcy{at}duke.edu.

Submitted on December 29, 2004
Revised on April 22, 2005
Accepted on 5 August 2005


Abstract

We describe an importance sampling method for approximating likelihoods of population parameters based on multiple summary statistics. In this first application, we address the demographic history of closely related members of the Drosophila pseudoobscura group. We base the maximum likelihood estimation of the time since speciation and the effective population sizes of the extant and ancestral populations on the pattern of nucleotide variation at DPS2002, a noncoding region tightly linked to a paracentric inversion that strongly contributes to reproductive isolation. Consideration of summary statistics rather than entire nucleotide sequences permits a compact description of the genealogy of the sample. We use importance sampling first to propose a genealogical and mutational history consistent with the observed array of summary statistics and then to correct the likelihood with the exact probability of the history determined from a system of recursions. Analysis of a subset of the data, for which recursive computation of the exact likelihood was feasible, indicated close agreement between the approximate and exact likelihoods. Our results for the complete dataset also compare well with those obtained through Metropolis-Hastings sampling of fully resolved genealogies of entire nucleotide sequences.

Key Words: importance sampling, maximum likelihood, population structure, speciation time




This article has been cited by other articles:


Home page
Genome ResHome page
C. Becquet and M. Przeworski
A new approach to estimate parameters of speciation models with application to apes
Genome Res., October 1, 2007; 17(10): 1505 - 1519.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. C. Leman, M. K. Uyenoyama, M. Lavine, and Y. Chen
The evolutionary forest algorithm
Bioinformatics, August 1, 2007; 23(15): 1962 - 1968.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. S. Chang and M. A. F. Noor
The Genetics of Hybrid Male Sterility Between the Allopatric Species Pair Drosophila persimilis and D. pseudoobscura bogotana: Dominant Sterility Alleles in Collinear Autosomal Regions
Genetics, May 1, 2007; 176(1): 343 - 349.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. M. Tanaka, A. R. Francis, F. Luciani, and S. A. Sisson
Using Approximate Bayesian Computation to Estimate Tuberculosis Transmission Parameters From Genotype Data
Genetics, July 1, 2006; 173(3): 1511 - 1520.
[Abstract] [Full Text] [PDF]