Genetics, Vol. 162, 2025-2035, December 2002, Copyright © 2002

Approximate Bayesian Computation in Population Genetics

Mark A. Beaumonta, Wenyang Zhangb, and David J. Baldingc
a School of Animal and Microbial Sciences, The University of Reading, Whiteknights, Reading RG6 6AJ, United Kingdom,
b Institute of Mathematics and Statistics, University of Kent, Canterbury, Kent CT2 7NF, United Kingdom
c Department of Epidemiology and Public Health, Imperial College School of Medicine, St. Mary's Campus, Norfolk Place, London W2 1PG, United Kingdom

Corresponding author: Mark A. Beaumont, Whiteknights, PO Box 228, Reading RG6 6AJ, UK., m.a.beaumont{at}reading.ac.uk (E-mail)

Communicating editor: W. STEPHAN

We propose a new method for approximate Bayesian statistical inference on the basis of summary statistics. The method is suited to complex problems that arise in population genetics, extending ideas developed in this setting by earlier authors. Properties of the posterior distribution of a parameter, such as its mean or density curve, are approximated without explicit likelihood calculations. This is achieved by fitting a local-linear regression of simulated parameter values on simulated summary statistics, and then substituting the observed summary statistics into the regression equation. The method combines many of the advantages of Bayesian statistical inference with the computational efficiency of methods based on summary statistics. A key advantage of the method is that the nuisance parameters are automatically integrated out in the simulation step, so that the large numbers of nuisance parameters that arise in population genetics problems can be handled without difficulty. Simulation results indicate computational and statistical efficiency that compares favorably with those of alternative methods previously proposed in the literature. We also compare the relative efficiency of inferences obtained using methods based on summary statistics with those obtained directly from the data using MCMC.





This article has been cited by other articles:


Home page
Mol Biol EvolHome page
L. B. Scheinfeldt, S. Biswas, J. Madeoy, C. F. Connelly, E. E. Schadt, and J. M. Akey
Population Genomic Analysis of ALMS1 in Humans Reveals a Surprisingly Complex Evolutionary History
Mol. Biol. Evol., June 1, 2009; 26(6): 1357 - 1367.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
V. C. Sousa, M. Fritz, M. A. Beaumont, and L. Chikhi
Approximate Bayesian Computation Without Summary Statistics: The Case of Admixture
Genetics, April 1, 2009; 181(4): 1507 - 1519.
[Abstract] [Full Text] [PDF]


Home page
J R Soc InterfaceHome page
M. Baguelin, J.R. Newton, N. Demiris, J. Daly, J.A. Mumford, and J.L.N. Wood
Control of equine influenza: scenario testing using a realistic metapopulation model of spread
J R Soc Interface, April 1, 2009; (2009) rsif.2009.0030v1.
[Abstract] [Full Text] [PDF]


Home page
J R Soc InterfaceHome page
T. Toni, D. Welch, N. Strelkowa, A. Ipsen, and M. P.H Stumpf
Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems
J R Soc Interface, February 6, 2009; 6(31): 187 - 202.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
D. J. Wilson, E. Gabriel, A. J.H. Leatherbarrow, J. Cheesbrough, S. Gee, E. Bolton, A. Fox, C. A. Hart, P. J. Diggle, and P. Fearnhead
Rapid Evolution and the Importance of Recombination to the Gastroenteric Pathogen Campylobacter jejuni
Mol. Biol. Evol., February 1, 2009; 26(2): 385 - 397.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
Y. Kim and T. Wiehe
Simulation of DNA sequence evolution under models of recent directional selection
Brief Bioinform, January 1, 2009; 10(1): 84 - 96.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. J. Jobin and J. L. Mountain
REJECTOR: software for population history inference from genetic data via a rejection algorithm
Bioinformatics, December 15, 2008; 24(24): 2936 - 2937.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J.-M. Cornuet, F. Santos, M. A. Beaumont, C. P. Robert, J.-M. Marin, D. J. Balding, T. Guillemaud, and A. Estoup
Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation
Bioinformatics, December 1, 2008; 24(23): 2713 - 2719.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
P. K. Ingvarsson
Multilocus Patterns of Nucleotide Polymorphism and the Demographic History of Populus tremula
Genetics, September 1, 2008; 180(1): 329 - 340.
[Abstract] [Full Text] [PDF]


Home page
Biol LettHome page
B. J Evans, J. A McGuire, R. M Brown, N. Andayani, and J. Supriatna
A coalescent framework for comparing alternative models of population structure with genetic data: evolution of Celebes toads
Biol Lett, August 23, 2008; 4(4): 430 - 433.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
L.-z. Gao and H. Innan
Nonindependent Domestication of the Two Rice Subspecies, Oryza sativa ssp. indica and ssp. japonica, Demonstrated by Multilocus Microsatellites
Genetics, June 1, 2008; 179(2): 965 - 976.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. Foll, M. A. Beaumont, and O. Gaggiotti
An Approximate Bayesian Computation Approach to Overcome Biases That Arise When Using Amplified Fragment Length Polymorphism Markers to Study Population Structure
Genetics, June 1, 2008; 179(2): 927 - 939.
[Abstract] [Full Text] [PDF]


Home page
Progress in Physical GeographyHome page
B. R. Riddle, M. N. Dawson, E. A. Hadly, D. J. Hafner, M. J. Hickerson, S. J. Mantooth, and A. D. Yoder
The role of molecular genetics in sculpting the future of integrative biogeography
Progress in Physical Geography, April 1, 2008; 32(2): 173 - 202.
[Abstract] [PDF]


Home page
GeneticsHome page
M. P. Cox, F. L. Mendez, T. M. Karafet, M. M. Pilkington, S. B. Kingan, G. Destro-Bisol, B. I. Strassmann, and M. F. Hammer
Testing for Archaic Hominin Admixture on the X Chromosome: Model Likelihoods for the Modern Human RRM2P4 Region From Summaries of Genealogical Topology Under the Structured Coalescent
Genetics, January 1, 2008; 178(1): 427 - 437.
[Abstract] [Full Text] [PDF]


Home page
Biol LettHome page
A. D Leache, S. C Crews, and M. J Hickerson
Two waves of diversification in mammals and reptiles of Baja California revealed by hierarchical Bayesian analysis
Biol Lett, December 22, 2007; 3(6): 646 - 650.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. J. R. Fagundes, N. Ray, M. Beaumont, S. Neuenschwander, F. M. Salzano, S. L. Bonatto, and L. Excoffier
Statistical evaluation of alternative models of human evolution
PNAS, November 6, 2007; 104(45): 17614 - 17619.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
C. Becquet and M. Przeworski
A new approach to estimate parameters of speciation models with application to apes
Genome Res., October 1, 2007; 17(10): 1505 - 1519.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. G. B. Blum and N. A. Rosenberg
Estimating the Number of Ancestral Lineages Using a Maximum-Likelihood Method Based on Rejection Sampling
Genetics, July 1, 2007; 176(3): 1741 - 1757.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. A. Sisson, Y. Fan, and M. M. Tanaka
Sequential Monte Carlo without likelihoods
PNAS, February 6, 2007; 104(6): 1760 - 1765.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
B. Padhukasahasram, J. D. Wall, P. Marjoram, and M. Nordborg
Estimating Recombination Rates From Single-Nucleotide Polymorphisms Using Summary Statistics
Genetics, November 1, 2006; 174(3): 1517 - 1528.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. Wang
A Coalescent-Based Estimator of Admixture From DNA Sequences
Genetics, July 1, 2006; 173(3): 1679 - 1692.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. M. Tanaka, A. R. Francis, F. Luciani, and S. A. Sisson
Using Approximate Bayesian Computation to Estimate Tuberculosis Transmission Parameters From Genotype Data
Genetics, July 1, 2006; 173(3): 1511 - 1520.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. Innan
Modified Hudson-Kreitman-Aguade Test and Two-Dimensional Evaluation of Neutrality Tests
Genetics, July 1, 2006; 173(3): 1725 - 1733.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Jakobsson, J. Hagenblad, S. Tavare, T. Sall, C. Hallden, C. Lind-Hallden, and M. Nordborg
A Unique Recent Origin of the Allotetraploid Species Arabidopsis suecica: Evidence from Nuclear DNA Markers
Mol. Biol. Evol., June 1, 2006; 23(6): 1217 - 1231.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. Thornton and P. Andolfatto
Approximate Bayesian Inference Reveals Evidence for a Recent, Severe Bottleneck in a Netherlands Population of Drosophila melanogaster
Genetics, March 1, 2006; 172(3): 1607 - 1619.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. Thornton
Recombination and the Properties of Tajima's D in the Context of Approximate-Likelihood Calculation
Genetics, December 1, 2005; 171(4): 2143 - 2148.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
N. Miller, A. Estoup, S. Toepfer, D. Bourguet, L. Lapchin, S. Derridj, K. S. Kim, P. Reynaud, L. Furlan, and T. Guillemaud
Multiple Transatlantic Introductions of the Western Corn Rootworm
Science, November 11, 2005; 310(5750): 992 - 992.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. C. Leman, Y. Chen, J. E. Stajich, M. A. F. Noor, and M. K. Uyenoyama
Likelihoods From Summary Statistics: Recent Divergence Between Species
Genetics, November 1, 2005; 171(3): 1419 - 1436.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
N. Ray, M. Currat, P. Berthier, and L. Excoffier
Recovering the geographic origin of early modern humans by realistic and spatially explicit simulations
Genome Res., August 1, 2005; 15(8): 1161 - 1167.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
G. A.T McVean and N. J Cardin
Approximating the coalescent with recombination
Phil Trans R Soc B, July 29, 2005; 360(1459): 1387 - 1393.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
G. Hamilton, M. Stoneking, and L. Excoffier
Molecular analysis reveals tighter social regulation of immigration in patrilocal populations than in matrilocal populations
PNAS, May 24, 2005; 102(21): 7476 - 7480.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
G. Hamilton, M. Currat, N. Ray, G. Heckel, M. Beaumont, and L. Excoffier
Bayesian Estimation of Recent Migration Rates After a Spatial Expansion
Genetics, May 1, 2005; 170(1): 409 - 417.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. N. K. Anderson, U. Ramakrishnan, Y. L. Chan, and E. A. Hadly
Serial SimCoal: A population genetics model for data from multiple populations and points in time
Bioinformatics, April 15, 2005; 21(8): 1733 - 1734.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
L. Excoffier, A. Estoup, and J.-M. Cornuet
Bayesian Analysis of an Admixture Model With Mutations and Arbitrarily Linked Markers
Genetics, March 1, 2005; 169(3): 1727 - 1738.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. A. Tallmon, G. Luikart, and M. A. Beaumont
Comparative Evaluation of a New Effective Population Size Estimator Based on Approximate Bayesian Computation
Genetics, June 1, 2004; 167(2): 977 - 988.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Marjoram, J. Molitor, V. Plagnol, and S. Tavare
Markov chain Monte Carlo without likelihoods
PNAS, December 23, 2003; 100(26): 15324 - 15328.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. Przeworski
Estimating the Time Since the Fixation of a Beneficial Allele
Genetics, August 1, 2003; 164(4): 1667 - 1676.
[Abstract] [Full Text] [PDF]