Genetics, Vol. 165, 427-436, September 2003, Copyright © 2003

New Explicit Expressions for Relative Frequencies of Single-Nucleotide Polymorphisms With Application to Statistical Inference on Population Growth

A. Polanskia,b and M. Kimmela
a Department of Statistics, Rice University, Houston, Texas 77005
b Institute of Automation, Silesian Technical University, 44-100 Gliwice, Poland

Corresponding author: M. Kimmel, Rice University, M.S. 138, 6100 Main St., Houston, TX 77005., kimmel{at}rice.edu (E-mail)

Communicating editor: N. TAKAHATA

We present new methodology for calculating sampling distributions of single-nucleotide polymorphism (SNP) frequencies in populations with time-varying size. Our approach is based on deriving analytical expressions for frequencies of SNPs. Analytical expressions allow for computations that are faster and more accurate than Monte Carlo simulations. In contrast to other articles showing analytical formulas for frequencies of SNPs, we derive expressions that contain coefficients that do not explode when the genealogy size increases. We also provide analytical formulas to describe the way in which the ascertainment procedure modifies SNP distributions. Using our methods, we study the power to test the hypothesis of exponential population expansion vs. the hypothesis of evolution with constant population size. We also analyze some of the available SNP data and we compare our results of demographic parameters estimation to those obtained in previous studies in population genetics. The analyzed data seem consistent with the hypothesis of past population growth of modern humans. The analysis of the data also shows a very strong sensitivity of estimated demographic parameters to changes of the model of the ascertainment procedure.





This article has been cited by other articles:


Home page
Genome ResHome page
I. Hellmann, Y. Mang, Z. Gu, P. Li, F. M. de la Vega, A. G. Clark, and R. Nielsen
Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals
Genome Res., July 1, 2008; 18(7): 1020 - 1029.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
E. B. Rosenblum and J. Novembre
Ascertainment Bias in Spatially Structured Populations: A Case Study in the Eastern Fence Lizard
J. Hered., July 4, 2007; (2007) esm031v1.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
P. L.F. Johnson and M. Slatkin
Inference of population genetic parameters in metagenomics: A clean look at messy data
Genome Res., October 1, 2006; 16(10): 1320 - 1327.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
R. Nielsen, M. J. Hubisz, and A. G. Clark
Reconstituting the Frequency Spectrum of Ascertained Single-Nucleotide Polymorphism Data
Genetics, December 1, 2004; 168(4): 2373 - 2382.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. M. Adams and R. R. Hudson
Maximum-Likelihood Estimation of Demographic Parameters Using the Frequency Spectrum of Unlinked Single-Nucleotide Polymorphisms
Genetics, November 1, 2004; 168(3): 1699 - 1712.
[Abstract] [Full Text] [PDF]