Genetics, Vol. 156, 439-447, September 2000, Copyright © 2000

Usefulness of Single Nucleotide Polymorphism Data for Estimating Population Parameters

Mary K. Kuhnera, Peter Beerlia, Jon Yamatoa, and Joseph Felsensteina
a Department of Genetics, University of Washington, Seattle, Washington 98195-7360

Corresponding author: Mary K. Kuhner, Department of Genetics, University of Washington, Box 357360, Seattle, WA 98195-7360., mkkuhner{at}genetics.washington.edu (E-mail)

Communicating editor: G. A. CHURCHILL

Single nucleotide polymorphism (SNP) data can be used for parameter estimation via maximum likelihood methods as long as the way in which the SNPs were determined is known, so that an appropriate likelihood formula can be constructed. We present such likelihoods for several sampling methods. As a test of these approaches, we consider use of SNPs to estimate the parameter {Theta} = 4Neµ (the scaled product of effective population size and per-site mutation rate), which is related to the branch lengths of the reconstructed genealogy. With infinite amounts of data, ML models using SNP data are expected to produce consistent estimates of {Theta}. With finite amounts of data the estimates are accurate when {Theta} is high, but tend to be biased upward when {Theta} is low. If recombination is present and not allowed for in the analysis, the results are additionally biased upward, but this effect can be removed by incorporating recombination into the analysis. SNPs defined as sites that are polymorphic in the actual sample under consideration (sample SNPs) are somewhat more accurate for estimation of {Theta} than SNPs defined by their polymorphism in a panel chosen from the same population (panel SNPs). Misrepresenting panel SNPs as sample SNPs leads to large errors in the maximum likelihood estimate of {Theta}. Researchers collecting SNPs should collect and preserve information about the method of ascertainment so that the data can be accurately analyzed.





This article has been cited by other articles:


Home page
J HeredHome page
H. J. Ryynanen, A. Tonteri, A. Vasemagi, and C. R. Primmer
A Comparison of Biallelic Markers and Microsatellites for the Estimation of Population and Conservation Genetic Parameters in Atlantic Salmon (Salmo salar)
J. Hered., November 5, 2007; (2007) esm093v1.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
B. Knudsen and M. M. Miyamoto
Incorporating Experimental Design and Error Into Coalescent/Mutation Models of Population History
Genetics, August 1, 2007; 176(4): 2335 - 2342.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
E. B. Rosenblum and J. Novembre
Ascertainment Bias in Spatially Structured Populations: A Case Study in the Eastern Fence Lizard
J. Hered., July 4, 2007; (2007) esm031v1.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
L. Pariset, I. Cappuccio, P. Ajmone-Marsan, M. Bruford, S. Dunner, O. Cortes, G. Erhardt, E.-M. Prinzenberg, K. Gutscher, S. Joost, et al.
Characterization of 37 Breed-Specific Single-Nucleotide Polymorphisms in Sheep
J. Hered., September 1, 2006; 97(5): 531 - 534.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. K. Kuhner
LAMARC 2.0: maximum likelihood and Bayesian estimation of population parameters
Bioinformatics, March 15, 2006; 22(6): 768 - 770.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. G. Clark, M. J. Hubisz, C. D. Bustamante, S. H. Williamson, and R. Nielsen
Ascertainment bias in studies of human genome-wide polymorphism
Genome Res., November 1, 2005; 15(11): 1496 - 1502.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
R. Nielsen, M. J. Hubisz, and A. G. Clark
Reconstituting the Frequency Spectrum of Ascertained Single-Nucleotide Polymorphism Data
Genetics, December 1, 2004; 168(4): 2373 - 2382.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. M. Adams and R. R. Hudson
Maximum-Likelihood Estimation of Demographic Parameters Using the Frequency Spectrum of Unlinked Single-Nucleotide Polymorphisms
Genetics, November 1, 2004; 168(3): 1699 - 1712.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. Polanski and M. Kimmel
New Explicit Expressions for Relative Frequencies of Single-Nucleotide Polymorphisms With Application to Statistical Inference on Population Growth
Genetics, September 1, 2003; 165(1): 427 - 436.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J.-C. Nicod and C. R. Largiader
SNPs by AFLP (SBA): a rapid SNP isolation strategy for non-model organisms
Nucleic Acids Res., March 1, 2003; 31(5): e19 - e19.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. M. Akey, K. Zhang, M. Xiong, and L. Jin
The Effect of Single Nucleotide Polymorphism Identification Strategies on Estimates of Linkage Disequilibrium
Mol. Biol. Evol., February 1, 2003; 20(2): 232 - 242.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. M. Akey, G. Zhang, K. Zhang, L. Jin, and M. D. Shriver
Interrogating a High-Density SNP Map for Signatures of Natural Selection
Genome Res., December 1, 2002; 12(12): 1805 - 1814.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. Tsitrone, F. Rousset, and P. David
Heterosis, Marker Mutational Processes and Population Inbreeding History
Genetics, December 1, 2001; 159(4): 1845 - 1859.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Beerli and J. Felsenstein
Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach
PNAS, March 29, 2001; (2001) 81068098.
[Abstract] [Full Text]


Home page
GeneticsHome page
M. K. Kuhner, J. Yamato, and J. Felsenstein
Maximum Likelihood Estimation of Recombination Rates From Population Data
Genetics, November 1, 2000; 156(3): 1393 - 1401.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Beerli and J. Felsenstein
Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach
PNAS, April 10, 2001; 98(8): 4563 - 4568.
[Abstract] [Full Text] [PDF]