- THIS ARTICLE
- Full Text
- Full Text (PDF)
-
All Versions of this Article:
genetics.106.061317v1
175/4/1787 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Huelsenbeck, J. P.
- Articles by Andolfatto, P.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Huelsenbeck, J. P.
- Articles by Andolfatto, P.
Originally published as Genetics Published Articles Ahead of Print on January 21, 2007.
Genetics, Vol. 175, 1787-1802, April 2007, Copyright © 2007
doi:10.1534/genetics.106.061317
Inference of Population Structure Under a Dirichlet Process Model
John P. Huelsenbeck*,1 and
Peter Andolfatto
* Department of Integrative Biology, University of California, Berkeley, California 94720 and
Section of Ecology, Behavior and Evolution, Division of Biological Sciences, University of California, San Diego, California 92093-0116
1 Corresponding author: Section of Ecology, Behavior and Evolution, Division of Biological Sciences, University of California, La Jolla, CA 92093-0116.
E-mail: johnh{at}berkeley.edu
Inferring population structure from genetic data sampled from some number of individuals is a formidable statistical problem. One widely used approach considers the number of populations to be fixed and calculates the posterior probability of assigning individuals to each population. More recently, the assignment of individuals to populations and the number of populations have both been considered random variables that follow a Dirichlet process prior. We examined the statistical behavior of assignment of individuals to populations under a Dirichlet process prior. First, we examined a best-case scenario, in which all of the assumptions of the Dirichlet process prior were satisfied, by generating data under a Dirichlet process prior. Second, we examined the performance of the method when the genetic data were generated under a population genetics model with symmetric migration between populations. We examined the accuracy of population assignment using a distance on partitions. The method can be quite accurate with a moderate number of loci. As expected, inferences on the number of populations are more accurate when
= 4Neu is large and when the migration rate (4Nem) is low. We also examined the sensitivity of inferences of population structure to choice of the parameter of the Dirichlet process model. Although inferences could be sensitive to the choice of the prior on the number of populations, this sensitivity occurred when the number of loci sampled was small; inferences are more robust to the prior on the number of populations when the number of sampled loci is large. Finally, we discuss several methods for summarizing the results of a Bayesian Markov chain Monte Carlo (MCMC) analysis of population structure. We develop the notion of the mean population partition, which is the partition of individuals to populations that minimizes the squared partition distance to the partitions sampled by the MCMC algorithm.
This article has been cited by other articles:
![]() |
G. M. Volk, C. M. Richards, A. A. Reilley, A. D. Henk, P. A. Reeves, P. L. Forsline, and H. S. Aldwinckle Genetic Diversity and Disease Resistance of Wild Malus orientalis from Turkey and Southern Russia J. Amer. Soc. Hort. Sci., May 1, 2008; 133(3): 383 - 389. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kitada, T. Kitakado, and H. Kishino Empirical Bayes Inference of Pairwise FST and Its Distribution in the Genome Genetics, October 1, 2007; 177(2): 861 - 873. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Jakobsson and N. A. Rosenberg CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure Bioinformatics, July 15, 2007; 23(14): 1801 - 1806. [Abstract] [Full Text] [PDF] |
||||


