- THIS ARTICLE
- Full Text (Rapid PDF)
- Supporting Information
-
All Versions of this Article:
genetics.109.100479v1
182/1/295 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Lynch, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Lynch, M.
doi:10.1534/genetics.109.100479
A more recent version of this article appeared on May 1, 2009.
REGULAR RESEARCH PAPERS |
Estimation of Allele Frequencies from High-coverage Genome-sequencing Projects
Michael Lynch 1*
1 Indiana University
* To whom correspondence should be addressed. E-mail: milynch{at}indiana.edu.
Submitted on January 6, 2009
Accepted on 6 March 2009
A new generation of high-throughput sequencing strategies will soon lead to the acquisition of high-coverage genomic profiles of hundreds to thousands of individuals within species, generating unprecedented levels of information on the frequencies of nucleotides segregating at individual sites. However, because these new technologies are error prone and yield uneven coverage of alleles in diploid individuals, they also introduce the need for novel methods for analyzing the raw read data. A maximum-likelihood method for the estimation of allele frequencies is developed, eliminating both the need to arbitrarily discard individuals with low coverage and the requirement for an extrinsic measure of the sequence error rate. The resultant estimates are nearly unbiased with asymptotically minimal sampling variance, thereby defining the limits to our ability to estimate population-genetic parameters and providing a logical basis for the optimal design of population-genomic surveys.
Key Words: allele-frequency estimation, genome scans, maximum-likelihood estimation, nucleotide diversity, site-frequency spectrum
This article has been cited by other articles:
![]() |
T. B. Sackton, R. J. Kulathinal, C. M. Bergman, A. R. Quinlan, E. B. Dopman, M. Carneiro, G. T. Marth, D. L. Hartl, and A. G. Clark Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster Gen Biol Evol, March 1, 2010; 2009(0): 449 - 465. [Abstract] [Full Text] [PDF] |
||||
