Originally published as Genetics Published Articles Ahead of Print on October 18, 2007.

Genetics, Vol. 177, 1725-1731, November 2007, Copyright © 2007
doi:10.1534/genetics.106.069088

Sequence-Level Population Simulations Over Large Genomic Regions

* Department of Epidemiology and Public Health, Imperial College, London W2 1PG, United Kingdom, {dagger} Serono International, CH-1211 Geneva 20, Switzerland and {ddagger} Noncommunicable Disease Epidemiology Unit, London School of Hygiene and Tropical Medicine, London WC1E 7HT, United Kingdom

2 Corresponding author: Department of Epidemiology and Public Health, Imperial College, St. Mary's Campus, Norfolk Place, London W2 1PG, United Kingdom.
E-mail: c.hoggart{at}imperial.ac.uk

Simulation is an invaluable tool for investigating the effects of various population genetics modeling assumptions on resulting patterns of genetic diversity, and for assessing the performance of statistical techniques, for example those designed to detect and measure the genomic effects of selection. It is also used to investigate the effectiveness of various design options for genetic association studies. Backward-in-time simulation methods are computationally efficient and have become widely used since their introduction in the 1980s. The forward-in-time approach has substantial advantages in terms of accuracy and modeling flexibility, but at greater computational cost. We have developed flexible and efficient simulation software and a rescaling technique to aid computational efficiency that together allow the simulation of sequence-level data over large genomic regions in entire diploid populations under various scenarios for demography, mutation, selection, and recombination, the latter including hotspots and gene conversion. Our forward evolution of genomic regions (FREGENE) software is freely available from www.ebi.ac.uk/projects/BARGEN together with an ancillary program to generate phenotype labels, either binary or quantitative. In this article we discuss limitations of coalescent-based simulation, introduce the rescaling technique that makes large-scale forward-in-time simulation feasible, and demonstrate the utility of various features of FREGENE, many not previously available.




This article has been cited by other articles:


Home page
GeneticsHome page
C. Zhu and J. Yu
Nonmetric Multidimensional Scaling Corrects for Population Structure in Association Mapping With Different Sample Types
Genetics, July 1, 2009; 182(3): 875 - 888.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. K. Pickrell, G. Coop, J. Novembre, S. Kudaravalli, J. Z. Li, D. Absher, B. S. Srinivasan, G. S. Barsh, R. M. Myers, M. W. Feldman, et al.
Signals of recent positive selection in a worldwide sample of human populations
Genome Res., May 1, 2009; 19(5): 826 - 837.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Sargolzaei and F. S Schenkel
QMSim: a large-scale genome simulator for livestock
Bioinformatics, March 1, 2009; 25(5): 680 - 681.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
Y. Kim and T. Wiehe
Simulation of DNA sequence evolution under models of recent directional selection
Brief Bioinform, January 1, 2009; 10(1): 84 - 96.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Z. Ding, T. Mailund, and Y. S. Song
Efficient whole-genome association mapping using local phylogenies for unphased genotype data
Bioinformatics, October 1, 2008; 24(19): 2215 - 2221.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
I. Tachmazidou, T. Andrew, C. J. Verzilli, M. R. Johnson, and M. De Iorio
Bayesian survival analysis in genetic association studies
Bioinformatics, September 15, 2008; 24(18): 2030 - 2036.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
B. W. Lambert, J. D. Terwilliger, and K. M. Weiss
ForSim: a tool for exploring the genetic architecture of complex traits with controlled truth
Bioinformatics, August 15, 2008; 24(16): 1821 - 1822.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. M. Weiss
Tilting at Quixotic Trait Loci (QTL): An Evolutionary Perspective on Genetic Causation
Genetics, August 1, 2008; 179(4): 1741 - 1756.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
P. F. O'Reilly, E. Birney, and D. J. Balding
Confounding between recombination and selection, and the Ped/Pop method for detecting selection
Genome Res., August 1, 2008; 18(8): 1304 - 1313.
[Abstract] [Full Text] [PDF]