Skip to main content
  • Facebook
  • Twitter
  • YouTube
  • LinkedIn
  • Google Plus
  • Other GSA Resources
    • Genetics Society of America
    • G3: Genes | Genomes | Genetics
    • Genes to Genomes: The GSA Blog
    • GSA Conferences
    • GeneticsCareers.org
  • Log in
Genetics

Main menu

  • HOME
  • ISSUES
    • Current Issue
    • Early Online
    • Archive
  • ABOUT
    • About the journal
    • Why publish with us?
    • Editorial board
    • Contact us
  • SERIES
    • Centennial
    • Genetics of Immunity
    • Genetics of Sex
    • Genomic Selection
    • Multiparental Populations
    • FlyBook
    • WormBook
    • YeastBook
  • ARTICLE TYPES
    • About Article Types
    • Commentaries
    • Editorials
    • GSA Honors and Awards
    • Methods, Technology & Resources
    • Perspectives
    • Primers
    • Reviews
    • Toolbox Reviews
  • PUBLISH & REVIEW
    • Scope & publication policies
    • Submission & review process
    • Article types
    • Prepare your manuscript
    • Submit your manuscript
    • After acceptance
    • Guidelines for reviewers
  • SUBSCRIBE
    • Why subscribe?
    • For institutions
    • For individuals
    • Email alerts
    • RSS feeds
  • Other GSA Resources
    • Genetics Society of America
    • G3: Genes | Genomes | Genetics
    • Genes to Genomes: The GSA Blog
    • GSA Conferences
    • GeneticsCareers.org

User menu

Search

  • Advanced search
Genetics

Advanced Search

  • HOME
  • ISSUES
    • Current Issue
    • Early Online
    • Archive
  • ABOUT
    • About the journal
    • Why publish with us?
    • Editorial board
    • Contact us
  • SERIES
    • Centennial
    • Genetics of Immunity
    • Genetics of Sex
    • Genomic Selection
    • Multiparental Populations
    • FlyBook
    • WormBook
    • YeastBook
  • ARTICLE TYPES
    • About Article Types
    • Commentaries
    • Editorials
    • GSA Honors and Awards
    • Methods, Technology & Resources
    • Perspectives
    • Primers
    • Reviews
    • Toolbox Reviews
  • PUBLISH & REVIEW
    • Scope & publication policies
    • Submission & review process
    • Article types
    • Prepare your manuscript
    • Submit your manuscript
    • After acceptance
    • Guidelines for reviewers
  • SUBSCRIBE
    • Why subscribe?
    • For institutions
    • For individuals
    • Email alerts
    • RSS feeds
Previous ArticleNext Article

Demographic Inference Using Spectral Methods on SNP Data, with an Analysis of the Human Out-of-Africa Expansion

Sergio Lukić and Jody Hey
Genetics October 1, 2012 vol. 192 no. 2 619-639; https://doi.org/10.1534/genetics.112.141846
Sergio Lukić
*Department of Genetics, Rutgers University, Piscataway, New Jersey 08854†School of Natural Sciences, Institute for Advanced Study, Princeton, New Jersey 08540
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jody Hey
*Department of Genetics, Rutgers University, Piscataway, New Jersey 08854
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Supplemental
  • Info & Metrics
Loading

Article Figures & Data

Figures

  • Tables
  • Additional Files
  • Figure 1 
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 1 

    Decay of the error function between the equilibrium density of allele frequencies and its polynomial approximation and piecewise linear approximation. The horizontal axis denotes the number of polynomials for the lower curve and the number of grid points for the upper curve.

  • Figure 2 
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 2 

    Simulation of two Brownian paths on a two-population tree. The plot illustrates different types of initial conditions for the paths. The initial condition can be either a random allele frequency in the ancestral population at mutation–drift equilibrium (solid lines) or a de novo mutation that arises in one of the populations after the ancestral population leaves the state of equilibrium (shaded lines).

  • Figure 3 
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 3 

    Decay of the chi-square statistic in MultiPop (top) and ∂a∂i (bottom). Four different demographic scenarios with two simultaneous populations and 50 chromosomes sampled per population are considered. For simplicity, the average scaled migration rate is used to label each scenario. The observed AFS were constructed using Π = 50,000 independent loci produced with Monte Carlo simulations.

  • Figure 4 
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 4 

    Decay of the chi-square statistic in MultiPop (top) and ∂a∂i (bottom). Three different demographic scenarios with three simultaneous populations and 20 chromosomes sampled per population are considered. For simplicity, the average scaled migration rate is used to label each scenario. The observed AFS were constructed using Π = 50,000 independent loci produced with Monte Carlo simulations.

  • Figure 5 
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 5 

    A graphical representation of a four-population model for the human expansion out of Africa and peopling of the Americas. The nonconstancy of the population sizes of CEU, CHB, and MEX is modeled by means of an exponential growth model with growth rates rEU, rAS, and rMX.

Tables

  • Figures
  • Additional Files
  • Table 1  Comparison of numerical approximations of the AFS and the simulated AFS
    Model no.Intensity of migration (2Nm)MPop vs. Monte Carlo (chi-square statistic)∂a∂i vs. Monte Carlo (chi-square statistic)
    100.0012650.002479
    2<0.50.008940.005955
    310.010760.006457
    4>10.011400.006286
    5< 0.50.0075580.04684
    610.015110.02882
    7>10.029790.01791
    • Chi-square statistics associated with seven demographic scenarios are shown. The AFS computed by MultiPop (MPop) corresponds to the Λ = 35 AFS, and the AFS computed by ∂a∂i corresponds to a grid size of 40 grid points per population. The frequency spectra were normalized in all the cases such that the total number of SNPs was 1.

  • Table 2  Comparison of maximum-likelihood estimates using different numerical approximations
    ModelModel parameterTrue valueθ MPopθ ∂a∂i
    14NAu111
    1N1/NA11.0151.042
    1N2/NA0.45380.43850.4421
    1T/2NA0.018750.017880.01840
    24NAu111
    2N1/NA11.081.042
    2N2/NA0.45380.64670.6379
    2T/2NA0.018750.023040.02720
    22NAm1→20.30.57063.529
    22NAm2→10.880.87353.228
    34NAu111
    3N1/NA11.0991.044
    3N2/NA0.45380.66480.6497
    3T/2NA0.018750.022870.02783
    32NAm1→20.80.58734.022
    32NAm2→11.760.94693.965
    44NAu111
    4N1/NA11.0861.068
    4N2/NA0.45380.67120.6493
    4T/2NA0.018750.022580.02592
    42NAm1→21.120.61902.574
    42NAm2→12.640.99103.798
    54NAu111
    5N1/NA0.36300.37130.3522
    5N2/NA0.16300.15400.1440
    5N3/NA0.063020.056730.05569
    5Ta/2NA0.0220.021580.02041
    5Tb/2NA0.0130.012250.01152
    64NAu111
    6N1/NA0.36300.37130.3344
    6N2/NA0.16300.178550.1527
    6N3/NA0.063020.059500.06112
    6Ta/2NA0.0220.029960.01749
    6Tb/2NA0.0130.012110.01249
    62NAma1→20.050.045620.0001828
    62NAma2→150.023450.0009895
    62NAmb1→220.0021150.003311
    62NAmb1→30.30.38971.505
    62NAmb2→10.0050.0079820.09358
    62NAmb2→30.30.00076374.4e-09
    62NAmb3→130.68313.398
    62NAmb3→231.7882.640
    74NAu111
    7N1/NA0.36300.38740.3138
    7N2/NA0.16300.17240.1445
    7N3/NA0.063020.058430.05589
    7Ta/2NA0.0220.024150.01296
    7Tb/2NA0.0130.011550.01115
    72NAma1→20.050.029820.7007
    72NAma2→1120.0074390.004225
    72NAmb1→260.00071660.0003699
    72NAmb1→30.30.000901923.1354
    72NAmb2→10.0050.00034130.0020024
    72NAmb2→30.30.00077130.0001095
    72NAmb3→130.0073623.232
    72NAmb3→230.40900.01191
    • Maximum-likelihood estimates of seven different demographic scenarios are shown. Many results are biased due to numerical errors in the calculation of the frequency spectra (see subsection Numerical errors and bias-corrected confidence intervals for a detailed discussion on how numerical errors affect the location of the maximum-likelihood peak). In the case of MultiPop, 40 polynomials were used in the two-population models and 35 polynomials in the three-population models. We used 100 grid points in all models approximated with finite-difference schemes. The observed AFS were constructed using Π = 50,000 independent loci produced with Monte Carlo simulations. The maximum-likelihood peak was found by means of the BFGS method, using the coordinates of the true peak as the initial point in this local optimization algorithm.

  • Table 3  Comparison of computing time
    ModelNo. polynomials (MPop)Computing time of MPop (sec)Grid size (∂a∂i)Computing time of ∂a∂i (sec)
    2150.27150.07
    2200.61500.10
    2251.101000.13
    2301.865001.43
    2352.9210005.68
    6153.1150.16
    6207.95500.86
    62517.311006.33
    63033.320068.41
    63557.41300253.85
    • Computing times required to evaluate an allele-frequency spectrum using MultiPop and ∂a∂i are shown. The demographic scenarios involved two populations and 50 chromosomes sampled per population or three populations and 20 chromosomes sampled per population. The CPU used to measure the computing times was an Intel Core(TM)2 Duo P8600 with speed 2.40 GHz.

  • Table 4  Inference of a four-population model for the human expansion out of Africa and peopling of the Americas
    Model parametersθ MPop95% C.I.θ ∂a∂i, 195% C.I.θ ∂a∂i, 295% C.I.
    NA10,4008,670–12,2007,3004,400–10,100
    NAF17,30010,900–29,30012,30011,500–13,900
    NB2,060346–5,0702,1001,400–2,900
    NEU01,7101,030–3,0201,000500–1,9001,500700–2,100
    rEU (generation−1)0.00550.00351–0.01030.0040.0015–0.00660.00230.0008–0.0045
    NAS0453210–800510310–910590320–800
    rAS (generation−1)0.0160.0102–0.03010.00550.0023–0.00880.00370.0016–0.006
    mAF→B (×10−5)6.060.0–13.62515–34
    mAF→B (×10−5)1.630.0–3.473.02.0–6.0
    mAF→B (×10−5)0.4870.0–1.072515–34
    mAF→B (×10−5)1.540.0–2.969.62.3–17.4
    TAF (yr)125,40054,300–250,000220,000100,000–510,000
    TB (yr)52,40036,000–80,800140,00040,000–270,000
    TEU→AS (yr)29,50023,500–38,00021,20017,200–26,50026,40018,100–43,100
    NMX03,2001,100–6,100800160–1,800
    rMX (generation−1)0.00710.0043–0.0110.0050.0014–0.0117
    TMX (yr)29,30023,000–37,50021,60016,300–26,900
    fMX (%)20.43.2–414842–60
    • Inference of parameters by means of maximum likelihood is shown. Confidence intervals were computed by means of nonparametric bootstrap. The estimated parameters θ with MultiPop correspond to the mean of the bootstrap distribution. The estimates by MultiPop are compared with the estimates by ∂a∂i with two three-population models. ∂a∂i 1 denotes the three-population model for the out-of-Africa event described in Gutenkunst et al. (2009). ∂a∂i 2 denotes the three-population model for the peopling of the Americas studied in Gutenkunst et al. (2009).

Additional Files

  • Figures
  • Tables
  • Files in this data supplement:

    • File S1: Supporting material (PDF, 1 MB)
Previous ArticleNext Article
Back to top

PUBLICATION INFORMATION

Volume 192 Issue 2, October 2012

Genetics: 192 (2)

ARTICLE CLASSIFICATION

INVESTIGATIONS
Population and evolutionary genetics
View this article with LENS
Email

Thank you for sharing this Genetics article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Demographic Inference Using Spectral Methods on SNP Data, with an Analysis of the Human Out-of-Africa Expansion
(Your Name) has forwarded a page to you from Genetics
(Your Name) thought you would be interested in this article in Genetics.
Print
Alerts
Enter your email below to set up alert notifications for new article, or to manage your existing alerts.
SIGN UP OR SIGN IN WITH YOUR EMAIL
View PDF
Share

Demographic Inference Using Spectral Methods on SNP Data, with an Analysis of the Human Out-of-Africa Expansion

Sergio Lukić and Jody Hey
Genetics October 1, 2012 vol. 192 no. 2 619-639; https://doi.org/10.1534/genetics.112.141846
Sergio Lukić
*Department of Genetics, Rutgers University, Piscataway, New Jersey 08854†School of Natural Sciences, Institute for Advanced Study, Princeton, New Jersey 08540
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jody Hey
*Department of Genetics, Rutgers University, Piscataway, New Jersey 08854
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Permalink:
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation

Demographic Inference Using Spectral Methods on SNP Data, with an Analysis of the Human Out-of-Africa Expansion

Sergio Lukić and Jody Hey
Genetics October 1, 2012 vol. 192 no. 2 619-639; https://doi.org/10.1534/genetics.112.141846
Sergio Lukić
*Department of Genetics, Rutgers University, Piscataway, New Jersey 08854†School of Natural Sciences, Institute for Advanced Study, Princeton, New Jersey 08540
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jody Hey
*Department of Genetics, Rutgers University, Piscataway, New Jersey 08854
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero

Related Articles

Cited By

More in this TOC Section

Investigations

  • Selection To Increase Expression, Not Sequence Diversity, Precedes Gene Family Origin and Expansion in Rattlesnake Venom
  • A Novel Histone Crosstalk Pathway Important for Regulation of UV-Induced DNA Damage Repair in Saccharomyces cerevisiae
  • The RNAi Inheritance Machinery of Caenorhabditis elegans
Show more 3

Population and Evolutionary Genetics

  • Selection To Increase Expression, Not Sequence Diversity, Precedes Gene Family Origin and Expansion in Rattlesnake Venom
  • A Novel Histone Crosstalk Pathway Important for Regulation of UV-Induced DNA Damage Repair in Saccharomyces cerevisiae
  • The RNAi Inheritance Machinery of Caenorhabditis elegans
Show more 3
  • Top
  • Article
    • Abstract
    • Materials and Methods
    • Results
    • Discussion
    • Acknowledgments
    • Appendix
    • Footnotes
    • Literature Cited
  • Figures & Data
  • Supplemental
  • Info & Metrics

GSA

The Genetics Society of America (GSA), founded in 1931, is the professional membership organization for scientific researchers and educators in the field of genetics. Our members work to advance knowledge in the basic mechanisms of inheritance, from the molecular to the population level.

Online ISSN: 1943-2631

  • For Authors
  • For Reviewers
  • For Subscribers
  • Submit a Manuscript
  • Editorial Board
  • Press Releases

GET CONNECTED

RSS  Subscribe with RSS.

email  Subscribe via email. Sign up to receive alert notifications of new articles.

  • Facebook
  • Twitter
  • YouTube
  • LinkedIn
  • Google Plus

Copyright © 2017 by the Genetics Society of America

  • About GENETICS
  • Terms of use
  • Advertising
  • Permissions
  • Contact us
  • International access