Genetics, Vol. 164, 1229-1236, July 2003, Copyright © 2003

Effect of Recombination on the Accuracy of the Likelihood Method for Detecting Positive Selection at Amino Acid Sites

Maria Anisimovaa,b, Rasmus Nielsenc, and Ziheng Yanga
a Department of Biology, University College London, London WC1E 6BT, United Kingdom,
b Center for Mathematics and Physics in the Life Sciences and Experimental Biology (CoMPLEX), University College London, London WC1E 6BT, United Kingdom
c Department of Biometrics, Cornell University, Ithaca, New York 14853-7801

Corresponding author: Maria Anisimova, University College London, Darwin Bldg., Gower St., London WC1E 6BT, United Kingdom., m.anisimova{at}ucl.ac.uk (E-mail)

Communicating editor: J. HEIN

Maximum-likelihood methods based on models of codon substitution accounting for heterogeneous selective pressures across sites have proved to be powerful in detecting positive selection in protein-coding DNA sequences. Those methods are phylogeny based and do not account for the effects of recombination. When recombination occurs, such as in population data, no unique tree topology can describe the evolutionary history of the whole sequence. This violation of assumptions raises serious concerns about the likelihood method for detecting positive selection. Here we use computer simulation to evaluate the reliability of the likelihood-ratio test (LRT) for positive selection in the presence of recombination. We examine three tests based on different models of variable selective pressures among sites. Sequences are simulated using a coalescent model with recombination and analyzed using codon-based likelihood models ignoring recombination. We find that the LRT is robust to low levels of recombination (with fewer than three recombination events in the history of a sample of 10 sequences). However, at higher levels of recombination, the type I error rate can be as high as 90%, especially when the null model in the LRT is unrealistic, and the test often mistakes recombination as evidence for positive selection. The test that compares the more realistic models M7 (ß) against M8 (ß and {omega}) is more robust to recombination, where the null model M7 allows the positive selection pressure to vary between 0 and 1 (and so does not account for positive selection), and the alternative model M8 allows an additional discrete class with {omega} = dN/dS that could be estimated to be >1 (and thus accounts for positive selection). Identification of sites under positive selection by the empirical Bayes method appears to be less affected than the LRT by recombination.





This article has been cited by other articles:


Home page
Brief BioinformHome page
W. Delport, K. Scheffler, and C. Seoighe
Models of coding sequence evolution
Brief Bioinform, October 29, 2008; (2008) bbn049v1.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. F. Storz, F. G. Hoffmann, J. C. Opazo, and H. Moriyama
Adaptive Functional Divergence Among Triplicated {alpha}-Globin Genes in Rodents
Genetics, March 1, 2008; 178(3): 1623 - 1638.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
L. Petersen, J. P. Bollback, M. Dimmic, M. Hubisz, and R. Nielsen
Genes under positive selection in Escherichia coli
Genome Res., September 1, 2007; 17(9): 1336 - 1343.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
M. F. Ducatez, C. M. Olinger, A. A. Owoade, Z. Tarnagda, M. C. Tahita, A. Sow, S. De Landtsheer, W. Ammerlaan, J. B. Ouedraogo, A. D. M. E. Osterhaus, et al.
Molecular and antigenic evolution and geographical spread of H5N1 highly pathogenic avian influenza viruses in western Africa
J. Gen. Virol., August 1, 2007; 88(8): 2297 - 2306.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
R. H. Orsi, D. R. Ripoll, M. Yeung, K. K. Nightingale, and M. Wiedmann
Recombination and positive selection contribute to evolution of Listeria monocytogenes inlA
Microbiology, August 1, 2007; 153(8): 2666 - 2678.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
Z. Yang
PAML 4: Phylogenetic Analysis by Maximum Likelihood
Mol. Biol. Evol., August 1, 2007; 24(8): 1586 - 1591.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. Seoighe, F. Ketwaroo, V. Pillay, K. Scheffler, N. Wood, R. Duffet, M. Zvelebil, N. Martinson, J. McIntyre, L. Morris, et al.
A Model of Directional Selection Applied to the Evolution of Drug Resistance in HIV-1
Mol. Biol. Evol., April 1, 2007; 24(4): 1025 - 1031.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
R. J. P. Brown, A. W. Tarr, C. P. McClure, V. S. Juttla, N. Tagiuri, W. L. Irving, and J. K. Ball
Cross-genotype characterization of genetic diversity and molecular adaptation in hepatitis C virus envelope glycoprotein genes
J. Gen. Virol., February 1, 2007; 88(2): 458 - 469.
[Abstract] [Full Text] [PDF]


Home page
Integr. Comp. Biol.Home page
S. Cohen, J. Tirindelli, M. Gomez-Chiarri, and D. Nacci
Functional implications of Major Histocompatibility (MH) variation using estuarine fish populations
Integr. Comp. Biol., December 1, 2006; 46(6): 1016 - 1029.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
S. Foret and R. Maleszka
Function and evolution of a gene family encoding odorant binding-like proteins in a social insect, the honey bee (Apis mellifera)
Genome Res., November 1, 2006; 16(11): 1404 - 1413.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. T. T. Edwards, E. C. Holmes, O. G. Pybus, D. J. Wilson, R. P. Viscidi, E. J. Abrams, R. E. Phillips, and A. J. Drummond
Evolution of the Human Immunodeficiency Virus Envelope Gene Is Dominated by Purifying Selection
Genetics, November 1, 2006; 174(3): 1441 - 1453.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Scheffler, D. P. Martin, and C. Seoighe
Robust inference of positive selection from recombining coding sequences
Bioinformatics, October 15, 2006; 22(20): 2493 - 2499.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
T. H. M. Mes and M. Doeleman
Positive Selection on Transposase Genes of Insertion Sequences in the Crocosphaera watsonii Genome.
J. Bacteriol., October 1, 2006; 188(20): 7176 - 7185.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. L. Kosakovsky Pond, D. Posada, M. B. Gravenor, C. H. Woelk, and S. D. W. Frost
Automated Phylogenetic Detection of Recombination Using a Genetic Algorithm
Mol. Biol. Evol., October 1, 2006; 23(10): 1891 - 1901.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
H. Ichihara, K.-i. Kuma, and H. Toh
Positive Selection in the ComC-ComD System of Streptococcal Species.
J. Bacteriol., September 1, 2006; 188(17): 6429 - 6434.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
L. M. Turner and H. E. Hoekstra
Adaptive Evolution of Fertilization Proteins within a Genus: Variation in ZP2 and ZP3 in Deer Mice (Peromyscus)
Mol. Biol. Evol., September 1, 2006; 23(9): 1656 - 1669.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. L. Chen, C.-S. Hung, J. Xu, C. S. Reigstad, V. Magrini, A. Sabo, D. Blasiar, T. Bieri, R. R. Meyer, P. Ozersky, et al.
Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli: A comparative genomics approach
PNAS, April 11, 2006; 103(15): 5977 - 5982.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. P. Rooney, J. L. Swezey, R. Friedman, D. W. Hecht, and C. W. Maddox
Analysis of Core Housekeeping and Virulence Genes Reveals Cryptic Lineages of Clostridium perfringens That Are Associated With Distinct Disease Presentations
Genetics, April 1, 2006; 172(4): 2081 - 2092.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. Carvajal-Rodriguez, K. A. Crandall, and D. Posada
Recombination Estimation Under Complex Evolutionary Models with the Coalescent Composite-Likelihood Method
Mol. Biol. Evol., April 1, 2006; 23(4): 817 - 827.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. J. Wilson and G. McVean
Estimating Diversifying Selection and Functional Constraint in the Presence of Recombination
Genetics, March 1, 2006; 172(3): 1411 - 1425.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. Scheffler and C. Seoighe
A Bayesian Model Comparison Approach to Inferring Positive Selection
Mol. Biol. Evol., December 1, 2005; 22(12): 2531 - 2540.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Mondragon-Palomino and B. S. Gaut
Gene Conversion and the Evolution of Three Leucine-Rich Repeat Gene Families in Arabidopsis thaliana
Mol. Biol. Evol., December 1, 2005; 22(12): 2444 - 2456.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Perez-Losada, R. P. Viscidi, J. C. Demma, J. Zenilman, and K. A. Crandall
Population Genetics of Neisseria gonorrhoeae in a High-Prevalence Community Using a Hypervariable Outer Membrane porB and 13 Slowly Evolving Housekeeping Genes
Mol. Biol. Evol., September 1, 2005; 22(9): 1887 - 1902.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
K. K. Nightingale, K. Windham, and M. Wiedmann
Evolution and Molecular Phylogeny of Listeria monocytogenes Isolated from Human and Animal Listeriosis Cases and Foods
J. Bacteriol., August 15, 2005; 187(16): 5537 - 5551.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
C. T. T. Edwards, K. J. Pfafferott, P. J. R. Goulder, R. E. Phillips, and E. C. Holmes
Intrapatient Escape in the A*0201-Restricted Epitope SLYNTVATL Drives Evolution of Human Immunodeficiency Virus Type 1 at the Population Level
J. Virol., July 15, 2005; 79(14): 9363 - 9366.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
Q. Wu
Comparative Genomics and Diversifying Selection of the Clustered Vertebrate Protocadherin Genes
Genetics, April 1, 2005; 169(4): 2179 - 2188.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
Z. Yang, W. S.W. Wong, and R. Nielsen
Bayes Empirical Bayes Inference of Amino Acid Sites Under Positive Selection
Mol. Biol. Evol., April 1, 2005; 22(4): 1107 - 1118.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Consuegra, H.-J. Megens, H. Schaschl, K. Leon, R. J. M. Stet, and W. C. Jordan
Rapid Evolution of the MH Class I Locus Results in Different Allelic Compositions in Recently Diverged Populations of Atlantic Salmon
Mol. Biol. Evol., April 1, 2005; 22(4): 1095 - 1106.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
E. E. Smith, E. H. Sims, D. H. Spencer, R. Kaul, and M. V. Olson
Evidence for Diversifying Selection at the Pyoverdine Locus of Pseudomonas aeruginosa
J. Bacteriol., March 15, 2005; 187(6): 2138 - 2147.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Williamson, S. M. Perry, C. D. Bustamante, M. E. Orive, M. N. Stearns, and J. K. Kelly
A Statistical Characterization of Consistent Patterns of Human Immunodeficiency Virus Evolution Within Infected Patients
Mol. Biol. Evol., March 1, 2005; 22(3): 456 - 468.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
H. Kuang, S.-S. Woo, B. C. Meyers, E. Nevo, and R. W. Michelmore
Multiple Genetic Processes Result in Heterogeneous Rates of Evolution within the Major Cluster Disease Resistance Genes in Lettuce
PLANT CELL, November 1, 2004; 16(11): 2870 - 2894.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
E. R. Chare and E. C. Holmes
Selection pressures in the capsid genes of plant RNA viruses reflect mode of transmission
J. Gen. Virol., October 1, 2004; 85(10): 3149 - 3157.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
W. S. W. Wong, Z. Yang, N. Goldman, and R. Nielsen
Accuracy and Power of Statistical Methods for Detecting Adaptive Evolution in Protein Coding Sequences and for Identifying Positively Selected Sites
Genetics, October 1, 2004; 168(2): 1041 - 1051.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. Whelan and N. Goldman
Estimating the Frequency of Events That Cause Multiple-Nucleotide Changes
Genetics, August 1, 2004; 167(4): 2027 - 2043.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
J. M. Burrows, L. Bromham, M. Woolfit, G. Piganeau, J. Tellam, G. Connolly, N. Webb, L. Poulsen, L. Cooper, S. R. Burrows, et al.
Selection Pressure-Driven Evolution of the Epstein-Barr Virus-Encoded Oncogene LMP1 in Virus Isolates from Southeast Asia
J. Virol., July 1, 2004; 78(13): 7131 - 7137.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. de Oliveira, M. Salemi, M. Gordon, A.-M. Vandamme, E. J. van Rensburg, S. Engelbrecht, H. M. Coovadia, and S. Cassol
Mapping Sites of Positive Selection and Amino Acid Diversification in the HIV Genome: An Alternative Approach to Vaccine Design?
Genetics, July 1, 2004; 167(3): 1047 - 1058.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
D. A. Haake, M. A. Suchard, M. M. Kelley, M. Dundoo, D. P. Alt, and R. L. Zuerner
Molecular Evolution and Mosaicism of Leptospiral Outer Membrane Proteins Involves Horizontal DNA Transfer
J. Bacteriol., May 1, 2004; 186(9): 2818 - 2828.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
D. T. Haydon, A. D. S. Bastos, and P. Awadalla
Low linkage disequilibrium indicative of recombination in foot-and-mouth disease virus gene sequence alignments
J. Gen. Virol., May 1, 2004; 85(5): 1095 - 1100.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
R. P. Lane, J. Young, T. Newman, and B. J. Trask
Species Specificity in Rodent Pheromone Receptor Repertoires
Genome Res., April 1, 2004; 14(4): 603 - 608.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. Shriner, R. Shankarappa, M. A. Jensen, D. C. Nickle, J. E. Mittler, J. B. Margolick, and J. I. Mullins
Influence of Random Genetic Drift on Human Immunodeficiency Virus Type 1 env Evolution During Chronic Infection
Genetics, March 1, 2004; 166(3): 1155 - 1164.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
A. Kalia and D. E. Bessen
Natural Selection and Evolution of Streptococcal Virulence Genes Involved in Tissue-Specific Adaptations
J. Bacteriol., January 1, 2004; 186(1): 110 - 121.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. D. Andrews and T. Gojobori
Strong Positive Selection and Recombination Drive the Antigenic Variation of the PilE Protein of the Human Pathogen Neisseria meningitidis
Genetics, January 1, 2004; 166(1): 25 - 32.
[Abstract] [Full Text] [PDF]