IDT. Quality oligos. Every time.

Genetics, Vol. 158, 1321-1327, July 2001, Copyright © 2001

Disparity Index: A Simple Statistic to Measure and Test the Homogeneity of Substitution Patterns Between Molecular Sequences

Sudhir Kumara and Sudhindra R. Gadagkara
a Department of Biology, Arizona State University, Tempe, Arizona 85287-1501

Corresponding author: Sudhir Kumar, Life Sciences A 371, Department of Biology, Arizona State University, Tempe, AZ 85287-1501., s.kumar{at}asu.edu (E-mail)

Communicating editor: M. K. UYENOYAMA

A common assumption in comparative sequence analysis is that the sequences have evolved with the same pattern of nucleotide substitution (homogeneity of the evolutionary process). Violation of this assumption is known to adversely impact the accuracy of phylogenetic inference and tests of evolutionary hypotheses. Here we propose a disparity index, ID, which measures the observed difference in evolutionary patterns for a pair of sequences. On the basis of this index, we have developed a Monte Carlo procedure to test the homogeneity of the observed patterns. This test does not require a priori knowledge of the pattern of substitutions, extent of rate heterogeneity among sites, or the evolutionary relationship among sequences. Computer simulations show that the ID-test is more powerful than the commonly used {chi}2-test under a variety of biologically realistic models of sequence evolution. An application of this test in an analysis of 3789 pairs of orthologous human and mouse protein-coding genes reveals that the observed evolutionary patterns in neutral sites are not homogeneous in 41% of the genes, apparently due to shifts in G + C content. Thus, the proposed test can be used as a diagnostic tool to identify genes and lineages that have evolved with substantially different evolutionary processes as reflected in the observed patterns of change. Identification of such genes and lineages is an important early step in comparative genomics and molecular phylogenetic studies to discover evolutionary processes that have shaped organismal genomes.





This article has been cited by other articles:


Home page
Syst BiolHome page
N. C. Sheffield, H. Song, S. L. Cameron, and M. F. Whiting
Nonstationary Evolution and Compositional Heterogeneity in Beetle Mitochondrial Phylogenomics
Syst Biol, August 1, 2009; 58(4): 381 - 394.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
S. Kumar and A. Filipski
Multiple sequence alignment: In pursuit of homologous DNA positions
Genome Res., February 1, 2007; 17(2): 127 - 135.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
K. F. Gruber, R. S. Voss, and S. A. Jansa
Base-Compositional Heterogeneity in the RAG1 Locus among Didelphid Marsupials: Implications for Phylogenetic Inference and the Evolution of GC Content
Syst Biol, February 1, 2007; 56(1): 83 - 96.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
C.-H. Huang and J. Peng
Evolutionary conservation and diversification of Rh family genes and proteins
PNAS, October 25, 2005; 102(43): 15512 - 15517.
[Abstract] [Full Text] [PDF]


Home page
J. Exp. Biol.Home page
J. Spaethe and A. D. Briscoe
Molecular characterization and expression of the UV opsin in bumblebees: three ommatidial subtypes in the retina and a new photoreceptor organ in the lamina
J. Exp. Biol., June 15, 2005; 208(12): 2347 - 2361.
[Abstract] [Full Text] [PDF]


Home page
J. Exp. Biol.Home page
A. D. Briscoe and G. D. Bernard
Eyeshine and spectral tuning of long wavelength-sensitive rhodopsins: no evidence for red-sensitive photoreceptors among five Nymphalini butterfly species
J. Exp. Biol., February 15, 2005; 208(4): 687 - 696.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
L. S. Jermiin, S. Y.W. Ho, F. Ababneh, J. Robinson, and A. W.D. Larkum
The Biasing Effect of Compositional Heterogeneity on Phylogenetic Estimates May be Underestimated
Syst Biol, August 1, 2004; 53(4): 638 - 643.
[Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Spaethe and A. D. Briscoe
Early Duplication and Functional Diversification of the Opsin Gene Family in Insects
Mol. Biol. Evol., August 1, 2004; 21(8): 1583 - 1594.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. J. Lercher, J.-V. Chamary, and L. D. Hurst
Genomic Regionality in Rates of Evolution Is Not Explained by Clustering of Genes of Comparable Expression Profile
Genome Res., June 1, 2004; 14(6): 1002 - 1013.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. Tamura, S. Subramanian, and S. Kumar
Temporal Patterns of Fruit Fly (Drosophila) Evolution Revealed by Mutation Clocks
Mol. Biol. Evol., January 1, 2004; 21(1): 36 - 44.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. S. Rosenberg and S. Kumar
Heterogeneity of Nucleotide Frequencies Among Evolutionary Lineages and Phylogenetic Inference
Mol. Biol. Evol., April 1, 2003; 20(4): 610 - 621.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Yi, D. L. Ellsworth, and W.-H. Li
Slow Molecular Clocks in Old World Monkeys, Apes, and Humans
Mol. Biol. Evol., December 1, 2002; 19(12): 2191 - 2198.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. Tamura and S. Kumar
Evolutionary Distance Estimation Under Heterogeneous Substitution Pattern Among Lineages
Mol. Biol. Evol., October 1, 2002; 19(10): 1727 - 1736.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. Zurovcova and F. J. Ayala
Polymorphism Patterns in Two Tightly Linked Developmental Genes, Idgf1 and Idgf3, of Drosophila melanogaster
Genetics, September 1, 2002; 162(1): 177 - 188.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. D. Briscoe
Functional Diversification of Lepidopteran Opsins Following Gene Duplication
Mol. Biol. Evol., December 1, 2001; 18(12): 2270 - 2279.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Kumar and S. Subramanian
Mutation rates in mammalian genomes
PNAS, January 22, 2002; 99(2): 803 - 808.
[Abstract] [Full Text] [PDF]