- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Faith, J. J.
- Articles by Pollock, D. D.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Faith, J. J.
- Articles by Pollock, D. D.
Likelihood Analysis of Asymmetrical Mutation Bias Gradients in Vertebrate Mitochondrial Genomes
Jeremiah J. Faith1,a and David D. Pollockaa Department of Biological Sciences and Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana 70803
Corresponding author: David D. Pollock, Louisiana State University, Baton Rouge, LA 70803., dpollock{at}lsu.edu (E-mail)
| ABSTRACT |
|---|
Protein-coding genes in mitochondrial genomes have varying degrees of asymmetric skew in base frequencies at the third codon position. The variation in skew among genes appears to be caused by varying durations of time that the heavy strand spends in the mutagenic single-strand state during replication (DssH). The primary data used to study skew have been the gene-by-gene base frequencies in individual taxa, which provide little information on exactly what kinds of mutations are responsible for the base frequency skew. To assess the contribution of individual mutation components to the ancestral vertebrate substitution pattern, here we analyze a large data set of complete vertebrate mitochondrial genomes in a phylogeny-based likelihood context. This also allows us to evaluate the change in skew continuously along the mitochondrial genome and to directly estimate relative substitution rates. Our results indicate that different types of mutation respond differently to the DssH gradient. A primary role for hydrolytic deamination of cytosines in creating variance in skew among genes was not supported, but rather linearly increasing rates of mutation from adenine to hypoxanthine with DssH appear to drive regional differences in skew. Substitutions due to hydrolytic deamination of cytosines, although common, appear to quickly saturate, possibly due to stabilization by the mitochondrial DNA single-strand-binding protein. These results should form the basis of more realistic models of DNA and protein evolution in mitochondria.
VERTEBRATE mitochondrial DNA (mtDNA) has been a prevalent model system for genomic biodiversity and molecular evolution research largely due to the genome's manageable size of
17 kb, its high copy number, and a variety of protein and structural RNA-encoding genes that have proven useful for phylogenetic inference (![]()
![]()
![]()
![]()
![]()
The directions of transcription for the 37 genes of the circular mitochondrial genome are unevenly distributed between the heavy (H) strand and the light (L) strand (so named for their different buoyancy densities in CsCl gradients). The light strand of mtDNA codes for 8 tRNAs and 1 protein, while the heavy strand codes for 14 tRNAs, 2 rRNAs, and 12 proteins.
There is strong heterogeneity in the rate of substitution in vertebrate mitochondria, both between taxonomic groups and among sites. Because of the importance of substitution rates in evolutionary analysis, there is considerable interest in the underlying transcription and replication processes that give rise to differences in mitochondrial mutation processes. Much work has gone into deciphering the mechanism of mtDNA replication, especially in the mouse and human (![]()
![]()
-polymerase extends an RNA primer to produce a nascent H strand, while the displaced parental H strand is then coated with single-strand-mtDNA-binding proteins (mtSSBs). About two-thirds of the way around the genome, the replication fork reaches the origin of L-strand replication (OL) and a secondary structure is formed that binds a second polymerase molecule to begin synthesis of a new L strand. The parental H strand becomes double stranded as the L strand is synthesized, but for much of the genome considerable time passes before this occurs. Thus, there is a gradient along the genome of duration of time spent in the single-strand state (DssH; ![]()
![]()
2 hr to complete, with
1 hr spent on DNA synthesis (see ![]()
![]()
|
It has recently been inferred that the strand asymmetry and gradients of exposure to mutagenesis in the single-strand state during replication lead to gradients of asymmetry in the nucleotide substitution process and thereby to gradients of asymmetry in base composition between the strands. In 1991, it was noticed that T/C ratios at third codon positions in vertebrate mitochondria vary along the genome (![]()
![]()
![]()
![]()
![]()
Combining these observations, it was determined that asymmetric compositional bias (skew) in third codon positions and other sites depends on which strand the gene is coded on and on how long the gene spends in the single-strand state during replication (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Previous analyses have considered the effect of asymmetric mutation bias primarily in terms of base frequencies, but as ![]()
![]()
| METHODS |
|---|
Sequence alignment, manipulation, and pruning:
Sequences were obtained for all 118 complete vertebrate mitochondrial genomes that had been submitted to GenBank and revised by the National Center for Biotechnology Information RefSeq program (![]()
![]()
Since gene order is hypothesized to be critical for mutation bias, gene orders of all 118 vertebrate genomes in the database were scanned to determine the largest subset of taxa with identical gene order (including order, duplication, and presence/absence of the origin of light-strand replication and the D-loop). Primates and the European hedgehog are known to have faster rates of evolution than other vertebrates (![]()
|
|
From the complete set of alignments, four data sets were created for analysis. For phylogenetic tree reconstruction, the ribosomal RNAs (rRNAs) and protein-coding regions were concatenated. This data set was 15,054 bp long. Although sites in these genes are probably evolving differently, and the average model is not necessarily accurate for each site, a reasonable joint model is not available and the benefits of a large data set would appear to outweigh concerns over model variations, particularly since the purpose of this analysis is discovery of new model complexity, not improved phylogenetic reconstruction. From the same set of 12 concatenated proteins, third codon positions were parsed out for conserved fourfold redundant sites (4x) and the two types of conserved twofold (2x) redundant sites, coding for either purines (2x R) or pyrimidines (2x Y). Redundancy at a specific codon in a single sequence is easily defined as the number of codons that code for the same amino acid. For example, a site coding for alanine would be fourfold redundant with the vertebrate mitochondrial genetic code, since there are four possible codons (all with the same nucleotides at the first and second codon positions) for this amino acid. We defined conserved redundancy in the alignment as those sites for which all codons in each sequence were in the same redundancy class. For example, the conserved fourfold redundant sites included those sites with any of the amino acids threonine, alanine, arginine, valine, glycine, or proline, but no other amino acids or gaps. This measure of redundancy class is preferable to that measured in a single genome because the site has probably been in the redundancy class over the entire evolutionary tree, whereas for a single genome a site may have recently changed its redundancy class, and therefore its base frequencies may not reflect the equilibrium values for that class. Sites coding for serine and leucine were considered sixfold redundant sites and not included, and methionine at the first amino acid position was also excluded. The 4x, 2x R, and 2x Y data sets were 583, 221, and 445 bp, respectively. Third codon positions from the different redundancy classes do not have variable selective constraints for protein function. The 4x sites are free to vary among all nucleotides, while the 2x sites are restricted to exchange between only two nucleotides [i.e., purine to purine (2x R) or pyrimidine to pyrimidine (2x Y)]. The individual protein-coding regions were also sometimes considered separately for comparison to earlier results, but in these cases ATP8, ND3, and ND4L were discarded, as they contained too few sites to accurately estimate model parameters (see RESULTS for further consideration on tradeoffs among number of model parameters, amount of data, and comparative inference). ND6 was excluded from all redundancy class analyses since it is the only protein coded on the L strand.
Phylogenetic tree reconstruction:
A neighbor-joining tree (NJTree1) was calculated with PAUP* (![]()
![]()
(general time reversible with rates gamma distributed across sites) model. A final neighbor-joining tree (NJTree2) was calculated using the new parameters and maximum-likelihood distance estimates. It has been shown that substitution model parameters are relatively insensitive to errors in the phylogenetic tree estimate as long as the tree is approximately right (![]()
![]()
Calculation of statistics and parameters:
Base frequencies and substitution rates were obtained with PAML v3.1 (![]()
ij
j, where
ij =
ji is the symmetric rate parameter governing exchange between the nucleotides, and
j is the equilibrium frequency of the nucleotide mutated to. The HKY model (![]()
, the ratio of the transition rate parameter (
), and the transversion rate parameter (ß).
For each of the redundancy class data sets, GC and AT skews were calculated using the formula of ![]()
![]() |
(1) |
where fA, fC, fG, and fT are the empirical frequencies of each nucleotide. In the absence of strand bias, fG should equal fC, and fA should equal fT due to base-pairing rules and the fact that the same process are by definition occurring on both strands (![]()
![]()
For a single gene, DssH (the duration of the single-strand state of the parental H strand) was calculated as in ![]()
![]()
for ND1 and ND2, the two genes behind the origins of replication, and DssH = 2((OL -
)/L) for the remaining genes (Fig 1), where L is the length of the genome, OL is the position of the light-strand origin of replication, and
is the average position of the gene. This calculation assumes that the movement of the replication forks is constant along the genome and equal for both strands. Position numbers began at the start of tRNA-Phe (the first position after the D-loop) and increased in the direction of H-strand synthesis during replication. For gene alignments, DssH was calculated for each gene and each site as the average of all organisms in the alignment. We obtained the same relative gene order with respect to DssH as ![]()
To evaluate the effect of DssH continuously across the genome, the most likely model for both low and high DssH values was calculated, and comparative support for these two models was evaluated at each 4x site. The 4x sites with the 70 lowest and 70 highest DssH values were used to obtain parameters for the two extreme models. Maximum-likelihood estimates of model parameters were obtained using the fixed topology and branch lengths of NJTree2 under the GTR model with empirical base frequencies and using PAUP* (![]()
![]()
ln L, the difference in log-likelihood values for the two models. The difference in log-likelihood for the 70 lowest and 70 highest DssH sites evaluated separately and jointly was also calculated. The joint analysis provides the likelihood under the assumption that these two regions are evolving under the same model, and twice this
ln L statistic has an expected distribution of
2 with 9 d.f. A large
ln L indicates that the null hypothesis of a single model for both regions should be rejected, i.e., that these two regions are probably evolving with different substitution rates.
It is of interest to know how much of the site-specific support derives from rate parameters and how much derives from base frequencies. To address this, two further sets of models were obtained: the first used the rate parameters from the previously calculated high and low DssH sets, but used base frequencies obtained from an ML analysis of the entire 4x data set (6 d.f. between independent and joint models); the second, conversely, used base frequencies from the high and low sets, but rate parameters from the entire data set (3 d.f. difference between independent and joint models). Site-specific support was evaluated using
ln L, as before.
| RESULTS |
|---|
Skew and DssH:
In agreement with the results of ![]()
25 sites each. For these partitions, a linear regression for AT skew is also significant (y = 0.295x + 0.242, R2 = 0.687; P < 0.001), as is the GC skew (y = 0.156x - 0.631, R2 = 0.373; P = 0.001). These data points may be distributed on a curved rather than a straight line, however, and logarithmic curves (Fig 3) give better fits in both cases [for the AT curve, y = 0.113 ln(x) + 0.513, R2 = 0.750, P < 0.001; for the GC curve, y = -0.069 ln(x) - 0.782, R2 = 0.581, P < 0.001)]. Although the GC skew appears to fall early on with increasing DssH in both the gene-by-gene graph and the equal-partition graph, the first 25 4x sites already have a GC skew of -0.623, even though these sites spend relatively little time in the single-strand state. The last 25 4x sites have a GC skew of -0.766, which is close to -1.0, the largest possible negative skew.
|
Substitution rates:
Evaluation of changes in substitution rates can establish which types of substitution are most prevalent in the single-strand state. Since it has been hypothesized that transitions are most important, it is useful to analyze the 2x redundant sites in addition to the 4x sites, since transversions at the 4x sites can potentially confound analysis of transition rates. Linear regression of substitution rates on DssH for third codon positions in each gene (Table 2) were performed for 4x, 2x R, and 2x Y sites (see Table 3 for slopes, R2, and rates). For the 4x sites, the TC/CT (Fig 4) and AT/TA regressions are most significant, while nearly all regressions involving G are not (e.g., AG/GA, Fig 4). In a strikingly similar pattern, the regressions of substitution rates for 2x Y sites were significant, while for the 2x R sites they were not (Fig 5). Notably, the slopes of increase in pyrimidine transitions are extremely similar for both 4x and 2x Y sites. A regression of the transition/transversion ratio,
=
/ß, on DssH shows a small but significant increasing trend with DssH (y = 1.94x + 5.36, R2 = 0.545, P = 0.023; Fig 6).
|
|
|
|
|
It is worth noting here that there are explicit tradeoffs in this analysis among the number of parameters being estimated, the size of the data sets, and the accuracy of parameter estimates. When estimating skew, which is dependent only on the four equilibrium frequencies (three independent parameters), we were able to obtain accurate estimates for 23 4x data sets of only 25 sites each. For estimating substitution rates at the 4x sites, there are another six parameters for the reversible model, and the gene average of 63 sites (Table 2) is more appropriate. Fewer sites are necessary (and fewer are available) for the 2x sites, since only one free equilibrium frequency parameter and one rate parameter are in the 2x models. With a greater number of densely sampled taxa, we might be able to accurately sample smaller gene regions, and if the samples were more closely related there might be more sites in the conserved redundancy classes. It is clearly critical that the topology and branch lengths in these analyses are obtained from the entire genome data set and not reestimated for these small data sets, since the addition of so many more free parameters would greatly reduce the precision of substitution rate estimates.
Site-specific analyses:
A regression of relative site-specific support for the two most extreme models (derived from the 70 lowest and highest DssH values) was extremely significant, despite a great deal of noise in the plot (y = -1.62x + 0.335, R2 = 0.115, P < 0.001; Fig 7). The slope was still extremely significant for a regression on the middle points, with the first 70 and last 70 points removed (y = -1.63x + 0.261, R2 = 0.073, P < 0.001). Separate analyses of models with divergent base frequencies only, or divergent rate parameters only, showed that most of the significance was in the equilibrium frequencies (y = -1.38x + 0.312, R2 = 0.098, P < 0.001) rather than in the rate parameters (y = -0.122x + 0.057, R2 = 0.023, P < 0.001; Fig 8). Twice the difference in log-likelihood scores for the smallest and largest 70 sites evaluated both separately and together was 69.03. With only nine free parameters different between these two nested models, the probability that these two sets of sites evolve under the same model is <0.001.
|
|
| DISCUSSION |
|---|
Our analysis indicates that for vertebrates with the "ancestral" gene order and replication process, mutation patterns on the heavy strand during mtDNA replication lead to a strong and consistent increasing gradient of T
C substitutions on the light strand (Fig 9). This is consistent with the logarithmic increase of the AT skew with time in the single-strand state (DssH), and a similar linear increase in estimated rates of T
C substitutions is seen for both 4x and 2x Y redundant sites. The conserved 2x Y sites provide particularly strong evidence, as there is no possibility that this relationship is confounded by variable or unequal transversion rates. In COI, the gene with the smallest DssH value, there is no asymmetric bias in T
C over C
T substitutions for either the 4x or the 2x Y sites, indicating that the development of such a bias in genes with higher DssH values is entirely attributable to time spent in the single-strand state. The linear increase in
, the ratio of transition to transversion rate parameters, is also consistent with this interpretation, although it does not exclude a possible gradient in any of the transversion substitutions. The significant linear regression of relative site-specific support for the extreme high and low DssH value models also lends support to a continuous gradient of mixed substitution rates with changing DssH.
|
It has been hypothesized that C
U mutations on the heavy strand are the most common mutation in the single-strand state, which should lead to an even stronger gradient of G
A than T
C substitutions on the light strand (![]()
A substitutions in either the 4x or the 2x R sites. In both data sets we instead see a strong but nearly constant bias in favor of G
A substitutions over A
G substitutions. This bias is larger than the bias of T
C to C
T substitutions in 4x sites at all points in the genome.
Given the empirical evidence for up to 200-fold higher rates of cytosine deamination in the single-strand vs. double-strand states (![]()
![]()
![]()
![]()
![]()
![]()
T substitution rate biases are not detectable in the nontranscribed strand. An analysis of conserved 4x redundant sites in ND6, the only gene transcribed on the light strand, gave base frequencies of 0.1701 for A, 0.0527 for C, 0.2942 for G, and 0.4830 for T on the light strand, meaning that AT skew on the heavy strand was 0.479, while the GC skew was -0.69. ND6 has a DssH of 1.04, intermediate between ND5 and CYTB, and its AT skew is compatible with this DssH in the absence of any effect of transcription. The GC skew is slightly less negative than the average of heavy-strand-encoded genes with similar DssH values (-0.758), which may be due to sampling error or may indicate a small effect due to transcription, but if so the change is in a direction opposite to expectation. Thus, both transcription and codon bias due to selection play only a minor role, if any, in producing the gradients of asymmetric bias at redundant sites.
A plateau in the rate of light-strand G
A substitutions combined with a linear increase in light-strand T
C substitutions can be explained by either a plateau in the heavy-strand C
U mutation rate or a selective constraint against loss of too many G's in the light strand. A plateau in the C
U mutation rate could be plausibly associated with the coating of the single strand with mtSSBs [which occurs in
5 min (![]()
C substitutions on the light strand, apparently continue unabated in the face of this protection against further C
U deaminations. If selection plays a role, the constraint against loss of G's could be either a general selective pressure on the frequency of G's or a specific pressure against loss of G at specific sites. In the latter case, the G-selected sites would evolve more slowly than unrestricted sites, and therefore the slowest sites should be more heavily biased toward G. We tested this by running a GTR
model with 100 rate categories and fixed NJTree2 on the 4x sites, selecting the 20 sites with the lowest posterior rate probability and evaluating their base composition. The percentage of G content at these sites was 3.9 as opposed to 4.8% G content for all 4x sites combined. Thus, there is no support for site-specific constraints on selection for G in the 4x sites. A linear regression of posterior rate probabilities was also calculated against DssH, but no significant trend was found (slope = -0.11, R2 = 0.002, P = 0.284). There is a great deal of noise in this analysis, but this result is compatible with the hypothesis that the G
A substitutions have saturated early on (Fig 9).
It is worth noting that in the skew plots and in the
plots, and to a lesser extent in the substitution rate plots, points attributable to ND1 and ND2 appear slightly misplaced. These deviations from the rest of the genes could be corrected by a shift in their DssH to slightly higher values. An important assumption in the calculation of DssH is that the length of time in the single-strand state is well predicted by the length of the genome that the two replication forks must traverse and thus that the two replication forks travel at equal and constant speeds. A misplacement of ND1 and ND2 with these calculations might be caused by either a slower rate of light-strand replication or a slowdown of light-strand replication as it crosses secondary structure in the rRNA-coding and control regions.
It appears from this analysis that variation in different kinds of DNA substitutions along the genome can be directly linked to functional aspects of the replication system in vertebrate mitochondria. Although we attempted to determine the ancestral vertebrate substitution process in this study, it is exciting to consider that variation in substitution patterns during the course of evolution might be interpretable in terms of changes in the function of particular proteins, rather than uninterpretable alterations in the average nucleotide frequencies that result. We removed the taxa with the most potential to have changed from the ancestral evolutionary pattern, but there is still no guarantee that these patterns do not change within our data set. With this model, differences in the slope and intercept of the T
C substitution patterns (on the light strand), along with the initial rate of increase and saturation level of the G
A substitutions (Fig 9), can be linked to changes in the rate of replication, the number of replications per generation in the germ line, the efficiency of light-strand replication initiation, the functionality of protection offered by the mtSSB protein, and the location of the origins of replication.
Having definitive knowledge of the evolutionary behavior at individual sites can allow large improvements in phylogenetic inference (![]()
![]()
While the vertebrate mitochondrial genomes are the only taxonomic group with a large set of densely sampled complete genomes with the same gene order, it would be of great interest if new genomes that allow comparative analysis of changes in the mitochondrial replication process become available. Comparison of densely sampled data sets from more closely related organisms will allow evaluation of the degree to which such changes occur. On the basis of current rates of mitochondrial genome sequencing (the number of available complete vertebrate mitochondrial genomes has approximately doubled since this study was initiated), useful data sets are likely to appear in the near future, although current sampling strategies tend to emphasize sampling divergent lineages. We hope that this study will help motivate a focus on higher-density sampling of closely related organisms for the express purpose of more accurately analyzing substitution processes across the genome.
| FOOTNOTES |
|---|
1 Present address: Cold Spring Harbor Laboratory, 1 Bungtown Rd., P.O. Box 100, Cold Spring Harbor, NY 11724. ![]()
| ACKNOWLEDGMENTS |
|---|
We thank Marly Dows and Carla Haslauer for assistance with data preparation and manipulation. We thank Frank Burbrink, Jim McGuire, Mohamed Noor, and two anonymous reviewers for constructive comments on the manuscript. D.D.P. and J.J.F. were supported by a research competitiveness subprogram grant from the State of Louisiana [LEQSF (2001-04)-RD-A-08], and D.D.P. was also supported by the State of Louisiana's Millennium Research Program (Biological Computation and Visualization Center) and by National Institutes of Health grant GM-65612.
Manuscript received August 2, 2002; Accepted for publication June 17, 2003.
| LITERATURE CITED |
|---|
AMES, B. N., M. K. SHIGENAGA, and T. M. HAGEN, 1995 Mitochondrial decay in aging. Biochim. Biophys. Acta 1271:165-170.[Medline]
ASAKAWA, S., Y. KUMAZAWA, T. ARAKI, H. HIMENO, and K. MIURA et al., 1991 Strand-specific nucleotide composition bias in echinoderm and vertebrate mitochondrial genomes. J. Mol. Evol. 32:511-520.[Medline]
BIELAWSKI, J. P. and J. R. GOLD, 2002 Mutation patterns of mitochondrial H- and L-strand DNA in closely related Cyprinid fishes. Genetics 161:1589-1597.
CLAYTON, D. A., 2000 Transcription and replication of mitochondrial DNA. Hum. Reprod. 15:11-17.
DELORME, M. O. and A. HENAUT, 1991 Codon usage is imposed by the gene location in the transcription unit. Curr. Genet. 20:353-358.[Medline]
FRANCINO, M. P. and H. OCHMAN, 1997 Strand asymmetries in DNA evolution. Trends Genet. 13:240-245.[Medline]
FREDERICO, L. A., T. A. KUNKEL, and B. R. SHAW, 1990 A sensitive genetic assay for the detection of cytosine deamination: determination of rate constants and the activation energy. Biochemistry 29:2532-2537.[Medline]
GISSI, C., A. REYES, G. PESOLE, and C. SACCONE, 2000 Lineage-specific evolutionary rate in mammalian mtDNA. Mol. Biol. Evol. 17:1022-1031.
HASEGAWA, M., H. KISHINO, and T. YANO, 1985 Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 22:160-174.[Medline]
HOLT, I. J., H. E. LORIMER, and H. T. JACOBS, 2000 Coupled leading- and lagging-strand synthesis of mammalian mitochondrial DNA. Cell 100:515-524.[Medline]
IMPELLIZZERI, K. J., B. ANDERSON, and P. M. BURGERS, 1991 The spectrum of spontaneous mutations in a Saccharomyces cerevisiae uracil-DNA-glycosylase mutant limits the function of this enzyme to cytosine deamination repair. J. Bacteriol. 173:6807-6810.
JERMIIN, L. S., D. GRAUR, R. M. LOWE, and R. H. CROZIER, 1994 Analysis of directional mutation pressure and nucleotide content in mitochondrial cytochrome b genes. J. Mol. Evol. 39:160-173.[Medline]
JERMIIN, L. S., D. GRAUR, and R. H. CROZIER, 1995 Evidence from analyses of intergenic regions for strand-specific directional mutation pressure in metazoan mitochondrial DNA. Mol. Biol. Evol. 12:558-563.
LANAVE, C., G. PREPARATA, C. SACCONE, and G. SERIO, 1984 A new method for calculating evolutionary substitution rates. J. Mol. Evol. 20:86-93.[Medline]
LIMAIEM, J. and A. HENAUT, 1984a Demonstration of a sudden change in the use of codons in the vicinity of transcription termination. Crit. Rev. Acad. Sci. III 299:275-280.
LIMAIEM, J. and A. HENAUT, 1984b Fluctuation of the incidence of the 4 bases along the mitochondrial genome of mammals using correspondence factorial analysis. Crit. Rev. Acad. Sci. III 298:279-286.
LINDAHL, T., 1993 Instability and decay of the primary structure of DNA. Nature 362:709-715.[Medline]
LINNANE, A. W., S. MARZUKI, T. OZAWA, and M. TANAKA, 1989 Mitochondrial DNA mutations as an important contributor to ageing and degenerative diseases. Lancet 1:642-645.[Medline]
LOBRY, J. R., 1996 Asymmetric substitution patterns in the two DNA strands of bacteria. Mol. Biol. Evol. 13:660-665.[Abstract]
PERNA, N. T. and T. D. KOCHER, 1995 Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes. J. Mol. Evol. 41:353-358.[Medline]
POLLOCK, D. D. and W. J. BRUNO, 2000 Assessing an unknown evolutionary process: effect of increasing site-specific knowledge through taxon addition. Mol. Biol. Evol. 17:1854-1858.
POLLOCK, D. D., J. A. EISEN, N. A. DOGGETT, and M. P. CUMMINGS, 2000 A case for evolutionary genomics and the comprehensive examination of sequence biodiversity. Mol. Biol. Evol. 17:1776-1788.
PRUITT, K. D., K. S. KATZ, H. SICOTTE, and D. R. MAGLOTT, 2000 Introducing RefSeq and LocusLink: curated human genome resources at the NCBI. Trends Genet. 16:44-47.[Medline]
REYES, A., C. GISSI, G. PESOLE, and C. SACCONE, 1998 Asymmetrical directional mutation pressure in the mitochondrial genome of mammals. Mol. Biol. Evol. 15:957-966.[Abstract]
SANCAR, A. and G. B. SANCAR, 1988 DNA repair enzymes. Annu. Rev. Biochem. 57:29-67.[Medline]
SHADEL, G. S. and D. A. CLAYTON, 1997 Mitochondrial DNA maintenance in vertebrates. Annu. Rev. Biochem. 66:409-435.[Medline]
SUEOKA, N., 1962 On the genetic basis of variation and heterogeneity of DNA base composition. Proc. Natl. Acad. Sci. USA 48:582-592.
SULLIVAN, J., K. E. HOLSINGER, and C. SIMON, 1996 The effect of topology on estimates of among-site rate variation. J. Mol. Evol. 42:308-312.[Medline]
SWOFFORD, D. L., 2000 Phylogenetic Analysis Using Parsimony (*and Other Methods). Sinauer Associates, Sunderland, MA.
TANAKA, M. and T. OZAWA, 1994 Strand asymmetry in human mitochondrial DNA mutations. Genomics 22:327-335.[Medline]
THOMPSON, J. D., D. G. HIGGINS, and T. J. GIBSON, 1994 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
WALLACE, D. C., 1999 Mitochondrial diseases in man and mouse. Science 283:1482-1488.
WOLSTENHOLME, D. R., 1992 Animal mitochondrial DNA: structure and evolution. Int. Rev. Cytol. 141:173-216.[Medline]
YANG, Z., 1997 PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13:555-556.
YANG, Z., N. GOLDMAN, and A. FRIDAY, 1994 Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation. Mol. Biol. Evol. 11:316-324.[Abstract]
This article has been cited by other articles:
![]() |
P. Polak and P. F. Arndt Long-Range Bidirectional Strand Asymmetries Originate at CpG Islands in the Human Genome Gen Biol Evol, March 1, 2010; 2009(0): 189 - 197. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Breton, H. D. Beaupre, D. T. Stewart, H. Piontkivska, M. Karmakar, A. E. Bogan, P. U. Blier, and W. R. Hoeh Comparative Mitochondrial Genomics of Freshwater Mussels (Bivalvia: Unionoida) With Doubly Uniparental Inheritance of mtDNA: Gender-Specific Open Reading Frames and Putative Origins of Replication Genetics, December 1, 2009; 183(4): 1575 - 1589. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. F. Noor Mutagenesis from Meiotic Recombination Is Not a Primary Driver of Sequence Divergence between Saccharomyces Species Mol. Biol. Evol., November 1, 2008; 25(11): 2439 - 2444. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Fonseca, D. Posada, and D. J. Harris Inverted Replication of Vertebrate Mitochondria Mol. Biol. Evol., May 1, 2008; 25(5): 805 - 808. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Jia and P. G. Higgs Codon Usage in Mitochondrial Genomes: Distinguishing Context-Dependent Mutation from Translational Selection Mol. Biol. Evol., February 1, 2008; 25(2): 339 - 351. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. J. Min and D. A. Hickey DNA Asymmetric Strand Bias Affects the Amino Acid Composition of Mitochondrial Proteins DNA Res, November 1, 2007; (2007) dsm019v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Broughton and P. C. Reneau Spatial Covariation of Mutation and Nonsynonymous Substitution Rates in Vertebrate Mitochondrial Genomes Mol. Biol. Evol., August 1, 2006; 23(8): 1516 - 1524. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Mueller Evolutionary Rates, Divergence Dates, and the Performance of Mitochondrial Genes in Bayesian Phylogenetic Analysis Syst Biol, April 1, 2006; 55(2): 289 - 300. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Z. Raina, J. J. Faith, T. R. Disotell, H. Seligmann, C.-B. Stewart, and D. D. Pollock Evolution of base-substitution gradients in primate mitochondrial genomes Genome Res., May 1, 2005; 15(5): 665 - 673. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Gibson, V. Gowri-Shankar, P. G. Higgs, and M. Rattray A Comprehensive Analysis of Mammalian Mitochondrial Genome Base Composition and Improved Phylogenetic Methods Mol. Biol. Evol., February 1, 2005; 22(2): 251 - 264. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Krishnan, H. Seligmann, C.-B. Stewart, A. P. J. de Koning, and D. D. Pollock Ancestral Sequence Reconstruction in Primate Mitochondrial DNA: Compositional Bias and Effect on Functional Inference Mol. Biol. Evol., October 1, 2004; 21(10): 1871 - 1883. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Faith, J. J.
- Articles by Pollock, D. D.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Faith, J. J.
- Articles by Pollock, D. D.
















