- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Supplemental Material
-
All Versions of this Article:
genetics.103.025098v1
169/2/1009 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Bishop, J. G.
- Articles by Rose, J. K. C.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Bishop, J. G.
- Articles by Rose, J. K. C.
Originally published as Genetics Published Articles Ahead of Print on November 15, 2004.
Genetics, Vol. 169, 1009-1019, February 2005, Copyright © 2005
doi:10.1534/genetics.103.025098
Selection on Glycine ß-1,3-Endoglucanase Genes Differentially Inhibited by a Phytophthora Glucanase Inhibitor Protein
J. G. Bishop*,1,
D. R. Ripoll
,
S. Bashir
,2,
C. M. B. Damasceno
,
J. D. Seeds* and
J. K. C. Rose
* School of Biological Sciences, Washington State University, Vancouver, Washington 98686-9600
Computational Biology Service Unit, Cornell Theory Center
Department of Plant Biology, Cornell University, Ithaca, New York 14853
1 Corresponding author: School of Biological Sciences, Washington State University, 14204 NE Salmon Creek Ave., Vancouver, WA 98686.
E-mail: bishop{at}vancouver.wsu.edu
Plant endo-ß-1,3-glucanases (EGases) degrade the cell wall polysaccharides of attacking pathogens and release elicitors of additional plant defenses. Isozymes EGaseA and EGaseB of soybean differ in susceptibility to a glucanase inhibitor protein (GIP1) produced by Phytophthora sojae, a major soybean pathogen. EGaseA, the major elicitor-releasing isozyme, is a high-affinity ligand for GIP1, which completely inhibits it, whereas EGaseB is unaffected by GIP1. We tested for departures from neutral evolution on the basis of partial sequences of EGaseA and EGaseB from 20 widespread accessions of Glycine soja (the wild progenitor of soybean), from 4 other Glycine species, and across dicotyledonous plants. G. soja exhibited little intraspecific variation at either locus. Phylogeny-based codon evolution models detected strong evidence of positive selection on Glycine EGaseA and weaker evidence for selection on dicot EGases and Glycine EGaseB. Positively selected peptide sites were identified and located on a structural model of EGase bound to GIP1. Positively selected sites and highly variable sites were found disproportionately within 4.5 Å of bound GIP1. Low variation within G. soja EGases, coupled with positive selection in both Glycine and dicot lineages and the proximity of rapidly evolving sites to GIP1, suggests an arms race involving repeated adaptation to pathogen attack and inhibition.
AMONG the complex interspecific relationships arising through coevolution, the enmeshed attack and defense systems of plants and their pathogens are among the most intricate (SOMSSICH and HAHLBROCK 1998; DANGL and JONES 2001). One promising approach to understanding their coevolution is to elucidate the long-term pattern of selection acting on individual biochemical components that govern plant-pathogen interactions. Analysis of molecular variation at plant R-genes that function in recognition of invading pathogens has revealed a variety of evolutionary patterns, including strong balancing selection in some cases and evidence of selective sweeps, indicative of an arms race, in others (BERGELSON et al. 2001; MONDRAGóN-PALAMINO et al. 2002; DE MEAUX and MITCHELL-OLDS 2003). Similar analyses of genes involved in defense deployment (rather than recognition) reveal recent or repeated selective sweeps (DE MEAUX and MITCHELL-OLDS 2003). However, in only one case, that of the pathogen-produced cell wall degrading enzyme polygalacturonase (PG) and its plant-produced inhibitor protein (PGIP), has selection on a pair of antagonistically interacting components been examined (STOTZ et al. 2000; GöTESSON et al. 2002). Here we analyze genetic variation at two plant endoglucanase loci, cell wall degrading enzymes whose role in defense and interaction with a pathogen-produced inhibitor protein have been carefully documented (HAM et al. 1997; ROSE et al. 2002; YORK et al. 2004).
Plant cells are surrounded by rigid walls composed of complex polysaccharides and diverse proteins (CARPITA and GIBEAUT 1993; REITER 2002; O'NEILL and YORK 2003). In addition to providing structural support, cell walls constitute an important line of defense against pathogens. To penetrate and nutritionally utilize plant cell walls, pathogens secrete a remarkable array of polysaccharide degrading enzymes, including exo- and endopolygalacturonases, cellulases, pectinases, rhamnogalacturonase, and xylanases (WALTON 1994; DE VRIES and VISSER 2001; LEV and HORWITZ 2003). Some of these exist in large multigene families that exhibit diverse patterns of expression, suggesting functional specialization (GöTESSON et al. 2002). As a countermeasure, plants deploy cell wall-associated inhibitor proteins of these degradative glycanhydrolases (STAHL and BISHOP 2000; DE VRIES and VISSER 2001; QIN et al. 2003). For example, inhibitors of microbial PGs not only directly retard pathogen penetration, but also inhibit pathogen degradation of the cell wall-derived oligogalacturonides that elicit induced defenses (CôTé et al. 1998; ESQUERRE-TUGAYE et al. 2000; RIDLEY et al. 2001).
Plants further guard against pathogens through glycanhydrolytic attack on the cell walls of invading pathogenic bacteria, fungi, and oomycetes. The best-known defensive glycanhydrolases are chitinases and endo-ß-1,3-glucanases (EGases), many of which are expressed in response to pathogen attack and can confer resistance against specific pathogens(BROGLIE et al. 1991; GRISON et al. 1996; JIN et al. 1999; LEUBNER-METZGER and MEINS 1999). Considerable evidence suggests that these enzymes protect plants through two distinct mechanisms. First, they may directly impair microbial growth and proliferation by hydrolyzing the chitin and ß-1,3/1,6-glucan components of the pathogen cell wall, rendering cells susceptible to lysis and additional defense responses. Second, cell wall fragments released by chitinolytic and glucanolytic activity elicit a wide range of further defense responses (CôTé et al. 1998). In turn, pathogens may resist glycanhydrolytic attack by deploying inhibitors of chitinases and endo-ß-1,3-glucanases, by cell wall modification, or by proteolytic attack (ESQUERRE-TUGAYE et al. 2000; STAHL and BISHOP 2000; ROSE et al. 2002; YORK et al. 2004). The systems of attack and counterattack centered on cell walls of both plants and pathogens seem likely to favor a coevolutionary series of advantageous countermeasures in each interactor, which may include selective sweeps of favorable mutations and the recruitment of new proteins to the interaction.
Isozymes of plant glycanhydrolases vary in their susceptibility to pathogen-produced inhibitors. For example, two class I chitinases isolated from Phaseolus vulgaris differed in their susceptibility to allosamidin, a bacterial inhibitor (LONDERSHAUSEN et al. 1996), despite differing by only four amino acid substitutions (J. BISHOP, unpublished data). Variability in pathogen cell wall architecture and enzymatic inhibitors have been proposed as agents of positive selection detected in the active site of class I chitinases (BISHOP et al. 2000), a circumstance conducive to the strong coevolutionary interactions characteristic of an arms race. One prediction of this hypothesis is that both members of an antagonistically interacting glycanhydrolase-inhibitor system should exhibit the signature of rapid adaptive evolution. This hypothesis has been supported by analysis of endoPGs from fungal and oomycetous pathogens and analysis of corresponding plant polygalacturonase inhibitor proteins (PGIPs) that revealed positive selection driving evolution of both the enzyme and the inhibitor (STOTZ et al. 2000; GöTESSON et al. 2002).
Soybean endo-ß-1,3-glucanases and corresponding glucanase inhibitor proteins (GIPs) secreted by the oomycete root pathogen Phytophthora sojae are an attractive system for studying the coevolution of enzyme-inhibitor systems. P. sojae GIPs inhibit up to 85% of soybean endoglucanase activity and appear to be highly specific for particular EGases (HAM et al. 1997). For example, GIP1 completely inhibits soybean EGaseA, which acts as a high-affinity ligand for GIP1, but it does not bind or inhibit the isozyme EGaseB, tobacco PR-2 (an EGase that is structurally similar to EGaseB), or an endogenous P. sojae EGase (HAM et al. 1997). EGaseA and EGaseB further differ in that EGaseA is constitutively produced, releases elicitors of additional defense reactions when soybean is challenged by P. sojae, and was shown experimentally to increase resistance to P. sojae, whereas EGaseB is induced upon pathogen attack and is not known to increase resistance (YOSHIKAWA et al. 1990; ROSE et al. 2002). Although the corresponding genes remain uncloned, GIP activity has also been reported from the fungal pathogen Colletotrichum lindemuthianum (ALBERSHEIM and VALENT 1974), suggesting that dicot EGases may face a broad range of GIPs. In this study we examined EGaseA and EGaseB for evidence of positive selection at three taxonomic scaleswithin the wild progenitor of soybean, Glycine soja, across the genus Glycine, and across all dicotyledonous plants.
50% amino acid identity. Partial EGaseA and EGaseB sequences were amplified from 20 accessions of G. soja drawn from throughout G. soja's range in Japan, China, S. Korea, and far eastern Russia and from 4 other Glycine species native to Australia, G. canescens, G. falcata, G. latrobeana, and G. tabacina. Seeds for all accessions were obtained from the USDA Soybean Germplasm Collection (see supplementary Table 1 at http://www.genetics.org/supplemental/). Genomic DNA was extracted from Glycine spp. leaves using a modified Dellaporta protocol (DELLAPORTA et al. 1985). PCR conditions for all genes were 35 cycles of 1 min at 94°, 1 min at 60°, and 2 min at 72° using AmpliTaq Gold per instructions, with 1 µg total plant DNA as substrate. EGaseA and EGaseB primers (EGaseA forward 5' tccggggtatgttatggaaga 3', reverse 5' ggccatccactctcagacaca 3'; EGaseB forward 5' cggcgtctgttatggaggaaa 3', reverse 5' acaaccttcacatttggtgcc 3') amplified fragments 681 bp (227 codons) and 669 bp (223 codons) in length, respectively. PCR products were gel separated, excised, cleaned in a High Pure PCR product purification spin column (Roche Diagnostics), treated with shrimp alkaline phosphatase and exonuclease I, and sequenced. In most cases this yielded unambiguous sequence of a single product. However, multiple PCR products for EGaseB obtained from G. tabacina, G. falcata, G. canescens, and several G. soja accessions required cloning into pGEMtEZ vectors (Invitrogen, Carlsbad, CA) and several clones were sequenced from each. To guard against misinterpreting base misincorporation by Taq polymerase as polymorphism, each cloned allelic variant was verified by cloning from at least one additional PCR reaction, but no Taq-derived sequence errors were detected.
Sequences were assembled using Seqman 5.03, aligned using Clustal W in Megalign 5.03 (DNASTAR, Madison, WI), and then adjusted by hand. Phylogenies were estimated by maximum likelihood using DNAML (FELSENSTEIN 2001) and by Bayesian inference using MrBayes 3.0 (HUELSENBECK and RONQUIST 2001). For Bayesian estimates the Markov chain Monte Carlo search was run with 4 chains for 600,000 generations, with trees sampled every 100 generations (the first 1000 trees were discarded as "burnin"). The model employed six substitution types ("nst = 6"), with base frequencies set to the empirically observed values ("basefreq = empirical").
Tests for adaptive molecular evolution were performed using phylogeny-based maximum likelihood models of codon evolution implemented in CODEML, part of PAML (YANG et al. 2000). These models allow nonsynonymous/synonymous rate ratios (dN/dS, hereafter referred to as
) to vary among codons. A likelihood ratio test (LRT) is performed to compare the fit of a model that does not allow for positive selection with one that does. CODEML implements models with several different assumptions regarding the distribution of
among sites. Model M1 is a neutral model that assumes all sites either are subject to purifying selection or are neutral (i.e.,
0 = 0 or
1 = 1). Model M2 adds a third category,
2 > 0, estimated from the data. If
2 > 1, then the LRT is also a test of positive selection. However, the M2-M1 comparison lacks power in some cases where a large fraction of sites have 0 <
< 1, in which case
2 is forced to account for these sites rather than for positively selected sites (YANG et al. 2000). Comparing model M3 (three rate categories) to M0 (one rate category) may detect positive selection in such cases because M3 and M0 estimate
-parameters from the data rather than fixing them at 0 or 1 (YANG et al. 2000). As in the M2-M1 comparison, a significant LRT indicates heterogeneity in selection among codons, but only if one
> 0 does this become a test for positive selection. Although simulations indicate that the M3-M0 comparison is generally a conservative test for positive selection, under certain
-distributions it may be more prone to falsely detecting positive selection (ANISIMOVA et al. 2001). We used several different model comparisons to guard against this type of error.
The codon-based analysis used Bayesian consensus trees for all unique alleles of Glycine EGaseA (8 unique sequences and 2 outgroup sequences; Figure 1 and see supplementary Figure 1 at http://www.genetics.org/supplemental/), EGaseB (18 unique sequences; Figure 2), and dicots (21 sequences; Figure 3; Table 1). For EGaseB, the full data set contains a number of G. soja and G. max sequences that differ from each other by a single substitution, resulting in a large number of equally plausible trees. Analyses were repeated on data sets of 14 and 8 sequences (Figure 2 and see supplementary Figure 2 at http://www.genetics.org/supplemental/), obtained after removing 1-bp variants (14-sequence data set) and possible allelic variants (defined here as conspecific sequences having >95% identity) to guard against elevated rates attributable to recombination or poor phylogenetic resolution. Phylogenetic trees were generally well supported (Figures 13), but to guard against results based on inaccurate phylogenetic inferences, models for the dicot and EGaseB data sets were rerun using additional phylogenies. For the dicot data and the 14-sequence EGaseB data set, CODEML models were run for the set of trees composing the 99% Bayesian credibility group and, for the smaller EGaseB data set, models were run for the 11 trees composing the 99% credibility group. Tree files are available in supplementary material online at http://www.genetics.org/supplemental/. Models included correction for codon usage bias and transition/transversion rate bias and were run with two initial parameter values (using initial values
2init = 0.4 and 1.4). Different model assumptions or trees occasionally caused different peptide positions to be classified as positively selected and models employing lower probability trees tended to infer a greater number of positively selected positions. For each data set only positively selected sites identified across all trees are reported.
|
|
|
|
|
Three-dimensional models of Glycine EGaseA and EGaseB were built on the basis of the X-ray structures of barley 1,3-1,4-ß-glucanase (PDB codes 1AQ0 and 1GHS, respectively) as the 3-D structural templates using the MODELLER program (SALI and BLUNDELL 1993; VARGHESE et al. 1994; SALI et al. 1995; MüLLER et al. 1998). Only a few deletions and insertions were needed and residue side-chains were accommodated well, with only minor distortions of the backbone. The active site residues of EGase were identified as Tyr34, Glu238, and Glu295 (VARGHESE et al. 1994). Models for GIP1 were built using a BLAST sequence alignment to chymotrypsin and using PDB accession 1EQ9 as the 3D-template. GIPs are homologous to trypsin-like serine proteases, but the catalytic residues His, Asp, and Ser common to serine protease enzymes have been substituted by Thr (Thr43), Asn (Asn91), and Thr (Thr177) in GIP1. Trypsin-like proteases bind their substrate by means of an Arg/Lys binding pocket containing a buried Asp residue. This binding pocket is preserved in GIP1, with Asp171 being the equivalent to Asp189 in trypsin.
A model of EGaseA docking with GIP1 was generated manually and was done blindly with respect to knowledge of positively selected sites. The model assumes that inhibition of EGaseA by GIP1 involves at least partial occlusion of the catalytic region and that the inhibitor uses a trypsin-like mechanism of recognition; i.e., GIP1 identifies an Arg or Lys residue on the surface of the EGaseA molecule. Because GIP1 inhibits only EGaseA and not EGaseB, we hypothesized that the Arg or Lys residue recognized by GIP1 should be present only on EGaseA but not on EGaseB. Only binding sites that produce a large surface of interaction without major distortions of the enzyme or inhibitor structures were considered. Residues Arg61 and Lys97 met these criteria. Our model assumes recognition based on Lys97 because it produced greater obstruction of the active site. EGase residues were assumed to be in contact with the bound GIP1 if the distance between any heavy atom belonging to the residue and any atom on the inhibitor was within 4.5 Å. We treat this as an a priori hypothesis of the region likely to experience positive selection.
= 0.00052 and
= 0.0012 per site (see supplementary Table 2 at http://www.genetics.org/supplemental/). All were replacement polymorphisms, with two being unique and the other shared by geographically distant accessions from Ehime and Aichi prefectures, Japan. The predominant G. soja haplotype was identical to G. max accession M37753. A single EGaseA sequence was obtained from each of the four other Glycine spp., but the G. canescens EGaseA contained a premature stop codon and was not included in further analyses. Our data appear consistent with the report that EGaseA is a single-copy gene (CHEONG et al. 2000). G. soja EGaseB exhibited seven variable sites, including four nonsynonymous mutations, yielding six haplotypes (see supplementary Table 2 at http://www.genetics.org/supplemental/). The predominant G. soja EGaseB haplotype was identical to G. max accession AF034110. Additional, poorly resolved PCR products were obtained from within G. soja accessions from Hokkaido, Japan and Zhejiang, China that appeared identical to sequences from G. latrobeana and G. tabacina, suggesting that these products likely represent a paralogous (duplicated) locus, as would be expected on the basis of EGaseB's membership in a gene family (JIN et al. 1999). Additional EGaseB paralogs were amplified from G. soja accession 447003 (Nei Mongol, China) as well as from G. latrobeana and G. tabacina (Figure 2 and see supplementary Table 1 at http://www.genetics.org/supplemental/). Because of uncertainty over paralogy and orthology, we do not report diversity statistics for EGaseB. Pseudogenes amplified from G. soja accession 578340A (Khabarovsk, Russia) and from G. falcata were omitted from the analyses.
Tests of selection:
Tests of selection based on intraspecific variation in G. soja were not conducted because of uncertainty over allelic vs. interlocus (paralogous) variation in EGaseB and because polymorphism in EGaseA was inadequate for such tests. Several pairwise sequence comparisons, all involving EGaseA from G. latrobeana, featured
> 1 (
= dN/dS). Although not statistically significant, the result suggests the possibility of strong selection acting on EGaseA. Codon evolution models applied to phylogenies and alignments of Glycine EGaseA, Glycine EGaseB, and a collection of dicot EGaseA and EGaseB sequences (Figures 13), provided evidence of positive selection (
> 1) for all data sets (Table 2 and see supplementary Table 3 at http://www.genetics.org/supplemental/). Significant tests occurred for all Glycine EGaseA data sets and model comparisons, indicating that evidence for positive selection on EGaseA is particularly robust. Model comparisons were less consistently significant for Glycine EGaseB and across dicot EGases. For dicots only the M2 vs. M1 comparison was significant, but this test is fairly conservative. Although only one test was significant in the smaller EGaseB data (see supplementary Table 3 at http://www.genetics.org/supplemental/), the tests are known to lack power in a data set this size (ANISIMOVA et al. 2001, 2002).
|
In each data set, Bayesian analysis of the models predicted positive, diversifying selection acting on 1 to several peptide sites, with a total of 10 positively selected sites across the data sets (Table 2; see Figures 4 and 5a and supplementary Figures 1 and 2 at http://www.genetics.org/supplemental/ for location of these sites). One site, Ser150 (taking the mature peptide of G. max M37753, Glycine2, as a reference), was positively selected in both the EGaseB and the dicot analyses, regardless of whether EGaseB was included in the dicot data set (see position 151 in Figure 4). Unfortunately, our Glycine data set sequences run only from Pro14 to Val235 (EGaseA) and from Gly7 to Val236 (EGaseB) and are therefore missing the 80-residue carboxy-terminal region containing 2 of the catalytic residues. The missing region comprises much of the active site and includes the region where most positively selected sites were found in the dicot data set. Therefore, the results for Glycine likely underestimate the number of selected residues.
|
Output of CODEML includes reconstructed ancestral sequences, based on maximum likelihood assignment of character states to the interior nodes of the phylogeny (YANG et al. 1995). We used these reconstructions for the five positively selected sites in the dicot data set that are situated in close proximity to GIP1 to estimate the number of amino acid substitutions at these sites that involved recurrent or convergent evolution to the same residue. Of an estimated 73 substitutions occurring since the common ancestor of these sequences, 35 (48%) involve convergence to allelic states found at other nodes (see supplementary Figure 5 at http://www.genetics.org/supplemental/).
Structural models:
Our model of GIP1 bound to EGaseA allowed assessment of whether changes in EGase are likely driven by arms race-type interactions with the inhibitor. Although our binding model is based on GIP bound to EGaseA, superimposition of EGaseA and EGaseB models indicates a close correspondence of atomic position in the two models (C-
root mean square deviation = 1.75 Å). Positively selected sites occurred within 4.5 Å of bound GIP nearly four times more often than expected: 15% (46 of 312; Figure 4) of all residues are within 4.5 Å of GIP vs. 60% of positively selected residues (6 of 10;
2 = 13.5, 1 d.f., P = 0.0002). Furthermore, the number of replacement substitutions per site for the dicot data set is significantly greater at sites within 4.5 Å of GIP (328 of 1476 replacements are within 4.5 Å, corresponding to 4.3 replacements per codon near GIP vs. 7.1 replacements per codon away from GIP, t = 3.07, 52 d.f., P = 0.003), whereas silent substitutions are equitably distributed (239 of 1571 silent substitutions are within 4.5 Å of GIP, corresponding to 5.3 substitutions per codon within 4.5 Å vs. 5.5 substitutions per codon outside 4.5 Å, P = 0.64). Despite missing sequence data for 25% of the protein including much of the active site region, 3 of the 4 positively selected residues identified for Glycine data sets are in close proximity to the active site and 2 are within 4.5 Å of GIP (Figures 4 and 5). Although the results provide clear evidence for diversifying selection, the Glycine and dicot data sets often include sequences from only one individual per species and thus cannot distinguish between repeated fixation of advantageous mutations (selective sweep model) and balancing selection combined with rare allele advantage. However, the near absence of intraspecific sequence variation for EGaseA and low levels of variation for EGaseB are inconsistent with a balancing selection model, which predicts elevated polymorphism relative to neutral expectations. Given the broad geographic range of the samples, the near absence of variation in G. soja is surprising, but it would be predicted in the event of a recent sweep to fixation of a favored allele. Given the possibility of widespread pathogen inocula associated with cultivated soybean, a recent species-wide sweep is plausible. Alternatively, neutral demographic patterns, such as a recent range expansion or high unidirectional gene flow between cultivated soybean and G. soja, are also plausible explanations. However, introgression between cultivated and wild soybeans has been measured and is insufficient to produce such swamping (ABE et al. 1999). Intraspecific sequence data for other nuclear protein-coding loci could assist in distinguishing between remaining neutral demographic and selective sweep hypotheses (see, for example, TIFFIN 2004 and TIFFIN and GAUT 2001). No such data are available for the genus Glycine, but several studies document high levels of variation at isozyme and other molecular marker loci in G. soja (TOZUKA et al. 1998; DONG et al. 2001; XU et al. 2001), suggesting that the low levels of variation observed for EGases and particularly for EGaseA may be unusual and indicative of a recent selective sweep.
The actual strength of selection acting on EGases is difficult to judge because relatively few proteins have been examined using codon evolution models. Selection is clearly weaker on EGases than on some class I chitinases and PGIPs, where
> 1 even when averaged over all sites in the protein. The set of genes that has been analyzed using codon evolution models is dominated by those studied because of a priori expectations of strong selection, such as genes involved in defense, in avoiding immune response, or in mating systems. To put our results in context, we surveyed 33 genes for which codon evolution models found evidence of positive selection (at least one parameter
i > 1). On average for these genes
2 = 5.6, and
8% of codons were placed in the positively selected category (BISHOP et al. 2000; STOTZ et al. 2000; YANG et al. 2000; FORD 2001; SWANSON et al. 2001; GöTESSON et al. 2002; TIFFIN 2004). Glycine EGaseA had the third highest
of all genes surveyed, but only a small proportion of sites (1.2%) were placed in this category. Dicot and EGaseB data sets have
in the lower third of the distribution for positively selected genes, but the proportion of sites categorized as positively selected (59%) is near the mean.
Patterns of selection on defense genes likely vary depending on their role in defense and mechanism of action. For example, proteins that act earliest in a sequence of defense responses, such as R-proteins that detect pathogen-associated molecular patterns, may experience more intense or more frequently recurring selection than the array of downstream pathogenesis-related proteins that they induce. In the case of selection on EGase isozymes, EGaseA may experience more intense selection from pathogen countermeasures than EGaseB because of its apparent role in producing elicitors of further defense responses. The values of
obtained here support this hypothesis.
Location of positively selected and highly variable residues:
Bayesian analysis of the codon evolution models identified 10 amino acid sites that had a high probability (P > 0.94) of sustaining repeated advantageous substitutions (Figures 4 and 5a). Several of these sites exhibit remarkable variability. For example, sites 9 and 289, which physically contact each other in the three-dimensional structure, present 15 amino acid combinations among the 21 sequences analyzed in the dicot data set (Figure 5, b and c). These residues form a ledge on the lip of the active site cleft across from catalytic residue Tyr33 and are predicted to interact with GIP1 in our docking model of P. sojae GIP1 and soybean EGaseA. Only two sites away, position 291 is similarly situated with respect to GIP1 but also physically contacts catalytic residue Glu295. Position 291 is occupied by 9 different amino acids among the 21 sequences. Overall, 5 of 10 positively selected sites (positions 9, 54, 183, 289, and 291) are located on the margin of or within the active site cleft (Figure 5, a and b). This pattern is counter to the usual expectation, wherein residues within the active site cleft are highly conserved.Inspection of Figure 5, c and d, indicates that the most rapidly evolving sites appear as a "ring of fire" around the margin of the active site cleft, and statistically they are far more likely to interact with GIP1 than expected under an equitable distribution of highly variable sites throughout the enzyme. This contrasts with the distribution of silent changes, which shows no pattern with respect to bound GIP1. Similarly, positively selected sites were also more likely to contact bound GIP1 than expected by chance. Although most of the positively selected sites from the dicot data set were "missing data" in the Glycine data sets, one site, Ser150, was categorized as positively selected in both the EGaseB and the dicot data sets. Although this residue is external to the binding cleft, one loop of GIP1 is positioned directly over this site. The close proximity of rapidly evolving and positively selected sites to bound GIP strongly suggests an ongoing arms race between plant EGases and their pathogen-produced inhibitors. It will be of interest to examine variation in GIP for reciprocal evolutionary patterns and to test experimentally whether positively selected EGaseA residues modulate the interaction with GIP1 and its effects on glycanhydrolytic activity.
Summary:
Although the molecular genetic mechanisms of plant-pathogen interactions are rapidly becoming understood, elucidation of the coevolutionary processes that give rise to these mechanisms has come more slowly. Retrospective estimates of the strength and form of natural selection acting on a broad sample of the genes involved will characterize the distribution or hierarchy of response types and selection strengths, providing a richly informative context for coevolutionary models. Several competing, but not mutually exclusive, coevolutionary models have already been supported by such retrospective analyses. For example, the discovery that balancing selection maintains resistant and susceptible alleles at RPM1 and other resistance (R) genes in Arabidopsis prompted Stahl et al. to propose a "trench-warfare" model of interaction, in which the frequency of susceptible and resistant alleles cycles according to the population status of the pathogen and the cost of deploying resistant alleles (STAHL et al. 1999; BERGELSON et al. 2001; TIAN et al. 2002; MAURICIO et al. 2003). The trench warfare model has been contrasted with an arms race model, wherein repeated selective sweeps are taken as evidence for ongoing counter adaptation. Other R-genes and a variety of loci involved in defense deployment exhibit this characteristic of an arms race (BISHOP et al. 2000; STOTZ et al. 2000; BERGELSON et al. 2001; TIFFIN and GAUT 2001; MONDRAGóN-PALAMINO et al. 2002; DE MEAUX and MITCHELL-OLDS 2003; TIFFIN 2004). Antagonistic coevolution of enzyme-inhibitor systems modulating plant-pathogen interactions may be particularly prone to arms race dynamics, although such races need not exclude reversion to previously advantageous allelic states. Indeed, likelihood reconstruction of ancestral sequences for the dicot phylogeny indicates that of 73 substitutions estimated to occur at the five positively selected sites contacting GIP, a remarkable 48% involve convergent evolution to the same residues (see supplementary Figure 5 at http://www.genetics.org/supplemental/). This is consistent with the idea that the number of possible adaptive substitutions is rather limited in enzyme-inhibitor systems, owing to the need to preserve enzymatic function and specificity. It may also indicate that distantly related dicot EGases are frequently evolving in response to highly similar antagonists.
2 Present address: Department of Chemistry, Texas A&M University, 700 University Blvd., Kingsville, TX 78363. ![]()
ABE, J., A. HASEGAWA, H. FUKUSHI, T. M IKAMI, M. OHARA et al., 1999 Introgression between wild and cultivated soybeans of Japan revealed by RFLP analysis for chloroplast DNAs. Econ. Bot. 53: 285291.
ALBERSHEIM, P., and B. S. VALENT, 1974 Host-pathogen interactions. VII. Plant pathogens secrete proteins which inhibit enzymes of the host capable of attacking the pathogen. Plant Physiol. 53: 684687.
ANISIMOVA, M., J. P. BIELAWSKI and Z. YANG, 2001 Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol. Biol. Evol. 18: 15851592.
ANISIMOVA, M., J. P. BIELAWSKI and Z. YANG, 2002 Accuracy and power of Bayes prediction of amino acid sites under positive selection. Mol. Biol. Evol. 19: 950958.
BERGELSON, J., M. KREITMAN, E. A. STAHL and D. TIAN, 2001 Evolutionary dynamics of plant R-genes. Science 292: 22812284.
BISHOP, J. G., A. M. DEAN and T. MITCHELL-OLDS, 2000 Rapid evolution in plant chitinases: molecular targets of selection in plant-pathogen coevolution. Proc. Natl. Acad. Sci. USA 97: 53225327.
BROGLIE, K., I. CHET, M. HOLLIDAY, R. CRESSMAN, P. BIDDLE et al., 1991 Transgenic plants with enhanced resistance to the fungal pathogen Rhizoctonia solani. Science 254: 11941197.
CARPITA, N. C., and D. M. GIBEAUT, 1993 Structural models of primary cell walls in flowering plants: consistency of molecular structure with the physical properties of the walls during growth. Plant J. 3: 130.[CrossRef][Medline]
CHEONG, Y. H., C. Y. KIM, H. J. CHUN, B. C. MOON, H. C. PARK et al., 2000 Molecular cloning of a soybean class III ß-1,3-glucanase gene that is regulated both developmentally and in response to pathogen infection. Plant Sci. 154: 7181.[Medline]
CôTé, F., K.-S. HAM, M. G. HAHN and C. W. BERGMANN, 1998 Oligosaccharide elicitors in host-pathogen interactions: generation, perception, and signal transduction. Subcell. Biochem. Plant-Microbe Interact. 29: 385432.
DANGL, J. L., and J. D. G. JONES, 2001 Plant pathogens and integrated defence responses to infection. Nature 411: 826833.[CrossRef][Medline]
DE MEAUX, J., and T. MITCHELL-OLDS, 2003 Evolution of plant resistance at the molecular level: ecological context of species interactions. Heredity 91: 345352.[CrossRef][Medline]
DE VRIES, R. P., and J. VISSER, 2001 Aspergillus enzymes involved in degradation of plant cell wall polysaccharides. Microbiol. Mol. Biol. Rev. 65: 497522.
DELLAPORTA, S. L., J. WOOD and J. B. HICKS, 1985 Maize DNA miniprep, pp. 3637 in Molecular Biology of Plants: A Laboratory Course Manual, edited by R. MALMBERG, J. MESSING and I. SUSSEX. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
DONG, Y. S., B. C. ZHUANG, L. M. ZHAO, H. SUN and M. Y. HE, 2001 The genetic diversity of annual wild soybeans grown in China. Theor. Appl. Genet. 103: 98103.[CrossRef]
ESQUERRE-TUGAYE, M. T., G. BOUDART and B. DUMAS, 2000 Cell wall degrading enzymes, inhibitory proteins, and oligosaccharides participate in the molecular dialogue between plants and pathogens. Plant Physiol. Biochem. 38: 157163.[CrossRef]
FELSENSTEIN, J., 2001 PHYLIP (Phylogeny Inference Package). University of Washington, Seattle.
FORD, M. J., 2001 Molecular evolution of transferrin: evidence for positive selection in salmonids. Mol. Biol. Evol. 18: 639647.
GöTESSON, A., J. S. MARSHALL, D. A. JONES and A. R. HARDHAM, 2002 Characterization and evolutionary analysis of a large polygalacturonase gene family in the oomycete pathogen Phytophthora cinnamomi. Mol. Plant-Microbe Interact. 15: 907921.[Medline]
GRISON, R., B. GREZES-BESSET, M. SCHNEIDER, N. LUCANTE, L. OLSEN et al., 1996 Field tolerance to fungal pathogens of Brassica napus constitutively expressing a chimeric chitinase gene. Nat. Biotechnol. 14: 643656.[CrossRef][Medline]
HAM, K., S. WU, A. G. DARVILL and P. ALBERSHEIM, 1997 Fungal pathogens secrete an inhibitor protein that distinguishes isoforms of plant pathogenesis-related endo-beta-1,3-glucanases. Plant J. 11: 169179.[CrossRef]
HUELSENBECK, J. P., and F. RONQUIST, 2001 MRBAYES: Bayesian inference of phylogeny. Bioinformatics 17: 754755.
JIN, W., H. T. HORNER, R. G. PALMER and R. C. SHOEMAKER, 1999 Analysis and mapping of gene families encoding beta-1,3-glucanases of soybean. Genetics 153: 445452.
LEUBNER-METZGER, G., and F. MEINS, JR., 1999 Functions and regulation of plant ß-1,3-glucanases (PR-2), pp. 4976 in Pathogenesis Related Proteins in Plants, edited by S. K. DATTA and S. MUTHUKRISHNAN. CRC Press LLC, Boca Raton, FL.
LEV, S., and B. A. HORWITZ, 2003 A mitogen-activated protein kinase pathway modulates the expression of two cellulase genes in Cochliobolus heterostrophus during plant infection. Plant Cell 15: 835844.
LONDERSHAUSEN, M., A. TURBERG, B. BIESELER, M. LENNARTZ and M. G. PETER, 1996 Characterization and inhibitor studies of chitinases from a parasitic blowfly (Lucilia cuprina), a tick (Boophilus microplus), an intestinal nematode (Haemonchus contortus) and a bean (Phaseolus vulgaris). Pestic. Sci. 48: 305314.[CrossRef]
MAURICIO, R., E. A. STAHL, T. KORVES, D. TIAN, M. KREITMAN et al., 2003 Natural selection for polymorphism in the disease resistance gene Rps2 of Arabidopsis thaliana. Genetics 163: 735746.
MONDRAGóN-PALAMINO, M., B. MEYERS, R. W. MICHELMORE and B. S. GAUT, 2002 Patterns of positive selection in the complete NBS-LRR gene family in Arabidopsis thaliana. Genet. Res. 12: 13051315.
MüLLER, J. J., K. K. THOMSEN and U. HEINEMANN, 1998 Crystal structure of barley 1,31,4-ß-glucanase at 2.0 Å resolution and comparison with Bacillus1,31,4-ß-glucanases. J. Biol. Chem. 273: 34383446.
O'NEILL, M. A., and W. S. YORK, 2003 The composition and structure of plant primary walls, pp. 154 in The Plant Cell Wall, edited by J. ROSE. Blackwell Publishing, Oxford.
QIN, Q., C. W. BERGMANN, J. K. C. ROSE, M. SALADIE, V. S. KUMAR KOLLI et al., 2003 Characterization of a tomato protein that inhibits a xyloglucan-specific endoglucanase. Plant J. 34: 327338.[CrossRef][Medline]
REITER, W.-D., 2002 Biosynthesis and properties of the plant cell wall. Curr. Opin. Plant Biol. 5: 536542.[CrossRef][Medline]
RIDLEY, B. L., M. A. O'NEILL and D. MOHNEN, 2001 Pectins: structure, biosynthesis, and oligogalacturonide-related signaling. Phytochemistry 57: 919967.
ROSE, J. K. C., K.-S. HAM, A. G. DARVILL and P. ALBERSHEIM, 2002 Molecular cloning and characterization of glucanase inhibitor proteins: coevolution of a counterdefense mechanism by plant pathogens. Plant Cell 14: 13291345.
SALI, A., and T. L. BLUNDELL, 1993 Comparative protein modelling by satisfaction of spatial constraints. J. Mol. Biochem. 234: 779815.
SALI, A., L. POTTERTON, F. YUAN, H. VAN VLIJMEN and M. KARPLUS, 1995 Evaluation of comparative protein modeling by MODELLER. Proteins 23: 318326.[CrossRef][Medline]
SOMSSICH, I. E., and K. HAHLBROCK, 1998 Pathogen defense in plantsa paradigm of biological complexity. Trends Plant Sci. 3: 8690.[CrossRef]
STAHL, E. A., and J. G. BISHOP, 2000 Plant-pathogen arms races at the molecular level. Curr. Opin. Plant Biol. 3: 299304.[CrossRef][Medline]
STAHL, E. A., G. DWYER, R. MAURICIO, M. KREITMAN and J. BERGELSON, 1999 Dynamics of disease resistance polymorphism at the RPM1 locus of Arabidopsis. Nature 400: 667671.[CrossRef][Medline]
STOTZ, H. U., J. G. BISHOP, C. W. BERGMANN, M. KOCH, P. ALBERSHEIM et al., 2000 Identification of target amino acids that affect interactions of fungal polygalacturonases and their plant inhibitors. Mol. Physiol. Plant Pathol. 56: 117130.[CrossRef]
SWANSON, W. J., Z. YANG, M. F. WOLFNER and C. F. AQUADRO, 2001 Positive Darwinian selection drives the evolution of several female reproductive proteins in mammals. Proc. Natl. Acad. Sci. USA 98: 25092514.
TAKEUCHI, Y., M. YOSHIKAWA, G. TAKEBA, K. TANAKA, D. SHIBATA et al., 1990 Molecular cloning and ethylene induction of mRNA encoding a phytoalexin elicitor-releasing factor, ß-1,3-endoglucanase. Plant Physiol. 93: 673682.
TIAN, D., H. ARAKI, E. A. STAHL, J. BERGELSON and M. KREITMAN, 2002 Signature of balancing selection in Arabidopsis. Proc. Natl. Acad. Sci. USA 99: 1152511530.
TIFFIN, P., 2004 Comparative evolutionary histories of chitinase genes in the genus Zea and family Poaceae. Genetics 167: 13311340.
TIFFIN, P., and B. S. GAUT, 2001 Molecular evolution of the wound-induced serine protease inhibitor wip1 in Zea and related genera. Mol. Biol. Evol. 18: 20922101.
TOZUKA, A., H. FUKUSHI, T. HIRATA, M. OHARA, A. KANAZAWA et al., 1998 Composite and clinal distribution of Glycine soja in Japan revealed by RFLP analysis of mitochondrial DNA. Theor. Appl. Genet. 96: 170176.[CrossRef]
VARGHESE, J. N., T. P. J. GARRETT, P. M. COLMAN, L. CHEN, P. B. HOJ et al., 1994 Three-dimensional structures of two plant beta-glucan endohydrolases with distinct substrate specificities. Proc. Natl. Acad. Sci. USA 91: 27852789.
WALTON, J. D., 1994 Deconstructing the cell wall. Plant Physiol. 104: 11131118.[Medline]
XU, D. H., J. ABE, A. KANAZAWA, J. Y. GAI and Y. SHIMAMOTO, 2001 Identification of sequence variations by PCR-RFLP and its application to the evaluation of cpDNA diversity in wild and cultivated soybeans. Theor. Appl. Genet. 102: 683688.[CrossRef]
YANG, Z., S. KUMAR and M. NEI, 1995 A new method of inferene of ancestral nucleotide and amino acid sequences. Genetics 141: 16411650.[Abstract]
YANG, Z., R. NIELSEN, N. GOLDMAN and A.-M. KRABBE PEDERSEN, 2000 Codon-substitution models for heterogenous selection pressure at amino acid sites. Genetics 155: 431449.
YORK, W. S., Q. QIN and J. K. C. ROSE, 2004 Proteinaceous inhibitors of endo-ß-glucanases. Biochim. Biophys. Acta 1696: 223233.[Medline]
YOSHIKAWA, M., Y. TAKEUCHI and O. HORINO, 1990 A mechanism for ethylene-induced disease resistance in soybean: enhanced synthesis of an elicitor-releasing factor, ß-1,3-endoglucanase. Physiol. Mol. Plant Pathol. 37: 367376.[CrossRef]
This article has been cited by other articles:
![]() |
M. Shabab, T. Shindo, C. Gu, F. Kaschani, T. Pansuriya, R. Chintha, A. Harzen, T. Colby, S. Kamoun, and R. A.L. van der Hoorn Fungal Effector Protein AVR2 Targets Diversifying Defense-Related Cys Proteases of Tomato PLANT CELL, April 1, 2008; 20(4): 1169 - 1183. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Moeller and P. Tiffin Genetic Diversity and the Evolutionary History of Plant Immunity Genes in Two Species of Zea Mol. Biol. Evol., December 1, 2005; 22(12): 2480 - 2490. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Bishop Directed Mutagenesis Confirms the Functional Importance of Positively Selected Sites in Polygalacturonase Inhibitor Protein Mol. Biol. Evol., July 1, 2005; 22(7): 1531 - 1534. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Supplemental Material
-
All Versions of this Article:
genetics.103.025098v1
169/2/1009 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Bishop, J. G.
- Articles by Rose, J. K. C.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Bishop, J. G.
- Articles by Rose, J. K. C.









