- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Clauss, M. J.
- Articles by Mitchell-Olds, T.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Clauss, M. J.
- Articles by Mitchell-Olds, T.
Functional Divergence in Tandemly Duplicated Arabidopsis thaliana Trypsin Inhibitor Genes
M. J. Claussa and T. Mitchell-Oldsaa Department of Genetics and Evolution, Max Planck Institute of Chemical Ecology, 07745 Jena, Germany
Corresponding author: M. J. Clauss, Max Planck Institute of Chemical Ecology, Beutenberg Campus, Hans Knöll Str. 8, 07745 Jena, Germany., clauss{at}ice.mpg.de (E-mail)
Communicating editor: J. BERGELSON
| ABSTRACT |
|---|
In multigene families, variation among loci and alleles can contribute to trait evolution. We explored patterns of functional and genetic variation in six duplicated Arabidopsis thaliana trypsin inhibitor (ATTI) loci. We demonstrate significant variation in constitutive and herbivore-induced transcription among ATTI loci that show, on average, 65% sequence divergence. Significant variation in ATTI expression was also found between two molecularly defined haplotype classes. Population genetic analyses for 17 accessions of A. thaliana showed that six ATTI loci arranged in tandem within 10 kb varied 10-fold in nucleotide diversity, from 0.0009 to 0.0110, and identified a minimum of six recombination events throughout the tandem array. We observed a significant peak in nucleotide and indel polymorphism spanning ATTI loci in the interior of the array, due primarily to divergence between the two haplotype classes. Significant deviation from the neutral equilibrium model for individual genes was interpreted within the context of intergene linkage disequilibrium and correlated patterns of functional differentiation. In contrast to the outcrosser Arabidopsis lyrata for which recombination is observed even within ATTI loci, our data suggest that response to selection was slowed in the inbreeding, annual A. thaliana because of interference among functionally divergent ATTI loci.
NATURAL selection and neutral evolutionary processes can shape functionally important genetic variation at individual genes. In multigene families, we are further challenged to distinguish the contribution of variation among loci and among alleles to functional diversification (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Evolutionary inferences based on patterns in molecular sequence data have been the topic of a large number of recent studies (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Proteinase inhibitors (PIs) are widespread and highly diverse in the plant kingdom. The physiological function of plant proteinase inhibitors includes protection against the proteolytic enzymes of herbivores and pathogens, as well as the regulation of endogenous storage proteinases during seed dormancy and reserve protein mobilization (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Functional investigation of Sinapis trypsin inhibitors has demonstrated expression in immature seeds and in wounded leaves (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The loci encoding A. thaliana trypsin inhibitors (ATTIs) are members of a multigene family. Six ATTI loci are arranged in tandem within 10 kb on chromosome II and appear to have undergone duplication subsequent to a genome duplication event 2440 million years ago (![]()
| MATERIALS AND METHODS |
|---|
A. thaliana trypsin inhibitors:
Six ATTI loci have been identified in the Columbia accession of A. thaliana on the basis of sequence homology to mustard trypsin inhibitor (MTI2; ![]()
![]()
![]()
|
The ATTI1 locus encodes an 89-amino-acid precursor with 27 amino acids in the amino terminus not represented in the mature protein (![]()
![]()
Herbivory experiment and expression analysis:
Seeds of seven A. thaliana accessions chosen randomly from two sequence-defined haplotype classes (A, Gö-0, Ler-0, Nd-1, and Rsch-0; B, Cvi-1, Fe-1a, and Wei-0; see below for haplotype classification) were sown onto a sterilized potting soil/vermiculite mix (3:1), vernalized at 4° for 1 week, placed in a randomized design into a short-day growth chamber, and reduced to a single individual per pot after germination. Four and 6 weeks after germination, six randomly selected plants per accession were placed into a separate growth chamber for 9 hr; three plants were subjected to the herbivory treatment and three individuals were control plants. The herbivory treatment consisted of one third instar Plutella xylostella (Leptidoptera) larva per plant. The larvae originate from line G88 (Cornell University) and were reared on an artificial diet (![]()
![]()
For each of 12 plants per accession (two ages, two herbivore treatments, and three replicates), RNA was extracted from
100 mg of leaf tissue using a standard protocol employing cell lysis with TRIZOL reagent (GIBCO BRL, Gaithersburg, MD) and RNA purification with phenol-chloroform and ethanol precipitation. Approximately 1 µg of total RNA was used for cDNA synthesis, as described by ![]()
5 ng/µl (RAN; Ras-related nuclear small GTP-binding protein; At5g55190). RAN primers were 5' ACCAGCAAACCGTGGATTACC and 3' CCACAAAGTGAAGATTAGCGTCC (57°; see RT-PCR conditions below). To assess trypsin inhibitor expression, one master mix was made for RT-PCR conducted with primers for all six ATTI loci and RAN. A standard PCR protocol was used with the following cycling scheme: 2 min at 94°; 30 cycles of 30 sec at 94°, 20 sec at primer-specific annealing temperature, and 40 sec at 72°; and a final extension at 72° for 10 min. Locus-specific primers were designed to span the intron, such that PCR products derived from the cDNA could be differentiated from genomic contaminants (primer sequences available upon request). The PCR products were visualized on a 2% agarose gel in a standardized manner. Band intensity provides a semiquantitative measure of the transcription level (ImageQuant 5.1; Molecular Dynamics, Sunnyvale, CA). Prior to analysis of transcription level, each RT-PCR band was standardized to the intensity of the 400-bp band of 2 µl of the Low Mass Ladder (Invitrogen, San Diego) loaded in every ninth well to control for gel effects. For the ATTI RT-PCR products, we standardized bands by the intensity of the RAN band for the given individual and by the PCR efficiency of the primers for that ATTI locus amplifying genomic DNA.
Statistical analysis of transcription:
An analysis of variance was performed on the log-transformed standardized transcript level and, because transcription was sometimes zero, 100 was added to each standardized data point prior to log transformation (PROC GLM; version 8, SAS Institute). Haplotype class, herbivory, locus, age, and accession nested within haplotype class were fixed effects (e.g., ![]()
ATTI tandem arrayamplification and sequencing:
A single individual was sampled from 17 accessions of A. thaliana (L.) Heynh. (Brassicaceae) originating from Eurasia, North Africa, and North America: Col-0, USA; Cvi-1, Cape Verde Islands; Di-0, France; Fe-1a, Germany; Gö-0, Germany; Ita-0, Morocco; Kas-1, India; Le-0, The Netherlands; Ler-0, Germany; Nd-1, Germany; Nok-0, The Netherlands; Rsch-0, Russia; Sah-0, Spain; Ta-0, Czech Republic; Wei-0, Switzerland; Wil-2, Lithuania; and Ws-0, Belarus. All A. thaliana seeds were obtained from the Nottingham Arabidopsis Stock Center. After harvest, young leaf material from each accession was placed in liquid nitrogen and DNA extraction from 0.1 g of tissue followed the protocol for the Nucleon PhytoPure plant DNA extraction kit (Amersham, Arlington, Heights, IL). Primers for PCR amplification were placed in the exons of genes flanking the ATTI array in Columbia (AC002335; arrayF, 5' GGACGGGTCGTTTCAGCTG, and arrayR, 5' GACGTGAGCTTAGAGTTCATAC; 58°). PCR of up to 10 kb of chromosome II was conducted using ELONGase Enzyme mix (GIBCO BRL) with hotstart and an 8-min extension time. PCR products were gel purified and cloned using pCR-Blunt II TOPO vector (Invitrogen). Three clones per accession were sequenced on a 3700 ABI capillary sequencer using primers spaced approximately every 400 bp and designed from the Columbia accession (available upon request). For Ita-0, the ATTI region was amplified in two overlapping segments due to the presence of a large insertion. The ATTI array was amplified from one individual of A. lyrata ssp. petraea from Plech in Bavaria, Germany (49° 54' 99'' N; 11° 30' 64'' E; same as KOCH, Pfaffenhofen b. Neuhaus, Bavaria, Germany, leg. KOCH; ![]()
![]()
Sequence analysis:
Sequences were assembled with SeqMan 5.0 (DNASTAR) and all variable sites were checked manually during the construction of a consensus sequence from three clones for each A. thaliana accession and one A. lyrata individual. Only one allele per accession was included in the population genetic analysis of A. thaliana because individuals are derived from multiple generations of inbreeding in the laboratory. For the outgroup A. lyrata, we sequenced one of two alleles. All sequences were aligned with MegAlign 5.03 set to default gap penalty parameters (DNASTAR). Pairwise BLAST searches confirmed insertion/deletion (indel) breakpoints in regions of high polymorphism and repetitive nucleotide sequence. Coding and noncoding regions were inferred according to homology with MTI2, GenBank annotation, and sequenced cDNA clones in A. thaliana (M. J. CLAUSS, unpublished data). The DnaSP program version 3.84 (![]()
, was estimated according to ![]()
according to ![]()
![]()
![]()
![]()
![]()
Sequences were tested for departure from equilibrium-neutral expectations using statistics from ![]()
![]()
![]()
![]()
per gene (estimated as the average number of nucleotide differences), (2) the sample size, (3) the estimated recombination parameter per gene (R), and (4) the estimated value of Fay and Wu's H. Homogeneity of diversity and divergence in silent sites along the tandem array was tested using a multilocus Hudson, Kreitman, and Aguadé (HKA) test for eight nonoverlapping, artificially defined loci each encompassing
500 nongapped positions (multilocus HKA; distributed by Jody Hey through http://www.lifesci.rutgers.edu/heylab/).
| RESULTS |
|---|
Heterogeneous ATTI expression:
Semiquantitative RT-PCR estimating mRNA levels of ATTI loci in rosette leaves for seven accessions of A. thaliana demonstrated that five of the six ATTI loci were transcribed. Although positive controls for three primer pairs with genomic DNA resulted in ATTI5 amplification, we never observed ATTI5 bands in RT-PCR analyses for any accession or tissue type (including basal leaves, flowers, fruits, and roots; results not shown). Consequently, the ATTI5 locus was excluded from subsequent analyses. The ATTI5 locus was identified here for the first time on the basis of sequence similarity and position within the tandem array from the accessions Cvi-1, Fe-1a, Nd-1, and Wei-0 (see below). The ATTI4 locus, presently not annotated in the Col-0 sequence as a member of the trypsin inhibitor gene family, was transcribed in all accessions tested.
There was differential regulation among ATTI loci, among haplotype classes, and in response to herbivory, as seen by the significant main effects in the ANOVA of log-transformed transcript levels (Table 1). Gene regulation differed among loci with ATTI3 and ATTI6 having the highest transcript levels (Table 1; Fig 2A). The haplotype classes differed significantly in transcription; overall, haplotype A (HA) had higher transcript levels (Table 1; Fig 2B). Only ATTI6 showed the opposite response, with haplotype B (HB) having more transcripts (Fig 2B). Accessions were assigned to haplotype class on the basis of the presence (HB) or absence (HA) of the ATTI5 region, as well as >150 nucleotide polymorphisms in linkage disequilibrium throughout the array (see below). The effect of accession nested within haplotype class explained no additional variation in transcript levels (Table 1). The herbivory treatment accounted for the largest source of variation in the model, with herbivore-induced plants having, on average, 4.5 times greater transcript levels than uninduced control plants (Table 1; Fig 2C). Loci were induced differently by herbivory, such that transcription plasticity was greater than average for ATTI3 and lower than average for ATTI2 (Table 1, linear contrasts; Fig 2C). Plant age did not account for significant variation in expression. However, the two-way interaction of locus and age was significant, indicating the potential for age-specific regulation among loci (Table 1; Fig 2A). The remaining two-way as well as all three- and four-way interaction terms were not significant and, therefore, were removed from the model for clarity.
|
|
Nucleotide variation in the ATTI array:
The alignment of 17 accessions of A. thaliana for the ATTI tandem array located on chromosome II spanned 9019 bp (from 3' of the preceding to 5' of the succeeding gene). An analysis based on 7016 nongapped sites identified 239 polymorphic sites (including 59 singletons), 16 haplotypes, nucleotide diversity estimates of
= 0.0107 and
= 0.0102, and a minimum of six recombination events. Unless otherwise stated, analyses and references to position are based on a 10,352-bp global alignment of 17 A. thaliana accessions with A. lyrata as the outgroup.
Under neutrality, nucleotide sequences exhibiting high divergence among species are also predicted to evolve at high rates within species. In the ATTI region, however, diversity and divergence were uncorrelated (rSpearman = 0.137; P = 0.996; n = 17 nonoverlapping 300-bp windows), and a multilocus HKA test rejected homogeneity of polymorphism and divergence (eight nonoverlapping loci of
500 bp;
2 = 24.71; P < 0.001). In relation to divergence, polymorphism was disproportionately lower in the 5' and 3' ends of the region and higher flanking ATTI4, in the middle of the array (Fig 3).
|
Significant linkage disequilibrium was evident throughout the entire tandem array (55% of comparisons by Fisher's exact test significant at P < 0.001; Fig 4), giving rise to two haplotype clades (Fig 5). In addition to the six recombination events identified via the ![]()
|
|
Within the ATTI array, a peak in species-wide diversity centered on ATTI5 was the result of a large number of fixed differences among haplotypes (Fig 6A;
> 0.04 in one 400-bp window; n = 17 accessions). Diversity within each haplotype was similarly low throughout the array (HA
= 0.0032; HB
= 0.0027) and showed no underlying peaks in polymorphism near ATTI5 (Fig 6A). Using 2 accessions that are representative of HA and HB (Col-0 and Fe-1, respectively), we polarized each polymorphism in the array using the A. lyrata sequence as the outgroup. Equal numbers of derived nucleotide substitutions were found in the two lineages (62 in each Col-0 and Fe-1a). Even within the largest nonrecombining region in the array interior (Fig 4), derived and ancestral substitutions were evenly distributed among haplotypes (Table 2).
|
|
We tested whether the species-wide frequency spectrum of polymorphisms deviated from expectations under neutral equilibrium population dynamics. A sliding window analysis of the ATTI region demonstrated that, on average, Tajima's D was not different from zero (D = 0.22; P > 0.10). However, two 400-bp segments did deviate significantly from zero: D = 2.08 (P < 0.05) and D = 2.26 (P < 0.05), corresponding to the 5' regions of ATTI2 and ATTI6, respectively (Fig 6B; see Single-locus analysis below). A sliding window analysis of Fay and Wu's H demonstrated that, within the context of an overall excess of high-frequency-derived polymorphisms (H = 36.06; P < 0.001), distinct deviations from neutrality were associated with ATTI2 and ATTI4 (Fig 6C).
Insertion/deletion variation in the ATTI array:
The pattern of insertions and deletions contains valuable population genetic information and simultaneously can be the source of hidden biases in the analysis of nucleotide diversity. Alignment algorithms employing a high or low gap penalty can result in elevated or reduced diversity parameters within the resulting alignments, respectively. Indels were found throughout the ATTI array, with an indel peak roughly coinciding with the peak of polymorphism (rSpearman = 0.48; P = 0.049; n = 17 nonoverlapping 400-bp windows; Fig 7). This positive correlation is not expected if high diversity were simply an alignment artifact. When we polarized 230 gapped sites in accessions representative of haplotypes A and B relative to the A. lyrata outgroup (Col-0 and Fe-1a, respectively), we observed approximately equal numbers of insertion and deletion events (20 and 23, respectively), although 2.7 times more sites were deleted (163) than were inserted (67). Derived indels were distributed evenly among haplotype clades surrounding the ATTI5 deletion (Table 2).
|
In the Ita-0 accession, we identified a 4857-bp indel that appears to be a class II (DNA) transposable element insertion as evidenced by the presence of an 11-bp terminal inverted repeat, flanking 5-bp target site duplication, and sequence similarity to A. thaliana MuDR-like elements. The element was located within the 5' untranslated region (UTR) of ATTI4 (85 bp of ATG, with no discernible effect on transcription; M. J. CLAUSS, unpublished data) and was lacking in the remaining 16 accessions as well as in the A. lyrata outgroup. The second largest indel was a 1505-bp deletion observed in 13 of 17 accessions, which includes the previously undescribed ATTI5 locus (Fig 1). The ATTI5+ allele was 43% diverged from the most similar locus; hence ATTI5 probably represents the recent deletion of an old gene family member. Consistent with this hypothesis, diversity within the ATTI5 region (
= 0.0034 for positions 72738777) was not reduced relative to average diversity across the entire tandem array (
= 0.0027 for positions 19710,548) when only the four accessions with ATTI5+ (haplotype B; Fig 6A) were considered. Comparison with the outgroup A. lyrata was not informative because ATTI5, as well as ATTI3, were deleted in all alleles surveyed thus far (M. J. CLAUSS, unpublished data). Although the ATTI5 deletion site in A. thaliana and A. lyrata was within 100 bp, homology of the deletion was uncertain because of ambiguous alignment in this highly repetitive and polymorphic region.
Single-locus analysis:
The six ATTI loci differ in their level of diversity within A. thaliana, the frequency distribution of polymorphisms, and among-species divergence (Table 3, Table 4, and Table 5; Fig 7). The consensus sequence of all ATTI loci was in frame, with no pseudogene signatures. Variation in sequence diversity among the five expressed ATTI loci was positively correlated with variability in transcript levels among accessions (Fig 8). For the nontranscribed ATTI5 locus only, a GA microsatellite located 313 bp 5' to the ATG in each TI locus was large and variable (up to 25 repeats), and there was a single missense replacement substitution in one accession (Cvi-1). As expected for functional genes, average diversity at ATTI loci (
= 0.0057) was lower than diversity across the
10 kb spanning the array (
total = 0.0107 or
silent = 0.0135; Table 3 and Table 4). Within this chromosomal context, diversity for ATTI1 and ATTI2 was extremely low (
= 0.0009), whereas
for ATTI4 was high (0.0110), approaching that of silent-site diversity. Differences in the extent of haplotype structure were responsible for some of this diversity: ATTI1 and ATTI2 had only singleton polymorphisms, whereas the two dominant haplotypes could be identified in the remaining loci (e.g., Wall's B for ATTI4 and ATTI6 was 0.54 and 0.44, respectively; Table 4). Among functional regions, exon 1, coding for most of the signal peptide, had the greatest average diversity, even higher than that of the 5'-UTR (Table 3). Significant deviation in the frequency spectra from equilibrium-neutral expectations was observed for the ATTI2 locus (including flanking region; Tajima's D = 2.02; P < 0.002) and ATTI6 (coding sequence only; D = 2.20; P < 0.01; Table 4). After a sequential Bonferroni correction (![]()
|
|
|
|
Although average ATTI divergence between A. thaliana and A. lyrata was similar to divergence across the entire chromosomal region (K = 0.100), K ranged from 0.0661 to 0.1671 among loci (Table 3). Among coding positions (
270 bp per gene) in the four ATTI loci for which an outgroup comparison was possible, divergence and diversity were positively correlated (rSpearman = 0.9487; P = 0.05; n = 4; Table 3). Although the significant HKA test reported above rejected homogeneity for the entire 10-kb region, HKA was nonsignificant when only the coding regions were compared (results not shown). Among functional regions, the lowest divergence, on average, was seen for coding sequences (8%), whereas the 5'-UTR was on average 17% divergent (Table 3). High- frequency-derived polymorphisms were in significant excess for ATTI2 (Fay and Wu's H; Table 4).
A comparison of synonymous (Ks) and amino acid replacement (Ka) substitutions can provide evidence of the evolutionary rate of functionally important changes on three levels in gene families: intraspecific diversity within loci, species divergence within loci, and divergence among loci. For ATTI, we found lower replacement than synonymous rates of evolution at each level. First, within A. thaliana,
a/
s varied from 0.29 (ATTI1) to 0.76 (ATTI6) along the array, suggesting different levels of purifying selection among loci (Table 5). Second, among species, the Ka/Ks ratio was lower than
a/
s within A. thaliana (Table 5). It is notable that Ka/Ks for ATTI6 was the lowest observed (0.14). Third, comparisons among all pairs of ATTI loci for one representative accession (Fe-1a) also illustrated overall selective constraint in trypsin inhibitor evolution (Ka/Ks << 1): average Ka/Ks ratios for all comparisons including ATTI1 through ATTI6 were 0.33, 0.40, 0.44, 0.43, 0.61, and 0.21, respectively. Thus, ATTI5 was under the least selective constraint (0.61), whereas amino acid evolution appears most constrained for ATTI1 and ATTI6.
Of particular interest for understanding changes in inhibitory function are replacement changes in the reactive site loop (Fig 1). While there were synonymous changes between the P3 to P3' positions in some loci, no within-species replacement polymorphisms were observed within this critical region. The most common residue at the P1 position for ATTI, arginine, was replaced by lysine in ATTI4 and ATTI6 (Fig 1).
| DISCUSSION |
|---|
We found a significant association between gene expression and haplotype class for loci of the A. thaliana trypsin inhibitor (ATTI) tandem gene family. Membership in two molecularly defined haplotype classes explained a significant proportion of the variation in transcription, whereas the contribution of accession nested within haplotype was not significant (Table 1). Haplotype classes differed up to sixfold in transcript level (Fig 2B). Within A. thaliana, linkage disequilbrium generally extends over <250 kb; hence a significant association between haplotype and function suggests cis-regulation at or near the tandemly arrayed ATTI loci, whereas unlinked trans-acting regulation is less likely (![]()
![]()
![]()
An association between naturally occurring sequence haplotypes and function has been observed in A. thaliana for several qualitative phenotypes relating to pathogen resistance (![]()
![]()
![]()
![]()
ATTI loci in the chromosomal landscape:
Tandemly duplicated genes located in close proximity to one another not only share a demographic and functional history, but also exist within a shared chromosomal landscape. For 10,352 aligned nucleotide positions, we observed: (1) average levels of overall polymorphism within A. thaliana and divergence to A. lyrata, (2) two distinct haplotype classes in A. thaliana with similar nucleotide diversity within each class, (3) significant linkage disequilibrium, (4) abnormally low levels of diversity associated with ATTI1 and ATTI2, and (5) a peak of polymorphism spanning the presence/absence polymorphism of the previously undescribed ATTI5 locus. Significant heterogeneity of polymorphism relative to divergence allowed us to reject the hypothesis that the genomic region encompassing the ATTI tandem array has evolved according a homogeneous neutral model (Fig 3; multilocus HKA; P < 0.001).
The co-occurrence of a peak of polymorphism at one functional locus and distinct haplotype classes has previously been reported in several studies of A. thaliana molecular evolution and may indicate balancing or frequency-dependent selection (![]()
![]()
![]()
![]()
![]()
First, the polymorphism peak in the ATTI tandem array spans >1.2 kb of nongapped sequence, including three genes, their promoter regions, and intergenic sequences (Fig 3 and Fig 6). The genes are part of a tandem array of duplicated loci that are functionally differentiated and thus represent several linked targets of selection (e.g., ![]()
![]()
Second, the peak of nucleotide polymorphism in the ATTI array centers on a presence/absence polymorphism for an old, apparently nonfunctional, gene copy (ATTI5). Whereas evidence in favor of an indel polymorphism maintained by selection is strengthened by the presence of at least one functional allele (e.g., RPM1; ![]()
Third, although the peak of polymorphism reflects fixed differences among two haplotype classes spanning several ATTI loci, the allele frequency spectrum does not provide statistical support for a balanced polymorphism. The worldwide frequency of the ancestral haplotype B (estimated by the frequency of the ATTI5+ allele) was 0.17, and nucleotide polymorphisms in linkage disequilibrium with ATTI5 did not deviate significantly from expectations under a neutral equilibrium model (Fig 6B). Diversity across the sampled 10 kb was similar for both ATTI haplotype classes (HA
= 0.0032 and HB
= 0.0027; or if accessions with interhaplotype recombination are excluded, HA
= 0.0017 and HB
= 0.0015), suggesting coexisting allele classes with segregating variation of similar age. The even distribution of derived sites among haplotypes also argues strongly against introgression of a divergent allele (Table 2). One possible explanation for this pattern is frequency-dependent selection for the maintenance of long-lived and divergent clades as proposed by ![]()
![]()
![]()
![]()
![]()
Functional evolution in closely linked ATTI loci:
ATTI loci exhibit substantial sequence divergence (65%), differences in reactive site residues determining functional specificity, and variation in constitutive as well as induced transcription (Fig 1 and Fig 2). In A. thaliana there was an association of nucleotide polymorphism with ATTI transcript level as demonstrated by significant haplotype and locus x haplotype effects on transcription (Table 1; Fig 2) and by the positive correlation between sequence diversity and variability in transcription among loci (Fig 8). This functional and molecular diversity, together with multiple recombination events (Fig 4), suggests the potential for independent adaptive evolution among tandem duplicates in A. thaliana. However, the coding sequences of ATTI loci are in close physical linkage (neighboring genes are separated on average by only 716 bp), and we estimated significant linkage disequilibrium, particularly among polymorphisms in the interior of the tandem array (Fig 4). Below, we interpret the patterns of polymorphism and divergence in functionally differentiated TI loci within the constraints of their genetic backgrounds.
Unique patterns of sequence diversity and expression for the first two TI loci indicate an evolutionary history distinct from the remaining gene family members. ATTI1 and ATTI2 exhibit extremely low levels of intraspecific polymorphism in comparison to other A. thaliana loci (Table 3 and Table 4; Fig 2A; ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
/K ratio (Fig 3) and an excess of high-frequency-derived polymorphisms concentrated in the signal peptide and 5'-UTR of ATTI2 are consistent with recent positive selection (Fig 6B and Fig C; Table 4; ![]()
ATTI3, ATTI4, and ATTI5 are located in a region with excess species-wide polymorphism, excess indels, and almost complete linkage disequilibrium (Fig 4, Fig 6, and Fig 7). Nonetheless, ATTI3 and ATTI4 were functional and show evidence of selective constraint (
a/
s < 1; Table 5). ATTI3 has the common trypsin inhibitor P1 reactive site residue (arginine) and was the most highly transcribed ATTI locus in both control and herbivore-induced treatments (Table 1; Fig 2C). ATTI4 has lysine at the P1 position and showed a low-to-intermediate transcription profile (Fig 2C). Lysine at the P1 position also results in trypsin inhibition (![]()
![]()
![]()
![]()
The function of ATTI6 in A. thaliana and A. lyrata appears to be modified in comparison to upstream TI loci. ATTI6 has lysine in place of arginine at the P1 position (as in ATTI4) and two additional amino acid substitutions in the P3-P3' reactive site loop in comparison to ATTI1 and ATTI2. The potential for independent functional evolution of ATTI6 in A. thaliana was suggested by three recombination events within and flanking this locus (Fig 4). ATTI6 is functionally unique among ATTI loci in that transcript levels for haplotype B exceeded those of haplotype A (Fig 2B). An interhaplotype recombination event located
880 bp 5' of ATTI6 was associated with a switch from haplotype A to B downstream and a concordant dramatic switch in expression. This result identifies candidate polymorphisms for regulatory control of haplotype-specific ATTI6 expression and is consistent with promoter-deletion experiments that have identified the 520 bp 5' of MTI2 essential for gene expression (![]()
a/
s observed at ATTI6, because demographic processes are expected to be pervasive throughout the genome and affect all site categories (Table 3 and Table 4). The role of balancing selection or frequency-dependent selection favoring alternate alleles at ATTI6 must be further explored via ecologically informed functional studies (e.g., ![]()
Interference or traffic occurs when variants are positively selected but fail to go to fixation because of conflicts among multiple linked segregating sites (![]()
![]()
![]()
![]()
![]()
![]()
Population genetics of defense:
Patterns of nucleotide polymorphism in plant defense-related genes can be divided into three categories. First, we find examples that are consistent with selective sweeps, as predicted by models of a coevolutionary arms race between plants and their enemies (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
In gene-for-gene pathogen resistance, selection appears to result overwhelmingly in rapid adaptive evolution among both alleles and paralogs (![]()
![]()
![]()







