Abstract
The S locus receptor kinase and the S locus glycoproteins are encoded by genes located at the S locus, which controls the self-incompatibility response in Brassica. In class II self-incompatibility haplotypes, S locus glycoproteins can be encoded by two different genes, SLGA and SLGB. In this study, we analyzed the sequences of these genes in several independently isolated plants, all of which carry the same S haplotype (S2). Two groups of S2 haplotypes could be distinguished depending on whether SRK was associated with SLGA or SLGB. Surprisingly, SRK alleles from the two groups could be distinguished at the sequence level, suggesting that recombination rarely occurs between haplotypes of the two groups. An analysis of the distribution of polymorphisms along the S domain of SRK showed that hypervariable domains I and II tend to be conserved within haplotypes but to be highly variable between haplotypes. This is consistent with these domains playing a role in the determination of haplotype specificity.
MOST flowering plant species are hermaphrodite and possess flowers in which male (stamen) and female (pistil) reproductive organs are in close proximity. However, outcrossing is favored in the majority of these species due to the presence of self-incompatibility (SI) systems that allow the pistil to recognize and reject self-pollen. In Brassica, SI is controlled by a single genetic locus (the S locus) and self-pollen is rejected on the stigma surface (reviewed in Cock 2000; Nasrallah 2000).
One of the most striking features of the S locus is its extremely high level of polymorphism. For example, >80 different S locus alleles (or haplotypes) have been identified in Brassica oleracea (Ruffio-Châble and Gaude 2001). This high level of polymorphism is thought to be the result of balancing selection, which maintains large numbers of S haplotypes in Brassica populations over long periods of time (Uyenoyama 1995). Another consequence of this unusual mode of evolution is that S haplotypes exhibit trans-specific evolution; they are often more similar to an S haplotype from another species in the Brassicaceae than to other S haplotypes from the same species (Dwyeret al. 1991; Sakamotoet al. 1998).
Physical mapping and sequence analysis of different S haplotypes has revealed large-scale rearrangements that can extend to several hundred kilobases in this region of the genome (Boyes and Nasrallah 1993; Boyeset al. 1997; Cuiet al. 1999; Suzukiet al. 1999). Several expressed genes are located within this highly polymorphic region and two of these genes have recently been shown to play key roles in the SI response. Takasaki et al. (2000) used a transgenic approach to show that SRK encodes the female component of the SI response. SRK encodes the S locus receptor kinase, a membrane-anchored stigma glycoprotein that shares structural similarity with animal receptor protein kinases (Steinet al. 1991; Delormeet al. 1995; Steinet al. 1996). SCR (or SP11) was shown to encode the male component of the self-incompatibility response (Schopferet al. 1999; Takayamaet al. 2000). SCR encodes the S locus cysteine-rich protein, a small secreted protein that is thought to be located in the pollen coating (Stephensonet al. 1997; Schopferet al. 1999; Takayamaet al. 2000).
While the functions of SRK and SCR seem to be relatively clear, that of a third S locus gene, SLG (S locus glycoprotein), is less well understood. SLG encodes a secreted glycoprotein that closely resembles the extracellular (or S) domain of SRK. The similarity between these two genes is often particularly marked within haplotypes, indicating concerted evolution (Steinet al. 1991; Kusabaet al. 1997; Nishio and Kusaba 2000). Initially, several pieces of evidence suggested that SLG functioned in the SI response (Toriyamaet al. 1991; Shibaet al. 1995). However, more recent evidence argues against a role for this protein in the recognition of self-pollen (Gaudeet al. 1995; Cabrillacet al. 1999; Okazakiet al. 1999; Nishio and Kusaba 2000) although SLG may play an auxiliary role by mediating maturation of the SRK protein (Dixitet al. 2000). If this is the case, SLG is probably not the only protein that can carry out this function as self-incompatible plants that lack SLG have been identified (Okazakiet al. 1999; Nishio and Kusaba 2000). It has been suggested that a truncated form of SRK, eSRK (Girantonet al. 1995), may substitute for SLG in these plants (Dixitet al. 2000). Takasaki et al. (2000) showed that coexpression of SLG and SRK in transgenic plants resulted in a stronger SI response than expression of SRK alone, again supporting the hypothesis that SLG plays an auxiliary role. Another function that has been proposed for SLG is as one of the components on the stigma surface that mediate pollen adhesion (Luuet al. 1999). In the latter case, SLG would function independently of the SI system as pollen adhesion is not influenced by the SI response (Luuet al. 1997).
Both SLG and SRK are highly polymorphic. The alleles of these genes can be grouped into two distinct classes, class I and class II, and there is a correlation between these classes and observed SI phenotypes. SI in Brassica is sporophytic and a nonlinear hierarchy of dominance relationships has been shown to exist between S haplotypes (Thompson and Taylor 1966). Class I haplotypes tend to be dominant and to confer a strong SI response whereas class II haplotypes tend to be recessive to the class I haplotypes and to confer a weaker SI response. Two different SLG genes, SLGA and SLGB, have recently been shown to be associated with class II S haplotypes (Cabrillacet al. 1999). Both SLGA and SLGB contain a single intron but SLGA differs from SLGB in that its second exon encodes a membrane-spanning domain, allowing the production of alternative transcripts that encode both secreted and membrane-anchored proteins (Tantikanjanaet al. 1993; Cabrillacet al. 1999).
Data from genetic crosses indicate that each S haplotype encodes an independent recognition system. Selfpollen rejection is initiated as a result of the interaction of male and female components encoded by the same S haplotype. In principle, therefore, comparison of different haplotypes should permit the identification of sequence motifs that determine haplotype specificity. This approach has been used profitably, for example, as an aid to functional analysis of polymorphic disease resistance gene loci (Elliset al. 1999). However, a draw-back with applying this approach to self-incompatibility systems is the very high level of polymorphism at the S locus. For example, searches for regions that potentially determine haplotype specificity in the S domain of SRK have identified several extremely polymorphic hypervariable (HV) regions (Nasrallahet al. 1987; Kusabaet al. 1997). However, the level of polymorphism throughout the rest of the S domain is also very high and it is possible that the HV regions are simply under a low level of selective constraint and that this is sufficient to explain the higher level of polymorphism in these regions (Nasrallah 1997). The approach that we have taken here, to circumvent this problem, is to compare not only S haplotypes that confer different SI specificities (“between haplotype” comparisons) but also several independently isolated S haplotypes that confer the same SI specificity (“within haplotype” comparisons). Regions under relaxed selective constraint are expected to be more divergent (compared to the rest of the protein) in within-haplotype comparisons whereas regions involved in determining haplotype specificity should be conserved in genes that confer the same SI specificity. Sequence heterogeneity at S locus gene loci has been previously reported for the S2 haplotype (Kusabaet al. 2000). Here we analyze an additional 10 S2 haplotypes. Two groups of S2 haplotype were defined, depending on the presence of either only SLGA or only SLGB. We show that, at the sequence level, hypervariable domains HV1 and HV2 are conserved in independent SRK2 alleles but are divergent between haplotypes, indicating a role for these domains in determining S haplotype specificity.
MATERIALS AND METHODS
Plant material and pollination analysis: The seven S2 B. oleracea plants analyzed in this study were from diverse origins and include varieties of kale, broccoli, and Brussels sprouts (Table 1). Incompatibility phenotypes were determined by self- and cross-pollinations using previously described procedures (Ruffio-Châble and Gaude 2001). Homozygous S6 and S8 tester lines were from the collection at the Genetic Resources Unit, Horticulture Research International (Wellesbourne, United Kingdom), which has recently been duplicated at INRA, Le Rheu (Rennes, France).
Immunological detection of SLG proteins in stigma extracts: Protein extraction, separation of proteins by isoelectric focusing, electrotransfer onto nitrocellulose membranes, and immunodetection of antigen with antibodies were performed as described previously (Gaude et al. 1991, 1993). Polyclonal rabbit serum specific for class I SLG proteins and monoclonal antibody MAb 85-36-71, which recognizes class II SLG proteins, have been described previously (Gaude et al. 1991, 1993; Girantonet al. 1995).
DNA blot analysis: DNA blot analysis was carried out as described (Cabrillacet al. 1999). Probes corresponding to intron sequences of SLGA and SLGB (Cabrillacet al. 1999) and to SLA (Pastugliaet al. 1997b) have been described. The SRK probe was derived from the third intron of SRK2 (J. M. Cock, unpublished data) by PCR amplification with SK42 (5′-GTATAAATAATGAAGGAATCACTATGAAAT-3′) and SK40 (5′-GATCACTTATACAAAACCAACAGAGCAG-3′).
PCR and PCR-restriction fragment length polymorphism segregation analysis: SLGA and SLGB alleles were amplified from genomic DNA, using the PCR and oligonucleotides PS3 (see below) and SG17 (5′-TGTTCCGTCTGTCAAGTCCCACTGCTGCGG-3′) for the progeny of plant i/j and PS4CM (5′-CGGAATATGGTATAAGAAAGTCTCCCA-3′) and SG39 (5′-CTTTGCGTTTCAACACGTTGATTCA-3′) for the progeny of plant k/l. SRK alleles were similarily amplified using the oligonucleotides PS5CM (5′-GGAATATGGTATAAAAAAGCCCC CTG-3′) and SG2 (see below). PCR amplification conditions were 94° for 3 min followed by 30 cycles of denaturation at 94° for 40 sec, annealing at 51° for 40 sec, and extension at 72° for 1 min in a GeneAmp PCR system 9600 cycler (Perkin-Elmer, Shelton, CT). PCR products were digested with either TaqI or NdeI restriction enzymes (GIBCO BRL, Gaithersburg, MD) to detect a restriction fragment length polymorphism (RFLP).
Origins of the seven S2 plants analyzed in this study
cDNA cloning and DNA sequencing: Total RNA (1 μg) from stigmas of each of the seven S2 plants was reverse transcribed with Superscript II reverse transcriptase (GIBCO BRL), using oligonucleotide RA1 (Frohmanet al. 1988) as a primer. SRK and SLG sequences were PCR amplified from the resulting cDNA or from genomic DNA extracted from the same plants, using different combinations of the following oligonucleotides:
5′ oligonucleotides:
PS3, 3′-ATGAAAGGGGTACAGAACAT-5′ (Nishioet al. 1996)
SDOM1, 3′-GWTGGTAYCTCGGRATRTGGTA-5′
3′ oligonucleotides:
SK30, 3′-TTCTCGCCCTCATAAACACAACAG-5′
SK38, 3′-CTCCAACCTATGATTTTTCCAGT-5′
SK66DC, 3′-CTCCTCCAAAAGCAGAACACGATAACACTC-5′
SG2, 3′-GGCCTGCAGCAGCATTCAATCTGAC-5′
PCR products were amplified under the same conditions described above and were cloned into pGEM-T easy (Promega, Madison, WI) and sequenced by the dideoxynucleotide chain termination method of Sanger et al. (1977) on an automatic sequencer (Applied Biosystems, Foster City, CA). Sequence data were analyzed with Lasergene sequence analysis software (DNAstar, London).
Sequence analysis: Multiple sequence alignments were constructed using the Megalign program (Lasergene; DNAstar). Neighbor-joining trees were constructed from these alignments using ClustalW software on the Pasteur Institute website (http://bioweb.pasteur.fr/#log; Thompsonet al. 1994). Nonsynonymous substitutions were identified using DnaSP, version 3 (Rozas and Rozas 1999).
The nucleotide sequence data presented in this article have been submitted to the EMBL, GenBank, and DDBJ nucleotide sequence databases under accession nos. AJ306573–AJ306591.
RESULTS
Identification of Brassica plants carrying S2 haplotypes: Diallel crosses were carried out to test the self-incompatibility phenotypes of seven putative S2 B. oleracea plants from diverse origins (including kale, broccoli, and Brussels sprout lines; Table 1). Table 2 shows that a strong incompatible reaction was observed in the majority of cross-pollinations between the seven plants. In several crosses, however, either the SI reaction was weak or the cross was fully compatible. This was often the case, for example, when plant f was used as the female partner or when plant g/h was used as the male partner. This type of variability in the strength of the SI response has been observed previously, particularly with class II haplotypes. For example, Ruffio-Châble et al. (1997) described similar levels of variability in the strength of the SI response between different Brassica lines homozygous for the S15 haplotype. This phenomenon is thought to be due to the action of suppresser and modifier genes that are unlinked to the S locus but that modulate the SI response (Hinataet al. 1994). In general, however, each of the seven lines exhibited a strong SI response with several of the other members of the group, indicating that they all carried a common S specificity. All of the plants tested, with the exception of plant f, were self-incompatible.
Self- and cross-incompatibility phenotypes of the seven S2 plants in diallel crosses
Immunodetection of S-locus-encoded glycoproteins in stigma extracts from the seven S2 plants. Soluble stigma proteins were separated by isoelectric focusing gel electrophoresis. Immunodetection was performed with monoclonal antibody mAb 85-36-7. The α- and β-polypeptides detected by the antibody are indicated.
Immunodetection of SLG in stigma extracts of the S2 plants: Immunological detection of SLG in stigma extracts separated by isoelectric focusing is a highly discriminative method for typing S haplotypes (Ruffio-Châble et al. 1997, 1999; Okazakiet al. 1999). Figure 1 shows that two bands, α and β, characteristic of the S2 haplotype (Gaudeet al. 1995), were specifically recognized by the monoclonal antibody (mAb) 85-36-71 in all of the S2 plants except g/h and i/j. No additional S-locus-specific proteins were detected with either mAb 85-36-71 or with a polyclonal antibody that specifically recognizes class I SLG proteins (Delormeet al. 1995; Gaudeet al. 1995) in any of the seven S2 plants.
Variation in the complement of SLG genes in the various S2 plants: The immunodetection experiments indicated heterogeneity among the S2 plants. To investigate this further, we probed genomic DNA blots with probes that specifically detect four S locus genes, SRK, SLGA, SLGB, and SLA (Figure 2). Plants g/h and i/j showed markedly different patterns to the five other S2 plants. Two bands (2.00 and 1.25 kbp) hybridized to the SRK probe in DNA of both of these two plants and neither of these bands corresponded to that detected in the other five S2 plants with the same probe (a single fragment of 1.27 kbp). A 0.85-kbp fragment that hybridized to the SLGB probe was detected in DNA of plants g/h and i/j but no hybridization was detected to the SLGA probe. Conversely, a 1.10-kbp SLGA fragment was detected in DNA of the five other S2 plants but, in most cases, no SLGB sequence was detected. The only exception was DNA from plant k/l, which contained both the 1.10-kbp SLGA and the 0.85-kbp SLGB bands, indicating the presence of both genes. The SLA probe hybridized to the same samples as the SLGA probe, as expected (SLA has been shown to be very closely linked to SLGA; Boyes and Nasrallah 1995; Pastugliaet al. 1997b). These results were confirmed by probing genomic DNA digested with BamHI (data not shown).
The presence of two bands that hybridized to the SRK probe in DNA from plants g/h and i/j indicated that the plants analyzed might have been heterozygous at the S locus. To investigate this further we analyzed 18 progeny of the plant i/j. Figure 3a (top) shows that the two sequences detected by the SRK probe segregated with a 4:10:4 ratio (homozygous i:heterozygous i/j:homozygous j) that closely approximates a Mendelian 1:2:1 ratio. When BamHI-digested DNA was probed with the SLGB probe, the pattern obtained was consistent with the presence of two polymorphic allelic sequences. However, an additional, nonpolymorphic fragment was also detected in this experiment (Figure 3a, middle). An alternative approach, based on restriction digestion of PCR products, was therefore used to specifically amplify the SLGB alleles and to detect a polymorphism that was predicted from the sequences of the two SLGB alleles (Figure 3a, bottom). These analyses demonstrated that plant i/j was heterozygous at both the SRK and SLGB loci and allowed the identification of the alleles associated with each haplotype.
DNA gel blots indicated that plant k/l carried alleles of SRK, SLGA, and SLGB, suggesting that it may have been heterozygous at the S locus (Figure 2). To investigate this further, we screened genomic DNA from 10 progeny of plant k/l for the presence of SLGA and SLGB, using PCR and gene-specific oligonucleotide pairs. SLGA and SLGB segregated independently in this population, confirming that plant k was heterozygous (Figure 3b).
The conclusions that can be drawn from the data presented in Figures 2 and 3 are summarized in Figure 3c. Figure 3c also includes the two S2 haplotypes characterized by Tantikanjana et al. (1993) and Kusaba et al. (2000), which we have designated as S2a and S2b, respectively. The figure shows that two classes of S2 haplotype could be clearly distinguished. In the first, SRK is associated with SLGA (haplotypes a, c, d, e, f, and k) and in the second SRK is associated with SLGB (haplotypes b, g, h, i, j, and l). None of the plants analyzed carried an S2 haplotype consisting of SRK, SLGA, and SLGB as has been described in the S15 haplotype.
Cloning and sequence analysis of the S locus genes present in the B. oleracea S2 plants: SRK, SLGA, and SLGB sequences were amplified from the seven S2 plants by reverse transcription (RT)-PCR or by PCR amplification of genomic sequences, using different combinations of six oligonucleotides designed to amplify class II S locus genes. We isolated clones corresponding to all of the SRK, SLGA, and SLGB alleles detected by DNA gel blotting except SRK2g and SRK2h. However, note that, based on comparison of SLGB sequences (see below), these two SRK alleles are likely to be highly similar to SRK2i and SRK2j. The cloned fragments shared a region corresponding to almost the entire S domain (represented by a shaded bar in Figure 5a). To demonstrate that the sequences that we designated SRK were not derived from SLGA or SLGB genes, we amplified, by RT-PCR, a fragment that included both S domain and kinase domain sequences (data not shown).
Detection of SRK, SLGA, SLGB, and SLA alleles in B. oleracea plants carrying S2, S3, and S15 haplotypes. A gel blot of genomic DNA digested with EarI was sequentially hybridized with the probes indicated beneath each panel. The positions of molecular length markers are shown at left in kilobases (kbp).
To estimate the level of sequence polymorphism at each S locus gene locus we compared, at both the nucleotide and deduced amino acid sequence levels, the S domains of the SRK, SLGA, and SLGB alleles from the seven S2 plants and from the two S2 haplotypes that have been described previously (Tantikanjanaet al. 1993; Kusabaet al. 2000). Figure 4a shows that polymorphisms were detected in SRK, SLGA, and SLGB although SLGA was significantly less polymorphic than SLGB and SRK. Despite this polymorphism, alleles of the three genes could be clearly distinguished at the sequence level. SLGA2 was less closely related to SRK2 and SLGB2 than these latter genes were to each other.
Figure 4b shows a phylogenetic tree that was constructed from an alignment of the deduced amino acid sequences of the S domains of the S2 sequences with those of other class II S locus genes. The allelic forms of SRK2 clustered in one part of the tree together with the two previously reported sequences, SRK2a and SRK2b. SRK sequences corresponding to other class II alleles fell outside this cluster. Taken together with the data from genetic crosses (Table 2), this confirmed that the seven plants analyzed in this study carried the S2 haplotype.
Surprisingly, two phylogenetically distinct groups of SRK2 sequence could be distinguished (Figure 4b). These two groups correlated exactly with the presence of either SLGA or SLGB in each haplotype (compare Figure 4b with Figure 3c).
Sequence analysis of the S2 plants also provided a possible explanation as to why no SLG proteins were detected in stigma extracts of plants g/h and i/j (Figure 1). The epitope recognized by the mAb 85-36-71 (IYVN-TLSSSE) is conserved at the predicted N terminus of the mature SLGA protein, but a variant sequence (TYV-NTMSSSE) is predicted to occur in the SLGB protein encoded by the i, j, g, and h haplotypes, and no SLGA gene was detected in these plants. From a practical point of view, heterogeneity concerning the presence or absence of SLG genes in haplotypes that confer the same SI phenotype could present problems for S haplotype typing methods that are based on detection of SLG at either the DNA or protein level (Nishioet al. 1996; Ruffio-Châbleet al. 1997; Sakamotoet al. 1998; Okazakiet al. 1999; Ruffio-Châbleet al. 1999). In this respect, methods based on determining the SRK genotype are likely to be more reliable (Nishioet al. 1997).
Sequence polymorphism in the HV domains of SRK: Recognition of self-pollen in self-incompatible Brassica is thought to involve haplotype-specific interaction between SCR on the male side and SRK on the female side. It has, however, been difficult to identify the regions of these two proteins that determine haplotype specificity because of their extremely polymorphic nature. For example, although hypervariable regions have been identified in the extracellular domain of SRK [HV1, HV2, HV3, and a deletable region (DR); Nasrallahet al. 1987; Kusabaet al. 1997], it is not known whether these domains determine haplotype specificity or whether they are merely regions that are under a low level of selective constraint and therefore are free to diverge to a greater degree. One way of addressing this question is to carry out comparisons both between alleles of the same S haplotype (where domains determining S haplotype specificity should be conserved) and between alleles from different S haplotypes (where domains determining S haplotype specificity should be polymorphic).
When the 10 SRK2 alleles were compared and polymorphic residues were positioned with respect to the S domain, we noted that none of the substitutions affected residues of the DR, HV1, or HV2 (Figure 5a, top). In contrast, when SRK sequences from different class II haplotypes were compared, polymorphic residues appeared to be clustered in these regions (Figure 5a, bottom). To further investigate this observation, we calculated the percentage of substitutions observed in each region on a per residue basis so that substitution rates could be compared despite the differences in length of each region (Figure 5b). This analysis suggested that the pattern of substitutions observed when the different SRK2 sequences were compared differed from that observed when SRK sequences were compared between haplotypes. On average, when SRK sequences of the three class II haplotypes were compared, a substitution was more than three times more likely to be found at a residue within the DR, HV1, or HV2 than at a residue elsewhere in the S domain. In contrast, when the SRK2 sequences were compared, no substitutions were observed in these regions. We used a modified version of the Hudson-Kreitman-Aguadé (HKA) test (Hudsonet al. 1987) to determine whether there was a significant difference in the distribution of nonsynonymous substitutions between the DR/HV1/HV2 and the rest of the region analyzed in a comparison of SRK2 alleles vs. a comparison between the SRK2 alleles and SRK15 (i.e., comparing within-haplotype and between-haplotype comparisons). We did not include HV3 with the other HV regions in this analysis because it is divergent between some SRK2 alleles (Figure 5b) and yet conserved in several SRK alleles corresponding to different S haplotypes (see below), suggesting that it has not evolved in the same manner as the other HV domains. Table 3 shows that the HKA test gave a result that was very close to being significant (P = 0.07). This is interesting because the HKA test provides a conservative estimate of significance, particularly in this case, where the number of observed nonsynonymous substitutions is small.
Linkage analysis of the SLG and SRK alleles in plants i/j and k/l. (a) Segregation of SRK and SLGB alleles in self-progeny of plant i/j. DNA gel blot analysis was performed on 5 μg/lane of EarI-(top) or BamHI-digested (middle) genomic DNA of plant i/j and 18 self-progeny using SRK (top) and SLGB (middle) probes. The bottom shows a PCR-RFLP analysis of the segregation of the two SLGB alleles. SLGB sequences were specifically amplified with oligonucleotides PS3 and SG17 and digested with TaqI to reveal a restriction fragment length polymorphism predicted from sequence analysis of the two SLGB alleles. Bands corresponding to alleles of either SRK or SLGB from the S2i and S2j haplotypes are indicated by either a solid circle or an open circle, respectively. The S2 haplotype(s) (S2i or S2j) carried by each of the progeny is (are) indicated below each lane. The positions of molecular length markers are shown at left in kilobases (kbp) or base pairs (bp). (b) Analysis of the segregation of SLGA, SLGB, and SRK in self-progeny of plant k/l. SLGA, SLGB, and SRK sequences were specifically amplified with oligonucleotides PS3 and SG38, PS4CM and SG39, or PS5CM and SG2, respectively. The S2 haplotype(s) (S2k and/or S2l) carried by each of the progeny is (are) indicated below each lane. The positions of molecular length markers are shown at left in base pairs (bp). (c) Schematic representation showing the complement of genes present in S2a, in S2b, and in the 10 S2 haplotypes carried by the seven plants analyzed in this study (including three plants that were heterozygous at the S locus).
Although the total number of polymorphic sites identified when SLGB2 sequences were compared was similar to that observed for the SRK2 sequences, a much higher proportion was located in the DR, HV1, and HV2 (7 out of 14 substitutions, Figure 5a). A Fisher's exact test showed that this difference in the distribution of polymorphisms in SLGB and SRK was highly significant (P = 0.006; Table 4), indicating that these domains are not under the same functional constraints in SLGB as in SRK. These data are consistent with several reports that indicate that SLG does not play a role in haplotype-specific recognition of self-pollen (Gaudeet al. 1995; Cabrillacet al. 1999; Nishio and Kusaba 2000).
Sequence comparison and phylogenetic relationships of the S domains of class II SRK, SLGA, and SLGB alleles. (a) Similarity between alleles of SRK, SLGA, and SLGB from 12 S2 haplotypes. Alleles corresponding to each gene are grouped separately and the range of similarity between genes at the nucleotide (nuc.) and amino acid (aa.) levels is indicated. For each gene, alleles were subgrouped on the basis of similarity and the numbers of synonymous/nonsynonymous substitutions observed within each group and subgroup are indicated. All comparisons were carried out using a region of 1299 bp corresponding to the majority of the S domain (see Figure 5a where this region is indicated by a shaded bar). (b) Neighbor-joining tree based on class II SLG and SRK S domain sequences. SRK2a and SLG2a (Tantikanjanaet al. 1993), SRK2b and SLG2b (Kusabaet al. 2000), SLG5 (Scutt and Croy 1992), SRK5, SRK15, SLGA15, and SLGB15 (Cabrillacet al. 1999) have been described previously. SFR2 (Pastugliaet al. 1997a) was included as an outgroup. Based on sequence similarity and gene structure, SLG5 is an allele of SLGB. The tree was constructed using the PAM250 amino acid substitution matrix. The numbers next to the branches are bootstrap values expressed as percentage confidence level and based on 2000 repeats. The scale indicates amino acid substitutions/site. Note that this tree is presented as a means of representing the relationships between the sequences shown and probably does not correctly describe the evolutionary events that generated these sequences. The phylogeny of these sequences has almost certainly been obscured in many cases by events that tend to result in concerted evolution within haplotypes.
The difference in the distribution of polymorphisms in SRK and SLGB also supports the hypothesis that the DR, HV1, and HV2 domains have been conserved in the SRK2 alleles. To investigate this difference further, we carried out a second HKA test, again to compare distributions of nonsynonymous substitutions between the DR/HV1/HV2 and the rest of the region analyzed, but this time comparing the “within SRK2” data to a comparison between SRK2 and SLGB2. Table 5 shows that a significant difference was detected in this comparison (P = 0.03), suggesting that there has been either selection against changes in the DR, HV1, and HV2 of SRK2 or that there has been positive selection for changes in these regions since the divergence of SRK and SLGB. It will, however, be important to carry out similar analyses on other S haplotypes to ensure that the observed difference is not due to gene conversion having, by chance, preferentially homogenized the non-HV regions in the S2 haplotype.
To date, intrahaplotype polymorphism has been described in only one other Brassica haplotype, S13 (Kusabaet al. 2000). Surprisingly, and in contrast to our findings with the S2 haplotype, when the S domains of SRK13 and SRK13-b were compared, one of the three amino acid substitutions was located in HV2. However, note that, if HV1 and HV2 are involved in determining S haplotype specificity, it is possible, and even likely, that only a subset of residues within the HV domains is involved in this function. Six of the seven differences between the predicted mature SLG13 and SLG13-b proteins were found within HV1, supporting our observation that substitutions are clustered in the HV domains in SLG proteins from the same haplotype.
Distribution of nucleotide and amino acid polymorphisms in SRK, SLGA, and SLGB. (a) Schematic representations showing the positions of observed nucleotide and amino acid substitutions along the S domains of SRK, SLGA, and SLGB following intra- and interhaplotype comparisons. Comparisons were carried out for the 10 SRK2 alleles (SRK2), the 6 SLGA2 alleles (SLGA2), the 6 SLGB2 alleles (SLGB2), and between SRK2 and either SRK5 or SRK15 (SRK class II). Vertical bars and vertical bars with open circles indicate the positions of nucleotide and amino acid residues that exhibited polymorphism. In the bottom only amino acid substitutions are shown, either between SRK2 and SRK5 (above the rectangle) or between SRK2 and SRK15 (below the rectangle). Open rectangles represent signal peptides plus S domains; the region that was compared is indicated by a shaded bar. The positions of the DR (deletable region) and the three hypervariable regions (HV1, HV2, and HV3) are indicated beneath the S domains by solid bars. The hatched portions of the S domain represent the two variable regions defined by Nasrallah et al. (1987). (b) Frequency of observed amino acid substitutions in different regions of the SRK S domain in either inter- or intrahaplotype comparisons. The mean frequencies of observed amino acid substitutions, expressed as percentage substitutions per amino acid residue, are indicated by lightly shaded and darkly shaded bars for comparisons within the 10 SRK2 sequences and for comparisons between different class II SRK sequences (SRK2/SRK5, SRK2/SRK15, and SRK2/SRK29), respectively. SRK2, SRK5, and SRK15 are from B. oleracea, and SRK29 is from B. rapa. DR, deletable region; HV1, HV2, and HV3, hypervariable regions; rest, rest of the region indicated by the shaded bar in Figure 5a, excluding the DR, HV1, HV2, and HV3 regions.
HKA test for difference in distribution of nonsynonymous substitutions between DR/HV1/HV2 and the rest of the S domain in a comparison between SRK2 alleles vs. a comparison between the SRK2 alleles and SRK15
To further investigate the role of the HV domains in SRK function, we compared the HV domains of all the SRK sequences available in the databases (Figure 6). The sequence of the HV3 region was not conserved in the SRK2 sequences and, moreover, the HV3 domains of four of the SRK2 sequences were identical to those of SRK5 and SRK45 from B. oleracea and B. campestris, respectively. In contrast, HV1 and HV2 were both 100% conserved in the 10 SRK2 sequences but no two SRK sequences from different haplotypes exhibited the same sequence in these regions. Taken together with the analyses described above, these data argue against DR, HV1, and HV2 merely being regions of the SRK protein that are under a low level of functional constraint. Rather, they indicate that substitutions in these regions are associated with the acquisition of new SI specificities.
The DR was conserved among the SRK2 sequences but two pairs of class I SRKs shared identical DR domains (BrSRK8/BoSRK23 and BoSRK3/BnSRKA10, where Bo, Br, and Bn refer to the species of origin, B. oleracea, B. rapa, and B. napus, respectively; Figure 6). Therefore, this domain does not determine S haplotype specificity alone although it is possible that it may contribute to specificity by acting in combination with another polymorphic region of the protein such as HV1 or HV2.
DISCUSSION
SLG and S locus structure: In a previous study, we reported the existence of two different SLG genes (SLGA and SLGB) in class II S haplotypes (Cabrillacet al. 1999). Both SLG genes were present in an S15 haplotype but either only SLGA or only SLGB was detected in an S5 and in an S2 haplotype, respectively. More recently, Kusaba et al. (2000) described a novel form of the S2 haplotype, S2b, which exhibited both structural and sequence differences compared to the S2 haplotype originally described by Tantikanjana et al. (1993). In this study, we analyzed the sequences of SLG and SRK genes from 10 additional S2 haplotypes from diverse genetic backgrounds. On the basis of sequence similarities, the 10 S2 haplotypes could be divided into two groups that resembled S2a on one hand and S2b on the other. In the first group, SRK was associated with SLGA, and SLGB was absent, while in the second group SRK was associated with SLGB, and SLGA was absent.
Fisher's exact test to compare distribution of nonsynonymous substitutions between DR/HV1/HV2 and the rest of the S domain in SRK2 and SLGB2
Taken together, these results indicate that SRK, SLGA, and SLGB were already associated at the S locus in an ancestor of the class II haplotypes. SLGA or SLGB would then have been lost from the S locus in some haplotypes, these events occurring, at least in part, after the divergence of the different class II S haplotype specificities. An alternative scenario has been proposed by Kusaba et al. (2000). They suggest that SLGA was originally acquired by the S2 haplotype and then subsequently transmitted to the S15 haplotype. This latter model is based on the fact that SLGB is commonly found in class II haplotypes of both B. oleracea and B. rapa (Scutt and Croy 1992; Hatakeyamaet al. 1998) whereas SLGA has been found only in the S2 and S15 haplotypes so far. Note, however, that SRK15 is more similar to SLGA15 than to SLGB15 whereas the opposite is true for SRK2, suggesting that, if SLGA was acquired after divergence of the class II haplotypes, then this probably occurred in the S15 haplotype. A more detailed analysis of independent isolates of additional class II haplotypes such as S5 should help to distinguish between these different models.
The S locus region has been characterized in detail in several class I S haplotypes and there is currently no evidence for the presence of a second SLG gene in this class of haplotype (Boyeset al. 1997; Cuiet al. 1999; Suzukiet al. 1999). It seems likely, therefore, that the acquisition of a second SLG gene by the class II haplotypes occurred after their divergence from class I haplotypes.
HKA test for difference in distribution of nonsynonymous substitutions between DR/HV1/HV2 and the rest of the S domain in a comparison between SRK2 alleles vs. a comparison between the SRK2 alleles and SLGB2
It is difficult to determine precisely how SRK, SLGA, and SLGB originated due to the highly polymorphic nature of these three genes. Comparison of their introns indicates a complex series of recombination and mutation events that tend to obscure their evolutionary history (Cabrillacet al. 1999). It seems probable, however, that SLGA arose as a result of a duplication of SRK (Tantikanjanaet al. 1993). SLGB may have originated in a similar manner, although it is also possible that it was recruited to the S locus from another part of the genome. Analogous translocation events of this type have been observed between distant disease resistance gene clusters in tomato (Parniske and Jones 1999).
Comparison of the S2 haplotypes indicated the existence of two distinct phylogenetic groups of SRK2 alleles and these two groups correlated with the presence of either SLGA2 or SLGB2, respectively. The fact that SRK alleles from each group can be clearly distinguished at the sequence level suggests that recombination between these two types of S2 haplotype has been suppressed or has occurred only rarely. An alternative possibility is that these two groups of S2 alleles evolved independently in geographically isolated populations and have been brought together only recently. Note, however, that we found no correlation between the presence of SLGA or SLGB and the origin of the plants that were analyzed.
Hypervariable domains and haplotype specificity: In both sporophytic and gametophytic multiallelic SI systems, genes that encode female components of the SI system have been shown to possess regions of extreme sequence polymorphism, the HV domains. In the Solanaceae, S-RNAse HV domain swapping experiments have provided evidence that these domains are responsible for the difference in specificity between closely related alleles (Mattonet al. 1997). However, other regions of these proteins also seem to be important because exchanges of HV domains between more distantly related alleles often result in loss of activity rather than acquisition of new allelic specificities (Kao and McCubbin 1996; Zureket al. 1997).
Sequence comparison of the DR and HV regions from the SRK2 alleles with other class II and class I SRK alleles. Solid bars indicate the positions of each region. Numbers indicate the position of the last residue of each block in relation to the N-terminal end of the predicted unprocessed polypeptide. Residues conserved in a majority of sequences are boxed. *, SRK sequences with identical HV3 regions; ● and ◆, two pairs of identical DR sequences; Bo, B. oleracea; Br, B. rapa; Bn, B. napus; DR, deletable region; HV1, HV2, and HV3, hypervariable regions.
By analogy to the SI systems of the Solanaceae, and bearing in mind the unusually high number of functional S haplotypes in Brassica, it seems likely that the highly polymorphic HV regions of SRK are also involved in determining S haplotype specificity. However, in the absence of a direct experimental demonstration of the role of these domains, it has been difficult to distinguish between this possibility and a second hypothesis in which the HV regions are under a low level of selective constraint and therefore free to diverge to a greater degree (Nasrallah 1997). In this study we have addressed this problem by comparing the level of polymorphism in these regions either within or between haplotypes. These comparisons show that HV1 and HV2 tend to be conserved in SRK molecules that confer the same SI specificity but are diverged in SRK molecules that confer different SI specificities, indicating a role for HV1 and HV2 in determining haplotype specificity.
In the Solanaceae, comparisons of the rates of nonsynonymous and synonymous substitution in the hypervariable region of S-RNases indicated that these regions have been subjected to positive selection (Clark and Kao 1991). We carried out a similar analysis of HV1 and HV2 in the class II SRK genes but did not find any direct evidence of positive selection acting in these regions (data not shown). Similar results were obtained by Awadalla and Charlesworth (1999) when they analyzed the HV regions of class I SRK genes. However, these authors noted that, while there was no strong evidence of positive selection acting on the HV regions (Ka/Ks ratios were close to 1), Ka/Ks ratios for these regions were significantly higher than for the rest of the protein. They suggested that evidence of positive selection may be obscured by the high level of substitution in these regions, particularly if adaptive substitutions can occur at only a small subset of the amino acid positions.
The role of SLG in S locus function and evolution: There is accumulating evidence that SLG is not involved in the haplotype-specific recognition of self-pollen (Gaudeet al. 1995; Cabrillacet al. 1999; Okazakiet al. 1999; Nishio and Kusaba 2000). If this is the case, then it is necessary to explain why SLG and SRK alleles are often highly similar within the same S haplotype (Steinet al. 1991; Goringet al. 1993; Delormeet al. 1995; Kusabaet al. 1997). We have suggested that this concerted evolution of SLG and SRK is due to their close proximity in the genome, which tends to favor gene conversion events (Cabrillacet al. 1999). There is convincing evidence that gene conversion events occur between SLG and SRK, particularly within haplotypes (Goringet al. 1993; Watanabeet al. 1994; Suzukiet al. 1997; Cabrillacet al. 1999). The DR, HV1, and HV2 regions were highly conserved in the SRK2 sequences analyzed in this study but this was not the case for SLGB2, where there was a tendency for substitutions to accumulate in these regions. We suggest that the capacity of SLG to accumulate substitutions in its HV domains may be an important factor in the generation of new S haplotype specificities. Gene conversion events would provide a means of transferring novel HV domain sequences from SLG to SRK and this could accelerate the evolution of novel SI specificities on the female side. Two other factors may be important for the functionality of such a system. The first is the overall similarity between SLG and the S domain of SRK, particularly within haplotypes, which is presumably the result of (reciprocal) sequence homogenization. The second is the maintenance of SLG as a functional protein due to its role in pollen adhesion and/or SRK maturation. In this respect, it is interesting that SLG binds PCPA1, a component of the pollen coat that is related to SCR at the sequence level (Doughtyet al. 1998).
Acknowledgments
We thank Fabienne Deguerry and Anne-Marie Thierry for technical assistance, and Rosemary MacClenaghan, Graham King, Angela Pinnegar, and Dave Astley from HRI Wellesbourne for providing Brassica lines. This work was funded by the Institut National de la Recherche Agronomique, the Centre National de la Recherche Scientifique, and the Ecole Normale Supérieure de Lyon.
Footnotes
-
Communicating editor: D. Charlesworth
- Received May 3, 2001.
- Accepted July 16, 2001.
- Copyright © 2001 by the Genetics Society of America