Genetics, Vol. 149, 1063-1067, June 1998, Copyright © 1998

Possible Assortment of a1 and a2 Region Gene Segments in Human MHC Class I Molecules

George Johnsona and Tai T. Wua
a Departments of Biochemistry, Molecular Biology and Cell Biology, and Biomedical Engineering, Northwestern University, Evanston, Illinois 60208

Corresponding author: Tai T. Wu, Room E267, Technological Institute, 2145 Sheridan Rd., Northwestern University, Evanston, IL 60208, tt{at}immuno.bme.nwu.edu (E-mail).

Communicating editor: S. L. ALLEN


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Using pair-wise comparison of aligned nucleotide sequences of distinct and complete human MHC class I molecules, we have constructed triangular tables to study the similarities and differences of various a1 (exon 2) and a2 (exon 3) region sequences. There are two HLA-A (A*6901 and A*6601) and 13 HLA-B (B*4201, B*8101, B*4102, B*4801, B*4007, B*4001, B*4802, Dw53, B*4406, B*4402, B*3901, B*1514 and B*3702) sequences that have identical a1 sequences with other known MHC class I molecules, while their a2 sequences are the same as those of different ones. Of these 15, A*6901, B*4001 and B*4802 have previously been suggested as the results of recombination between A*6801 and A*0201, B*4101 and B*8101, and B*4801 and B*3501, respectively. However, many other sequences can also be used to generate them by recombination. Furthermore, their reciprocal products have never been identified. Thus, gene conversion has subsequently been suggested as an alternative. Another possible genetic mechanism for generating these nucleotide sequence similarities can be assortment, or that some gene segments can be duplicated or multiplicated to be used in different human MHC class I molecules. Interestingly, this genetic mechanism is probably absent for the generation of different mouse MHC class I molecules.


THE nucleotide sequences of human major histocompatibility complex (MHC) class I molecules have been extensively collected by PARHAM and his collaborators (ARNETT and PARHAM 1995 Down). They have noticed that the a1 region nucleotide sequences of A*6901 and A*6801 are identical, and A*6901 and A*0201 have the same a2 region nucleotide sequence. Therefore, they initially suggested that A*6901 might be the result of recombination of A*6801 and A*0201 molecules (HOLMES and PARHAM 1985 Down). Because the reciprocal product has not been found, and because the differences between A*6901 and A*6801 sequences in their a2 regions are localized to a relatively short segment, HEMMI et al. 1988 Down and PARHAM et al. 1995 Down have subsequently suggested that gene conversion may be a more likely mechanism.

HUGHES et al. 1993 Down have found two similar examples for HLA-B molecules: B*4001 being the product of B*4101 and B*8101 recombination, and B*4802 being that of B*4801 and B*3501 with one nucleotide difference.

A thorough pairwise comparison of these sequences was initiated by PARHAM et al. 1995 Down, and they noticed the following: "In comparing pairs of HLA-A,B,C alleles, only 2 pairs out of a total of 6,400 cannot be distinguished on the basis of sequences in exons 2, 3 and 4... In contrast, when comparison is restricted to either exon 2 or exon 3 alone then the number of ambiguous pairs increases significantly." They also made plots of occurrences vs. differences to separate interlocus and intralocus comparisons.

We have been using similar methods to study human and mouse immunoglobulin V-gene nucleotide sequences (JOHNSON and WU 1997A Down) and possible evolutionary differences between T-cell receptor for antigen and immunoglobulin V-genes (JOHNSON and WU 1997B Down). In addition, we have made use of the triangular tables that list the differences of pairwise comparisons.

In this study, triangular tables are constructed for human and mouse MHC class I a1 (exon 2) and a2 (exon 3) region gene segments. The combined tables of a1/a2 region nucleotide sequence differences are then used for detailed analysis.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The sequence database has been previously described (JOHNSON and WU 1997A Down). Briefly, the aligned sequences of human and mouse MHC class I a1 (exon 2) and a2 (exon 3) regions were obtained through the Kabat database WWW server (JOHNSON et al. 1996 Down; http://immuno.bme.nwu.edu), which has been maintained in our laboratory. This is the only database with aligned nucleotide and amino acid sequences of proteins of immunological interest.

Triangular tables were generated, listing the number of nucleotide differences between any two complete and distinct sequences of human or mouse MHC class I chains for a1 and a2 regions separately. As shown before (JOHNSON and WU 1997A Down), these tables were very large and difficult to print in a journal article. If interested, please write to the authors for copies. Combined triangular tables for a1/a2 regions, listing the difference of a1 region sequences above those of a2 region sequences, were then constructed for three examples: one for 12 human HLA-A sequences (Table 1), one for 15 human HLA-B sequences (Table 2), and one for 5 mouse H-2 sequences (Table 3).


 
View this table:
In this window
In a new window

 
Table 1. Triangular table listing pairwise differences for a1/a2 region sequences of 12 HLA-A molecules


 
View this table:
In this window
In a new window

 
Table 2. Triangular table listing pairwise differences for a1/a2 region sequences of 15 HLA-B molecules


 
View this table:
In this window
In a new window

 
Table 3. Triangular table listing pair-wise differences for a1/a2 region sequences of five H-2 D molecules


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Table 1 illustrates the nucleotide differences of some HLA-A sequences. A*6901 (HOLMES and PARHAM 1985 Down) has an a1 region sequence identical to those of A*6801 and A*6602, and differing by only one nucleotide from the A*6601 a1 sequence, but its a2 region sequence differs by 7, 16, and 16 nucleotides, respectively. On the other hand, A*6901 has an a2 region sequence identical to those of A*0206, A*0211, and A*0201, and differing by only one nucleotide from the A*0204 a2 sequence, but its a1 region sequence differs by 9, 11, 11, and 11 nucleotides, respectively. This finding is summarized as one item in Table 4, where A*6901 is connected to the first row of identical or nearly identical a1 region sequences and to the second row of identical or nearly identical a2 region sequences. This example was initially reported by HOLMES and PARHAM 1985 Down, with A*6901 being the result of recombination between A*6801 and A*0201. As illustrated in Table 1A*6901 and A*6801 differ by only 7 nucleotides, most of which are localized in a relatively short segment. HEMMI et al. 1988 Down and PARHAM et al. 1995 Down have subsequently proposed that gene conversion can be another mechanism for generating new MHC class I molecules.


 
View this table:
In this window
In a new window

 
Table 4. Listing of 2 HLA-A and 13 HLA-B sequences that share identical or nearly identical a1 sequences with one set of other molecules and a2 sequences with another set

A similar situation is observed for some of the HLA-B sequences (Table 2). B*4201 has an a1 region sequence identical to those of B*8101, B*0704, B*0705, and B7''', but its a2 sequence differs by 10, 7, 9, and 9 nucleotides, respectively. On the other hand, B*4201 has an a2 region sequence identical to that of B*4102 and differs from that of B*0801 by 1 nucleotide, but its a1 region sequence differs by 19 and 10 nucleotides, respectively. This is summarized in the third line of Table 4.

As illustrated in Table 4, there are 2 HLA-A and 13 HLA-B molecules with such shared sequence similarities. The circled ones, namely, A*6901, B*4001, and B*4802, have previously been investigated by HOLMES and PARHAM 1985 Down and HUGHES et al. 1993 Down. They have suggested only 1 pair of other sequences to generate each of the above 3 sequences. In fact, for all the 15 sequences listed in Table 4, there are a total of 83 possible pairs of other sequences that may be involved.

For mouse MHC class I sequences (Table 3), however, the nucleotide sequence differences among D-Bm24, D-Bm14 (HEMMI et al. 1988 Down), and Dg7 (GIRGIS et al. 1996 Down) can be explained by gene conversion (HEMMI et al. 1988 Down; PARHAM et al. 1995 Down). Their a2 region sequences are identical, and their a1 region sequences differ by two, seven, and nine nucleotides.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

A*6901 was considered to be the result of intra-allelic reciprocal recombination between A*6801 and A*0201 by HOLMES and PARHAM 1985 Down. They pointed out that this can be a useful mechanism for generating novel MHC class I molecules. As shown in Table 1A*6901 and A*6801 are identical in their a1 regions and differ by seven nucleotides in their a2 regions, most of which are localized in a short segment. For this reason, these authors subsequently favored the idea of gene conversion as a more likely mechanism of generating such differences (PARHAM et al. 1995 Down). On the other hand, A*6901 can be derived from A*0201 and A*6602 or other pairs (Table 1 and Table 4), where the nucleotide differences are distributed over the entire regions of a1 and a2, rather than localized.

The other molecule, consisting of the a1 region of A*0201 and the a2 region of A*6801, expected as a result of reciprocal recombination, is, however, not found. For HLA-B molecules, the two examples found by HUGHES et al. 1993 Down have similar problems. Indeed, our simple method of using triangular tables to list the pairwise nucleotide differences of a1/a2 regions have uncovered many other examples, as illustrated in Table 4. To account for all of these, a simpler mechanism of gene assortment may, therefore, be needed. Alternatively, the same a1 or a2 region gene segment can be duplicated or multiplicated to be used in different MHC class I molecules. Indeed, gene fragments of human MHC class I molecules have previously been observed (GERAGHTY et al. 1992 Down).

If assortment of a1 and a2 region genes, that is, exons 2 and 3, respectively, occurs in human MHC class I molecules, the logical extension is that this genetic mechanism should be operative in mouse MHC class I molecules also. However, as we have noted, the only similar situation was found for three mouse sequences: D-Bm14, D-Bm24, and Dg7 (Table 3). Their differences are localized in the segment from codons 63 to 80 of the a1 region. As pointed out by HEMMI et al. 1988 Down and by PARHAM et al. 1995 Down, this can be the result of gene conversion.

In conclusion, on the basis of pairwise comparison of a1 (exon 2) and a2 (exon 3) region nucleotide sequences of human MHC class I molecules, in addition to reciprocal recombination and gene conversion, assortment of the entire a1 or a2 region can be a versatile mechanism for generating novel class I molecules in the human system. However, this genetic process does not seem to be important in the mouse system, although gene conversion may play a role.


*  ACKNOWLEDGMENTS

This research was supported in part by National Institutes of Health grant 5 R01 AI-25616-08.

Manuscript received August 5, 1997; Accepted for publication January 26, 1998.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ARNETT, K. L. and P. PARHAM, 1995  HLA class I nucleotide sequences. Tissue Antigens 46:217-257[Medline].

GERAGHTY, D. E., B. H. KOLLER, J. A. HANSEN, and H. T. ORR, 1992  The HLA class I gene family includes at least six genes and twelve pseudogenes and gene fragments. J. Immunol. 149:1934-1946[Abstract].

GIRGIS, K. R., J. D. CAPRA, and I. STROYNOWSKI, 1996  Nucleotide sequences of H2g7 K and D loci of non-obese diabetic mice. Immunogenetics 41:386.

HEMMI, S., J. GELIEBTER, R. A. JEFF, R. W. MELVOLD, and S. G. NATHENSON, 1988  Three spontaneous H-2Db mutants are generated by genetic micro-recombination (gene conversion) events. Impact on the H-2-restricted immune responsivenesses. J. Exp. Med. 168:2319-2335[Abstract/Free Full Text].

HOLMES, N. and P. PARHAM, 1985  Exon shuffling in vivo can generate novel HLA class I molecules. EMBO J. 4:2849-2854[Medline].

HUGHES, A. L., M. K. HUGHES, and D. I. WATKINS, 1993  Contrasting roles in interallelic recombination at the HLA-A and HLA-B loci. Genetics 133:669-680[Abstract].

JOHNSON, G. and T. T. WU, 1997a  A method of estimating the numbers of human and mouse immunoglobulin V-genes. Genetics 145:777-786[Abstract].

JOHNSON, G. and T. T. WU, 1997b  Profile of numbers of sequence differences among V-genes coding for the variable regions of T cell receptor for antigen alpha and beta chains. J. Mol. Evol. 44:253-257[Medline].

JOHNSON, G., E. A. KABAT and T. T. WU, 1996 Kabat database of proteins of immunological interest, pp. 6.1–6.21 in Weir's Handbook of Experimental Immunology. I. Immunochemistry and Molecular Immunology, edited by L. A. HERZENBERG, W. M. WEIR, L. A. HERZENBERG and C. BLACKWELL. Blackwell Science, Cambridge, MA.

PARHAM, P., E. J. ADAMS, and K. L. ARNETT, 1995  The origin of HLA-A,B,C polymorphism. Immunol. Rev. 143:141-180[Medline].




This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
G. Johnson and T. T. Wu
Kabat Database and its applications: future directions
Nucleic Acids Res., January 1, 2001; 29(1): 205 - 206.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Johnson and T. T. Wu
Kabat Database and its applications: 30 years after the first variability plot
Nucleic Acids Res., January 1, 2000; 28(1): 214 - 218.
[Abstract] [Full Text] [PDF]