Genetics, Vol. 152, 393-400, May 1999, Copyright © 1999

Evolution of HLA Class II Molecules: Allelic and Amino Acid Site Variability Across Populations

Hugh Salamon1,a, William Klitz2,a, Simon Eastealb, Xiaojiang Gao3,b, Henry A. Erlichc, Marcello Fernandez-Viñad, Elizabeth A. Trachtenbergc,e, Shannon K. McWeeneya, Mark P. Nelsona, and Glenys Thomsona
a Department of Integrative Biology, University of California, Berkeley, California 94720-3140,
b Human Genetics Group, John Curtin School of Medical Research, The Australian National University, Canberra, ACT 0200, Australia,
c Department of Human Genetics, Roche Molecular Systems, Alameda, California 94501,
d American Red Cross, National Histocompatibility Laboratory, Baltimore, Maryland 21201
e Children's Hospital Oakland Research Institute, Oakland, California 94609

Corresponding author: Glenys Thomson, Department of Integrative Biology, 3060 Valley Life Sciences Bldg., MC#3140, University of California, Berkeley, CA 94720-3140., glenys{at}allele5.biol.berkeley.edu (E-mail)

Communicating editor: A. G. CLARK


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Analysis of the highly polymorphic ß1 domains of the HLA class II molecules encoded by the DRB1, DQB1, and DPB1 loci reveals contrasting levels of diversity at the allele and amino acid site levels. Statistics of allele frequency distributions, based on Watterson's homozygosity statistic F, reveal distinct evolutionary patterns for these loci in ethnically diverse samples (26 populations for DQB1 and DRB1 and 14 for DPB1). When examined over all populations, the DQB1 locus allelic variation exhibits striking balanced polymorphism (P < 10-4), DRB1 shows some evidence of balancing selection (P < 0.06), and while there is overall very little evidence for selection of DPB1 allele frequencies, there is a trend in the direction of balancing selection (P < 0.08). In contrast, at the amino acid level all three loci show strong evidence of balancing selection at some sites. Averaged over polymorphic amino acid sites, DQB1 and DPB1 show similar deviation from neutrality expectations, and both exhibit more balanced polymorphic amino acid sites than DRB1. Across ethnic groups, polymorphisms at many codons show evidence for balancing selection, yet data consistent with directional selection were observed at other codons. Both antigen-binding pocket- and non-pocket-forming amino acid sites show overall deviation from neutrality for all three loci. Only in the case of DRB1 was there a significant difference between pocket- and non-pocket-forming amino acid sites. Our findings indicate that balancing selection at the MHC occurs at the level of polymorphic amino acid residues, and that in many cases this selection is consistent across populations.


THE function of human leukocyte antigen (HLA) class I and class II molecules is to collect peptide fragments inside the cell and transport them to the cell surface, where the peptide-HLA complex is surveyed by immune system T cells (see, e.g., MONACO 1993 Down). Three regions of HLA class II genes produce functional antigen-presenting heterodimers; these are labeled DR, DQ, and DP. Each class II heterodimer is made up of the noncovalent association of two glycopeptide chains: the {alpha} chain and the ß chain, encoded by, e.g., for DQ, the DQA1 and DQB1 loci, respectively. T cells recognize both MHC and foreign antigen presented together as a complex.

Specificity and affinity for peptides by the antigen recognition sites (ARS) are determined by contacts with the amino acid side chains of the bound peptide and thus are due to the amino acid sequences of both the bound peptide and the HLA allelic variant. Definable pockets in the ARS of the HLA molecules bind to the antigenic peptide side chains to create binding specificity.

The genetic variation of HLA, and the major histocompatibility complex (MHC) regions in other species' genomes, is especially striking in the amino acid residues of the ARS (see, e.g., PARHAM et al. 1989 Down; LAWLOR et al. 1990 Down; HEDRICK et al. 1991 Down). High heterozygosity at positions critical to antigen recognition suggests positive selection in favor of diversity. This is also indicated by analyses of HUGHES and NEI 1988 Down, HUGHES and NEI 1989 Down and TAKAHATA et al. 1992 Down, which show that in the ARS, nonsynonymous differences between alleles are proportionally more common than synonymous ones. In the rest of the molecule, there is an excess of synonymous changes, as is normally observed in protein coding sequences.

The neutral theory of evolution at a locus assumes that all alleles are selectively equivalent, that mutation is the source of new genetic variation, and that genetic drift results in loss of variation. EWENS 1972 Down developed a sampling theory that predicts the distribution of alleles at a locus drawn from a population at equilibrium under the infinite alleles model of neutrality. WATTERSON 1978 Down developed a method, using a homozygosity measure, denoted by F, and the observed number of alleles and sample size, to compare a sample of allele frequencies to neutrality expectations. For a given number of observed alleles, a low value of the homozygosity measure F, compared to the neutrality expectation, corresponds to all alleles having similar frequencies. Some form of balancing selection is implied when the observed allele frequencies are more even than the neutrality expectation (WATTERSON 1978 Down). A high value of F compared to the neutrality expectation corresponds to one allele predominating with the others at low frequencies. Directional selection is implicated in this case. Note that the fit of a sample of allele frequencies to neutrality expectations does not mean that selection is not operating, but simply that one cannot reject the neutral model.

Previous analyses of serologically defined class I (A, B, and C) and class II (DR) alleles and of polymerase chain reaction/sequence-specific oligonucleotide probe (PCR/SSOP) typed class II (DRB1 and DQB1) alleles show that the alleles at these loci are often more even in frequency than expected under neutrality, which implies some form of balancing selection (HEDRICK and THOMSON 1983 Down; KLITZ et al. 1986 Down, KLITZ et al. 1992 Down; BEGOVICH et al. 1992 Down). However, in many, but not all, populations studied to date, the distribution of DPB1 alleles does not differ significantly from neutrality expectations, despite the molecular similarity of the encoded proteins and proximity of the locus to DR and DQ genes (BEGOVICH et al. 1992 Down).

The availability of population data from a number of ethnic groups with PCR/SSOP molecular-defined variation at the HLA class II DRB1, DQB1, and DPB1 loci has allowed us to compare and contrast genetic variation among populations at both the allelic and amino acid levels. Using Watterson's homozygosity statistic, the allele frequency distributions were tested for departure from neutral expectations. We also investigated amino acid frequency distributions with respect to neutrality expectations and determined if the results corresponded with the findings for the allele level investigation. In addition, genetic variation at the amino acid level was examined for the different functional categories of polymorphic amino acid sites.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The Watterson homozygosity test of neutrality:
The homozygosity measure for the allele frequencies at a locus is denoted by FA:

(1)
where pi is the observed frequency of allelic type i, i = 1, 2, ... , k, in a sample of size n individuals, i.e., a total of 2n alleles (WATTERSON 1978 Down). Note that the Watterson homozygosity test of neutrality does not examine deviations from Hardy-Weinberg proportions, but uses the expected Hardy-Weinberg homozygosity as a measure of the distribution of allelic frequencies for a given sample size (2n) and observed number of alleles (k). Values of FA range from 0 to 1. Several confidence levels for the distribution of FA under neutrality are available for combinations of 2n and k up to 500 and 40, respectively (ANDERSON 1979 Down; also see EWENS 1979 Down). Individual values of FA are tested for significant deviation from neutrality by comparison to quantiles of the neutral distribution (matching for k and 2n) as calculated in ANDERSON 1979 Down. If the exact combinations of 2n and k in the HLA data are not observed in the table, values are linearly interpolated. In cases in which 2n > 500, values for 500 are used. This gives a conservative value when the alternative hypothesis is balancing selection, where the homozygosity observed is expected to be lower than the neutrality expectation for the given number of alleles.

To allow comparison of homozygosity F statistics across populations with different numbers of alleles and sample sizes, tabled values of the mean and variance of FA (ANDERSON 1979 Down) were used to calculate the normalized deviate, FndA;

(2)
for each sampled population. A negative value of FndA is in the direction of balancing selection, a value of zero is the neutrality expectation, and a positive value is in the direction of directional selection. The mean of FndA across m populations, A, was calculated

(3)
where FndA j is the normalized deviate for the jth population, j = 1, 2, ... , m. By the central limit theorem,

(4)

The estimated standard error of FndA is given by sA, where

(5)

The overall test of A is a two-tailed test using z = A .

Neutrality testing at the amino acid level is analogous to that for allelic variation. The homozygosity statistic for amino acid polymorphism is calculated as

(6)
where qi is the observed frequency of amino acid residue i, with a total of t amino acids, i = 1, 2, ... , t, at the site under consideration, in a sample of size n individuals, i.e., a total of 2n amino acid "alleles." The normal deviate for faa denoted Fndaa is calculated as for allelic variation (see Equation 2). The Fndaa can be averaged across m populations, or across r polymorphic sites, and the statistical properties and tests of the averages are as in Equation 3Equation 4Equation 5.

Population samples:
Allele and amino acid variation are examined for DRB1 and DQB1 ß1 domains in 23 human populations, including 5 Melanesian: Papua New Guinea Highlanders (from Goroka) and populations from Madang, Rabaul, New Caledonia, and Fiji (GAO et al. 1992A Down); 3 Polynesian populations, Rarotonga, Niue, and West Samoa (GAO et al. 1992B Down); 2 Micronesian populations, Nauru and Kiribati (GAO et al. 1992B Down); two different Indonesian populations, here designated Indonesia and Java (GAO et al. 1992B Down); a "Southern" Filipino population (BUGAWAN et al. 1994 Down); Caucasian populations, the Centre d'Etude du Polymorphisme Humain (CEPH; BEGOVICH et al. 1992 Down), Czech (CERNA et al. 1992 Down), and Norwegian (RONNINGEN et al. 1990 Down); 2 Chinese populations, southern Chinese from Gaunzou and northern Chinese from Beijing (both supplied by M. Fernandez-Viña); a Senegalese population (10th HLA workshop data); North Americans of African descent (FERNANDEZ-VINA et al. 1991 Down); a population sample from the United States of individuals of Mexican origin (ERLICH et al. 1993 Down); and 2 different native South American populations, the Cayapa in Ecuador (TRACHTENBERG et al. 1995 Down) and the Toba of the Gran Chaco in Argentina (CERNA et al. 1993 Down).

Variation in the DPB1-encoded ß1 domain is considered in the following 14 populations: Cayapa, CEPH, Czech, Indonesia, Japan (MITSUNAGA et al. 1992 Down), Mexican (U.S.), Nauru (supplied by S. Easteal), Northern China (Beijing), Norway, Papua New Guinea Highlanders from Goroka (supplied by S. Easteal), Southern China (Gaunzou), Senegal, Filipino, and Toba. DPB1 populations unreferenced in this list are from the same sources as in the DRB1 and DQB1 list. All of the populations were typed using PCR/SSOP molecular typing methods.

Amino acid sequences were obtained from the EMBL ftp server at ftp://FTP.EMBL-Heidelberg.DE/ (sequences as in MARSH and BODMER 1993 Down, MARSH and BODMER 1995 Down; BODMER et al. 1997 Down). Amino acid sites of the ß1 domains were considered: sites 6–90, inclusive for DQB1 and DRB1, and 8–90 for DPB1. All alleles studied were defined at the amino acid level, by molecular, not serological typing.

Pocket vs. nonpocket categories of amino acid sites:
Sites were classified on the basis of a DRB1 structure (BROWN et al. 1993 Down) and on the peptide contacts observed in DR1-influenza peptide complex (STERN et al. 1994 Down). All polymorphic sites that contact antigenic peptide side-chains were assigned to the pocket category. All other sites are considered to be in the nonpocket category.

The DR1 structure was used as a model for class II peptide binding. Sixteen amino acid sites in DRB1 and DQB1 were classified as participating in pockets: 9, 11, 13, 28, 47, 57, 61, 67, 70, 71, 74, 78, 85, 86, 89, and 90. The sites that appear homologous (in the sense of serial homology) to these in DPB1 are 9, 11, 13, 26, 45, 55, 59, 65, 68, 69, 72, 76, 83, 84, 87, and 88. The differences in site designations are due to a relative deletion in DPB1 as compared to DQB1 and DRB1. All other amino acid sites were classified as nonpocket.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Allelic variation in DRB1, DQB1, and DPB1:
The homozygosity statistic FA and the normalized deviate FndA for the DRB1 locus show evidence of balancing selection (Table 1). Two populations (African-American and Norway) show individually significant negative (balancing selection) values for FndA. Only 5 of the 23 populations show positive FndA values, which is significant at P < 0.005 by a sign test from the neutrality 50% expectation of an equal number of positive and negative values. Thus more populations show deviation in the direction of balancing selection than expected by chance. For DRB1 alleles the mean of the normal deviate of the homozygosity statistic A is marginally significantly different from neutrality expectations (P < 0.06) in the direction of balancing selection. However, note that 2 independently sampled Indonesian populations, Indonesia and Java, respectively, show large positive values with Java significantly different from neutrality, which is indicative of directional selection.


 
View this table:
In this window
In a new window

 
Table 1. DRB1 allele homozygosity statistics in 23 human populations

DQB1 shows the strongest evidence for balancing selection of the three loci investigated (Table 2). Six of the 23 populations exhibit individually significant homozygosity values below neutral expectations. Only 3 out of the 23 populations exhibit positive FndA values, and the test of signs under neutrality gives P < 0.0002. The mean across populations of the normal deviates, A, is well below neutral expectations and highly statistically significant (P << 10-4). Despite the strong linkage disequilibrium between DRB1 and DQB1, the sample designated Indonesia exhibits a negative value for FndA at DQB1 (Table 2) and a positive one at DRB1 (Table 1). On the other hand, the Java sample does in fact exhibit one of the few positive FndA values observed at DQB1, as well as at DRB1.


 
View this table:
In this window
In a new window

 
Table 2. DQB1 allele homozygosity statistics in 23 human populations

For DPB1, the normal deviates of homozygosity, FndA, show a scatter of values spanning zero (the neutrality expectation) with five positive and nine negative (Table 3). No single population exhibits a value of homozygosity significantly different from neutrality. However, the mean value (across all populations) of the normal deviates of homozygosity, A, is negative and in the direction of balancing selection (P <= 0.08).


 
View this table:
In this window
In a new window

 
Table 3. DPB1 allele homozygosity statistics in 14 human populations

To summarize, at the level of allelic variation, DQB1 shows strong evidence of balancing selection. Balancing selection is implicated at DRB1, although the evidence is much less striking than with DQB1. The overall picture for DPB1 is in agreement with neutral theory predictions, although the mean across populations of the normal deviates shows a trend in the direction of balancing selection.

Variation at amino acid sites:
For each population, the frequency distributions of variants at every polymorphic amino acid in the ß1 domain encoded by DRB1, DQB1, and DPB1 were analyzed and the homozygosity statistics were calculated. The mean homozygosity Faa and its normal deviate aa were calculated including every polymorphic site across all populations sampled, and are presented in Figure 1, Figure 2, and Figure 3 for DRB1, DQB1, and DPB1, respectively. For all three loci many sites show strongly balanced polymorphism in the amino acids across human populations, which is indicated by negative values for aa in Figure 1 Figure 2 Figure 3; 17/30 amino acid sites in DRB1 (57%), 24/30 in DQB1 (80%), and 13/17 in DPB1 (76%) exhibit strongly balanced polymorphism. Significantly balanced polymorphism is observed both for amino acid sites classified as ARS and for sites that are not part of the ARS.



View larger version (27K):
In this window
In a new window
Download PPT slide
 
Figure 1. DRB1 amino acid site statistics calculated across 23 human populations. Each site that is observed to be polymorphic in at least one of the 23 populations is shown. The mean number of variants observed across Np populations polymorphic for each site, aa, is given as well as the mean homozygosity, aa, and the mean normal deviate of homozygosity aa. ({blacksquare}) Pocket and ({bullet}) nonpocket sites are distinguished.



View larger version (27K):
In this window
In a new window
Download PPT slide
 
Figure 2. DQB1 amino acid site statistics calculated across 23 human populations, as described in Figure 1.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 3. DPB1 amino acid site statistics calculated across 14 human populations, as described in Figure 1.

Other sites show neutral or even significantly positive values, the latter indicating directional selection (e.g., the DQB1 amino acid 40 has a positive aa value of 0.081 from the 11 populations polymorphic at this residue). Positive aa values for individual sites are often found in cases where many populations do not exhibit polymorphism at those sites at all. This is not surprising as the sites that show aa values consistent with directional selection are the same ones at which low frequency variants are easily missed in sampling.

We also observe that many amino acid sites maintain levels of balanced polymorphism across populations. Two sites within a locus may both present to us homozygosity values significantly and consistently lower than neutrality expectations, with the homozygosity values consistently higher at one site than the other across human populations. This phenomenon of distinct levels of homozygosity being maintained at certain amino acid sites across human populations is observed in all three loci.

DRB1 exhibits the most polymorphism as measured by numbers of variants segregating at a single site. For example, DRB1 amino acid sites 11, 13, and 30 are observed with up to six amino acid variants in a single population sample. Note, however, that site 30 does not exhibit evidence for balanced polymorphism. Although DRB1 sites exhibit many variants (mean = 3.26), the variants at DPB1 (mean = 2.42) and DQB1 (mean = 2.51) sites tend to have more even frequencies. However, in contrast to this general trend, note that the most balanced polymorphisms, i.e., the lowest aa values at a single site, are observed in DRB1 (e.g., sites DRB1 12 and 67).

The mean value over amino acid sites of all mean normal deviates of homozygosity across populations in DQB1 is not statistically different from that observed for DPB1 (see Figure 2 and Figure 3). DPB1 and DQB1 exhibit significantly more balanced amino acid polymorphisms than does DRB1. The mean normal deviates plus or minus one standard error of the mean value for each locus are aa (DPB1) = -0.959 ± 0.059 (205 observed polymorphisms), (DQB1) = -0.924 ± 0.034 (641 polymorphisms), and (DRB1) = -0.642 ± 0.039 (635 polymorphisms). These values are all significantly below neutrality expectations.

Polymorphic sites were found in both categories of sites (pockets and nonpockets), which allowed statistics for each category to be examined. Overall, the contrast between pockets and nonpockets, with respect to the evenness of amino acid variant distributions, is significant only at DRB1 (Figure 4). At DPB1 and DQB1 the mean normal deviates of homozygosity over all sampled amino acid sites in the nonpockets category are indistinguishable from the levels observed for the sites in the pockets.



View larger version (7K):
In this window
In a new window
Download PPT slide
 
Figure 4. Normal deviates of homozygosity in pocket vs. nonpocket sites. All polymorphic antigenic side-chain-binding amino acid sites ({blacksquare}, pocket category) were used to calculate the mean normal deviates of homozygosity and two standard error of the mean error bars for each locus. Also shown is the same calculation for polymorphic nonside-chain-binding sites ({bullet}).


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Deviation from neutral expectations of the homozygosity F statistic is evident at the level of amino acid sites in class II HLA ß1 domains encoded by DRB1, DQB1, and DPB1; many polymorphic amino acid sites in class II HLA ß1 domains are more evenly distributed than expected under neutrality. This is in contrast to allele level variation in which DQB1 shows very strong evidence for balancing selection, DRB1 some evidence, and DPB1 very little evidence. In one population, the Java, there was significant evidence of directional selection at the allele level. This may indicate that selection is most intense at the amino acid level.

It appears that different evolutionary histories have shaped the three ß1 domains. Two possible explanations for the contrast between DRB1 nonpocket polymorphic sites and DPB1 and DQB1 nonpocket sites are: (1) the designations of site functions are incorrect for DQB1 and DPB1 despite the strong similarity in the sequences and molecules, which indicates that the functional use of sites has diverged in the different ß1 domains; and (2) the pocket and nonpocket categories of sites in DRB1 are uncoupled by intraexonic recombination and/or segmental exchange, while DQB1 and DPB1 categories of sites do not evolve independently in this fashion.

To test hypothesis 1, we used another test for selection, the ratio of nonsynonymous substitutions per nonsynonymous site (dn) to synonymous substitutions per synonymous site (ds) (HUGHES and NEI 1988 Down, HUGHES and NEI 1989 Down). Under neutral expectations, this ratio should equal 1. Values >1 are seen as evidence for overdominant selection. The ratio of dn/ds was examined for sites in DQB1 on the basis of both the current pocket/nonpocket designation, as well as sites grouped by their negative aa values. The ratio for DQB1 pocket sites was 1.5, while the negative aa sites had a ratio of 3.9. This indicates that the current designation for pocket/nonpocket may not be inclusive of all sites.

When we look at only DQB1 and DRB1 sequences, hypothesis 2 also appears plausible, as far fewer DQB1 than DRB1 distinct sequences have been observed, and DQB1 shows larger blocks of amino acid variants that segregate together than does DRB1. Recent work by JAKOBSEN et al. 1998 Down lends support for hypothesis 2. She used compatibility matrices to show that, although DRB1 and DPB1 both have high levels of recombination relative to DQB1, they behave very differently. DRB1 is essentially scrambled so that the pocket and nonpocket sites may be uncoupled. Although there is more recombination at DP than at DR, the allelic diversity at DP is almost entirely due to shuffling of a small number of motifs, and there are consequently relatively few variable sites despite the large number of alleles. The larger number of monomorphic sites at DPB1 than at DQB1 and DRB1 suggests that the persistence time of DPB1 alleles is much shorter than that of DRB1 and DQB1 alleles, due to weaker selection at DPB1 or frequent recombination. This may also provide an explanation for the difference between the allele and amino acid analyses of DP. If selection is acting at the amino acid or motif level it may have no discernible effect at the allele level because of the extent of shuffling of motifs.

The observations presented here can be interpreted as predicting that either the antigenic peptide-binding pockets in DPB1 and DQB1 employ different sets of sites than is the case for DRB1, or the evolution of individual amino acid sites in DRB1 is considerably more independent than it is within DPB1 and DQB1 second exons. Structural analysis of DQB1 and DPB1 molecules will allow these predictions from analysis of population genetic data to be examined.

A technical criticism that initially would suggest that we have not estimated our normal deviates in a conservative fashion is that amino acid sites are limited to 20 "allelic" states, at most. In fact, it is reasonable to assume that fewer amino acid variants could be tolerated even at the most polymorphic HLA amino acid sites. The objection is based on the fact that the neutrality model we used in calculating normal deviates of homozygosity assumed that infinite alleles existed; that is, the models to which we compared data assumed that each mutation produces a state not previously existing in a population. The infinite alleles model is not strictly appropriate for investigating amino acid site polymorphism. A preliminary examination of neutral models with finite allelic states demonstrates that, given reasonable mutation rates and population sizes for human data, the effect of finite alleles on the neutral model (decreasing the homozygosity statistic relative to that expected for the infinite alleles model) is negligible (SALAMON 1995 Down). However, estimating the neutrality parameter {theta} (4Neµ) from empirically derived mutation rates and population sizes is difficult, given the uncertainty in estimates of both parameters. Therefore, further simulations are required and are in progress to address the issue of the appropriate value of {theta} and consequently the robustness of the application of infinite alleles tests to finite allele data. At this stage, interpretations are descriptive at best; the Watterson test provides us with a useful reference point to compare allele and amino acid frequency distributions at the various loci and sites, and among populations.

A criticism of the means and variances of the homozygosity statistic is that although the population samples are statistically independent, they are not historically independent. This is most certainly true in a strict sense as all ethnic groups have a common ancestor several thousand generations ago, and in the case of many of those studied here, far fewer generations ago than that. One could sacrifice sample size to obtain a set of populations that are deemed "more independent" than the entire set analyzed here. However, it is not obvious that there exist sets of ethnic groups that are in some respect equivalent, such that one cluster can be compared fairly to another; nor is it obvious that time since divergence is a good measure of evolutionary independence. When we focus on DRB1 pockets, the normal deviates of homozygosity for Caucasians—Czech, CEPH, and Norwegian samples—are very similar, whereas the values for Melanesians—samples from Fiji, Madang, Papua New Guinea Highlands, and New Caledonia—are widely scattered (data not shown). Clearly Melanesians have diversified into a range of environments and relatively small isolated populations. At the amino acid level, caution must be taken in interpreting P values due to nonindependence of amino acid sites. A neutral site linked to a selected site may deviate from neutrality as a result of hitchhiking.

Despite difficulties with independence of genetic information, there exists strong evidence of balanced polymorphisms at particular sites in HLA class II amino acid sequences, a pattern that is found across diverse human populations. The reproducibility of homozygosity statistics lower than neutral expectation across human populations for some polymorphic sites in the ß1 domains of DPB1, DQB1, and DRB1 is clearly demonstrated here.


*  FOOTNOTES

1 Present address: Division of Infectious Diseases and Geographic Medicine, Stanford University, Stanford, CA 94305. Back
2 Present address: Department of Public Health, University of California, Berkeley, CA 94720. Back
3 Present address: Laboratory of Genomic Diversity, National Cancer Institute, Frederick, MD 21702. Back


*  ACKNOWLEDGMENTS

This research was supported by National Institutes of Health grant GM35326 (H.S., W.K., S.K.M., M.P.N., G.T.).

Manuscript received July 6, 1998; Accepted for publication February 18, 1999.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ANDERSON, R., 1979 Some stochastic models in population genetics. M. Sc. Thesis, Monash University, Clayton, Victoria, Australia.

BEGOVICH, A. B., G. R. MCCLURE, V. C. SURAJ, R. C. HELMUTH, and N. FILDES et al., 1992  Polymorphism, recombination and linkage disequilibrium within the HLA class II region. J. Immunol. 148:249-258[Abstract].

BODMER, J., S. G. E. MARSH, E. D. ALBERT, W. F. BODMER, and R. E. BONTROP et al., 1997  Nomenclature for factors of the HLA system, 1996. Tissue Antigens 49:297-321[Medline].

BROWN, J. H., T. S. JARDETZKY, J. C. GORGA, L. J. STERN, and R. G. URBAN et al., 1993  Three-dimensional structure of the human class II histocompatibility antigen HLA-DR1. Nature 364:33-39. [see comments].[Medline].

BUGAWAN, T. L., J. D. CHANG, W. KLITZ, and H. A. ERLICH, 1994  PCR/oligonucleotide probe typing of HLA class II alleles in a Filipino population reveals an unusual distribution of HLA haplotypes. Am. J. Hum. Genet. 54:331-340[Medline].

CERNA, M., M. FERNANDEZ-VIÑA, E. IVASKOVA, and P. STASTNY, 1992  Comparison of HLA class II alleles in Gypsy and Czech populations by DNA typing with oligonucleotide probes. Tissue Antigens 39:111-116[Medline].

CERNA, M., M. FALCO, H. FRIEDMAN, E. RAIMONDI, and A. MACCAGNO et al., 1993  Differences in HLA class II alleles of isolated South American Indian populations from Brazil and Argentina [see comments]. Hum. Immunol. 37:213-220[Medline].

ERLICH, H. A., A. ZEIDLER, J. CHANG, S. SHAW, and L. J. RAFFEL et al., 1993  HLA class II alleles and susceptibility and resistance to insulin dependent diabetes mellitus in Mexican-American families. Nat. Genet. 3:358-364[Medline].

EWENS, W. J., 1972  The sampling theory of selectively neutral alleles. Theor. Popul. Biol. 3:87-112[Medline].

EWENS, W. J., 1979 Mathematical Population Genetics. Springer-Verlag, Berlin-New York.

FERNANDEZ-VIÑA, M. A., X. J. GAO, M. E. MORAES, J. R. MORAES, and I. SALATIEL et al., 1991  Alleles at four HLA class II loci determined by oligonucleotide hybridization and their associations in five ethnic groups. Immunogenetics 34:299-312[Medline].

GAO, X., K. BHATIA, R. J. TRENT, and S. W. SERJEANTSON, 1992a  HLA-DR,DQ nucleotide sequence polymorphisms in five Melanesian populations. Tissue Antigens 40:31-37[Medline].

GAO, X., P. ZIMMET, and S. W. SERJEANTSON, 1992b  HLA-DR,DQ sequence polymorphisms in Polynesians, Micronesians and Javanese. Hum. Immunol. 34:153-161[Medline].

HEDRICK, P. W. and G. THOMSON, 1983  Evidence for balancing selection at HLA. Genet. 104:449-456[Abstract/Free Full Text].

HEDRICK, P. W., W. KLITZ, W. P. ROBINSON, M. K. KUHNER and G. THOMSON, 1991 Population genetics of HLA, pp. 248–271 in Evolution at the Molecular Level, edited by R. K. SELANDER, A. G. CLARK and T. S. WHITTAM. Sinauer Associates, Sunderland, MA.

HUGHES, A. L. and M. NEI, 1988  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature 335:167-170[Medline].

HUGHES, A. L. and M. NEI, 1989  Nucleotide substitution at major histocompatibility complex class II loci: evidence for overdominant selection. Proc. Natl. Acad. Sci USA 86:958-962[Abstract/Free Full Text].

JAKOBSEN, I. B., S. R. WILSON, and S. EASTEAL, 1998  Patterns of reticulate evolution for the classical class I and II HLA loci. Immunogenetics 48:312-323[Medline].

KLITZ, W., G. THOMSON, and M. P. BAUR, 1986  Contrasting evolutionary histories among HLA-region loci. Am. J. Hum. Genet. 39:340-349[Medline].

KLITZ, W., G. THOMSON, N. BOROT, and A. CAMBON-THOMSEN, 1992  Evolutionary and population perspective of the human HLA complex. Evol. Biol. 26:35-72.

LAWLOR, D. A., J. ZEMMOUR, P. D. ENNIS, and P. PARHAM, 1990  Evolution of class I MHC genes and proteins: from natural selection to thymic selection. Annu. Rev. Immunol. 8:23-63[Medline].

MARSH, S. G. and J. G. BODMER, 1993  HLA class II nucleotide sequences, 1992. Eur. J. Immunogenet. 20:47-79[Medline].

MARSH, S. G. and J. G. BODMER, 1995  HLA class II nucleotide sequences, 1995. Tissue Antigens. 45:258-280[Medline].

MITSUNAGA, S., S. KUWATA, K. TOKUNAGA, C. UCHIKAWA, and K. TAKAHASHI et al., 1992  Family study on HLA-DPB1 polymorphism: linkage analysis with HLA-DR/DQ and two "new" alleles. Hum. Immunol. 34:203-211[Medline].

MONACO, J. J., 1993  Structure and function of genes in the MHC class II region. Curr. Opin. Immunol. 5:17-20[Medline].

PARHAM, P., D. A. LAWLOR, C. E. LOMEN, and P. D. ENNIS, 1989  Diversity and diversification of HLA-A, B, C alleles. J. Immunol. 142:3937-3950[Abstract].

NNINGEN, K. S., A. SPURKLAND, G. MARKUSSEN, T. IWE, and F. VARDAL et al., 1990  Distribution of HLA class II alleles among Norwegian caucasians. Hum. Immunol. 29:275-281[Medline].

SALAMON, H., 1995 Pattern, process, function and disease in the evolution of protein sequences: a study of antigen presenting molecules in human populations. Ph.D. Thesis, University of California, Berkeley.

STERN, L. J., J. H. BROWN, T. S. JARDETZKY, J. C. GORGA, and R. G. URBAN et al., 1994  Crystal structure of the human class II MHC protein HLA-DR1 complexed with an influenza virus peptide. Nature 368:215-221[Medline].

TAKAHATA, N., T. SATTA, and J. KLEIN, 1992  Polymorphism and balancing selection at major histocompatibility complex loci. Genetics 130:925-938[Abstract].

TRACHTENBERG, E. A., H. A. ERLICH, O. RICKARDS, G. F. DESTAFANO, and W. KLITZ, 1995  HLA class II linkage disequilibrium and haplotype evolution in the Cayapa Indians of Ecuador. Am. J. Hum. Genet. 57:415-424[Medline].

WATTERSON, G. A., 1978  The homozygosity test of neutrality. Genetics 88:405-417[Abstract/Free Full Text].