Genetics, Vol. 158, 747-756, June 2001, Copyright © 2001

Contrasting Patterns of Polymorphisms at the ABO-Secretor Gene (FUT2) and Plasma {alpha}(1,3)Fucosyltransferase Gene (FUT6) in Human Populations

Yoshiro Kodaa, Hidenori Tachidab, Hao Panga, Yuhua Liua, Mikiko Soejimaa, Abbas A. Ghaderic, Osamu Takenakad, and Hiroshi Kimuraa
a Division of Human Genetics, Department of Forensic Medicine, Kurume University School of Medicine, Kurume 830-0011, Japan,
b Department of Biology, Faculty of Sciences, Kyushu University, Fukuoka 812-8581, Japan,
c Department of Immunology, Shiraz University of Medical Sciences, Shiraz, Iran
d Department of Biochemistry, Primate Research Institute, Kyoto University, Inuyama 484-0002, Japan

Corresponding author: Hiroshi Kimura, Department of Forensic Medicine, Kurume University School of Medicine, Kurume 830-0011, Japan., hkimura{at}med.kurume-u.ac.jp (E-mail)

Communicating editor: N. TAKAHATA


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The coding sequences (~1 kb) of FUT2 [ABO-Secretor type {alpha}(1,2)fucosyltransferase] and of FUT6 [plasma {alpha}(1,3)fucosyltransferase] were analyzed for allelic polymorphism by direct sequencing in five populations. The nucleotide diversities of FUT2 estimated from pairwise sequence differences were 0.0045, 0.0042, 0.0042, 0.0009, and 0.0008 in Africans, European-Africans, Iranians, Chinese, and Japanese, respectively. The nucleotide diversities of FUT6 were 0.0024, 0.0016, 0.0015, 0.0017, and 0.0020 in Africans, European-Africans, Iranians, Chinese, and Japanese, respectively. At FUT2, excesses in pairwise sequence differences compared to the number of polymorphic sites as indicated by a significantly positive Tajima's D were observed in European-Africans and in Iranians. The data do not fit expectations of the equilibrium neutral model with an infinite number of sites. On the other hand, Tajima's D's at FUT6 in each of the five populations and at FUT2 in Africans, Chinese, and Japanese were not significantly different from zero. FST between the Asians and the others measured at FUT2 was higher than at FUT6. These results suggest that natural selection was responsible for the generation of the FUT2 polymorphism in European-Africans and in Iranians.


THE human {alpha}(1,2)- and {alpha}(1,3)-fucosyltransferase genes constitute a multigene family, and seven genes encoding human fucosyltransferases (FUT1FUT7) have been isolated (KUKOWSKA-LATALLO et al. 1990 Down; LARSEN et al. 1990 Down; LOWE et al. 1991 Down; WESTON et al. 1992A Down, WESTON et al. 1992B Down; SASAKI et al. 1994 Down; KELLY et al. 1995 Down). The {alpha}(1,2)fucosyltransferase plays a key role for tissue expression of the H antigen—a common precursor for the blood group A and B antigens. Two human {alpha}(1,2)fucosyltransferase genes (FUT1 and FUT2) have been isolated (LARSEN et al. 1990 Down; KELLY et al. 1995 Down; ROUQUIER et al. 1995 Down). FUT1 encodes H type {alpha}(1,2)fucosyltransferase (H enzyme) that regulates the expression of the H antigen and thereafter A and/or B antigens in the erythroid lineage and in the vascular endothelia. FUT2 encodes Secretor type {alpha}(1,2)fucosyltransferase (Se enzyme) that regulates the expression of these antigens in the secretory glands and in the digestive mucosa. About 20% of individuals are nonsecretors who fail to express the ABO antigen in saliva, because they are homozygous for the null allele of FUT2.

Recent studies have indicated that the FUT2 polymorphism showed an ethnic group-specific pattern (KELLY et al. 1995 Down; KODA et al. 1996 Down; LIU et al. 1998 Down). A common null allele of FUT2 in East Asians has a missense mutation in codon 129 (Ile-Phe; se385, B6 and B7 alleles in this study), whereas a common null allele of FUT2 in Europeans and Africans has a nonsense mutation in codon 143 (Trp-ter; se428, B3 and B4 alleles in this study). The enzyme encoded by se385 shows weak activity (2–5% activity of the wild-type enzyme), while the enzyme encoded by se428 shows no activity (KELLY et al. 1995 Down; KODA et al. 1996 Down). As compared to the wild-type allele (Se, A1 allele in this study), seven base differences were observed within the 1-kb DNA fragment encompassing the protein coding region in the se428 allele (see Fig 1). On the other hand, only two base changes from the wild-type allele were observed in the se385 allele.



View larger version (39K):
In this window
In a new window
Download PPT slide
 
Figure 1. Polymorphic base positions at 17 FUT2 alleles and at 15 FUT6 alleles. Functional alleles were indicated as A and nonfunctional alleles as B in both FUT2 and FUT6. C1–C5 alleles were found in 15 chimpanzee individuals and all chimpanzee alleles were functional. Vertical labels on the top indicate the positions of nucleotides. An asterisk indicates the sequence matched the reference allele (A1) at that position. A dash indicates a single base deletion at that position at FUT2. The 499i in FUT6 means a single base insertion at that position. Sites are also classified depending on whether the change is replacement (R), synonymous (S), or in the noncoding region (N). The B10 allele, not described here, is the fusion gene allele found in the Japanese population.

FUT3, FUT5, and FUT6 encode {alpha}(1,3)fucosyltransferases and are organized in a cluster, within a 40-kb region, in the short arm of chromosome 19 (19p13.3; MCCURLEY et al. 1995 Down). These three genes maintain intronless coding segments with nucleic acid sequences that share ~90% identity to each other. FUT3 encodes the Lewis {alpha}(1,3/4)fucosyltransferase that synthesizes the Lewis a and Lewis b epitopes, whereas the tissue expression of the FUT5-encoding enzyme is not yet known. FUT6 encodes the plasma {alpha}(1,3)fucosyltransferase and is known to be polymorphic (MOLLICONE et al. 1994 Down; LARSON et al. 1996 Down).

In this study, we analyzed the FUT2 allelic variation by direct DNA sequencing in five populations: Africans (Xhosa), European-Africans of South Africa, Iranians, Chinese, and Japanese. We found that the nucleotide variation at FUT2 did not fit the expectations of the equilibrium neutral model in European-Africans and in Iranians. The allelic variation of FUT6 was also examined for comparison. We found contrasting patterns of variations at these two loci, suggesting the action of selection at FUT2.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Population samples:
Individuals from five populations were sampled: Africans (Xhosa) from Cape Town, European-Africans from Cape Town, Iranians from Shiraz, Chinese from Guangzhou, and Japanese from Fukuoka (KODA et al. 1996 Down; LIU et al. 1998 Down, LIU et al. 1999 Down; PANG et al. 1999 Down). In addition, 15 chimpanzees were sampled. Chimpanzee genomic DNA was obtained from one of the authors (O.T.).

PCR amplification and sequencing of FUT2 and FUT6:
Genomic DNA was prepared from peripheral blood leukocytes by an organic solvent method. The FUT2 and FUT6 genes were amplified from genomic DNA by PCR for sequencing. FUT2 (16- to 999-bp coding region and 1000- to 1029-bp noncoding region) was amplified and sequenced as described previously (KODA et al. 1996 Down; LIU et al. 1998 Down). A DNA fragment (1.2 kb) including the full coding sequence (1080 bp) of FUT6 was amplified and sequenced as described previously (PANG et al. 1999 Down). DNA sequence was determined by the dideoxynucleotide chain termination method using an ABI PRISM Big Dye Terminator cycle sequencing ready reaction kit and an ABI PRISM 310 genetic analyzer (Perkin-Elmer-Japan ABI). Numbers of individuals sequenced in this study in respective populations are as follows: n = 30 for FUT2 in Africans; n = 36 for FUT2 in European-Africans; n = 53 for FUT2 and n = 52 for FUT6 in Iranians; n = 30 for FUT2 and FUT6 in Chinese; and n = 30 for FUT2 in Japanese populations. The FUT6 sequences of Africans (n = 56), European-Africans (n = 52), and Japanese (n = 53) from our previous study (PANG et al. 1999 Down) were added for data analysis. Individuals examined for FUT6 include all individuals examined for FUT2. In addition, the FUT2 region was sequenced in 15 chimpanzee individuals.

Haplotype determination:
For haplotyping of the alleles, PCR products from selective individuals heterozygous at more than one variant site in FUT2 (n = 17) or FUT6 (n = 21) were cloned into plasmid pGEM using the pGEM-T vector system I (Promega, Madison, WI). At least six clones from each subject were analyzed by sequencing to determine the haplotypes (alleles). Probable alleles in other individuals heterozygous at more than one variant site were estimated by the results of linkage of the base substitutions in all individuals homozygous at all sites and heterozygous at one site, and of haplotyping of the alleles in selective individuals as mentioned above.

Statistical analysis:
To measure diversity within a population, the number of segregating sites S for sample size n and the average pairwise sequence difference k were calculated. Under the equilibrium neutral model, these values and k provide alternative estimators of {theta} = 4Neµ, where Ne is the effective population size and µ is the mutation rate per sequence per generation (WATTERSON 1975 Down; NEI and TAJIMA 1981 Down). From the two estimates of {theta}, a test statistic D proposed by TAJIMA 1989 Down was computed. In addition, with the number of singleton sites {eta}s, test statistics D* and F* proposed by FU and LI 1993 Down were computed. Nucleotide diversity {pi} was estimated by dividing the average pairwise sequence difference by the numbers of sites. FST between populations was estimated from the sequence data using the method of HUDSON et al. 1992 Down. All analyses were carried out using DNASP3.1.4 (ROZAS and ROZAS 1999 Down).


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Sequence variation of FUT2 and FUT6:
In previous studies, we sequenced only selective individuals to determine the null alleles of FUT2 and frequencies of identified polymorphic sites were determined by PCR-restriction fragment length polymorphism or allele-specific PCR analysis in Africans, European-Africans, Chinese, and Japanese (KODA et al. 1996 Down; LIU et al. 1998 Down, LIU et al. 1999 Down). To analyze DNA sequence variation of FUT2 in detail, we sequenced the coding region of FUT2 for 355 chromosomes from 179 randomly selected individuals drawn from the five populations and 15 chimpanzees in this study (we deleted 3 chromosomes for calculation of nucleotide diversity because of the presence of three fusion genes in FUT2; KODA et al. 1996 Down). Concerning FUT2 sequence analysis, we include the results of one Japanese, seven Africans, and one European-African (none of Chinese) from previous studies (KODA et al. 1996 Down; LIU et al. 1998 Down). We also sequenced the coding region of FUT6 for 164 chromosomes from 82 individuals from the Iranian and Chinese populations. In addition, the FUT6 sequences of Africans, European-Africans, and Japanese obtained from PANG et al. 1999 Down were reanalyzed.

We found 21 polymorphic sites at FUT2 (1014 bp) and 18 polymorphic sites at FUT6 (1080 bp; Fig 1). At these polymorphic sites, 1 site was a biallelic 1-base deletion and 20 sites were single nucleotide polymorphism (10 replacement, 8 synonymous, and 2 noncoding changes) at FUT2, and 1 site was a biallelic 1-base insertion and 17 sites were single nucleotide polymorphism (9 replacement and 8 synonymous changes) at FUT6. Three of the 21 sites (162 G to A, 278 C to T, and 543 G to A) at FUT2 and none of the 18 sites at FUT6 were newly identified in this study. Six of the 21 sites at FUT2 had only a single copy of the rarer nucleotide (singletons), whereas none of the 18 sites was a singleton at FUT6.

Variations in FUT2 and FUT6 showed different patterns and were distributed unevenly among the five populations. Only 1 of the 21 sites of FUT2 and 7 of the 18 sites of FUT6 showed biallelic segregating variation in all five populations. The African population was found to have the largest numbers of population-specific polymorphic sites both at FUT2 (5 sites) and FUT6 (8 sites). The European-African, Iranian, and Japanese populations each had 1 population-specific polymorphic site at FUT2, while the Chinese population had 3 population-specific polymorphic sites at FUT2. Two population-specific polymorphic sites were found in European-Africans at FUT6, whereas no population-specific polymorphic site was observed in the Japanese, Chinese, and Iranian populations at FUT6.

From the results of direct sequencing and haplotyping of some alleles, we estimated the probable alleles and their frequencies. We found 18 alleles (including a fusion gene; KODA et al. 1996 Down) at FUT2 and 15 alleles at FUT6. Functional and null alleles are designated as A's and B's, respectively. We also found 5 alleles at chimpanzee FUT2 and these alleles are designated as C's. Fig 1 shows the allelic variations found at FUT2 and FUT6 in the five populations and chimpanzees. We found 9 alleles in the Africans, 6 in European-Africans, 5 in Iranians, 6 in Chinese, and 5 alleles in Japanese at FUT2, whereas we found 13 alleles in Africans, 8 in European-Africans, 7 in Iranians, 6 in Chinese, and 5 alleles in Japanese at FUT6. Fig 2 summarizes the allele frequencies in the five populations. To make phylogenetic analyses of FUT2 and FUT6, neighbor-joining trees were drawn using orangutan sequences (AB015636 at FUT2 and AB051859 at FUT6) as outgroups (Fig 3A and Fig B). Human alleles are monophyletic at both loci, although the origination of the FUT2 B3/B4 alleles seemed to be very old (KODA et al. 2000 Down). The results also suggested that nonfunctional alleles (B alleles) have multiple origins at both loci.



View larger version (29K):
In this window
In a new window
Download PPT slide
 
Figure 2. Histogram representations of 18 FUT2 and 15 FUT6 allele frequencies for the five populations.



View larger version (10K):
In this window
In a new window
Download PPT slide
 
Figure 3. The neighbor-joining trees constructed from the human FUT2 alleles and primate FUT2 homologues (A) and from the human FUT6 alleles and primate FUT6 homologues (B). We exclude the A8 allele of FUT2 from the tree because A8 seemed to be generated by crossing over between the A5 (or A6) and the B3 alleles.

There were apparently different allelic distributions in the five populations. At FUT2, A1–A8 are functional alleles (Se), and B1–B10 are null alleles (se). The B10 allele of FUT2 is a fusion gene, which was found in the Japanese population (KODA et al. 1996 Down; LIU et al. 1999 Down). Although we cannot rule out the possibility that the A8 allele is simply intermediate to connect the A5 and B3 alleles possibly generated by point mutations, a more parsimonious interpretation is that A8 was generated by crossing over between the A5 (or A6) and the B3 alleles. A8 was found only in the European population where the A5 and B3 frequencies were high, suggesting that recombination had occurred in this population. The crossover region is within a 310-bp stretch corresponding to the region between nucleotides 429 and 738 of FUT2. Consistent with the results from previous DNA sequencing of a few se428 alleles (KELLY et al. 1995 Down; LIU et al. 1998 Down), all 177 se428 alleles sequenced in this study had at least seven base substitutions within the 1.1-kb DNA fragment as compared to the A1 allele, and the sequence variation within the se428 alleles was very low (only one allele, B4, showed one additional polymorphic site at position 543). In addition, we found a novel allele (B5) with a missense mutation at position 278 in Iranians that may result in an amino acid change from alanine to valine. Determination of enzyme activity from a transient expression study indicated that this allele product has 4% enzyme activity as compared with the wild-type allele products (not shown). Accordingly, we classified this allele into an Se enzyme-deficient allele. The frequencies of functional alleles of FUT2 alleles were evenly distributed in the African population, whereas A5 (Se357) was a common functional allele in European-African and Iranian populations (Fig 2). A1 and A5 were common in the two Asian populations.

At FUT6, A1–A11 are functional alleles and B1–B4 are nonfunctional alleles. As for FUT2 functional alleles, all FUT6 alleles were much more evenly distributed in the African population, and almost half of the alleles found in the Africans were absent in non-African populations (Fig 2). The null allele B2 was detected in the European and Asian populations but not in the Africans. A9 and A11 constituted the two main alleles in the Asian populations, whereas A11 was a common allele in the European and Iranian populations. The genotype frequencies with regard to the functional and null alleles in each population do not deviate significantly from the Hardy-Weinberg expectation (data not shown).

Statistical analysis:
The average number of pairwise nucleotide differences (k), nucleotide diversity ({pi}), {theta}, Tajima's D, number of singleton sites ({eta}s), and Fu and Li's F* and D* were estimated (Table 1 and Table 2). Nucleotide diversity in the coding region of FUT2 ({pi} = 0.0033 ± 0.00007, 985 bp) and that of FUT6 ({pi} = 0.0020 ± 0.00002, 1080 bp) was much higher than in the lipoprotein lipase (LPL) coding region ({pi} = 0.0005, 998 bp; NICKERSON et al. 1998 Down) and other coding regions of 49 loci encompassing 54,193 bp (the {pi} values ranged from a maximum of 0.0011 for fourfold degenerate sites to a minimum of 0.0003 for nondegenerate sites; LI and SADLER 1991 Down). Nucleotide diversity at FUT2 but not at FUT6 was also higher than in the 2.67-kb region of the ß-globin gene ({pi} = 0.0018; HARDING et al. 1997 Down), the 9.7-kb region of the LPL gene ({pi} = 0.002), and the four degenerate sites in melanocortin 1 receptor locus ({pi} = 0.0021; CLARK et al. 1998 Down; NICKERSON et al. 1998 Down; RANA et al. 1999 Down). FUT2 has two polymorphic sites at 10 and 12 bp downstream of the termination codon in addition to those in the coding region. If we add these sites, the diversity of the 1014-bp region at FUT2 is increased ({pi} = 0.0041 ± 0.00007). Nucleotide diversity of replacement sites was 0.0018 and 0.0016 in FUT2 and FUT6, respectively, whereas that of synonymous and noncoding sites was 0.0105 and 0.0035 in FUT2 and FUT6, respectively. In addition, nucleotide diversity within the functional FUT2 alleles was 0.0009 (replacement sites, 0.0003). Thus, high nucleotide diversity is due to large silent-site variation and does not seem to be due to less constraint on replacement changes at the FUT2 locus.


 
View this table:
In this window
In a new window

 
Table 1. Population genetic statistics for the FUT2 variation


 
View this table:
In this window
In a new window

 
Table 2. Population genetic statistics for the FUT6 variation

Tajima's D values at FUT2 were estimated to be 2.86 (P < 0.01) and 3.17 (P < 0.01) in European-Africans and Iranians, respectively, whereas the D values in the other three populations at FUT2 were estimated as -0.35 to 1.52 (Table 1). Tajima's D values at FUT6 were estimated as -0.27 to 1.02 in the five populations (Table 2). We cannot reject the null hypothesis of neutrality at FUT6 in any of the five populations and at FUT2 in the African, Chinese, and Japanese populations. However, the polymorphism at FUT2 is difficult to explain by the equilibrium neutral model with constant effective population size in the European-Africans and Iranians. Fu and Li's test statistics at the FUT2 locus were estimated with or without using chimpanzee sequences (Fig 1) as outgroup. The values with or without outgroup sequences did not differ significantly. Table 1 shows the results obtained from the method with outgroup sequences. Fu and Li's F* also showed a similar pattern. F* was significantly positive in Iranians and European-Africans.

To quantify population differentiation, we computed FST in all pairs of populations (Table 3). First, note that FST's between Iranians and Europeans, and between Chinese and Japanese, were very small at both loci. Both pairs of populations are considered to be not genetically differentiated. The estimate of FST between Africans and Europeans based on the data at FUT6 was 0.081 and comparable to that (0.065 ± 0.014) estimated in the LPL gene region (CLARK et al. 1998 Down). FST values between Africans and non-Africans were not significantly different at the two loci (0.069 at FUT2 and 0.072 at FUT6). However, FST estimated at FUT2 showed a peculiar pattern. First, FST's between any pairs of African, European, and Iranian populations were small. Second, FST between the African/European/Iranian and Asian populations was large (>0.42). The FST analysis shows that the differentiation pattern is very different at the two loci.


 
View this table:
In this window
In a new window

 
Table 3. FST's in all pairs of populations estimated from sequence diversity at FUT2 and FUT6

Chimpanzee FUT2 homologue variation:
Since the sequence variation of human FUT2 was out of the scope of the standard mutation-selection balance model or the equilibrium neutral model with random mating, we examined the sequence variation of the chimpanzee FUT2 homologue. We found 7 polymorphic sites (2 replacement and 5 synonymous single nucleotide changes) and 5 alleles in the chimpanzee FUT2 homologue (Fig 1). In contrast to human, we did not find a null allele in the chimpanzee FUT2 homologue by transient expression study (KODA et al. 1996 Down). The average number of pairwise nucleotide differences (k), nucleotide diversity ({pi}), {theta}, Tajima's D, number of singleton sites ({eta}S), and Fu and Li's F* and D* of the chimpanzee FUT2 homologues were also estimated (Table 1). We cannot reject the null hypothesis of neutrality at the FUT2 homologue in the chimpanzee population. We can test the levels of divergence and polymorphism at silent vs. replacement sites by the MCDONALD and KREITMAN 1991 Down test. Within respective species 12 sites were silent and 13 were replacement polymorphisms in total, whereas the human/chimpanzee comparison revealed 2 silent and 1 replacement fixed differences. This test did not yield a significant result (for Fisher's exact test, P = 1.0). We also performed the McDonald and Kreitman test at the FUT6 locus using a chimpanzee sequence (DDBJ/EMBL/GenBank accession no. Y14035). Within each species 8 sites were silent and 10 were replacement polymorphisms in total, whereas the human/chimpanzee comparison revealed 6 silent and 12 replacement fixed differences. As at the FUT2 locus, the McDonald and Kreitman test did not yield a significant result (for Fisher's exact test, P = 0.73).


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

In this study, we examined DNA polymorphisms at the FUT2 and FUT6 loci in five human populations. We found contrasting patterns of DNA polymorphisms at the two loci. First, Tajima's D and Fu and Li's F* were significantly positive in European-African and Iranian populations but not so in the other populations at FUT2. These statistics are not significant in any populations at FUT6. Second, there was strong differentiation between the Asian (Chinese and Japanese) and non-Asian (African, European-African, and Iranian) populations at FUT2 as indicated by large FST's between these two groups of populations but FST values at FUT6 were between 0.1 and 0.16. The FST values at FUT6 are close to those estimated at 84 protein and 33 blood group loci (NEI and LIVSHITS 1990 Down). The pattern observed at FUT6 is similar to those at LPL (CLARK et al. 1998 Down; NICKERSON et al. 1998 Down), ß-globin (HARDING et al. 1997 Down), and dystrophin (ZIETKIEWICZ et al. 1998 Down) genes. Thus, the pattern at FUT6 seems to reflect the past demographic history of human populations. However, the pattern at FUT2 was very different from those at other loci, suggesting action of some locus-specific factors.

The contrasting pattern at FUT2 results mainly from high frequencies of two null alleles, B3 (se428) in the non-Asian populations and B6 (se385) in the Asian populations. The B3 allele diverged greatly from the functional alleles (A alleles). Indeed, the divergence time between B3 and the functional alleles was estimated to be about 3 million years ago (95% confidence limits were 1.25 and 4.72 million years ago assuming the divergence time of the human and chimpanzee genes to be 5 million years ago; KODA et al. 2000 Down). The ancient origin of B3 was difficult to explain by the neutrality of the null allele and we suggest it to be due to balancing selection. This ancient origin and low variability both within functional alleles and within null alleles result in significant positive D values in the European-African and Iranian populations. The frequency of B3 is also high in the African population, leading to a positive D, but D was not significant because of a modest amount of variability among the functional alleles. Since Tajima's D (and all other neutrality statistics) is not very sensitive to detect selection (SIMONSEN et al. 1995 Down), the irregular pattern found in the European-African and Iranian populations needs some explanation. One simple explanation is balancing selection acting on functional alleles and the B3 allele as suggested previously (KODA et al. 2000 Down). However, we also need to explain large FST's between the Asian and non-Asian populations.

Large FST's between the Asian and non-Asian populations were caused by high frequencies of the B6 allele in the Asian populations. B6 most likely originated from the functional allele, A5 (Se357), that is separated from B6 by only one nucleotide change. Because of the small number of nucleotide changes, Tajima's D is significantly positive in contrast to D values in the European-African and Iranian populations. Balancing selection makes Tajima's D significantly positive if it persisted long (SIMONSEN et al. 1995 Down). Since B6 exists only in the Asian populations, the increase of its frequency occurred after the separation of the Asian and non-Asian populations, the time of which is thought to be less than 100,000 years ago (NEI and LIVSHITS 1990 Down). In fact, a recent study based on 53 complete mtDNA sequences suggests this time to be less than ca. 50,000 years ago (INGMAN et al. 2000 Down). Such a large differentiation between races is also found at other loci. HARRIS and HEY 1999 Down reported that FST between African and non-African populations is 0.617 at the X-linked locus coding for PDHA1. Also, RANA et al. 1999 Down found an allele whose frequency is 0.66 in a Chinese population but that is absent in non-Asian populations at the locus coding for melanocortin 1 receptor. The authors of both articles suggested the action of selection on these two loci. Our case is similar to that of RANA et al. 1999 Down and we can apply their method of inferring selection utilizing the formula of LI 1975 Down. The frequency of B6 is 0.55 in the Chinese population. It takes 140,000 years on average for a neutral allele to reach that frequency if we assume that effective population size is 5000, initial frequency is 1/2N, and one generation is 20 years as were assumed by RANA et al. 1999 Down. Although the time depends on the presumed parameter values such as effective population size and the initial frequency, the action of selection is suspected for the increase of B6 in the Asian populations.

To summarize, the contrasting pattern at FUT2 is due mainly to the long persistence of B3 in the non-Asian populations and a rapid increase of B6 in the Asian populations. Both observations are difficult to explain by the simple neutral model and suggest the action of selection on this locus.

One possible selection model that explains both observations is a two-class overdominance model for the functional and null alleles. Assume that there are two classes of alleles Ai and Bi and that heterozygotes of alleles belonging to different classes (AiBi) have higher fitness than homozygotes and heterozygotes of alleles belonging to the same class. In this system, only two alleles Ai and Bi, each from the two classes, are maintained in a population at a time. Occasionally a new allele rapidly invades the population, replacing the old allele of the same class. In the case of the FUT2 locus, the functional and null alleles can be regarded as Ai and Bi, respectively. Under this scenario, B3 was maintained for a long time because of overdominance and B6 rapidly replaced B3 after the separation of the Asian and non-Asian populations.

To be more concrete and investigate turnovers, consider a simple model with only three alleles, A representing all A's, B3, and B6. Assume that an AB heterozygote has fitness of 1 and a B3B6 heterozygote has fitness of 1 - (t3 + t6)/2. Also assume that fitnesses of homozygotes AA, B3B3, and B6B6 are 1 - s, 1 - t3, and 1 - t6, respectively. Let the frequencies of A, B3, and B6 be 1 - y, y(1 - x), and yx, respectively. If we assume that selection is strong, then the frequency, x, of B6 in the null alleles satisfies the following differential equation:

(1)

Thus, if B6B6 has higher fitness than B3B3 (t3 > t6), the B6 allele rapidly increases its frequency as a semidominant advantageous mutation with a selection coefficient y(t3 - t6). The turnover rate of B's depends on the selective advantage, y (frequency of null alleles), and the mutation rate. This simple analysis shows that the long persistence of B3 and rapid increase of B6 can be explained by the two-class overdominance model if we assume that the mutation rate to new B alleles with higher homozygote fitness is low. One drawback of this explanation is that it cannot explain the low diversity within se428 alleles including B3 unless some modification such as time-dependent change of selection coefficients is incorporated.

Alternatively, low nucleotide diversities within se428 alleles (B3 and B4, {pi} = 0.00002) and within se385 alleles (B6 and B7, {pi} = 0.00003) suggest that frequencies of the se428 alleles and the se385 alleles might have increased relatively recently. Thus we might speculate that the B6 allele is on its way to fixation in the Asian populations and the B3 allele is on its way to fixation in the non-Asian populations by diversifying selection. The similarity of the null allele frequencies among the populations could be explained if we assume that the putative selective agent(s) had started to work almost simultaneously in the populations after the separation of Asian populations from the other populations, although the long persistence of B3 in the non-Asian populations is difficult to explain only by this model.

If polymorphism of FUT2 is due to two-class overdominance for the functional and null alleles or diversifying selection of the null allele, what would be the selective advantage for heterozygotes of the functional and null alleles or for homozygotes of null alleles? This question needs attention because the null alleles are usually considered deleterious or neutral. One hint comes from studies on the {alpha}(1,3)galactosyltransferase gene. The {alpha}(1,3)galactosyltransferase gene had changed to a pseudogene in the lineage of Old World monkeys and hominoids after the separation from the lineage of New World monkeys (GALILI and SWANSON 1991 Down). Old World monkeys and hominoids have a natural antibody against the Gal{alpha}1-3Gal epitope. GALILI and ANDREWS 1995 Down speculated that an infectious microbial agent containing the Gal{alpha}1-3Gal epitope was endemic to the Old World after the speciation of Old World monkeys and New World monkeys, and individuals with the {alpha}(1,3)galactosyltransferase pseudogene had higher fitness than those with the {alpha}(1,3)galactosyltransferase functional gene. Thus, a null allele can have a selective advantage over functional alleles.

In the case of the FUT2 locus, secretors who have at least one functional allele can express ABO(H) and Lewis b antigens in their secretions and digestive and respiratory mucosa, while nonsecretors with two null alleles do not. A recent study suggested that the Lewis b antigen was a possible receptor for Helicobacter pylori and that this microorganism was not bound to gastric mucosa lacking Lewis b expression (BOREN et al. 1993 Down). Most microorganisms may invade through the respiratory and digestive mucosa via ABO(H) and related carbohydrate structures (RAMPHAL et al. 1991 Down; PLOTKOWSKI et al. 1993 Down; KAUFFMANN et al. 1996 Down). Accordingly, the absence of some carbohydrate structures in the respiratory and digestive mucosa may be advantageous for protecting mammals from microorganism infections and secretor status may play a role in susceptibility to some microorganisms (RAZA et al. 1991 Down). Although these arguments suggest just a possible advantage of the null allele over functional alleles, we can build various models such as marginal overdominance to make the two-allele model overdominant.

The two-class overdominance model and recent diversifying selection model are just two of the possible explanations for the contrasting pattern observed at FUT2. We need to investigate the physiological and epidemiological significance of the null alleles in future studies to show that selection is really operating on this locus. In addition, some complex demographic scenarios may also explain the pattern, although we did not explore them here. For example, WALL 2000 Down showed that admixture after a long time of separation of two populations is still compatible with the current DNA data. This or some other population structure may explain the long persistence of B3 without invoking balancing selection (see TAJIMA 1993 Down). Also, expansion of human populations after bottleneck events was suggested by microsatellite data (KIMMEL et al. 1998 Down) and such bottleneck events can explain positive D values (see TAJIMA 1993 Down) although only a few are significant in currently available data of nuclear polymorphisms observed at nuclear loci (HEY 1997 Down; NACHMAN et al. 1998 Down; FAY and WU 1999 Down; HEY and HARRIS 1999 Down; PRZEWORSKI et al. 2000 Down). Therefore, we also need to explore possible neutral models to explain the contrasting pattern at the FUT2 locus as the accumulation of data of human DNA polymorphism will make more accurate modeling of the past demographic history of human populations possible. Although our conclusion for the contrasting pattern at the FUT2 locus is thus not definite, collecting DNA polymorphism data with random sampling certainly will enhance our understanding of the past population structure and give some hints about locus-specific factors such as selective agents acting on human populations.


*  ACKNOWLEDGMENTS

We thank Dr. Yun-Xin Fu for his suggestion of the two-allele balancing selection model and helpful discussion and Dr. Ernette D. du Toit for providing blood samples from Cape Town, South Africa. This work was supported in part by grants-in-aid for Scientific Research from the Ministry of Education, Science and Culture of Japan, and grants from Uehara Memorial Foundation. H.T. was also supported in part by a grant from the Program for Promotion of Basic Research Activities for Innovative Biosciences (PROBRAIN).

Manuscript received August 25, 2000; Accepted for publication February 15, 2001.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

BOREN, T., P. FALK, K. A. ROTH, G. LARSON, and S. NORMARK, 1993  Attachment of Helicobacter pylori to human gastric epithelium mediated by blood group antigens. Science 262:1892-1895[Abstract/Free Full Text].

CLARK, A. G., K. M. WEISS, D. A. NICKERSON, S. L. TAYLOR, and A. BUCHANAN et al., 1998  Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. Am. J. Hum. Genet. 63:595-612[Medline].

FAY, J. and C. I. WU, 1999  A human population bottleneck can account for the discordance between patterns of mitochondrial versus nuclear DNA variation. Mol. Biol. Evol. 16:1003-1005[Medline].

FU, Y. X. and W. H. LI, 1993  Statistical tests of neutrality of mutations. Genetics 133:693-709[Abstract].

GALILI, U. and P. ANDREWS, 1995  Suppression of {alpha}-galactosyl epitopes synthesis and production of the natural anti-Gal antibody: a major evolutionary event in ancestral Old World primates. J. Hum. Biol. 29:433-442.

GALILI, U. and K. SWANSON, 1991  Gene sequences suggest inactivation of {alpha}-1,3-galactosyltransferase in catarrhines after the divergence of apes from monkeys. Proc. Natl. Acad. Sci. USA 88:7401-7404[Abstract/Free Full Text].

HARDING, R. M., S. M. FULLERTON, R. C. GRIFFITHS, J. BOND, and M. J. COX et al., 1997  Archaic African and Asian lineages in the genetic ancestry of modern humans. Am. J. Hum. Genet. 60:772-789[Medline].

HARRIS, E. H. and J. HEY, 1999  X chromosome evidence for ancient human histories. Proc. Natl. Acad. Sci. USA 96:3320-3324[Abstract/Free Full Text].

HEY, J., 1997  Mitochondrial and nuclear genes present conflicting portraits of human origins. Mol. Biol. Evol. 14:166-172[Abstract].

HEY, J. and E. HARRIS, 1999  Population bottlenecks and patterns of human polymorphism. Mol. Biol. Evol. 16:1423-1426[Medline].

HUDSON, R. R., M. SLATKIN, and W. P. MADDISON, 1992  Estimation of levels of gene flow from DNA sequence data. Genetics 132:583-589[Abstract].

INGMAN, M., H. KAESSMANN, S. PAABO, and U. GYLLENSTEN, 2000  Mitochondrial genome variation and the origin of modern humans. Nature 408:708-713[Medline].

KAUFFMANN, F., C. FRETTE, Q-T. PHAM, S. NAFISSI, and J. P. BERTRAND et al., 1996  Associations of blood group-related antigens to EFV1, wheezing, and asthma. Am. J. Respir. Crit. Care Med. 153:76-82[Abstract].

KELLY, R. J., S. ROUQUIER, D. GIORGI, G. G. LENNON, and J. B. LOWE, 1995  Sequence and expression of a candidate for the human Secretor blood group {alpha}(1,2)fucosyltransferase gene (FUT2). Homozygosity for an enzyme-inactivating nonsense mutation commonly correlates with the non-secretor phenotype. J. Biol. Chem. 270:4640-4649[Abstract/Free Full Text].

KIMMEL, M., R. CHAKRABORTY, J. P. KING, M. BAMSHAD, and W. S. WATKINS et al., 1998  Signatures of population expansion in microsatellite repeat data. Genetics 148:1921-1930[Abstract/Free Full Text].

KODA, Y., M. SOEJIMA, Y. LIU, and H. KIMURA, 1996  Molecular basis for secretor type {alpha}(1,2)-fucosyltransferase gene deficiency in a Japanese population: a fusion gene generated by unequal crossover responsible for the enzyme deficiency. Am. J. Hum. Genet. 59:343-350[Medline].

KODA, Y., H. TACHIDA, M. SOEJIMA, O. TAKENAKA, and H. KIMURA, 2000  Ancient origin of the null allele se428 of the human ABO-secretor locus (FUT2). J. Mol. Evol. 50:243-248[Medline].

KUKOWSKA-LATALLO, J. F., R. D. LARSEN, R. P. NAIR, and J. B. LOWE, 1990  A cloned human cDNA determines expression of a mouse stage-specific embryonic antigen and the Lewis blood group {alpha}(1,3/1,4)fucosyltransferase. Genes Dev. 4:1288-1303[Abstract/Free Full Text].

LARSEN, R. D., L. K. ERNST, R. P. NAIR, and J. B. LOWE, 1990  Molecular cloning, sequence, and expression of human GDP-L-fucose: ß-D-galactoside 2-{alpha}-L-fucosyltransferase cDNA that can form the H group antigen. Proc. Natl. Acad. Sci. USA 87:6674-6678[Abstract/Free Full Text].

LARSON, G., C. BORJESON, A. ELMGREN, A. KERNHOLT, and S. HENRY et al., 1996  Identification of a new plasma {alpha}(1,3)fucosyltransferase (FUT6) allele requires an extended genotyping strategy. Vox Sang. 71:233-241[Medline].

LI, W.-H., 1975  The first arrival time and mean age of a deleterious mutant gene in a finite population. Am. J. Hum. Genet. 27:274-286[Medline].

LI, W. H. and L. A. SADLER, 1991  Low nucleotide diversity in man. Genetics 129:513-523[Abstract].

LIU, Y.-H., Y. KODA, M. SOEJIMA, H. PANG, and T. SCHLAPHOFF et al., 1998  Extensive polymorphism of the FUT2 gene in an African (Xhosa) population of South Africa. Hum. Genet. 103:204-210[Medline].

LIU, Y.-H., Y. KODA, M. SOEJIMA, H. PANG, and B.-J. WANG et al., 1999  The fusion gene at the ABO-Secretor locus (FUT2): absence in Chinese populations. J. Hum. Genet. 44:181-184[Medline].

LOWE, J. B., J. F. KUKOWSKA-LATALLO, R. P. NAIR, R. D. LARSEN, and R. M. MARKS et al., 1991  Molecular cloning of a human fucosyltransferase gene that determines expression of the Lewisx and VIM-2 epitopes but not ELAM-1-dependent cell adhesion. J. Biol. Chem. 266:17467-17477[Abstract/Free Full Text].

MCCURLEY, R. S., A. RECINOS, III, A. S. OLSEN, J. C. GINGRICH, and D. SZCZEPANIAK et al., 1995  Physical maps of human {alpha}(1,3)fucosyltransferase genes FUT3-FUT6 on chromosomes 19p13.3 and 11q21. Genomics 26:142-146[Medline].

MCDONALD, J. H. and M. KREITMAN, 1991  Adaptive protein evolution at the Adh locus in Drosophila. Nature 351:652-654[Medline].

MOLLICONE, R., I. REGUIGNE, A. FLETCHER, A. AZIZ, and M. RUSTAM et al., 1994  Molecular basis for plasma {alpha}(1,3)-fucosyltransferase gene deficiency (FUT6). J. Biol. Chem. 269:12662-12671[Abstract/Free Full Text].

NACHMAN, M. W., V. L. BAUER, S. L. CROWELL, and C. F. AQUADRO, 1998  DNA variability and recombination rates at X-linked loci in humans. Genetics 150:1133-1141[Abstract/Free Full Text].

NEI, M., and G. LIVSHITS, 1990 Evolutionary relationships of Europeans, Asians, and Africans at the molecular level, pp. 251–265 in Population Biology of Genes and Molecules, edited by N. TAKAHATA and J. F. CROW. Baifukan, Tokyo.

NEI, M. and F. TAJIMA, 1981  DNA polymorphism detectable by restriction endonucleases. Genetics 97:145-163[Abstract/Free Full Text].

NICKERSON, D. A., S. L. TAYLOR, K. M. WEISS, A. G. CLARK, and R. G. HUTCHINSON et al., 1998  DNA sequence diversity in a 9.7-kb region of the human lipoprotein lipase gene. Nat. Genet. 19:233-240[Medline].

PANG, H., Y. KODA, M. SOEJIMA, T. SCHLAPHOFF, and E. D. DU TOIT et al., 1999  Allelic diversity of the human plasma {alpha}(1,3)fucosyltransferase gene (FUT6). Ann. Hum. Genet. 63:277-284[Medline].

PLOTKOWSKI, M. C., O. BAJOLET-LAUDINAT, and E. PUCHELLE, 1993  Cellular and molecular mechanisms of bacterial adhesion to respiratory mucosa. Eur. Respir. J. 6:903-916[Abstract].

PRZEWORSKI, M., R. R. HUDSON, and A. D. RIENZO, 2000  Adjusting the focus on human variation. Trends Genet. 16:296-302[Medline].

RAMPHAL, R., C. CARNOY, S. FIEVRE, J. C. NICHALSKI, and N. HOUDRET et al., 1991  Pseudomonas aeruginosa recognizes carbohydrate chains containing type 1 (Galß1-3GlcNAc) or type 2 (Galß1-4GlcNAc) disaccharide units. Infect. Immun. 59:700-704[Abstract/Free Full Text].

RANA, B. K., D. HEWETT-EMMETT, L. JIN, B. H.-J. CHANG, and N. SAMBUUGHIN et al., 1999  High polymorphism at the human melanocortin 1 receptor locus. Genetics 151:1547-1557[Abstract/Free Full Text].

RAZA, M. W., C. C. BLACKWELL, P. MOLYNEAUX, V. S. JAMES, and M. M. OGILVIE et al., 1991  Association between secretor status and respiratory viral illness. Br. Med. J. 303:815-818.

ROUQUIER, S., J. B. LOWE, R. J. KELLY, A. L. FERTITTA, and G. G. LENNON et al., 1995  Molecular cloning of a human genomic region containing the H blood group {alpha}(1,2)fucosyltransferase gene and two H locus-related DNA restriction fragments. Isolation of a candidate for the human Secretor blood group locus. J. Biol. Chem. 270:4632-4639[Abstract/Free Full Text].

ROZAS, J. and R. ROZAS, 1999  DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174-175[Abstract/Free Full Text].

SASAKI, K., K. KURATA, K. FUNAYAMA, M. NAGATA, and E. WATANABE et al., 1994  Expression cloning of a novel {alpha}1,3fucosyltransferase that is involved in biosynthesis of the sialyl Lewisx carbohydrate determinants in leukocytes. J. Biol. Chem. 269:14730-14737[Abstract/Free Full Text].

SIMONSEN, K. L., G. A. CHURCHILL, and C. F. AQUADRO, 1995  Properties of statistical tests of neutrality for DNA polymorphism data. Genetics 141:413-429[Abstract].

TAJIMA, F., 1989  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595[Abstract/Free Full Text].

TAJIMA, F., 1993  Statistical analysis of DNA polymorphism. Jpn. J. Genet. 68:567-595[Medline].

WALL, J. D., 2000  Detecting ancient admixture in humans using sequence polymorphism data. Genetics 154:1271-1279[Abstract/Free Full Text].

WATTERSON, G. A., 1975  On the number of segregating sites in genetical model without recombination. Theor. Popul. Biol. 7:256-276[Medline].

WESTON, B. W., R. P. NAIR, R. D. LARSEN, and J. B. LOWE, 1992a  Isolation of a novel human {alpha}(1,3)fucosyltransferase gene and molecular comparison to the human Lewis blood group {alpha}(1,3/1,4)fucosyltransferase gene. Syntenic, homologous, nonallelic genes encoding enzymes with distinct acceptor substrate specificities. J. Biol. Chem. 267:4152-4160[Abstract/Free Full Text].

WESTON, B. W., P. L. SMITH, R. J. KELLY, and J. B. LOWE, 1992b  Molecular cloning of a fourth member of a human {alpha}(1,3)fucosyltransferase gene family. Multiple homologous sequences that determine expression of the Lewisx, sialyl Lewisx, and difucosyl sialyl Lewisx epitopes. J. Biol. Chem. 267:24575-24584[Abstract/Free Full Text].

ZIETKIEWICZ, E., V. YOTOVA, M. JARNIK, M. KORAB-LASKOWSKA, and K. K. KIDD et al., 1998  Genetic structure of the ancestral populations of modern humans. J. Mol. Evol. 47:146-155[Medline].




This article has been cited by other articles:


Home page
Genome ResHome page
M. Fumagalli, R. Cagliani, U. Pozzoli, S. Riva, G. P. Comi, G. Menozzi, N. Bresolin, and M. Sironi
Widespread balancing selection and pathogen-driven selection at blood group antigen genes
Genome Res., February 1, 2009; 19(2): 199 - 212.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
M. Thorven, A. Grahn, K.-O. Hedlund, H. Johansson, C. Wahlfrid, G. Larson, and L. Svensson
A Homozygous Nonsense Mutation (428G->A) in the Human Secretor (FUT2) Gene Provides Resistance to Symptomatic Norovirus (GGII) Infections
J. Virol., December 15, 2005; 79(24): 15351 - 15355.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
E. A. Hurd and S. E. Domino
Increased Susceptibility of Secretor Factor Gene Fut2-Null Mice to Experimental Vaginal Candidiasis
Infect. Immun., July 1, 2004; 72(7): 4279 - 4281.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. F. Hammer, F. Blackmer, D. Garrigan, M. W. Nachman, and J. A. Wilder
Human Population Structure and Its Effects on Sampling Y Chromosome Sequence Variation
Genetics, August 1, 2003; 164(4): 1495 - 1509.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. M. Akey, G. Zhang, K. Zhang, L. Jin, and M. D. Shriver
Interrogating a High-Density SNP Map for Signatures of Natural Selection
Genome Res., December 1, 2002; 12(12): 1805 - 1814.
[Abstract] [Full Text] [PDF]