Genetics, Vol. 150, 511-513, September 1998, Copyright © 1998


Letter to the Editor

Ancient Polymorphism and the Hypothesis of a Recent Bottleneck in the Malaria Parasite Plasmodium falciparum

Austin L. Hughesa and Federica Verraa
a Department of Biology and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University, University Park, Pennsylvania 16802

Corresponding author: Austin L. Hughes, Department of Biology, 208 Mueller Laboratory, The Pennsylvania State University, University Park, PA 16802., austin{at}hugaus3.bio.psu.edu (E-mail).

RICH et al. (1998) have recently proposed that the current world population of the human malaria parasite Plasmodium falciparum consists of individuals descended from a single haploid genotype or strain (malarial Eve) that occurred between 24,500 and 57,500 years ago. They base this conclusion on an examination of synonymous and nonsynonymous nucleotide polymorphisms at 10 loci. At each of these loci, the authors examined published allelic sequences, with allele numbers ranging from 2 to 25 per locus; they reported some nonsynonymous polymorphisms but no synonymous polymorphisms at any locus. This conclusion is surprising because it ignores previous reports of extensive synonymous polymorphisms at certain loci of P. falciparum encoding surface proteins (HUGHES 1991 Down, HUGHES 1992 Down; HUGHES and HUGHES 1995A Down; JONGWUTIWES et al. 1994 Down). Furthermore, it is inconsistent with estimates that certain polymorphisms at loci encoding immunogenic proteins of P. falciparum have been maintained for millions of years (HUGHES 1992 Down, HUGHES 1993 Down; HUGHES and HUGHES 1995B Down).

The circumsporozoite protein (CSP) is a surface protein of the sporozoite, the infective stage of the malaria parasite. Examination of rates of synonymous and nonsynonymous nucleotide substitution at this locus showed a significantly enhanced rate of nonsynonymous nucleotide substitution in the region of the gene encoding peptides bound by host major histocompatibility complex (MHC) molecules and presented to T cells [T cell epitopes (TCE); GOOD et al. 1988 Down; HUGHES 1991 Down]. This is evidence that CSP polymorphism is maintained by balancing selection relating to the evasion of host immune recognition (HUGHES 1991 Down). The recent discovery of altered peptide ligand (APL) antagonism between certain allelic forms of these TCE in P. falciparum (GILBERT et al. 1998 Down) suggests a possible mechanism for such selection. When antagonistic forms of the TCE are present in the same host, an effective immune response is prevented (GILBERT et al. 1998 Down). Therefore APL antagonism will give rise to overdominant selection, favoring CSP heterozygosity at the level of the zygote that occurs in the mosquito intermediate host and gives rise to the infective sporozoites.

It is well known that balancing selection can maintain polymorphisms for much longer than neutral polymorphisms are expected to persist (TAKAHATA and NEI 1990 Down). A classic example involves the MHC genes of vertebrates. The MHC genes are characterized by a greatly enhanced rate of nonsynonymous nucleotide substitution in the codons encoding the peptide-binding region (PBR) of the molecule (HUGHES and NEI 1988 Down, HUGHES and NEI 1989 Down; HUGHES et al. 1994 Down). This pattern of nucleotide substitution is evidence that some form of balancing selection (such as overdominant selection) acts to maintain diversity at the amino acid level in the PBR. Diversity in the PBR is apparently selectively favored because MHC heterozygosity enhances immune surveillance by enabling the individual to bind and present a wide array of foreign peptides (HUGHES and NEI 1988 Down; HUGHES and HUGHES 1995A Down). MHC loci are also characterized by long-lasting polymorphisms, which have been maintained for millions of years and predate speciation events (KLEIN and TAKAHATA 1990 Down). The fact that there are numerous ancient alleles at human MHC loci is evidence that the human population could not have gone through a recent extreme bottleneck; rather the long-term effective population size of humans has been on the order of 104–105 for the past 5 million years or so (KLEIN et al. 1990 Down; AYALA 1995 Down; TAKAHATA et al. 1995 Down).

Because the CSP locus is subject to a similar sort of balancing selection to that seen at the MHC, it would not be surprising to find that polymorphism at this locus also has been maintained for a long time. The available data support this hypothesis. For example, we computed the mean number of synonymous substitutions per site (dS; NEI and GOJOBORI 1986 Down) in the 5' and 3' nonrepeat regions between 13 distinct CSP alleles of P. falciparum and one of Plasmodium reichenowi, a parasite of chimpanzees. (The central tetrapeptide repeat region of the gene was excluded because it cannot be reliably aligned among alleles; see JONGWUTIWES et al. 1994 Down.) Mean dS between P. falciparum and P. reichenowi was 0.035 ± 0.016 SE. On the other hand, dS between the two most distant P. falciparum alleles was 0.015 ± 0.011. Assuming that P. falciparum and P. reichenowi diverged when human and chimpanzee diverged and that this occurred about 5 million years ago (mya), we estimate the divergence time of the most divergent known CSP alleles in P. falciparum at 2.1 ± 1.5 million years. The standard error of the estimate is large because the number of sites involved is small; nonetheless the range of two standard errors around the mean (0.6–3.6 million years) falls well beyond the very recent date for malarial Eve proposed by RICH et al. 1998 Down. To accumulate the degree of synonymous difference observed between the two most divergent P. falciparum CSP alleles in the roughly 60,000 years proposed by RICH et al. 1998 Down, a rate of synonymous substitution of about 1 x 10-7 substitutions/site/year would be required. This is two orders of magnitude higher than typical rates of synonymous substitution in eukaryotes (LI 1997 Down).

It might seem surprising that we obtained nonzero values of dS for the CSP alleles, when RICH et al. 1998 Down counted no synonymous differences in the same data set. The difference is due to their method of counting and illustrates the pitfalls of merely counting codon differences rather than estimating the number of differences by a method like NEI and GOJOBORI's (1986). For example, in the 5' nonrepeat region of the CSP gene of P. falciparum, there is a codon position at which certain alleles have ATC (isoleucine), while the rest have ACT (threonine). RICH et al. 1998 Down presumably counted this as a nonsynonymous difference. However, NEI and GOJOBORI's method considers the pathways of point mutation by which one can get from ATC to ACT (or vice versa). The possible intermediates are ATT (isoleucine) and ACC (threonine). Thus, by whichever mutational pathway we proceed from ATC to ACT (or vice versa), we must accumulate one synonymous mutation and one nonsynonymous mutation. By ignoring mutational pathways, RICH et al. 1998 Down came to the misleading conclusion that no synonymous substitutions have occurred among CSP alleles.

Merozoite surface antigen-1 (MSA-1) is another antigenic surface protein of Plasmodium. It is encoded by a single polymorphic locus and expressed on the surface of the merozoite, the stage of the parasite that invades host red blood cells (HALL et al. 1984 Down). The evolutionary history of MSA-1 alleles is complex because of numerous past events of interallelic recombination; thus different gene regions have different evolutionary histories (HUGHES 1992 Down). (Note that interlocus recombination cannot have been a factor here because there are no related loci in the P. falciparum genome; HALL et al. 1984 Down.) In one extensive region [called region 6 by HUGHES 1992 Down], the known MSA-1 alleles of P. falciparum form two distinctive families, which are highly divergent from each other. HUGHES 1992 Down, HUGHES 1993 Down estimated that these two families diverged from each other at least 35 mya, making this one of the oldest polymorphisms known in any organism. Mean dS in region 6 between the two families of P. falciparum MSA-1 alleles is 0.681 ± 0.122 (HUGHES 1992 Down). Thus, alleles from these two families are nearly as divergent at synonymous sites as are orthologous genes of human and mouse. To accumulate this degree of difference in 60,000 years would require a synonymous substitution rate of 6 x 10-6 substitutions/site/year—over three orders of magnitude greater than the typical eukaryotic rate.

The genomes of Plasmodium parasites are extraordinarily AT rich. The 334 coding regions of P. falciparum now in the database are 71.7% AT (NAKAMURA et al. 1998 Down). This nucleotide content bias affects codon usage, with codons having A or T in the third position being preferentially used. Of over 200,000 P. falciparum codons in the database, 83.6% have A or T in the third position. It is known from other systems that a strong codon bias will lower the rate of synonymous substitution (SHARP and LI 1989 Down; TICHER and GRAUR 1989 Down; SHARP 1991 Down). Thus, the rate of synonymous substitution is expected to be lower in P. falciparum than in organisms such as humans, whose codon usage is less biased. The fact that the rate of synonymous substitution is reduced may explain the observation of nonsynonymous but not synonymous polymorphisms at certain loci in this species (HUGHES 1991 Down; HUGHES and HUGHES 1995B Down).

The existence of long-lasting polymorphisms and high levels of intraspecific synonymous polymorphism in P. falciparum is strong evidence against the hypothesis of malarial Eve. Most of the loci examined by RICH et al. 1998 Down are housekeeping genes, encoding metabolic enzymes, chaperone proteins, and so forth. The existence of balancing selection at such loci is not expected. Most are probably evolving neutrally. It is not expected that the pattern of coalescence at a neutral locus will be the same as that at a locus under balancing selection; rather, polymorphism at the former will be much less long-lasting than at the latter (TAKAHATA and NEI 1990 Down). Of course, if recent directional selection has led to fixation of an allele at any of the loci studied by RICH et al. 1998 Down, the coalescence time of current alleles could be even more recent than that at a neutrally evolving locus. In either case, the coalescence pattern at one or a few loci cannot be used to estimate the population history of a species (TAKAHATA et al. 1995 Down). Alleles at CSP, MSA-1, and other loci that are under balancing selection clearly have much earlier coalescence times than do alleles at neutrally evolving loci, just as in humans MHC loci have much earlier coalescence times than do mitochondrial DNA.

The fact that long-lasting polymorphisms have been maintained at certain loci of P. falciparum but not at others is also strong evidence that interlocus recombination has been an important factor in the evolution of this species, contrary to the hypothesis of RICH et al. 1997 Down and AYALA 1998 Down. If haploid genomes did not recombine, polymorphism at all loci would hitchhike together, causing polymorphism even at selectively neutral loci to be as ancient as that at MSA-1. The evidence for past interallelic recombination at both CSP and MSA-1 (HUGHES 1992 Down; JONGWUTIWES et al. 1994 Down) loci also shows the importance of recombination in this sexually reproducing diploid.

In conclusion, there is strong evidence against a recent extreme population bottleneck in P. falciparum, involving a single haploid genotype, as proposed by RICH et al. 1998 Down. At this point we cannot rule out the possibility of a moderate bottleneck, which would have allowed continued maintenance of polymorphism at the CSP, MSA-1, and other loci subject to balancing selection (HUGHES and HUGHES 1995B Down). We probably do not yet have sufficient data to estimate reliably the long-term effective population size of P. falciparum, as has been done for humans on the basis of polymorphism at MHC loci (TAKAHATA et al. 1995 Down). To address this question, we need extensive samples of sequence polymorphism from worldwide populations of P. falciparum. It is instructive that the only study that attempts to assay CSP allelic variation in a local population of this species—in Thailand by JONGWUTIWES et al. 1994 Down—revealed several previously undescribed alleles, in fact nearly doubling the number of known alleles at this locus. This suggests that many more alleles at this locus remain to be discovered, which in turn would imply a substantial long-term effective population size. We predict that similar studies in other parts of the species' range will reveal extensive polymorphism, particularly in Africa, where P. falciparum presumably originated.

LITERATURE CITED

AYALA, F. J., 1995  The myth of Eve: molecular biology and human origins. Science 270:1930-1936[Medline].

AYALA, F. J., 1998  Is sex better? Parasites say "no". Proc. Natl. Acad. Sci. USA 95:3346-3348[Full Text].

GILBERT, S. C., M. PLEBANSKI, S. GUPTA, J. MORRIS, and M. COX et al., 1998  Association of malaria parasite population structure, HLA, and immunological antagonism. Science 279:1173-1177[Abstract/Full Text].

GOOD, M. F., D. POMBO, L. A. QUAKYI, E. M. RILEY, and R. A. HOUGHTEN et al., 1988  Human T-cell recognition of the circumsporozoite protein of Plasmodium falciparum: immunodominant T-cell domains map to the polymorphic regions of the molecule. Proc. Natl. Acad. Sci. USA 85:1199-1203[Medline].

HALL, R., J. E. HYDE, M. GOMAN, D. L. SIMMONS, I. A. HOPE, and M. MACKAY et al., 1984  Major surface antigen gene of a human malaria parasite cloned and expressed in bacteria. Nature 311:379-382[Medline].

HUGHES, A. L., 1991  Circumsporozoite protein genes of malaria parasites (Plasmodium spp.): evidence for positive selection on immunogenic regions. Genetics 127:345-353[Abstract].

HUGHES, A. L., 1992  Positive selection and interallelic recombination at the merozoite surface antigen-1 (MSA-1) locus of Plasmodium falciparum. Mol. Biol. Evol. 9:381-393[Abstract].

HUGHES, A. L., 1993 Coevolution of immunogenic proteins of Plasmodium falciparum and the host's immune system, pp. 109–127 in Mechanisms of Molecular Evolution, edited by N. TAKAHATA and A. G. CLARK. Sinauer, Sunderland MA.

HUGHES, A. L. and M. K. HUGHES, 1995a  Natural selection on the peptide-binding regions of major histocompatibility complex molecules. Immunogenetics 42:233-243[Medline].

HUGHES, M. K. and A. L. HUGHES, 1995b  Natural selection on Plasmodium surface proteins. Mol. Biochem. Parasitol. 71:99-113[Medline].

HUGHES, A. L. and M. NEI, 1988  Pattern of nucleotide substitution at MHC class I loci reveals overdominant selection. Nature 335:167-170[Medline].

HUGHES, A. L. and M. NEI, 1989  Nucleotide substitution at major histocompatibility complex class II loci: evidence for overdominant selection. Proc. Natl. Acad. Sci. USA 86:958-962[Medline].

HUGHES, A. L., M. K. HUGHES, C. Y. HOWELL, and M. NEI, 1994  Natural selection at the class II major histocompatibility complex loci of mammals. Philos. Trans. R. Soc. Lond. B Biol. Sci. 345:359-367.

JONGWUTIWES, S., K. TANABE, M. K. HUGHES, H. KANBARA, and A. L. HUGHES, 1994  Allelic variation in the circumsporozoite protein of Plasmodium falciparum from Thai field isolates. Am. J. Trop. Med. Hyg. 51:659-668[Medline].

KLEIN, J. and N. TAKAHATA, 1990  The major histocompatibility complex and the quest for origins. Immunol. Rev. 113:5-25[Medline].

KLEIN, J., J. GUTKNECT, and N. FISCHER, 1990  The major histocompatibility complex and human evolution. Trends Genet. 6:7-11[Medline].

LI, W.-H., 1997 Molecular Evolution. Sinauer, Sunderland, MA.

NAKAMURA, Y., T. GOJOBORI, and T. IKEMURA, 1998  Codon usage tabulated from the international DNA sequence databases. Nucleic Acids Res. 26:334[Abstract/Full Text].

NEI, M. and T. GOJOBORI, 1986  Simple methods for estimating the numbers of synonymous and nonsynonymous substitutions. Mol. Biol. Evol. 3:418-426[Abstract].

RICH, S. M., R. R. HUDSON, and F. J. AYALA, 1997  Plasmodium falciparum antigenic diversity: evidence of clonal population structure. Proc. Natl. Acad. Sci. USA 94:13040-13045[Abstract/Full Text].

RICH, S. M., M. C. LICHT, R. R. HUDSON, and F. J. AYALA, 1998  Malaria's Eve: evidence of a recent population bottleneck throughout the world populations of Plasmodium falciparum. Proc. Natl. Acad. Sci. USA 95:4425-4430[Abstract/Full Text].

SHARP, M. M., 1991  Determinants of DNA sequence divergence between Escherichia coli and Salmonella typhimurium: codon usage, map position, and concerted evolution. J. Mol. Evol. 33:23-33[Medline].

SHARP, P. M. and W.-H. LI, 1989  On the rate of DNA sequence evolution in Drosophila. J. Mol. Evol. 28:398-402[Medline].

TAKAHATA, N. and M. NEI, 1990  Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. Genetics 124:967-978[Abstract].

TAKAHATA, N., Y. SATTA, and J. KLEIN, 1995  Divergence time and population size in the lineage leading to modern humans. Theor. Pop. Biol. 48:198-211.

TICHER, A. and D. GRAUR, 1989  Nucleic acid composition, codon usage, and the rate of synonymous substitution in protein-coding genes. J. Mol. Evol. 28:286-298[Medline].




This article has been cited by other articles:


Home page
Mol Biol EvolHome page
D. E. Neafsey, D. L. Hartl, and M. Berriman
Evolution of Noncoding and Silent Coding Sites in the Plasmodium falciparum and Plasmodium reichenowi Genomes
Mol. Biol. Evol., July 1, 2005; 22(7): 1621 - 1626.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. C. Leclerc, P. Durand, C. Gauthier, S. Patot, N. Billotte, M. Menegon, C. Severini, F. J. Ayala, and F. Renaud
From The Cover: Meager genetic variability of the human malaria agent Plasmodium vivax
PNAS, October 5, 2004; 101(40): 14455 - 14460.
[Abstract] [Full Text] [PDF]


Home page
Clin. Microbiol. Rev.Home page
R. Carter and K. N. Mendis
Evolutionary and Historical Aspects of the Burden of Malaria
Clin. Microbiol. Rev., October 1, 2002; 15(4): 564 - 594.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Jongwutiwes, C. Putaporntip, R. Friedman, and A. L. Hughes
The Extent of Nucleotide Polymorphism is Highly Variable Across a 3-kb Region on Plasmodium falciparum Chromosome 2
Mol. Biol. Evol., September 1, 2002; 19(9): 1585 - 1590.
[Abstract] [Full Text] [PDF]


Home page
Integr. Comp. Biol.Home page
A. G. Evans and T. E. Wellems
Coevolutionary Genetics of Plasmodium Malaria Parasites and Their Human Hosts
Integr. Comp. Biol., April 1, 2002; 42(2): 401 - 407.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
S. K. Volkman, A. E. Barry, E. J. Lyons, K. M. Nielsen, S. M. Thomas, M. Choi, S. S. Thakore, K. P. Day, D. F. Wirth, and D. L. Hartl
Recent Origin of Plasmodium falciparum from a Single Progenitor
Science, July 20, 2001; 293(5529): 482 - 484.
[Abstract] [Full Text] [PDF]