| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 174, 1689-1694, November 2006, Copyright © 2006
doi:10.1534/genetics.106.062885
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
School of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia
1 Address for correspondence: School of Molecular and Biomedical Science, Molecular Life Sciences Bldg., Gate 8, Victoria Dr., University of Adelaide, Adelaide, SA 5005, Australia.
E-mail: jack.dasilva{at}adelaide.edu.au
| ABSTRACT |
|---|
|
|
|---|
HIV-1 third variable region:
I investigated this correlation using sequence data from the third variable region (V3) of the HIV-1 exterior envelope glycoprotein (gp120), located on the surface of the virus particle (virion). V3 is the main determinant of which one of two cell-surface chemokine receptors (CCR5 or CXCR4) is used by a virion as a coreceptor to enter a cell, which determines the type of cell infected (SPECK et al. 1997). V3 also modulates the use of each coreceptor (DE JONG et al. 1992; HUNG et al. 1999) and thereby affects the rate-limiting step in cell entry (PLATT et al. 2005). Finally, V3 is also the main target of antibodies that interfere with cell entry (ZOLLA-PAZNER 2004). Because of its critical function and its exposure to neutralizing antibodies, V3 has been the focus of intense study, resulting in the data necessary to test the correlation.
Site-specific marginal fitness:
Data on V3 site-specific fitness effects of amino acids are available from a study that employed site-directed mutagenesis to modify the amino acid at V3 site 25 (HUNG et al. 1999). This study measured the effects of different amino acids at site 25 on the number of cells infected (infectivity) in a standardized assay. Note that infectivity is the appropriate measure of the effect of V3 on fitness under the assay conditions; the sole function of V3 appears to be in cell entry, and this component of fitness is unlikely to trade off against the remaining components of fitness, such as the rate of viral genome integration into the host cell genome, the rate of provirus (integrated viral genome) replication, and virion budding, which are controlled by other viral proteins (COFFIN 1999). The site-directed mutagenesis study was conducted using CCR5-utilizing HIV-1 of subtype B, the phylogenetic clade most common in Europe and North America and the most sequenced and studied.
Site-specific amino acid frequency:
I calculated site-specific frequencies of subtype B V3 amino acids for the viral population infecting a patient and then averaged frequencies across patients. Among-population means of frequencies were used because the site-specific frequency of an amino acid in any one population is expected to be dynamic, being dependent on the population's level of adaptation, and thus a poor predictor of the amino acid's marginal fitness. This procedure was repeated for each of the three coreceptor usage viral phenotypes: exclusively CCR5 utilizing (R5) (269 sequences from 58 patients), exclusively CXCR4 utilizing (X4) (109 sequences from 11 patients), and dual coreceptor utilizing (R5X4) (60 sequences from 16 patients). The rank order of amino acid frequencies at most sites, including site 25, varies among coreceptor-usage phenotypes (Figure 1), as would be expected if the sites function in determining or modulating coreceptor usage.
|
|
Selection coefficients:
V3 site-specific amino acid selection coefficients were calculated relative to the most common residue at each respective site for the R5 phenotype. These calculations assume no linkage among sites, which is reasonable given the high rate of effective recombination (between viral variants) (1.38 x 10–4 recombination events/adjacent nucleotide site/generation) (SHRINER et al. 2004a) relative to the mutation rate (2.4 x 10–5 point mutations/nucleotide site/generation) (MANSKY and TEMIN 1995). Site-specific selection coefficients range from 0.180 to 0.999, with a mean of 0.923 (median = 0.970; Figure 3), suggesting strong selection. Even if a small intrapatient effective population size of Ne = 103 infected cells (ACHAZ et al. 2004; SHRINER et al. 2004b) is assumed and the minimum selection coefficient is used, then 2Nes = 360 >> 1, indicating that selection will prevail over genetic drift.
|
Mutation–selection balance:
Mutation–selection balance cannot explain the high frequencies of less favored amino acids. Given a genetically homogenous virus population at the time of infection (DERDEYN et al. 2004) and the early evolution of V3 toward the R5 sequence with the most common residue at each site (ZHANG et al. 1993), and assuming no migration between patients, amino acid polymorphism within a population infecting a patient is assumed to arise via mutation. At mutation–selection equilibrium, the most favored amino acid at each site is the most frequent, with each alternative (deleterious) amino acid present at a mean frequency of p = µ/s, where µ is the rate of mutation to the deleterious amino acid and s is its selection coefficient relative to the most preferred amino acid (CROW and KIMURA 1970). For example, in the case of glutamic acid (E), the second most common residue at site 25 of R5 V3 sequences (Figure 1), the mutation rate from the most common residue, aspartic acid (D), ignoring any mutation bias, is predicted to be 1.6 x 10–5/codon/generation and s = 0.18. This gives p = 8.9 x 10–5, which is much lower than the observed mean frequency of 0.35.
Episodic adaptation:
The high frequencies of less favored residues may be explained if V3 is displaced from mutation–selection balance, either at the time of infection, after which V3 evolves rapidly toward the R5 sequence with the most common residue at each site, or during chronic infection when it is subject to strong opposing selection, as argued above. Although the alternative coreceptor, CXCR4, may impose strong selection during the late stages of disease progression (TEMPLETON et al. 2004), the immune system is expected to periodically select escape mutants throughout the chronic stage of infection (DA SILVA 2003; WEI et al. 2003; WILLIAMSON 2003; TEMPLETON et al. 2004; FROST et al. 2005). For example, antibodies may impose frequency-dependent selection by targeting epitopes consisting of the most common residues at sites, but such sites may eventually become shielded from antibody surveillance by changes in other protein regions (WEI et al. 2003; FROST et al. 2005). This scenario could result in the episodic adaptation of V3 to CCR5.
The mutational landscape model:
Under weak selection and mutation (see ORR 2002), each step in an episode of adaptation can be described by the mutational landscape model of GILLESPIE (1984, 1991). In this model, a beneficial amino acid of fitness rank j spreads to fixation as the next step in adaptation, when an amino acid of fitness rank i is the current wild type at the site, with probability
![]() | (1) |
is the probability of fixation and the subscripts denote the marginal fitness ranks of the beneficial amino acids within one mutational step of the wild type, with 1 indicating the highest fitness and j < i. This model assumes no recombination and considers mutations at all sites of a sequence. However, the model can be applied to a single amino-acid site if free recombination is assumed or if it is assumed that the most preferred amino acid is present at all other sites (i.e., that there is no potential for adaptation at other sites). Since
j
2sj for a new mutation (HALDANE 1927) and
j
2Nsjpj for standing variation (ORR and BETANCOURT 2001), Pij is, in either case, proportional to sj. For example, in the case of new mutations,
![]() | (2) |
Therefore, Pij is linearly related to sj and wj, and, after scaling both variables between 0 and 1, Pij
wj regardless of which amino acid is the current wild type (Figure 4). This means that regardless of the level of adaptation of the population, a particular beneficial amino acid will be the next to spread to fixation with a probability that is proportional to its marginal fitness. If Pij is taken to be the proportion of populations fixed for an amino acid of fitness rank j at a particular site or the site-specific frequency of this amino acid averaged across many populations, then this relationship may explain why among-population mean site-specific frequencies of amino acids and their marginal relative fitnesses, both scaled between 0 and 1, are equal.
|
The question then is, Is there clonal interference among beneficial mutations at a single amino-acid site? The answer depends on the probability that more than one beneficial mutation segregate at the same site. This, of course, depends on the beneficial mutation rate, which depends on the level of adaptation at a given site (the wild-type amino acid at the site). I calculated the proportion of nucleotide point mutations that produce a beneficial amino acid change using the rank order of fitnesses of amino acids at V3 site 25 (Figure 2). Excluding mutations at multiple codon positions, because the frequency of these is very low (µ2 and µ3) compared to the frequency of mutations at a single codon position (µ), and ignoring any mutation bias, the proportion of nucleotide point mutations that are beneficial, averaged among codons for each amino acid, ranges from 0.037 for arginine (R) to 0.389 for glycine (G) and is 0.182 averaged across all amino acids observed at site 25 [except aspartic acid (D), which has the highest fitness]. Then, the rate of beneficial mutations per codon is the product of the probability of a mutation in the codon (including synonymous mutations) and the proportion of these mutations that are beneficial. Allowing for the highest beneficial mutation rate at site 25, that is, with glycine as the wild-type amino acid, the rate of beneficial mutations per codon is 3 nucleotide sites/codon x 2.4 x 10–5 mutations/nucleotide site/generation x 0.389 beneficial mutations/mutation = 2.8 x 10–5 beneficial mutations/codon/generation. Therefore, with a census population size of
N = 107 infected cells (CHUN et al. 1997) there will be a maximum total of 2.8 x 102 beneficial mutations at V3 site 25 each generation within a population. However, the probability that a new beneficial mutation becomes fixed is proportional to Ne/N (KIMURA 1964), which for HIV-1 is 103/107 = 10–4. Therefore, at most 10–4 x 2.8 x 102 = 2.8 x 10–2 new beneficial mutations each generation ultimately become fixed. This is equivalent to one beneficial mutation that ultimately becomes fixed arising every 36 generations or more, on average, for the highest beneficial mutation rate at site 25. For the mean beneficial mutation rate at site 25 (1.3 x 10–5), one beneficial mutation that ultimately becomes fixed arises every 77 generations or more. To determine whether multiple beneficial mutations will segregate at the same site, we need to know how long it takes a single new beneficial mutation to spread to fixation. This can be calculated numerically from the frequency of a beneficial mutation after one generation of selection in a haploid organism (the HIV-1 provirus is haploid): p' = p(1 + s)/(ps + 1), where p is the frequency in the current generation. With initial frequency p = 1/N = 10–7 for a new mutation, the number of generations taken to reach a frequency of 99% ranges from 6, for mutation G
D (s = 34.9), to 51, for mutation G
R (s = 0.5). Therefore, in the extreme case of the highest beneficial mutation rate and lowest selection coefficient, we can expect some clonal interference (on average, a new beneficial mutation fixes every 36 generations or more and it takes such a mutation 51 generations to spread to near fixation). However, with the average beneficial mutation rate, clonal interference is not expected regardless of the magnitude of the selection coefficient (on average, a new beneficial mutation fixes only every 77 generations or more).
A recent study with a DNA bacteriophage system under moderately strong selection (s = 0.11–0.39) and weak mutation is the only empirical examination of the mutational landscape model to date (ROKYTA et al. 2005). The fitnesses of nine beneficial amino acid replacements that occurred as first steps in episodes of adaptation were measured, and the frequencies of the replacements across 20 replicate populations were compared to expectations under the model. Modifying the model to account for mutation bias and population bottlenecks resulting from the experimental protocol improved its fit to the observed distribution of amino acid replacement frequencies. In the case of HIV-1, population size is constant during chronic infection and mutation bias may be unimportant since selection is strong. The empirical support from the Rokyta et al. study suggests that the model is robust to violations of the assumption of weak selection. Indeed, the model requires only that fixation probabilities be proportional to s (ROKYTA et al. 2005).
Most of the arguments above assume that the source of beneficial variants is new mutations within the population. However, variation may also be introduced by migration in the form of multiple infections of the same patient. Nevertheless, long-term studies of subtype B HIV-1-infected patients have not reported evidence of repeat infections (e.g., SHANKARAPPA et al. 1999). Repeat infections may be rare because sexual transmission is the main route of infection in North America and western Europe (UNAIDS 2004), where subtype B predominates, and the probability of infection per coital act is <0.5% (GRAY et al. 2001). And, perhaps more importantly, sexual transmission involves a severe bottleneck of the donor viral population, possibly due to strong selection by host target cells, resulting in typically only a single variant transmitting successfully (DERDEYN et al. 2004). Consequently, the impact of migration on genetic variation is expected to be minor compared to that of mutation and selection. For example, even with the successful transmission and integration of 102 distinct genomes, a migration event would contribute only a very small fraction of the 2.4 x 106 mutant genomes generated by mutation each generation (
2 days) (MARKOWITZ et al. 2003) in a population (for an HIV-1 genome size of
104 nucleotides). Therefore, mutation produces several orders of magnitude more variation in a population in a single generation than can be contributed by a rare migration event.
Another assumption made in applying the mutational landscape model to HIV-1 is that the HIV-1 populations studied here are true replicates. Strictly speaking, this assumption is violated because populations differ in a variety of ways, including differences in specific immunity (cytotoxic T-lymphocyte and antibody responses) among patients. However, the main source of selection of V3 of interest here is the chemokine receptor CCR5, which is expressed intact on cells targeted by HIV-1 in every infected individual. The intact, expressed protein appears to exhibit little adaptive variation within humans and among higher primates with respect to HIV-1 (ZHANG et al. 2003). Therefore, CCR5 represents a homogenous source of selection among patients. Indeed, HIV-1 adapts to CCR5 shortly after infection, rapidly evolving the V3 sequence with the most common amino acid at each site for the CCR5-utilizing phenotype (Figure 1) (ZHANG et al. 1993). Therefore, although the subtype B HIV-1 populations analyzed here are not true replicates, they are as similar with respect to the source of selection under study as one might expect to find in a nonexperimental setting.
Conclusion:
It will be interesting to see if the correlation between mean frequency and fitness observed here holds more generally, which should depend on whether fluctuating selection within populations combined with constant, strong selection across populations is a common condition. If so, the ability to estimate site-specific marginal fitnesses of amino acids from their among-population mean site-specific frequencies should open new avenues of research into the dynamics of adaptation at the molecular level.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| LITERATURE CITED |
|---|
|
|
|---|
ACHAZ, G., S. PALMER, M. KEARNEY, F. MALDARELLI, J. W. MELLORS et al., 2004 A robust measure of HIV-1 population turnover within chronically infected individuals. Mol. Biol. Evol. 21: 1902–1912.
BONHOEFFER, S., E. C. HOLMES and M. A. NOWAK, 1995 Causes of HIV diversity. Nature 376: 125.[CrossRef][Medline]
CHUN, T. W., L. CARRUTH, D. FINZI, X. SHEN, J. A. DIGIUSEPPE et al., 1997 Quantification of latent tissue reservoirs and total body viral load in HIV-1 infection. Nature 387: 183–188.[CrossRef][Medline]
COFFIN, J. M., 1999 Molecular biology of HIV, pp. 3–40 in The Evolution of HIV, edited by K. A. CRANDALL. Johns Hopkins University Press, Baltimore.
CROW, J. F., and M. KIMURA, 1970 An Introduction to Population Genetics Theory. Harper & Row, New York.
DA SILVA, J., 2003 The evolutionary adaptation of HIV-1 to specific immunity. Curr. HIV Res. 1: 363–371.[CrossRef][Medline]
DE JONG, J. J., A. DE RONDE, W. KEULEN, M. TERSMETTE and J. GOUDSMIT, 1992 Minimal requirements for the human immunodeficiency virus type 1 V3 domain to support the syncytium-inducing phenotype: analysis by single amino acid substitution. J. Virol. 66: 6777–6780.
DERDEYN, C. A., J. M. DECKER, F. BIBOLLET-RUCHE, J. L. MOKILI, M. MULDOON et al., 2004 Envelope-constrained neutralization-sensitive HIV-1 after heterosexual transmission. Science 303: 2019–2022.
FROST, S. D. W., T. WRIN, D. M. SMITH, S. L. K. POND, Y. LIU et al., 2005 Neutralizing antibody responses drive the evolution of human immunodeficiency virus type 1 envelope during recent HIV infection. Proc. Natl Acad. Sci. USA 102: 18514–18519.
GERRISH, P., 2001 The rhythm of microbial adaptation. Nature 413: 299–302.[CrossRef][Medline]
GERRISH, P. J., and R. E. LENSKI, 1998 The fate of competing beneficial mutations in an asexual population. Genetica 102–103: 127–144.[CrossRef]
GILLESPIE, J. H., 1984 Molecular evolution over the mutational landscape. Evolution 38: 1116–1129.[CrossRef]
GILLESPIE, J. H., 1991 The Causes of Molecular Evolution. Oxford University Press, New York.
GRAY, R. H., M. J. WAWER, R. BROOKMEYER, N. K. SEWANKAMBO, D. SERWADDA et al., 2001 Probability of HIV-1 transmission per coital act in monogamous, heterosexual, HIV-1-discordant couples in Rakai, Uganda. Lancet 357: 1149–1153.[CrossRef][Medline]
HALDANE, J. B. S., 1927 A mathematical theory of natural and artificial selection. V. Selection and mutation. Proc. Camb. Philos. Soc. 28: 838–844.
HUNG, C. S., N. VANDER HEYDEN and L. RATNER, 1999 Analysis of the critical domain in the V3 loop of human immunodeficiency virus type 1 gp120 involved in CCR5 utilization. J. Virol. 73: 8216–8226.
KIMURA, M., 1964 Diffusion models in population genetics. J. Appl. Probab. 1: 177–232.[CrossRef]
KUIKEN, C. L., B. FOLEY, E. GUZMAN and B. T. M. KORBER, 1999 Determinants of HIV-1 protein evolution, pp. 432–468 in The Evolution of HIV, edited by K. A. CRANDALL. Johns Hopkins University Press, Baltimore.
MANSKY, L. M., and H. M. TEMIN, 1995 Lower in vivo mutation rate of human immunodeficiency virus type 1 than that predicted from the fidelity of purified reverse transcriptase. J. Virol. 69: 5087–5094.[Abstract]
MARKOWITZ, M., M. LOUIE, A. HURLEY, E. SUN, M. DI MASCIO et al., 2003 A novel antiviral intervention results in more accurate assessment of human immunodeficiency virus type 1 replication dynamics and T-cell decay in vivo. J. Virol. 77: 5037–5038.
NIELSEN, R., and Z. YANG, 1998 Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics 148: 929–936.
ORR, H. A., 2002 The population genetics of adaptation: the adaptation of DNA sequences. Evolution 56: 1317–1330.[CrossRef][Medline]
ORR, H. A., and A. J. BETANCOURT, 2001 Haldane's sieve and adaptation from the standing genetic variation. Genetics 157: 875–884.
PLATT, E. J., J. P. DURNIN and D. KABAT, 2005 Kinetic factors control efficiencies of cell entry, efficacies of entry inhibitors, and mechanisms of adaptation of human immunodeficiency virus. J. Virol. 79: 4347–4356.
REZA, S. M., L.-M. SHEN, R. MUKHOPADHYAY, M. ROSETTI, T. PE'ERY et al., 2003 A naturally occurring substitution in human immunodeficiency virus tat increases expression of the viral genome. J. Virol. 77: 8602–8606.
ROKYTA, D. R., P. JOYCE, S. B. CAUDLE and H. A. WICHMAN, 2005 An empirical test of the mutational landscape model of adaptation using a single-stranded DNA virus. Nat. Genet. 37: 441–444.[CrossRef][Medline]
SHANKARAPPA, R., J. B. MARGOLICK, S. J. GANGE, A. G. RODRIGO, D. UPCHURCH et al., 1999 Consistent viral evolutionary changes associated with the progression of human immunodeficiency virus type 1 infection. J. Virol. 73: 10489–10502.
SHRINER, D., A. G. RODRIGO, D. C. NICKLE and J. I. MULLINS, 2004a Pervasive genomic recombination of HIV-1 in vivo. Genetics 167: 1573–1583.
SHRINER, D., R. SHANKARAPPA, M. A. JENSEN, D. C. NICKLE, J. E. MITTLER et al., 2004b Influence of random genetic drift on human immunodeficiency virus type 1 env evolution during chronic infection. Genetics 166: 1155–1164.
SPECK, R. F., K. WEHRLY, E. J. PLATT, R. E. ATCHISON, I. F. CHARO et al., 1997 Selective employment of chemokine receptors as human immunodeficiency virus type 1 coreceptors determined by individual amino acids within the envelope V3 loop. J. Virol. 71: 7136–7139.[Abstract]
TEMPLETON, A. R., R. A. REICHERT, A. E. WEISSTEIN, X.-F. YU and R. B. MARKHAM, 2004 Selection in context: patterns of natural selection in the glycoprotein 120 region of human immunodeficiency virus 1 within infected individuals. Genetics 167: 1547–1561.
UNAIDS, 2004 2004 Report on the Global HIV/AIDS Epidemic: 4th Global Report. Joint United Nations Programme on HIV/AIDS (UNAIDS), Geneva.
WEI, X., J. M. DECKER, S. WANG, H. HUI, J. C. KAPPES et al., 2003 Antibody neutralization and escape by HIV-1. Nature 422: 307–312.[CrossRef][Medline]
WILLIAMSON, S., 2003 Adaptation in the env gene of HIV-1 and evolutionary theories of disease progression. Mol. Biol. Evol. 20: 1318–1325.
YAMAGUCHI, Y., and T. GOJOBORI, 1997 Evolutionary mechanisms and population dynamics of the third variable envelope region of HIV within single hosts. Proc. Natl. Acad. Sci. USA 94: 1264–1269.
YUSTE, E., C. LOPEZ-GALINDEZ and E. DOMINGO, 2000 Unusual distribution of mutations associated with serial bottleneck passages of human immunodeficiency virus type 1. J. Virol. 74: 9546–9552.
ZHANG, L. Q., P. MACKENZIE, A. CLELAND, E. C. HOLMES, A. J. BROWN et al., 1993 Selection for specific sequences in the external envelope protein of human immunodeficiency virus type 1 upon primary infection. J. Virol. 67: 3345–3356.
ZHANG, Y.-W., O. A. RYDER and Y.-P. ZHANG, 2003 Intra- and interspecific variation of the CCR5 gene in higher primates. Mol. Biol. Evol. 20: 1722–1729.
ZOLLA-PAZNER, S., 2004 Identifying epitopes of HIV-1 that induce protective antibodies. Nat. Rev. Immunol. 4: 199–210.[CrossRef][Medline]
Communicating editor: D. BEGUNThis article has been cited by other articles:
![]() |
Y. Liu, J. McNevin, H. Zhao, D. M. Tebit, R. M. Troyer, M. McSweyn, A. K. Ghosh, D. Shriner, E. J. Arts, M. J. McElrath, et al. Evolution of Human Immunodeficiency Virus Type 1 Cytotoxic T-Lymphocyte Epitopes: Fitness-Balanced Escape J. Virol., November 15, 2007; 81(22): 12179 - 12188. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |