- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Shriner, D.
- Articles by Mullins, J. I.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Shriner, D.
- Articles by Mullins, J. I.
Influence of Random Genetic Drift on Human Immunodeficiency Virus Type 1 env Evolution During Chronic Infection
Daniel Shrinera, Raj Shankarappa1,a, Mark A. Jensena, David C. Nicklea, John E. Mittlera, Joseph B. Margolickc, and James I. Mullinsa,ba Department of Microbiology, University of Washington School of Medicine, Seattle, Washington 98195-8070
b Departments of Medicine and Laboratory Medicine, University of Washington School of Medicine, Seattle, Washington 98195-8070
c Department of Molecular Microbiology and Immunology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland 21205
Corresponding author: James I. Mullins, University of Washington School of Medicine, Box 358070, Seattle, WA 98195-8070., jmullins{at}u.washington.edu (E-mail)
Communicating editor: D. CHARLESWORTH
| ABSTRACT |
|---|
Human immunodeficiency virus type 1 (HIV-1) has high replication and mutation rates that generate large census populations and high levels of genetic variation. We examined the roles of natural selection, population growth, random genetic drift, and recombination in shaping the variation in 1509 C2V5 env sequences derived from nine men with chronic HIV-1 infection. These sequences were obtained from clinical visits that reflect the first 613.7 years of infection. Pairwise comparisons of nonsynonymous and synonymous distances, Tajima's D test, Fu and Li's D* test, and a test of recurrent mutation revealed evidence for episodes of nonneutral evolution in a total of 22 out of 145 blood samples, representing six of the nine individuals. Using three coalescent-based maximum-likelihood estimators, we found viral effective population sizes in all nine individuals to be
103. We also show that a previous estimate of the effective population size of
105 based on rare haplotype frequencies decreases to
103 upon correcting a biased sampling procedure. We conclude that the genetic variation in these data sets can be explained by a predominance of random genetic drift of neutral mutations with brief episodes of natural selection that were frequently masked by recombination.
A hallmark of human immunodeficiency virus type 1 (HIV-1) infection is a clinical stage of chronic, generally asymptomatic infection of highly variable length, during which time viral genetic variation accumulates (reviewed in ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The overall role of natural selection in the evolution of env is uncertain. First, the occurrence of viral escape within env during chronic infection and its importance to HIV-1 disease progression have not been critically evaluated in the context of population genetic theory. Second, none of the aforementioned studies demonstrated that putative escape variants fixed in the population faster than one would predict on the basis of random genetic drift alone. Third, HIV-1 has a high rate of recombination (![]()
![]()
![]()
![]()
If natural selection is more important than random genetic drift in shaping env genetic variation, then deterministic models of population genetics might be appropriate (![]()
![]()
108 (![]()
![]()
![]()
105 (![]()
![]()
However, the relevant parameter when modeling genetic changes within populations is the effective, not the census, population size, because it represents the number of individuals who contribute genetic information to successive generations. The effective population size, known as Ne, represents the size of an ideal population that experiences the same magnitude of random genetic drift as the observed population (![]()
![]()
![]()
103, substantially less than the inverse mutation rate. Leigh Brown concluded that the results of the test of neutrality and the low effective population size supported stochastic models of population genetics, wherein random genetic drift may be more important than natural selection in shaping the virus population.
In contrast, analysis of the same sequences by ![]()
105, a population size that supports deterministic models of population genetics. Their analysis used a linkage disequilibrium test based on haplotypes that should be missing in small populations, but are expected to be present in large populations. Recently, ![]()
![]()
In this study, we explored the related issues of the viral effective population size and the roles of natural selection, population growth, random genetic drift, and recombination in shaping env diversity. We analyzed all 1300 sequences from ![]()
| MATERIALS AND METHODS |
|---|
Study samples:
The nine HIV-1 subtype B-infected individuals studied were described previously (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Sequence analysis:
We used four statistical tests of neutrality. In all cases, the null hypothesis was that all mutations were neutral. First, an analysis of nonsynonymous and synonymous distances was performed using MEGA, version 2.1 (![]()
![]()
![]()
![]()
![]()
![]()
Second, TAJIMA's (1989) D test is a test based on the distribution of mutant frequencies. It compares the number of polymorphic sites (S) to the average pairwise number of nucleotide differences (
). Assuming infinite sites (which means that each site may be mutated at most once), no population growth, no recombination, and neutral mutations at equilibrium, ![]()
, in which
is a scalar for n sequences,
= 2NeµL is the mutation rate scaled by the effective population size for a haploid organism, and µ is the mutation rate per generation per site with L sites. Under the same assumptions, ![]()
=
. TAJIMA's (1989) test is therefore based on the expectation that, on average,
(S/an) = 0. Thus, positive test statistics reflect an excess of intermediate-frequency mutations, suggesting diversifying selection, and negative test statistics reflect an excess of low-frequency mutations, suggesting negative selection (![]()
![]()
Third, FU and LI's (1993) D* test, also a test based on the distribution of mutant frequencies, compares the total number of mutations (
) to the number of singletons (
s; i.e., mutations that occur on terminal branches on an unrooted phylogeny) among n sequences.
and
s can be estimated by counting the mutations as they are mapped onto the sample's phylogeny. With the same assumptions as given above for Tajima's test, the test is based on the expectation that, on average, (n/(n 1))
an
s = 0. Thus, similar to Tajima's D test, positive D* test statistics reflect an excess of "internal" or "old" mutations, i.e., mutations at intermediate frequency, and negative test statistics reflect an excess of "external" or "new" mutations, i.e., mutations at low frequency.
For Tajima's and Fu and Li's tests, we ascertained P values that accounted for recombination from null distributions of 10,000 independent replicates that were created by simulating sequences under a neutral coalescent model. We conditioned upon a population-scaled mutation rate
= 38 per locus and a population-scaled recombination rate
= 9 per locus (on the basis of the estimates from the RECOMBINE program, which is described below) with a locus of 639 sites and the appropriate sample size, using DnaSP, version 3.53 (![]()
These three tests of neutrality each looked at the data in different ways. The test based upon synonymous and nonsynonymous distances uses a codon as its unit, and this test is applicable when the population is not in equilibrium. In contrast, Tajima's D test and Fu and Li's D* test both use an individual nucleotide as their unit, but assume that the population is in equilibrium. Fu and Li's D* test is generally more powerful in detecting population shrinkage and background selection (which refers to the variation-reducing effect of a negatively selected polymorphism on linked neutral polymorphism), whereas Tajima's D test is generally more powerful in detecting population growth and hitchhiking (which refers to the variation-reducing effect of a positively selected polymorphism on linked neutral polymorphism; ![]()
![]()
The fourth test addresses the assumption of infinite sites. We first determined which samples had no sites with more than two cosegregating nucleotide states because the variation at such sites can be most parsimoniously explained by one mutational event. For the remaining samples, we tested whether more sites than expected were unambiguously recurrently mutated (i.e., there were more than two cosegregating nucleotide states). To do this, we calculated how many sites were expected to have been recurrently mutated by assuming a Poisson mutational process with a mean conditioned upon the number of total sites and the number of variable sites. P values were generated from 10,000 independent replicates.
Estimating Ne:
Effective population sizes were estimated using three coalescent-likelihood programs from the LAMARC package (![]()
assuming a single panmictic population of constant size without recombination or natural selection (![]()
and g, the exponential growth rate (![]()
and r, the ratio of the recombination rate to the mutation rate, such that
= r
(![]()
were generated from each of the three coalescent-likelihood programs using a transition/transversion ratio estimated from the data (![]()
![]()
for data with recombination under these three sets of assumptions, 10 independent replicates were generated under a neutral coalescent model with recombination, using TREEVOLVE. We conditioned upon a sample size of 10,
= 38/locus,
= 9/locus, and a locus of 639 sites, on the basis of the observed data and the results of RECOMBINE.
A fourth program in the LAMARC package, MIGRATE, relaxes the assumption of panmixis (![]()
![]()
Phylogenetic analysis:
Phylogenetic tree reconstruction and model of evolution estimation were performed in PAUP*, version 4.0 (![]()
![]()
Multiple tests:
Because we statistically tested a large number of samples, our tests needed to be corrected for multiple comparisons. A strict way to correct for multiple comparisons would be to apply a Bonferroni correction, which is accomplished by dividing the significance level by the number of tests performed (![]()
Nucleotide sequence accession numbers:
The GenBank accession numbers are
AF137629,
AF138163,
AF138166,
AF138167,
AF138168,
AF138169,
AF138170,
AF138171,
AF138172,
AF138173,
AF138174,
AF138175,
AF138176,
AF138177,
AF138178,
AF138179,
AF138180,
AF138181,
AF138182,
AF138183,
AF138184,
AF138185,
AF138186,
AF138187,
AF138188,
AF138189,
AF138190,
AF138191,
AF138192,
AF138193,
AF138194,
AF138195,
AF138196,
AF138197,
AF138198,
AF138199,
AF138200,
AF138201,
AF138202,
AF138203,
AF138204,
AF138205,
AF138206,
AF138207,
AF138208,
AF138209,
AF138210,
AF138211,
AF138212,
AF138213,
AF138214,
AF138215,
AF138216,
AF138217,
AF138218,
AF138219,
AF138220,
AF138221,
AF138222,
AF138223,
AF138224,
AF138225,
AF138226,
AF138227,
AF138228,
AF138229,
AF138230,
AF138231,
AF138232,
AF138233,
AF138234,
AF138235,
AF138236,
AF138237,
AF138238,
AF138239,
AF138240,
AF138241,
AF138242,
AF138243,
AF138244,
AF138245,
AF138246,
AF138247,
AF138248,
AF138249,
AF138250,
AF138251,
AF138252,
AF138253,
AF138254,
AF138255,
AF138256,
AF138257,
AF138258,
AF138259,
AF138260,
AF138261,
AF138262,
AF138263,
AF138305,
AF138703, and
AF204402,
AF204670 for the 1300 sequences reported previously (![]()
| RESULTS |
|---|
Testing neutrality:
Each of the three Ne estimators we employed assumed neutrality, so we first addressed this assumption by applying three two-tailed statistical tests. The first test compared nonsynonymous mutations to synonymous mutations. Of 96 viral DNA samples examined (Fig 1, *), we found 1 sample (from participant 6) for which ds exceeded dn and 10 samples (from participants 2, 3, and 9) for which dn exceeded ds. Similarly, of 49 plasma viral RNA samples examined, no samples were found for which ds exceeded dn and 3 samples (from participants 2 and 7) were found for which dn exceeded ds (Fig 1, #). The samples for which neutrality was rejected by this test did not tend to cluster at any particular time during infection (Fig 1). Further, the signal for selection tended to be sporadic, as evident by the single samples in participants 6 and 7 that displayed nonneutral evolution. In summary, by this test, we found evidence for a sporadic excess of synonymous mutations in one of the nine individuals and for sporadic excesses of nonsynonymous mutations in four of the nine individuals.
|
We next addressed the infinite-sites assumption of Tajima's and Fu and Li's tests. Thirty-three samples had no sites with more than two cosegregating nucleotide states (data not shown). For the remaining 112 samples, 1 sample from participant 2 and 3 samples from participant 5 showed significantly more unambiguous recurrent mutation than expected (Fig 1,
). This result indicated that the assumption of a Poisson mutational process is largely valid for these samples. Furthermore, because only 4 samples were found to have experienced significant excesses of unambiguous recurrent mutation, diversifying selection was unlikely to have had a predominant role in the evolution of these sequences.
The preceding analysis underestimates recurrent mutation, because it ignores parallel changes and reversions. Recombination can induce spuriously inferred parallel changes and reversions on the sample's phylogeny. Furthermore, recombination is known to make Tajima's test conservative (![]()
![]()
![]()
|
Using the null distributions with recombination, significant departures from the neutral expectation of Tajima's test (with excesses of low-frequency mutations) were observed for 2 of the 145 samples, both from participant 3 (Fig 1,
). Only 1 sample that yielded significance occurred after genetic diversity stabilized [taken as the time of peak diversity as reported in ![]()
, for the DNA sample and
for the RNA samples). Of these 3, the 2 samples from participant 6 occurred after genetic diversity stabilized. No departures from neutral expectations were detected by either test if recombination remained unacknowledged (data not shown). Accounting for recombination, therefore, revealed stronger evidence for nonneutral evolution in these sequences.
As a further approach for investigating purifying selection, all 1509 sequences were translated and a peptide alignment was constructed. Although in any individual no more than
21% of nucleotide sites were observed to vary at any given time, only one amino acid residue was observed to be 100% conserved (data not shown). This site was G366, which plays a role in CD4 receptor binding (![]()
If HIV-1 populations within infected individuals were experiencing recurring selective sweeps, we would expect to detect fixation events occurring with high frequency. We therefore asked when the first fixation event occurred in each individual. The first fixation event was defined as the first time a sample had a derived nucleotide state reach a frequency of 100% when it started at 0% (sites that were initially polymorphic were disregarded because it was unknown when the initial mutation events occurred). It is the neutral expectation that the first fixation event should occur after an average of 2Ne generations, with variance on the order of Ne2 (![]()
), concurrent with the time when genetic diversity stabilized for four of the individuals studied. For the other five individuals, the first fixation event happened before genetic diversity stabilized, as early as 1.5 years after seroconversion. In none of the individuals did the first fixation event happen after genetic diversity stabilized. The observed mean time to the first fixation event of 3.88 years is not significantly less than the mean time to diversity stabilization of 5.06 years. The observation that fixation occurred at an earlier time in five of the nine individuals, but never later, suggests the presence of positive selection.
In summary, the C2V5 region of HIV-1 env appears to evolve in a manner consistent with neutral expectations in 85% of the samples. However, evidence for episodes of nonneutral evolution was found in seven of the nine individuals. Accounting for the reduction of variance introduced by recombination led to stronger evidence for nonneutral evolution.
Estimating Ne:
Having addressed the assumption of neutrality, we obtained estimates of Ne using three coalescent-likelihood programs. The values of Ne ranged from 311 to 4783 for COALESCE (Fig 1). RECOMBINE, which relaxes the assumption of no recombination, yielded estimates of Ne from 326 to 2886 (Fig 1). As expected, RECOMBINE yielded lower estimates than COALESCE. FLUCTUATE, which relaxes the assumption of a constant effective population size, is known to have an upward bias (![]()
|
Our estimates of Ne are consistent with those of ![]()
![]()
0.2% of pairs would contain 2 highly diverse sites. Thus, their selection of highly diverse sites may have introduced a bias toward higher estimates of diversity and, consequently, higher estimates of Ne.
To test if the effective mutation rate is the same for highly diverse sites as for all sites, we performed a likelihood-ratio test of equal rates across sites vs. gamma-distributed rate heterogeneity across sites (![]()
![]()
2 distribution with 1 d.f. (P = 3.0 x 1011; ![]()
On the basis of the foregoing, we propose that the appropriate mutation rate to use when relating the least frequent haplotype to the product Neµ is the effective mutation rate at the two highly diverse sites in question rather than an average across all sites. From the gamma distribution of rate heterogeneity, we estimated the effective mutation rate for the class of highly diverse sites to be
14, relative to a mean across all sites of 1. Thus, after accounting for Rouzine and Coffin's use of the per site mutation rate µ = 105 instead of 2.5 x 105, the mutation rate they used is 14 x 2.5
35 times too low, yielding estimates of Ne that are therefore
35 times too high. Our proposed corrections estimate Ne to be on the order of 103, alleviating the discrepancy between the estimate based on Rouzine and Coffin's method and estimates based on coalescent theory.
| DISCUSSION |
|---|
Although HIV-1 sequence heterogeneity is widely recognized, its biological origins and consequences are not understood. On the basis of a joint consideration of the results of four statistical tests of neutrality and estimates of the effective population size, we conclude that the genetic variation in the C2V5 region of HIV-1 subtype B env observed in the nine individuals studied can be explained by a predominance of random genetic drift of neutral mutations with brief episodes of natural selection, which were frequently masked by recombination. Of 145 samples, 21 showed a departure from neutral expectations by one of four statistical tests of neutrality, and 1 sample showed departures from neutral expectations by two tests. The remaining 85% of the samples showed no departures from neutral expectations by any of the tests. We estimated Ne to be on the order of
103, consistent with estimates of ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The four tests we used to detect departures from neutrality included pairwise comparisons of nonsynonymous and synonymous distances, Tajima's D test, Fu and Li's D* test, and a test of recurrent mutation. For both Tajima's and Fu and Li's tests, a negative test statistic that achieves significance indicates an excess of low-frequency mutations. Three possible explanations for an excess of low-frequency mutations are directional selection, purifying selection, or population growth (![]()
![]()
![]()
Our analyses do not support a hypothesis of recurring selective sweeps affecting env during chronic infection. Nucleotide substitutions should happen more rapidly and more frequently when driven by positive selection than when driven by random genetic drift. Thus, we would have expected the first nucleotide substitution to occur much sooner than the average of 3.88 years after seroconversion, assuming that 2Ne generations is 5.06 years. Furthermore, the maintenance of a stable amount of diversity, as shown in Fig 1, implies that the strength of natural selection remains equally stable through chronic infection. It seems highly unlikely that the strength of natural selection should remain constant through time and across individuals.
The estimates of Ne derived from COALESCE and RECOMBINE were remarkably consistent within and among individuals, regardless of the presence or absence of a signal for selection. This suggests that the methods were relatively robust to violations of the assumption of no selection in these sequences. The higher estimates from FLUCTUATE were consistent with inappropriately attempting to explain recombination by a model of population growth. The rate heterogeneity we invoked to correct Rouzine and Coffin's estimate to 103 can be explained by selection and/or recombination (![]()
![]()
average = fcon
con + flow
low + fhigh
high, in which fcon, flow, and fhigh are the frequencies of constant, low-diversity, and high-diversity sites, respectively, and
con,
low, and
high are the estimates for
if one applied Rouzine and Coffin's test to haplotype data from these sites.
average could then be used in conjunction with the average per site mutation rate to estimate Ne.
Previously, ![]()
- In the deterministic case, Ne is high, and selection is more important than random genetic drift. In this scenario, the stabilization of genetic diversity reflects an equilibrium in which mutant frequencies equal the mutation rate divided by the selection coefficient (
ROUZINE et al. 2001 ).
- In the stochastic case, Ne is low, and random genetic drift is more important than selection. In this scenario, the stabilization of genetic diversity reflects the equilibrium between neutral mutations and random genetic drift that is expected to occur after an average of 2Ne generations (
RODRIGO and FELSENSTEIN 1999 ).
- In the alternative stochastic case, Ne is low, but selection is more important than random genetic drift. In this scenario, recurrent selective sweeps (presumably due to immune pressure) reduce diversity and a true equilibrium is never reached. Under this explanation, the reduction in diversity is due to the hitchhiking of neutral mutations linked to the mutation conferring the selective advantage (
MAYNARD SMITH and HAIGH 1974 ;
KAPLAN et al. 1989 ).
Our findings of low Ne values, a predominance of random genetic drift over selection, and a lack of early fixation events favor explanation 2, reflecting a highly stochastic evolutionary process.
TAJIMA's (1989) D test and FU and LI's (1993) D* test are applicable only for equilibrium populations. Therefore, although we performed these tests on all of the 145 samples, the presence of significant results among the 70 samples derived from study visits before the time when genetic diversity stabilized may simply reflect preequilibrium conditions. In Fig 1, values of Ne for all samples are shown; however, as with the two tests, samples prior to the peak of viral diversity do not satisfy the assumption of an equilibrium population. Therefore, estimates of Ne derived from these samples are included to depict the expected accumulation of diversity prior to equilibrium starting from near homogeneity (![]()
The basis for the five or more orders of magnitude difference between the census and the effective viral population size is not understood. One partial explanation is a low ratio of infectious units to virus particles (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
We note that the course of sampling in the individuals we studied included periods of emergence and dominance in the population of viruses with predicted altered cell tropism, periods of low-impact antiviral therapy, clinical AIDS, and terminal disease (![]()
![]()
![]()
![]()
| FOOTNOTES |
|---|
1 Present address: Center for Genomic Sciences, Allegheny-Singer Research Institute, Pittsburgh, PA 15212. ![]()
| ACKNOWLEDGMENTS |
|---|
We thank Yang Wang for helpful discussions and Allen G. Rodrigo and Gerald H. Learn, Jr. for critically reviewing this manuscript. We thank Dr. Learn for assisting with sequence management. We thank the two anonymous reviewers for their comments. D.S. was a Howard Hughes Medical Institute Predoctoral Fellow. This work was supported by grants from the U.S. Public Health Service and the University of Washington Center for AIDS Research.
Manuscript received January 14, 2003; Accepted for publication December 12, 2003.
| LITERATURE CITED |
|---|
ANISIMOVA, M., R. NIELSEN, and Z. YANG, 2003 Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites. Genetics 164:1229-1236.
BEERLI, P. and J. FELSENSTEIN, 1999 Maximum-likelihood estimation of migration rates and effective population numbers in two populations using a coalescent approach. Genetics 152:763-773.
BORROW, P., H. LEWICKI, X. WEI, M. S. HORWITZ, and N. PEFFER et al., 1997 Antiviral pressure exerted by HIV-1-specific cytotoxic T lymphocytes (CTLs) during primary infection demonstrated by rapid selection of CTL escape virus. Nat. Med. 3:205-211.[CrossRef][Medline]
BRAVERMAN, J. M., R. R. HUDSON, N. L. KAPLAN, C. H. LANGLEY, and W. STEPHAN, 1995 The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics 140:783-796.[Abstract]
COFFIN, J. M., 1995 HIV population dynamics in vivo: implications for genetic variation, pathogenesis, and therapy. Science 267:483-489.
DIMITROV, D. S., R. L. WILLEY, H. SATO, L.-J. CHANG, and R. BLUMENTHAL et al., 1993 Quantitation of human immunodeficiency virus type 1 infection kinetics. J. Virol. 67:2182-2190.
EVANS, D. T., D. H. O'CONNOR, P. JING, J. L. DZURIS, and J. SIDNEY et al., 1999 Virus-specific cytotoxic T-lymphocyte responses select for amino-acid variation in simian immunodeficiency virus Env and Nef. Nat. Med. 5:1270-1276.[CrossRef][Medline]
FROST, S. D., M. J. DUMAURIER, S. WAIN-HOBSON, and A. J. BROWN, 2001 Genetic drift and within-host metapopulation dynamics of HIV-1 infection. Proc. Natl. Acad. Sci. USA 98:6975-6980.
FU, Y.-X., 1996 New statistical tests of neutrality for DNA samples from a population. Genetics 143:557-570.[Abstract]
FU, Y.-X., 1997 Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 147:915-925.[Abstract]
FU, Y.-X. and W.-H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133:693-709.[Abstract]
GANGE, S. J., A. MUÑOZ, J. S. CHMIEL, A. D. DONNENBERG, and L. M. KIRSTEIN et al., 1998 Identification of inflections in T-cell counts among HIV-1-infected individuals and relationship with progression to clinical AIDS. Proc. Natl. Acad. Sci. USA 95:10848-10853.
GROSSMAN, Z., M. B. FEINBERG, and W. E. PAUL, 1998 Multiple modes of cellular activation and virus transmission in HIV infection: a role for chronically and latently infected cells in sustaining viral replication. Proc. Natl. Acad. Sci. USA 95:6314-6319.
HARTL, D. L., and A. G. CLARK, 1989 Principles of Population Genetics, Ed. 2. Sinauer Associates, Sunderland, MA.
HO, D. D., A. U. NEUMANN, A. S. PERELSON, W. CHEN, and J. M. LEONARD et al., 1995 Rapid turnover of plasma virions and CD4 lymphocytes in HIV-1 infection. Nature 373:123-126.[CrossRef][Medline]
HUELSENBECK, J. P. and B. RANNALA, 1997 Phylogenetic methods come of age: testing hypotheses in an evolutionary context. Science 276:227-232.
KAPLAN, N. L., R. R. HUDSON, and C. H. LANGLEY, 1989 The "hitchhiking effect" revisited. Genetics 123:887-899.
KASLOW, R. A., D. G. OSTROW, R. DETELS, J. P. PHAIR, and B. F. POLK et al., 1987 The Multicenter AIDS Cohort Study: rationale, organization, and selected characteristics of the participants. Am. J. Epidemiol. 126:310-318.
KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK/London/New York.
KUHNER, M. K., J. YAMATO, P. BEERLI and J. FELSENSTEIN, 1995a LAMARClikelihood analysis with metropolis algorithm using random coalescence (http://evolution.genetics.washington.edu/lamarc.html).
KUHNER, M. K., J. YAMATO, and J. FELSENSTEIN, 1995b Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling. Genetics 140:1421-1430.[Abstract]
KUHNER, M. K., J. YAMATO, and J. FELSENSTEIN, 1998 Maximum likelihood estimation of population growth rates based on the coalescent. Genetics 149:429-434.
KUHNER, M. K., J. YAMATO, and J. FELSENSTEIN, 2000 Maximum likelihood estimation of recombination rates from population data. Genetics 156:1393-1401.
KUMAR, S., K. TAMURA, I. B. JAKOBSEN, and M. NEI, 2001 MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17:1244-1245.
KWONG, P. D., R. WYATT, J. ROBINSON, R. W. SWEET, and J. SODROSKI et al., 1998 Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing human antibody. Nature 393:648-659.[CrossRef][Medline]
LAYNE, S. P., M. J. MERGES, M. DEMBO, J. L. SPOUGE, and S. R. CONLEY et al., 1992 Factors underlying spontaneous inactivation and susceptibility to neutralization of human immunodeficiency virus. Virology 189:695-714.[CrossRef][Medline]
LEIGH BROWN, A. J., 1997 Analysis of HIV-1 env gene sequences reveals evidence for a low effective number in the viral population. Proc. Natl. Acad. Sci. USA 94:1862-1865.
LI, C. C., 1976 First Course in Population Genetics, pp. 567592. Boxwood Press, Pacific Grove, CA.
LI, W.-H., 1977 Distribution of nucleotide differences between two randomly chosen cistrons in a finite population. Genetics 85:331-337.
MANSKY, L. M., 1996 Forward mutation rate of human immunodeficiency virus type 1 in a T lymphoid cell line. AIDS Res. Hum. Retroviruses 12:307-314.[Medline]
MARGOLICK, J. B., A. MUÑOZ, A. D. DONNENBERG, L. P. PARK, and N. GALAI et al., 1995 Failure of T-cell homeostasis preceding AIDS in HIV-1 infection. Nat. Med. 1:674-680.[CrossRef][Medline]
MARUYAMA, T. and M. KIMURA, 1980 Genetic variability and effective population size when local extinction and recolonization of subpopulations are frequent. Proc. Natl. Acad. Sci. USA 77:6710-6714.
MAYNARD SMITH, J. and J. HAIGH, 1974 The hitch-hiking effect of a favourable gene. Genet. Res. 23:23-35.[Medline]
MITTLER, J. E., M. MARKOWITZ, D. D. HO, and A. S. PERELSON, 1999 Improved estimates for HIV-1 clearance rate and intracellular delay. AIDS 13:1415-1417.[CrossRef][Medline]
NEI, M. and T. GOJOBORI, 1986 Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 3:418-426.[Abstract]
NEI, M. and L. JIN, 1989 Variances of the average numbers of nucleotide substitutions within and between populations. Mol. Biol. Evol. 6:290-300.[Abstract]
NEI, M., and S. KUMAR, 2000 Molecular Evolution and Phylogenetics. Oxford University Press, New York.
NIELSEN, R. and Z. YANG, 1998 Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics 148:929-936.
NOWAK, M. A., R. M. MAY, R. E. PHILLIPS, S. ROWLAND-JONES, and D. G. LALLOO et al., 1995 Antigenic oscillations and shifting immunodominance in HIV-1 infections. Nature 375:606-611.[CrossRef][Medline]
NOWAK, M. A., R. M. ANDERSON, M. C. BOERLIJST, S. BONHOEFFER, and R. M. MAY et al., 1996 HIV-1 evolution and disease progression. Science 274:1008-1011.[CrossRef][Medline]
OVERBAUGH, J. and C. R. BANGHAM, 2001 Selection forces and constraints on retroviral sequence variation. Science 292:1106-1109.
PANNELL, J. R., 2003 Coalescence in a metapopulation with recurrent local extinction and recolonization. Evolution 57:949-961.[CrossRef][Medline]
PERELSON, A. S., A. U. NEUMANN, M. MARKOWITZ, J. M. LEONARD, and D. D. HO, 1996 HIV-1 dynamics in vivo: virion clearance rate, infected cell life-span, and viral generation time. Science 271:1582-1586.[Abstract]
PIATAK, M., JR., M. S. SAAG, L. C. YANG, S. J. CLARK, and J. C. KAPPES et al., 1993 High levels of HIV-1 in plasma during all stages of infection determined by competitive PCR. Science 259:1749-1754.
RAMRATNAM, B., S. BONHOEFFER, J. BINLEY, A. HURLEY, and L. ZHANG et al., 1999 Rapid production and clearance of HIV-1 and hepatitis C virus assessed by large volume plasma apheresis. Lancet 354:1782-1785.[CrossRef][Medline]
RICE, W. R., 1989 Analyzing tables of statistical tests. Evolution 43:223-225.[CrossRef]
RINALDO, C. R., JR., P. GUPTA, X. HUANG, Z. FAN, and J. I. MULLINS et al., 1998 Anti-HIV-1 memory cytotoxic T lymphocyte responses associated with changes in CD4+ T cell numbers in the progression of HIV-1 infection. AIDS Res. Hum. Retroviruses 14:1423-1433.[Medline]
RODRIGO, A. G., and J. FELSENSTEIN, 1999 Coalescent approaches to HIV population genetics, pp. 233272 in The Evolution of HIV, edited by K. A. CRANDALL. Johns Hopkins University Press, Baltimore.
ROUZINE, I. M. and J. M. COFFIN, 1999 Linkage disequilibrium test implies a large effective population number for HIV in vivo. Proc. Natl. Acad. Sci. USA 96:10758-10763.
ROUZINE, I. M., A. RODRIGO, and J. M. COFFIN, 2001 Transition between stochastic evolution and deterministic evolution in the presence of selection: general theory and application to virology. Microbiol. Mol. Biol. Rev. 65:151-185.
ROZAS, J. and R. ROZAS, 1999 DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174-175.
SCHIERUP, M. H. and J. HEIN, 2000 Consequences of recombination on traditional phylogenetic analysis. Genetics 156:879-891.
SEIBERT, S. A., C. Y. HOWELL, M. K. HUGHES, and A. L. HUGHES, 1995 Natural selection on the gag, pol, and env ge


