- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Kuhner, M. K.
- Articles by Felsenstein, J.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Kuhner, M. K.
- Articles by Felsenstein, J.
Maximum Likelihood Estimation of Population Growth Rates Based on the Coalescent
Mary K. Kuhnera, Jon Yamatoa, and Joseph Felsensteinaa Department of Genetics, University of Washington, Seattle, Washington 98195
Corresponding author: Mary K. Kuhner, University of Washington, Department of Genetics, Box 357360, Seattle, WA 98195-7360, mkkuhner{at}genetics.washington.edu (E-mail).
Communicating editor: S. TAVARÉ
| ABSTRACT |
|---|
We describe a method for co-estimating 4Neµ (four times the product of effective population size and neutral mutation rate) and population growth rate from sequence samples using Metropolis-Hastings sampling. Population growth (or decline) is assumed to be exponential. The estimates of growth rate are biased upwards, especially when 4Neµ is low; there is also a slight upwards bias in the estimate of 4Neµ itself due to correlation between the parameters. This bias cannot be attributed solely to Metropolis-Hastings sampling but appears to be an inherent property of the estimator and is expected to appear in any approach which estimates growth rate from genealogy structure. Sampling additional unlinked loci is much more effective in reducing the bias than increasing the number or length of sequences from the same locus.
THE genealogical structure of a sample from a population contains information about that population's history. The distribution of coalescence times (times at which two of the sampled individuals have a common ancestor) depends on the effective population size Ne: in a diploid population the distribution is proportional to 4Ne. Since coalescence times cannot be directly observed in most cases, but only inferred from the accumulation of mutations, we rescale time proportional to the per-site neutral rate µ. Thus, though we cannot estimate 4Ne directly, we can estimate the product 4Neµ which we will call
.
If the population size has changed over time, the distribution of coalescence times will differ from its expectation in a population where
is constant, and in principle this should be detectable. In particular, if the population has been growing the most rootward branches will be relatively short, whereas if it has been shrinking the most rootward branches will be relatively long.
We have previously described a method for estimating
in a population of constant size (![]()
![]()
![]()
, and then use the sampled genealogies to evaluate the relative likelihood of other values of
. This importance sampling approach concentrates the sampled genealogies in regions of high posterior probability, which is much more efficient than using random genealogies and avoids the bias of using only a single genealogy reconstruction. This algorithm is implemented in our Coalesce program.
In this paper we extend the Metropolis-Hastings genealogy sampling approach to the case of a population experiencing exponential growth or decline. In this case population size is represented by two parameters: the exponential growth rate g and the present-day value of
(that is, the value at the time when the organisms were sampled). The parameters are not independent: the more rapidly a population has grown, the larger its current size is expected to be compared to its "average" size. We have written a program, Fluctuate, which implements this sampler.
Both analytic and simulation results show that the estimate of the growth rate g is biased upwards when a finite number of individuals are analyzed. At least two factors are at work in this bias: the nonlinear relationship between coalescence times and the estimate of g, and truncation of the coalescent distribution, in genealogies of finite numbers of individuals, by the bottommost coalescence. There is also a smaller upwards bias in
due to the correlation between the two parameters. The bias in these estimators can most effectively be reduced by sampling multiple loci.
The method proposed by ![]()
![]()
![]()
![]()
![]()
| MATERIALS AND METHODS |
|---|
The Metropolis-Hastings genealogy sampler for constant-sized populations (![]()
, called
0. In the first phase, a new genealogy is created by locally rearranging the previous genealogy in proportion to the coalescent prior probability P(G|
0) (given by ![]()
![]()
0)P(D|G). This process is repeated, with samples taken from it at intervals to produce a set of genealogies from which a maximum likelihood of
can be made. The estimate is most efficient when
0 is close to
, so it is useful to run several iterations of the sampler, using the estimated
of each iteration as the starting
0 of the next.
Like most calculations involving the coalescent, these equations hold exactly only in the limit as the population size N goes to infinity: in practice the approximation involved should be insignificant as long as the number of individuals sampled is less than the square root of the population size.
Mutational model:
We used the DNA/RNA sequence model of ![]()
![]()
Scaling for population growth:
When the size of the population changes exponentially through time, the coalescent prior becomes P(G|
0, g0) where g0 is a trial value of the exponential growth rate g. (Positive values of g indicate population growth, and negative values indicate decline.) The units of g are 1/µ generations.
In order to sample coalescence times from this prior, we use a time rescaling under which it becomes identical to the simpler constant-population prior. Time is scaled proportional to growth, so that the same expected amount of coalescence occurs in one unit of time regardless of population size. Under this transformation, the coalescent structure of the genealogy becomes identical to the constant-population expectation.
The rescaled time T is derived from the original time t by the following relation (![]()
This rescaled time is then substituted for ordinary time in constructing rearrangements of the genealogy. In cases where g is less than zero, some proportion of the rescaled times will correspond to infinite ordinary time. Our implementation rejects genealogies which contain infinite times, on the grounds that their likelihood for biologically reasonable data will tend to be very small. An upwards bias may be created by this procedure, but in practice it should be trivial.
A series of genealogies generated under a given
0 and g0 can be used to determine the likelihood L(
,g) for other values of
and g. For each genealogy G a product is taken over all coalescence intervals i: in each interval, k is the number of lineages in the genealogy during that interval, ts is the time at the tipward end, and te is the time at the rootward end. Note that these are not rescaled times:
This formula can be shown to be equivalent to that given in GRIFFITHS and TAVARÉ 1994), bearing in mind that they scaled time in units of N generations rather than 1/µ generations and they considered a haploid rather than a diploid case: they also retained some combinatorial constants which we omit, since we are concerned only with ratios of probabilities.
This probability is then corrected for the importance sampling function P(D|G)P(G|
0,g0) (where n is the number of genealogies sampled):
[The terms P(D|G) drop out as they are the same for all values of
and g.]
The maximum of this function, which is a joint maximum likelihood estimate of
and g, can be found by standard methods. Technical difficulties are often encountered due to arithmetic overflow in exponentiation and the characteristic curving-ridge shape of the likelihood surface.
Multiple loci:
The likelihoods can be multiplied together across unlinked loci to generate an overall multi-locus likelihood. Doing so should greatly improve the efficiency of the estimate, especially for g, since doubling the number of loci doubles the amount of information available about the most rootward parts of the genealogy (which are the most informative for growth rate, since they represent the population size most divergent from the modern-day size). Adding additional sequences mainly adds information about the most tipward parts of the genealogy, which contain relatively little information about growth.
If the loci to be combined cannot be assumed to have the same values for the parameters, this must be taken into account when combining them. It is reasonable to assume that the population growth rate affects all loci equally (barring selection), but both the neutral mutation rate µ and the effective population size Ne can vary among loci (for example, Ne is lower for a mitochondrial locus than for a nuclear one).
This can easily be accommodated if the relative values of the parameters for different loci are known (or can be assumed): we simply replace Ne and µ with appropriate locus-dependent functions when calculating the multi-locus likelihood. In the future, a method for dealing with unknown variability in µ among loci could be developed by assuming Gamma distributions for the parameters and integrating over the range of possibilities.
Assessing the accuracy of the estimate:
An advantage of likelihood methods is that information about the accuracy of the estimate can be gleaned from the likelihood curve. We will consider the confidence interval as the set of all parameter values which would not be rejected (via a likelihood ratio test) at the given level. Asymptotically, as the number of loci approaches infinity, the shape of the likelihood curve becomes Gaussian (normal) and we can construct a variance for it using a
2 metric with two degrees of freedom (![]()
Such confidence intervals will be approximate at best for finite numbers of loci. It is not obvious a priori whether bias present in the maximum likelihood parameter estimates will also strongly affect the confidence intervals. We have not solved this problem analytically, but we can assess the usefulness of the approximate confidence intervals by simulation.
Simulation procedures:
Each simulation consisted of 100 replicates. Genealogies of 25 sequences were randomly generated according to given values of
and g, and DNA data were generated randomly from these genealogies using a Kimura 2-parameter model (![]()
we used Watterson's estimate (![]()
![]()
We also performed simulations in which we made maximum likelihood estimates assuming that the true genealogy was known without error. This is equivalent to using infinitely long sequences, since with such sequences the Metropolis-Hastings sampler should unerringly generate the true genealogy. We have called these results "infinite sites" in the Tables.
For each estimation, we noted whether or not the log likelihood for the true
and g was within three units of the log likelihood at the maximum, i.e., whether or not the truth could be rejected at the approximate 95% level.
| RESULTS |
|---|
Table 1 shows results from simulation tests of Fluctuate. We do not present results for the case of
= 0.01, g = 100 with finite numbers of sites because data sets simulated at these values frequently contained no variable sites. On theoretical grounds we expect an invariant data set to produce a zero estimate of
and an indeterminate estimate of g (all values are equally likely).
|
Cases where g is negative entail the possibility that infinite time will be required for coalescence when simulating the genealogy. The probability that this will happen depends on the product of
and g. In practice, the case of
= 0.01, g = -10.0 could be simulated (less than 1% failure to coalesce), but in the case of
= 0.1 a substantial fraction of simulated genealogies failed to coalesce in finite time, and so no results are presented.
In general, estimates of g showed a strong upwards bias, decreasing somewhat with number of sites and more markedly with number of loci. The only exception was the case of
= 0.1, g = 100 in which the estimates appear biased downwards with finite amounts of data, possibly due to saturation of variable sites. The standard deviation of g was much less for high true values of
than for low ones, even with infinite numbers of sites.
Estimates of
also tended to be biased upwards, in contrast to the constant-population case, in which they appear nearly unbiased (![]()
With few exceptions, doubling the number of loci was more effective in reducing bias and standard deviation than doubling the number of sites.
In most cases the true values of
and g were rejected at the 95% level slightly more often than the desired 5%.
Table 2 shows comparable results, for the case in which the true g was zero, from the program Coalesce (![]()
.
|
| DISCUSSION |
|---|
Why is the estimate of g biased? We have identified two processes that contribute to this bias. Both are intrinsic to the estimation of exponential growth from genealogical data and are not due to the Metropolis-Hastings sampler itself: they can be shown in simple cases that do not require any of the Metropolis-Hastings machinery.
One component of the bias results from the nonlinear relationship between the coalescence times and the estimate of g. A simple two-sequence case provides a concrete demonstration. In genealogies of two tips where the true growth rate is zero and
is known without error, the distribution of the coalescence time t follows directly from coalescent theory (![]()
![]()
values (Table 3). The distribution of
is highly skewed, with a mean far above the true value. Essentially, the nonlinear relationship between t and
transforms variance in t into bias in
. Thus, bias is expected not only in our method but in any method that uses t (or measurements depending on it, such as number of mutations) as a basis for estimating exponential growth. For example, the star-phylogeny method of ![]()
|
However, even in the absence of variability in coalescence times some bias is present. Table 4 shows results based on analysis of a "perfect" coalescent genealogy, in which each interval has exactly its expected length; there is no variance in t. A bias is clearly visible in Table 4, although the 95% confidence intervals do include the true value. This component of the bias results from the fact that any genealogy with finite tips truncates the distribution of coalescence times; it has a "final coalescence" at the root, prior to which no further information is available. This presents likelihood estimation with an attractive hypothesis involving a population bottleneck at the time of the final coalescence; such a hypothesis has high likelihood because it maximizes the probability of the final event. Attraction towards this degenerate hypothesis produces a bias in
.
|
Correctness of the sampler:
It is difficult to prove a complex computer program correct, but we tested Fluctuate in several ways to help assure ourselves that the observed bias was not due to program error. If the sampler is run with 100% acceptance (that is, the data are ignored and every proposed genealogy accepted) the genealogies produced should be an autocorrelated but otherwise random sample from a coalescent distribution with the given
and g. We examined large samples of such genealogies and found them consistent with the random coalescent (data not shown). We also tested the sampler with g = 0 and found its results substantively identical to our previous program Coalesce which dealt with the constant-population case (data not shown). Based on these tests, we believe the sampler to be correct. In any case, as is shown in Table 3 and Table 4, bias would be expected in a perfectly functioning sampler.
Overcoming bias:
Given that this method (and other methods involving use of t to estimate g) has bias, how can the most accurate results be obtained? Table 3 and Table 4 show clearly that adding additional sites or sequences is ineffectual, whereas adding additional unlinked loci rapidly reduces the bias. Each new locus will provide additional information about the region of the early branchings, thereby fleshing out this part of the distribution, and the independent variation in coalescence times among loci helps counteract the bias introduced by non-linearity.
It appears that the small bias seen in
is a consequence of correlation between
and g, since it does not appear when g is held constant at zero (as in Coalesce). One positive aspect of these findings is that it is quite possible to estimate current
accurately even if the population has been growing or shrinking; the bias in
is small even when g is far from zero.
Future directions:
Real biological populations often grow or decline in ways more complicated than simple exponential growth, but the bias in the estimator interferes with attempts to fit more complex models. For example, one could imagine fitting a two-stage model with exponential growth followed by a steady-state period; however, because of the sparseness of the rootward part of the genealogy this model would be attracted to wrong solutions featuring very rapid early growth. It is possible that using a sufficiently large number of loci would allow such models to work.
Since relatively little power is available for estimating growth, attempts to differentiate between different models of growth (for example, exponential versus geometric or linear) are unlikely to succeed with reasonably sized data sets. In principle, however, this method could accommodate any growth model for which the time transformation can be worked out.
The algorithm can readily be adapted to data types other than nucleotide sequence data, such as protein sequences, allozyme alleles, or restriction site polymorphisms, as long as an appropriate evolutionary model is available.
It is possible to extend this family of algorithms by including recombination, which will greatly facilitate the analysis of nuclear loci. This may also allow a single long locus to provide some of the advantages of multiple loci, since recombination turns the single genealogy into several partially correlated genealogies. However, the algorithm with recombination will be technically challenging due to the more complex data structures and rearrangement scheme required. ![]()
Availability of software:
The Metropolis-Hastings Monte Carlo algorithm described here is available from the authors as program Fluctuate in the package Lamarc, which uses the same input/output formats as the Phylip package. (The program is written in C and can by obtained by anonymous ftp at evolution.genetics.washington.edu in directory pub/lamarc or via the World Wide Web at http://evolution.genetics.washington.edu/lamarc.html.)
| ACKNOWLEDGMENTS |
|---|
We thank MONTY SLATKIN and SIMON TAVARÉ for helpful discussion and PETER BEERLI for assistance in finding maxima of likelihood surfaces. We also thank the Organizing Committee of the fourth annual meeting of the Society of Molecular Biology and Evolution for inviting the first author to a highly productive meeting. This research was supported by National Science Foundation grants BIR-8918333 and DEB-9207558 and National Institutes of Health grant 2-R55GM41716-04 (all to J.F.).
Manuscript received February 24, 1997; Accepted for publication January 2, 1998.
| LITERATURE CITED |
|---|
COX, D. R., and D. V. HINKLEY, 1974 Theoretical Statistics. Chapman and Hill, London.
FELSENSTEIN, J., 1981 Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17:368-376[Medline].
FELSENSTEIN, J., 1992 Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates. Genet. Res. 59:139-147[Medline].
FELSENSTEIN, J., 1993 PHYLIP (Phylogeny Inference Package) version 3.5c. Distributed by the author. Department of Genetics, University of Washington, Seattle.
FELSENSTEIN, J. and G. CHURCHILL, 1996 A Hidden Markov Model approach to variation among sites in rate of evolution. Mol. Biol. Evol. 13:93-104[Abstract].
GRIFFITHS, R. C. and P. MARJORAM, 1996 Ancestral inference from samples of DNA sequences with recombination. J. Comput. Biol. 3:479-502[Medline].
GRIFFITHS, R. C. and S. TAVARÉ, 1994 Sampling theory for neutral alleles in a varying environment. Philos. Trans. R. Soc. Lond. B Biol. Sci. 344:403-410[Medline].
HASTINGS, W. K., 1970 Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97-109
KIMURA, M., 1980 A simple model for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16:111-120[Medline].
KINGMAN, J. F. C., 1982a The coalescent. Stochastic Processes and Their Applications 13:235-248.
KINGMAN, J. F. C., 1982b On the genealogy of large population. J. Appl. Probab. 19A:27-43.
KUHNER, M., J. YAMATO, and J. FELSENSTEIN, 1995 Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling. Genetics 140:1421-1430[Abstract].
METROPOLIS, N., A. W. ROSENBLUTH, M. N. ROSENBLUTH, A. H. TELLER, and E. TELLER, 1953 Equations of state calculations by fast computing machines. J. Chem. Phys. 21:1087-1092.
ROGERS, A. R. and H. HARPENDING, 1992 Population growth makes waves in the distribution of pairwise genetic differences. Mol. Biol. Evol. 9:552-569[Abstract].
SLATKIN, M. and R. R. HUDSON, 1991 Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics 129:555-562[Abstract].
WATTERSON, G. A., 1975 On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7:256-276[Medline].
This article has been cited by other articles:
![]() |
V. N. Minin, E. W. Bloomquist, and M. A. Suchard Smooth Skyride through a Rough Skyline: Bayesian Coalescent-Based Inference of Population Dynamics Mol. Biol. Evol., July 1, 2008; 25(7): 1459 - 1471. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Vasco A Fast and Reliable Computational Method for Estimating Population Genetic Parameters Genetics, June 1, 2008; 179(2): 951 - 963. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Anthony, M. Johnson-Bawe, K. Jeffery, S. L. Clifford, K. A. Abernethy, C. E. Tutin, S. A. Lahm, L. J. T. White, J. F. Utley, E. J. Wickings, et al. The role of Pleistocene refugia and rivers in shaping gorilla genetic diversity in central Africa PNAS, December 18, 2007; 104(51): 20432 - 20436. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Meligkotsidou and P. Fearnhead Postprocessing of Genealogical Trees Genetics, September 1, 2007; 177(1): 347 - 358. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Delport, T. M. Crowe, P. Lloyd, and P. Bloomer Population Growth Confounds Phylogeographic Inference in Namaqua Sandgrouse J. Hered., March 1, 2007; 98(2): 158 - 164. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hey and R. Nielsen Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics PNAS, February 20, 2007; 104(8): 2785 - 2790. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Kuhner and L. P. Smith Comparing Likelihood and Bayesian Coalescent Estimation of Population Parameters Genetics, January 1, 2007; 175(1): 155 - 165. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Kuhner LAMARC 2.0: maximum likelihood and Bayesian estimation of population parameters Bioinformatics, March 15, 2006; 22(6): 768 - 770. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Felsenstein Accuracy of Coalescent Likelihood Estimates: Do We Need More Sites, More Sequences, or More Loci? Mol. Biol. Evol., March 1, 2006; 23(3): 691 - 700. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Bowen, A. Muss, L. A. Rocha, and W. S. Grant Shallow mtDNA Coalescence in Atlantic Pygmy Angelfishes (Genus Centropyge) Indicates a Recent Invasion from the Indian Ocean J. Hered., January 1, 2006; 97(1): 1 - 12. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. F. Voight, A. M. Adams, L. A. Frisse, Y. Qian, R. R. Hudson, and A. Di Rienzo Interrogating multiple aspects of variation in a full resequencing data set to infer human population size changes PNAS, December 20, 2005; 102(51): 18508 - 18513. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Leman, Y. Chen, J. E. Stajich, M. A. F. Noor, and M. K. Uyenoyama Likelihoods From Summary Statistics: Recent Divergence Between Species Genetics, November 1, 2005; 171(3): 1419 - 1436. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Drummond, A. Rambaut, B. Shapiro, and O. G. Pybus Bayesian Coalescent Inference of Past Population Dynamics from Molecular Sequences Mol. Biol. Evol., May 1, 2005; 22(5): 1185 - 1192. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Innan, K. Zhang, P. Marjoram, S. Tavare, and N. A. Rosenberg Statistical Tests of the Coalescent Model Based on the Haplotype Frequency Distribution and the Number of Segregating Sites Genetics, March 1, 2005; 169(3): 1763 - 1777. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Adams and R. R. Hudson Maximum-Likelihood Estimation of Demographic Parameters Using the Frequency Spectrum of Unlinked Single-Nucleotide Polymorphisms Genetics, November 1, 2004; 168(3): 1699 - 1712. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Lemey, O. G. Pybus, A. Rambaut, A. J. Drummond, D. L. Robertson, P. Roques, M. Worobey, and A.-M. Vandamme The Molecular Population Genetics of HIV-1 Group O Genetics, July 1, 2004; 167(3): 1059 - 1068. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. B. Beheregaray, J. P. Gibbs, N. Havill, T. H. Fritts, J. R. Powell, and A. Caccone Giant tortoises are not so slow: Rapid diversification and biogeographic consensus in the Galapagos PNAS, April 27, 2004; 101(17): 6514 - 6519. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Arnason Mitochondrial Cytochrome b DNA Variation in the High-Fecundity Atlantic Cod: Trans-Atlantic Clines and Shallow Gene Genealogy Genetics, April 1, 2004; 166(4): 1871 - 1885. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Shriner, R. Shankarappa, M. A. Jensen, D. C. Nickle, J. E. Mittler, J. B. Margolick, and J. I. Mullins Influence of Random Genetic Drift on Human Immunodeficiency Virus Type 1 env Evolution During Chronic Infection Genetics, March 1, 2004; 166(3): 1155 - 1164. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. N. Bennett, E. C. Holmes, M. Chirivella, D. M. Rodriguez, M. Beltran, V. Vorndam, D. J. Gubler, and W. O. McMillan Selection-Driven Evolution of Emergent Dengue Virus Mol. Biol. Evol., October 1, 2003; 20(10): 1650 - 1658. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. P. Lessa, J. A. Cook, and J. L. Patton From The Cover: Genetic footprints of demographic expansion in North America, but not Amazonia, during the Late Quaternary PNAS, September 2, 2003; 100(18): 10331 - 10334. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Joy, X. Feng, J. Mu, T. Furuya, K. Chotivanich, A. U. Krettli, M. Ho, A. Wang, N. J. White, E. Suh, et al. Early Origin and Recent Expansion of Plasmodium falciparum Science, April 11, 2003; 300(5617): 318 - 321. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. J. Hayes, P. M. Visscher, H. C. McPartlan, and M. E. Goddard Novel Multilocus Measure of Linkage Disequilibrium to Estimate Past Effective Population Size Genome Res., April 1, 2003; 13(4): 635 - 643. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. G. Pybus, A. J. Drummond, T. Nakano, B. H. Robertson, and A. Rambaut The Epidemiology and Iatrogenic Transmission of Hepatitis C Virus in Egypt: A Bayesian Coalescent Approach Mol. Biol. Evol., March 1, 2003; 20(3): 381 - 387. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. E. Ramos-Onsins and J. Rozas Statistical Properties of New Neutrality Tests Against Population Growth Mol. Biol. Evol., December 1, 2002; 19(12): 2092 - 2100. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Drummond, G. K. Nicholls, A. G. Rodrigo, and W. Solomon Estimating Mutation Parameters, Population History and Genealogy Simultaneously From Temporally Spaced Sequence Data Genetics, July 1, 2002; 161(3): 1307 - 1320. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Tang, D. O. Siegmund, P. Shen, P. J. Oefner, and M. W. Feldman Frequentist Estimation of Coalescence Times From Nucleotide Sequence Data Using a Tree-Based Partition Genetics, May 1, 2002; 161(1): 447 - 459. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hugall, C. Moritz, A. Moussalli, and J. Stanisic Reconciling paleodistribution models and comparative phylogeography in the Wet Tropics rainforest land snail Gnarosophia bellendenkerensis (Brazier 1875) PNAS, April 30, 2002; 99(9): 6112 - 6117. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-K. Seo, J. L. Thorne, M. Hasegawa, and H. Kishino Estimation of Effective Population Size of HIV-1 Within a Host: A Pseudomaximum-Likelihood Approach Genetics, April 1, 2002; 160(4): 1283 - 1293. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-G. Qiu, D. E. Dykhuizen, M. S. Acosta, and B. J. Luft Geographic Uniformity of the Lyme Disease Spirochete (Borrelia burgdorferi) and Its Shared History With Tick Vector (Ixodes scapularis) in the Northeastern United States Genetics, March 1, 2002; 160(3): 833 - 849. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Strimmer and O. G. Pybus Exploring the Demographic History of DNA Sequences Using the Generalized Skyline Plot Mol. Biol. Evol., December 1, 2001; 18(12): 2298 - 2305. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Fearnhead and P. Donnelly Estimating Recombination Rates From Population Genetic Data Genetics, November 1, 2001; 159(3): 1299 - 1318. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Nielsen Mutations as Missing Data: Inferences on the Ages and Distributions of Nonsynonymous and Synonymous Mutations Genetics, September 1, 2001; 159(1): 401 - 411. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Wares Biogeography of Asterias: North Atlantic Climate Change and Speciation Biol. Bull., August 1, 2001; 201(1): 95 - 103. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Drummond, R. Forsberg, and A. G. Rodrigo The Inference of Stepwise Changes in Substitution Rates Using Serial Sequence Samples Mol. Biol. Evol., July 1, 2001; 18(7): 1365 - 1371. [Abstract] [Full Text] [PDF] |
||||







