- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Miller, C. R.
- Articles by Waits, L. P.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Miller, C. R.
- Articles by Waits, L. P.
Assessing Allelic Dropout and Genotype Reliability Using Maximum Likelihood
Craig R. Millera, Paul Joyceb, and Lisette P. Waitsaa Department of Fish and Wildlife, College of Natural Resources, University of Idaho, Moscow, Idaho 83844
b Department of Mathematics, Division of Statistics, University of Idaho, Moscow, Idaho 83844
Corresponding author: Craig R. Miller, College of Natural Resources, University of Idaho, Moscow, ID 83844., mill8560{at}uidaho.edu (E-mail)
Communicating editor: S. TAVARÉ
| ABSTRACT |
|---|
A growing number of population genetic studies utilize nuclear DNA microsatellite data from museum specimens and noninvasive sources. Genotyping errors are elevated in these low quantity DNA sources, potentially compromising the power and accuracy of the data. The most conservative method for addressing this problem is effective, but requires extensive replication of individual genotypes. In search of a more efficient method, we developed a maximum-likelihood approach that minimizes errors by estimating genotype reliability and strategically directing replication at loci most likely to harbor errors. The model assumes that false and contaminant alleles can be removed from the dataset and that the allelic dropout rate is even across loci. Simulations demonstrate that the proposed method marks a vast improvement in efficiency while maintaining accuracy. When allelic dropout rates are low (030%), the reduction in the number of PCR replicates is typically 4050%. The model is robust to moderate violations of the even dropout rate assumption. For datasets that contain false and contaminant alleles, a replication strategy is proposed. Our current model addresses only allelic dropout, the most prevalent source of genotyping error. However, the developed likelihood framework can incorporate additional error-generating processes as they become more clearly understood.
THE extraction and amplification of DNA from museum, noninvasive, and forensic sources has great potential for studying and managing wild populations (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The cause of allelic dropout is believed to be stochastic sampling error (![]()
![]()
![]()
) by making the integer i arbitrarily large: i
1 - (ln
/ln 2). If
= 0.05, i
6, and if
= 0.01, i
8. ![]()
If multiple loci are considered simultaneously, then an acceptance error at any heterozygous locus renders the genotype erroneous. The worst-case rationale can be extended to multiple loci by casting it as a decision rule that is, a procedure specified before collecting any data that will yield a correct genotype with probability
1 -
. The procedure accomplishing this is one that renders a correct genotype with probability = 1 -
under the worst possible scenario (a dropout rate of 1 and all loci heterozygous). Under these circumstances, the probability of obtaining a correct multilocus genotype is
![]() |
(1) |
where L is the number of loci and i is the number of replicates (see Appendix for proof; the word "replicates" is used to specify the per locus number of reactions). This probability can be made arbitrarily large by making i arbitrarily large: i
1 - [ln(1 - (1 -
)1/L)/2] (see Appendix). If L = 8 then 9 replicates are required to meet the
= 0.05 criteria and 11 replicates are required to meet the
= 0.01 criteria. For the duration of this article, this is called the "worst-case rule" (WCR).
There are both practical and statistical shortcomings to a worst-case approach. Pragmatically, it leads to the need to perform large numbers of replicates. Acquiring accurate genetic information on a population will often involve typing hundreds or thousand of samples. As is shown, a study using eight loci and the WCR could easily require an average of 35 reactions per sample. This equates to 35,000 single reactions to accurately genotype 1000 samples. The financial costs associated with this number would prohibit many noninvasive and historical genetic studies. Furthermore, the limited amount of DNA extract may be consumed before 30 or 40 reactions can be performed. The statistical problem with a worst-case approach is that it makes essentially no use of the data in hand.
In contrast to ignoring the available data, ![]()
![]()
![]()
What are the implications of committing genotyping errors? Clearly, a high genotyping error rate could bias most current applications of microsatellites, including genetic mark-recapture studies, forensic identification of individuals, parentage analysis, population assignment, and estimates of population substructure. D. ROON, L. WAITS and K. KENDALL (unpublished data) and ![]()
![]()
| METHODS |
|---|
General approach:
Before developing the proposed approach, it is helpful to overview the rationale behind it. Suppose that an individual is genotyped at each of a number of diploid loci i times. Assuming that contamination and false alleles do not occur or can be removed from the data (see CONCLUSIONS AND OUTSTANDING ISSUES), the observation of two different alleles at a locus implies that the individual is a heterozygote. If only one allele is observed, however, then the individual may either be a true homozygote or it may be a heterozygote at which i dropouts of the same allele have occurred. The probability of the latter event can be estimated as it is a function of the probability that the two copies differ (i.e., the heterozygosity) and the probability of a dropout. If the allele frequencies are known and Hardy-Weinberg equilibrium is assumed, then the heterozygosity (conditional on the allele observed) is readily obtained. The dropout probability for the sample at hand can be estimated by finding that dropout rate that makes the observed data most likely. The dropout rate must be estimated for each sample because samples differ in age, environmental exposure, etc., and therefore in quality and quantity of DNA. The (un)reliability of the observed multilocus genotype can then be estimated by weighting the probability that sequential dropout errors have occurred by the probability that the locus is heterozygous multiplied across the observed homozygous loci. Samples that are not reliable must be replicated until they are.
The model:
Consider a population where all loci under study are independent, at Hardy-Weinberg equilibrium, and have known allele frequencies. The model involves two sampling events. First, during reproduction alleles are sampled from the gamete pool and fixed into individuals. Second, during the PCR alleles are sampled from individuals across loci. Let ij denote the number of times sampling occurs at locus j. Each time one copy drops out with probability pj. If this error occurs, we assume that each copy is equally likely to be the dropout. Otherwise, both copies are observed. We do not consider the event that both copies drop out because PCR failure may be due to more than stochastic sampling error such as PCR reagents, thermocycler problems, etc. Hence, pj is actually the conditional probability of detecting one allele given that at least one allele amplifies. (It can be shown that, for realistic error rates, neglecting double dropouts has a negligible effect.) If the two copies are labeled a and b (where a could be the same or different from b), the results can be summarized as the number of times in which a, b, and ab are sampled: ra,j, rb,j, and rab,j, respectively. Let the vector of these counts be denoted by
j. When the individual's true genotype, g, is known, the likelihood of the data given the genotype and given the dropout rates is trinomial multiplied across the T heterozygous loci.
![]() |
(2) |
Truly homozygous loci can be ignored because the probability of the data at them is 1.
Of course, the true genotype is not known. This is addressed by writing the likelihood as the sum of genotype-specific probabilities weighted by the unconditional probability of the genotype (i.e., its expected frequency) over all possible genotypes. In addition, the model is greatly simplified by assuming that across all samples the dropout rates at different loci are related to one another by a collection of constants such that pj = cjp. Then
![]() |
(3) |
If the dropout rates are equal across loci then cj = 1 for all j and the likelihood reduces to
![]() |
(4) |
Although all subsequent theory and analyses in this article are based on the assumption that the dropout rate is even across loci, this assumption can be relaxed for any of the derivations that follow by substituting cjp in for p.
It is the reliability of the genotype, not the dropout rate, that is of interest to the investigator. Let Ej be the event that the observed genotype is correct at locus j and let Eg be the event that it is correct across all loci. Let fj denote the frequency of the observed allele at locus j. Note that it is at the M loci observed as homozygous where errors may be hidden. The reliability of a genotype, P(Eg), is given by
![]() |
(5) |
For purposes of study design, the unconditional probability that a genotype will be correctly identified is useful. Let Zl be the event that the lth locus is heterozygous, ZCl the event that the lth locus is homozygous, Hl the heterozygosity at locus l, and L the number of loci typed. Then
![]() |
(6) |
By comparing Equation 5 and Equation 6 we see how (5) is dependent on the observed data while (6) is not. The dependency on the data occurs in two ways. Since it cannot be determined before viewing the data which loci will be observed as homozygous, the observed allele with frequency fj at homozygous locus j is itself data dependent. Also, note that the total number of observed homozygous loci, M, is a random variable that is determined only after viewing the data.
Estimating the dropout rate, genotype reliability, and number of additional replicates:
The value of p that maximizes Equation 4 is the maximum-likelihood estimate (MLE),
. If
is substituted for p into Equation 5 and
p, then accepting only those genotypes that exceed a reliability criteria (1 -
) of say, 95%, would limit the long run frequency of accepting false genotypes to
5%. But when p >
, substituting
into Equation 5 overstates the reliability, a nonconservative error. This error can be guarded against by using an upper confidence bound on p in Equation 5,
(up). Formally, we define the estimated reliability,
![]() |
(7) |
where
(up) is chosen so that P(Reliability
Estimated reliability) =
. A method for finding
(up) is described in the simulation section below. This general approach of estimating the reliability on the basis of a MLE of the dropout rate is referred to as the MLR method.
Suppose that between one and three PCR replicates are conducted initially at each locus and the reliability is estimated. If the estimated reliability is <1 -
, then further replication is necessary, but how much and at which loci? The answer lies in Equation 7. Because the estimated reliability is a product across individual loci, it follows that the largest per reaction increase in estimated reliability will occur by adding a replicate to the most unreliable locus. In theory, the most efficient procedure is to add one replicate to the most unreliable locus, reestimate the reliability, and continue in this manner until the estimated reliability
1 -
. Adding replicates one at a time and reevaluating the data between each is called the "single addition method" (SAM). When the (re)evaluation entails the MLR method as just described, the abbreviation MLRSAM is used.
In the laboratory, however, SAM will usually be impractical as it entails a PCR, gel, and analysis between every additional reaction. A more practical approach is to add additional replicates in a block. The following algorithm provides an efficient way to choose the size of the block: mathematically add one to the number of replicates at the most unreliable locus, assume that the same homozygote is observed, and reestimate the reliability using the original
(up). Repeat this process until the estimated reliability
1 -
. Perform this pattern of replication in the laboratory. Unless
(up) increases substantially with the new data, this method will yield a genotype estimated as reliable. If
(up) does jump and the genotype is still estimated as unreliable, perform another block of replicates. Adding replicates en masse is called the "blockwise addition method" (BAM). When the block size and acceptability criteria are based on the MLR method, the abbreviation MLRBAM is used. (The MLR prefix is necessary here because SAM and BAM are also used in conjunction with the WCR evaluation criteria.)
Simulations:
Simulations are used here for two basic purposes: (i) to find the upper confidence bound on p,
(up), to be used in Equation 7 and (ii) to evaluate and compare the performances of the various approaches. Specific methods to these ends are detailed in the subsections below, but the same basic simulation algorithm is used throughout. First, model parameters are fixed. These include the number of loci, number of initial replicates, allele frequencies, and the dropout rate. Unless otherwise noted, the dropout rate is equal across loci. In each run a multilocus genotype is created and then sampled with errors as described in The model above, to yield the data. A computer algorithm is used to search the potential dropout rates between zero and one at 1% increments until the MLE from Equation 4 is found. Simulations are carried out using Visual Basic 6.0.
Performance of the MLE: The performance of Equation 4 in estimating the dropout rate is evaluated by conducting 1000 runs for a given set of parameters and then calculating the (estimated) bias and standard error. Simulations are run to assess the effects of the number of loci (three to six), number of replicates (one to three), heterozygosity (1867%), and the parametric dropout rate (01) on the performance.
Upper confidence bound on the dropout rate:
There are two steps in determining
(up). First we need to know how large p could be and still yield estimates as small or smaller than
only a specified proportion of the time (
). Thus the upper 1 -
confidence bound on p [denoted
(up1-
)] is defined as the value of p that would yield MLEs
,
(100)% of the time. Because the data are discrete, there may be a range of p's that meet this criteria;
(up1-
) is the largest of these. Finding
(up1-
) requires knowing the sampling distribution of
for any given p, number of replicates, number of loci, and allele frequencies. When repeated many times, the algorithm described in the first paragraph of this subsection on simulations generates just this. Candidate values of p are examined individually until the largest one having
of its probability 
is found. This is achieved by a computer algorithm that searches the candidate value of p at 1% increments.
This establishes how
(up1-
) can be determined for any value of
; the second task is to determine what
renders a rate of false inclusions 
. If the reliability was a deterministic function of p, then a 1 -
upper bound on p would render a 1 -
lower bound on the reliability. However, the reliability is a stochastic function depending on both p and the data. As is shown, a 1 -
of 95% corresponds to a 1 -
of only 70%. There does not appear to be an analytic method for finding
for a given
. Instead, simulations are employed to test how different upper bounds limit the rate of false inclusions across a range of reasonable conditions. In these simulations, the upper bound (1 -
), number of loci, number of initial replicates, allele frequencies, and the reliability criteria (1 -
) are fixed. In each run, data are generated by performing the initial replicates, determining
(up1-
), and estimating the reliability according to Equation 7. If the estimated reliability is <1 -
, then SAM is used to add replicates until the estimated reliability is
1 -
. Upon acceptance, the true and observed genotypes are compared to yield a binary result of either correct or incorrect. This process is repeated 1000 times and the observed incidence of false inclusions is calculated. The appropriate upper bound is defined as the smallest value of 1 -
that limits the observed incidence of false inclusions to 
for all values of p. Upper bounds are searched at 5% increments.
Efficiency of the different approaches: Four different methodsMLRSAM, MLRBAM, WCRSAM, and WCRBAMare evaluated and compared by running simulations under common parametric conditions (i.e., fixed dropout rate, number of loci, allele frequencies, and number of initial replicates). In all simulations, the data are evaluated after the initial reactions have been performed. In the MLR simulations, a sample estimated to be unreliable is replicated at observed homozygous loci either in a single (SAM) or a blockwise (BAM) fashion as described above until it is estimated as reliable. By comparing it to the true genotype, the accepted genotype is scored as either correct or incorrect and the number of reactions invested in it is recorded. This is repeated 1000 times to yield an observed incidence of false inclusions and a mean number of reactions per sample.
The WCR simulations are analogous except in the rules governing acceptance and additional replication. Under the WCRSAM, all observed homozygous loci are replicated once and reevaluated, replicated once and reevaluated, and so on. Loci that do not turn up heterozygous by this process are replicated until Equation 1 is satisfied. With the WCRBAM, one block of reactions is added to all observed homozygous loci so that Equation 1 is satisfied. Because the BAM is more practical in the lab, further simulations focus on the MLRBAM and WCRBAM approaches. Specifically, the effects of heterozygosity (5080%), number of loci (four to eight), number of initial replicates (one to three), and reliability criteria (9599.9%) on the number of reactions per sample are explored.
Interlocus dropout heterogeneity: Simulations are conducted to investigate how well the MLRBAM approach performs when it is assumed that the dropout rates are even across loci, but they are not. This is accomplished by running successive simulations where the dropout rates across loci are made increasingly uneven but other parameters are held constant. Reliability is estimated under the assumption of dropout rate homogeneity (i.e., using Equation 4 and Equation 7), and replicates are added using BAM. Each simulation consists of 1000 runs from which the incidence of false inclusions is calculated.
| RESULTS AND DISCUSSION |
|---|
Performance of the MLE:
When the number of loci is less than four, genotypes are unreplicated, or the heterozygosity is <50%, there is little information in the data regarding the dropout rate. In this case, the MLE from Equation 4 tends to be biased high for small values of p (data not shown). While no bias is desirable, overestimating the dropout rate is a conservative error. Above these values (or for p > 0.5 below these values), the estimator becomes approximately unbiased. As expected in a binomial model, the largest standard errors are observed when p is near 50%.
Upper confidence bound on the dropout rate:
To make a conservative estimate of the reliability, we need a sufficiently conservative estimate of the dropout rate. Preliminary simulations showed that, irrespective of what upper bound is used, the incidence of false inclusions is highest when p is between 0.5 and 0.8 (data not shown). We therefore concentrated on finding the appropriate upper bound for p in this range as it will be sufficiently large for other values of p. When a 95% reliability criteria is required for acceptance (i.e., 1 -
= 0.95) and two initial replicates are performed, the approximate upper bound on p for four, six, or eight loci with H = 50 or 67% is between 65 and 75% (Table 1). Increasing the reliability criteria to 99% elevates the upper bounds slightly to between 70 and 75%. In the case of three initial replicates, the appropriate upper bounds are in the 6070% range, while with initially unreplicated data upper bounds are between 75 and 85% (data not shown). These results are used to set the upper bound in subsequent simulations.
|
Efficiency of the different approaches:
This study is motivated largely by the apparent inefficiency of the worst-case approach. The central question is, therefore, how efficient is the proposed MLR method by comparison? Consider a case where two initial replicates are performed at six loci, all with 67% heterozygosity. The reliability criteria are set at 1 -
= 95% for MLR simulations and the probability of a correct genotype is likewise set at 1 -
= 95% for the WCR simulations. In comparing the mean number of total reactions required to achieve acceptable genotypes under each of the four methods (Fig 1A), several important trends emerge. First, the MLR methods are virtually always more efficient than the WCR method of the same replication strategy. Second, differences between the MLR and the WCR methods are largest when p = 0 and they disappear as p approaches 1. For p
0.2, the MLR methods require 1012 reactions fewer than the WCR methods (a 4050% reduction). We do note, however, that the total number of reactions in this and all other simulations in this article does not include any failed PCR reactions. While PCR failures should be rare for samples with low values of p, failures will increase the total reaction counts and reduce the proportional difference between approaches. Third, the SAM and BAM approaches are nearly equivalent for small values of p, but SAM is increasingly superior as p approaches 1.
|
This third trend is important because BAM is far more practical than SAM in the laboratory and because dropout rates should be <0.5 for most samples. For example, ![]()
![]()
![]()
The WCR is generally inefficient because it is designed to guard against a worst possible scenario, p = 1. When p is not large, the payoff of (over)replicating is, of course, that it renders genotypes with high estimated reliabilities and very few errors (Fig 1B and Fig C). In contrast, the MLR methods yield moderate estimated reliabilities and an incidence of false inclusion belowbut generally not far below
. This is true of all the MLR simulations in this article except those involving interlocus dropout heterogeneity (see below). To put these false inclusion rates in perspective, if genotypes in these simulations were unconditionally accepted after initial replication (i.e., without being subject to a reliability criteria) the incidence of erroneous genotypes would be
0, 7, 29, 54, 75, and 93% for p = 0, 0.2, 0.4, 0.6, 0.8, and 1, respectively.
It might be argued that the lower false inclusion rate observed with WCR approaches over most of the range of p suggests it actually outperforms the MLR approaches in one respect. Recall, however, that the investigator is willing to tolerate up to 5% errors to reduce the number of reactions. If the investigator wishes to reduce the incidence of false inclusions below 5%, the reliability criteria in the MLR methods can simply be raised. Fig 2 shows the number of replicates required to achieve estimated reliabilities of 95, 99, and 99.9% with the MLRBAM approach for six loci, two initial replicates, and H = 67%. When p is low, only a few more reactions are required to achieve the higher estimated reliabilities, and even at p = 0.4, 99% estimated reliability is just four reactions >95% estimated reliability. As p gets large the cost of higher estimated reliability grows considerably.
|
In addition to assuming that p = 1, the WCR assumes that H = 1. As these assumptions are approached, we expect the relative performance of the WCR methods to improve. Examining the comparative effect of heterozygosity on efficiency in the WCRBAM and MLRBAM approaches shows this to be true (Fig 3). But even at H = 80%, MLRBAM outperforms WCRBAM across the range where most samples will realistically fall (for p < 70%). When H = 50% and p is small, MLRBAM renders acceptable genotypes in approximately one-half as many reactions. Interestingly, the efficiency of the MLRBAM approach is only slightly improved by increasing heterozygosity.
|
Because an error anywhere in a genotype renders it erroneous, adding loci elevates the estimated reliability required of each locus and thereby the number of replicates per locus. The impact of this effect was investigated by running simulations at four and eight loci while holding all other parameters constant (two initial replicates, H = 67%; Fig 4). Surprisingly, when p is near zero, doubling the number of loci approximately doubles the total number of reactions in the MLRBAM approach. This near linear increase reflects a near constancy in per locus replication. As p increases, however, the cost of adding loci escalates in a nonlinear manner (as indicated by the divergent MLRBAM lines in the figure). Fig 4 also shows that unless p is near one, adding loci increases the disparity between the MLRBAM and WCRBAM approaches.
|
One parameter that affects efficiency and is easily manipulated by the investigator is the number of initial replicates per locus. ![]()
0.8. The two- and three-replicate results under the WCRBAM approach are even less efficient except when p is near 1.
|
Interlocus dropout heterogeneity:
All simulations to this point have assumed that dropout rates are equal across loci. Here we address a simple question: How well does MLRBAM perform when the dropout rates are assumed to be even but they are not? Fig 6A shows the observed incidence of false inclusions under MLRBAM for increasing degrees of dropout rate heterogeneity when the other parameters are fixed (six loci, H = 67%, two initial replicates). An upper bound of 70% is used as would be appropriate when the error rates genuinely are homogenous (Table 1). Even in the moderately uneven case where two of the loci are at 60% of the maximum rate, two are at 80%, and two are at the full error rate (coded "1 1 .8 .8 .6 .6" in Fig 6), the incidence of false inclusions remains near 5% so long as p
0.8. When the unevenness is more severe with loci at 40 and 70% of the maximum error rate (1 1 .7 .7 .4 .4), the incidence of false inclusion becomes unacceptably large for p > 0.4.
|
One remedy might be to use a larger upper bound on p to estimate reliability. Fig 6B shows the incidence of false inclusions across the same set of uneven dropout rates when a 1 -
= 95% upper bound is used. This reduces the incidence of false inclusions at low and moderate p values, but the effect diminishes as p gets large. For the most uneven case considered, the false inclusions rate is acceptable so long as p < 0.6. Surprisingly, this increase in the upper bound on p from 70 to 95% elevates the number of reactions only slightly (Fig 7). These results suggest that, if the dropout rates across loci are not highly uneven and/or if the base rate is not large, analyzing data under the even dropout rate assumption still yields reliable results. Using a higher upper bound on p increases the range of violations over which the model remains robust while not increasing the number of reactions appreciably.
|
Study design:
When designing and budgeting a study, it is often valuable to have an estimate of the number of reactions that will be required. Equation 6, the unconditional probability of obtaining a correct genotype, can provide such an estimate. This requires three things of the investigator: (1) confidence that the model is appropriate; (2) knowledge of the heterozygosity per locus, or a willingness to make an educated guess; and (3) knowledge of the dropout rate, or an educated guess. To avoid underbudgeting, the investigator can use conservatively low heterozygosities and conservatively high dropout rates. It should be noted that the number of replicates need not be even across loci; the SAM algorithm can be used to forecast how replication will proceed on average after the initial replicates are performed. We also note that the number of reactions will additionally include failures.
| CONCLUSIONS AND OUTSTANDING ISSUES |
|---|
The most important result of this article is that under the model assumptions the MLRBAM represents an efficient method for obtaining reliable genotypes, especially in comparison to the WCRBAM approach. Although a number of variables are shown to affect this efficiency, two valuable points emerge. First, the MLRBAM method is especially efficient when p is smalland published data suggest it generally will be 040% (![]()
![]()
![]()
![]()
Tantamount to the performance of MLRBAM in simulations is the issue of its applicability to real data. Several of the assumptions upon which the model is based warrant closer scrutiny. One such assumption made here is that the dropout rates are even across loci. The findings that the model is robust to mild departures from evenness and the failure of ![]()
A second assumption made in the MLR model is that the two alleles in a heterozygote are equally likely to drop out. It has been suggested that the longer allele may drop out more often than the shorter (![]()
![]()
![]()
![]()
The most serious assumption made in our model is that there are no false or contaminant alleles in the analyzed data. We do not assume that such alleles never occur, but rather that they can be flagged and removed from the data. Although several studies have reported occurrences of false and contaminant alleles that are non-negligible (![]()
![]()
![]()
Nevertheless, cryptic false and contaminant alleles do occur and when they are undetected, they will cause genotype errors. It may be possible to explicitly incorporate these errors into the likelihood model, for example, by assigning each allele a conditional probability of being true and a probability of being false. Certainly, there is information regarding how likely an allele is to be false vs. true such as its frequency and its length relative to the other allele (most false alleles are one repeat shorter or longer than a true allele). Likewise, the contamination probability can be estimated by using numerous blanks during DNA extraction and PCR. In practice, one current option is to follow ![]()
A dilemma arises with this approach, however, when a series of reactions at a locus yield one heterozygote result and the same homozygote for all the rest (e.g., ab, a, a, a). If in replicating we continue to observe the same homozygote, at what point should we begin to have serious doubts about it being a genuine heterozygote? Suppose that the locus is truly a heterozygote. The probability of observing one heterozygous result and i - 1 homozygotes of the same allele given the model is
![]() |
(8) |
Replacing p with
, or more conservatively with
(up), estimated from data at the other loci, a p value can be calculated. Even in the worst case, where p is near 8085%, it is very unusual not to observe both alleles twice in a true heterozygote within five or six replicates. Once replication drives this p value below, say, 5%, the sample should be abandoned at this locus. The sample cannot be declared a homozygote because casting doubt on the heterozygote hypothesis and accepting the homozygote-false/contaminant allele hypothesis are not equivalent. The competing hypotheses cannot be resolved (as with a likelihood-ratio test) without placing a probability on the event of a false/contaminant allele (see also ![]()
Though false and contaminant alleles are generally rare, homozygotes are not. Each time a homozygote is observed, there is the possibility that it represents a true heterozygote where consecutive dropout errors have occurred. In this article we developed a general mathematical framework for dealing with this source of uncertainty. While incomplete, the model shows promise for vastly improving the efficiency in acquiring reliable genetic dataa critical step toward realizing the potential of noninvasive, historic, and forensic genetic sampling.
| ACKNOWLEDGMENTS |
|---|
We thank Gordon Luikart for advice on writing the simulation program. This research was supported by the National Science Foundation (NSF) grant DEB-0089756 and the NSF EPSCoR program (Experimental Program to Stimulate Competitive Research), NSF cooperative agreement nos. EPS-9720634 and EPS-0080935.
Manuscript received February 23, 2001; Accepted for publication October 22, 2001.
| APPENDIX |
|---|
PROOF OF Equation 1
Assumptions:
- All L loci are true heterozygotes.
- Allelic dropout occurs in every reaction. Denoting the two alleles a and b, the true genotype is revealed only if a drops out during one replication and b drops out during another.
- The probability of observing a particular allele is one-half per replicate per locus.
- Loci are independent.
Probability experiment:
Each locus is initially replicated k times. The observed homozygotes are then replicated an additional i - k times for a total of i replicates.
Recording the results of the experiment:
If at the end of the experiment all replicates produce the same allele, the locus is typed as a homozygote; otherwise it is typed as a heterozygote. The genotype is correctly identified if at the end of the experiment each locus is typed as a heterozygote.
Mathematics:
Let C be the event that the genotype is typed correctly. Let Mk equal the number of observed homozygotes after k replicates. Let Xj = 1 if, after k replicates, locus j is an observed homozygote and Xj = 0 otherwise. It follows that Mk = X1 + X2 + ... + XL. Now let s be any real number. It follows that
![]() |
(A1) |
Therefore,

Now replace s = 1 - (1/2)i-k in (A1) to get

which is Equation 1. Note that P(C) does not depend on k. The initial number of replicates does not affect the probability of a correct genotype.
If a genotype error probability is set to
, then P(C) = 1 -
. Therefore,

Therefore,

implying

| LITERATURE CITED |
|---|
BOUZAT, J. L., H. A. LEWIN, and K. N. PAIGE, 1998 The ghost of genetic diversity past: historical DNA analysis of the greater prairie chicken. Am. Nat. 152:1-6.
ERNEST, H. B., M. C. T. PENDO, B. P. MAY, S. M. SYVANEN, and W. M. BOYCE, 2000 Molecular tracking of mountain lions in the Yosemite Valley region in California: genetic analysis using microsatellite and faecal DNA. Mol. Ecol. 9:433-441[Medline].
GAGNEUX, P., C. BOESCH, and D. S. WOODRUFF, 1997 Microsatellite scoring errors associated with noninvasive genotyping based on nuclear DNA amplified from shed hairs. Mol. Ecol. 6:861-868[Medline].
GERLOFF, U., C. SCHLÖTTERER, K. RASSMANN, I. RAMBOLD, and G. HOHMANN et al., 1995 Amplification of hypervariable simple sequence repeats (microsatellites) from excremental DNA of wild living bonobos (Pan paniscus). Mol. Ecol. 4:515-518.
GOOSSENS, B., L. P. WAITS, and P. TABERLET, 1998 Plucked hair samples as a source of DNA: reliability of dinucleotide microsatellite genotyping. Mol. Ecol. 7:1237-1241[Medline].
KOHN, M. H. and R. K. WAYNE, 1997 Facts from feces revisited. Trends Ecol. Evol. 12:223-227.
KOHN, M. H., E. C. YORK, D. A. KAMRADT, G. HAUGHT, and R. M. SAUVAJOT et al., 1999 Estimating population size by genotyping faeces. Proc. R. Soc. Lond. Ser. B 266:1-7[Medline].
LEONARD, J. A., R. K. WAYNE, and A. COOPER, 2000 Population genetics of ice age brown bears. Proc. Natl. Acad. Sci. USA 97:1651-1654
MILLER, L. M. and A. R. KAPUSCINSKI, 1997 Historical analysis of genetic variation reveals low effective population size in northern pike (Esox lucius) population. Genetics 147:1249-1258[Abstract].
MILLS, L. S., J. J. CITTA, K. P. LAIR, M. K. SCHWARTZ, and D. A. TALLMON, 2000 Estimating animal abundance using noninvasive DNA sampling: promise and pitfalls. Ecol. Appl. 10:283-294.
MORIN, P. A., J. WALLIS, J. J. MOORE, and D. S. WOODRUFF, 1994 Paternity exclusion in a community of wild chimpanzees using hypervariable simple sequence repeats. Mol. Ecol. 3:469-478[Medline].
MUNDY, N. I., C. S. WINCHELL, T. BURR, and D. W. WOODRUFF, 1997 Microsatellite variation and microevolution in the critically endangered San Clemente Island loggerhead shrike (lanius ludovicianus mearnsi). Proc. R. Soc. Lond. Ser. B 264:869-875.
NAVIDI, W., N. ARNHEIM, and M. S. WATERMAN, 1992 A multiple-tubes approach for accurate genotyping of very small DNA samples by using PCR: statistical considerations. Am. J. Hum. Genet. 50:347-359[Medline].
PALSBØLL, P. J., J. ALLEN, M. BÉRUBÉ, P. J. CLAPHAM, and T. P. FEDDERSEN et al., 1997 Genetic tagging of humpback whales. Nature 388:767-769[Medline].
REED, J. Z., D. J. TOLLIT, P. M. THOMPSON, and W. AMOS, 1997 Molecular scatology: the use of molecular genetic analysis to assign species, sex and individual identity to seal faeces. Mol. Ecol. 6:225-234[Medline].
ROY, M. S., E. GEFFEN, D. SMITH, and R. K. WAYNE, 1996 Molecular genetics of pre-1940 red wolves. Conserv. Biol. 10:1413-1424.
TABERLET, P., S. FRIFFIN, B. GOOSSENS, S. QUESTIAU, and V. MANCEAU et al., 1996 Reliable genotyping of samples with very low DNA quantities using PCR. Nucleic Acids Res. 24:3189-3194
TABERLET, P., J. J. CAMARRA, S. GRIFFIN, E. UHRÈS, and O. HANOTTE et al., 1997 Noninvasive genetic tracking of the endangered Pyrenean brown bear population. Mol. Ecol. 6:869-876[Medline].
TABERLET, P., L. P. WAITS, and G. LUIKART, 1999 Noninvasive genetic sampling: look before you leap. Trends Ecol. Evol. 14:323-327[Medline].
TESSIER, N. and L. BERNATCHEZ, 1999 Stability of population structure and genetic diversity across generations assessed by microsatellites among sympatric populations of landlocked Atlantic salmon (Salmo salar L.). Mol. Ecol. 8:169-179.
WAITS, J. L. and P. L. LEBERG, 2000 Biases associated with population estimation using molecular tagging. Anim. Conserv. 3:191-200.
WOODS, J. G., D. PAETKAU, D. LEWIS, B. N. MCLELLAN, and M. PROCTOR et al., 1999 Genetic tagging of free-ranging black and brown bears. Wildl. Soc. Bull. 27:616-627.
This article has been cited by other articles:
![]() |
P. C. D. Johnson and D. T. Haydon Maximum-Likelihood Estimation of Allelic Dropout and False Allele Error Rates From Microsatellite Genotypes in the Absence of Reference Data Genetics, February 1, 2007; 175(2): 827 - 842. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. A. Okello, G. Wittemyer, H. B. Rasmussen, I. Douglas-Hamilton, S. Nyakaana, P. Arctander, and H. R. Siegismund Noninvasive Genotyping and Mendelian Analysis of Microsatellites in African Savannah Elephants J. Hered., November 1, 2005; 96(6): 679 - 687. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Wehausen, R. R. Ramey II, and C. W. Epps Experiments in DNA Extraction and PCR Amplification from Bighorn Sheep Feces: the Importance of DNA Extraction Method J. Hered., November 1, 2004; 95(6): 503 - 509. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. R. Miller and L. P. Waits The history of effective population size and genetic diversity in the Yellowstone grizzly (Ursus arctos): Implications for conservation PNAS, April 1, 2003; 100(7): 4334 - 4339. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Miller, C. R.
- Articles by Waits, L. P.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Miller, C. R.
- Articles by Waits, L. P.








), MLRSAM (
), MLRBAM (
) across dropout rate, p. (A) Mean total number of reactions to achieve acceptable genotypes. (B) Mean estimated reliability of accepted genotypes. For WCRSAM and WCRBAM, reliability estimated after genotype acceptance using
) Three rounds, () two rounds, (
) one round. Based on 1000 runs/simulation, six loci, two initial replicates/locus, H = 67% [allele frequencies: {0.33, 0.33, 0.34}], reliability criteria 1 - 
), 75% for 

) four loci, WCRBAM; (






