- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Related articles in Genetics
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Stahl, F. W.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Stahl, F. W.
Genetics, Vol. 179, 701-704, May 2008, Copyright © 2008
doi:10.1534/genetics.108.086918
On the "NPD Ratio" as a Test for Crossover Interference
Franklin W. Stahl1
Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403
1 Address for correspondence: Institute of Molecular Biology, 1229, University of Oregon, Eugene, OR 97403-1229.
E-mail: fstahl{at}uoregon.edu
The "NPD ratio," widely used by yeast geneticists, is of limited applicability and is prone to falsely indicate significant crossover interference in a chi-square test. A simple, better chi-square test for interference in two-factor crosses is described.
FOR organisms, such as yeast, in which the four products of individual meioses can be examined, crossover interference can be assessed in two-factor crosses. In 1952, Haig Papazian noted that, for the null hypothesis of no interference, the frequency of nonparental ditype (NPD) tetrads expected (fNexp) can be written as a function of the frequency of tetratype (TT) tetrads observed (fTobs):
![]() | (1) |
Equation 1, which is equivalent to PAPAZIAN's (1952) Equation 5, assumes the absence of both chromatid and chromosome (chiasma) interference and is a rewritten form of equations by HALDANE (1931). Interference is indicated when the observed frequency of NPDs (fNobs) differs significantly from fNexp. The "NPD ratio" (fNobs/fNexp) is an indication of the intensity of interference, with smaller values indicating stronger positive interference (SNOW 1979a,b).
Papazian's approximate method:
In 1952, in the absence of today's electronic calculators, Papazian provided an easily calculable approximation to Equation 1:
![]() | (2) |
calculated from the approximate equation is only 0.080, which is less than half of its exact expectation of
. Subsequently, STRICKLAND (1958) proposed an approximation that is more useful over a wider range of fTobs values and that, he says, changes Papazian's (allegedly incorrect)
to
.
Some authors, noting correctly that Papazian's exact equation gives meaningless roots when
, have used Equation 2 to calculate fNexp when confronted with data in which
. Apparently, these authors took PAPAZIAN (1952, p. 179) seriously when he wrote that his approximation "... is valid for long intervals and can be used where other methods of measuring interference cannot." However, not only is the approximation a poor one for large fTobs, its use when
is as invalid as is that of the exact expression—fTobs cannot (significantly) exceed
in the absence of interference, as noted by PERKINS (1955). Furthermore, since Equation 1 describes relationships in the presumed absence of interference, the only value for fNexp that can reasonably be based solely on an fTobs that (insignificantly) exceeds
is
, the value for fNexp as map length approaches infinity. Some other authors have simply ignored the upper limit on fT for tetrads in which
and extracted a meaningless "fNexp" from the exact equation.
Tests for significance:
Several users of Papazian's equation test for significance by chi square, but they fail to justify use of this test or to publish particulars. The most obvious, though not necessarily legitimate (see below), application of chi square would seem to be a test of the observed numbers of NPDs and of parental ditypes (PDs) vs. the numbers expected (d.f. = 1), where the frequency of PDs expected is fPexp = 1 – fNexp – fTobs. Some investigators appear to ignore the PDs and calculate chi square as (Nobs – Nexp)2/Nexp (d.f. = 1), where Nexp is fNexp x total tetrads, rounded to the nearest integer. (The rounding seems to imply the inappropriate use of a contingency chi-square test rather than the appropriate "goodness-of-fit" chi-square test.)
Problems with chi square:
PAPAZIAN's (1952) equation uses only a fraction of the data available for testing interference. The resulting inefficiency, which is minimal at small values of fNobs, increases with increasing fNobs. Because the Papazian-based chi-square tests ascribe all of the deviation to two of the three classes, their chi-square values, based on the sum of the squares of those deviations, are enlarged. Accordingly, such Papazian-based methods will be prone to falsely claiming interference.Below, we offer a simple, better way to test two-factor tetrad data for interference and compare it with the above-mentioned Papazian-based methods.
The better way:
In the presumed absence of interference, map length (X, in morgans) may be calculated from the observed recombinant frequency (R) using HALDANE's (1919) equation
![]() | (3) |
![]() | (4) |
The expected frequencies of the three classes of tetrads are written below in a form that is user friendly for calculation. They appear in different form in HALDANE (1931), PAPAZIAN (1952), and SNOW (1979a),
![]() | (5) |
![]() | (6) |
![]() | (7) |
fstahl/. With respect to false positives, the performance of the better way (BW) can be compared with those of the Papazian-based methods and the maximum-likelihood estimation (MLE) (FISHER 1925), using the appropriate likelihood equation of estimation (Equation 7 of SNOW 1979a). When a universe of tetrads lacking interference is sampled, the BW behaves as well as the MLE over the range tested, giving "significance" 5% of the time (Figure 1). The false positives of the Papazian-based methods, on the other hand, increase with increasing map length. For instance, a 2003 yeast article in Molecular and Cellular Biology uses Papazian-based chi-square tests and reports interference significant at the 0.02 level for an interval that has 188 PDs, 16 NPDs, and 235 TTs. When tested by the better way, the P-value for that interval is 0.07. A 2003 article in GENETICS reports negative interference (P = 0.03) on the basis of 542 PDs, 80 NPDs, and 617 TTs. When tested by the BW, the P-value is 0.12.
|
In Table 1, we compare the properties of the three methods for some of the Schizosaccharomyces pombe data of MUNZ (1994) and for fictitious data that vary with respect to map length, interference, and total sample size. Table 1 shows that chi-square values for the BW and the MLE are practically identical. The Papazian-based method, on the other hand, often gives appreciably larger chi-square values. The entries in Table 1 suggest that, in the presence of positive interference, MLEs of X are slightly larger than BW estimates. This suggestion is supported by extensive comparisons (APPENDIX) made (unintentionally) by SNOW (1979a,b).
|
In Figure 2, the BW and the MLE are compared with respect to the efficiency with which they reflect the universe sampled. Figure 2A shows that, even for small samples (100 tetrads each), the mean values of X obtained by the BW reflect the universe mean about as well as those obtained by the MLE over the range tested. Figure 2B shows that the variances of the sample distributions obtained by the BW are also practically indistinguishable from those obtained by the MLE. The test range was limited by the frequency with which sample R values >
prevented the use of the BW.
|
The similarity in performance of the MLE and the BW justifies the use of chi square as an easily calculated test for significance when the estimate of map length (X) is based on Equation 3 rather than on the tedious (or computerized) MLE. The BW is valid for values of fTobs that are
(as long as the observed R is not
). APPENDIX
SNOW (1979a) developed maximum-likelihood equations for estimating interference according to a model by BARRATT et al. (1954). The MLE equations contain parameters for the map length (X) and for the strength of interference (k). Snow calculated what he called X(i) for "X with interference." He was later advised that what he had, in fact, calculated was X as it would be if there were no interference (SNOW 1979b).SNOW (1979a, p. 237) writes: "Because of the nature of tetrad data and the conditions of the interference model, the estimate of x obtained by (9) and (10) is equal to –(1/2)ln p(0) (i.e., p(0) = e–2x), where p(0) is the proportion of tetrads with no exchanges, that is, PD – NPD/N. For example, a Saccharomyces cross involving the markers mat1 and his4 on chromosome 3 (see Table 2) produced 97 PD, 7 NPD, and 174 T. The maximum likelihood estimate of x with interference is 0.56390, which is –(1/2)ln(90/278)."
Note that fP – fN = 1 – 2fN – fT = 1 – 2R. Thus, Snow calculated the X(i) values of his Table 2 from equations exactly equivalent to our Equations 3 and 4, which assume no interference (HALDANE 1919). Snow appeared at that time not to realize what he had done, judging from his remark that "the estimates for x(i) are somewhat smaller than those for x, in keeping with expectations of the interference model" (p. 241). In fact, Snow's estimates of X(i) tend to be less than his MLEs of X in the no-interference model simply because the BW procedure for estimating X assuming the absence of interference is slightly biased. By comparing MLEs and BW estimates of X to five decimal places, SNOW (1979a) has given us examples of the extent of bias in the X values of the BW method. The degree of the bias can be appreciated by examining the many entries for X and X(i) in SNOW's (1979a) Table 2.
BARRATT, R. W., D. NEWMEYER, D. PERKINS and L. GARNJOBST, 1954 Map construction in Neurospora crassa. Adv. Genet. 6: 1–93.[Medline]
FISHER, R. A., 1925 Statistical Methods for Research Workers. Oliver & Boyd, Edinburgh.
HALDANE, J. B. S., 1919 The combination of linkage values, and the calculation of distances between the loci of linked factors. J. Genet. 8: 299–309.
HALDANE, J. B. S., 1931 The cytological basis of genetical interference. Cytologia 3: 55–65.
MUNZ, P., 1994 An analysis of interference in the fission yeast Schizosaccharomyces pombe. Genetics 137: 701–707.[Abstract]
PAPAZIAN, H. P., 1952 The analysis of tetrad data. Genetics 37: 175–188.
PERKINS, D. D., 1955 Tetrads and crossing over. J. Cell. Comp. Physiol. 45(Suppl. 2): 119–149.[CrossRef]
SNOW, R., 1979a Maximum likelihood estimation of linkage and interference from tetrad data. Genetics 92: 231–245.
SNOW, R., 1979b Comment concerning maximum likelihood estimation of linkage and interference from tetrad data. Genetics 93: 285.
STRICKLAND, W. N., 1958 An analysis of interference in Aspergillus nidulans. Proc. R. Soc. Lond. Ser. B 140: 82–101.
Communicating editor: G. R. SMITH
Related articles in Genetics:
ISSUE HIGHLIGHTS
Genetics 2008 179: NP.
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Related articles in Genetics
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Stahl, F. W.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Stahl, F. W.








