help button home button Genetics AJP: Endocrinology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zeng, Z.-B.
Right arrow Articles by Laurie, C. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zeng, Z.-B.
Right arrow Articles by Laurie, C. C.
Genetics, Vol. 154, 299-310, January 2000, Copyright © 2000

Genetic Architecture of a Morphological Shape Difference Between Two Drosophila Species

Zhao-Bang Zenga, Jianjun Liu1,2,b, Lynn F. Stamb, Chen-Hung Kao3,a, John M. Mercerb, and Cathy C. Laurie4,b
a Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695
b Department of Zoology, Duke University, Durham, North Carolina 27708

Corresponding author: Zhao-Bang Zeng, Department of Statistics, Box 8203, 220F Patterson Hall, North Carolina State University, Raleigh, NC 27695-8203., zeng{at}stat.ncsu.edu (E-mail)

Communicating editor: A. G. CLARK


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The size and shape of the posterior lobe of the male genital arch differs dramatically between Drosophila simulans and D. mauritiana. This difference can be quantified with a morphometric descriptor (PC1) based on elliptical Fourier and principal components analyses. The genetic basis of the interspecific difference in PC1 was investigated by the application of quantitative trait locus (QTL) mapping procedures to segregating backcross populations. The parental difference (35 environmental standard deviations) and the heritability of PC1 in backcross populations (>90%) are both very large. The use of multiple interval mapping gives evidence for 19 different QTL. The greatest additive effect estimate accounts for 11.4% of the parental difference but could represent multiple closely linked QTL. Dominance parameter estimates vary among loci from essentially no dominance to complete dominance, and mauritiana alleles tend to be dominant over simulans alleles. Epistasis appears to be relatively unimportant as a source of variation. All but one of the additive effect estimates have the same sign, which means that one species has nearly all plus alleles and the other nearly all minus alleles. This result is unexpected under many evolutionary scenarios and suggests a history of strong directional selection acting on the posterior lobe.


IN recent years, advances in molecular biology have stimulated a great resurgence of interest in the genetic and developmental bases of morphological diversity. Most of the work in this area is concerned with fundamental differences in the body plans of organisms that diverged from one another many millions of years ago (e.g., SHUBIN et al. 1997 Down). The basic approach is classical comparative biology of the patterns of expression of genes known to play an important role in the development of model organisms. Such studies have suggested hypotheses about how morphological diversity may evolve through changes in patterns of regulatory gene expression (e.g., CARROLL et al. 1995 Down; AVEROF and PATEL 1997 Down). However, these hypotheses are difficult to test and it is likely that a multitude of complex genetic changes underlie macroevolutionary differences such as variation in numbers and types of appendages.

An alternative approach to the study of morphological diversity is direct genetic analysis of phenotypic differences between closely related species. This type of microevolutionary analysis has the potential to reveal the underlying genetic architecture of morphological differences in considerable detail. An understanding of the numbers and types of gene substitutions responsible for species differences eventually can provide insight into how morphology evolves in terms of population genetic processes.

A pair of closely related allopatric species of Drosophila, Drosophila simulans and D. mauritiana, differ dramatically in size and shape of the posterior lobe of the male genital arch (Fig 1). D. simulans females hybridize readily with mauritiana males in the laboratory, producing fertile female and sterile male F1's with an intermediate posterior lobe morphology. When F1 females are backcrossed to parental males, a continuous series of morphologies is produced, suggesting polygenic inheritance. We have previously shown that both the size and shape variation (which are highly correlated) can be quantified by a morphometric descriptor (PC1) based on elliptical Fourier and principal components analyses. We have also reported a preliminary investigation of the genetic architecture of this trait using composite interval mapping (LIU et al. 1996 Down). Here we greatly improve the resolution of the quantitative trait locus (QTL) analysis through increases in sample size and marker number as well as the application of a new interval mapping procedure.



View larger version (8K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1. Camera lucida outlines of a posterior lobe from one specimen each of D. simulans and D. mauritiana.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Experimental design and data acquisition:
The experimental design and morphological data acquisition methods were described previously in detail (LIU et al. 1996 Down). Briefly, females from an inbred line of D. simulans (13w JJ) were crossed to males of an inbred line of D. mauritiana (Rob A JJ) to produce an F1 population. The parental lines are fixed for different alleles at 45 marker loci. The F1 females were backcrossed to each parental line to produce two populations, referred to as mauritiana backcross (BM) and simulans backcross (BS). These crosses were repeated at two different times to produce two independent samples of males from each backcross population, BS1 (n = 186), BS2 (n = 288), BM1 (n = 192), and BM2 (n = 299). The phenotypic value of an individual is the average over both sides of the first principal component (PC1) of the Fourier coefficients of the posterior lobe.

The genotype of males from BS1 and BM1 was determined at each of 45 marker loci, which are listed in Table 1. The same markers were scored also on BS2 and BM2, except for prd, eve, and plu. Cytological positions in Table 1 are from FLYBASE 1997 Down and genetic map positions are estimated from gametes produced by F1 females in this study.


 
View this table:
[in this window]
[in a new window]

 
Table 1. Summary of markers

Molecular markers were developed by using D. melanogaster sequence in FLYBASE 1997 Down to design PCR primers that amplify a specified region from both mauritiana and simulans parental lines. These PCR products were sequenced to identify interspecific differences to be used as markers. In some cases, the sequence difference alters a restriction site and was detected by restriction digestion and electrophoresis as described by LIU et al. 1996 Down. In most cases, an allele-specific oligonucleotide (ASO) hybridization assay was used to genotype flies [modified after MAEKAWA et al. 1995 Down and SAIKI et al. 1986 Down]. In the ASO assay, two 15-base probes were designed to cover the nucleotide sequence difference, with one matching each allelic form. PCR products from single-fly genomic DNA were blotted onto Hybond N+ filters in a 96-well format and hybridized with each probe in succession. At discriminating wash temperatures, the mismatched probe washes off and the perfectly matched probe remains hybridized, revealing the fly's genotype. PCR and hybridization conditions, along with primer and probe sequences, are given in Table 1. The genetic and phenotypic data of this experiment are available through a link on the QTL Cartographer web site (http://statgen.ncsu.edu/qtlcart/cartographer.html).

QTL analysis:
Two types of interval mapping analyses were applied to the backcross data sets, composite interval mapping (CIM; ZENG 1994 Down) and multiple interval mapping (MIM; KAO and ZENG 1997 Down; KAO et al. 1999 Down).

The CIM analysis involves a single QTL model with multiple regression on marker loci outside of the interval under consideration. In this case the model is

(1)

where i indexes backcross, j indexes sample within backcross, k indexes individuals within sample, l indexes markers, yijk is the phenotypic value, µij is the mean, b*ij is the effect of a putative QTL, x*ijk is a QTL indicator variable, bijl is the partial regression coefficient of yijk on xijkl, xijkl is an indicator variable for the lth marker, {Sigma} is over all markers except those flanking the interval containing the putative QTL, and eijk is the residual. The CIM likelihood function and hypothesis testing for this data set were described previously (ZENG 1994 Down; LIU et al. 1996 Down).

For CIM, several different likelihood ratios corresponding to different hypotheses were calculated and results are presented as LOD = -log10(L0/L1). Fig 2A shows joint mapping with both samples from both backcrosses (H0, b*ij = 0; and H1, b*ij != 0 for i,j = 1 and 2) and joint mapping with both samples in one backcross (H0, b*i1 = 0 and b*i2 = 0; and H1, b*i1 != 0 and b*i2 != 0). In addition, two types of interactions were tested, a QTL x backcross interaction (H0, b*11 = b*21 and b*12 = b*22; and H1, b*11 != b*21 and b*12 != b*22) and a QTL x sample interaction (H0, b*11 = b*12 and b*21 = b*22; and H1, b*11 != b*12 and b*21 != b22). The critical value of LOD score used is (0.217{chi}2), where f is the number of parameters and g is the number of marker intervals. For joint mapping of both samples and both backcrosses, f = 5, g = 42, and LODc = 4.4; for joint mapping of both samples in one backcross, f = 3, g = 42, and LODc = 3.5. As interactions were only tested on previously identified QTL, f = 1, g = 14, and LODc = 1.8.



View larger version (39K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2. LOD profiles for chromosomes X, 2, and 3 from composite interval mapping (a) and multiple interval mapping (b). The solid curve represents the joint analysis of both backcrosses, while the dashed and dotted curves represent separate analysis of each backcross. In a, the horizontal lines represent the critical values, solid for the joint analysis and dashed for the individual backcrosses. Putative QTL identified in a occur at map positions 1-3, 1-23, 2-1, 2-19, 2-67, 2-143, 3-4, 3-21, 3-47, 3-75, 3-83, 3-94, 3-140, and 3-172. In b, putative QTL are numbered for reference to information in Table 2 and Table 3. Marker positions are given by triangle symbols.

The MIM analysis is quite different, because it involves a multiple QTL model, which may include both main effects and epistatic interactions. The model is

(2)

where i indexes backcross, j indexes sample within backcross, k indexes individuals within sample, m is the number of putative QTL, {alpha}ir is the effect of a putative QTL r in backcross i (assuming that the effect is the same for both samples within a backcross), x*ijkr is an indicator variable denoting the genotype of QTL r (= +1/2 or -1/2), ßirs is an epistatic effect between QTL r and s, {Sigma}r!=s{sub}(1,...,m) is the subset of QTL pairs that show a significant epistatic effect, t is the number of significant pairwise epistatic effects, and eijk is the residual.

The likelihood of this MIM model is

(3)

where Y denotes the quantitative trait data, X denotes the marker data, pijkl is the probability of each multilocus genotype (conditional on marker data), Ei is a vector of QTL parameters ({alpha}'s and ß's), and Dijkl is a vector specifying the configuration of x*'s associated with each {alpha} and ß for the lth genotype. The term in brackets is the weighted sum of a series of normal density functions, one for each of the 2m possible multiple-QTL genotypes. The procedure to obtain maximum-likelihood parameter estimates was described by KAO and ZENG 1997 Down. A large fraction of the 2m possible genotypes have zero or very small probability given marker genotypes and do not need to be evaluated in the analysis.

Selection of the number and map positions of putative QTL to be included in the MIM model followed a stepwise procedure described by KAO et al. 1999 Down. Briefly, the procedure consisted of the following steps: (1) selection began with a model containing m QTL at positions determined from CIM and included no epistatic effects; (2) the genome was scanned to find the best position of an (m + 1)th QTL (i.e., that resulting in the greatest increase in likelihood), denoted {theta}Fm+1; (3) the model with (m + 1) QTL was reevaluated to find the one QTL having the least significant effect, denoted {theta}Bm+1; (4) if {theta}Fm+1 != {theta}Bm+1, {theta}Bm+1 was removed, {theta}Fm+1 was kept, and the process returned to step (2). If {theta}Fm+1 = {theta}Bm+1, then {theta}Fm+1 was retained in the model if it made a significant contribution (i.e., if the likelihood ratio for models with and without {theta}Fm+1 exceeds a prescribed critical value). If {theta}Fm+1 was retained, all QTL locations were reevaluated and the process returned to step (2). In the positional reevaluation, the entire genome was scanned repeatedly until the positions of maximum LOD score were stationary. If {theta}Fm+1 was not retained, selection of QTL number and position terminated.

It is not clear what critical value should be applied to this type of analysis. Initially, the threshold appropriate for a CIM analysis (LOD score 4.4) was used. Later, a residual permutation (or bootstrap) test was used to guide the final model selection (ZENG et al. 1999 Down).

The residual permutation test is a model-dependent, resampling method. After a series of model-fitting cycles, a test is needed to determine whether the least significant QTL in the current model (or the last added QTL) in the model is a statistically significant addition to the model. In this case the test consists of comparing a model of k QTL (null hypothesis) with an alternative model of k + 1 QTL (which consists of the k QTL model plus one additional QTL). The test consists of several steps: (1) the estimated genotypic value for each individual is obtained under the null hypothesis and the corresponding residual is calculated as the difference between the observed phenotypic and estimated genotypic values; (2) a permuted sample is obtained by randomly shuffling the residuals among individuals; (3) the permuted sample is used in a search for a new QTL, conditional on the k QTL model, and the maximum test statistic is recorded; (4) the resampling and testing in (2) and (3) are repeated a number of times to obtain an empirical 95% significance threshold for the test; and (5) finally, this threshold is compared with the test statistic for the (k + 1)th QTL in the original data.

After establishing a model with main effects, the combined forward/backward procedure was applied to identify significant epistatic effects between pairs of identified QTL. A likelihood-ratio test was performed on each fitted epistatic effect, using a threshold adjusted by the number of tests performed in each cycle. In the first cycle, the LOD threshold was set to 3.0 ({cong}0.217{chi}2). An attempt was also made to search for significant epistatic effects between one QTL identified through its marginal effect and another unoccupied position on the genetic map. This was done by searching for the largest epistatic effect between an identified QTL and all unoccupied positions at 1-cM intervals.

For a single QTL, genotypic values may be defined as a for QQ, d for Qq, and -a for qq. With these definitions, the QTL main effect in one backcross is (a + d) and in the other backcross is (a - d). Thus, a and d can be estimated separately. Variances and covariances can also be estimated. The total phenotypic variance ({sigma}2p) can be estimated from the total sum of squares, the total genotypic variance ({sigma}2g) from the model sum of squares, and the environmental variance ({sigma}2e) from the residual sum of squares. The contribution of a single QTL effect to the total genotypic variance can be expressed as the sum of its variance contribution and 1/2 of its covariances with other QTL effects. This quantity is

(4)

where ijkl is the probability of the lth QTL genotype conditioned on the marker genotype and the trait phenotype, Dijklr is the indicator variable (x*) associated with the genotype at a locus or a product of two indicator variables associated with a bilocus combination, Êir is the main ({alpha}) or epistatic (ß) effect associated with a particular QTL locus or a particular bilocus combination, nij is the size of the jth sample in the ith backcross, and ir = .

This experiment consists of two independent samples for each backcross. Thus, the MIM model can be cross-validated by using parameter estimates from one sample to predict phenotypic values for the other sample. The prediction equation is

(5)

from Equation 2, where pijkl is the probability of the lth QTL genotype conditioned on marker genotype only. While j indicates the value for one sample, j' is the corresponding value for the other sample. The first summation is over all possible 2m QTL genotypes and the second summation is over all effects in the model (m main effects and t epistatic effects).


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Descriptive statistics:
The phenotypic variances within the highly inbred parental lines and within the F1 populations provide estimates of the environmental variance. A pooled estimate shows that the mean parental line difference in PC1 equals 34.9 environmental standard deviations. In addition, the environmental variance estimate is an order of magnitude less than the phenotypic variances of the backcross populations, indicating high heritability. The variance estimates (x10-4) are 0.026 environmental vs. 0.546 for BS1, 0.499 for BS2, 0.280 for BM1, and 0.263 for BM2 backcross populations. The large parental difference coupled with high heritability provides a very favorable situation for QTL mapping.

In both replicate experiments there is evidence for partial dominance of mauritiana alleles. With strictly additive gene action, and assuming some effect of the X chromosome, the F1 mean should be greater than the parental midpoint (i.e., more simulans-like), because all F1 males have a simulans X chromosome. However, in both cases the F1 mean is significantly less than the parental midpoint. For example, in sample 1, the F1 mean is 0.0028, while the midpoint is 0.0054 between the mauritiana (-0.0230) and simulans (0.0337) parents.

Composite interval mapping:
Results of the CIM analysis are summarized in Fig 2A. The joint analysis of both samples in both backcrosses provides evidence for at least 14 different QTL at map positions stated in the figure legend. In most cases, these putative QTL exceed the critical value by a substantial margin and clearly indicate different QTL, because they occur in nonadjacent intervals (ZENG 1994 Down). However, two cases are more complicated: (1) although the LOD peak at 2-67 is somewhat less than the critical value, we count it as a putative QTL because it is highly significant when fewer markers are included as cofactors in the CIM analysis; (2) the region from 3-70 to 3-84 has a peak in each of two adjacent intervals. One peak is significant in the simulans backcross only, while the other is significant in the mauritiana backcross only. This region may contain two separate QTL with backcross-specific effects or it may contain a single QTL, in which case the difference in LOD peak positions in the two backcrosses is due to sampling error. These alternatives were tested using the procedure of JIANG and ZENG 1995 Down, which is designed to distinguish pleiotropy and close linkage.

Fig 3 shows a two-dimensional likelihood profile surface for distinguishing between a single QTL with significant effects in both backcrosses and two QTL, each with a significant effect in only one backcross. The two-dimensional surface represents all possible combinations of two QTL positions between 3-70 and 3-84. The diagonal elements represent null hypotheses of just one QTL (i.e., effects at the same map position in both backcrosses), while the off-diagonal elements represent alternative hypotheses of two QTL. A comparison is made between the maximum value on the two-dimensional surface (20.0 LOD score) and the maximum value on the diagonal (17.2 LOD score). The difference between the two likelihoods is asymptotically {chi}2-distrib-uted with 1 d.f. under the null hypothesis (one QTL). Thus the 95% significance threshold for the test is 0.217{chi}21, 0.05 = 0.83 in LOD score, while the observed difference in LOD score is 2.82. Therefore, the test is significant and the hypothesis of two QTL with backcross-specific effects is favored.



View larger version (31K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3. Two-dimensional likelihood profile surface for the test to distinguish between a single QTL with significant effects in both mauritiana and simulans backcrosses and two QTL, each having a backcross-specific effect. See text for further explanation.

Testing for QTL x sample interactions at each of the 14 putative QTL positions gave a significant LOD score only at position 3-75. Therefore, the mapping results for each backcross are generally consistent across the two independent samples. Testing for QTL x backcross interactions at each of the 14 positions gave significant LOD scores at three positions, 3-75, 3-83, and 3-94, and a nearly significant score at 3-140. These cases provide evidence of nonadditive inheritance (i.e., dominance and/or epistasis).

An analysis of sample 1 (n = 378 over both backcrosses) with 18 markers was published previously (LIU et al. 1996 Down). It is interesting to compare the results of the earlier, low resolution analysis with the present analysis, where the sample size is more than doubled (n = 965 over both backcrosses) and the marker number is increased to 45. This comparison is provided in Fig 4. Although the power to detect QTL effects is increased with larger sample size, the LOD scores in the full analysis ('99 CIM in Fig 4) are less than those in the smaller experiment ('96 CIM) because more background markers are included in the model to control for variation outside of the test interval. Although there is generally a good correspondence between the two analyses, it is clear that increases in sample size and marker number have improved the resolution. Some of the broad peaks in the '96 CIM analysis are resolved in the '99 analysis into two different peaks in nonadjacent intervals, clearly indicating multiple underlying QTL. This comparison suggests that further increases in sample size and marker number might lead to further changes in the estimated number and positions of QTL.



View larger version (26K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 4. LOD profiles for chromosomes X, 2, and 3 from composite interval mapping for the joint analysis of both backcrosses. The profile labeled '99 CIM is analysis of the full data set with two samples per backcross and 45 markers, while the profile labeled '96 CIM is a previously published (LIU et al. 1996 Down) analysis of a smaller data set (one sample per backcross and 18 markers). Horizontal lines represent the critical values. Marker positions are given by triangle symbols; open triangles represent the 18 markers used in the '96 CIM analysis; all markers were used in the '99 CIM analysis.

Multiple interval mapping:
In the MIM analysis, model selection began with an initial genetic model suggested by the CIM results and continued in a search for additional QTL through several cycles of a backward/forward selection procedure. In each cycle, estimates of QTL position were readjusted for the model selected. In the joint analysis of both backcrosses and both samples, a total of 18 putative QTL were detected with a LOD value exceeding the CIM threshold of 4.4. A 19th QTL (number 8 at 2-135 cM) has a LOD value of 3.6, just below the CIM threshold. Fig 2B shows, for each of the putative QTL, a LOD profile that spans the region from one neighboring QTL to the other. The estimated positions and main effects are given in Table 2 and estimates of a and d are plotted in Fig 5.



View larger version (24K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 5. The distribution of additive (a) and dominance (d) effect estimates of putative QTL, arranged in rank order according to the estimate of a. The effects are expressed as a percentage of half the difference between the parental lines.


 
View this table:
[in this window]
[in a new window]

 
Table 2. QTL positions, effects, and variance components estimated by multiple interval mapping

A residual permutation test was performed under the null hypothesis of 18 QTL to determine the significance of adding the 19th QTL (i.e., QTL 8 at 2-135). In a joint analysis of all four samples, a threshold of 4.6 LOD score was obtained. With this test, QTL 8 would not be significant. However, the evidence for this QTL comes mainly from sample BS2 (Table 3), so a residual permutation test was performed for BS2 only. In this case, the threshold value is 2.3 and the comparable test statistic for QTL 8 is 3.9. Given these 19 QTL, no other position shows a significant effect based on either joint or separate analyses of the four samples. Therefore it appears reasonable to include QTL 8 in the model, which brings the total number of QTL to 19.


 
View this table:
[in this window]
[in a new window]

 
Table 3. Summary results of separate and joint MIM analyses

The MIM analysis provides evidence for five QTL that were not detected in the CIM analysis. Their positions are 2-27, 2-114, 2-135, 3-117, and 3-160. CIM gives some indication of QTL at 2-27 and 3-117, but the evidence is not conclusive. However, when CIM is performed under relaxed conditions (i.e., fewer markers in the multiple regression), the LOD scores in these two regions are significant.

There is a large difference between CIM and MIM in the LOD score for the putative QTL at 2-69. The reason is that the test under MIM is conditional on all QTL in the model, while the test under CIM is conditional on all markers. In the case of the putative QTL at 2-69, the neighboring QTL are 87 cM apart, while the neighboring markers are only 37 cM apart. A similar difference in LOD is found for the QTL at 3-47. Note that the magnitude of LOD score is not strictly proportional to magnitude of effect because the LOD also depends on the proximity of conditioning markers or neighboring QTL.

Once the number and positions of QTL main effects were established in a joint analysis of both backcrosses, the MIM proceeded to select QTL pairs that have significant epistatic effects in each backcross separately. No significant interactions were detected in the simulans backcross, while six were detected in the mauritiana backcross. It is notable that the main effect of QTL at 2-0 was significant only in the simulans backcross, yet it shows significant interactions with two other QTL in the mauritiana backcross. However, no completely new QTL were detected by their interaction with QTL identified through marginal effects in one backcross or the other. The six effects detected in the mauritiana backcross together account for only 6.5% of the phenotypic variance. Therefore, epistasis appears to be relatively unimportant in these backcross populations.

A difference in magnitude of effect of a QTL between the two backcrosses may be due to dominance. For example, a case of complete dominance would give a significant effect in one backcross (say, Qq vs. qq), but no effect in the other (QQ vs. Qq). Estimation of a and d from the difference between the backcrosses reveals considerable variation among loci in the estimated degree of dominance (Fig 5). On the average, d is negative, suggesting that mauritiana alleles tend to be dominant, which is consistent with the observation that the F1 mean is significantly less than the midparent value. Only one QTL (3-83) appears to have strong dominance of the simulans allele.

Fig 5 shows the distribution of additive effect estimates for the 19 putative QTL. The shape of this distribution is probably quite different from the distribution of true values of a for two reasons: (1) the true distribution may have a large number of very small effects that cannot be detected with the power of the current experiment; and (2) some of the estimates may represent the combined effects of multiple, closely linked QTL. Nevertheless, Fig 5 contains some important observations. It shows that no one QTL accounts for a large fraction of the parental difference and that nearly all of the effects are positive, which means that simulans has plus alleles, and mauritiana has minus alleles, at nearly all QTL.

Table 2 gives an estimate of the fraction of the phenotypic variance in the backcross population that is accounted for by each putative QTL (). These estimates are likely to be more accurate than the variance components reported in most previous QTL studies, which generally are based on simple or multiple regression of the phenotype on marker genotypes. The sum of () estimates over all putative QTL effects is R2 for the model, which provides an estimate of broad-sense heritability. The heritability estimates are very high: 0.93 for BS and 0.92 for BM (0.85 from Table 2 plus 0.07 for epistasis). Nevertheless, they are consistent with the observation noted above that the environmental variance is an order of magnitude smaller than the phenotypic variance of the backcross populations.

The MIM model-building procedure was evaluated by comparing models obtained through separate and joint analyses of the four different samples (Table 3). Within each backcross, QTL detected in the smaller samples are also detected in the larger samples and the larger samples detect more QTL, as expected. The estimated positions of QTL are very consistent in different samples within each backcross. These results indicate that the MIM method gives very similar results in the analysis of independent samples.

The MIM results are also generally consistent between the two backcrosses, even though one expects some differences due to dominance effects. Table 3 shows that most QTL are detected at similar positions in both backcrosses. However, in some cases, QTL are found at a certain position in one backcross but not the other. Three such cases (3-75, 3-83, and 3-94) showed significant QTL x backcross interactions in the CIM analysis, which can be interpreted as dominance effects, as noted earlier. Other cases might also be due to dominance effects or possibly just sampling errors. In one case (QTL 7), the result seems to indicate that a single QTL in the joint analysis might be due to two QTL with backcross-specific effects and slightly different positions. However, this interpretation is inconclusive due to the flat likelihood profile in the BS samples in this region.

A cross-validation study was performed separately on each backcross to further assess the MIM model-building procedure and effect estimation. In this study, one sample was analyzed by MIM and then used to predict phenotypic values in the other independent sample. The results (Table 3) show a high level of predictability. For predicting sample 1 from sample 2, the R2 is 0.83 for BM (Fig 6) and 0.89 for BS. For predicting sample 2 from sample 1, the R2 is 0.86 for BM and 0.88 for BS. Although the results appear quite impressive, this study may not provide a very sensitive test of the predictive ability of the MIM procedure because of the consistency in direction of most allelic effects and the existence of considerable linkage disequilibrium in the backcross populations.



View larger version (16K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 6. Plot of cross-validation study results for the mauritiana backcross. Sample 2 was analyzed with multiple interval mapping to develop a model and parameter estimates, which were then used to predict the phenotypic values of sample 1 individuals.

The MIM analysis was compared with a multiple regression of phenotypic value on marker genotype using a backward stepwise selection procedure. Table 3 shows that the two types of analysis give similar results in terms of the number of QTL in the MIM model and the number of markers in the regression model. They also give similar results for the R2 of the model and for cross-validation R2 values. The MIM model gives consistently higher R2 values, but not by a large margin (i.e., average of 0.92 vs. 0.88 for model R2 and 0.87 vs. 0.83 for cross-prediction R2). Given the high density of markers relative to the level of recombination in the backcross populations analyzed here, the similarity between the two types of model is not unexpected. It is likely that the difference between MIM and multiple regression on markers would be much larger in experiments with lower marker density.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

This study is one of the first applications of multiple interval mapping, which has several advantages over previous QTL mapping procedures. These advantages stem mainly from the fact that MIM uses a multiple-QTL model, whereas other procedures like CIM use a different, single-QTL model for each interval analyzed. With a multiple-QTL model, the effects of all putative QTL are analyzed simultaneously so that epistatic terms can be included, and variance components and heritability can be estimated. Thus, it is expected that MIM will be more precise and powerful than single-QTL mapping methods, particularly in parameter estimation. In this study, the utility of MIM was analyzed in a cross-validation study, which showed a very high level of predictability. In addition, MIM analysis of replicate samples gave similar estimates for QTL positions. These results are very promising, but further work is needed to assess the efficacy of MIM in dealing with various types of genetic models that involve less coupling linkage disequilibrium.

The difference in posterior lobe morphology between D. simulans and mauritiana is clearly polygenic. The CIM analysis shows that at least 14 loci contribute to the PC1 trait difference, whereas MIM indicates a somewhat larger number of 19. Effect estimates from both analyses show that no single QTL explains a large fraction of the parental difference. In the MIM analysis, the largest additive effect estimate is 11.4% of the parental difference and 10 of the 19 putative QTL detected have additive effect estimates >=5.0%. Some of these values may be overestimates if there are multiple, closely linked QTL within a single interval.

We have previously proposed a quantitative definition of a major gene effect as one for which the distributions of alternative homozygotes (on a uniform isogenic background) show little overlap, such that the probability of misclassification is <0.05 (TRUE et al. 1997 Down). When both homozygous classes are normally distributed with equal variance, the probability of misclassification is 0.05 when the means are 3.28 standard deviations apart. For PC1, the parental means are 35 environmental standard deviations apart, so a major gene would have an effect that explains 9.4% of the parental difference. Only one putative QTL (3-47) has an additive effect estimate >9.4%. However, seven putative QTL have effect estimates >2.0 environmental standard deviations (5.7%), which can be detected readily with moderate sample sizes. Therefore, identification of individual QTL appears feasible through introgression and other approaches (ALPERT and TANKSLEY 1996 Down; CORMIER et al. 1997 Down).

Epistatic effects appear to be relatively unimportant for PC1 in the interspecific backcross populations. This observation is difficult to interpret biologically, because an interspecific backcross is segregating for alleles that may never have occurred together in the same population before. However, the lack of strong epistasis between alleles that were fixed in different populations may indicate that such alleles are generally "good combiners." CROW 1957 Down once suggested that mutations with largely additive gene action may produce advantageous effects on a variety of genetic backgrounds and therefore be selected efficiently, while those with complex interactions may tend to remain at low frequency. On the other hand, LONG et al. 1995 Down found strong epistatic effects on Drosophila bristle number for genes that responded to strong artificial selection, which indicates that interacting alleles may respond to directional selection. However, strong artificial selection in a small population may produce different results than natural selection in a large population.

A striking result of the QTL analysis reported here is that all but 1 of the 19 additive effect estimates have the same sign (Fig 5). ORR 1998 Down provides a statistical test for whether the observed numbers of plus and minus alleles in one line is greater than expected under neutrality, given that the QTL study begins with a certain phenotypic difference. A simplified form of the test assumes that all QTL have equal effects, which appears to be conservative in the present case where the single negative allele has a smaller effect than any of the 18 plus alleles. Using this test, the probability of observing 18 of 19 plus alleles in the high line is 0.0001. Therefore, it is highly unlikely that the interspecific difference in genital morphology evolved by random drift.

The strong preponderance of plus alleles in one species suggests a history of consistent directional selection operating on the trait. D. mauritiana and simulans diverged from a common ancestor an estimated 0.6–0.9 million years ago (HEY and KLIMAN 1993 Down), which probably corresponds to ~10 million fly generations. A lot of things can happen in 10 million generations. If there were reverses in the direction of selection or a long period of stabilizing selection, a mixture of plus and minus allelic effects is expected. However, consistently positive directional selection throughout the period of isolation would produce the type of genetic architecture observed. It is also possible that a recent strong episode of directional selection has erased the traces of previous evolutionary changes, but this would require that the recent fixations occurred at nearly all of the same loci involved in previous fixations. This situation seems unlikely because studies of artificial selection in Drosophila have shown that replicate selection lines frequently differ in the contributions of the three major chromosomes to selection response [see appendix of CHARLESWORTH et al. 1987 Down] and therefore in what genes respond at any given time.

For most quantitative traits analyzed previously, there is a mixture of plus and minus alleles in each species, leading to transgressive segregation (TANKSLEY 1993 Down). For example, in a cross between two tomato species, 36% of QTL detected had effects opposite to those predicted by the parental phenotypes (DEVICENTE and TANKSLEY 1993 Down). Even in D. mauritiana and simulans, QTL analysis of male-specific bristle number traits revealed a mixture of plus and minus alleles (TRUE et al. 1997 Down). Therefore, the consistent allelic effects on posterior lobe morphology may be a rare situation that reflects a history of unusually strong directional selection.

Divergent male genital structures, such as the posterior lobe, are thought to evolve by sexual selection through cryptic female choice implemented by postmating mechanisms such as remating and sperm displacement (EBERHARD 1996 Down; PRICE 1997 Down; ARNQVIST 1998 Down). A runaway process of sexual selection (LANDE 1981 Down) may produce very strong selection pressures that lead to rapid evolution of sexually dimorphic traits. However, under some models, cyclical change in the direction of selection on a male trait is expected (IWASA and POMIANKOWSKI 1995 Down), which is not consistent with the posterior lobe genetic architecture reported here.


*  FOOTNOTES

1 These authors contributed equally to this work. Back
2 Present address: Genome Center, Columbia University, 1150 St. Nicholas Ave., Unit 109, New York, NY 10032. Back
3 Present address: Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan, R.O.C. Back
4 Present address: Cereon Genomics, 45 Sidney St., Cambridge, MA 02139. Back


*  ACKNOWLEDGMENTS

We acknowledge the Duke University Morphometrics Laboratory for the use of equipment and software. This work was supported by U.S. Public Health Service Grants GM-47292 and GM-45344.

Manuscript received November 28, 1998; Accepted for publication September 29, 1999.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ALPERT, K. B. and S. D. TANKSLEY, 1996  High-resolution mapping and isolation of a yeast artificial chromosome contig containing fw2.2: a major fruit weight quantitative trait locus in tomato. Proc. Natl. Acad. Sci. USA 93:15503-15507[Abstract/Free Full Text].

ARNQVIST, G., 1998  Comparative evidence for the evolution of genitalia by sexual selection. Nature 393:784-786.

AVEROF, M. and N. H. PATEL, 1997  Crustacean appendage evolution associated with changes in Hox gene expression. Nature 388:682-686[Medline].

CARROLL, S. B., S. D. WEATHERBEE, and J. A. LANGELAND, 1995  Homeotic genes and the regulation and evolution of insect wing number. Nature 375:58-61[Medline].

CHARLESWORTH, B., J. A. COYNE, and N. H. BARTON, 1987  The relative rates of evolution of sex chromosomes and autosomes. Am. Nat. 130:113-146.

CORMIER, R. T., K. H. HONG, R. B. HALBERG, T. L. HAWKINS, and P. RICHARDSON et al., 1997  Secretory phospholipase Pla2g2a confers resistance to intestinal tumorigenesis. Nat. Genet. 17:88-91[Medline].

CROW, J. F., 1957 Genetics of DDT resistance in Drosophila, pp. 408–409 in Proceedings of the International Genetics Symposia, 1956, Science Council of Japan, Tokyo.

DEVICENTE, M. C. and S. D. TANKSLEY, 1993  QTL analysis of transgressive segregation in an interspecific tomato cross. Genetics 134:585-596[Abstract].

EBERHARD, W. G., 1996 Female Control: Sexual Selection by Cryptic Female Choice. Princeton University Press, Princeton, NJ.

FLYBASE CONSORTIUM,, 1997  FlyBase: a Drosophila database. Nucleic Acids Res. 25:63-66[Abstract/Free Full Text].

HEY, J. and R. M. KLIMAN, 1993  Population genetics and phylogenetics of DNA sequence variation at multiple loci within the Drosophila melanogaster species complex. Mol. Biol. Evol. 10:804-822[Abstract].

IWASA, Y. and A. POMIANKOWSKI, 1995  Continual change in mate preferences. Nature 377:420-422[Medline].

JIANG, C. and Z.-B. ZENG, 1995  Multiple trait analysis of genetic mapping for quantitative trait loci. Genetics 140:1111-1127[Abstract].

KAO, C.-H. and Z.-B. ZENG, 1997  General formulas for obtaining the MLEs and the asymptotic variance-covariance matrix in mapping quantitative trait loci when using the EM algorithm. Biometrics 53:653-665[Medline].

KAO, C.-H., Z.-B. ZENG, and R. D. TEASDALE, 1999  Multiple interval mapping for quantitative trait loci. Genetics 152:1203-1216[Abstract/Free Full Text].

LANDE, R., 1981  Models of speciation by sexual selection on polygenic traits. Proc. Natl. Acad. Sci. USA 78:3721-3725[Abstract/Free Full Text].

LIU, J., J. M. MERCER, L. F. STAM, G. GIBSON, and Z.-B. ZENG et al., 1996  Genetic analysis of a morphological shape difference in the male genitalia of Drosophila simulans and D. mauritiana.. Genetics 142:1129-1145[Abstract].

LONG, A. D., S. L. MULLANEY, L. A. REID, J. D. FRY, and C. H. LANGLEY et al., 1995  High resolution mapping of genetic factors affecting abdominal bristle number in Drosophila melanogaster.. Genetics 139:1273-1291[Abstract].

MAEKAWA, B., T. G. COLE, R. L. SEIP, and D. BYLUND, 1995  Apolipoprotein E genotyping methods for the clinical laboratory. J. Clin. Lab. Anal. 9:63-69[Medline].

ORR, H. A., 1998  Testing natural selection vs. genetic drift in phenotypic evolution using quantitative trait locus data. Genetics 149:2099-2104[Abstract/Free Full Text].

PRICE, C. S. C., 1997  Conspecific sperm precedence in Drosophila.. Nature 388:663-666[Medline].

SAIKI, R. K., T. L. BUGAWAN, G. T. HORN, K. B. MULLIS, and H. A. ERLICH, 1986  Analysis of enzymatically amplified ß-globin and HLA-DQ{alpha} DNA with allele-specific oligonucleotide probes. Nature 324:163-166[Medline].

SHUBIN, N., C. TABIN, and S. CARROLL, 1997  Fossils, genes and the evolution of animal limbs. Nature 388:639-648[Medline].

TANKSLEY, S. D., 1993  Mapping polygenes. Annu. Rev. Genet. 27:205-233[Medline].

TRUE, J. R., J. LIU, L. F. STAM, Z.-B. ZENG, and C. C. LAURIE, 1997  Quantitative genetic analysis of divergence in male secondary sexual traits between Drosophila simulans and Drosophila mauritiana.. Evolution 51:816-832.

ZENG, Z.-B., 1994  Precision mapping of quantitative trait loci. Genetics 136:1457-1468[Abstract].

ZENG, Z.-B., C.-H. KAO, and C. J. BASTEN, 1999  Estimating the genetic architecture of quantitative traits. Genet. Res. in press.




This article has been cited by other articles:


Home page
GeneticsHome page
R. Yang, H. Gao, X. Wang, J. Zhang, Z.-B. Zeng, and R. Wu
A Semiparametric Approach for Composite Functional Mapping of Dynamic Quantitative Traits
Genetics, November 1, 2007; 177(3): 1859 - 1870.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
R. M. Graze, O. Barmina, D. Tufts, E. Naderi, K. L. Harmon, M. Persianinova, and S. V. Nuzhdin
New Candidate Genes for Sex-Comb Divergence Between Drosophila mauritiana and Drosophila simulans
Genetics, August 1, 2007; 176(4): 2561 - 2576.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. Li, S. Wang, and Z.-B. Zeng
Multiple-Interval Mapping for Ordinal Traits
Genetics, July 1, 2006; 173(3): 1649 - 1663.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. J. Moehring, A. Llopart, S. Elwyn, J. A. Coyne, and T. F. C. Mackay
The Genetic Basis of Prezygotic Reproductive Isolation Between Drosophila santomea and D. yakuba Due to Mating Preference
Genetics, May 1, 2006; 173(1): 215 - 223.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. C. Hall, C. J. Basten, and J. H. Willis
Pleiotropic Quantitative Trait Loci Contribute to Population Divergence in Traits Associated With Life-History Variation in Mimulus guttatus
Genetics, March 1, 2006; 172(3): 1829 - 1844.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. K. Kelly
Epistasis in Monkeyflowers
Genetics, December 1, 2005; 171(4): 1917 - 1931.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. M. Gleason, J.-M. Jallon, J.-D. Rouault, and M. G. Ritchie
Quantitative Trait Loci for Cuticular Hydrocarbons Associated With Sexual Isolation Between Drosophila simulans and D. sechellia
Genetics, December 1, 2005; 171(4): 1789 - 1798.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. Lexer, D. M. Rosenthal, O. Raymond, L. A. Donovan, and L. H. Rieseberg
Genetics of Species Differences in the Wild Annual Sunflowers, Helianthus annuus and H. petiolaris
Genetics, April 1, 2005; 169(4): 2225 - 2239.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
F. Zou, J. P. Fine, J. Hu, and D. Y. Lin
An Efficient Resampling Method for Assessing Genome-Wide Statistical Significance in Mapping Quantitative Trait Loci
Genetics, December 1, 2004; 168(4): 2307 - 2316.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. J. Moehring, J. Li, M. D. Schug, S. G. Smith, M. deAngelis, T. F. C. Mackay, and J. A. Coyne
Quantitative Trait Loci for Sexual Isolation Between Drosophila simulans and D. mauritiana
Genetics, July 1, 2004; 167(3): 1265 - 1274.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. M. Gleason and M. G. Ritchie
Do Quantitative Trait Loci (QTL) for a Courtship Song Difference Between Drosophila simulans and D. sechellia Coincide With Candidate Genes and Intraspecific QTL?
Genetics, March 1, 2004; 166(3): 1303 - 1311.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
Y. Tao, S. Chen, D. L. Hartl, and C. C. Laurie
Genetic Dissection of Hybrid Incompatibilities Between Drosophila simulans and D. mauritiana. I. Differential Accumulation of Hybrid Male Sterility Effects on the X and Autosomes
Genetics, August 1, 2003; 164(4): 1383 - 1397.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
Y. Tao, Z.-B. Zeng, J. Li, D. L. Hartl, and C. C. Laurie
Genetic Dissection of Hybrid Incompatibilities Between Drosophila simulans and D. mauritiana. II. Mapping Hybrid Male Sterility Loci on the Third Chromosome
Genetics, August 1, 2003; 164(4): 1399 - 1418.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. Lan, J. P. Stoehr, S. T. Nadler, K. L. Schueler, B. S. Yandell, and A. D. Attie
Dimension Reduction for Mapping mRNA Abundance as Quantitative Traits
Genetics, August 1, 2003; 164(4): 1607 - 1614.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
O. Carlborg, S. Kerje, K. Schutz, L. Jacobsson, P. Jensen, and L. Andersson
A Global Search Reveals Epistatic Interaction Between QTL for Early Growth in the Chicken
Genome Res., March 1, 2003; 13(3): 413 - 421.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C.-H. Kao and Z.-B. Zeng
Modeling Epistasis of Quantitative Trait Loci Using Cockerham's Model
Genetics, March 1, 2002; 160(3): 1243 - 1261.
[Abstract] [Full Text] [PDF]


Home page