Genetics, Vol. 149, 999-1017, June 1998, Copyright © 1998
Two Sites in the Delta Gene Region Contribute to Naturally Occurring Variation in Bristle Number in Drosophila melanogaster
Anthony D. Longa,
Richard F. Lymanb,
Charles H. Langleya, and
Trudy F. C. Mackayb
a Center for Population Biology, University of California, Davis, California 95616 and
b Department of Genetics, North Carolina State University, Raleigh, North Carolina 27695
Corresponding author:
Anthony D. Long, Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, tdlong{at}ucdavis.edu (E-mail).
Communicating editor: P. D. KEIGHTLEY
 | ABSTRACT |
|---|
A restriction enzyme survey of a 57-kb region including the gene Delta uncovered 53 polymorphic molecular markers in a sample of 55 naturally occurring chromosomes. A permutation test, which assesses the significance of the molecular marker with the largest effect on bristle variation in four genetic backgrounds relative to permuted data-sets, found two sites that were independently associated with variation in bristle number. A common site in the second intron of Delta affected only sternopleural bristle number, and another common site in the fifth intron affected only abdominal bristle number in females. Under an additive genetic model, the polymorphism in the second intron may account for 12% of the total genetic variation in sternopleural bristle number due to third chromosomes, and the site in the fifth intron may account for 6% of the total variation in female abdominal bristle number due to the third chromosomes. These results suggest the following: (1) models that incorporate balancing selection are more consistent with observations than deleterious mutation-selection equilibrium models, (2) mapped quantitative trait loci of large effect may not represent a single variable site at a genetic locus, and (3) linkage disequilibrium can be used as a tool for understanding the molecular basis of quantitative variation.
MANY characters of evolutionary, medical, and agricultural importance are quantitative in nature. Variation in these traits can be partitioned into environmental and genetic components, the genetic component presumed to be due to the segregation of alleles at a number of loci that affect the trait (FALCONER and MACKAY 1996
). A major goal of both medical genetics and agricultural genetics is the identification of the genes underlying quantitative traits. An understanding of the number of loci contributing to quantitative variation, the number and effects of alleles at these loci, how sensitive allelic effects are to variants segregating at other loci (i.e., epistasis) and changes in the environment (i.e., genotype by environment and genotype by sex interactions), and the molecular nature of the variants that give rise to quantitative variation, is also crucial to models that attempt to explain the nature of standing variation in quantitative traits (BARTON and TURELLI 1989
), and in the determination of how the adaptations that distinguish closely related species become established (ORR and COYNE 1992
). Despite the importance of understanding the nature of the genes contributing to quantitative variation, our understanding of quantitative traits is still largely limited to statistical descriptions of observed phenotypic variation under assumed underlying genetic models (BARTON and TURELLI 1989
; FALCONER and MACKAY 1996
).
We have been dissecting the genetic basis of standing variation in abdominal and sternopleural bristle number in Drosophila melanogaster to understand the genetic basis of quantitative variation. Bristle number in Drosophila is a good system for addressing this question, as it has been well characterized using classic quantitative genetic approaches (FALCONER and MACKAY 1996
), and a number of studies have documented stabilizing selection on bristle number (LINNEY et al. 1971
; NUZHDIN et al. 1995
; GARCIA-DORADO and GONZALEZ 1996
). Abdominal and sternopleural bristle are external sensory organs of the peripheral nervous system. A large number of candidate genes have been identified through mutants that affect bristle pattern and spacing, and the signaling pathways these genes act in are partially characterized (reviewed in JAN and JAN 1994
). In addition, previous work on the mapping of quantitative trait loci (QTLs) important in short-term response to artificial selection on bristle number (LONG et al. 1995
), quantitative complementation testing of chromosomes containing high and low QTLs to mutants at candidate loci (LONG et al. 1996
), and associations between molecular variants in a candidate gene region and phenotypic variation among a set of wild-derived chromosomes (MACKAY and LANGLEY 1990
; LAI et al. 1994
) have suggested that a small number of loci make large contributions to standing variation in bristle number and that these loci appear to correspond to candidate genes that are known to be important in bristle development. The generality of these initial results will rely on the results of a thorough survey of candidate loci. This work extends these previous results and demonstrates that naturally occurring variants at Delta contribute to standing variation in bristle number.
The QTL mapping study of LONG et al. 1995
mapped a factor to the cytogenetic interval 89D1 to 92E1 that had an effect of approximately three abdominal bristles and one sternopleural bristle; this cytogenetic interval contains the Delta genetic locus. Additional support that Delta may be contributing to naturally occurring variation in bristle number came from quantitative complementation testing studies, in which a mutant of Dl genetically interacted with chromosomes containing high and low bristle QTLs fixed by artificial selection and likely segregating in nature (LONG et al. 1996
), as well as chromosomes which had accumulated mutations with quantitative effects on bristle number (MACKAY and FRY 1996
). Delta is an important and obvious locus at which naturally occurring genetic variation might be expected to contribute to variation in bristle number. Delta is known to be a component of the lateral inhibition process in bristle forming regions of the pupae (PARKS and MUSKAVITCH 1993
; PARKS et al. 1997
). Lateral inhibition involves cell-cell signaling between cells expressing the Delta ligand and the Notch receptor in patches of cells that have the potential to form neurogenic structures (HAENLIN et al. 1994
; KUNISCH et al. 1994
; ARTAVANIS-TSAKONAS et al. 1995
; PARKS et al. 1997
). Cells initially expressing slightly lower levels of the Delta ligand become committed to a non-neurogenic fate through a down regulation, via the Notch receptor pathway [whose downstream members include H, Su(H), E(spl), and ASC], and subsequent failure to up regulate additional Dl expression in the down regulated cell (ARTAVANIS-TSAKONAS et al. 1995
). Experiments in which overall Dl expression levels are artificially raised (or lowered) tend to decrease (or increase) bristle number, consistent with the molecular understanding of Delta function (HEITZLER and SIMPSON 1991
). Based on the molecular biology of Delta it seems plausible that molecular variants in the Delta region of more subtle, quantitative effect may be responsible for naturally occurring variation in bristle number and patterning.
We report here the results of an experiment to associate DNA polymorphisms in the Delta gene region with variation in abdominal and sternopleural bristle number among a representative sample of 55 chromosomes extracted from a natural population. This association study approach has been used in previous studies of bristle number variation (MACKAY and LANGLEY 1990
; LAI et al. 1994
), and is an approach that will be useful in the future study of human diseases (JORDE 1995
; RISCH and MERIKANGAS 1996
; LONG et al. 1997
). If a neutral molecular marker (referred to as the marker site) and a site contributing to variation in a quantitative trait (the quantitative trait nucleotide or QTN) are physically close to one another, they are likely to be in linkage disequilibrium. This linkage disequilibrium will result from the two sites only rarely recombining from one another and hence sharing a common evolutionary history. Disequilibrium between the marker site and QTN can be detected as a significant regression of marker site allelic state on the phenotypic measure. The work presented here differs from previous studies in two main respects. First, we have used a permutation-based statistical approach for assessing marker/phenotype associations that is robust with respect to the number of correlated molecular markers used (CHURCHILL and DOERGE 1994
; DOERGE and CHURCHILL 1996
); and second, we test for marker/phenotype associations in a number of different genetic backgrounds to increase the power of detecting significant associations. Using this approach, we were able to identify two molecular markers in the Delta region that are at intermediate frequency and are associated with bristle variation.
 | MATERIALS AND METHODS |
|---|
Isogenic lines:
The derivation of the lines employed in this study, Drosophila culturing conditions, the crosses and genetic backgrounds employed, and the experimental design used to determine average abdominal and sternopleural bristle number for each line are described in LYMAN and MACKAY 1998
. Briefly, all lines are derived from a wild collection of Drosophila from the Raleigh, NC Farmer's Market in 1988. Average bristle number was assessed in four different genetic backgrounds for each line: (1) homozygous third chromosomes in an otherwise isogenic Samarkand (SAM) background (W), (2) homozygous introgressed fragments of the third chromosome, including Dl, made by repeated rounds of backcrossing through females mutant for Delta in an otherwise isogenic SAM background (B), (3) the same introgressed fragment heterozygous against the Dl mutant (T), and (4) the same introgressed fragment heterozygous against the SAM chromosome (C). In the case of the B, T, and C bristle measurements were made on two independent introgression lines per whole third chromosome (LYMAN and MACKAY 1997). The lines used in this study were examined cytologically for the presence of large inversions on the right arm of chromosome three.
Restriction map analysis:
The Delta region was examined at two levels of resolution to find sites detectable with restriction endonucleases which could be associated with variation in bristle number. Genomic DNA was prepared from approximately 200 flies either homozygous for the entire third chromosome, or a backcross segment including the Delta region, using an SDS lysis, organic extraction, ethanol precipitation protocol (JOWETT 1986
). Initially, each of the lines was examined with four restriction enzymes (EcoRI, HindIII, BamHI, and PstI) with a six base pair recognition sequence and Southern blotting technique (SOUTHERN 1975
). Filters were made using ~12 µg of genomic DNA per restriction enzyme per line blotted onto S&S Nytran neutral charge membranes (Scheicher & Schuell, Keene, OH). These filters were sequentially probed with four phage clones spanning ~52 kb including the Delta transcriptional unit (see Figure 1; kindly provided by M. MUSCAVITZ). Probes were labeled using the Random Prime kit from Boehringer Mannheim (Indianapolis, IN) and either 200 µCi of [
-P32] dCTP or 200 µCi of both [
-P32] dCTP and [
-P32] dGTP, with unincorporated nucleotides removed over a Sephadex column (SAMBROOK et al. 1989
). Hybridization, washing, and autoradiography conditions were carried out as in KRIETMAN and AGUADE (1986; modified from CHURCH and GILBERT 1984
), with similar hybridization (10 mg/ml BSA (Sigma, St. Louis, MO), 1 mM EDTA pH 7.2, 525 mM NaPi pH 7.2, 7% SDS) and wash (40 mM NaPi pH 7.2, 1mM EDTA pH 7.2, 1% SDS) solutions.

View larger version (52K):
In this window
In a new window
Download PPT slide
|
Figure 1.
Summary of restriction map variation in the Delta region. (A) Four phage clones covering the Delta region used in the six-cutter survey. (B) The exon/intron structure of the Delta gene region and flanking DNA (KOPCZYNSKI et al. 1988 ; HAENLIN et al. 1990 ; M. A. T. MUSCAVITZ, personal communication). The five prime- and three prime-most markers are at -24.7 and +24.0, respectively, and the point at which the scale of the figure changes is -3.5. A BamHI site that is 1637 bp 5' to the AUG translation initiation codon (and 964 bp 5' to a strong transcription initiation site) is designated as position 0.0. Thick lines are exons, unfilled are untranslated, and filled are translated; thin lines are flanking DNA (the 5' flanking region is drawn at two different scales as indicated by 1-kb labels). (C) A summary of linkage disequilibria in the Delta region. Hollow circles connecting the disequilibrium table with exon/intron structure represent boundaries of the different fragments surveyed by long PCR and four-cutter enzymes. Labels on the left of the disequilibrium table correspond to those described in Table 4 (those prefixed with Hi, E, P, or B are six-cutter enzymes, and the remaining are four-cutter enzymes). Those sites with a gray bar to the right of the disequilibrium table are unordered but within the interval specified by the bar. Those sites with an asterisk on the right side of the table are the two sites that are significantly associated with variation in bristle number. Squares within the disequilibrium grid are shaded according to the significance of the disequilibrium between pairs of sites (not corrected for multiple tests) assessed using a Fisher's exact test: black squares P < 0.005, gray squares P < 0.01, light gray squares P < 0.05, and unshaded squares P > 0.05.
|
|
The above six-cutter survey resulted in only a small number of polymorphic sites throughout the Delta region, so a second higher resolution survey was carried out using "Long" PCR (BARNES 1994
; CHENG et al. 1994
) coupled with digestion of the PCR products with four base pair recognition sequence endonucleases. This approach used a series of primers designed using published Delta cDNA sequence to amplify introns of unknown sequence. In the case of Dl-5' and Dl-3'
clone-associated restriction fragments were subcloned into pBluescript and sequenced into the subclone far enough to generate a sequence tagged site 5' or 3' to the Delta transcription unit large enough to design PCR primers. The following PCR conditions were used to amplify genomic DNA: 50 µl reaction volume, 1x Tricine Buffer [5x Tricine Buffer: 0.1 M Tricine pH 8.9, 0.42 M KOAc, 0.01 M Mg(OAc)2], 0.5 mM of both forward and reverse primer (Table 1), 0.2 mM of each dNTP, ~20 ng of gDNA, 2 µl Taq polymerase, and 0.5 µl Taq Extender (Stratagene, La Jolla, CA). Cycling conditions were: an initial 2-min denaturation step at 92° followed by 35 cycles with 45 sec at 92°, 45 sec at the correct annealing temperature for a given primer (Table 1), and 24 min at 70° depending on the expected size of the PCR product. From each PCR reaction, 10 µl was transferred to each of 4 microtiter plates and 0.5 µl of a four-cutter restriction enzyme (Alu1, Cfo1, HaeIII, and HpaII) was added to each sample in each of the plates. In the case of Dl-I2 and Dl-I5 the entire experiment was repeated with four additional enzymes (DdeI, RsaI, ScrFI, and TaqI). Plates were sealed with adhesive microplate film and incubated at 37° (65° in the case of TaqI) for ~3 hr. Under the described conditions, these eight four-cutter enzymes appear to completely digest the PCR products (although partial digestions were occasionally observed with HpaII). After digestion, 3 µl of 6X Ficoll loading buffer (SAMBROOK et al. 1989
, with the addition of EDTA pH 8.0 to 0.01 M) was added to the restricted Long PCR product and the products were run overnight at low power on a submarine "Synergel" [1.5% Synergel (Research Products International, Mt. Prospect, IL), 0.5% agarose, 1x TAE], and visualized by staining with ethidium bromide.
As restriction maps were not available for the unsequenced regions of Delta, polymorphisms were scored as the presence or absence of bands. Thus, unlike the data from the six-cutter survey of the Delta region, which could be analyzed using standard methods for studying nucleotide variation, the data from the four-cutter survey was analyzed using a band sharing approach to estimate nucleotide variation (NEI and LI 1979
). A 95% confidence interval was constructed for the sample estimate of nucleotide variation by estimating the same parameter from 1000 bootstrap estimates of the data with each estimated correct for bias by a factor of n/(n - 1), where n is the number of bootstrap samples drawn from the data which is also equal to the number of lines surveyed (WEIR 1996
). Introns 2 and 5 were sequenced, allowing sites located in the PCR fragments corresponding to these regions to be ordered. These sequences are deposited in GeneBank (accession numbers AF035257 and AF035258).
Statistical analysis of molecular marker/phenotypic associations:
For each line in the study, bristle phenotypes were assessed in each of four genetic backgrounds (W, B, T, and C). For analyses of molecular marker/bristle number associations, arithmetic mean bristle phenotypes were used for each background/line/sex (see LYMAN and MACKAY 1998
for a complete description of the lines). In the case of the W genetic background this mean is based on 20 individuals of each sex (ten individuals per vial times two vials), and for the B, T, and C genetic backgrounds it is based on 40 individuals of each sex (ten individuals per vial times two vials times two independent introgression replicates). The molecular marker data used to test for marker/phenotype associations is listed in APPENDIX A. The marker data is a combination of the six-cutter sites and four-cutter bands, with four-cutter bands having identical patterns of presence and absence among the lines collapsed to a single site for the purpose of association testing. In previous work we have carried out quantitative complementation testing on mapped QTLs (LONG et al. 1996
), and mutation accumulation lines (MACKAY and FRY 1996
). In this study we apply an analogous contrast to that employed in previous work, by testing for associations between molecular markers and the difference in bristle number between flies of genotype Bi/Dl3 (T) and those of genotype Bi/Sam (C) over the i = 1 to n lines of the study. The T - C contrast can be thought of as the effect of failure to complement with a control for the additive effect of wild-type alleles. All subsequent significance testing was carried out on a transformed data-set consisting of three response variables, the first two are homozygous entire third and backcross chromosomes and the third the effect of failure complement (W, B, and T - C). Since the variance among line means is different for each of the three response variables, the significance of molecular marker/phenotypic associations were only carried out on line means transformed by sex and response variable to have zero mean and unit variance (effectively weighting each sex and response variable equally). The original average bristle counts in the four genetic backgrounds and the three scaled response variables will be referred to as the "four genetic background" and "three genetic background" data-sets, respectively.
To assess the significance of marker/phenotype associations, a test statistic was constructed that combines phenotypic information from the three genetic backgrounds. Since the measures of bristle number are correlated over the genetic backgrounds (LYMAN and MACKAY 1998
), an ideal test statistic will take these correlations into account. This requirement was met by performing a principal component analysis on the three response variable data-set to reduce the dimensionality of the observations to one for each character and sex. Principal components were separately estimated for each character and sex from the covariance matrix associated with the three genetic backgrounds, and principal component scores were subsequently created by multiplying the three genetic background z-scores by the eigenvector associated with the largest eigenvalue (i.e., the first principal component score; CHATFIELD and COLLINS 1980
). This resulted in a set of four principal component scores (male abdominal, female abdominal, male sternopleural, and female sternopleural bristle number) which were separately analyzed for each character and molecular marker with the linear model Yijk = µ + si + mj + (s*m)ij + c(m)k(j) + s*c(m)ik(j), where Yijk is the principal component score of the ith sex with the jth allele for the lth wild chromosome at the molecular marker under consideration. The F statistics,
m and
s*m, associated with the model terms of marker and sex X marker, respectively (tested with the chromosome in marker and sex X chromosome in marker mean squares, respectively), can now be assessed for significance. Principal component analysis has advantages and disadvantages relative to other approaches we examined, which are discussed later.
At this point there are two statistics,
m, and
s*m, for each bristle character/molecular marker combination, and we wish to assess if variation in bristle number is associated with any of the m molecular markers of the study. Since the m markers under consideration are correlated with one another, it is difficult to derive a theoretical threshold that the
's must exceed to denote statistical significance. Therefore, we assessed the statistical significance of our results using a permutation testing approach for mapping QTLs (CHURCHILL and DOERGE 1994
) combined with sequential removal of significant markers (DOERGE and CHURCHILL 1996
) applied in the disequilibrium mapping context of this work. This method has the advantage of assessing significance using a distribution free statistical approach, while preserving the linkage disequilibria structure of the markers (i.e., the correlational structure of the markers) over permutations. Significance was assessed by calculating the largest value of
,
max, over the m markers of our study for a given character/statistic combination, and then comparing this value to the distribution of
max derived by randomly permuting entire haplotypes for each line with respect to phenotypic data 1000 times and calculating
max each replication. If the
max associated with the nonpermutated haplotypes is in the upper 5% of the permutation tests (i.e., only 50 or fewer values of
max from the permutated data-sets exceed it), the effect of that molecular marker is statistically removed from each of the principal component scores. After statistical removal of any significant molecular markers, the entire permutation testing procedure is repeated on the residuals generated in this manner until no further markers remain significant.
The effect of a given marker in a given sex and genetic background is calculated as the difference between the mean of all lines having that marker versus the mean of all lines lacking the marker. One standard error on the effect of a given marker can be calculated as
where
2x is the variance among lines that have allele x at the molecular marker and Nx is the number of lines that have allele x at the molecular marker. The variance due to a given marker is calculated using the VARCOMP procedure of SAS. Within VARCOMP the following model was fitted for molecular markers that showed sex limited effects: Yijklm = µ + Mi + L(M)j(i) + B(L M)k(ji) + R(B L M)l(kji) +
mlkji, where Yijklm is the bristle number of the mth individual nested in the lth replicated vial, nested in the kth backcross replicate (this effect was excluded in the case of homozygous third chromosomes that were not replicated), nested in the jth line, nested in the ith marker class. All effects were considered random for the purpose of estimation, except the effect of marker which is fixed. For markers that showed phenotypic effects in both sexes the above model was fitted with the inclusion of an additional fully crossed fixed effect of sex. Under a strictly additive model of quantitative variation 2FVG =
22 + 2
2S*L +
2M + 2
2S*M , and similarly 2FVM =
2M + 2
2S*M , where F is the inbreeding coefficient (FALCONER and MACKAY 1996
). For completely homozygous lines F = 1; thus, obvious estimators of the total additive genetic variation (VG) and the total additive genetic variation due to marker (VM) are suggested by the above equations.
 | RESULTS |
|---|
Patterns of bristle variation among Raleigh third chromosomes:
APPENDIX A gives the mean bristle count by sex and genetic background for each of the lines presented in this study. A complete analysis of the patterns of variation among the introgression lines of this study is given in LYMAN and MACKAY 1998
: here we briefly review some of these patterns. Small standard errors were generally associated with estimates of line means (i.e., 0.26, 0.27, 0.38, 0.35 pooled over sexes for abdominal bristle number in the W, B, T, and C genetic backgrounds, and 0.23, 0.20, 0.25, 0.18 for sternopleural bristle number), indicating that variation between lines is primarily genetical in origin. Within a given sex and bristle character, bristle numbers were comparable over genetic backgrounds, the exception being the complementation testing cross background, which generally had a higher average bristle number. The observation of higher bristle number on average in a genetic background containing a Dl loss-of-function allele is consistent with previous phenotypic results involving Delta mutants (LONG et al. 1996
; MACKAY and FRY 1996
), and the proposed function of Delta in which loss-of-function mutants cause an over-commitment to a neurogenic fate. Also in accord with previous work, within a given bristle trait and genetic background, females generally had a greater average number of bristles than males. This effect of sexual dimorphism was more pronounced for abdominal bristle number (range 2.09 to 3.88) than sternopleural number (range 0.19 to 2.36). For both bristle characters, the ordering of the four genetic backgrounds with respect to sexual dimorphism was the same with: T > B > C > W chromosome. Over all experimental treatments, the variance among line means in abdominal bristle number ranged from 0.33 (males in the C genetic background) to 5.36 (females in the W genetic background), and from 0.16 (females in the C genetic background) to 6.30 (females in the W genetic background) for sternopleural bristle number. For both bristle characters, the ordering of the four genetic backgrounds with respect to variance in bristle number was the same, with W > B > T > C. For both bristle characters, the variance among lines was greater for females than for males (with the exception of the C genetic background for sternopleural bristle number). This effect was more pronounced for abdominal bristle number, where the variance among lines in females was approximately twice that of males (the ratio of female to male variance in abdominal bristle number ranged from 1.59 to 3.33), than for sternopleural bristle number, where the variance among lines was, at most, 26% greater in females than in males (the ratio of female to male variance in sternopleural bristle number ranged from 0.58 to 1.26).
Molecular population genetics of the Delta region:
The region surveyed with four six-cutter enzymes was ~57 kb, of which 22 kb includes the entire transcriptional unit of the Dl locus (Figure 1). APPENDIX A gives the molecular marker genotypes for each of the lines presented in this study. Of the 59 restriction sites surveyed with six-cutters, 18 were polymorphic. Of these 18 polymorphic sites, 15 were 5' to the Dl transcriptional start site and are thus unlikely to be useful for detecting molecular marker/phenotype associations in the coding region. From the six-cutter data we estimate
, whose expectation is 4Neµ for selectively neutral substitutions, to be equal to 6.53 x 10-3, with sample variances of 2.39 x 10-6 and 1.18 x 10-5 under infinite and zero recombination, respectively (HUDSON 1982
). The average per-site heterozygosity,
, which was estimated using three different methods, gave very similar values of 7.13 x 10-3 (with variance 1.54 x 10-5; NEI and LI 1979
), 7.14 x 10-3 (with variance 1.54 x 10-5; NEI and TAJIMA 1981
), and 6.96 x 10-3 (ENGELS 1981
). Under a neutral model of evolution, the expectations of
and
are the same, and departures from neutrality, due to forces such as genetic hitch-hiking, can be detected as the normalized difference between
and
, a statistic referred to a TAJIMA's D (KAPLAN et al. 1989
; TAJIMA 1989
; BRAVERMAN et al. 1996
). TAJIMA's D for the six-cutter of data of this sample is 0.272, providing little evidence for a departure from neutrality. The six-cutter survey revealed an atypically low amount of insertion/deletion variation, with no large insertions, and only three small deletions. Two of the deletions were observed once in the set of 55 lines (a 50-bp deletion between -0.63 and 0.0 seen in line 34, and a 150-bp deletion between 5.9 and 6.2 seen in line 8), the other deletion was seen in four of the 55 lines (a 50-bp deletion between 23.7 and 24.6 in lines 17, 36, 56, and 81).
Using a survey approach that combines Long-PCR, and subsequent restriction digests with a series of four-cutter enzymes, followed by electrophoretic separation, we were able to score 112 polymorphic and 156 monomorphic bands throughout ~27 kb of the Dl region that includes the entire Dl transcription unit. This survey was carried out by dividing Dl into eight PCR fragments corresponding to the 5' region, Introns 1 through 5, Exon 6, and the 3' untranslated region (the most 5' part of the survey was -3.15 and the most 3' was +23.7). This four-cutter survey resulted in data that was difficult to interpret with respect to the presence or absence of restriction sites. Four-cutter surveys are generally carried out on segments with a known consensus sequence that can be used to interpret banding patterns relative to the gain or loss of sites (KREITMAN and AGUADE 1986
), but Dl sequence is not available for much of the region surveyed. Although many sites could be easily interpreted as the gain or loss of sites, some sites could not be interpreted in this manner, so we estimated per-site heterozygosity using a band sharing approach (NEI and LI 1979
). Using this approach on the four-cutter data, we estimate
to be 9.84 x 10-3 for the Dl region, with a bootstrap 95% confidence interval on the sample estimate of 9.13 x 10-3 to 1.06 x 10-2. Estimates of nucleotide variation in the Delta region are typical relative to other loci that have been surveyed in D. melanogaster despite the observation of zero variable sites in Intron 4 and Exon 6. (These two fragments show 20 and 14 monomorphic bands, respectively). Estimates of
associated with the other fragments were typical, ranging from 6.11 x 10-3 to 1.69 x 10-2. As a number of the surveyed bands could not easily be interpreted as site variants, it is possible they represent small insertion/deletion variants not detectable by the six-cutter survey. Thus, it is not possible to partition the estimate of
into variation due to insertions and deletions versus restriction site polymorphisms. As the objective of this study was to find molecular markers throughout the Dl region that may be in linkage disequilibrium with a putative site causing bristle differences in natural populations, it is not important at this stage to distinguish between these two types of molecular events. The Long-PCR/four-cutter approach of this work does seem an efficient means to generate molecular markers in a large candidate gene region of interest.
All polymorphisms that had a frequency of greater than three in the sample of 55 lines were examined for pairwise linkage disequilibrium using Fisher's exact test. Figure 1 shows the patterns of disequilibria for the Delta region. Of the 1378 pairs of sites tested, 40, 57, and 118 were significant at 0.5, 1.0, and 5.0 percent levels, respectively (not corrected for multiple tests). The number of pairs of sites for which significant values of the Fisher's exact test were observed is slightly more than what one would expect by chance: 2.9, 4.1, and 8.6 percent were observed significant at 0.5, 1.0, and 5.0 percent levels, respectively. From Figure 2 it appears this small excess of linkage disequilibria, over what would be expected by chance alone, is primarily due to sites physically close to one another. When R2 (the correlation coefficient is a measure of disequilibrium) for all pairs of markers with frequencies of 20% or greater is plotted against distance between the markers, all pairs of markers with values of R2 greater than 0.3 are within 5 kb of one another. Hudson's estimator of 4Nc (an estimate of four times the effective population size times the probability of a recombinational event per gamete per generation for the region under consideration, subsequently referred to as
) was applied to the data to derive an estimate of the number of recombination events that have occurred in the history of the sample (HUDSON 1987
). Based on the six-cutter data,
was 725, or 14.9
/kb, whereas the estimate of
from the four-cutter data was 165, or 6.2
/kb. In the four-cutter survey, we observed 41 polymorphic sites in 26.8 kb; it follows that the average distance between polymorphic sites was 0.65 kb or 4
. It follows from Hudson's estimator of
for the four-cutter data that one
is approximately 175 bp, which coincides with the inflexion point in the theoretical curve fit to the relationship between R2 and distance between markers of Figure 2.
Associations between molecular variants and bristle number:
We tested whether the 53 polymorphic molecular markers of the survey (those with a frequency of 3 or greater in the survey) were associated with variation in bristle number using a principal component analysis which combined data from the three genetic backgrounds (W, B, and T-C ) coupled with a permutation testing approach. Table 2 lists the results of the analyses carried out to detect associations between molecular variants at Delta and variation in bristle number (marker names are defined in APPENDIX A). Marker associations carried out on the first principal component (which accounts for 48, 62, 56, and 62% of the variation in male abdominal, female abdominal, male sternopleural, and female sternopleural bristle number, respectively) resulted in Ha + 8.6 being significantly associated with variation in sternopleural bristle number (P < 0.009) and S + 18.6 significantly associated with an abdominal bristle number by sex interaction (P < 0.037). The effect of Ha + 8.6 was subsequently statistically removed from the component scores in both sexes and for both bristle characters and the permutation testing procedure repeated. Marker S + 18.6 remained significantly (P < 0.028) associated with an abdominal bristle number by sex interaction. This demonstrates that Ha + 8.6 and S + 18.6 are independently associated with variation in sternopleural and abdominal bristle number. When the effects of both Ha + 8.6 and S + 18.6 are statistically removed from the component scores, no sites remain significantly associated with variation in bristle number. Although not significant, once the effects of Ha + 8.6 and S + 18.6 are statistically removed from the model, Ha - 2.5 approaches statistical significance for abdominal bristle number (P < 0.075).
View this table:
In this window
In a new window
|
Table 2.
Results of permutation testing procedure used to evaluate the significance of marker/bristle trait associations
|
|
The second principal component scores accounted for 33, 21, 26, and 23% of the variation in male abdominal, female abdominal, male sternopleural, and female sternopleural bristle number, respectively, in the three genetic background data-set. Molecular marker/phenotype association analyses were carried out on the second principal component scores and no sites were significantly associated with variation in bristle number. That variation in the second principal component scores was not associated with any molecular markers may be expected, given that the second principal component generally accounted for much less of the total variation in bristle number than the first principal component.
Table 2 gives P-values associated with four rounds of testing for statistically significant associations between DNA sequence polymorphisms and variation in bristle number for each of the four characters in the study, the four characters studied being the two bristle characters (abbreviated AB and SB) and each character for a marker by sex interaction (abbreviated AB*X and SB*X). Although the second principal component scores are independent of the first principal component scores (by definition), a Bonferroni correction to the first principal component scores would be overly conservative. Since the second principal component score accounts for much less of the total variation in the data-set than the first principal component score (on average 21% vs. 62%), tests for molecular marker/phenotype associations with the second principal component score are a priori less likely to be significant compared to those for the first principal component. Abdominal and sternopleural bristle counts are correlated phenotypically, but only weakly genetically correlated, therefore it is unclear how to correct for multiple tests over these two characters. As with the second principal component scores, marker by sex interactions for a given bristle trait have lower statistical power a priori than a marker site main effect test. Thus, it is also unclear how statistical significance of marker by sex interactions should be assessed relative to marker main effects. Ultimately, replication is desirable to firmly establish the significance of any marker/phenotype associations, nonetheless, the significance levels observed in the study indicate these associations are indeed statistically significant.
Figure 3 and Figure 4 are plots of the value of the F statistics (i.e., the
's) associated with the first principal component derived from the three genetic background data-set for each of the molecular markers in this study as a function of marker position. These F statistics are presented separately for sternopleural and abdominal bristle number in Figure 3 and Figure 4, respectively, with the two panels within each figure showing the F statistics for a model that tested the main effect of a given marker over both sexes (labeled SB and AB in the two figures), or for a model that tested the effect of a marker by sex interaction (labeled SB*sex and AB*sex in the two figures). These figures provide a visual means of assessing the significance of different markers relative to their position. In Figure 3 (bottom panel) the marker with a F statistic of 14.6 in Intron 2 is the marker found significant using the permutation test for SB (Ha + 8.6), and in Figure 4 (top panel) the marker with an F statistic of 13.4 in Intron 5 is the marker found significant using the permutation test for AB by sex (S + 18.6). As the significance of associations were assessed using a permutation testing approach, there is no meaningful threshold to place on these figures to indicate statistical significance. A helpful guide may be to recall that an F1,54-statistic of 8.6 is normally considered significant at P < 0.005 for independent tests, and an F1,54-statistic of 12.2 corresponds to an experiment-wise significant level of P < 0.05 (corrected for the 53 markers considered). Considering these thresholds, no markers exceed the theoretical critical F of 8.6 but fail to exceed 12.3, and only two markers exceed 12.3 (Ha + 8.6 for SB and S + 18.6 for AB*X). These are the same two markers that are significant by the permutation testing procedure. In Figure 3 and Figure 4 the large F statistics appear to be clustered. This is to be expected, as markers close to one another are often in linkage disequilibrium and, as a result, are likely detecting the same marker/phenotype association. A lemma of this is that when the effect of a significant marker is statistically removed, "satellite" F statistics disappear.

View larger version (17K):
In this window
In a new window
Download PPT slide
|
Figure 3.
Plots of the values of the F statistics of models testing for molecular markers that are associated with variation in the first principal component score derived from the three genetic background data-set (W, B, and T-C) for sternopleural bristle number. F statistics are plotted as a function of the estimated physical position of the marker site in the Delta gene region. The locations of some markers 5' to the Delta gene are drawn at a different scale as indicated in the small figure of the Dl transcription unit. The top panel shows F statistics associated with a model that tests for a marker by sex association with the phenotype, and the bottom panel a model testing only for the main effect of the molecular marker. The F statistics are proportional to the variance attributable to a given marker.
|
|

View larger version (18K):
In this window
In a new window
Download PPT slide
|
Figure 4.
Plots of the values of the F statistics of models testing for molecular markers that are associated with variation in the first principal component score derived from the three genetic background data-set (W, B, and T-C) for abdominal bristle number.
|
|
Figure 5 gives the effect of Ha + 8.6 on sternopleural bristle number in males (top panel) and females (bottom panel) in each of the four genetic backgrounds and for the first principal component score associated with the three genetic background data-set. It can be seen in Figure 5 that the effect of Ha + 8.6 on sternopleural bristles is very consistent over the two sexes. The estimated effect of this polymorphism is 0.6 and 0.7 bristles in the male and female backcross (B) lines, respectively. The whole chromosome (W) lines gave the largest estimated effect, while the introgressed chromosome over a Delta loss-of-function mutant (T) lines gave an estimate very comparable to the backcross lines, and the introgressed chromosome over a wild-type chromosome (C) effect was about half as large as the backcross lines. In all four genetic backgrounds, effects were in the same direction, with the presence of the Ha + 8.6 marker associated with increased sternopleural bristle number.

View larger version (16K):
In this window
In a new window
Download PPT slide
|
Figure 5.
Sternopleural bristle number effects associated with molecular marker Ha + 8.6 (a HaeIII site in Intron 2) in each of the genetical backgrounds in which it was measured (W, B, T, and C), and the effect associated with the first principal component score (PC1). The top panel gives estimates of effects in males and the bottom in females. Effects are the mean of all the lines having a value of 1 for that marker (presence of a band or restriction site) minus the mean of all the lines having a 0 for that marker (absence of a band or restriction site), with error bars representing one standard error on the estimate of the effect (see text). All effects are expressed in sternopleural bristles except those associated with the principal component scores. Effects associated with marker Ha + 8.6 are very consistent over sexes.
|
|
Figure 6 gives the effect of S + 18.6 on abdominal bristle number in males (top panel) and females (bottom panel) in each of the four genetic backgrounds and for the first principal component scores associated with the three genetic background data-set. Unlike the effect of Ha + 8.6 on sternopleural bristle number, the effect of S + 18.6 on abdominal bristle number is very different in the two sexes. From the figure, it appears that this polymorphism has no measurable effect in males. The observation of sex-specific allelic effects associated with molecular markers in the Delta region for abdominal bristle number is consistent with a phenotypic analysis of the backcross chromosomes where a large sex by line component of variation was observed (LYMAN and MACKAY 1998
). The estimated homozygous effect (from the backcross lines) of S + 18.6 in females is approximately 1.1 abdominal bristles, which is in almost complete agreement with the estimates from the whole chromosome lines and the introgression over Delta loss-of-function lines. The estimated effect associated with the introgressed chromosome over wild-type chromosome lines (C) is 37% of the estimate from the backcross lines, consistent with additivity. Like the effect of Ha + 8.6 on sternopleural bristle number, the effect of S + 18.6 on female abdominal bristle number was in the same direction in all genetic backgrounds (presence of the site associated with decreased bristle number).

View larger version (15K):
In this window
In a new window
Download PPT slide
|
Figure 6.
Abdominal bristle number effects associated with molecular marker S + 18.6 (a ScrFI site in Intron 5) in each of the genetical backgrounds in which it was measured (W, B, T, and C) and the effect associated with the first principal component score (PC1). The top panel gives estimates of effects in males and the bottom in females. Effects are the mean of all the lines having a value of 0 for that marker minus the mean of all the lines having a 1 for that marker, with error bars representing one standard error on the estimate of the effect (see text). All effects are expressed in abdominal bristles except those associated with the principal component scores. Effects associated with S + 18.6 are only significantly different from zero in females, that is, the effect of S + 18.6 appears sex-limited.
|
|
Table 3 gives the variance attributable to sites Ha + 8.6 and S + 18.6, and the total genetic variance, in each of the four genetic backgrounds of this study. The effects associated with the two sites significantly associated with bristle variation appear large relative to total additive genetic variation. For example, considering only effects and variance in females for abdominal bristle number, the effect associated with S + 18.6 is 0.78 whole third chromosome background genetic standard deviations (i.e., effect
), and 1.06 homozygous backcross background genetic standard deviations. Similarly, for sternopleural bristle number averaged over sexes, the effect of Ha + 8.6 is 1.06 whole third chromosome background genetic standard deviations, and 1.26 homozygous backcross background genetic standard deviations. The two molecular markers associated with bristle number variation may also contribute a significant amount to standing genetic variation in bristle number. The additive genetic variation in abdominal bristle number attributable to S + 18.6 in females is 5.7% and 8.2% of the total genetic variation due to homozygous third chromosomes and homozygous backcross segments, respectively. Similarly, the additive genetic variation in sternopleural bristle number attributable to Ha + 8.6 averaged over sexes is 12.4% and 16.6% of the total genetic variation due to homozygous third chromosomes and homozygous backcross segments, respectively. Estimates of variance attributable to these sites is expected to be greater in the homozygous backcross background than the homozygous entire third chromosome background since the total genetic variation due to backcross chromosomes is less than that due to homozygous entire third chromosomes, possibly reflecting the presence of other factors on the third chromosome located outside the introgressed region.
View this table:
In this window
In a new window
|
Table 3.
Variance components associated with significant marker alleles over all genetic backgrounds
|
|
Polymorphic inversions and variation in bristle number:
All the lines included in the survey were Standard cytotype, with the exception of lines 38 [In(3R)86D-90D], 94 [In(3R)Payne], and 97 [In(3R)92E-100E]. In (3R)Payne is a common cosmopolitan inversion often observed in collections of D. melagaster from the wild, and the other two inversions are unique endemics LEMEUNIER and AULARD 1992). Data were reanalysed to determine if the associations observed between DNA polymorphisms in the Delta gene region and variation in bristle number could be due to the presence of inversions. The data were re-analysed with the combined effect of the three inversion containing lines statistically removed from the data, as well as with the three inversion containing lines dropped from the analysis. In both cases, the P-values associated with marker Ha + 8.6 changed by less than one percent. Those associated with S + 18.6 changed less than 1% when the effect of inversions was statistically removed and by 3.3 percent when the three inversion containing lines were dropped from the analysis. In addition, the P-values associated with an analysis of variance of the effect of inversion (and an inversion by sex interaction) on variation in abdominal and sternopleural bristle number did approach statistical significance.
 | DISCUSSION |
|---|
Two polymorphic markers in the Delta gene region are significantly associated with bristle variation:
Ha + 8.6 in Intron 2 is associated with variation in sternopleural bristle number in both males and females and S + 18.6 in Intron 5 is associated in a sex-limited manner with abdominal bristle number in females. These two sites are not in linkage disequilibrium with one another, and the effect associated with each site is apparently unaffected by statistical removal of the other. Thus, these two sites appear to be independently acting on variation in bristle number, despite their close physical proximity (10 kb), and likely mode of action through the same gene product (Dl). The observation of at least two sites at the Delta locus contributing to standing variation may be inconsistent with discussions in the literature regarding how adaptation may occur. In these discussions genes (or mapped factors) of large effect are often equated to mutations of large effect (ORR and COYNE 1992
). Although adaptation may appear to be due to a single large effect genetic locus, the effect of this locus may result from multiple substitutions over evolutionary time. As no genes contributing to adaptive differences between species have been dissected at the resolution necessary to distinguish between one or multiple substitution events at the same locus, we cannot conclude at this time how evolution proceeds. The thesis that multiple substitutions at the same locus may contribute to short-term evolutionary change appears supported by these observations at Delta, where at least two sites contribute to standing variation in bristle number. Additional support comes from previous studies on Adh, where at least three sites in the Adh gene region contribute to standing variation in Adh protein levels (STAM and LAURIE 1996
).
The estimated effects associated with both Ha + 8.6 and S + 18.6 are large relative to standing genetic variation in bristle number; likely too large to be consistent with Gaussian allelic effects models for the maintenance of additive genetic variation by the mutation selection balance theory (BARTON and TURELLI 1989
). It is also unlikely that the QTNs detected in this study are rare. For example, if it is assumed the QTN affecting sternopleural bristle number detected by Ha + 8.6 in the B background is at a frequency of less than 5% in the sample, then for the effect associated with this QTN to be consistent with a 95% lower bound on the estimated effect of Ha + 8.6 the QTN's effect would have to be on the order of four backcross genotypic standard deviations {see LAI et al. 1994, footnote #24, frequency of Ha + 8.6 is 0.40, 95% lower bound is [effect of Ha + 8.6 - (1.96 x standard error from Figure 5)], genotypic standard deviation from Table 3}. QTNs of this magnitude would have been detected by inspection of the phenotypic data, thus the QTNs detected in this study are likely too common to be consistent with House of Cards models for the maintenance of additive genetic variation by the mutation selection balance theory (BARTON and TURELLI 1989
). Recent experiments suggest that there are detectable levels of stabilizing selection on bristle number, so it is difficult to conclude that the variation of this study is neutral with respect to fitness (LINNEY et al. 1971
; NUZHDIN et al. 1995
; GARCIA-DORADO and GONZALEZ 1996
). It is possible to explain the presence of variants of large effect at intermediate frequency if the net effect of such variants with respect to fitness is zero, but then it is difficult to explain the apparent stabilizing selection on the trait (KEIGHTLEY and HILL 1990
). Selective models, with the variants being maintained as balanced polymorphisms (possibly aided by spatial or temporal variation in the environment), or through antagonistic pleiotropy, seem more amenable to further empirical testing at this time. Antagonistic pleiotropy is a particularly attractive hypothesis to test in the case of S + 18.6, where allelic effects are different in the two sexes, as this may represent an example of sex-specific trade-offs.
The estimated bristle number effects associated with Ha + 8.6 and S + 18.6 are smaller than those estimated via a QTL mapping approach of earlier work (1.4 sternopleural bristles averaged over sexes, and 3.6 and 2.4 abdominal bristles in males and females, respectively; LONG et al. 1995
). The discrepancy in the estimates of effects from the two experiments does not necessarily imply these two studies are inconsistent with one another. QTL mapping tends to over-estimate effects if there are linked QTLs, or multiple sites at the same locus, affecting a character. In contrast, it can be shown that association studies tend to under-estimate the variance attributable to a QTN by a factor of 1/R2, where R2 is the correlation between marker and QTN (LAI et al. 1994
, footnote #24). Even considering this discrepancy, the large differences in estimated effects makes it appear that the factors detected in this association study do not completely correspond to those detected in earlier QTL mapping work. For example, the QTL affecting abdominal bristle number that mapped to the Delta region (LONG et al. 1995
) had a larger effect in males than in females, whereas the factor detected in this association study has no effect in males and a large effect in females. It seems likely that the discrepancy in estimates of effects using the two different approaches to characterizing quantitative variation is due to one, or a combination of, the following reasons: (1) QTLs linked to (but not at) the Delta locus led to inflated estimates of effects in the earlier QTL mapping experiment (another candidate gene known to affect bristle number, Hairless, is only 3.5 cM from Delta), (2) the selection experiment used to generate the high and low parental strains used in the QTL mapping experiment may have fixed variants at the Delta locus that are rare or not represented in the sample of 55 random chromosomes used in this study, or (3) differences in estimates of effects from the two experiments result from epistatic interactions between factors at Delta and loosely linked factors that exist in the set of third chromosome recombinant inbred lines of the earlier QTL mapping experiment, but do not exist in the backcross lines of the current experiment. The possibility that the difference between estimates of allelic effects in the two studies is due to epistatic interactions among factors is plausible, as significant epistatic effects were observed in the earlier study of LONG et al. 1995
.
Two previous studies have detected associations between DNA polymorphisms at candidate loci important in peripheral nervous system development and variation in bristle number. MACKAY and LANGLEY 1990
found that insertions of transposable elements in the achaete-scute region were associated with a reduction in bristle number, consistent with achaete-scute weak loss-of-function mutations. As the frequency of any given transposable element insertion is rare in samples from natural populations, these results are consistent with deleterious mutation-selection equilibrium models for the maintenance of quantitative variation. A second study found that DNA polymorphisms in the scabrous gene region were associated with variation in bristle number more often than one would expect by chance alone, and that insertions of transposable elements in the scabrous region were not associated with bristle number variation (LAI et al. 1994
). These results appeared more consistent with models incorporating balancing selection than deleterious mutation-selection models, as the sites that were associated with bristle number variation were generally common. The statistical significance of the associations observed in the scabrous study was assessed by permutation testing the number of markers significant at a marginal significance level of P < 0.05, yielding a set of marginally significant markers. In the present study, significance was assessed by permutation testing the F-statistic associated with the most significant marker, yielding a prioritized set of associated markers. The performance of these two statistical tests under different models of quantitative variation is an interesting topic for further investigation.
The protein product of Delta is conserved in both sequence and function, is utilized in a number of different tissues and times during development, yet contributes to standing variation in bristle number:
The product of the Delta gene is the ligand, and that of the Notch gene is the receptor, in the well-characterized Notch signal transduction pathway (ARTAVANIS-TSAKONAS et al. 1995
). Bristle patterning and spacing is believed to be the result of a process known as lateral inhibition, whereby cell-cell interactions cause feed back loops between cells expressing Notch and Delta, resulting in some cells assuming a neuronal fate (i.e., a bristle) and others a non-neuronal fate (HEITZLER and SIMPSON 1991
). Delta homologues have also been identified in human, mouse, chick, Xenopus, and Caenorhabditis elegans, where the protein sequence is conserved, as is its apparent role in the Notch signaling pathway. In addition, the function of Delta, in defining a commitment to a neuronal versus non-neuronal fate, appears to be conserved in Xenopus (CHITNIS et al. 1995
), chick (HENRIQUE et al. 1995
), and mice (HRABE DE ANGELIS et al. 1997
; in mice Delta appears to have assumed an additional role in the maintenance of somite borders). Delta-Notch signaling is involved in many important developmental processes in addition to bristle patterning and spacing, as evidenced by the pleiotropy observed for different Delta mutants (VASSIN et al. 1987
; ALTON et al. 1988
) and the many different times and tissues in which Delta is expressed during development (HAENLIN et al. 1990
; PARKS and MUSKAVITCH 1993
). The observation that the Delta region has very few insertions of transposable elements (only three small insertions) relative to other loci that have been surveyed in Drosophila is consistent with the idea that the Delta region has a number of cis-acting regulatory sequences for which a change in physical distance from each other and/or the Delta transcription unit is strongly deleterious (i.e., insertions in the Delta region are quickly eliminated in natural populations). The large number of Delta spontaneous mutants that have been identified suggests that the Delta locus is as much of a target for transposable element insertions as other loci (LINDSLEY and ZIMM 1992
). The typical levels of DNA heterozygosity observed for the Delta region provides no support for the hypothesis that Delta has experienced recent selective sweeps (KAPLAN et al. 1989
).
The two sites, Ha + 8.6 and S + 18.6, that are significantly associated with bristle variation are located in the large second (4.0 kb) and fifth (2.9 kb) introns of Delta, respectively. Although the permutation testing approach we describe does not localize the causative site, physical locations close to Ha + 8.6 and S + 18.6 are more likely to harbor the causative site than locations further away. It is likely that the QTNs we have localized to somewhere in the Delta region are in one of the introns or the translated portion of Delta, rather than either the 5' or 3' untranslated flanking regions. It is possible that the Delta variants that are responsible for variation in bristle number are site polymorphisms in binding domains for proteins that regulate Delta expression levels. The first intron of Delta is known to contain an enhancer(s) of expression in the embryo (HAENLIN et al. 1994
), so it is possible that the other introns also have, as yet unidentified, enhancer elements important later in development (021 hr and 730 hr after puparium formation for macrochaetae and microchaetae, respectively) when normal Delta expression is important for bristle spacing (PARKS and MUSKAVITCH 1993
). Our data do not necessarily implicate such regulatory variants as contributing to standing variation though, as we can not rule out polymorphisms resulting in amino acid changes, or variants that affect splicing efficiency of the Delta pre-mRNA. With regard to the last possibility, a peculiar feature of Delta transcription is that the introns of Delta are not immediately degraded after being spliced out of the pre-mRNA (as with other genes), and also remain localized to the sites of transcription (KOPCZYNSKI and MUSKAVITCH 1992
).
The permutation testing approach provides a robust and general method for testing for associations between molecular markers and phenotypic variation:
In addition to applying the permutation testing procedure to F-statistics derived from a univariate analysis of pricipal component scores, we also applied the permutation testing procedure to likelihood ratio test statistics estimated from applying a multivariate general linear model to the data (KENDALL and STUART 1979
; results not shown). The two approaches gave comparable results, although the approach based on the likelihood ratio seemed less powerful than that based on the principal component scores. The difference in power is likely due to a biological constraint that was difficult to incorporate into the multivariate likelihood ratio test statistic. Although we may expect the phenotypic effect of a causative site in linkage disequilibrium with one of our marker sites to have different effects in the different genetic backgrounds (and hence response variables) studied, we expect the direction of the effects to be the same in all genetic backgrounds. Since the multivariate likelihood ratio test statistic is completely a function of the covariance structure of the error (or residual) matrix that results from fitting a specific model, it follows that a marker whose effects are in different directions in the different genetic backgrounds, but always large, would not weaken the model (i.e., the column vector of B's associated with the marker effect of the design matrix could have both large positive and negative values without weakening the overall likelihood; KENDALL and STUART 1979
). Under almost any genetic model that purports to explain the nature of quantitative variation, the assumption that alleles should act in the same direction in a number of genetic backgrounds will be met. Thus we consider the permutation testing approach based on the principal component scores to be a more powerful statistical approach for detect marker/phenotype associations than the likelihood based test statistic. If it is discovered in the future that the alleles that affect quantitative traits are massively epistatic to one another (enough that genetic background can result in reversals of allelic effects), then statistical analyses based the multivariate likelihood ratio test statistic may be more appropriate. If this were the case, principal components other than the first are expected to account for a significant proportion of the total variation over genetic backgrounds, which was not true for Delta.
A molecular marker can be significantly associated with bristle number variation using the permutation test for one of three reasons: (1) there is no QTN in the population and the association between marker and phenotype is due to chance, (2) there is a QTN in the population that is in linkage equilibrium with markers in the Dl gene region, and by chance it is in linkage disequilibrium with a marker in the Dl region in the sample of this study, or (3) there is a QTN in linkage disequilibrium with sites in the Dl region in the population and this association is reflected in the sample. The P-value associated with the permutation testing procedure is the probability that the association is due to outcomes (1) or (2). Given the small P-values of the permutation tests associated with Ha + 8.6 and S + 18.6, the implication is that sites in linkage disequilibrium with polymorphic sites in the Dl region in the population are contributing to bristle number variation. Past studies have observed linkage disequilibrium between inversions and allozyme loci. Disequilibrium between allozymes and inversions on the same arm can be large and replicable over different studies, whereas disequilibrium between inversions and allozymes on opposite arms is small or nonexistent (reviewed in LEMEUNIER and AULARD 1992
). As only three of the lines surveyed contained inversions on the right arm of the third chromosome, and these inversions have little impact on observed associations between the two significant markers and variation in bristle number, it is unlikely that the presence of inversions can account for observed marker/phenotype associations. In Drosophila, linkage disequilibrium is rarely observed over large distances in sets of inversion-free chromosomes (e.g., AQUADRO et al. 1986
; MIYASHITA and LANGLEY 1988
),