- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Balakirev, E. S.
- Articles by Ayala, F. J.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Balakirev, E. S.
- Articles by Ayala, F. J.
DNA Polymorphism in the ß-Esterase Gene Cluster of Drosophila melanogaster
Evgeniy S. Balakireva,b, V. R. Chechetkinc, V. V. Lobzinc, and Francisco J. Ayalaaa Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525,
b Institute of Marine Biology, Vladivostok 690041, Russia, and Academy of Ecology, Marine Biology, and Biotechnology, Far Eastern State University, Vladivostok 690600, Russia
c Troitsk Institute of Innovation and Thermonuclear Investigations (TRINITI), Theoretical Department of Division for Perspective Investigations, 142190 Troitsk, Moscow Region, Russia
Corresponding author: Francisco J. Ayala, 321 Steinhaus Hall, University of California, Irvine, CA 92697-2525., fjayala{at}uci.edu (E-mail)
Communicating editor: N. TAKAHATA
| ABSTRACT |
|---|
We have analyzed nucleotide polymorphism within a 5.3-kb region encompassing the functional Est-6 gene and the
Est-6 putative pseudogene in 28 strains of Drosophila melanogaster and one of D. simulans. Two divergent sequence types were detected, which are not perfectly associated with Est-6 allozyme variation. The level of variation (
) is very close in the 5'-flanking region (0.0059) and Est-6 gene (0.0057), but significantly higher in the intergenic region (0.0141) and putative pseudogene (0.0122). The variation in the 3'-flanking region is intermediate (0.0083). These observations may reflect different levels of purifying selection in the different regions. Strong linkage disequilibrium occurs within the region studied, with the largest values revealed in the putative pseudogene and 3'-flanking region. Moreover, recombination is restricted within
Est-6. Gene conversion is detected both within and (to a lesser extent) between Est-6 and
Est-6. The data indicate that
Est-6 exhibits some characteristics that are typical of nonfunctional genes, while other characteristics are typically attributed to functional genes; the same situation has been observed in other pseudogenes (including Drosophila). The results of structural entropy analysis demonstrate higher structural ordering in Est-6 than in
Est-6, in accordance with expectations if
Est-6 is indeed a pseudogene. Taking into account that the function of
Est-6 is not known (but could exist) and following the terminology of J. Brosius and S. J. Gould, we suggest that the term "potogene" may be appropriate for
Est-6, indicating that it is a potential gene that may have acquired some distinctive but unknown function.
THE ß-esterase gene cluster is on the left arm of chromosome 3 of Drosophila melanogaster, at 68F769A1 in the cytogenetic map (but see ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Est-6. ![]()
![]()
Our earlier investigation of
Est-6 (![]()
Est-6 putative pseudogene and Est-6 gene in a random sample of D. melanogaster derived from a natural population of California. The full sequence now analyzed is 5394 bp long and includes the 5'-flanking region, complete Est-6 gene, intergenic region,
Est-6 putative pseudogene, and 3'-flanking region. The data for the 5'-flanking region and Est-6 gene (1686 bp) are from ![]()
| MATERIALS AND METHODS |
|---|
Drosophila strains:
The 28 D. melanogaster strains were derived from a random sample of wild flies collected by F. J. Ayala (October 1991) in El Rio Vineyard, Acampo, California. The strains were made fully homozygous for the third chromosome by crosses with balancer stocks, as described by ![]()
|
DNA extraction, amplification, and sequencing:
Total genomic DNA was extracted using the tissue protocol of the QIAamp tissue kit (QIAGEN, Valencia, CA). The D. melanogaster Est-6 sequence (GenBank accession nos.
M33780 and
M33781; ![]()
Est-6 sequences.) At least two independent PCR amplifications were sequenced for each polymorphic site in all D. melanogaster strains to prevent possible PCR or sequencing errors.
DNA sequence analysis:
The sequences were assembled using the program SeqMan (Lasergene, DNASTAR, 19941997). The computer programs DnaSP, version 3.4 (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Entropy analysis:
If
Est-6 is in fact a pseudogene or nonessential gene, one could expect lower structural regularity and higher structural divergence in this putative pseudogene than in its functional paralogous gene, Est-6. These features can be quantitatively assessed with the proper structural analysis of the relevant sequences. Our approach is based on spectral methods previously developed (![]()
![]()
![]()
![]()
![]()
![]()
First, we begin with the necessary definitions. The Fourier harmonics corresponding to the nucleotides of type
(where
is A, C, G, or T) in a sequence of length M are defined as
![]() |
(1) |
where
m,
indicates the positions occupied by the nucleotides of type
,
m,
= 1 if the nucleotide of type
occupies the mth site, and 0 otherwise. The amplitudes of Fourier harmonics (or structure factors) are expressed as
![]() |
(2) |
where the asterisk denotes complex conjugation. The zeroth harmonics depending only on the nucleotide composition do not contain structural information and are discarded below. Due to the symmetry property
![]() |
(3) |
the spectra for structure factors can be restricted to their left halves from n = 1 to
![]() |
(4) |
where the brackets denote the integer part of the quotient. The structure factors will always be normalized with respect to mean spectral values,
![]() |
(5) |
where N
is the total number of nucleotides of type
in a sequence of length M. The structural regularity of the nucleotides of type
in a sequence of length M is assessed with the spectral structural entropy
![]() |
(6) |
The value of spectral entropy for a counterpart random sequence having the same nucleotide composition is the highest and the corresponding mean characteristics averaged over an ensemble of various random realizations are given by
![]() |
(7) |
Using the values (7) as the reference characteristics, it is convenient to introduce the relative spectral structural entropy
![]() |
(8) |
as well as the relative normalized deviations
![]() |
(9) |
The value of S
,rel serves for the comparison of structural regularity for the different sequences (generally, also of different lengths); the higher the value of S
,rel, the higher the structural regularity of a sequence, while r
serves for the assessment of the statistical significance of observed deviations. Assuming a Gaussian distribution for r in the case of random deviations, the probability of finding values of r exceeding some threshold r0 is given by
![]() |
(10) |
The value Pr = 0.05 corresponds to that of r0 = 1.64.
The level of structural divergence may be quantitatively estimated with deviations
![]() |
(11) |
where the cross correlation coefficients are determined as
![]() |
(12) |
![]() |
(13) |
The structure factor harmonics F
(qn), the mean spectral values 

, and number N are defined in Equation 2, Equation 4, and Equation 5, while the superscripts 1 and 2 refer to a pair of compared sequences. The definition (12) assumes equal lengths for the compared sequences 1 and 2 corresponding to the patterns of the same gene in two different strains of D. melanogaster. In the presence of insertions/deletions and unequal lengths in the compared sequences, the shorter sequence is supplemented by void sites up to the length of the longer one (![]()

correspond to the higher structural divergence between compared sequences.
| RESULTS |
|---|
Nucleotide polymorphism and recombination:
Fig 1 shows a total of 92 polymorphic sites in a sample of 28 sequences of the
Est-6 putative pseudogene: 62 sites in exon I (1 site involves a 3-bp deletion), 2 sites in the intron, 12 sites in exon II, and 16 sites in the 3'-flanking region. Two indel polymorphisms occur. A 3-bp deletion occurs in the F-775F strain and a 9-bp insertion occurs in the S-510S, S-501S, and S-5F strains (Fig 1). For the
Est-6 coding region we detected 41 replacements (1 site involves a 9-bp insertion) and 33 synonymous polymorphic sites. We previously found 13 replacements and 23 synonymous polymorphic sites in the Est-6 coding region (![]()
Est-6 in the S-438S D. melanogaster strain (data not shown). ![]()
Est-6.
Table 1 shows estimates of nucleotide diversity for the putative pseudogene as well as for the Est-6 gene and the flanking regions. The
value for the full sequence is 0.0084, which is within the range of values observed in other high-recombination gene regions in D. melanogaster (![]()
value is very similar in the 5'-flanking (0.0059) and Est-6 regions (0.0057), but significantly higher in the intergenic region (0.0141) and putative pseudogene (0.0122), and intermediate in the 3'-flanking region (0.0083). The level of synonymous variation is 0.0152 in the Est-6 coding region but 0.0268 (1.76 times higher) in the putative pseudogene. The difference is more pronounced for nonsynonymous variation, which is 0.0026 in the Est-6 gene and 0.0078 (3.0 times higher) in the putative pseudogene. This could indicate different degrees of selective constraint in the Est-6 gene and the putative pseudogene. The level of silent polymorphism in the 3'-flanking region is 0.0083, but 0.0268 (3.2 times higher) in the putative pseudogene. These differences could again indicate differences in selective constraints. The level of silent divergence between D. melanogaster and D. simulans is similar for the Est-6 gene (0.1469) and the putative pseudogene (0.1393), but lower in the 5'-flanking (0.0807) and 3'-flanking (0.0417) regions.
|
The method of ![]()
![]()
![]()
![]()
|
Thus, there is two times more total nucleotide variability in the putative pseudogene (Table 1) but the recombination rate is at least two times higher in the Est-6 gene (Table 2). The association in
Est-6 of a high level of nucleotide variation with low recombination is contrary to the well-documented positive relationship between within-species DNA variation and recombination rates (e.g., ![]()
Haplotype structure:
Previously, ![]()
![]()
![]()
![]()
= 0.0026 ± 0.0006) than the six F haplotypes (
= 0.0096 ± 0.0042). The permutation test of ![]()
|
Gene conversion:
The method of ![]()
![]()
Est-6 (13 regions in all 28 sequences, P = 0.0000). The numbers of significant fragments are 14 for the Est-6 gene (fragment length from 314 to 1183 bp, average 662 bp) and 85 for
Est-6 (fragment length from 154 to 1052 bp, average 669 bp). Gene conversion events between Est-6 and
Est-6 are detected only in the protein alignment (involving a single region between amino acids 41 and 55, P = 0.0102). The number of significant fragment pairs showing intergenic conversion, which involve 23 Est-6 and 6
Est-6 sequences, is 138. Taken together, these results show that gene conversion has played an important role in the evolution of the ß-esterase gene cluster.
Sliding-window analysis:
Fig 3 shows the distribution of polymorphism along the Est-6 (thin line) and
Est-6 (thick line) sequences. There is a distinct peak in the Est-6 sequences at 750950, which includes the F/S replacement site (position 772). We detected this peak previously (![]()
![]()
![]()
![]()
![]()
![]()
Est-6 sequences at 50200, 400600, 8501050, and 13001650. The putative pseudogene peaks are more acute than the Est-6 gene peaks, have a regular distribution along the sequence (with an interval of 200300 bp), and are not centered around the replacement polymorphisms (Fig 1).
|
We measure heterogeneity in the distribution of polymorphic sites along the
Est-6 sequence and discordance between
(within-melanogaster polymorphism) and K (melanogaster-simulans divergence) by means of GOSS and LEWONTIN's (1996) and MCDONALD's (1996, 1998) statistics and assess their significance by Monte Carlo simulations of the coalescent model incorporating recombination (![]()
![]()
![]()
Linkage disequilibrium:
We have calculated the P value of Fisher's exact test in all pairwise comparisons of informative polymorphic sites. The numbers (and percentages) of pairwise comparisons that are significant are, for the whole region, 4235 out of 7626 (55.53%, 2.62% with the Bonferroni correction); for the Est-6 gene, 151 out of 300 (50.33%, 18.33% with the correction); for the putative pseudogene, 1486 out of 1830 (81.20%, 14.81% with the correction); for the 3'-flanking region, 66 out of 78 (84.62%, 57.69% with the correction); and between Est-6 and
Est-6, 927 of 1525 (60.79%, but none with the Bonferroni correction). The significant interlocus linkage disequilibria are caused by six divergent haplotypes, F-517S, F-517F, F-1461S, F-274F, F-357F, and F-775F, which have unique polymorphisms in both Est-6 and
Est-6.
Tests of neutrality:
In a previous study (![]()
Est-6, Kelly's ZnS and Wall's B and Q values are even higher (ZnS = 0.422; B = 0.432; Q = 0.520) than those for the Est-6 gene and significant by coalescent simulations with the population recombination rate ≥0.005 (ZnS statistic) or without recombination (B and Q statistics).
Entropy analysis:
We use this new type of analysis when seeking to ascertain the functionality of
Est-6. We have calculated the relevant characteristics for the exon-intron-exon sequences of Est-6 and
Est-6 before splicing and for exon-exon sequences after splicing. The examples of spectra for structure factor harmonics (see MATERIALS AND METHODS, Equation 5) are illustrated in Fig 4 and Fig 5, where the period p is related to the ordinal number of structure factor harmonic n as p = M/n. The high peaks at p = 3 (n = 561 for Est-6 and n = 563 for
Est-6) are a distinctive feature of protein-coding regions (![]()
![]()
Est-6, and -1.736 (r = 1.20) for spliced
Est-6. The values obtained for
Est-6 are not significantly different from random sequence, while the entropy of the Est-6 gene is significantly higher than expected for random sequence (Table 3). These results demonstrate higher structural ordering in Est-6 than in
Est-6, in accordance with expectations if
Est-6 is indeed a pseudogene. (Note the generally higher values of r, indicating higher structural ordering in the spliced genes, with the exception of a few cases due to structural coupling between exons and intron.)
|
|
|
The mean values of deviations 
averaged over the set of 378 pairwise cross correlation coefficients corresponding to each gene are summarized in Table 4, while the examples of their distributions are presented in Fig 6 and Fig 7. The values 

in the last column in Table 4 are obtained by additional averaging of 
over four types of nucleotides. The insertions/deletions are present only in
Est-6 and produce the main contribution to deviations 
. Comparison of structural divergence in
Est-6 was also performed upon equalizing the lengths of the sequences by removing insertions/deletions. As seen in Table 4, even after equalizing, the structural divergence remains distinctly higher in
Est-6 than in Est-6, as expected if
Est-6 is not a functional gene. Besides that, it is worth noting the correlation between the deviations 
and the heights of peaks for f
at p = 3 (see Equation 5 as well as Fig 4 and Fig 5): the higher the peaks at p = 3 the smaller 
and the narrower their distributions (see Fig 6 and Fig 7 as well as Table 4).
|
|
|
| DISCUSSION |
|---|
Pseudogenes in Drosophila and other organisms:
Relative to what is known in other organisms, especially vertebrates (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The unusual patterns of pseudogene evolution suggesting some functional constraints have also been revealed in other organisms. Moreover, it was shown that a small number of detrimental alterations is a common feature of pseudogenes; there are many examples of extremely conserved pseudogene sequences exhibiting 90% and higher homology with functional counterparts (for a review, see ![]()
![]()
![]()
![]()
![]()
![]()
By definition, pseudogenes should be transcriptionally and translationally silent; however, nonfunctional (or functional only in some cases) transcripts of many pseudogenes have been described (e.g., ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Est-6 putative pseudogene, for which stop codons were detected only in some lines of D. melanogaster.
Pattern of recombination:
Disruption of homologous sequences by insertion or nucleotide polymorphisms can significantly reduce recombination frequencies. Even individual nucleotide substitutions have been shown to affect recombination (e.g., ![]()
![]()
![]()
Est-6 putative pseudogene has been detected in one D. melanogaster strain studied here (S-438S) and also in strain 12I-11.2 analyzed by ![]()
Recombination is not a random process; recombination hotspots caused by specific initiating sequences are reported for many gene systems. The unprecedented evolutionary stability of simple repeats promoting recombination in the expressed mammalian MHC-DRB genes was detected in some specific genome locations (review in ![]()
![]()
![]()
![]()
![]()
![]()
In several yeast and maize genes, the sequence signals initiating recombination often occur within the promoter but not within the gene itself (![]()
![]()
![]()
Est-6 putative pseudogene is limited to 193 bp consisting of the intergenic region between Est-6 and
Est-6. It might be that the obvious reduction in recombination could be connected with the promoter truncation of
Est-6. The sequences promoting recombination could also be eroded within
Est-6 due to stochastic accumulation of mutations as in the case of HLA-DRB (![]()
![]()
![]()
![]()
Pseudogene function:
A possible role for pseudogenes in development as a source of the intracellular inhibitors was suggested by ![]()
![]()
![]()
![]()
![]()
![]()
Est-6 transcription unit contain elements that modulate the expression of Est-6.
A functional role has been proposed and, in some cases clearly brought out, for pseudogenes in the diversity of vertebrate immune response (e.g., ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Conclusions:
We have detected some contrasting characteristics of nucleotide variation in the Est-6 gene and the
Est-6 putative pseudogene. The level of the total nucleotide variation is 2.1 times higher in
Est-6 than in Est-6. The population recombination rate is at least 2.6 times lower in
Est-6 than in Est-6. As a consequence, linkage disequilibrium is more pronounced in
Est-6 than in Est-6. The haplotype structure of
Est-6 is dimorphic. However, the divergent sequences of
Est-6 are not perfectly associated with Est-6 allozyme variation. Some of the detected features of
Est-6 indicate that it could be a pseudogene: 11 premature stop codons out of 28 strains are hardly compatible with functionality of the encoded protein. The level of nonsynonymous variation is 3.0 times higher in
Est-6 than in Est-6. The results of the structural entropy analysis reveal a lower structural regularity and a higher structural divergence for
Est-6, in accordance with the expectations provided it is a pseudogene or nonfunctional gene. On the other hand, it has been shown that the gene is expressed (![]()
Est-6 produce a catalytically active esterase (![]()
Est-6 is not known (but could exist and may be discovered in the future), we suggest that the term potogene be used for
Est-6, following the terminology of ![]()
Est-6, including its pattern of population variation, suggest that it may already have some functional role (for instance, as a reservoir of sequences, which can recombine with the expressed Est-6 gene), but its designation as a potogene would imply that the function is not known and far from confirmed, although the potentiality exists.
| ACKNOWLEDGMENTS |
|---|
We are grateful to S. A. Sawyer, G. McVean, D. A. Filatov, J. K. Kelly, J. H. McDonald, J. D. Wall, J. M. Comeron, F. Depaulis, and J. Rozas for useful advice on analyses and for providing computer programs. We thank Elena Balakireva, Andrei Tatarenkov, Victor DeFilippis, Martina Zurovkova, and Carlos Márquez for encouragement and help; and W. M. Fitch, B. Gaut, R. R. Hudson, A. Long, and two anonymous reviewers for detailed and valuable comments. This work is supported by National Insititutes of Health grant GM42397 to F. J. Ayala.
Manuscript received December 16, 2002; Accepted for publication February 21, 2003.
| LITERATURE CITED |
|---|
BALAKIREV, E. S. and F. J. AYALA, 1996 Is esterase-P encoded by a cryptic pseudogene in Drosophila melanogaster? Genetics 144:1511-1518.[Abstract]
BALAKIREV, E. S., E. I. BALAKIREV, F. RODRIGUEZ-TRELLES, and F. J. AYALA, 1999 Molecular evolution of two linked genes, Est-6 and Sod, in Drosophila melanogaster.. Genetics 153:1357-1369.
BALAKIREV, E. S., E. I. BALAKIREV, and F. J. AYALA, 2002 Molecular evolution of the Est-6 gene in Drosophila melanogaster: contrasting patterns of DNA variability in adjacent functional regions. Gene 288:167-177.[Medline]
BARD, J. A., S. P. NAWOSCHIK, B. F. O'DOWD, S. R. GEORGE, and T. A. BRANCHEK et al., 1995 The human serotonin 5-hydroxytryptamine1D receptor pseudogene is transcribed. Gene 153:295-296.[Medline]
BEGUN, D., 1997 Origin and evolution of a new gene descended from alcohol dehydrogenase in Drosophila. Genetics 145:375-382.[Abstract]
BEGUN, D. J. and C. F. AQUADRO, 1992 Levels of naturally occurring DNA polymorphism correlate with recombination rates in Drosophila melanogaster.. Nature 356:519-520.[Medline]
BLISKOVSKII, V. V., A. V. KIRILLOV, K. S. SPIRIN, V. M. ZAKHAREV, and I. M. CHUMAKOV, 1993 Son pseudogene does not contain five repeated elements of the area of perfect tandem repeats present in the homologous son gene sequence. Mol. Biol. 27:61-68.
BRAYTON, K. A., D. P. KNOWLES, T. C. MCGUIRE, and G. H. PALMER, 2001 Efficient use of a small genome to generate antigenic diversity in tick-borne ehrlichial pathogens. Proc. Natl. Acad. Sci. USA 98:4130-4135.
BRISTOW, J., S. E. GITELMAN, M. K. TEE, B. STAELS, and W. L. MILLER, 1993 Abundant adrenal-specific transcription of the human P450c21A "pseudogene". J. Biol. Chem. 268:12919-12924.
BROSIUS, J. and S. J. GOULD, 1992 On "genomenclature": a comprehensive (and respectful) taxonomy for pseudogenes and other "junk" DNA. Proc. Natl. Acad. Sci. USA 89:10706-10710.
BUETTNER, J. A., G. GLUSMAN, N. BEN-ARIE, P. RAMOS, and D. LANCET et al., 1998 Organization and evolution of olfactory receptor genes on human chromosome 11. Genomics 53:56-68.[Medline]
CHECHETKIN, V. R. and V. V. LOBZIN, 1996 Levels of ordering in coding and non-coding regions of DNA sequences. Phys. Lett. A 222:354-360.
CHECHETKIN, V. R. and V. V. LOBZIN, 1998 Study of correlations in segmented DNA sequences: application to structural coupling between exons and introns. J. Theor. Biol. 190:69-83.[Medline]
CHECHETKIN, V. R. and A. Y. TURYGIN, 1994 On the spectral criteria of disorder in non-periodic sequences: application to inflation models, symbolic dynamics and DNA sequences. J. Phys. A Math. Gen. 27:4875-4898.
CHECHETKIN, V. R. and A. Y. TURYGIN, 1996 Study of correlations in DNA sequences. J. Theor. Biol. 178:205-217.[Medline]
CHECHETKIN, V. R., L. A. KNIZHNIKOVA, and A. Y. TURYGIN, 1994 Three-quasiperiodicity, mutual correlations, ordering and long-range modulations in genomic nucleotide sequences for viruses. J. Biomol. Struct. Dyn. 12:271-299.[Medline]
COLLET, C., K. M. NIELSEN, R. J. RUSSELL, M. KARL, and J. G. OAKESHOTT et al., 1990 Molecular analysis of duplicated esterase genes in Drosophila melanogaster.. Mol. Biol. Evol. 7:9-28.[Abstract]
COOKE, P. H. and J. G. OAKESHOTT, 1989 Amino acid polymorphisms for esterase-6 in Drosophila melanogaster.. Proc. Natl. Acad. Sci. USA 86:1426-1430.
CRISTIANO, R. J., S. J. GIORDANO, and A. W. STEGGLES, 1993 The isolation and characterization of the bovine cytochrome b5 gene, and a transcribed pseudogene. Genomics 17:348-354.[Medline]
CURRIE, P. D. and D. T. SULLIVAN, 1994 Structure, expression and duplication of genes which encode phosphoglyceromutase of Drosophila melanogaster.. Genetics 138:352-363.
DUMANCIC, M. M., J. G. OAKESHOTT, R. J. RUSSELL, and M. J. HEALY, 1997 Characterization of the EstP protein in Drosophila melanogaster and its conservation in Drosophilids. Biochem. Genet. 35:251-271.[Medline]
FAN, Q., F. XU, and T. PETES, 1995 Meiosis-specific double-strand breaks at the HIS4 recombination hot spot in the yeast Saccharomyces cerevisiae control in cis and trans.. Mol. Cell. Biol. 15:1679-1688.[Abstract]
FICKETT, J. W. and C.-S. TUNG, 1992 Assessment of protein coding measures. Nucleic Acids Res. 20:6441-6450.
FILATOV, D. A. and D. CHARLESWORTH, 1999 DNA polymorphism, haplotype structure and balancing selection in the Leavenworthia PgiC locus. Genetics 153:1423-1434.
FISHER, J. A. and T. MANIATIS, 1985 Structure and transcription of the Drosophila mulleri alcohol dehydrogenase genes. Nucleic Acids Res. 13:6899-6917.
FOTAKI, M. E. and K. IATROU, 1988 Identification of a transcriptionally active pseudogene in the chorion locus of the silkmoth Bombyx mori. Regional sequence conservation and biological function. J. Mol. Biol. 203:849-860.[Medline]
FÜRBAB, R. and J. VANSELOW, 1995 An aromatase pseudogene is transcribed in the bovine placenta. Gene 154:287-291.[Medline]
GAME, A. Y. and J. G. OAKESHOTT, 1990 Associations between restriction site polymorphism and enzyme activity variation for esterase 6 in Drosophila melanogaster.. Genetics 126:1021-1031.[Abstract]
GIMELBRANT, A. A. and T. S. MCCLINTOCK, 1997 A nuclear matrix attachment region is highly homologous to a conserved domain of olfactory receptors. J. Mol. Neurosci. 9:61-63.[Medline]
GLUSMAN, G., A. SOSINSKY, E. BEN-ASHER, N. AVIDAN, and D. SONKIN et al., 2000 Sequence, structure, and evolution of a complete human olfactory receptor gene cluster. Genomics 63:227-245.[Medline]
GOSS, P. J. E. and R. C. LEWONTIN, 1996 Detecting heterogeneity of substitution along DNA and protein sequences. Genetics 143:589-602.[Abstract]
GROMKO, M. H., D. F. GILBERT and R. C. RICHMOND, 1984 Sperm transfer and use in the multiple mating system of Drosophila, pp. 371426 in Sperm Competition and the Evolution of Animal Mating Systems, edited by R. L. SMITH. Academic Press, New York.
HASSON, E. and W. F. EANES, 1996 Contrasting histories of three gene regions associated with In(3L)Payne of Drosophila melanogaster.. Genetics 144:1565-1575.[Abstract]
HEALY, M. J., M. M. DUMANCIC, and J. G. OAKESHOTT, 1991 Biochemical and physiological studies of soluble esterases from Drosophila melanogaster.. Biochem. Genet. 29:365-388.[Medline]
HEALY, M. J., M. M. DUMANCIC, A. CAO, and J. G. OAKESHOTT, 1996 Localization of sequences regulating ancestral and acquired sites of esterase 6 activity in Drosophila melanogaster.. Mol. Biol. Evol. 13:784-797.[Abstract]
HEY, J. and J. WAKELEY, 1997 A coalescent estimator of the population recombination rate. Genetics 145:833-846.[Abstract]
HOLSTE, D., O. WEISS, I. GROSSE, and H. HERZEL, 2000 Are noncoding sequences of Rickettsia prowazekii remnants of "neutralized" genes? J. Mol. Evol. 51:353-362.[Medline]
HUDSON, R. R., 1987 Estimating the recombination parameter of a finite population model without selection. Genet. Res. 50:245-250.[Medline]
HUDSON, R. R., 1990 Gene genealogies and the coalescent process. Oxf. Surv. Biol. 7:1-44.
HUDSON, R. R. and N. KAPLAN, 1985 Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147-164.
HUDSON, R. R. and N. KAPLAN, 1988 The coalescent process in models with selection and recombination. Genetics 120:831-840.
HUDSON, R. R., D. BOOS, and N. L. KAPLAN, 1992 A statistical test for detecting geographic subdivision. Mol. Biol. Evol. 9:138-151.[Abstract]
INOUYE, M., 1988 Antisense RNA: its functions and applications in gene regulationa review. Gene 72:25-34.[Medline]

denotes a 3-bp deletion of CAG (position 232234, strain F-775F);
denotes the absence of a deletion;
denotes an insertion of ACATTTGAT (position 13791387, strains S-501S, S-510S, and S-5F);
denotes the absence of an insertion.

















