- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Dorer, D. R.
- Articles by Christensen, A. C.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Dorer, D. R.
- Articles by Christensen, A. C.
A Family of Genes Clustered at the Triplo-lethal Locus of Drosophila melanogaster Has an Unusual Evolutionary History and Significant Synteny With Anopheles gambiae
Douglas R. Dorera, Jamie A. Rudnick1,b, Etsuko N. Moriyamab,c, and Alan C. Christensenba Department of Microbiology, Meharry Medical College, Nashville, Tennessee 37208
b School of Biological Sciences, University of Nebraska, Lincoln, Nebraska 68588
c Plant Science Initiative, University of Nebraska, Lincoln, Nebraska 68588
Corresponding author: Alan C. Christensen, 348 Manter Hall, University of Nebraska, Lincoln, NE 68588-0118., achristensen2{at}unl.edu (E-mail)
Communicating editor: S. HENIKOFF
| ABSTRACT |
|---|
Within the unique Triplo-lethal region (Tpl) of the Drosophila melanogaster genome we have found a cluster of 20 genes encoding a novel family of proteins. This family is also present in the Anopheles gambiae genome and displays remarkable synteny and sequence conservation with the Drosophila cluster. The family is also present in the sequenced genome of D. pseudoobscura, and homologs have been found in Aedes aegypti mosquitoes and in four other insect orders, but it is not present in the sequenced genome of any noninsect species. Phylogenetic analysis suggests that the cluster evolved prior to the divergence of Drosophila and Anopheles (250 MYA) and has been highly conserved since. The ratio of synonymous to nonsynonymous substitutions and the high codon bias suggest that there has been selection on this family both for expression level and function. We hypothesize that this gene family is Tpl, name it the Osiris family, and consider possible functions. We also predict that this family of proteins, due to the unique dosage sensitivity and the lack of homologs in noninsect species, would be a good target for genetic engineering or novel insecticides.
WHEN the Drosophila genome was surveyed for dosage-sensitive regions, only one was found that was both triplo-lethal and haplo-lethal (![]()
![]()
![]()
![]()
![]()
![]()
Denell proposed three hypotheses to explain the lack of point mutations at Tpl: (1) the locus is very small so the mutation rate is very low, (2) the locus does not encode a protein and therefore is less sensitive to single base changes, or (3) the locus consists of a gene cluster with at least partial redundancy, such that mutation of one of the genes does not rescue the lethality of a duplication of the entire cluster. The small size hypothesis predicts that as the number of mutagenized chromosomes increases, the chance that a mutant will be found also increases. However, we have subsequently screened >106 chromosomes and still have not isolated point mutations (![]()
![]()
Using the complete genomic sequence of Drosophila melanogaster we have tested the prediction that the Triplo-lethal region contains a cluster of genes with high similarity. To do that, first we defined the molecular limits of Tpl by isolating and mapping duplications and deletions and then examining the sequence within those limits for repeated genetic units. We describe here the discovery of a multigene family in the Triplo-lethal region, consistent with the best hypothesis based on the genetic data. Although the proteins encoded by this family are novel, the sequences have features that allow us to make predictions about their function. We predict that a family of genes whose dosage is so critical will be well conserved and show evidence of strong selection on expression levels. Comparison of the D. melanogaster gene family with the orthologous genes in Anopheles gambiae allows us to analyze the expression, selection, and evolution of the family.
| MATERIALS AND METHODS |
|---|
Drosophila rearrangements and mapping:
Drosophila stocks were previously described (![]()
![]()
2-3 source of transposase and a single P element inserted near Tpl to YSX.YL, In(1)EN y;;Dp(3;3)Tpl pp/Df(3R)Tpl10 pp. Survivors were backcrossed to the Dp/Df line to establish a stock carrying the new rearrangement, often flanked by the starting P element. Three single P-element insertions were used: P{ry+t7.2=PZ}l(3)0108601086, inserted in RM62; P{hsneo} l(3)neo331, inserted in castor (![]()
![]()
![]()
![]()
Bioinformatics:
Sequence similarity searches were done using the BLAST server at http://www.ncbi.nlm.nih.gov (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Phylogenetic relationships were reconstructed with the neighbor-joining (NJ) method (![]()
![]()
Codon usage bias was measured as the "effective number of codons" (ENC) as developed by ![]()
![]()
| RESULTS |
|---|
Molecular mapping of Tpl:
Because we and others have been unable to isolate point or transposon insertion mutations in Tpl, in spite of a very powerful selection, we generated duplications and deletions flanking single P-element insertions (![]()
![]()
334 kb. The smallest deficiency, Df(3R)Tpl6F, also has its distal breakpoint in Pak, confirming that Tpl is located between Rm62 and Pak.
A novel family of proteins is encoded within the Triplo-lethal region:
Examination of this region in the Drosophila genome (![]()
![]()
![]()
![]()
![]()
![]()
|
Four protein sequences from the cluster at 83E that appear to be members of the Osiris family cannot be aligned as well and were not shown in Fig 1, but are aligned in Supplemental Data. These four proteins are Osiris 10, Osiris 13, Osiris 14, and Osiris 17. Each includes the transmembrane domain and the conserved tyrosine motif, however Osiris 10 appears to be internally repeated, and Osiris 13 and Osiris 14 are diverged at the ends, perhaps because of errors in predicting the exons. Osiris 17 includes copies of the cysteine motif at both the amino and carboxyl termini, and is predicted to be mitochondrial. Despite these differences, the four are recognizable as members of the family, and annotation errors may account for the differences.
Through BLAST and PSI-BLAST queries of the NCBI database, we have identified three other members of this family, encoded at three different sites elsewhere in the genome. Osiris 21 (CG14925, polytene region 32E) and Osiris 23 (CG15538, 99F) appear to be typical members of the family (Fig 1). Osiris 22 (CG8644, 87E) lacks the N-terminal cysteines, although it is otherwise very similar to the others (supplemental data at http://www.genetics.org/supplemental/). None of these loci are triplo-lethal or located within haplo-insufficient regions (![]()
The Osiris gene family is highly conserved between Drosophila and Anopheles:
We compared the genes in this region to the A. gambiae genomic sequence (![]()
|
To understand the close relationships between orthologs, we examined the codon usage biases of the genes and the base substitution patterns between orthologous pairs. Codon usage bias was measured as the effective number of codons. It can range from 20 (where only one codon is used for each of the 20 amino acids and thus codon usage is most biased) to 61 (where all possible codons are used and there is no bias). The average ENC for >12,000 genes from D. melanogaster is 49, with a range from 28 to 61 (![]()
![]()
![]()
![]()
A significant negative correlation is observed between the numbers of synonymous substitutions per site and the codon usage bias of orthologous pairs (R = 0.76, P = 0.002). This again implies that these genes are under translational selection. Remarkably, the orthologous pairs are similar enough to estimate the synonymous substitution rates for 15 of them. The average ratio between the synonymous and nonsynonymous substitutions per site is 0.37 and the ratios range from 0.25 to 0.58. These ratios are <1.0 (Fig 3), indicating that these genes are under selection. Not surprisingly, these ratios are higher than those obtained within the Drosophila lineage. ![]()
90% of such ratios obtained from comparisons among three Drosophila species are under 0.2. Interestingly, nonsynonymous substitution rates are correlated with synonymous substitution rates (Fig 3; Spearman Rho = 0.6, P = 0.02). Such correlations have been described in Drosophila (![]()
![]()
![]()
|
The Osiris gene cluster displays significant synteny:
The map locations of most of the A. gambiae orthologs are known (![]()
![]()
|
Osiris family members in other species:
Homologs to the Osiris genes have also been sequenced in the dipteran insects D. pseudoobscura and Aedes aegypti. Not surprisingly, all 23 family members are found in the D. pseudo-obscura genome, completely syntenic with the D. melanogaster genes (http://hgsc.bcm.tmc.edu/drosophila). No other insect genomes have been reported to be completely sequenced as of this writing. However, a partial cDNA from A. aegypti (GenBank accession no.
BQ789636) encodes a homolog of Osiris 11. The cDNA sequence contains the putative transmembrane domain, and the high degree of conservation of this region is shown in Fig 5A.
|
We have also observed Osiris genes in four other insect orders. At least 17 different homologs to the Drosophila genes are found by BLAST searches of the first assembly of the honeybee (Apis mellifera) genome (http://titan.biotec.uiuc.edu/bee/honeybee_project.htm). A honeybee expressed sequence tag (EST; BI512836, BI515793, and BI509951) encodes a probable ortholog to Osiris 14 that contains the signal peptide, the paired cysteines, and the transmembrane domain (Fig 5B). cDNAs of other Osiris genes have been recovered from the lepidopterans Bombyx mori (BP121280, BP117216, BP119727, BP119640, BP119167, and others), Helicoverpa ameriga (BU038419), and Manduca sexta (BF046752); the coleopteran Cicindela campestris (BQ475392); and the hemipteran Toxoptera citricida (CB855036). In contrast to the conservation demonstrated in these insect species, no homologs have been sequenced in any other phyla to date. This suggests that the function of the Osiris proteins may be insect or Arthropod specific.
Expression:
The lethal phase of Tpl aneuploids is late embryonic or early larval, with the tracheae and the gut the first tissues to be affected (![]()
Expression of Osiris genes in later stages has also been observed. Osiris 7 shows two peaks of expression during metamorphosis in addition to one late in embryogenesis (![]()
![]()
![]()
| DISCUSSION |
|---|
The Triplo-lethal locus has been mysterious since its discovery in 1972, primarily because point mutations and transposon insertional mutants have not been isolated (![]()
![]()
![]()
![]()
334 kb and examining the genomic sequence of that region. We found 20 genes that are clustered and closely related and encode a novel family of transmembrane proteins. With 23 total members, this is one of the largest gene families in Drosophila. Of the 1437 sequence similarity groups reported at http://www.fruitfly.org/annot/similarity.html, only 26 groups are larger.
Homologous genes have been found only in insects, and the function of this family is unknown in A. gambiae or any other insect species. We predict that the orthologous families in Anopheles and other insects will be dosage sensitive. Reciprocal crosses between D. melanogaster carrying duplications and deficiencies of Tpl with D. simulans, D. mauritiana, and D. seychellia have shown that Tpl is both triplo- and haplo-lethal to the interspecific hybrids of both sexes (our unpublished data). These genes are also located in one of the longest regions of microsynteny between D. melanogaster and A. gambiae (![]()
The genetic data for Tpl (![]()
![]()
![]()
Without knowing the precise function of these proteins it is hard to know why this genetic region is so dosage sensitive. However, the relative concentrations of membrane proteins can affect rates of association and assembly of complexes (![]()
![]()
![]()
![]()
![]()
| FOOTNOTES |
|---|
1 Present address: Department of Forestry and Natural Resources, Purdue University, West Lafayette, IN 47907-2033. ![]()
| ACKNOWLEDGMENTS |
|---|
We thank Deidre Potter for excellent advice; Larry Harshman, Han Asard, and Bob Weldon for helpful conversations; Laura Smoyer, John Engelman, and Marilyn Cadden for technical assistance; and Joel Eissenberg for critical comments on the manuscript. This work was supported in part by grants to A.C.C from the National Science Foundation and to D.R.D. from the National Institutes of Health (S06 GM08037-31).
Manuscript received March 26, 2003; Accepted for publication May 16, 2003.
| LITERATURE CITED |
|---|
ADAMS, M. D., S. E. CELNIKER, R. A. HOLT, C. A. EVANS, and J. D. GOCAYNE et al., 2000 The genome sequence of Drosophila melanogaster.. Science 287:2185-2195.
AKASHI, H., 1994 Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. Genetics 136:927-935.[Abstract]
ALTSCHUL, S. F., W. GISH, W. MILLER, E. W. MYERS, and D. J. LIPMAN, 1990 Basic local alignment search tool. J. Mol. Biol. 215:403-410.[Medline]
ALTSCHUL, S. F., T. L. MADDEN, A. A. SCHAFFER, J. ZHANG, and Z. ZHANG et al., 1997 Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389-3402.
ARBEITMAN, M. N., E. E. M. FURLONG, F. IMAM, E. JOHNSON, and B. H. NULL et al., 2002 Gene expression during the life cycle of Drosophila melanogaster. Science 297:2270-2275.
BERGMAN, C. M., B. D. PFEIFFER, D. E. RINCON-LIMAS, R. A. HOSKINS, A. GNIRKE et al., 2002 Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome. Genome Biol. 3: 0086.10086.20.
BONIFACINO, J. S. and E. C. DELL'ANGELICA, 1999 Molecular bases for the recognition of tyrosine-based sorting signals. J. Cell Biol. 145:923-926.
BRAVO, A., B. ILLANA, and M. SALAS, 2000 Compartmentalization of phage {phi}29 DNA replication: interaction between the primer terminal protein and the membrane-associated protein p1. EMBO J. 19:5575-5584.[Medline]
COOLEY, L., R. KELLEY, and A. SPRADLING, 1988 Insertional mutagenesis of the Drosophila genome with single P elements. Science 239:1121-1128.
DENELL, R. E., 1976 The genetic analysis of a uniquely dose-sensitive chromosomal region of Drosophila melanogaster.. Genetics 84:193-210.
DORER, D. R. and A. C. CHRISTENSEN, 1990 The unusual spectrum of mutations induced by hybrid dysgenesis at the Triplo-lethal locus of Drosophila melanogaster.. Genetics 125:795-801.[Abstract]
DORER, D. R., A. C. CHRISTENSEN, and D. H. JOHNSON, 1990 A novel RNA helicase gene tightly linked to the Triplo-lethal locus of Drosophila melanogaster.. Nucleic Acids Res. 18:5489-5494.
DORER, D. R., M. A. CADDEN, B. GORDESKY-GOLD, G. HARRIES, and A. C. CHRISTENSEN, 1993 Suppression of a lethal trisomic phenotype in Drosophila melanogaster by increased dosage of an unlinked locus. Genetics 134:243-249.[Abstract]
DORER, D. R., D. H. EZEKIEL, and A. C. CHRISTENSEN, 1995 The Triplo-lethal locus of Drosophila: reexamination of mutants and discovery of a second-site suppressor. Genetics 141:1037-1042.[Abstract]
DURET, L., 2002 Evolution of synonymous codon usage in metazoans. Curr. Opin. Genet. Dev. 12:640-649.[Medline]
EISSENBERG, J. C., J. MA, M. A. GERBER, A. CHRISTENSEN, and J. A. KENNISON et al., 2002 dELL, an essential RNA polymerase II elongation factor with a general role in development. Proc. Natl. Acad. Sci. USA 99:9894-9899.
EMANUELSSON, O., H. NIELSEN, S. BRUNAK, and G. VON HEIJNE, 2000 Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J. Mol. Biol. 300:1005-1016.[Medline]
FELSENSTEIN, J., 2001 PHYLIP (Phylogeny Inference Package), version 3.6.a3. University of Washington, Seattle.
GODER, V. and M. SPIESS, 2001 Topogenesis of membrane proteins: determinants and dynamics. FEBS Lett. 504:87-93.[Medline]
HOFFMAN, S. and G. M. EDELMAN, 1983 Kinetics of homophilic binding by embryonic and adult forms of the neural cell adhesion molecule. Proc. Natl. Acad. Sci. USA 80:5762-5766.
HOLT, R. A., G. M. SUBRAMANIAN, A. HALPERN, G. G. SUTTON, and R. CHARLAB et al., 2002 The genome sequence of the malaria mosquito Anopheles gambiae. Science 298:129-149.
HURST, L. D., E. J. B. WILLIMAS, and C. PÁL, 2002 Natural selection promotes the conservation of linkage of co-expressed genes. Trends Genet. 18:604-606.[Medline]
JONES, D. T., W. R. TAYLOR, and J. M. THORNTON, 1992 The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci. 8:275-282.
KEENAN CURTIS, K. and P. M. KANE, 2002 Novel vacuolar H+-ATPase complexes resulting from overproduction of Vma5p and Vma13p. J. Biol. Chem. 277:2716-2724.
KEPPY, D. O. and R. E. DENELL, 1979 A mutational analysis of the triplo-lethal region of Drosophila melanogaster.. Genetics 91:421-441.
KREBS, K. C., K. L. BRZOZA, and Q. LAN, 2002 Use of subtracted libraries and macroarray to isolate developmentally specific genes from the mosquito, Aedes aegypti.. Insect Biochem. Mol. Biol. 32:1757-1767.[Medline]
KROGH, A., B. LARSSON, G. VON HEIJNE, and E. L. L. SONNHAMMER, 2001 Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 305:567-580.[Medline]
LI, W. H., 1993 Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J. Mol. Evol. 36:96-99.[Medline]
LINDSLEY, D. L., L. SANDLER, B. S. BAKER, A. T. C. CARPENTER, and R. E. DENELL et al., 1972 Segmental aneuploidy and the genetic gross structure of the Drosophila genome. Genetics 71:157-184.
MAKALOWSKI, W. and M. S. BOGUSKI, 1998 Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. Proc. Natl. Acad. Sci. USA 95:9407-9412.
MORIYAMA, E. N., 2003 Encyclopedia of the Human Genome. Macmillan, London in press.
NIELSEN, H., J. ENGELBRECHT, S. BRUNAK, and G. VON HEIJNE, 1997 Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng. 10:1-6.
PRESTON, C. R., J. A. SVED, and W. R. ENGELS, 1996 Flanking duplications and deletions associated with P-induced male recombination in Drosophila. Genetics 144:1623-1638.[Abstract]
ROEHRDANZ, R. L. and J. C. LUCCHESI, 1980 Mutational events in the triplo- and haplo-lethal region (83DE) of the Drosophila melanogaster genome. Genetics 95:355-366.
ROYLE, S. J., L. K. BOBANOVIC, and R. D. MURRELL-LAGNADO, 2002 Identification of a non-canonical tyrosine-based endocytic motif in an ionotropic receptor. J. Biol. Chem. 277:35378-35385.
SAITOU, N. and M. NEI, 1987 The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406-425.[Abstract]
SMOYER, L. K., D. R. DORER, K. W. NICKERSON, and A. C. CHRISTENSEN, 2004 Phenotype of the Triplo-lethal locus of Drosophila melanogaster and its suppression by hyperoxia. Genet. Res. in press.
SPRADLING, A. C., D. STERN, A. BEATON, E. J. RHEM, and T. LAVERTY et al., 1999 The Berkeley Drosophila Genome Project gene disruption project: single P-element insertions mutating 25% of vital Drosophila genes. Genetics 153:135-177.
THOMPSON, J. D., D. G. HIGGINS, and T. J. GIBSON, 1994 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
URBAN, S. and M. FREEMAN, 2002 Intramembrane proteolysis controls diverse signalling pathways throughout evolution. Curr. Opin. Genet. Dev. 12:512-518.[Medline]
VINCENT, J.-P., J. A. KASSIS, and P. H. O'FARRELL, 1990 A synthetic homeodomain binding site acts as a cell type specific, promoter specific enhancer in Drosophila embryos. EMBO J. 9:2573-2578.[Medline]
WHITFIELD, C. W., M. R. BAND, M. F. BONALDO, C. G. KUMAR, and L. LIU et al., 2002 Annotated expressed sequence tags and cDNA microarrays for studies of brain and behavior in the honey bee. Genome Res. 12:555-566.
WRIGHT, F., 1990 The effective number of codons used in a gene. Gene 87:23-29.[Medline]
YUAN, J., A. AMEND, J. BORKOWSKI, R. DEMARCO, and W. BAILEY et al., 1999 MULTICLUSTAL: a systematic method for surveying Clustal W alignment parameters. Bioinformatics 15:862-863.
ZDOBNOV, E. M., C. VON MERING, I. LETUNIC, D. TORRENTS, and M. SUYAMA et al., 2002 Comparative genome and proteome analysis of Anopheles gambiae and Drosophila melanogaster. Science 298:149-159.
ZHANG, J. and M. NEI, 1996 Evolution of Antennapedia-class homeobox genes. Genetics 142:295-303.[Abstract]
This article has been cited by other articles:
![]() |
A. Bhutkar, S. W. Schaeffer, S. M. Russo, M. Xu, T. F. Smith, and W. M. Gelbart Chromosomal Rearrangement Inferred From Comparisons of 12 Drosophila Genomes Genetics, July 1, 2008; 179(3): 1657 - 1680. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Ren, C. Zhu, H. Lee, and P. N. Adler Gene Expression During Drosophila Wing Morphogenesis and Differentiation Genetics, October 1, 2005; 171(2): 625 - 638. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. N. Belyakin, G. K. Christophides, A. A. Alekseyenko, E. V. Kriventseva, E. S. Belyaeva, R. A. Nanayev, I. V. Makunin, Heidelberg Fly Array Consortium, F. C. Kafatos, and I. F. Zhimulev Genomic analysis of Drosophila chromosome underreplication reveals a link between replication control and transcriptional territories PNAS, June 7, 2005; 102(23): 8269 - 8274. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Dorer, D. R.
- Articles by Christensen, A. C.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Dorer, D. R.
- Articles by Christensen, A. C.







