- THIS ARTICLE
-
Abstract
- Full Text (PDF)
-
All Versions of this Article:
genetics.106.060269v1
173/4/1885 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by He, X.
- Articles by Zhang, J.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by He, X.
- Articles by Zhang, J.
Originally published as Genetics Published Articles Ahead of Print on May 15, 2006.
Genetics, Vol. 173, 1885-1891, August 2006, Copyright © 2006
doi:10.1534/genetics.106.060269
Toward a Molecular Understanding of Pleiotropy
Xionglei He and Jianzhi Zhang1
Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109
1 Corresponding author: Department of Ecology and Evolutionary Biology, University of Michigan, 1075 Natural Science Bldg., 830 N. University Ave., Ann Arbor, MI 48109.
E-mail: jianzhi{at}umich.edu
Pleiotropy refers to the observation of a single gene influencing multiple phenotypic traits. Although pleiotropy is a common phenomenon with broad implications, its molecular basis is unclear. Using functional genomic data of the yeast Saccharomyces cerevisiae, here we show that, compared with genes of low pleiotropy, highly pleiotropic genes participate in more biological processes through distribution of the protein products in more cellular components and involvement in more proteinprotein interactions. However, the two groups of genes do not differ in the number of molecular functions or the number of protein domains per gene. Thus, pleiotropy is generally caused by a single molecular function involved in multiple biological processes. We also provide genomewide evidence that the evolutionary conservation of genes and gene sequences positively correlates with the level of gene pleiotropy.
PLEIOTROPY refers to the observation that a single gene affects two or more distinct and seemingly unrelated traits. Pleiotropy is one of the most commonly observed attributes of genes, with broad implications in genetics, evolution, development, aging, disease, and drug discovery (WILLIAMS 1957; WRIGHT 1968; BARTON 1990; HODGKIN 1998; WAXMAN and PECK 1998; BRUNNER and VAN DRIEL 2004; OTTO 2004; PROMISLOW 2004; VAN DE PEPPEL and HOLSTEGE 2005). Genes of high pleiotropy are expected to be under strong stabilizing selection because they affect multiple traits (HODGKIN 1998). Pleiotropy also causes compromises among adaptations of different traits, because a genetic change beneficial to one trait may be deleterious to another (BARTON 1990; OTTO 2004). It is believed that this property underlies many fundamental principles and phenomena in biology, including senescence, trade-off, and cooperation (WILLIAMS 1957; FOSTER et al. 2004; MACLEAN et al. 2004). For example, it was proposed that mutant genes advantageous to development and reproduction are deleterious after the reproductive age and cause senescence, which may explain why all species have a limited life span (WILLIAMS 1957). Social amoeba Dictyostelium discoideum can aggregate during starvation where some cells die to form a stalk that holds the other cells aloft as reproductive spores (STRASSMANN et al. 2000). A recent study showed that deleting the gene dimA in D. discoideum allows cells to avoid death, but leads to a great reduction in spore production (FOSTER et al. 2004). Hence, the pleiotropic effects of dimA stabilize the cooperation among amoeba. Pleiotropy also has important implications in human diseases. For instance, mutations in the homeobox gene ARX cause ambiguous genitalia and lissencephaly (whole or parts of the surface of the brain appear smooth) (OMIM at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM). Mutants of AMT, a phosphatidylinositol-3 kinase gene, show the symptoms of cerebellar ataxia, telangiectases (visibly dilated blood vessel on the skin or mucosal surface), immune defects, and a predisposition to malignancy (OMIM).
A central question about pleiotropy is whether the pleiotropic effects of a gene are conferred by multiple molecular functions of the gene or by multiple consequences of a single molecular function (DUDLEY et al. 2005; VAN DE PEPPEL and HOLSTEGE 2005). A typical example of a pleiotropic gene with multiple molecular functions is the mammalian serum albumin, which is well known for binding fatty acids and toxic metabolites, but it is also involved in the oxidation of nitric oxide (RAFIKOVA et al. 2002). By contrast, the yeast gene HIS7 encodes glutamine amidotransferase, yet this single catalytic activity is used in both histidine biosynthesis and purine nucleotide monophosphate biosynthesis. Although the molecular basis of pleiotropy may vary among genes, it is still important to ask whether one of the above two mechanisms explains the majority of pleiotropic genes in a genome. To address this question, we take advantage of a recently generated genomewide data set of gene pleiotropy from the yeast Saccharomyces cerevisiae (DUDLEY et al. 2005). We show that the pleiotropic effects of a gene are not usually conferred by multiple molecular functions of the gene, but by multiple consequences of a single molecular function.
The yeast Gene Ontology (GO) annotations were downloaded from ftp://genome-ftp.stanford.edu/pub/yeast/literature_curation. Genes with the "unknown" annotations (molecular function: 0005554; biological process: 0000004; cellular component: 0008372) were excluded and only nonredundant annotations were considered. Yeast Enzyme Commission (EC) codes were downloaded from ftp://ftpmips.gsf.de/yeast/catalogues/eccat/eccat_data_20062005. Only genes with at least one EC code were considered. The predicted functional domains of yeast proteins were obtained from Munich Information Center for Protein Sequences (MIPS) (ftp://ftpmips.gsf.de/yeast/catalogues/motifs/) and genes with no domain predictions were excluded. The number of genes directly regulated by each transcriptional factor was obtained from the ChIPchip (chromatin immunoprecipitation followed by the identification of immunoprecipitated genomic fragments through the use of whole-genome DNA chips) experiments (HARBISON et al. 2004). The yeast proteinprotein interaction (PPI) data were compiled by Han and colleagues, who used stringent criteria to avoid false-positive data points (HAN et al. 2004). Proteins with no known PPIs were excluded. The yeast stable protein complex data used here were downloaded from ftp://genome-ftp.stanford.edu/pub/yeast/literature_curation/go_protein_complex_slim.tab. We found that genes involved in stable protein complexes were more pleiotropic than genes not involved in protein complexes (P < 104, MannWhitney U-test).
For the evolutionary analysis, we conducted genomewide all-against-all BLASTP searches (E-value cutoff = 1010) between 5773 yeast and 13,434 fruit fly (Drosophila melanogaster) proteins, which were downloaded from the Saccharomyces Genome Database (http://www.yeastgenome.org/) and ENSEMBL (http://www.ensembl.org), respectively. Similar BLASTP searches were also conducted between the yeast proteins and 19,873 nematode (Caenorhabditis elegans) proteins and between the yeast proteins and 4999 fission yeast (Schizosaccharomyces pombe) proteins, respectively. The nematode protein sequences were obtained from ENSEMBL, whereas the fission yeast protein sequences were downloaded from http://www.sanger.ac.uk/Projects/S_pombe. We then considered the 4494 yeast genes with pleiotropy information. Nonsynonymous nucleotide distances (dN) between orthologous genes of the yeasts S. cerevisiae and Saccharomyces bayanus were obtained from ZHANG and HE (2005). We measured the rank correlation between dN and gene pleiotropy. Furthermore, we measured the partial rank correlation between dN and gene pleiotropy when the expression level of the gene was controlled for.
Molecular functions:
In the yeast gene pleiotropy data set that we use here (DUDLEY et al. 2005), the level of pleiotropy was measured for each yeast gene by the number of lab conditions (of the 21 conditions) under which the homozygous gene-deletion strain showed significantly slower growth than under the control condition. We first examine whether the level of gene pleiotropy is correlated with the number of molecular functions per gene. We use the molecular function annotation in GO (ASHBURNER et al. 2000), which describes activities, such as catalytic or binding activities, at the molecular level. A gene may possess one or more than one activity. For example, BMH1 shows both DNA-binding and protein-binding activities, thus having two molecular functions. Nevertheless, we find no correlation between the level of pleiotropy and the number of molecular functions among 2386 yeast genes for which both pleiotropy information and GO annotation are available (Spearman's rank correlation coefficient r = 0.01, two-tailed P = 0.57). We also group genes according to their level of pleiotropy, but find different groups to have similar mean numbers of molecular functions (Figure 1A). Next, we examine EC codes for all yeast enzymes. EC codes are a numerical classification scheme for enzymes based on the chemical reactions that they catalyze. Although the majority of enzymes have only one EC code, some have more than one code because they catalyze multiple different chemical reactions. For instance, THI6 is both a thiaminephosphate diphosphorylase (EC 2.5.1.3) and a hydroxyethylthiazole kinase (EC 2.7.1.50). However, no significant correlation is observed between the number of EC codes and the level of pleiotropy among 917 genes for which both EC and pleiotropy information is available (r = 0.06, P = 0.09; see also Figure 1B). Furthermore, there is no correlation between the number of protein domains per gene and the level of gene pleiotropy (r = 0.01, P = 0.62; see Figure 1C). Thus, all three measures of molecular functions indicate that gene pleiotropy is not attributable to an excess of molecular functions.
|
Biological processes:
We then investigate the relationship between pleiotropy and the number of biological processes in which each gene participates, again using GO. A GO-annotated biological process is series of events accomplished by one or more ordered assemblies of molecular functions, such as pyrimidine metabolism or
-glucoside transport (ASHBURNER et al. 2000). A gene may participate in one or multiple biological processes. Not unexpectedly, a significant positive correlation exists between the level of gene pleiotropy and the number of biological processes in which the gene participates (r = 0.12, P < 1010). When the genes are grouped by the level of pleiotropy, we observe a clear trend that the mean number of biological processes per gene increases with pleiotropy (Figure 2A). We also observe a weak, but significant, positive correlation between the number of cellular components where the product of a gene is located (as annotated by GO) and the level of gene pleiotropy (r = 0.05, P < 0.003; see Figure 2B). Here, a cellular component refers to a component of a cell but with the proviso that it is part of some larger object, which may be an anatomical structure or a gene product group (ASHBURNER et al. 2000). We hypothesize that the correlation between the number of cellular components and pleiotropy arises because gene products distributed among more cellular components have opportunities to participate in more biological processes. Indeed, the number of cellular components and the number of biological processes are significantly correlated (r = 0.15, P < 1014). After we control for the number of biological processes, the correlation between the number of cellular components and pleiotropy is no longer significant (r = 0.03, P = 0.09). On the contrary, the correlation between the number of biological processes and pleiotropy is reduced only slightly by the control of the number of cellular components (r = 0.11, P < 108). These results indicate that gene pleiotropy is likely due to multiple biological processes in which the gene participates, and the multiple participations are realized in part by having the gene product distributed into multiple cellular components.
|
Proteinprotein interactions:
Genomewide studies showed that most genes function by PPIs (VON MERING et al. 2002; HE and ZHANG 2006). We thus hypothesize that many pleiotropic genes participate in multiple biological processes through engaging in multiple PPIs. To test this hypothesis, we analyze a recently compiled yeast PPI data set (HAN et al. 2004). We find that the number of PPIs that a gene has is positively correlated with its level of pleiotropy (r = 0.19, P < 106; see Figure 3A). Although pleiotropic genes also tend to be involved in stable protein complexes (see MATERIALS AND METHODS), the correlation between the number of PPIs and the level of pleiotropy remains significant when proteins involved in protein complexes are removed (r = 0.16, P = 0.0015; see Figure S1 at http://www.genetics.org/supplemental/). Furthermore, from the yeast PPI network we identify 106 pairs (n) of interacting proteins that share at least one phenotype (i.e., condition under which slow growth is observed). The number n between random pairs of genes can be estimated by randomly rewiring the yeast PPI network while keeping the number of interactions constant for every protein. The average n from 10,000 randomly rewired PPI networks is 35.9, with none of the 10,000 n values
106 (Figure 3B). Thus, interacting proteins share phenotypic effects significantly more often than by chance (P < 104), suggesting that PPIs underlie some phenotypic effects of genes. We also identified 15 cases in which a focal gene shares different phenotypes with different interacting partners, indicative of pleiotropy arising from multiple PPIs. Figure 3C shows one example in which the seven phenotypes of the focal gene are shared with four different PPI partners. It should be pointed out that the different fitness effects of different gene deletions were controlled for when the yeast gene phenotypic data were generated (see MATERIALS AND METHODS), so the observed relationship between pleiotropy and the number of PPIs is not due to the fact that genes with more PPIs tend to show detectable phenotypes upon deletions (JEONG et al. 2001; HE and ZHANG 2006).
|
We reason that having multiple PPIs contributes to gene pleiotropy because a gene can participate in multiple biological processes through different PPIs. Because two randomly picked genes have only a negligible chance (1.56%) to share a biological process, biological processes shared between interacting proteins are likely dependent on their interaction, although the biological processes of some genes may have been inferred from the PPI information. We find a strong positive correlation between the number of PPIs that a gene has and the total number of nonredundant biological processes that the gene shares with its interacting partners (r = 0.33; P < 1018). This result provides strong evidence that at least one of the molecular mechanisms by which highly pleiotropic genes participate in more biological processes is by multiple PPIs. Another potential mechanism is proteinDNA interaction. We find that although highly pleiotropic transcription factors do regulate more target genes on average, the correlation is not statistically significant (r = 0.14, P = 0.22). This result, however, could be due to the lack of statistical power, as only 83 transcriptional factor genes are included in our data set (see MATERIALS AND METHODS). In an earlier study, PROMISLOW (2004) found a positive correlation between the number of PPIs and gene pleiotropy. However, his gene pleiotropy was not directly measured, but was inferred from the number of functional classifications listed in MIPS. In effect, his finding was a correlation between the number of PPIs and the number of biological processes.
To test the robustness of our results, we analyzed another yeast gene pleiotropy data set (PARSONS et al. 2004). In this data set, the growth rate of each single-gene deletion strain was measured in the presence and absence of 1 of 12 diverse inhibitory compounds. The pleiotropic level of a gene is defined by the number of compounds that inhibit the growth of the deletion strain. Similar to the above results, we do not find a significant correlation between the number of GO-defined molecular functions and the pleiotropic level of a gene (r = 0.03, two-tailed P = 0.13, n = 2386; see Figure S5a at http://www.genetics.org/supplemental/ for binned data). In contrast, a significant positive correlation between the number of GO-defined biological processes and pleiotropy is observed (r = 0.13, two-tailed P < 109, n = 2822; see Figure S5b at http://www.genetics.org/supplemental/ for binned data). Thus, our conclusion on the molecular basis of yeast gene pleiotropy appears robust for different data sets. At this time, however, no genomewide data sets of gene pleiotropy are available for other organisms, making it difficult to conclude whether the mechanisms revealed from the yeast apply to all organisms.
Revelation of the molecular basis of pleiotropy has several implications. Much effort has been employed to identify alleles of a pleiotropic gene, each of which affects one of the many phenotypic effects of the gene. This strategy deserves reevaluation because, if the pleiotropic effects of a gene are usually due to the same molecular function, it would be difficult or at least inefficient to isolate gene mutants that affect only one trait. This concern is particularly meaningful in human genetics where isolation of symptom-specific alleles is thought to be important for developing effective treatments (DUDLEY et al. 2005). Our finding suggests that developing drugs that target only one particular phenotypic effect of a pleiotropic gene is likely to be difficult. However, targeting a specific protein interaction or an interacting partner of the pleiotropic gene might be a useful strategy. Our results also show the universality that the same molecular function of a gene is repeatedly used in different biological processes.
Pleiotropic genes are widely believed to be evolutionarily conserved because they are subject to purifying selection acting on multiple traits and are less likely to experience beneficial mutations (FISHER 1958; HODGKIN 1998). We find that 39.5 ± 0.8% of nonpleiotropic yeast genes (no phenotype in any condition) have detectable homologs in the fruit fly D. melanogaster (Figure 4A). In comparison, 49.2 ± 2.3% of low pleiotropic genes (with phenotypes in one to two conditions) and 54.7 ± 3.6% of high pleiotropic genes (with phenotypes in more than two conditions) have fruit fly homologs. Together, pleiotropic genes are significantly more likely to be retained in long-term evolution than nonpleiotropic genes (
2 = 29, P < 107). Similarly, 52.6 ± 2.7% of pleiotropic yeast genes have detectable homologs in the nematode C. elegans, in comparison to 38.3 ± 1.1% of nonpleiotropic genes (
2 = 20, P < 105). When the fungus S. pombe is compared, 71.7 ± 3.3% of pleiotropic yeast genes have detectable homologs, in comparison to 58.4 ± 1.3% of nonpleiotropic genes (
2 = 41, P < 109). We also computed the nonsynonymous nucleotide distance (dN) between orthologous genes of S. cerevisiae and S. bayanus, two closely related yeast species, and observed a negative correlation between pleiotropy and dN (r = 0.12, P < 1011), suggesting that pleiotropic genes tend to evolve more slowly at nonsynonymous sites (Figure 4B), consistent with a recent study that was based on fewer genes (SALATHE et al. 2006). Our result is robust (r = 0.09, P < 107) even when we control for the level of gene expression, the most important determinant of dN in yeasts (PAL et al. 2001; ZHANG and HE 2005; DRUMMOND et al. 2006). Thus, genomewide analyses demonstrate that pleiotropy leads to the evolutionary conservation of genes and gene sequences.
|
ASHBURNER, M., C. A. BALL, J. A. BLAKE, D. BOTSTEIN, H. BUTLER et al., 2000 Gene ontology: tool for the unification of biology: the Gene Ontology Consortium. Nat. Genet. 25: 2529.[CrossRef][Medline]
BARTON, N. H., 1990 Pleiotropic models of quantitative variation. Genetics 124: 773782.[Abstract]
BRUNNER, H. G., and M. A. VAN DRIEL, 2004 From syndrome families to functional genomics. Nat. Rev. Genet. 5: 545551.[CrossRef][Medline]
DRUMMOND, D. A., A. RAVAL and C. O. WILKE, 2006 A single determinant dominates the rate of yeast protein evolution. Mol. Biol. Evol. 23: 327337.
DUDLEY, A., D. JANSE, A. TANAY, R. SHAMIR and G. CHURCH, 2005 A global view of pleiotropy and phenotypically derived gene function in yeast. Mol. Syst. Biol. 1: 2005.0001.
FISHER, R. A., 1958 The Genetic Theory of Natural Selection. Dover, New York.
FOSTER, K. R., G. SHAULSKY, J. E. STRASSMANN, D. C. QUELLER and C. R. THOMPSON, 2004 Pleiotropy as a mechanism to stabilize cooperation. Nature 431: 693696.[CrossRef][Medline]
HAN, J. D., N. BERTIN, T. HAO, D. S. GOLDBERG, G. F. BERRIZ et al., 2004 Evidence for dynamically organized modularity in the yeast protein-protein interaction network. Nature 430: 8893.[CrossRef][Medline]
HARBISON, C. T., D. B. GORDON, T. I. LEE, N. J. RINALDI, K. D. MACISAAC et al., 2004 Transcriptional regulatory code of a eukaryotic genome. Nature 431: 99104.[CrossRef][Medline]
HE, X., and J. ZHANG, 2006 Why do hubs tend to be essential? PLoS Genet. 2: e88.[CrossRef][Medline]
HODGKIN, J., 1998 Seven types of pleiotropy. Int. J. Dev. Biol. 42: 501505.[Medline]
JEONG, H., S. P. MASON, A. L. BARABASI and Z. N. OLTVAI, 2001 Lethality and centrality in protein networks. Nature 411: 4142.[CrossRef][Medline]
MACLEAN, R. C., G. BELL and P. B. RAINEY, 2004 The evolution of a pleiotropic fitness tradeoff in Pseudomonas fluorescens. Proc. Natl. Acad. Sci. USA 101: 80728077.
OTTO, S. P., 2004 Two steps forward, one step back: the pleiotropic effects of favoured alleles. Proc. Biol. Sci. 271: 705714.
PAL, C., B. PAPP and L. D. HURST, 2001 Highly expressed genes in yeast evolve slowly. Genetics 158: 927931.
PARSONS, A. B., R. L. BROST, H. DING, Z. LI, C. ZHANG et al., 2004 Integration of chemical-genetic and genetic interaction data links bioactive compounds to cellular target pathways. Nat. Biotechnol. 22: 6269.[CrossRef][Medline]
PROMISLOW, D. E., 2004 Protein networks, pleiotropy and the evolution of senescence. Proc. Biol. Sci. 271: 12251234.
RAFIKOVA, O., R. RAFIKOV and E. NUDLER, 2002 Catalysis of S-nitrosothiols formation by serum albumin: the mechanism and implication in vascular control. Proc. Natl. Acad. Sci. USA 99: 59135918.
SALATHE, M., M. ACKERMANN and S. BONHOEFFER, 2006 The effect of multifunctionality on the rate of evolution in yeast. Mol. Biol. Evol. 23: 721722.
STRASSMANN, J. E., Y. ZHU and D. C. QUELLER, 2000 Altruism and social cheating in the social amoeba Dictyostelium discoideum. Nature 408: 965967.[CrossRef][Medline]
VAN DE PEPPEL, J., and F. C. HOLSTEGE, 2005 Multifunctional genes. Mol. Syst. Biol. 1: 2005.0003.
VON MERING, C., R. KRAUSE, B. SNEL, M. CORNELL, S. G. OLIVER et al., 2002 Comparative assessment of large-scale data sets of protein-protein interactions. Nature 417: 399403.[Medline]
WAXMAN, D., and J. R. PECK, 1998 Pleiotropy and the preservation of perfection. Science 279: 12101213.
WILLIAMS, G. C., 1957 Pleiotropy, natural selection, and the evolution of senescence. Evolution 11: 398411.[CrossRef]
WRIGHT, S., 1968 Evolution and the Genetics of Populations. University of Chicago Press, Chicago.
ZHANG, J., and X. HE, 2005 Significant impact of protein dispensability on the instantaneous rate of protein evolution. Mol. Biol. Evol. 22: 11471155.
Communicating editor: S. YOKOYAMA
This article has been cited by other articles:
![]() |
T. Shlomi, M. Herrgard, V. Portnoy, E. Naim, B. O. Palsson, R. Sharan, and E. Ruppin Systematic condition-dependent annotation of metabolic genes Genome Res., November 1, 2007; 17(11): 1626 - 1633. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Gu Evolutionary Framework for Protein Sequence Evolution and Gene Pleiotropy Genetics, April 1, 2007; 175(4): 1813 - 1822. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
-
All Versions of this Article:
genetics.106.060269v1
173/4/1885 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by He, X.
- Articles by Zhang, J.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by He, X.
- Articles by Zhang, J.





