| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |

* Department of Plant and Microbial Biology, University of California, Berkeley, California 94720
College of Natural Resources, University of California, Berkeley, California 94720
3 Corresponding author: Plant and Microbial Biology, 111 Koshland Hall, University of California, Berkeley, CA 94720-3102.
E-mail: kustu{at}nature.berkeley.edu
| ABSTRACT |
|---|
|
|
|---|
4400 for the genes of Escherichia coli), analysis of the data has necessitated the development of a variety of tools. More than 50 different commercial, shareware, and free software products are currently available (for a brief summary, see GOODMAN 2002), most of which focus on statistical normalization and analysis of numerical data. Powerful statistical clustering algorithms have been developed for interpreting data from many experiments (reviewed in SHERLOCK 2000). To complement the tools available, we have developed simple programs for visualizing gene expression patterns in E. coli in their genomic context and for identifying known biological relationships among lists of genes (ZIMMER et al. 2000; WENDISCH et al. 2001; SOUPENE et al. 2003). These are particularly helpful to biologists who wish to interpret relatively small numbers of experiments. | MATERIALS AND METHODS |
|---|
|
|
|---|
7 MB each, 10-µm resolution) representing fluorescence intensities for the Cy3- and Cy5-labeled cDNAs hybridized to slides were generated using a GenePix scanner (Axon Instruments, Union City, CA). These images were overlaid and analyzed in Scanalyze 2.x (http://rana.lbl.gov/EisenSoftware.htm) or GenePix 3.0. Global intensity normalization was used to calculate a normalization factor for each pair of images (SCHENA et al. 1995) and the image intensities were normalized accordingly in Scanalyze. The normalized overlaid image was then saved as a bitmap image, which was converted to an 8-bit color GIF image and then to a portable network graphic file (PNG) using standard image manipulation software. At this point, some of the quantitative information is lost. Glass-slide DNA microarray data were stored in AMAD (Another MicroArray Database; http://www.microarrays.org/AMADFaq.html). AMAD, developed by Joe DeRisi, is a flat file database written in Perl and JavaScript that allows storage and retrieval of raw scanned slide images and extraction of numerical data.
Generation of genome images:
Genome images were built on AMAD as a core component. All image-manipulating scripts were written in Perl programming language (WALL et al. 2000), with the CGI.pm and the GD.pm modules of Lincoln Stein (http://stein.cshl.org/WWW/software/) installed.
From the PNG files described above, the program that generates genome images extracts rectangles containing the spots and arranges them according to their E. coli b number (BLATTNER et al. 1997). For E. coli microarrays, the resulting genome images contain 45 rows of 100 spots/row, with each spot in a 10-pixel square (the original size of the scanned spot). The output of the program is a PNG "genome image" file and an HTML document containing an image map of the b numbers, gene names, gene descriptions, and links to the raw data. The image is stored in a local database along with raw data files, and both can be easily accessed through a web-based interface that is provided. By clicking on a spot in a genome image, the user is transferred to the E. coli Entry Point (see below), which allows quick access to biological information on the gene corresponding to this spot.
On a separate page the user can display a list of genes corresponding to spots that fulfill certain criteria (see below). The spots can then be outlined in blue boxes on the genome image and can also be transferred directly to the E. coli Entry Point (see below), from which other biological information can be accessed.
A generalized version of the program for generating genome (and other sorts of) images uses as input: (1) a tab-delimited "ORDER" text file containing the headings (ORDER, TOP, LEFT, RIGHT, BOTTOM, NAME, DESC, LINK), where the TOP, LEFT, RIGHT, and BOTTOM parameters refer to the corresponding pixel positions of each spot in the original microarray image, and (2) a PNG image file. The program aligns the spots according to the order specified in the ORDER file, yielding an HTML page and an image. An image map identifies each spot and includes a user-specified hyperlink. The generalized genome image program is written in Perl and requires that all of the appropriate Perl modules (GD, CGI) be installed. It can be accessed at http://coli.berkeley.edu/genomeimages/ and the stand-alone version can be downloaded from the same site.
A GenePix-specific version of the program for generating genome images was written to accommodate the large number of users of the Axon GenePix software. This program uses as input: (1) a GenePix results file (GPR), (2) a PNG or JPEG image file, and (3) an ORDER file. For the tab-delimited ORDER file, the ORDER and ID fields are required, and the NAME, DESC, and LINK fields are optional but recommended. The TOP, LEFT, RIGHT, and BOTTOM fields are not used by the GenePix version of this program because they are calculated from fields that are present in the GPR file. The program joins the ORDER table to the GPR table by the ID field present in both files. For E. coli, we have used the b number as the ID in both the GPR file and the ORDER file. The GenePix-specific program can also be accessed at http://coli.berkeley.edu/genomeimages.
Alignment of genome images:
The program that aligns genome images takes as input a list of genome images in PNG format (each assumed to be 100 cells wide with each 10-pixel cell containing a spot). The program vertically concatenates corresponding rows from each of the genome images to generate a new larger image of the data with rows of spot images aligned. A generalized version of the genome image alignment program works as follows: (1) The user is first prompted for the number of genome images he would like to align, and then (2) on a second page the user must upload all of the images to be aligned, preferably in PNG format, and must upload the ORDER file described above. Currently a maximum of 12 images can be aligned, but this can be reconfigured at local installations. The generalized version of the genome image alignment program can be run or downloaded from http://coli.berkeley.edu/genomeimages/.
The AMAD core database of DeRisi (accessible through a web-based interface) allows the user to extract from multiple experiments lists of genes corresponding to spots that fulfill specified criteria, e.g., have a normalized median red-to-green (R/G) ratio higher than a specified cutoff value. Outputs can be saved directly to the local computer.
The E. coli Entry Point database:
The E. coli Entry Point programs are written in Perl using the CGI.pm module. Data are stored in a MySQL database (http://www.mysql.com/) and accessed using the DBI.pm and DBD::MySQL Perl modules (DESCARTES and BUNCE 2000; http://www.cpan.org/modules/by-module/DBI/; http://www.cpan.org/modules/by-module/DBD/). The E. coli Entry Point is composed of a main script and several subsidiary scripts. Their functions, features, and data resources, which can be accessed at http://coli.berkeley.edu/genomeimages/, are outlined below, along with those of the additional databases to which the Entry Point has links.
The main page allows the user to display annotation information for lists of E. coli genes. The primary source of data that was used is the ecoli.ptt file (NC_000913.ptt), which was compiled as part of the E. coli sequencing effort and downloaded from the National Center for Biotechnology Information (NCBI) (ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Escherichia_coli_K12/). The user begins by entering or selecting a list of genes using any one of several nomenclatures. The heading "Display standard fields" allows the user to display basic annotation information on these genes, including b number, gene name, gene position on chromosome (left and right), strand orientation, protein length, GenBank ID, functional description (called gene description), and operon ID. We have updated some of the gene names on the basis of evidence in the primary literature. The heading "Sorting" allows the user to sort and group the genes being displayed by genome position or functional category (see below). When genes are sorted by position, the background color of the row (alternating between yellow and white) is used to indicate different operons. Similarly, when genes are sorted by category, genes belonging to the same category are indicated with the same color. Additional fields that can be displayed are:
-factors that have documented or predicted binding sites within 2000 nt of each gene according to GLASNER et al. (2003)(http://asap.ahabs.wisc.edu/annotation/php/ASAP1.htm). After a list of genes has been generated, a set of clickable buttons at the bottom of the main E. coli Entry Point page allows access to information from the subsidiary programs and to supplementary data taken from external sources. The fields available are (in order of presentation): chromosome position, operons, functional category, COG description, protein binding sites, selfBLAST, features, gene sequences, and AMAD (takes the user to Genome Images/AMAD database). A brief description of each follows.
-factors and/or selected protein transcriptional regulators. The left and right ends of the binding site for each
-factor or regulatory protein are specified, along with the status of the site (documented or predicted). The distance to the starting ATG of the gene and the position of the transcriptional start are also indicated (GLASNER et al. 2003). | RESULTS |
|---|
|
|
|---|
|
3 are boxed in blue (see below). The strain grows slightly less rapidly on taurine than on sulfate and appears to perceive some degree of sulfur limitation. As expected from previous work (VAN DER PLOEG et al. 2001), two operons under control of the regulators CysB and Cbl (CysB-like) were more highly expressed on taurine. These are tauABCD (b0365b0368), a catabolic operon for taurine, and ssuEADCB (b0937b0933), a catabolic operon for utilization of alkanesulfonates. They are easily identified on the image as striking strings of red spots. A number of single red spots are also clearly visible. Two that are easily understood are a spot corresponding to the cbl regulatory gene (b1987) and one corresponding to sbp (b3917), the gene for a periplasmic sulfate transport component known to be highly expressed under sulfur-limiting conditions (QUADRONI et al. 1996). Note that tauB and cbl are not boxed because their R/G ratios were <3. The reproducibility and significance of other red spots is currently being assessed (P. GYANESHWAR, unpublished results). Also available from genome images is visual information on spot intensities, information that may be lost in some higher-level analyses of the data (e.g., clustering based on R/G ratios). By displaying the spots rather than pseudocolors representing R/G ratios, we can discern, for example, bright yellow spots, genes for which there is probably a large amount of mRNA in both cultures. Several long strings of bright yellow spots in Figure 1 correspond to operons of ribosomal protein genes or clusters of such operons (e.g., b32943298, b32993310, b33113321, b33393342, b39833984, b39853986), which are always highly expressed in E. coli (NEIDHARDT et al. 1990). Other strings of bright yellow spots correspond to genes of the flagellar and chemotaxis regulon (b10701083, b18811892, b19201926) and to operons encoding the F1F0 ATPase (b37313739) and the tricarboxylic acid cycle enzymes succinate dehydrogenase (b07210724) and 2-oxoglutarate dehydrogenase and succinyl-CoA synthetase (b0726b0729). Although high intensity is probably an indication of high mRNA levels, long length of a gene and/or a large amount of DNA (PCR product) attached to the slide may contribute. Low intensity of a spot may have many causes but low intensity of a group of adjacent spots corresponding to an operon(s) probably indicates that the operon is not highly expressed under either condition chosen for the comparison and hence R/G ratios should be evaluated accordingly.
The quality of spots can be assessed directly on genome images without the need for complex statistical procedures because the images are composed of the actual scanned pixels. Dark spots within operons can be seen easily when they are surrounded by spots that are otherwise red, green, or bright yellow. Such spots often indicate failed PCR products or damaged print tips. In Figure 1 there are two black spots (b3309 and b3310) in the middle of the string of ribosomal protein genes between b3294 and b3321. They were reproducibly black in several prints and hence probably are failed PCR products.
Finally, in conjunction with analyses at the E. coli Entry Point (see below), genome images can be helpful in detecting misannotated operons and artifactual differential expression. For example, we determined that the gltIJK and L genes probably constitute a single operon, as do yhdWXY and Z, although gltI was not originally included with the other glt genes and the yhd operon was split in half (ZIMMER et al. 2000). Likewise we showed that apparent overexpression of the cynX gene upon IPTG induction of the lactose operon was an artifact of readthrough transcription (WENDISCH et al. 2001, Figure 3C). The cynX gene is adjacent to lacA and is the last gene in the cynTSX operon, which is transcribed toward lac. The high signal seen in IPTG-induced cells is probably due to the presence of antisense RNA because many transcripts for the lac operon terminate at least one-third of the way into the cynX gene (HEDIGER et al. 1985; MCCORMICK et al. 1991).
As indicated in MATERIALS AND METHODS, the AMAD database, in which our genome images are stored, allows extraction of the corresponding numerical data. AMAD was developed by Joe DeRisi. Extraction of numerical data can, of course, also be accomplished with other microarray data analysis programs.
Alignment of genome images:
Aligned genome images are used to identify similarities and differences in gene expression (mRNA levels) in several experiments and to assess reproducibility of these differences (see, for example, an alignment of four images for an E. coli K12 strain grown on taurine vs. sulfate at http://nature.berkeley.edu/~opaliy/papers/GenomeImages.html). An alignment of two E. coli genome images can be viewed on a single screen at 1280 x 1024 resolution. Alignments of several images (we have used up to a dozen; e.g., see Figure 2) must be scrolled. As for single images, image maps allow ready identification of gene names and functions corresponding to particular spots.
|
E. coli Entry Pointtools for identifying relationships among E. coli genes:
The E. coli Entry Point is a set of tools for identifying known biological relationships among groups of genes. A user can enter any list and then display various sorts of biological information for each gene, including information on chromosome position and inclusion in an operon, promoter and
-factor controlling expression, regulatory proteins that bind upstream and their binding sites, sequence, function of the gene product, and homology relationships to other gene products (see MATERIALS AND METHODS). The full list of genes together with the information requested is shown on one web page, allowing fast comparisons and interpretations. In addition, the information can easily be copied into a spreadsheet program such as Microsoft Excel for further analysis locally.
As an illustration (Figure 3) we show a screen shot of the E. coli Entry Point page displaying information on genes that were highly expressed on taurine vs. sulfate in the experiment of Figure 1. Data for spots with R/G ratio
3 were first extracted from AMAD and the corresponding list of genes was transferred directly to the E. coli Entry Point. The screen shot shows some of the basic annotation information available from the options "Display Standard Fields" and "Display Additional Fields." Note that the background shading of the rows alternates between operons. Note, too, that there are direct links to the other major E. coli databases listed. Thus, if the user wishes additional information on a gene(s) of interest, he or she can go to a gene-specific page of any of these databases with one click of the mouse button. A screen shot of all the additional information available from the option "Display Additional Fields" is provided at http://nature.berkeley.edu/~opaliy/papers/GenomeImages.html, along with comments.
|
3. By clicking the button at the bottom of the operon page the user can now return to the E. coli Entry Point with all genes of the operons being considered and obtain additional information for all of them. Entering b0367 in AMAD allows the user to determine that the R/G median for this gene was 2.0, whereas ratios for the 9 genes originally in the list were between 3.2 and 14.8. Screen shots obtained by using all of the clickable buttons at the bottom of the E. coli Entry Point page for the expanded list of 10 genes are given at http://nature.berkeley.edu/~opaliy/papers/GenomeImages.html, along with comments on the timeliness and accuracy of the information currently available.
|
Finally, if the user wishes first to determine the gene set meeting a certain criterion, e.g., all the genes containing "tau" in their name, he or she can begin with the option "Select Genes" at the Entry Point and then return to the Entry Point with the resulting list. Criteria for selecting genes include gene name, description, b number, position on the genome, and length. The user can also generate lists of genes from other programs or E. coli resources on the internet and import them into the E. coli Entry Point.
| DISCUSSION |
|---|
|
|
|---|
A major goal of aligning genome images is similar to that of powerful statistical methods for data analysis such as hierarchical clustering (EISEN et al. 1998). The two approaches are complementary, with alignment having two distinct strengths for small numbers of experiments. First, the alignment of several genome images can be viewed on a single page, whereas the complete cluster analysis for E. coli requires many more pages. (Results of the latter are usually organized into a figure/table that is L experiments wide and N genes high, where N is
4400 for E. coli.) The compactness of genome images facilitates rapid qualitative analysis of the data and reduces its complexity by allowing immediate consideration of operons without the need to sift through lists of hundreds of genes. Apart from the 1125 genes that are transcribed separately in E. coli, the remaining 3100 protein-coding genes are partitioned into only about one-quarter as many operons (
750; GLASNER et al. 2003; http://asap.ahabs.wisc.edu/annotation/php/ASAP1.htm). In the case of the NtrC and Nac regulons, the 75 genes involved were members of only 25 operons (ZIMMER et al. 2000). A second advantage of genome images is that members of operons are contiguous whereas this often is not the case in a hierarchical cluster. The nested NtrC and Nac regulons provide an interesting example. Many of the operons under control of these regulatory proteins encode ABC transport systems for nitrogen-containing compounds, and, biologically, members of each operon are their own closest neighbors because they must function together. Genome images showed clearly that expression of all genes in each operon changed in the same direction when various strains and growth conditions were compared (ZIMMER et al. 2000). Nevertheless, members of different operons were intermingled in hierarchical clusters (http://nature.berkeley.edu/~opaliy/papers/GenomeImages.html) at least partly because expression (mRNA levels) of the genes in each operon apparently did not change to the same extent. [As discussed previously, we think this has a biological explanation (ZIMMER et al. 2000).] Apart from problems with operons, however, the results of hierarchical clustering and interpretation of genome images (ZIMMER et al. 2000) were remarkably congruent, illustrating the complementarity of the two means of analysis. A cluster of only 39 genes contained 32 genes in operons directly under NtrC control and a second cluster of only 24 genes contained 17 genes in operons under Nac control. In all, two-thirds of the 75 genes we had identified previously were in these two clusters (http://nature.berkeley.edu/~opaliy/papers/GenomeImages.html).
In their masterful study using glass-slide DNA microarrays and hierarchical clustering to analyze tryptophan metabolism in E. coli, KHODURSKY et al. (2000) mentioned that only five known multigene operons were fully represented in the set of 169 genes that they selected to analyze, whereas 37 operons were represented by only a single gene. For example, expression of only a few of the 50 genes in the flagellar and chemotaxis regulon appeared to respond to tryptophan availability. However, examination of the data in genome images showed that expression of genes in many operons, including those of the flagellar and chemotaxis regulon, differed in the same direction in particular comparisons between growth conditions or strains (strings of contiguous red or green spots; see example at http://nature.berkeley.edu/~opaliy/papers/GenomeImages.html). One image revealed a striking artifact: an apparent difference in expression of the flagellar and chemotaxis regulon between a wild-type strain (W3110) and a strain lacking the tryptophan repressor CY15682. The W3110 wild-type strain is in the same lineage as MG1655 (BACHMANN 1996) and expresses flagellar genes poorly (see above), whereas the particular trpR2 strain used for this and one other experiment is apparently noncongenic with W3110 and expresses these genes well. The difference was not seen when congenic trpR2 and wild-type strains were compared (KHODURSKY et al. 2000; P. GYANESHWAR, unpublished results). The use of genome images to examine the data of KHODURSKY et al. (2000), which we are analyzing in further detail elsewhere (P. GYANESHWAR, A. JONES, A. KHODURSKY and S. KUSTU, unpublished results), illustrated the value of these images as an adjunct to hierarchical clustering.
The E. coli Entry Point:
After examining genome images and using data sorting and filtering methods to determine a list of genes whose expression differs in a microarray experiment, an investigator can use the E. coli Entry Point to extract biological information about these genes. Examples of the uses of the Entry Point are given in the RESULTS. We previously used the Entry Point to determine relationships among the genes of the NtrC and Nac regulons and to update their annotations. In addition, we used it recently to export a list of all
4400 E. coli genes together with appropriate functional information into a spreadsheet file that was used to compare the protein and mRNA profiles of E. coli on a global scale (CORBIN et al. 2003). The comparison was also visualized in artificial color in an analog of an aligned genome image (http://coli.berkeley.edu/protein_profile/).
The Entry Point consists of simple programs that extract and visualize data, which can be downloaded from a variety of publicly available sources (see MATERIALS AND METHODS). The capacity to visualize this data in new ways rests on the flexibility given by being able to access it from a MySQL database that was implemented locally. As illustrated in the RESULTS and at http://nature.berkeley.edu/~opaliy/papers/GenomeImages.html, the quality of the information obtained from the Entry Point depends on whether information in other databases is current and accurate. One very useful feature of the Entry Point is that it facilitates access to primary literature from PubMed (EcoGene Bibliography) and to information from other databases. Data from these sources can be cross-checked to obtain the best possible information on a list of genes at any given time.
Conclusions:
Global expression technologies have led to a rapid increase in our knowledge and understanding of metabolic pathways and regulatory networks in a variety of microbes and other organisms. As the use of DNA microarrays becomes more widespread among biologists of all generations, it will be useful to have biologist-friendly software and visualization tools available to supplement more mathematical tools. Genome images and the E. coli Entry Point should be useful in this regard. Our current efforts are directed at improving these tools for E. coli, making them widely available, and generalizing them to other microorganisms.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
2 Present address: Department of Biochemistry and Molecular Biology, Wright State University, Dayton, OH 45435. ![]()
| LITERATURE CITED |
|---|
|
|
|---|
BACHMANN, B. J., 1996 Derivations and genotypes of some mutant derivatives of Escherichia coli K-12, pp. 24602488 in Escherichia coli and Salmonella: Cellular and Molecular Biology, Ed. 2, edited by F. C. NEIDHARDT, R. CURTISS III, E. C. C. LIN, J. INGRAHAM, B. K. LOW et al. ASM Press, Washington, DC.
BAIROCH, A., and R. APWEILER, 2000 The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28: 4548.
BERLYN, M. B., and S. LETOVSKY, 1992 Genome-related datasets within the E. coli Genetic Stock Center database. Nucleic Acids Res. 20: 61436151.
BLATTNER, F. R., G. PLUNKETT, III, C. A. BLOCH, N. T. PERNA, V. BURLAND et al., 1997 The complete genome sequence of Escherichia coli K-12. Science 277: 14531474.
CORBIN, R. W., O. PALIY, F. YANG, J. SHABANOWITZ, M. PLATT et al., 2003 Toward a protein profile of Escherichia coli: comparison to its transcription profile. Proc. Natl. Acad. Sci. USA 100: 92329237.
DESCARTES, A., and T. BUNCE, 2000 Programming the Perl DBI. O'Reilly & Associates, Cambridge, MA.
EISEN, M. B., P. T. SPELLMAN, P. O. BROWN and D. BOTSTEIN, 1998 Cluster analysis and display of genome-wide expression patterns. Proc. Natl. Acad. Sci. USA 95: 1486314868.
GLASNER, J. D., P. LISS, G. PLUNKETT, III, A. DARLING, T. PRASAD et al., 2003 ASAP, a systematic annotation package for community analysis of genomes. Nucleic Acids Res. 31: 147151.
GOODMAN, N., 2002 A dim summary of microarray software. Genome Technol. 19: 5864.
GUTNICK, D., J. M. CALVO, T. KLOPOTOWSKI and B. N. AMES, 1969 Compounds which serve as the sole source of carbon or nitrogen for Salmonella typhimurium LT-2. J. Bacteriol. 100: 215219.
HEDIGER, M. A., D. F. JOHNSON, D. P. NIERLICH and I. ZABIN, 1985 DNA sequence of the lactose operon: the lacA gene and the transcriptional termination region. Proc. Natl. Acad. Sci. USA 82: 64146418.
KARP, P. D., M. RILEY, M. SAIER, I. T. PAULSEN, S. M. PALEY et al., 2000 The EcoCyc and MetaCyc databases. Nucleic Acids Res. 28: 5659.
KHODURSKY, A. B., B. J. PETER, N. R. COZZARELLI, D. BOTSTEIN, P. O. BROWN et al., 2000 DNA microarray analysis of gene expression in response to physiological and genetic changes that affect tryptophan metabolism in Escherichia coli. Proc. Natl. Acad. Sci. USA 97: 1217012175.
LEHNEN, D., C. BLUMER, T. POLEN, B. WACKWITZ, V. F. WENDISCH et al., 2002 LrhA as a new transcriptional key regulator of flagella, motility and chemotaxis genes in Escherichia coli. Mol. Microbiol. 45: 521532.[CrossRef][Medline]
LERCHER, M. J., A. O. URRUTIA and L. D. HURST, 2002 Clustering of housekeeping genes provide a unified model of gene order in the human genome. Nat. Genet. 31: 180183.[CrossRef][Medline]
MASUDA, N., and G. M. CHURCH, 2003 Regulatory network of acid resistance genes in Escherichia coli. Mol. Microbiol. 48: 699712.[CrossRef][Medline]
MCCORMICK, J. R., J. M. ZENGEL and L. LINDAHL, 1991 Intermediates in the degradation of mRNA from the lactose operon of Escherichia coli. Nucleic Acids Res. 19: 27672776.
MEDIGUE, C., A. VIARI, A. HENAUT and A. DANCHIN, 1993 Colibri: a functional data base for the Escherichia coli genome. Microbiol. Rev. 57: 623654.
NEIDHARDT, F. C., J. L. INGRAHAM and M. SCHAECHTER, 1990 Physiology of the Bacterial Cell: A Molecular Approach. Sinauer Associates, Sunderland, MA.
QUADRONI, M., W. STAUDEMANN, M. KERTESZ and P. JAMES, 1996 Analysis of global responses by protein and peptide fingerprinting of proteins isolated by two dimensional gel electrophoresis: application to sulfate starvation responses of Escherichia coli. Eur. J. Biochem. 239: 773781.[Medline]
RILEY, M., and B. LABEDAN, 1996 E. coli gene products: physiological functions and common ancestries, pp. 21182202 in Escherichia coli and Salmonella: Cellular and Molecular Biology, Ed. 2, edited by F. C. NEIDHARDT, R. CURTISS III, E. C. C. LIN, J. INGRAHAM, B. K. LOW et al. ASM Press, Washington, DC.
RILEY, M., and D. B. SPACE, 1996 Genes and proteins of Escherichia coli (GenProtEC). Nucleic Acids Res. 24: 40.
ROY, P. J., J. M. STUART, J. LUND and S. K. KIM, 2002 Chromosomal clustering of muscle-expressed genes in Caenorhabditis elegans. Nature 418: 975979.[Medline]
RUDD, K. E., 2000 EcoGene: a genome sequence database for Escherichia coli K-12. Nucleic Acids Res. 28: 6064.
SALGADO, H., A. SANTOS-ZAVALETA, S. GAMA-CASTRO, D. MILLAN-ZARATE, E. DIAZ-PEREDO et al., 2001 RegulonDB (version 3.2): transcriptional regulation and operon organization in Escherichia coli K-12. Nucleic Acids Res. 29: 7274.
SCHENA, M., D. SHALON, R. W. DAVIS and P. O. BROWN, 1995 Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 270: 467470.
SHERLOCK, G., 2000 Analysis of large-scale gene expression data. Curr. Opin. Immunol. 12: 201205.[CrossRef][Medline]
SOUPENE, E., W. C. VAN HEESWIJK, J. PLUMBRIDGE, V. STEWART, D. BERTENTHAL et al., 2003 Physiological studies of Escherichia coli strain MG1655: growth defects and apparent cross-regulation of gene expression. J. Bacteriol. 85: 56115626.
SPELLMAN, P. T., and G. M. RUBIN, 2002 Evidence for large domains of similarly expressed genes in the Drosophila genome. J. Biol. 1: 5.[CrossRef][Medline]
TATUSOV, R. L., M. Y. GALPERIN, D. A. NATALE and E. V. KOONIN, 2000 The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 28: 3336.
VAN DER PLOEG, J. R., E. EICHHORN and T. LEISINGER, 2001 Sulfonate-sulfur metabolism and its regulation in Escherichia coli. Arch. Microbiol. 176: 18.[CrossRef][Medline]
WALL, L., T. CHRISTIANSEN and J. ORWANT, 2000 Programming Perl. O'Reilly & Associates, Cambridge, MA.
WENDISCH, V. F., D. P. ZIMMER, A. KHODURSKY, B. PETER, N. COZZARELLI et al., 2001 Isolation of Escherichia coli mRNA and comparison of expression using mRNA and total RNA on DNA microarrays. Anal. Biochem. 290: 205213.[CrossRef][Medline]
ZIMMER, D. P., E. SOUPENE, H. L. LEE, V. F. WENDISCH, A. B. KHODURSKY et al., 2000 Nitrogen regulatory protein C-controlled genes of Escherichia coli: scavenging as a defense against nitrogen limitation. Proc. Natl. Acad. Sci. USA 97: 1467414679.
This article has been cited by other articles:
![]() |
E.-B. Goh, P. J. Bledsoe, L.-L. Chen, P. Gyaneshwar, V. Stewart, and M. M. Igo Hierarchical Control of Anaerobic Gene Expression in Escherichia coli K-12: the Nitrate-Responsive NarX-NarL Regulatory System Represses Synthesis of the Fumarate-Responsive DcuS-DcuR Regulatory System J. Bacteriol., July 15, 2005; 187(14): 4890 - 4899. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Gyaneshwar, O. Paliy, J. McAuliffe, A. Jones, M. I. Jordan, and S. Kustu Lessons from Escherichia coli genes similarly regulated in response to nitrogen and sulfur limitation PNAS, March 1, 2005; 102(9): 3453 - 3458. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Gyaneshwar, O. Paliy, J. McAuliffe, D. L. Popham, M. I. Jordan, and S. Kustu Sulfur and Nitrogen Limitation in Escherichia coli K-12: Specific Homeostatic Responses J. Bacteriol., February 1, 2005; 187(3): 1074 - 1090. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |