- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by de Boer, J. G.
- Articles by Glickman, B. W.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by de Boer, J. G.
- Articles by Glickman, B. W.
The lacI Gene as a Target for Mutation in Transgenic Rodents and Escherichia coli
Johan G. de Boera and Barry W. Glickmanaa Centre for Environmental Health, University of Victoria, Victoria, British Columbia, Canada V8W 3N5
Corresponding author: Johan G. de Boer, Centre for Environmental Health, University of Victoria, P.O. Box 3020, Victoria, BC Canada V8W 3N5, jdboer{at}uvic.ca (E-mail).
| ABSTRACT |
|---|
The lacI gene has been used extensively for the recovery and analysis of mutations in bacteria with various DNA repair backgrounds and after exposure to a wide variety of mutagens. This has resulted in a large database of information on mutational mechanisms and specificity of many mutagens, as well as the effect of DNA repair background on mutagenicity. Most importantly, knowledge about the mutational sensitivity of the lacI gene is now available, yielding information about mutable nucleotides. This popularity and available knowledge resulted in the use of the lacI gene in transgenic rodents for the study of mutagenesis in mammals, where it resides in ~40 repeated copies. As the number of sequenced mutations recovered from these animals increases, we are able to analyze the sites at which mutations have been recovered in great detail and to compare the recovered sites between bacteria and transgenic animals. The nucleotides that code for the DNA-binding domain are nearly saturated with base substitutions. Even after determining the sequences of ~10,000 mutations recovered from the animals, however, new sites and new changes are still being recovered. In addition, we compare the nature of deletion mutations between bacteria and animals. Based on the nature of deletions in the animals, we conclude that each deletion occurs in a single copy of the gene.
| MUTATIONAL spectra |
|---|
Occasional changes in the sequence of nucleotides are referred to as mutations. Such mutations, when in moderation, are the driving force behind evolution. Mutations, however, may also result in cancer and inherited disease. Endogenous processes in the cell related to cellular metabolism and DNA replication can bring about mutations (![]()
![]()
![]()
![]()
![]()
A:T transitions (![]()
A:T transitions at G nucleotides preceded 5' by a purine (the "PuG effect"; ![]()
![]()
A:T transitions produced by ethyl methanesulfonate do not depend on the PuG context (![]()
T:A transversions in mammalian cells, mainly at runs of guanines flanked by adenine residues (![]()
![]()
A:T transitions at methylated cytosines in 5'-CpG-3' dinucleotide sequences. This specificity of mutation is what makes it possible to infer an induced response from the determination of the sequence alterations in mutants recovered after potential exposure. It is thus the changes in mutational spectra that can reveal mutagenic exposures.
The facile analysis of mutation requires a nonessential target gene in which mutations can be detected efficiently. Several such systems are currently available, including the mammalian hypoxanthine-guanine phosphoribosyl transferase (hprt) and adenine phosphoribosyl transferase (aprt), and the bacterial gpt and supF genes, which code for a bacterial homolog of hprt and a suppressor tRNA molecule, respectively, and the bacterial lacI and lacZ genes.
| The lacI gene |
|---|
The lacI gene of the lac operon has been used extensively for the recovery and analysis of mutations in bacteria with various DNA repair backgrounds and after exposure to a wide variety of mutagens. The LacI protein product represses the transcription of the adjacent lacZ gene by binding to its operator sequence (![]()
![]()
![]()
![]()
![]()
-lacZ fragment, is produced. This fragment may complement a carboxy-terminal or omega fragment that is provided by an appropriate host cell. This full complement has ß-galactosidase activity, which can be used in a screening and in a selective assay.
The first studies using the lacI gene in bacteria for mutational analysis were pioneered by the group of MILLER (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Recently, the lacI gene was used as the mutation target in transgenic mice (![]()
![]()
![]()
![]()
![]()
![]()
![]()
| Origin of the mutants |
|---|
The DNA sequences of mutants recovered from bacteria have been obtained from published literature (~14,000 mutants) and entered into a computer database (![]()
![]()
| The nucleotide targets in the lacI gene |
|---|
The coding sequence of the lacI gene contains 1083 nucleotide pairs (including the termination codon) and codes for a 359-amino acid polypeptide. The numbering of the base positions used is according to FARABAUGH (1978). The first transcribed nucleotide is at position 1, the first translated codon is at position 2931, and the termination codon is at position 11091111. The sense strand of the gene is comprised of a total of 311 guanines and 299 cytosines, for a total of 610 G:C base pairs, and 240 adenines and 233 thymines for a total of 473 A:T base pairs (Table 1).
|
The G:C content of the gene is 56% compared to an average of 44% in the mouse genome (![]()
![]()
| Mutational saturation |
|---|
Knowing the complete set of sites in a gene where mutations can be recovered greatly facilitates the interpretation of mutational spectra. Figure 1 shows the number of base substitution sites that have been recovered and the total number of different changes at these sites as a function of the number of mutants from Big Blue animals that we have sequenced in our laboratory. Surprisingly, even after sequencing 10,000 mutants, the gene has obviously not yet been saturated with base substitution mutations. The data include mutants recovered from various tissue types from control animals and after treatment with a wide variety of mutagens. More sites will be recovered as additional mutagenic treatments with different sequence specificity are included. The number of sites and base substitution changes found in bacteria (also based on ~10,000 substitution mutations) is relatively small, compared to those found in the transgenic animals. It should be kept in mind, however, that in many published studies using bacteria, only mutations in the DNA-binding (NC+) region were analyzed. Because this involves only approximately the first 200 base pairs, this would result in fewer sites recovered overall. Even when only the first 200 base pairs are considered in both mice and bacteria, however, the Big Blue database still has more recovered sites and changes that are unique to the transgenic system than the bacterial database, even though considerably more bacterial mutants are available for this part of the gene. At a number of sites, however, mutations were recovered in bacteria but not in the animal (Figure 2). The lacI gene is the only gene in which such a degree of mutational saturation has been achieved.
|
|
Figure 2 presents all nucleotide positions at which substitution mutations have been recovered. It also shows which new nucleotides were recovered at those positions, and the amino acids that are coded for by the new sequences. The data from bacterial and animal studies are summarized in Table 2AC. A total of 86 As, 151 Cs, 157 Gs, and 113 Ts have been found mutated in the lacI gene recovered from both transgenic animals and bacteria (Table 2C). The largest class in this tabulation is the C
T transition, undoubtedly because of its contribution to spontaneous or background mutation in the animal. In the NC+ region (from base pairs 29205), we found 75.7% (134 out of 177) of the sites mutated in the bacterial plus Big Blue data combined. A total of 305 individual changes were found at these 134 sites. At 62 sites in the NC+ region, all three possible base changes have been recovered (green colored nucleotides in Figure 2). We note that the classic definition of the NC+ region ends where a large clustering of mutation ends. The recovery approximates the theoretical limit, when most changes at third positions in codons are considered silent. This region can therefore be considered mutationally saturated, at least when considering nucleotide positions.
|
![]()
![]()
![]()
The carboxy-terminal end of the gene (after position 1015) is particularly devoid of mutation (Figure 2). This is true in the data collection recovered from bacterial as well as from Big Blue and, therefore, less likely to be an artifact of sequencing strategy. Coding of this part of the gene includes the terminal
-helix that is involved in the tetrahelix bundle formation. This helix bundle is important for the formation of a tetramer structure. Failure of the formation of a tetramer because of mutation does not abolish repression ability (![]()
| Differences in mutation spectrum recovered in different parts of the gene |
|---|
We have analyzed the mutational spectrum, as recovered from Big Blue animals, as a function of position in the lacI gene. Figure 3 shows the main components of the spontaneous spectrum (G:C
A:T transitions, G:C to A:T at CpG sites, G:C
T:A transversions, and deletions combined from various sources) in the first 400 and the next 600 base pairs. This analysis involves ~2000 spontaneous mutants recovered from various tissues. Interestingly, the contribution of G:C
A:T transitions, especially their distribution over 5'-CpG-3' sites, is particularly concentrated in the first 400 base pairs, where 6585% of them are located at 5'-CpG-3' sites. In the remaining part of the gene, only 20% of the recovered G:C
A:T transitions are found at 5'-CpG-3' sites. In contrast, deletion events are threefold more common after the first 400 base pairs. It can be concluded that the characteristic large percentage of G:C
A:T transitions that occurs at 5'-CpG-3' sites in the lacI gene in animals reflects the prevalence of these mutations in the DNA-binding region. The two parts of the lacI gene can therefore be considered as two different sequences and we show here that these exhibit quite different mutational characteristics. Even identical sequences, which are present in different parts of the genome, can be subject to different mutational events. This was shown in the hrpt gene by ![]()
|
The density of 5'-CpG-3' sites (total nucleotides = 2 x CpGs) as a function of 200 base pair blocks along the lacI gene is shown in Figure 4. In addition, the number of those sites (as nucleotides) that have actually been found mutated and as a fraction are shown. The largest fraction of affected sites is found in the first part of the gene. The largest contribution of CpG sites in mutations at G:C base pairs is in the first 400 base pairs of the gene. The density of 5'-CpG-3' dinucleotide sequences is similar for the NC+ region and the rest of the gene (0.096 CpGs per base pair in the NC+ region vs. 0.086 CpGs per base pair in the remainder of the gene.) The fraction of G:C base pairs that can be found at a CpG sequence is 0.351 for the NC+ region and 0.256 for the remainder of the gene.
|
| Differences in methylation patterns between mice and bacteria |
|---|
Spontaneous deamination of methylated cytosines results in G:C
A:T transitions (![]()
A:T transitions at 5'-CpG-3' dinucleotide sequences, in reflection of this methylation. As a consequence, ~75% of all G:C
A:T transitions in the lacI gene in Big Blue are found at these sites. The gene in mice is reported to be fully methylated in all copies present in the genome (![]()
![]()
![]()
![]()
A:T transitions, was now shifted to the 5' end of the gene, where mutation at the remaining CpG sites was enhanced (T. SKOPEK, personal communication).
Of the 610 G:C base pairs in the lacI gene, a total of 190 are part of the 95 5'-CpG-3' sites. This amounts to 31% of all G:C base pairs. Randomly distributed, a maximum of 31% of mutations at G:C base pairs would therefore be expected to be found at CpG sites, when all changes would be recoverable. In reality, this percentage is lower because not all of those changes will result in detectable phenotypic changes. Mutations in Big Blue have been recovered at 258 G:C base pairs. G:C
A:T transitions at CpG sites have only been found at 47 nucleotides, while G:C
T:A transversions at CpG sites have been found at 51 nucleotides. This results in 18.2 and 19.8%, respectively, that can be expected to be found at 5'-CpG-3' dinucleotide sequences. About 75% of all G:C
A:T transitions are typically found at CpG sites. In addition, we notice that ~45% of all G:C
C:G and G:C
A:T transversions in all spontaneous mutations are found at CpG sites. Both of these percentages are significantly higher than expected. The transitions are typically attributed to deamination events at methylated cytosines, when they are part of CpG sequences.
An additional explanation has been offered, however; 5'-CpG-3' dinucleotide sequences, or CpG steps, are unusually malleable (![]()
![]()
![]()
![]()
![]()
T transversions. If this radical reaction or the mispairing is favored by CpG methylation or by CpG structural features, we would expect to see an increase in transversions at CpG sites.
| Two systems to detect mutations |
|---|
Mutations in the lacI gene can be detected by two principal methods, both of which use the presence or absence of ß-galactosidase activity. The first method relies on the color of a plaque or a colony for the screening for mutants. A mutation in the lacI gene results in the expression of ß-galactosidase. Besides galactose, which is its normal substrate, this enzyme can also cleave X-gal. Cleavage of X-gal, a chromogenic compound, results in the production of an insoluble dye that colors the colony or plaque blue. A lacI gene with an altered promoter, lacIq (![]()
T base substitution at position -35 in the "-35" promoter box. This mutation results in a higher level of expression of LacI protein and, therefore, in better repression of background lacZ expression. Reversion mutants at this site have been found among mutants recovered from the transgenic animals (Figure 2).
A second system for the detection of mutations in the lacI gene uses a selective assay. In addition to galactose and X-gal, ß-galactosidase can also cleave P-gal. This results in the production of galactose, which can provide a sole carbon source, thus permitting the growth of bacterial colonies. A wild-type lacI gene will, therefore, not support colony formation. This system has been used by most researchers in bacterial studies.
A great many publications resulted from the use of the lacI gene in the study of mutational mechanisms and mutagenic specificity. Mutations in the recovered mutants can be detected in either the entire lacI gene or only in the portion coding for the DNA-binding domain, up to nucleotide position 205, which is highly sensitive to base substitutions. Mutants in this region can be selected for by a NC+ assay (![]()
![]()
Some researchers used the lacI s gene instead, which carries a single C
T transition at position 617, resulting in a replacement of Arg 197 by Cys (![]()
![]()
The nature of the lacI gene (such as q or s mutations) and the detection system are determining factors in the nature of the mutations that are recovered, as has been already pointed out by ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
|
We would expect differences in the spectra of mutants collected with X-gal and P-gal, where this would result in a LacI protein activity level that may discriminate between the two requirements. When the large collections of mutants recovered from bacteria with P-gal and from Big Blue animals with X-gal screening are compared, some differences are apparent. Several mutations were recovered in bacteria with P-gal selection that have not been seen with X-gal screening in the large Big Blue database, notably around nucleotide positions 110 and 147. Many sites where mutations were found in Big Blue were not recovered in bacteria; however, this may partially result from the lower number of substitutions recovered in bacteria.
| Plaque color and threshold of detection |
|---|
Bacteriophage particles with mutated lacI genes are detected in the Big Blue assay on the basis of a blue plaque phenotype. The intensity of the blue color, however, depends on the residual functionality of the LacI protein. Plaques that are a very light blue color may therefore be missed during the screening. The standardized assay uses four color standards (the CM series) that consists of four mutants with increasing color intensity (![]()
![]()
|
Interestingly, frameshift and stop codon mutations can also result in the very light blue phenotype. Table 4 shows selected frameshift and stop codon mutations, as well as their positions, that were recovered as very light blue (CM0 or CM1) or as dark (CM3). CM0 itself is a translation termination at base pair position 530. In general, however, a termination results in a dark color, even near the end of the gene. Frameshifts generally result in dark-colored plaques. Interestingly, frameshifts at 503 and 507 and the introduction of a stop codon at 530 result in a faint blue plaque color. The frameshift at 503 and 507 adds the same 27 and 26 nonnative amino acids, respectively, before terminating at the same position. This deletion would remove approximately half of the protein but would possibly leave dimerization potential intact. Visualization of the protein structure using RasMol molecular modeling software (![]()
|
When the detection threshold for the blue color increases, the light-colored mutants would be missed, potentially skewing the mutational spectrum. This may happen when the plating media do not conform to standardized protocols (![]()
![]()
![]()
| A comparison of deletion mutations |
|---|
Deletion of DNA sequence is frequently mediated by repeated sequences (![]()
![]()
![]()
![]()
|
| Summary |
|---|
No other gene has been used so frequently in mutation studies as the lacI gene. The number of mutations and its mutational saturation is unparalleled. With the advent of transgenic technology, lacI has entered a new era of mutational study in which mutation can be studied in various tissues and in multiple species of different ages, both males and females. Thus, the lacI gene will continue to be an excellent target for the study of mutational specificity and mechanism.
| ACKNOWLEDGMENTS |
|---|
The authors thank DAVE WALSH, KEN SOJONKI, JAMES HOLCROFT, PAM WARRINGTON, and NAHEED HAGUE for excellent technical assistance. This work was supported by contract NO1-ES-35365 from the National Institute for Environmental Health Sciences, USA, and the National Cancer Institute of Canada.
| LITERATURE CITED |
|---|
ADAMS, W. T. and T. R. SKOPEK, 1987 Statistical test for the comparison of samples from mutational spectra. J. Mol. Biol. 194:391-396[Medline].
BOTTEMA, C. D., M. J. BOTTEMA, R. P. KETTERLING, H. S. YOON, and H. L. JANCO et al., 1991 Why does the human factor IX gene have a G+C content of 40%? Am. J. Hum. Genet. 49:839-850[Medline].
BRENOWITZ, M., N. MANDAL, A. PICKAR, E. JAMISON, and S. ADHYA, 1991 DNA-binding properties of a Lac repressor mutant incapable of forming tetramers. J. Biol. Chem. 266:1281-1288
BRIDGES, B. A., 1996 Mutation in resting cells: the role of endogenous DNA damage. Cancer Surv. 28:155-167[Medline].
BURNS, P. A., F. L. ALLEN, and B. W. GLICKMAN, 1986 DNA sequence analysis of mutagenicity and site specificity of ethyl methanesulfonate in UvrB+ and UvrB- strains of Escherichia coli.. Genetics 113:811-819
BURNS, P. A., A. J. E. GORDON, and B. W. GLICKMAN, 1987 Influence of neighbouring base sequence on N-methyl-N'-nitro-N-nitrosoguanidine mutagenesis in the lacI gene of Escherichia coli.. J. Mol. Biol. 194:385-390[Medline].
BURNS, P. A., A. J. E. GORDON, K. KUNSMANN, and B. W. GLICKMAN, 1988 Influence of neighboring base sequence on the distribution and repair of N-ethyl-N-nitrosourea-induced lesions in Escherichia coli.. Cancer Res. 48:4455-4458
CALOS, M. P. and J. H. MILLER, 1981 Genetic and sequence analysis of frameshift mutations induced by ICR-191. J. Mol. Biol. 153:39-66[Medline].
CHRISTENSEN, R. B., J. R. CHRISTENSEN, and C. W. LAWRENCE, 1985 Conjugation-dependent enhancement of induced and spontaneous mutation in the lacI gene of E. coli.. Mol. Gen. Genet. 201:35-37[Medline].
COULONDRE, C., J. H. MILLER, P. J. FARABAUGH, and W. GILBERT, 1978 Molecular basis of base substitution hotspots in Escherichia coli.. Nature 274:775-780[Medline].
DAVIDSON, M., 1960 The Biochemistry of Nucleic Acids, Ed. 8. Chapman & Hall, London.
DE BOER, J. G., 1995 Software package for the management of sequencing projects using lacI transgenic animals. Environ. Mol. Mutagen. 25:256-262[Medline].
DE BOER, J. G. and B. W. GLICKMAN, 1991 Mutational analysis of the structure and function of the adenine phosphoribosyltransferase enzyme of Chinese hamster. J. Mol. Biol. 221:163-174[Medline].
DE BOER, J. G., H. ERFLE, D. WALSH, J. HOLCROFT, and J. S. PROVOST et al., 1997 The spectrum of spontaneous mutants in liver tissue of lacI transgenic mice. Environ. Mol. Mutagen. 30:273-286[Medline].
DE BOER, J. G., S. PROVOST, N. GORELICK, K. TINDALL, and B. W. GLICKMAN, 1998 Spontaneous mutation in lacI transgenic mice: a comparison of tissues. Mutagenesis in press.
DRAKE, J. W., 1991 Spontaneous mutation. Annu. Rev. Genet. 25:125-146[Medline].
DRAKE, J. W., 1996 The antievolutionary component of antimutagenesis and anticarcinogenesis: where do mutation rates come from and where are they going? Mutation Res. 350:5-8.
DYCAICO, M. J., G. S. PROVOST, P. L. KRETZ, S. L. RANSOM, and J. C. MOORES et al., 1994 The use of shuttle vectors for mutation analysis in transgenic mice and rats. Mutation Res. 307:461-478.
EHRLICH, M., X.-Y. ZHANG, and N. M. INAMDAR, 1990 Spontaneous deamination of cytosine and 5-methylcytosine in DNA and replacements of 5-methylcytosine residues with cytosine residues. Mutat. Res. 238:277-286[Medline].
EL ANTRI, S., O. MAUFFRET, and M. MONNOT, 1993 Structural deviations of CpG provide a plausible explanation for the high frequency of mutations at this site. Phosphorus nuclear magnetic resonance and circular dichroism studies. J. Mol. Biol. 230:373-378[Medline].
FARABAUGH, P. J., U. SCHMEISSNER, M. HOFER, and J. H. MILLER, 1978 Genetic studies of the lac repressor. VII. On the molecular nature of spontaneous hotspots in the lacI gene of Escherichia coli. J. Mol. Biol. 126:847-857[Medline].
FOSTER, P. L., E. EISENSTADT, and J. H. MILLER, 1983 Base substitution mutations induced by metabolically activated aflatoxin B1. Proc. Natl. Acad. Sci. USA 80:2695-2698
GILBERT, W., and B. MULLER-HILL, 1970 The lactose repressor, in The Lactose Operon, edited by J. R. BECKWITH and D. ZIPSER. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
GLICKMAN, B. W., 1990 Study of mutational specificity in the lacI gene of Escherichia coli as a window on the mechanisms of mutation. Environ. Mol. Mutagen 16:48-54[Medline].
GLICKMAN, B. W., M. J. HORSFALL, A. J. GORDON, and P. A. BURNS, 1987 Nearest neighbor affects G:C to A:T transitions induced by alkylating agents. Environ. Health Perspect. 76:29-32[Medline].
GORDON, A. J. and J. A. HALLIDAY, 1995 Transgenic system for in vivo mutational analysis. Letter to the editor. Mutation Res. 306:103-105.
GORDON, A. J., J. A. HALLIDAY, M. J. HORSFALL, and B. W. GLICKMAN, 1991 Spontaneous and 9-aminoacridine-induced frameshift mutagenesis: second-site frameshift mutation within the N-terminal region of the lacI gene of Escherichia coli.. Mol. Gen. Genet. 227:160-164[Medline].
GU, M., A. AHMED, C. WEI, N. GORELICK, and B. W. GLICKMAN, 1994 Development of a lambda-based complementation assay for the preliminary localization of lacI mutants from the Big Blue mouse: implications for a DNA-sequencing strategy. Mutat. Res. 307:533-540

) and number of individual substitutions (
) as a function of the number of mutants sequenced in our laboratory.






