Genetics, Vol. 165, 695-705, October 2003, Copyright © 2003

Likelihood Models of Somatic Mutation and Codon Substitution in Cancer Genes

Ziheng Yangb, Simon Roa, and Bruce Rannalaa
a Department of Medical Genetics, University of Alberta, Edmonton, Alberta T6G 2H7, Canada
b Department of Biology, Galton Laboratory, University College London, London WC1E 6BT, England

Corresponding author: Bruce Rannala, 8-39 Medical Sciences Bldg., University of Alberta, Edmonton, AB T6G 2H7, Canada., brannala{at}ualberta.ca (E-mail)

Communicating editor: S. TAVARÉ

The role of somatic mutation in cancer is well established and several genes have been identified that are frequent targets. This has enabled large-scale screening studies of the spectrum of somatic mutations in cancers of particular organs. Cancer gene mutation databases compile the results of many studies and can provide insight into the importance of specific amino acid sequences and functional domains in cancer, as well as elucidate aspects of the mutation process. Past studies of the spectrum of cancer mutations (in particular genes) have examined overall frequencies of mutation (at specific nucleotides) and of missense, nonsense, and silent substitution (at specific codons) both in the sequence as a whole and in a specific functional domain. Existing methods ignore features of the genetic code that allow some codons to mutate to missense, or stop, codons more readily than others (i.e., by one nucleotide change, vs. two or three). A new codon-based method to estimate the relative rate of substitution (fixation of a somatic mutation in a cancer cell lineage) of nonsense vs. missense mutations in different functional domains and in different tumor tissues is presented. Models that account for several potential influences on rates of somatic mutation and substitution in cancer progenitor cells and allow biases of mutation rates for particular dinucleotide sequences (CGs and dipyrimidines), transition vs. transversion bias, and variable rates of silent substitution across functional domains (useful in detecting investigator sampling bias) are considered. Likelihood-ratio tests are used to choose among models, using cancer gene mutation data. The method is applied to analyze published data on the spectrum of p53 mutations in cancers. A novel finding is that the ratio of the probability of nonsense to missense substitution is much lower in the DNA-binding and transactivation domains (ratios near 1) than in structural domains such as the linker, tetramerization (oligomerization), and proline-rich domains (ratios exceeding 100 in some tissues), implying that the specific amino acid sequence may be less critical in structural domains (e.g., amino acid changes less often lead to cancer). The transition vs. transversion bias and effect of CpG dinucleotides on mutation rates in p53 varied greatly across cancers of different organs, likely reflecting effects of different endogenous and exogenous factors influencing mutation in specific organs.





This article has been cited by other articles:


Home page
GeneticsHome page
S. Ro and B. Rannala
Inferring Somatic Mutation Rates Using the Stop-Enhanced Green Fluorescent Protein Mouse
Genetics, September 1, 2007; 177(1): 9 - 16.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. Greenman, R. Wooster, P. A. Futreal, M. R. Stratton, and D. F. Easton
Statistical Analysis of Pathogenicity of Somatic Mutations in Cancer
Genetics, August 1, 2006; 173(4): 2187 - 2198.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Mathe, M. Olivier, S. Kato, C. Ishioka, P. Hainaut, and S. V. Tavtigian
Computational approaches for predicting the biological effect of p53 missense mutations: a comparison of three sequence analysis based methods
Nucleic Acids Res., March 6, 2006; 34(5): 1317 - 1325.
[Abstract] [Full Text] [PDF]