Genetics, Vol. 167, 1813-1820, August 2004, Copyright © 2004
doi:10.1534/genetics.104.029082

A Genomic Basis for the Evolution of Vertebrate Transcription Factors Containing Amino Acid Runs

INSERM E0021 Génomique et Développement, IFR Alfred Jost, Hôpital Cochin, 75014 Paris, France

1 Corresponding author: INSERM E0021 Génomique et Développement, IFR Alfred Jost, Hôpital Cochin, Pavillon Baudelocque, 123 Blvd. de Port Royal, 75014 Paris, France.
E-mail: veitia{at}cochin.inserm.fr

We have previously shown that polyAla (A) tract-containing proteins frequently present runs of glycine (G), proline (P), and histidine (H) and that, in their ORFs, GC content at all codon positions is higher than that in the rest of the genome. In this study, we present new analyses of these human proteins/ORFs. We detected striking differences in codon usage for A, G, and P in and out of runs. After dividing the ORFs, we found that 5' halves were richer in runs than 3' halves. Afterward, when removing the runs, we observed that the run-rich halves (grouped irrespectively of their 5' or 3' position) had a marked statistical tendency to have more homo- and hetero-dicodons for A, G, P, and H than the run-poor halves. This suggests that, in addition to the necessary GC-rich genomic background, a specific codon organization is probably required to generate these coding repeats. Homo-dicodons may indeed provide primers for run formation through polymerase slippage. The compositional analysis of human HOX genes, the most polyAla-rich family, and their comparison with their zebrafish homologs, support these hypotheses and suggest possible effects of genomic environment on ORF evolution and organismal diversification.




This article has been cited by other articles:


Home page
Hum Mol GenetHome page
L. Moumne, A. Dipietromaria, F. Batista, A. Kocer, M. Fellous, E. Pailhoux, and R. A. Veitia
Differential aggregation and functional impairment induced by polyalanine expansions in FOXL2, a transcription factor involved in cranio-facial and ovarian development
Hum. Mol. Genet., April 1, 2008; 17(7): 1010 - 1019.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Legendre, N. Pochet, T. Pak, and K. J. Verstrepen
Sequence-based estimation of minisatellite and microsatellite repeat variability
Genome Res., December 1, 2007; 17(12): 1787 - 1796.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
N. G. Faux, G. A. Huttley, K. Mahmood, G. I. Webb, M. Garcia de la Banda, and J. C. Whisstock
RCPdb: An evolutionary classification and codon usage database for repeat-containing proteins
Genome Res., July 1, 2007; 17(7): 1118 - 1127.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. A. Huntley and G. B. Golding
Selection and Slippage Creating Serine Homopolymers
Mol. Biol. Evol., November 1, 2006; 23(11): 2017 - 2025.
[Abstract] [Full Text] [PDF]


Home page
J. Med. Genet.Home page
S Caburet, A Demarez, L Moumne, M Fellous, E De Baere, and R A Veitia
A recurrent polyalanine expansion in the transcription factor FOXL2 induces extensive nuclear and cytoplasmic protein aggregation
J. Med. Genet., December 1, 2004; 41(12): 932 - 936.
[Abstract] [Full Text] [PDF]