Sobreposição de códigos dentro de sequências codificadoras de proteínas

segunda-feira, novembro 01, 2010

Overlapping codes within protein-coding sequences

Shalev Itzkovitz1,3, Eran Hodis1,3 and Eran Segal1,2,4

-Author Affiliations
1 Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel;
2 Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot 76100, Israel

↵3 These authors contributed equally to this work.

Abstract

Genomes encode multiple signals, raising the question of how these different codes are organized along the linear genome sequence. Within protein-coding regions, the redundancy of the genetic code can, in principle, allow for the overlapping encoding of signals in addition to the amino acid sequence, but it is not known to what extent genomes exploit this potential and, if so, for what purpose. Here, we systematically explore whether protein-coding regions accommodate overlapping codes, by comparing the number of occurrences of each possible short sequence within the protein-coding regions of over 700 species from viruses to plants, to the same number in randomizations that preserve amino acid sequence and codon bias. We find that coding regions across all phyla encode additional information, with bacteria carrying more information than eukaryotes. The detailed signals consist of both known and potentially novel codes, including position-dependent secondary RNA structure, bacteria-specific depletion of transcription and translation initiation signals, and eukaryote-specific enrichment of microRNA target sites. Our results suggest that genomes may have evolved to encode extensive overlapping information within protein-coding regions.

Footnotes

↵4 Corresponding author.

E-mail eran.segal@weizmann.ac.il.

[Supplemental material is available online at http://www.genome.org.]

Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.105072.110.

Received March 22, 2010.
Accepted September 10, 2010.
Copyright © 2010 by Cold Spring Harbor Laboratory Press

+++++

Professores, pesquisadores e alunos de universidades públicas e privadas com acesso ao site CAPES/Periódicos podem ler gratuitamente este artigo da Genome Research e de mais 22.440 publicações científicas.