Circular Code Motifs in the Ribosome: A Missing Link in the Evolution of Translation?

The origin of the genetic code remains enigmatic five decades after it was elucidated, although there is growing evidence that the code coevolved progressively with the ribosome. A number of primordial codes were proposed as ancestors of the modern genetic code, including comma-free codes such as the RRYRNY, or GNC codes (R = G or A, Y = C or T, N = any nucleotide), and the X circular code, an error-correcting code that also allows identification and maintenance of the reading frame. It was demonstrated previously that motifs of the X circular code are significantly enriched in the protein-coding genes of most organisms, from bacteria to eukaryotes. Here, we show that imprints of this code also exist in the ribosomal RNA (rRNA). In a large-scale study involving 133 organisms representative of the three domains of life, we identified 32 universal X motifs that are conserved in the rRNA of >90% of the organisms. Intriguingly, most of the universal X motifs are located in rRNA regions involved in important ribosome functions, notably in the peptidyl transferase center and the decoding center that form the original “proto-ribosome.” Building on the existing accretion models for ribosome evolution, we propose that error-correcting circular codes represented an important step in the emergence of the modern genetic code. Thus, circular codes would have allowed the simultaneous coding of amino acids and synchronization of the reading frame in primitive translation systems, prior to the emergence of more sophisticated start codon recognition and translation initiation mechanisms.

The primordial tRNA acceptor stem code from theoretical minimal RNA ring clusters

The aim is to understand what makes RNA rings such useful, and perhaps efficient, simulators of prebiotic evolution, independently of the possibility that RNA rings are the actual primordial sequences.

theoretical RNA rings evolved along physicochemical constraints affecting nucleotide substitutions, apparently devoid of effects on their coding properties on amino acid sequences, in line with a pre-translational origin of diversification of RNA rings that would at a later stage become the population of primordial coding and decoding RNAs.

Comparisons Between Small Ribosomal RNA and Theoretical Minimal RNA Ring Secondary Structures Confirm Phylogenetic and Structural Accretion Histories

It is probable that the structural methods are more prone to errors due to evolutionary convergences than the phylogenetic method, though convergences remain the main difficulty in reconstructing evolution.

Origin and Evolution of the Universal Genetic Code

We propose an experimentally testable scenario for the evolution of the code that combines recognition of amino acids by unique sites on proto-tRNAs (distinct from the anticodons), expansion of the code via proto-tRNA duplication, and frozen accident.

The problems of the nature, origin, and evolution of the genetic code appear to be unique in combining extreme outward simplicity with excruciating difficulty.

In our view, theoretical study of the genetic code as a cryptographic problem has largely run its course. The best hope for further progress in understanding the origin and evolution of the code seems to lie with the technically challenging but conceptually clear experimentation aiming at recapitulation of the inferred steps in the translation system evolution.

+++++++

+++++++

+++++++

+++++++

+++++++

+++++++

+++++++

Evolution of Life on Earth: tRNA, Aminoacyl-tRNA Synthetases and the Genetic Code

Life on Earth and the genetic code evolved around tRNA and the tRNA anticodon. We posit that the genetic code initially evolved to synthesize polyglycine as a cross-linking agent to stabilize protocells. We posit that the initial amino acids to enter the code occupied larger sectors of the code that were then invaded by incoming amino acids. Displacements of amino acids follow selection rules. The code sectored from a glycine code to a four amino acid code to an eight amino acid code to an ~16 amino acid code to the standard 20 amino acid code with stops. The proposed patterns of code sectoring are now most apparent from patterns of aminoacyl-tRNA synthetase evolution. The Elongation Factor-Tu GTPase anticodon-codon latch that checks the accuracy of translation appears to have evolved at about the eight amino acid to ~16 amino acid stage. Before evolution of the EF-Tu latch, we posit that both the 1st and 3rd anticodon positions were wobble positions. The genetic code evolved via tRNA charging errors and via enzymatic modifications of amino acids joined to tRNAs, followed by tRNA and aminoacyl-tRNA synthetase differentiation. Fidelity mechanisms froze the code by inhibiting further innovation.