On the Diversification of the Translation Apparatus across Eukaryotes

Diversity is one of the most remarkable features of living organisms. Current assessments of eukaryote biodiversity reaches 1.5 million species, but the true figure could be several times that number. Diversity is ingrained in all stages and echelons of life, namely, the occupancy of ecological niches, behavioral patterns, body plans and organismal complexity, as well as metabolic needs and genetics. In this review, we will discuss that diversity also exists in a key biochemical process, translation, across eukaryotes. Translation is a fundamental process for all forms of life, and the basic components and mechanisms of translation in eukaryotes have been largely established upon the study of traditional, so-called model organisms. By using modern genome-wide, high-throughput technologies, recent studies of many nonmodel eukaryotes have unveiled a surprising diversity in the configuration of the translation apparatus across eukaryotes, showing that this apparatus is far from being evolutionarily static. For some of the components of this machinery, functional differences between different species have also been found. The recent research reviewed in this article highlights the molecular and functional diversification the translational machinery has undergone during eukaryotic evolution. A better understanding of all aspects of organismal diversity is key to a more profound knowledge of life.


Protein Synthesis Is a Fundamental Process of Life
Proteins are one of the elementary components of life and account for a large fraction of mass in the biosphere. They catalyze most reactions that sustain life and play structural, transport, and regulatory roles in all living organisms. Hence, "translation," that is, the synthesis of proteins by the ribosome using messenger (m)RNA as the template, is a fundamental process for all forms of life, and a large proportion of an organism's energy is committed to translation [1,2]. Accordingly, regulating protein synthesis is crucial for all organisms. Indeed, many mechanisms to control gene expression at the translational level have evolved in eukaryotes [3]. These mechanisms have endowed eukaryotes with the potential to rapidly and reversibly respond to stress or sudden environmental changes [1,2,4]. Translational control also plays a crucial role in tissues and developmental processes where transcription is quiescent, or where asymmetric spatial localization of proteins is required, such as early embryogenesis, learning and memory, neurogenesis, and gametogenesis [5][6][7][8][9][10]. Moreover, recent global gene expression measurements have shown that the cellular abundance of proteins in mammalian cells is predominantly controlled at the level of translation [11,12]. Eukaryotic translation is a sophisticated, tightly regulated, multistep process, the basic steps of which are conserved in all eukaryotes. It is performed by the ribosome together with multiple auxiliary "translation" factors (proteins) and is divided into four steps: initiation, elongation, termination, and recycling. These basic processes of translation were established experimentally in eukaryotes some decades ago, and many regulatory mechanisms have been subsequently elucidated [13,14]. However, it was only recently that, with the use of powerful genome-wide sequencing, proteomics and bioinformatics-based technologies, a surprising diversity in components of the translation apparatus across eukaryotes was unveiled. In some cases, even functional differences between same molecules from different species have also been identified. Additionally, there is evidence that even the genetic code itself has continued to evolve in some phyla. These findings indicate that after eukaryotes emerged, the translational apparatus further evolved during eukaryotic diversification. In this article, we will review recent research revealing the diversification that the genetic code and many components of the translational machinery have undergone across eukaryotes.

Overview of the Translation Process in Eukaryotes
2.1. Initiation. The aim of the initiation step is both to ensure the recruitment of the mRNA to the ribosome and the positioning the ribosome in the proper frame at the start codon, which is achieved in a set of steps mediated by eukaryotic initiation factors (eIF). For most eukaryotic mRNAs, this happens by the so-called cap-dependent mechanism ( Figure 1) [15][16][17][18]. It begins with the dissociation of the ribosome into its 60S and 40S subunits by eIF6. Free 40S subunit, which is stabilized by eIF3, eIF1, and eIF1A, binds to a ternary complex (consisting of eIF2 bound to an initiator Met-tRNA Met i and GTP) to form a 43S preinitiation complex. On the other hand, the cap structure (m 7 GpppN, where N is any nucleotide) of the mRNA is recognized by eIF4E in complex with the scaffold protein eIF4G. Then, the 43S preinitiation complex is recruited to the 5 end of the mRNA, a process that is coordinated by eIF4E through its interactions with eIF4G and the 40S ribosomal subunitassociated eIF3. The ribosomal complex then scans in a 5 −→ 3 direction along the 5 -untranslated region (UTR) through interactions with the eIF4G-bound RNA helicase eIF4A and eIF4B to reach the start codon, usually an AUG. During scanning, eIF4B stimulates the activity of eIF4A which unwinds secondary RNA structures in the mRNA. eIF1, eIF1A, and eIF5 assist in the positioning of the 40S ribosomal subunit at the correct start codon so that eIF2 can deliver the anti-codon of the initiator Met-tRNA Met i as the cognate partner for the start codon, directly to the peptidyl (P)-site of the 40S ribosomal subunit. Once the ribosomal subunit is placed on the start codon, a 48S pre-initiation complex is formed. Then, eIF5 promotes GTP hydrolysis by eIF2 to release the eIF proteins. Finally, the GTPase eIF5B is required for the joining of the 60S ribosomal subunit to the 40S subunit to form an 80S initiation complex. The poly Abinding protein (PABP) is able to interact with the 3 -poly(A) tail and eIF4G promoting circularization of the mRNA and increasing the efficiency of subsequent rounds of initiation ( Figure 1) [15][16][17][18][19][20].
In the case of some viral and cellular mRNAs, 5 -UTR recognition by the 40S ribosomal subunit happens without involvement of eIF4E and is, instead, driven by RNA structures located in cis within the mRNA itself. Such structures are operationally defined as internal ribosome entry site (IRES) and are located in the proximity of the start codon ( [21][22][23]; Martinez-Salas et al. this issue).

Elongation.
After initiation, the 80S ribosome is assembled at the start codon of the mRNA containing a Met-tRNA Met i in the P-site. Then, elongation takes place ( Figure 1); this is the process of decoding codons and formation of peptide bonds sequentially to add amino acid residues to the carboxy-terminal end of the nascent peptide [16,[24][25][26]. This process is assisted by elongation factors (eEF) and involves four major steps. (1) Formation of the ternary complex eEF1A·GTP·aminoacyl-tRNA and delivery of the first elongator aminoacyl-tRNAs to an empty ribosomal tRNA-binding site called the A-(acceptor) site. It is in the A-site where codon/anticodon decoding takes place. (2) Interaction of the ribosome with the mRNA-tRNA. This duplex activates eEF1A·GTP hydrolysis and guanine nucleotide exchange on eEF1A. (3) Peptide bond formation then occurs between the P-site peptidyl-tRNA and the incoming aminoacyl moiety of an A-site aminoacyl-tRNA. This reaction is catalyzed by the peptidyl transferase center of the 60S ribosomal subunit, and the products comprise of a new peptidyl-tRNA that is one amino acid residue longer and a deacylated (discharged) tRNA. (4) Binding of eEF2·GTP and GTP hydrolysis promotes the translocation of the mRNA such that the deacylated tRNA moves to the E-(exit) site, the peptidyl-tRNA is in the P-site, and the mRNA moves by three nucleotides to place the next mRNA codon into the Asite. The deacylated tRNA in E-site is then ejected from the ribosome. The whole process is repeated along the mRNA sequence until a stop codon is reached and the process of termination is initiated [16,[24][25][26].

Termination.
Translation termination is mediated by two polypeptide chain-release factors, eRF1 and eRF3 ( Figure 1). When any of the termination codons (UAA, UAG, and UGA) is exposed in the A-site, eRF1 recognizes the codon, binds the A-site, and triggers the release of the nascent polypeptide from the ribosome by hydrolysing the ester bond linking the polypeptide chain to the P-site tRNA. This reaction leaves the P-site tRNA in a deacylated state, leaving it to be catalyzed by the peptidyl transferase center of the ribosome. eRF1 recognizes stop signals and functionally acts as a tRNA-mimic, whereas eRF3 is a ribosome-and eRF1dependent GTPase that, by forming a stable complex with eRF1, stimulates the termination process [16,27,28]. During Initiation, most eukaryotic mRNAs are translated by the capdependent mechanism, which requires recognition by eIF4E (green crescent) complexed with eIF4G (red) and eIF4A (light green)-the so-called eIF4F complex-of the cap structure at the 5 end. A 43S preinitiation complex (consisting in a 40S ribosomal subunit (dark gray) loaded with eIF3 (pink), eIF1 and eIF1A (light grey), initiator Met-tRNA Met i (blue clover), eIF2 (dark green), and GTP binds the eIF4F-mRNA complex and scans along the 5 -UTR of the mRNA to reach the start codon (usually an AUG triplet). During the scanning eIF4A, stimulated by eIF4B (dark blue), unwinds secondary RNA structure in an ATP-dependent manner. The poly A-binding protein (PABP, dark brown) binds both the poly(A) tail and eIF4G promoting mRNA circularization. Elongation is assisted by elongation factors eEF1A and eEF2 (light brown). During this step, aminoacyl-tRNA synthetases (aaRSs, purple) catalyze the binding of amino acids (aa) to cognate tRNAs. Termination is mediated by the release factors eRF1 (gray) and eRF3 (light blue) and happens when a termination codon (STOP) of the mRNA is exposed in the A-site of the ribosome. In this step, the completed polypeptide (red) is released. During Recycling, which is required to allow further rounds of translation, both ribosomal subunits dissociate from the mRNA. eRF1 remains associated with the posttermination complexes after polypeptide release.

Recycling.
In the recycling step, both ribosomal subunits are dissociated, releasing the mRNA and deacetylated tRNA, so that both ribosomal subunits can be used for another round of initiation [16,27,28] (Figure 1). The closed-loop model proposes that, during translation, cross-talk occurs between both ends due to the circular conformation of the mRNA. According to this model, termination and recycling may not release the 40S ribosomal subunit back into the cytoplasm. Instead, this subunit may be passed from the poly(A) tail back to the 5 -end of the mRNA, so that a new round of initiation can be started [16,27].

Divergence in the Genetic Code
The deciphering of the genetic code in the early 1960's established one of the basic foundations of modern biology. Soon after, the essential universality of the genetic code was recognized, that is, the assignment of 20 amino acids to 64 codons and two punctuation marks (start and stop signals) is substantially the same for all extant forms of life on earth [29]. Nevertheless, variations to the "universal" genetic code, wherein the meaning of a "universal" codon is changed to a different one, have recently been uncovered in a wide range of bacteria, organelles, and the nuclear genome of eukaryotes, revealing that the genetic code is still evolving in some lineages [30][31][32][33]. In eukaryotes, deviations from the standard nuclear genetic code have arisen independently multiple times in unicellular organisms of five lineages, namely, ciliates, Diplomonads, fungi (in the genus Candida and some ascomycetes), polymastigid oxymonads, and green algae (in Dasycladales and Cladophorales) [30,31,[33][34][35][36][37][38][39]. Most codon variations in eukaryotes are found to be the reassignment of the stop codons UAG and UAA to glutamine, and the stop codon UGA to tryptophan or cysteine ( Figure 2). All reported code variations in ciliates, Diplomonads, and green algae belong to this kind. In contrast, Candida ambiguously utilizes the codon CUG (universally used for leucine) for both serine and leucine. The underlying mechanisms of codon reassignment are mutations in tRNA genes that affect decoding, RNA editing, or mutations in eRF1 [30,31,[34][35][36][37][38][39].
The observation that the same codon reassignments have occurred independently in closely related species (within the yeasts, green algae, and ciliate taxa) supports the notion that these changes provide a selective advantage in similar ecological niches [30]. Whether there is a restriction for the genetic code to change in multicellular organisms is not known.

Functional Divergence of eIF Proteins.
While the fundamental principles of translation are well conserved across all forms of life, in eukaryotes the initiation step has undergone substantial increase in complexity as compared to prokaryotes [3,22,[40][41][42][43][44]. Most evidence for molecular and functional diversification among the translation components has been found in the eIF4 proteins ( Figure 2). Most eukaryotic phyla possess several paralog genes for members of the eIF4 families, with well-documented differential expression patterns and variable biochemical properties among paralogs of the same organism . For eIF4E and eIF4G cognates, even evidence of physiological specialization has been found among both unicellular and multicellular organisms ( Table 1). These findings support the hypothesis that in organisms with several paralogs, an ubiquitous set of eIF4 factors supports global translation initiation whereas other paralogs perform their activity in specific cellular processes [45]. In some cases, eIF4E cognates have evolved towards translational repressors. Class 2 eIF4Es are exemplified by eIF4E-homolog protein (4E-HP) in human, eIF4E-2 in mouse [63], eIF4E-8 in Drosophila [52,58,73], IF4 in C. elegans [74,75], and nCBP in A. thaliana [76], and they can bind the 5 cap structure of mRNA but do not bind eIF4G [58,77], thereby acting as a translational repressors of mRNAs associated with it [73,78]. Class 2 eIF4Es are widespread across metazoa, plants, and some fungi although absent in the model ascomycetes S. cerevisiae and S. pombe [46]. Since the Arabidopsis [76] and Caenorhabditis [74] orthologs promote translation of some mRNAs, it seems most likely 4E-HP diverged from a widespread ancestral eIF4E to form a translational repressor in metazoa [3]. A similar example is eIF4E-1B, which emerged only in vertebrates as a translational repressor of a subset of oocyte mRNAs [57,59,79], and Leishmania eIF4E-1, which under heat shock conditions binds to a Leishmania-specific 4E-BP and becomes translationally inactive [71]. In other cases, eIF4E cognates have evolved towards a new molecular function not related to translation. This is the case with Trypanosoma eIF4E-1 and eIF4E-2, which are essential nuclear and cytoplasmic proteins, respectively [49], and Giardia (eIF4E-2), which binds only to nuclear noncoding small RNAs [64]. However, it is also possible that this was an ancestral function of eIF4E [22,40].
Whereas the need for distinct eIF4 proteins in different tissues may have been the driving force behind the evolution of various paralogs in multicellular organisms, in unicellular eukaryotes different paralogs may be differentially needed during distinct life stages [49]. Specific features of mRNA metabolism in some phyla also might have driven the evolution of eIF4Es in specific organisms, such as the use of different cap structures (usually mono-and trimethylated) in mRNAs from worms of the phylum Nematoda [50,51,54,80], and flagellate protists of the order Kinetoplastida [49,65,66]. These mRNAs result from the trans-splicing process to produce mature mRNAs.
Other eIFs have also undergone molecular diversification across eukaryotes, including the multisubunit eIF3 whose subunit composition ranges from 5 to 13 nonidentical polypeptides in different phyla [99], and eIF6 that is duplicated into two or three paralogs in plants [100]. However, the functional relevance of these phenomena (if any) is not known.

Multiple RNA Helicases for Translation Initiation.
The evolution of cap-dependent translation has led to a dependency on RNA helicase activity to unwind the 5 -UTR secondary structure during the scanning [22,40]. The DEADbox RNA helicase/ATPase eIF4A is the main helicase thought to perform this activity. Recently, other RNA helicases from diverse organisms have also been found to facilitate translation of specific mRNAs with structured 5 -UTRs ( Figure 2). Such is the case of the mammalian, Drosophila and yeast DEAD-box helicases DDX3 and Ded1, as well as the human DExH-box helicases RHA and DHX29 [101][102][103]. In Drosophila, the DEAD-box helicase Vasa interacts with eIF5B and regulates the translation of gurken and mei-P26 mRNAs. Evidence supports the idea that Vasa is a translational activator of specific mRNAs involved in germline development [6,7]. In contrast, orthologs of the Xenopus helicase Xp54 (DEAD-box, DDX6-like helicases) in a spectrum of organisms, including Drosophila Me31B, Saccharomyces Dhh1, human rck/p54, and Caenorhabditis CGH-1 have been found to repress translation of stored mRNAs and promote aggregation into germplasm-containing structures [104].
Most RNA helicases involved in translation also play a variety of roles in other processes of RNA metabolism, including mRNA RNP assembly, RNA degradation, RNA export, and splicing [103]. This functional versatility of RNA helicases leads us to speculate that a wider diversity of other, yet unidentified, helicases might be involved in translation in all eukaryotes. This could be the case of the Arabidopsis eIF4F complex, which contains eIF4A in proliferating cells but different RNA helicases in quiescent cells [105]. Whether these helicases play a role in translation is not known.

Divergence in the Regulation of Initiation: Diversity of eIF4E-Binding Proteins.
Almost twenty years ago, it was discovered that eIF4E is negatively regulated in mammalian cells by three related proteins, the eIF4E-binding proteins (4E-BPs) 1, 2, and 3. These proteins share with eIF4G the motif YXXXXLφ (where X is any amino acid and φ is a hydrophobic residue) that interacts with the convex dorsal surface of eIF4E, so binding of 4E-BPs to eIF4E precludes its association with eIF4G and represses cap-dependent Comparative and Functional Genomics . In contrast, mammalian species express two eRF3s (viz. eRF3a and eRF3b). Ribosomes from all eukaryotes perform Elongation with eEF1A and eEF2. However, the yeast S. cerevisiae requires an additional essential factor, eEF3 (light purple), for Elongation to proceed. Genes encoding eEF3 have been found exclusively in many species of fungi. Evidence supports the notion that eEF3 activity promotes ribosome recycling. Variations to the "universal" genetic code, wherein the meaning of a "universal" codon is changed to a different one, exist in several species of in unicellular eukaryotes. Most codon variations are the reassignment of the stop codons UAG and UAA to glutamine, and the stop codon UGA to tryptophan or cysteine.

A A A A A A A A A A A A A A A A A A A A A A
translation [8,106]. In the last years, a myriad of 4E-binding proteins has been discovered in species from distantly related taxa, including mammals, plants, Drosophila, Caenorhabditis, yeast [3,8,106], and Leishmania [71] (Figure 2). Interestingly, most 4E-BPs are phylogenetically unrelated to each other and control translation in disparate, species-specific processes, such as embryogenesis in Drosophila, neurogenesis in mammals, or pseudohyphal growth in yeast. Moreover, some 4E-BPs utilize non-canonical motifs to bind eIF4E. These observations support the idea that binding to eIF4E evolved independently in multiple taxonomic groups [3].  [107][108][109][110]. Mammals possess two Mnk genes (MKNK1/2) which in humans, but not mice, give rise to four Mnk isoforms by alternative splicing; these isoforms have distinct properties in terms of activity, regulation, and subcellular localization [111]. In Drosophila, the single Mnk orthologue, LK6, also phosphorylates eIF4E-1 at a serine residue corresponding to mammalian Ser-209, a phosphorylation that is critical for development and cell growth [112][113][114][115]. However, the effects of phosphorylation on eIF4E activity and its physiological relevance are different across eukaryotes. Indeed, a residue equivalent to Ser-209 is present in metazoan eIF4Es but is absent in different fungi, protists and plants ( [67];R. Jagus et al., this issue). Accordingly, Mnk is conserved among metazoans, but no Mnk ortholog exists in S. cerevisiae or plants, whose eIF4Gs  [116] and S. cerevisiae eIF4E [117] are phosphorylated on residues which are not equivalent to mammalian Ser-209, and S. cerevisiae cells expressing a nonphosphorylatable version as sole source of eIF4E do not display any evident defect on global protein synthesis or cell growth [117]. These observations support the idea that eIF4E phosphorylation at Ser-209 by the MAPK-Mnk signaling pathway evolved only in metazoans and that, perhaps, alternative mechanisms regulate eIF4E in nonmetazoan eukaryotes [3].

Diversity in the Regulation of Initiation: The Case of eIF2alpha Phosphorylation.
Under different stress conditions, general protein synthesis is inhibited through phosphorylation of the alpha subunit of eIF2 at Ser-51 by a family of kinases that are present in widely scattered lineages. They include the double-stranded RNA protein kinase (PKR) that is activated during viral infection, the heme-regulated inhibitor kinase (HRI) that is activated under heme deprivation or arsenite exposure, the PKR-like endoplasmatic reticulum kinase (PERK) that is activated by unfolded proteins in the lumen of the endoplasmic reticulum, and the general control nonderepressible 2 (GCN2) that is activated by uncharged tRNA and thus senses amino acid starvation [107,118] (Figure 2). The presence of eIF2alpha kinases varies in different lineages; while GCN2 is present in all eukaryotes; PERK is found in only metazoans; HRI is found in vertebrates, the dipteran Anopheles, the fungi Schizosaccharomyces, and the echinoderm Strongylocentrotus; PKR is only found in vertebrates [53,118]. Interestingly, in some teleost fishes, PKR has undergone further duplication into PKR and PKZ, which perhaps led teleost fishes to respond to an extended range of viral infections [119].

Divergence in the Aminoacyl-tRNA Synthetases.
The process of elongation is highly conserved among all forms of life [16,24,25]. Key molecules for elongation are aminoacyl-tRNA synthetases (aaRSs), which catalyze the aminoacylation reaction whereby an amino acid is attached to the cognate tRNA. aaRSs are the only components of the gene expression machinery that function at the interface between nucleic acids and proteins. Thus, by performing their activity, aaRSs establishes the fundamental rules of the universal genetic code and, thus, of translation. aaRSs constitute a family of 20 essential cellular enzymes that are grouped into two classes: class I, in which the aminoacylation domain has a Rossmann nucleotide-binding fold, and class II, in which this domain is a seven-stranded beta-sheet with flanking alpha-helices. The conservation of the genetic code suggests that aaRSs evolved very early before the emergence of the last universal common ancestor [120,121]. Throughout evolution of multicellularity, different domains, such as the WHEP domain, the oligonucleotide binding fold-containing EMAPII domain, the tripeptide ELR (Glu-Leu-Arg), the glutathione S-transferase (GST) domain and a specialized amino-terminal helix (N-helix), have been progressively added to different aaRSs in distinct phyla ( Figure 2). The tripeptide ELR and the EMAPII domain were incorporated simultaneously to TyrRSs in metazoans starting from insects; the WHEP domain is present in TrpRS only in chordates; a unique sequence motif, UNE-S, became fused to the C-terminal of SerRS of all vertebrates [120,121]. In bilaterian animals, the glutamylRS and prolylRS were linked via WHEP domains giving rise to a bifunctional glutamyl-prolylRS [120,121]. It was recently found that this fused enzyme is also present in the cnidarian Nematostella, which pushes the origin of glutamyl-prolylRS back to the cnidaria-bilaterian ancestor [122], and suggests that this enzyme further underwent fission in the nematode C. elegans where glutamylRS and prolylRS enzymes are separated. GlutamylRS and prolylRS are also separate in plants and fungi [120][121][122].
It has been found that the function of the aaRSs was either increased or impaired by the addition of the new domains. Whereas the WHEP domain regulates interaction of TrpRS with its cognate receptor, with MetRS this domain plays a tRNA-sequestering function. The Leuzipper motif in ArgRS is important for the formation of multi aaRSs complex (MSC), which enhances channeling of tRNA to the ribosome. Moreover, different aaRSs play diverse roles in cellular activities beyond translation, such as stress response, plant and animal embryogenesis, cell death, immune responses, transcriptional regulation, and RNA splicing [120,121,123]. It was found that the incorporation of domains to aaRSs correlates positively with the increase in organism's complexity. For example, the number of aaRSs carrying the GST domain increases from two in fungi to four in insects, to five in fish, and six in humans [121]. Thus, it has been proposed that the newly fused aaRSs domains triggered the appearance of new biological functions for these proteins in different lineages and that the fusion of domains to aaRSs could have played an important part in expanding the complexity of newly emerging metazoan phyla [121].

Divergence in Elongation
Factors. eEF1A plays a critical role in translation. It binds and delivers aa-tRNAs to the A-site of ribosomes during the elongation step. Because homologs of this essential protein occur in all domains of life, it was thought to exist in all eukaryotes. Strikingly, a recent genome-wide survey revealed that a number of lineages lack eEF1A and instead possess a related factor called elongation factor-like (EFL) protein that retains the residues critical for eEF1A function [124] (Figure 2). It was later found that EFL-encoding species are scattered widely across eukaryotes and that eEF1A and EFL genes display mutually exclusive phylogenetic distributions. Thus, it is assumed that eEF1A and EFL are functionally equivalent [124][125][126][127][128][129][130][131][132]. Since EFL is present only in eukaryotes, it is thought that eEF1A is ancestral to all extant eukaryotes and that a single duplication event in a specific lineage gave rise to EFL. EFL genes were then spread to other lineages via multiple independent lateral gene transfer events, where EFL took over the original eEF1A function resulting in secondary loss of the endogenous eEF1A. It is thought that both genes coexisted for some time before one or the other was lost. Indeed, the diatom Thalassiosira bears both EFL and eEF1A genes [129] and might be an example of this situation. It is also possible that there was a single gain of EFL early in evolution followed by differential loss of it [124,128,129,131,132]. So far, EFL genes have been identified in widespread taxa, including diatoms, green and red algae, fungi, euglenozoans, foraminiferans, cryptophytes, goniomonads, katablepharid, chlorarachniophytes, oomycetes, dinoflagellates, choanozoans, centrohelids, and haptophytes [124][125][126][127][128][129][130][131][132]. Most of them are unicellular organisms. In contrast, eEF1A is found in most eukaryotes, and multiple copies of this gene have been found in some insect orders, including Coleoptera, Hymenoptera, Diptera, Thysanoptera, and Hemiptera [133].

Comparative and Functional Genomics
The eEF1A activity is modulated by diverse posttranslational modifications, including phosphorylation, lysine methylation, and methyl-esterification. eEF1A also undergoes modification by covalent binding of ethanolamine phosphoglycerol (EPG), whose function is not known and for whom the number of moieties attached varies in different eukaryotes [134]. Moreover, in addition to its role in translation, eEF1A has been reported to play several "moonlighting" functions, including binding to cytoskeletal proteins, signal transduction, protein nuclear export and import of tRNAs into mitochondria [134]. It is not known whether EFL undergoes the same posttranslational modifications as eEF1A does and whether it also displays non-translational activities.

Divergence in the Termination Step
The termination of protein synthesis is governed by eRF1, which is a monophyletic and highly conserved protein that is universally present in eukaryotes. Comprehensive analyses of genomic datasets show that eRF1 was inherited by eukaryotes from archaeal ancestors and that most eukaryotes encode only one eRF1. Known exceptions are Arabidopsis thaliana, which possesses three eRF1 genes, and the ciliates Tetrahymena, Oxytricha, Nyctotherus, Oxytricha, Euplotes, and Paramecium which have two eRF1 genes [135][136][137][138]. Interestingly, unusually high rates of eRF1 evolution have been found in organisms with variant genetic codes, especially in the N-terminal domain, which is responsible for stop-codon recognition [30,34,135,136,138,139]. eRF1 displays structural similarity to tRNA molecules and mimics its activity during binding of ribosomal A-site during recognition of a stop codon [34,[139][140][141]. Since mutations in eRF1 N-terminal domain switch from omnipotent to bipotent mode for stop-codon specificity [35][36][37][38]141], most likely the accelerated evolution of eRF1 in organisms with variations to the nuclear genetic code has been driven mainly to accommodate these variations [30, 31, 34-38, 135, 138-141].
eRF3 is a GTPase that stimulates the activity of eRF1 during the translation termination process. eRF3 arose in early eukaryotes by the duplication of the GTPase eEF1A. Consistent with this, eRF3 binds and transports eRF1, a structural mimic of tRNA, to the ribosomal A-site, similar to the role of eEF1A in binding and delivering aminoacyl-tRNAs to the same site during translation elongation [142,143]. eRF3 is much more divergent than eRF1, especially in its N-terminal domain. In addition, eRF3 is universal among eukaryotes, and most organisms only contain singlecopies of this gene [137,143]. In contrast, mammalian species express two eRF3s (viz. eRF3a and eRF3b; Figure 2). They possess different N regions and display drastically different tissue distribution and expression profiles during the cell cycle [143,144]. Moreover, eRF3b but not eRF3a can substitute for yeast eRF3 in translation termination [145]. These observations indicate duplication and further functional divergence of eRF3 proteins in this lineage.

Divergence in the Recycling Step
Ribosomes from all eukaryotes perform elongation with eEF1A and eEF2. Interestingly, it has been known for some time that the yeast Saccharomyces cerevisiae requires an additional essential factor, eEF3, for the elongation cycle to proceed [146]. Genes encoding eEF3 were subsequently identified exclusively in other fungi (both yeasts and filamentous), including Candida, Pneumocystis, Neurospora, Aspergillus, and Mucor [147][148][149][150] (Figure 2). eEF3 is an ATPase that interacts with both ribosomal subunits and stimulates binding of aminoacyl-tRNA to the ribosomal Asite by enhancing the rate of deacylated tRNA dissociation from the E-site. Because E-site release is needed for efficient A-site binding of aminoacyl-tRNA, it was thought that eEF3 functions as a so-called "E-site" factor [16,151]. Most recently, it was shown that post-termination complex, consisting of a ribosome, mRNA, and tRNA, is disassembled into single components by ATP and eEF3. Because the release of mRNA and deacylated tRNA and ribosome dissociation takes place simultaneously and no 40S-mRNA complexes remain, it is proposed that eEF3 activity promotes ribosome recycling [152]. "What were the evolutionary forces that led to the emergence of eEF3 exclusively in fungi?" is a very interesting, still open question.

Concluding Remarks
One of the most conspicuous features of life is its prominent ability to diversify. Current assessments of the biodiversity on Earth reaches 2 million species, although the true number of living organisms could easily be four times that number and likely much higher [153,154]. The diversification of life has occurred at different levels, including the occupancy of ecological niches, behavioral patterns, body plans, and organismal complexity, and metabolic needs and capabilities. More recently, intensive whole-genome shotgun sequencing of microbial communities from different environments has unveiled a vast profusion of diversification also at the genetic level [155][156][157]. We have discussed that diversity also exists in the machinery that performs a fundamental process, translation, across eukaryotes. We speculate that the molecular diversification of the translation apparatus is among the basis that provided to early eukaryotes the scope to invade new ecological niches and overcome the different environmental and biological challenges this represented. Different evolutionary mechanisms might have been the driving forces leading to this molecular diversification in different lineages, including natural selection, sexual selection, genetic drift and neutral evolution. However, at this point, we can be nothing but speculative on the biological meaning of the molecular diversification reviewed here.
Traditional studies on so-called model organisms have taught us the global processes of eukaryotic translation. In the last years, the use of modern genome-wide, highthroughput technologies to study many non-model eukaryotes from different taxa has unveiled that diversification of the translation machinery configuration is far more expansive than previously thought. Collectively, these studies Comparative and Functional Genomics 9 show that the translation apparatus in eukaryotes is far from being evolutionarily static. Therefore, we anticipate that, as more organisms are studied, additional diversification of components of the translation apparatus will be revealed. We believe that a better understanding of the diversity of all levels of organism will provide us a more profound understanding of Life.