Abstract
Eukaryotic translation initiation factor 2-alpha kinase (EIF2AK) proteins inhibit protein synthesis at translation initiation level, in response to various stress conditions, including oxidative stress, heme deficiency, osmotic shock, and heat shock. Origin and functional diversification of EIF2AK sequences remain ambiguous. Here we determine the origin and molecular evolution of EIF2AK proteins in lower eukaryotes and studied the molecular basis of divergence among sub-family sequences. Present work emphasized primitive origin of EIF2AK4 sub-family gene in lower eukaryotes of protozoan lineage. Phylogenetic analysis supported common origin and sub-family based classification of EIF2AKs. Functional divergence studies across sub-families revealed several putative amino acid sites, which assist in altered protein interactions of kinase domains. The data can facilitate designing site-directed experimental studies aiming at elucidating diverse functional aspects of kinase domains regarding down-regulation of protein synthesis.
Introduction
The eIF2α kinases are a family of four distinct serine-threonine kinases, EIF2AK1, EIF2AK2, EIF2AK3 and EIF2AK4, referred to as HRI (Heme-Regulated Inhibitor), PKR (Protein Kinase double-stranded RNA-dependent), PERK (protein kinase R (PKR)-like endoplasmic reticulum kinase), and GCN2 (General Control Nonderepressible2) respectively[1]. Mammalian EIF2AKs have a conserved Kinase Domain (KD) but distinct regulatory domains that mediate activation by diverse stress signals[2]. These are stress-activated protein kinases that downregulate protein synthesis and simultaneously up-regulated transcriptional activators at the translational level. This dual response reduces cellular protein synthesis while reprogramming transcription to favor gene products with functions in stress management[3]. These kinases carry phosphorylation of translation initiation factor 2 (eIF2α) at the Ser51 residue of α-subunit. eIF2 binds GTP to onset initiation pathway delivering methionyl initiator tRNA to 40s ribosomal subunit, which produces the 43S pre-initiation complex. Initiator tRNA recognizes AUG codon, activating eIF2B, an exchange factor for guanine nucleotide, which hydrolyzes GTP releasing eIF2GDP from 40S subunit for recycling back to eIF2GTP. Phosphorylated eIF2 is an inhibitor of eIF2B, which reduces the concentration of eIF2 GTP inhibiting translation initiation. Reduced eIF2GTP levels stimulate translation of GCN4 mRNA in yeasts and ATF4 mRNA in mammals, which encode transcriptional activators of stress genes[4–6].
EIF2AK1 or HRI maintains heme levels in the cell by playing a key role in the synthesis of globin genes and promoting erythroid precursors survival[7]. Proposed model suggests that HRI binds to Heme in its N-terminal end. This triggers an intermolecular autophosphorylation generating HRI-HRI component, which senses Heme concentrations. Further autophosphorylation events activate HRI in heme deficiency conditions and it proceeds to phosphorylate eIF2α. Protein makes up of N-terminus and C-terminus regions flanking a central KD. N-terminus and KD regions harbor binding sites for heme. N-terminus contains a stable heme-binding site, needed for heme regulation of HRI while KD harbors reversible heme-binding site[8]. Flanking histidine residues align heme molecule at both sites.
EIF2AK2 or PKR phosphorylates eIF2α in response to viral infection, thereby blocking translation of viral mRNAs and promoting apoptosis[9]. PKR plays a key role in activation response to diverse signals, such as oxidative and endoplasmic reticulum (ER) stress, cytokine signaling and growth factors[10–13]. PKR’s N-terminal region contains a regulatory double-stranded RNA binding domain (dsRBD) consisting of two dsRNA binding motifs that are separated by an unstructured linker region[14]. C-terminal contains a KD, which bears catalytic functions of protein and has crucial dimerization interface for activation of PKR. Binding of viral dsRNA to PKR induces dimerization of C-terminal KD[15,16]. Autophosphorylation events stabilize resulting dimer, which phosphorylates eIF2α.
Misfolded proteins accumulate in the endoplasmic reticulum (ER) which creates a phenomenon of ER stress activating EIF2AK3 or PERK[17]. PERK is an ER transmembrane protein consisting of the N-terminal region important for dimerization, regulation, and association with ER chaperone, immunoglobulin binding protein (BiP) and HspA5, while the C-terminal region is cytosolic and contains KDs with autophosphorylation sites[18]. Misfolded proteins induce BiP bound PERK in the active state to dissociate and allow dimerization and autophosphorylation activating PERK[19,20]. Additional proposed activation mechanism involves PERK functions as a sensor of ER calcium levels[21,22]. Reduction in ER calcium levels activates PERK.
EIF2AK4 or GCN2 is activated by deprived levels of amino acids and by deprived levels of glucose in cells. GCN2 has a more complex structure than PERK and PKR[23,24]. It harbors the typical KD; it also harbors a pseudo-KD and a histidyl-tRNA synthetase (HisRS) related domain, which has a higher affinity for uncharged tRNAs. Binding of uncharged transfer RNAs (tRNAs) to HisRS domain of protein dimerizes and autophosphorylates protein as other eIF2α kinases which activate GCN2[25]. Activated GCN2 phosphorylates eIF2α, which causes a general inhibition of translation initiation and simultaneously favors selective translation of some mRNAs containing short open reading frames (ORFs)[26]. These encode bZIP (basic leucine zipper) transcription factors that induce expression of genes, which aid cellular adaptation to amino acid deprivation[27,28].
EIF2AKs share a common physiological role of phosphorylating eIF2α substrate that acts as an initiator of the translation process. EIF2AKs recognizes diverse signals at cellular levels due to their diverse signaling domains. KD is a key factor aiding in dimerization and in eIF2α phosphorylation, shows conservation across four sub-families and across species. An understanding divergence of these sequence sub-families provides insights into substrate specificity of kinases. We used bioinformatics and computational approaches to identify novel annotated EIF2AK sequences in lower organisms, address the evolutionary and phylogenetic relationship of EIF2AKs across four sub-families, analyze sites responsible for the evolution of functional divergence of EIF2AK sub-families.
Methodology
Identification of EIF2AK sequences
Annotated EIF2AK sequences were retrieved from UniProt (http://www.uniprot.org/)[29]. Hidden Markov Model (HMM) is built with HMMBUILD program of HMMER3 (http://hmmer.janelia.org/)[30] using above retrieved sequences. HMM profile is invoked as a query for HMMSEARCH program to search 250 vertebrate proteomes retrieved from Ensembl[31], Joint Genome Institute (JGI) (http://www.jgi.doe.gov/)[32] and NCBI genome (http://www.ncbi.nlm.nih.gov/genome/) databases[33]. Simultaneously, a search of NCBI Protein (http://www.ncbi.nlm.nih.gov/protein) sequence database using keyword search “EIF2AK” and sequences are manually curated. Identical sequences in the retrieved sequences were removed using CD-HIT software[34] with default tuple size and a cutting threshold of sequence identity of 95%.
Multiple sequence alignment and phylogenetic tree construction
A multiple sequence alignment (MSA) is obtained based on final dataset of EIF2AK sequences trimmed to conserved KDs. Local alignment on sequences is performed using MAFFT software[35] (http://mafft.cbrc.jp/alignment/software/), with PAM as scoring matrix to align functional motifs of diversified sequences. Based on quality of alignment unaligned or packed columns in MSA were removed manually using Jalview[36] software retaining good quality columns. Phylogenetic trees were built based on Bayesian Inference using MrBayes software version 3.2[37] using mixed amino acid model. Markov Chain Monte Carlo (MCMC) analysis is utilized to approximate posterior probabilities of phylogenetic trees. An analysis is executed for 20 million generations with sampling of every hundredth tree. A stop rule with a stop value of 0.05 is applied to terminate MCMC run and to assess convergence of MCMC. Sump value of 0.25 discards first 25% of sampled trees at burnin period after MCMC’s had converged and substitution model parameters, including mean, mode, and 95% credibility interval of each parameter were estimated. Sumt command is used to build a consensus tree from remaining 75% of sampled trees using 50% majority rule. Phylogenetic trees were viewed using Dendroscope 3 program[38].
Analysis of gene structure and gene order
Gene Structure Display Server (GSDS)[39] (http://gsds.cbi.pku.edu.cn/) is used to analyze gene structure. Representative gene sequences of each phyla of animal kingdom are submitted to GSDS server for generating visualization of gene features such as composition and position of exons and introns. Cinteny server (http://cinteny.cchmc.org/)[40] is used to analyze synteny and evolutionary distances in terms of genome rearrangements for selected organisms. Cinteny server allows comparing multiple genomes and performing sensitivity analysis for synteny block detection and for subsequent computation of reversal distances. Users can interactively browse gene blocks conserved in multiple genomes using Cinteny server, to facilitate genome annotation and validation of assemblies for newly sequenced genomes, and to construct and assess phylogenetic trees.
Comparison of selection pressure
Evolutionary pressure of substitution rates is calculated for selected EIF2AK sequences from different phyla of animal kingdom to analyze nature of substitutions in these sub-families. PAL2NAL (http://www.bork.embl.de/pal2nal/) server[41] is utilized to generate a codon based alignment of DNA sequences based on MSA of protein sequences. Resulting codon alignment is used in CODEML program of the PAML package[42] to calculate rate of non-synonymous and synonymous substitution (ω = dN/dS). An ω value >1 with increased dN value indicates rapid amino acid substitution implying positive selection on gene, and in contrast ω value < 1 indicates elimination of disadvantage replacements, suggesting purifying selection. If dN = dS (or ω = 1) gene evolves neutrally by silent or missense mutations with no significant alterations in gene function.
Estimation of functional divergence
Functional divergence between four sub-families of EIF2AK sequences and related kinase sequences is inferred by type I and type II divergence analyses utilizing Diverge 3.0[43]. Type I functional divergence suggests heterogeneous evolutionary rates between duplicated genes, while type II functional divergence suggests radical changes to biochemical properties like charge (positive or negative), nature (hydrophilic/hydrophobic), between duplicates. Site-specific estimation of posterior probability of radical changes related to type II functional divergence is performed to assess probable regions and shifts of biochemical properties between paralogous groups[44].
Results and discussion
Identification and classification of EIF2AK sequences
852 sequences in total consisting of annotated and unannotated sequences were identified using HMMSEARCH of sequenced genomes and NCBI sequence keyword based searching. Further classification of sequences at sub-family level identified a set of 169 EIF2AK1, 148 EIF2AK2, 206 EIF2AK3 and 329 EIF2AK4 sequences. EIF2AK1 and EIF2AK3 sub-family sequences show their origin from Nemathelminthes and Platyhelminthes. EIF2AK2 shows a later evolutionary origin in phylogenetic lineage with its origin from higher chordates Osteichthyes (Bony fish). EIF2AK4 occurs down evolutionary lineage with occurrence from lower eukaryotic species like Viridiplantae, which comprises of lower algal forms inferring its primitive ancestry in EIF2AK family. All EIF2AK sub-families show a continuous distribution by their presence in higher organisms across evolutionary order. HMMSEARCH of bacterial model organisms provide insight to prokaryotic origin of EIF2AKs from Protein kinase B (PknB) a Serine/threonine-protein kinase (STPK). PknB shares striking similarity in protein fold, catalytic machinery and kinase regulation mechanism with eukaryotic STPKs[45–47]. S1 Table shows distribution of EIF2AKs across phyla along with reference number.
Domain organization of EIF2AK family
EIF2AK family proteins show a varied domain organization majorly at signaling domain due to a wide range of activation signals they recognize (Fig 1). EIF2AK1 has a partial KD followed by a functional full-length KD. EIF2AK2 in humans has two Double-Stranded RNA-binding Motifs (DSRM), which contribute to recognition of dsRNA of viral origin and a functional KD. EIF2AK3 has a single KD with heme recognition motif, which is sensitive to heme levels. In lower non-chordates, EIF2AK3 has a dimerized Pyrrolo-quinoline quinone (PQQ) domain that aids in electron transfer reactions like transfer of phosphate moiety due to relatively high redox potentials of PQQ/PQQH2 couple[48]. EIF2AK4 has a complex structure with two nonfunctional pseudo KDs followed by a functional KD. It also harbors a histidyl-tRNA synthetase (HisRS)-related domain, which binds uncharged tRNAs. RWD domain at N terminal is located in RING finger and WD repeat containing proteins and DEXDc-like helicases. It also harbors a HGCTP anticodon domain, which aids in down regulation of translation process in stress conditions. Domain organization of EIF2AK4 shows a similar domain architecture in lower eukaryotes with loss of partial KDs. Domain organization of bacterial PknB kinase shows a typical KD, which plays a critical role in signal transduction pathway that regulates cell growth, cell shape and cell division[49–51]. Three PASTA (Penicillin-binding protein and Serine/Threonine kinase Associated) domains activate protein by dimerization of KD by allosteric mechanism and trigger auto phosphorylation events[52,53].
Multiple sequence alignment and phylogenetic analysis
To investigate evolutionary relationships of identified EIF2AK sequences we adopted Bayesian methods of phylogenetic reconstruction with identified 852 EIF2AK family sequences and outgroup sequences of closely related families via NEK2, TOPK, TTK, WEE1 and bacterial STPK. The inferred Bayesian tree topology shows an unrooted tree with a four closely grouped clades, supported by a good confidence value of 100 percent representing clustering of four sub-families of EIF2AK sequences (Fig 2). Outgroup sequences cluster into definite groups showing similarity among EIF2AK sequences compared to outgroup sequences. These phylogenetic groupings are consistent with aforementioned identity based methods of hmm and BLAST based classification methods. This phylogenetic classification of sequences is also useful in classification of highly ambiguous sequences. Four clades branched out were further split into divergent sub-clades, which are organized by evolutionary hierarchy of taxa present at each node. EIF2AK1 clade shows nematodes, arthropod, molluscans and hemichordates closer to node showing primitive origin of gene in lower organisms. Higher chordates from fish, reptiles and mammals group into three distinct subgroups. EIF2AK2 sequences show only presence of chordates that are further divided into discrete two sub groups, one consisting of sequences of Reptiles and Aves, and other group consisting of mammals, highly evolved chordates. EIF2AK3 sequences show a widespread distribution of subclades from Platyhelminthes to higher chordates. EIF2AK4 sequences show a highly clustered sequences which consist of lower evolutionary fungal sequences, other making up of Reptiles, Aves, and third cluster consisting of Mammals.
Four sub-families of EIF2AK exhibit homology with overall average sequence identity of 41%. Alignment file shows conserved sequence patterns of glycine rich loop GxGxxGxV, in EIF2AK sequence family, which interacts with phosphate moiety of ATP. In outgroup sequences and prokaryotic STPK sequences motif is variant with high substituted third Gly and ultimate Val residues. Conserved Asp residue of N-terminal end HRD[LI]KPxN motif stabilizes pronated form of transferred phosphate moiety of ATP molecule. In distinct related protein sub-families like TTK, WEE1 and TOPK Arg residue is substituted with Ser, Met and Gly respectively. Similarly DFG motif interacting with Mg2+ cofactor is highly conserved in serine threonine kinases but is highly substituted in WEE1 and TOPK. An average sequence identity of 68%, 56%, 70% and 48% is found in EIF2AK subgroups, which was calculated using ALISTAT server[54]. Alignment of subfamily sequences shows a similar conservation of above mentioned sites, as they are primarily engaged in phosphorylation of eIF2α. Conservation of residues in alignment is plotted with reference sequences using ESPript 3.0 server[55] (Fig 3).
Gene structure and order analysis
Orthologous genes from completely sequenced genomes share a conserved exon–intron structure through evolutionary lineage[56–58]. A comparative analysis of gene structures of four EIF2AK sub-families was adopted (Fig 4). Four members of EIF2AK family show an altered gene structure in subgroups as well as within subgroups during course of evolution. EIF2AK1 gene is encoded by 15 exons whereas EIF2AK2 gene is encoded by 17–18 exons that remain unvaried in chordate evolution. EIF2AK3 is coded by 15–22 exons, which alters, based on class of evolution. Larger number of exons of 39–40, code EIF2AK4 gene that is the largest of all EIF2AK family sequences. EIF2AK3 and EIF2AK4 of invertebrates are coded by 4 and 10 exons, which show a conservation in numbers. The few intron loss or gain events that occur in these sequences seem to be dependent on lineage-specific event. Higher conservation of exon and coding position show evolutionary conservation of EIF2AK family.
Synteny is conservation of blocks of order within two sets of chromosomes that are being compared with each other. EIF2AK sequences show a conserved sequence in closely related chordates Homo sapiens, Mus musculus and Rattus norvegicus (Fig 5). EIF2AK1, EIF2AK2, EIF2AK3 family sequences show a conservative order of 5–6 genes showing conserved chromosomal origin across these species. EIF2AK4 sequence which shows an evolutionary lineage in lower organisms has a high conserved gene order in above mentioned species with 11 sequences conserved. Gene order in lower non-chordates is highly variable and shows a little conservation due to less number of genes they code.
Selection pressure
Selection pressure acting on four EIF2AK sub-families is estimated to determine type of natural selection acting on these genes (Fig 6). EIF2AK genes that show an increased number of sites are subjected to negative selection, which aids in the removal of deleterious alleles. EIF2AK1, EIF2AK3 and EIF2AK4 genes show an increase in number of sites under an increased negative selection. EIF2AK2 gene not only shows a fall in numbers of the sites under negative selection but also an increase in number of sites under positive selection. Due to rapid increase in positive selection site numbers of EIF2AK2, gene shows an increased ω value of 0.35 compared to other EIF2AK’s ω value ranging from 0.06–0.17. These studies show that EIF2AK2 genes involved in host-pathogen interactions are subjected to an intense positive selection. Leigh Van Valen’s Red Queen hypothesis explains the evolutionary dynamics of host–pathogen interactions[59]. It hypothesizes the arms race of coevolution between two intensely competing species to gain advantage over their rival, leading to evolutionary stasis in the long term, whereby the two species continue to coexist. EIF2AK2 is an example of host gene that is under higher positive selection to deny access to the pathogen, which constantly devises novel strategies for evading respective host’s defensive systems[60].
Functional divergence of KD
Amino acid residues are said to be functionally important if it they are evolutionarily conserved. Putative changes in evolutionary conservation of amino acids at a particular residue may indicate its involvement in functional divergence. KD of four EIF2AKs have same functional role of protein synthesis inhibition, through eIF2α phosphorylation. However, KD of these proteins also has altered functions of phosphorylation of varied substrates like p53/TP53, PPP2R5A, DHX9, ILF3, IRS1 and have specific evolved inhibitors like TAT, K3L etc. Type I functional divergence refers to evolutionary process resulting in site-specific rate shifts after gene duplication, which alters highly conserved amino acid in one duplicate gene to a highly variable in other one. Type II functional divergence results in a site-specific property shift. Two evolutionally conserved orthologous genes homologous residues are subjected to a radical shift of amino acid property e.g., positively vs. negatively charged have occurred between two duplicate genes; otherwise, they are both evolutionarily conserved within each of orthologous genes. The coefficient of functional divergence between duplicate genes is defined as probability of being linked to functional divergence, with a higher value indicating a high level of divergence. Furthermore, a site-specific profile based on empirical posterior analysis can be used to predict residues that are liable to be responsible for functional divergence between two clusters.
S2 Table contains information of type I and type II sites, with posterior probability cutoff determined by a steep fall in the values (S1 Fig), along with residues from pairs of protein. In present study, a set of 72 EIF2AK sequences consisting of four sub-families along with closely related WEE1 and distinctly related STPK’s of bacterial origin is used to estimate coefficient of type I and type II functional divergence among pairs of these families and sub-families. Estimates of coefficient of divergence (θ) signify occurrence of both type I and type II divergence in KD of four EIF2AK proteins along with WEE1 and STPK. θI values for proteins range from 0.167 to 0.894 showing a little occurrence of type I functional divergence. Forty putative type I divergence sites are predominantly lying on C lobe of KD, are identified with posterior probability above the cutoff. Protein sequences show less incidence of type II divergence with a putative number of seventeen sites in MSA falling under this category. A few sites merely occur in N lobe and are widespread in C lobe. θII values for EIF2AK proteins range from 0.011 to 0.755 with a higher standard error values to diversified amino acids. Majority of these sites lie in the organized structural regions of KD whose substations impart an altered function.
Family pairs show a higher occurrence of type I divergence sites than type II divergence sites between pairs of sub-families, showing high substitution rates with a conservation of aa features (S3 Table). This is evident by a higher level of purifying selection acting on these sites. A few sites type I and type II divergence are observed in EIF2AKs, showing a conservation of KD. EIF2AKs show an increased occurrence of type I divergence with closely related WEE1 sequences than STPK of earlier origin, which predicts earlier diversification of EIF2AKs in lower organisms. Inter family comparison of EIF2AKs along with WEE1 and STPK shows fewer occurrences of type II divergent sites, which show evolutionary conservation of KD. Unique type I divergent values of EIF2AK4 sequences in comparison with other EIF2AKs illustrate its primitive origin in lower multicellular forms and its close conservation with STPK manifest origin of EIF2AKs from bacterial serine threonine kinases.
Altered interactions of proteins can be analyzed by functional divergence studies on binding site residues. Substrate protein eIF2α[61] and inhibitor protein K3L[62] binding sites are analyzed for significant functional divergence (Table 1). In regards to eIF2α; EIF2AKs do not show a significant functional divergence sites whereas non-interacting proteins like WEE1 and STPK show significant number of sites with type II divergence. Nonspecific inhibitor protein K3L has sites with significant functional divergence across EIF2AK family as well as WEE1 and STPK families. Common binding site 491 shows a significant number of type II divergence unlike other binding sites, which is asserted by selection pressure of both eIF2α and K3L proteins. Thus, binding sites of EIF2AK are under a constant selection pressure, which substitute functional site residues with drastic chemical variation of type II divergence altering interaction mechanisms to increase binding abilities.
Table 1. Functional divergence of binding sites.
Residues | Alignment position | EIF2AK1 | EIF2AK3 | EIF2AK4 | WEE1 | STPK | |||||
---|---|---|---|---|---|---|---|---|---|---|---|
θI | θII | θI | θII | θI | θII | θI | θII | θI | θII | ||
eIF2α binding site | |||||||||||
451 | 375 | 0.116 | 0.000 | 0.079 | 0.000 | 0.106 | 0.000 | 0.795 | 5.913 | 0.474 | 0.000 |
452 | 376 | 0.280 | 0.166 | 0.082 | 0.144 | 0.108 | 0.576 | 0.532 | 0.884 | 0.357 | 1.370 |
453 | 377 | 0.497 | 0.501 | 0.383 | 0.157 | 0.276 | 0.632 | 0.438 | 0.856 | 0.217 | 0.849 |
484 | 420 | 0.638 | 0.473 | 0.183 | 0.597 | 0.210 | 0.318 | 0.483 | 0.216 | 0.658 | 0.470 |
490 | 426 | 0.116 | 0.000 | 0.079 | 0.000 | 0.143 | 0.000 | 0.971 | 0.334 | 0.477 | 1.602 |
491 | 427 | 0.378 | 1.149 | 0.361 | 2.218 | 0.169 | 0.933 | 0.724 | 1.186 | 0.289 | 0.634 |
K3L binding site | |||||||||||
405 | 268 | 0.116 | 0.000 | 0.617 | 1.032 | 0.141 | 0.000 | 0.459 | 0.000 | 0.499 | 0.734 |
455 | 379 | 0.169 | 2.290 | 0.079 | 0.000 | 0.566 | 0.617 | 0.795 | 1.565 | 0.598 | 0.000 |
473 | 408 | 0.679 | 0.426 | 0.125 | 0.287 | 0.202 | 1.673 | 0.677 | 0.000 | 0.234 | 1.555 |
480 | 415 | 0.116 | 0.000 | 0.079 | 0.000 | 0.105 | 0.000 | 0.459 | 0.000 | 0.138 | 0.000 |
491 | 427 | 0.378 | 1.149 | 0.361 | 2.218 | 0.169 | 0.933 | 0.724 | 1.186 | 0.289 | 0.634 |
504 | 451 | 0.187 | 0.271 | 0.346 | 0.000 | 0.532 | 0.000 | 0.536 | 0.000 | 0.316 | 0.000 |
Conclusion
A large number of putative EIF2AK sequences were identified and classified in lower non-chordates, which show primitive origin of these genes. Comparative analysis at amino acid levels and gene structures supported orthologous origin of KD from bacterial PknB and it might have gained diversified signaling domains through events like chromosomal crossover. Phylogenetic studies show a separation of EIF2AK families into distinct clades and subclades, which show orthologous origin of these genes. All four genes are subject to a purifying selection, but EIF2AK2 shows an increased ω value due to its viral interactions. Functional divergence shows fewer sites with type I divergence and type II divergence sites with a conservation in KD structural features. Cluster specific analysis tells variation of functional divergence sites of closely related EIF2AKs with distinctly related families of WEE1 and STPK. Altered interactions of KD with substrates like eIF2α and K3L, which activates novel signaling cascade reactions, can be accountable to functional divergence, which imparts diversity and specificity in recognizing interacting proteins to KD. Site directed mutagenesis studies of proposed divergence sites would aid in understanding anonymous functions of KDs.
Supporting information
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
The authors received no specific funding for this work.
References
- 1.Donnelly N, Gorman AM, Gupta S, Samali A (2013) The eIF2alpha kinases: their structures and functions. Cell Mol Life Sci 70: 3493–3511. doi: 10.1007/s00018-012-1252-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Dever TE (2002) Gene-specific regulation by general translation factors. Cell 108: 545–556. [DOI] [PubMed] [Google Scholar]
- 3.Hinnebusch AG (2005) eIF2alpha kinases provide a new solution to the puzzle of substrate specificity. Nat Struct Mol Biol 12: 835–838. doi: 10.1038/nsmb1005-835 [DOI] [PubMed] [Google Scholar]
- 4.Lu PD, Harding HP, Ron D (2004) Translation reinitiation at alternative open reading frames regulates gene expression in an integrated stress response. J Cell Biol 167: 27–33. doi: 10.1083/jcb.200408003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Vattem KM, Wek RC (2004) Reinitiation involving upstream ORFs regulates ATF4 mRNA translation in mammalian cells. Proc Natl Acad Sci U S A 101: 11269–11274. doi: 10.1073/pnas.0400541101 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Harding HP, Novoa I, Zhang Y, Zeng H, Wek R, et al. (2000) Regulated translation initiation controls stress-induced gene expression in mammalian cells. Mol Cell 6: 1099–1108. [DOI] [PubMed] [Google Scholar]
- 7.Chen JJ (2007) Regulation of protein synthesis by the heme-regulated eIF2alpha kinase: relevance to anemias. Blood 109: 2693–2699. doi: 10.1182/blood-2006-08-041830 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Rafie-Kolpin M, Chefalo PJ, Hussain Z, Hahn J, Uma S, et al. (2000) Two heme-binding domains of heme-regulated eukaryotic initiation factor-2alpha kinase. N terminus and kinase insertion. J Biol Chem 275: 5171–5178. [DOI] [PubMed] [Google Scholar]
- 9.Galluzzi L, Brenner C, Morselli E, Touat Z, Kroemer G (2008) Viral control of mitochondrial apoptosis. PLoS Pathog 4: e1000018 doi: 10.1371/journal.ppat.1000018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ito T, Yang M, May WS (1999) RAX, a cellular activator for double-stranded RNA-dependent protein kinase during stress signaling. J Biol Chem 274: 15427–15432. [DOI] [PubMed] [Google Scholar]
- 11.Nakamura T, Furuhashi M, Li P, Cao H, Tuncman G, et al. (2010) Double-stranded RNA-dependent protein kinase links pathogen sensing with stress and metabolic homeostasis. Cell 140: 338–348. doi: 10.1016/j.cell.2010.01.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Onuki R, Bando Y, Suyama E, Katayama T, Kawasaki H, et al. (2004) An RNA-dependent protein kinase is involved in tunicamycin-induced apoptosis and Alzheimer's disease. EMBO J 23: 959–968. doi: 10.1038/sj.emboj.7600049 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ruvolo PP, Gao F, Blalock WL, Deng X, May WS (2001) Ceramide regulates protein synthesis by a novel mechanism involving the cellular PKR activator RAX. J Biol Chem 276: 11754–11758. doi: 10.1074/jbc.M011400200 [DOI] [PubMed] [Google Scholar]
- 14.Meurs E, Chong K, Galabru J, Thomas NS, Kerr IM, et al. (1990) Molecular cloning and characterization of the human double-stranded RNA-activated protein kinase induced by interferon. Cell 62: 379–390. [DOI] [PubMed] [Google Scholar]
- 15.Nanduri S, Rahman F, Williams BR, Qin J (2000) A dynamically tuned double-stranded RNA binding mechanism for the activation of antiviral kinase PKR. EMBO J 19: 5567–5574. doi: 10.1093/emboj/19.20.5567 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wu S, Kaufman RJ (1997) A model for the double-stranded RNA (dsRNA)-dependent dimerization and activation of the dsRNA-activated protein kinase PKR. J Biol Chem 272: 1291–1296. [DOI] [PubMed] [Google Scholar]
- 17.Szegezdi E, Logue SE, Gorman AM, Samali A (2006) Mediators of endoplasmic reticulum stress-induced apoptosis. EMBO Rep 7: 880–885. doi: 10.1038/sj.embor.7400779 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Harding HP, Zhang Y, Ron D (1999) Protein translation and folding are coupled by an endoplasmic-reticulum-resident kinase. Nature 397: 271–274. doi: 10.1038/16729 [DOI] [PubMed] [Google Scholar]
- 19.Sanderson TH, Deogracias MP, Nangia KK, Wang J, Krause GS, et al. (2010) PKR-like endoplasmic reticulum kinase (PERK) activation following brain ischemia is independent of unfolded nascent proteins. Neuroscience 169: 1307–1314. doi: 10.1016/j.neuroscience.2010.05.076 [DOI] [PubMed] [Google Scholar]
- 20.Bertolotti A, Zhang Y, Hendershot LM, Harding HP, Ron D (2000) Dynamic interaction of BiP and ER stress transducers in the unfolded-protein response. Nat Cell Biol 2: 326–332. doi: 10.1038/35014014 [DOI] [PubMed] [Google Scholar]
- 21.Cavener DR, Gupta S, McGrath BC (2010) PERK in beta cell biology and insulin biogenesis. Trends Endocrinol Metab 21: 714–721. doi: 10.1016/j.tem.2010.08.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Gupta S, McGrath B, Cavener DR (2010) PERK (EIF2AK3) regulates proinsulin trafficking and quality control in the secretory pathway. Diabetes 59: 1937–1947. doi: 10.2337/db09-1064 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Deval C, Chaveroux C, Maurin AC, Cherasse Y, Parry L, et al. (2009) Amino acid limitation regulates the expression of genes involved in several specific biological processes through GCN2-dependent and GCN2-independent pathways. FEBS J 276: 707–718. doi: 10.1111/j.1742-4658.2008.06818.x [DOI] [PubMed] [Google Scholar]
- 24.Ye J, Kumanova M, Hart LS, Sloane K, Zhang H, et al. (2010) The GCN2-ATF4 pathway is critical for tumour cell survival and proliferation in response to nutrient deprivation. EMBO J 29: 2082–2096. doi: 10.1038/emboj.2010.81 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Dong J, Qiu H, Garcia-Barrio M, Anderson J, Hinnebusch AG (2000) Uncharged tRNA activates GCN2 by displacing the protein kinase moiety from a bipartite tRNA-binding domain. Mol Cell 6: 269–279. [DOI] [PubMed] [Google Scholar]
- 26.Marion V, Sankaranarayanan S, de Theije C, van Dijk P, Lindsey P, et al. (2011) Arginine deficiency causes runting in the suckling period by selectively activating the stress kinase GCN2. J Biol Chem 286: 8866–8874. doi: 10.1074/jbc.M110.216119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Bruhat A, Cherasse Y, Maurin AC, Breitwieser W, Parry L, et al. (2007) ATF2 is required for amino acid-regulated transcription by orchestrating specific histone acetylation. Nucleic Acids Res 35: 1312–1321. doi: 10.1093/nar/gkm038 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Chaveroux C, Lambert-Langlais S, Cherasse Y, Averous J, Parry L, et al. (2010) Molecular mechanisms involved in the adaptation to amino acid limitation in mammals. Biochimie 92: 736–745. doi: 10.1016/j.biochi.2010.02.020 [DOI] [PubMed] [Google Scholar]
- 29.UniProt C (2015) UniProt: a hub for protein information. Nucleic Acids Res 43: D204–212. doi: 10.1093/nar/gku989 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Eddy SR (1996) Hidden Markov models. Curr Opin Struct Biol 6: 361–365. [DOI] [PubMed] [Google Scholar]
- 31.Yates A, Akanni W, Amode MR, Barrell D, Billis K, et al. (2016) Ensembl 2016. Nucleic Acids Res 44: D710–716. doi: 10.1093/nar/gkv1157 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Nordberg H, Cantor M, Dusheyko S, Hua S, Poliakov A, et al. (2014) The genome portal of the Department of Energy Joint Genome Institute: 2014 updates. Nucleic Acids Res 42: D26–31. doi: 10.1093/nar/gkt1069 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Pruitt KD, Tatusova T, Brown GR, Maglott DR (2012) NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy. Nucleic Acids Res 40: D130–135. doi: 10.1093/nar/gkr1079 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Fu L, Niu B, Zhu Z, Wu S, Li W (2012) CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28: 3150–3152. doi: 10.1093/bioinformatics/bts565 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30: 772–780. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Waterhouse AM, Procter JB, Martin DM, Clamp M, Barton GJ (2009) Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 25: 1189–1191. doi: 10.1093/bioinformatics/btp033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ronquist F, Huelsenbeck JP (2003) MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19: 1572–1574. [DOI] [PubMed] [Google Scholar]
- 38.Huson DH, Scornavacca C (2012) Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks. Syst Biol 61: 1061–1067. doi: 10.1093/sysbio/sys062 [DOI] [PubMed] [Google Scholar]
- 39.Hu B, Jin J, Guo AY, Zhang H, Luo J, et al. (2015) GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics 31: 1296–1297. doi: 10.1093/bioinformatics/btu817 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Sinha AU, Meller J (2007) Cinteny: flexible analysis and visualization of synteny and genome rearrangements in multiple organisms. BMC Bioinformatics 8: 82 doi: 10.1186/1471-2105-8-82 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Suyama M, Torrents D, Bork P (2006) PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res 34: W609–612. doi: 10.1093/nar/gkl315 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Yang Z (2007) PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24: 1586–1591. doi: 10.1093/molbev/msm088 [DOI] [PubMed] [Google Scholar]
- 43.Gu X, Zou Y, Su Z, Huang W, Zhou Z, et al. (2013) An update of DIVERGE software for functional divergence analysis of protein family. Mol Biol Evol 30: 1713–1719. doi: 10.1093/molbev/mst069 [DOI] [PubMed] [Google Scholar]
- 44.Zheng Y, Xu D, Gu X (2007) Functional divergence after gene duplication and sequence-structure relationship: a case study of G-protein alpha subunits. J Exp Zool B Mol Dev Evol 308: 85–96. doi: 10.1002/jez.b.21140 [DOI] [PubMed] [Google Scholar]
- 45.Boitel B, Ortiz-Lombardia M, Duran R, Pompeo F, Cole ST, et al. (2003) PknB kinase activity is regulated by phosphorylation in two Thr residues and dephosphorylation by PstP, the cognate phospho-Ser/Thr phosphatase, in Mycobacterium tuberculosis. Mol Microbiol 49: 1493–1508. [DOI] [PubMed] [Google Scholar]
- 46.Ortiz-Lombardia M, Pompeo F, Boitel B, Alzari PM (2003) Crystal structure of the catalytic domain of the PknB serine/threonine kinase from Mycobacterium tuberculosis. J Biol Chem 278: 13094–13100. doi: 10.1074/jbc.M300660200 [DOI] [PubMed] [Google Scholar]
- 47.Young TA, Delagoutte B, Endrizzi JA, Falick AM, Alber T (2003) Structure of Mycobacterium tuberculosis PknB supports a universal activation mechanism for Ser/Thr protein kinases. Nat Struct Biol 10: 168–174. doi: 10.1038/nsb897 [DOI] [PubMed] [Google Scholar]
- 48.Anthony C (2001) Pyrroloquinoline quinone (PQQ) and quinoprotein enzymes. Antioxid Redox Signal 3: 757–774. doi: 10.1089/15230860152664966 [DOI] [PubMed] [Google Scholar]
- 49.Kang CM, Abbott DW, Park ST, Dascher CC, Cantley LC, et al. (2005) The Mycobacterium tuberculosis serine/threonine kinases PknA and PknB: substrate identification and regulation of cell shape. Genes Dev 19: 1692–1704. doi: 10.1101/gad.1311105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Dasgupta A, Datta P, Kundu M, Basu J (2006) The serine/threonine kinase PknB of Mycobacterium tuberculosis phosphorylates PBPA, a penicillin-binding protein required for cell division. Microbiology 152: 493–504. doi: 10.1099/mic.0.28630-0 [DOI] [PubMed] [Google Scholar]
- 51.Fernandez P, Saint-Joanis B, Barilone N, Jackson M, Gicquel B, et al. (2006) The Ser/Thr protein kinase PknB is essential for sustaining mycobacterial growth. J Bacteriol 188: 7778–7784. doi: 10.1128/JB.00963-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wehenkel A, Fernandez P, Bellinzoni M, Catherinot V, Barilone N, et al. (2006) The structure of PknB in complex with mitoxantrone, an ATP-competitive inhibitor, suggests a mode of protein kinase regulation in mycobacteria. FEBS Lett 580: 3018–3022. doi: 10.1016/j.febslet.2006.04.046 [DOI] [PubMed] [Google Scholar]
- 53.Barthe P, Mukamolova GV, Roumestand C, Cohen-Gonsaud M (2010) The structure of PknB extracellular PASTA domain from mycobacterium tuberculosis suggests a ligand-dependent kinase activation. Structure 18: 606–615. doi: 10.1016/j.str.2010.02.013 [DOI] [PubMed] [Google Scholar]
- 54.Eddy S (2005) SQUID—C function library for sequence analysis The Eddy/Rivas Laboratory, Cambridge, USA: Eddy/Rivas Lab. [Google Scholar]
- 55.Robert X, Gouet P (2014) Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res 42: W320–324. doi: 10.1093/nar/gku316 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Rogozin IB, Sverdlov AV, Babenko VN, Koonin EV (2005) Analysis of evolution of exon-intron structure of eukaryotic genes. Brief Bioinform 6: 118–134. [DOI] [PubMed] [Google Scholar]
- 57.Sanchez D, Ganfornina MD, Gutierrez G, Marin A (2003) Exon-intron structure and evolution of the Lipocalin gene family. Mol Biol Evol 20: 775–783. doi: 10.1093/molbev/msg079 [DOI] [PubMed] [Google Scholar]
- 58.Banyai L, Patthy L (2004) Evidence that human genes of modular proteins have retained significantly more ancestral introns than their fly or worm orthologues. FEBS Lett 565: 127–132. doi: 10.1016/j.febslet.2004.03.088 [DOI] [PubMed] [Google Scholar]
- 59.VanValen L (1973) A new evolutionary law. Evolutionary Theory 1: 1–30. [Google Scholar]
- 60.Rothenburg S, Seo EJ, Gibbs JS, Dever TE, Dittmar K (2009) Rapid evolution of protein kinase PKR alters sensitivity to viral inhibitors. Nat Struct Mol Biol 16: 63–70. doi: 10.1038/nsmb.1529 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Dar AC, Dever TE, Sicheri F (2005) Higher-order substrate recognition of eIF2alpha by the RNA-dependent protein kinase PKR. Cell 122: 887–900. doi: 10.1016/j.cell.2005.06.044 [DOI] [PubMed] [Google Scholar]
- 62.Seo EJ, Liu F, Kawagishi-Kobayashi M, Ung TL, Cao C, et al. (2008) Protein kinase PKR mutants resistant to the poxvirus pseudosubstrate K3L protein. Proc Natl Acad Sci U S A 105: 16894–16899. doi: 10.1073/pnas.0805524105 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.