Skip to main content
iMeta logoLink to iMeta
. 2023 Nov 19;2(4):e148. doi: 10.1002/imt2.148

VeloPro: A pipeline integrating Ribo‐seq and AlphaFold deciphers association patterns between translation velocity and protein structure features

Bian Bian 1,2, Toshitaka Kumagai 3, Yutaka Saito 1,2,4,5,
PMCID: PMC10989810  PMID: 38868219

Abstract

VeloPro integrates Ribo‐seq data and AlphaFold2‐predicted 3D protein structure information for characterization of the association patterns between translation velocity and many protein structure features in prokaryotic and eukaryotic organisms across different taxonomical clades such as bacteria, fungi, protozoa, nematode, plants, insect, and mammals. We illustrated that association patterns between translation velocity and protein structure features differ across organisms, partially reflecting their taxonomical relationship.

graphic file with name IMT2-2-e148-g001.jpg

INTRODUCTION

Translation is an essential process for protein biogenesis, and translation velocity (also known as ribosome dwell times) plays a crucial role in modulating protein co‐translational folding [1]. The protein co‐translational folding process starts when a nascent peptide chain emerges within the ribosome exit tunnel [2]. Translation velocity variation may modulate protein folding by enabling the formation of structural intermediates within the ribosome exit tunnel [2, 3]. Moreover, the co‐translational folding is facilitated by the electrostatic interactions between the proteins and the vestibule of the ribosomal tunnel [4]. Many studies have shown that larger proteins with more sophisticated structural properties may rely more on co‐translational folding to avoid misfolding and aggregation, which further illustrates the paramount importance of co‐translational folding [5]. Ribosomes actively engage in the folding of knotted proteins by threading nascent chains, which emerge from ribosome exit channels, through loops created by an earlier synthesized segment of the same protein; thus, co‐translational folding processes can be necessary to achieve the native structure of proteins whose polypeptide chain is deeply knotted [6, 7].

With rapid development in deep sequencing technologies, a revolutionary approach, ribosome profiling (also called Ribo‐seq) has been utilized to probe and quantify a local translation velocity at the codon resolution level in vivo [8]. As Ribo‐seq footprint density is considered to negatively correlate with the local translation velocity, Ribo‐seq has been successfully utilized to understand local translation velocity in bacteria and eukaryotes [9, 10, 11, 12].

Translation velocity is known to be associated with mRNA sequence features [13, 14, 15, 16]. Many pioneering studies illustrated that Shine–Dalgarno (SD)‐like sequences strongly impede translation [17] while some research argues that SD‐like motifs have little effect on translation elongation [18]. Codon usage bias is another critical factor strongly affecting translation kinetics [19]. Codon usage bias is considered to correlate with tRNA concentration in vivo and rare codons cause ribosome stalling on an mRNA during translation [1, 11]. Furthermore, mRNA secondary structure influences translation velocity, as the mRNA hairpin or pseudoknot structure could hinder ribosome movement during translation [20, 21]. More recently, Tajima et al. found that the relationship between translation efficiency and mRNA sequence features differs across nine microorganisms partially reflecting their taxonomy [13].

Despite the accumulating studies on mRNA sequence features, few studies have investigated the association between protein features and translation velocity [22, 23]. Zhou et al. observed that RNA regions coding for alpha‐helix or beta‐sheet preferentially use frequent codons, while those coding for coil regions prefer to use less frequent codons or rare codons in the filamentous fungus Neurospora [24]. A previous study also indicated that translation velocity is different between RNA regions coding for coil and structured regions; it becomes slower in mRNA sequences coding for coil regions than those coding for alpha‐helix or beta‐strand regions in yeast and humans [23], while they raised codon usage bias as a factor to explain these differences. Moreover, previous research has shown that positively charged amino acid is the primary inhibitor of translation velocity in yeast [22] and could also lead to ribosome stalling in Arabidopsis chloroplast [25]. Another study suggested that some short nascent peptide sequences could be associated with ribosome stalling in yeast, such as PPP and LKK amino acid sequences [26]. However, the previous studies have focused on a few specific organisms, mostly yeast and humans, raising the question that to what extent these association patterns apply to diverse organisms (Supporting Information S2: Table S1). Revealing the association between translation velocity and protein features in a broader range of taxonomy will help us better understand factors that may affect co‐translational folding.

In this study, we developed VeloPro, a pipeline that integrates publicly available ribosome profiling data and AlphaFold2‐predicted 3D protein structure information to investigate the association pattern between translation velocity and protein structure features in 12 organisms (Supporting Information S2: Table S2) across different taxonomical clades, including bacteria (Escherichia coli and Pseudomonas aeruginosa), fungi (Saccharomyces cerevisiae and Candida albicans), protozoa (Trypanosoma brucei), nematode (Caenorhabditis elegans), plants (Arabidopsis and maize), insect (fruit fly), and mammals (human, rat, and mouse). For each organism, we analyze more protein features than in previous studies, including protein secondary structure elements, proline residue, positively charged amino acid, relatively accessible surface area (rASA), intrinsically disordered region (IDR) scores, and local contact order. We revealed that the association pattern between translation velocity and protein structure features differs across organisms, partially reflecting their taxonomical relationship. We found that the association between translation velocity and positively charged amino acids previously reported in yeast did not apply to prokaryotes. We also found that the association patterns for secondary structure, rASA, and local absolute contact order are widely conserved in most organisms.

RESULTS

Workflow of VeloPro

In this study, we developed a pipeline VeloPro by combining ribosome profiling and AlphaFold‐predicted protein 3D structure data and then investigated the association pattern between translation velocity and protein structure features. The workflow is described in Figure 1.

Figure 1.

Figure 1

Workflow of VeloPro. (A) Pipeline for translation velocity quantification including adapter trimming, mapping, calibration, and normalization. (B) Pipeline for protein structure features determination from AlphaFold. (C) Schematic of the comparative data analysis and visualization between translation velocity and protein structure features used in this study.

For ribosome profiling analysis, previous reports indicate that Ribo‐seq results are influenced by many factors, including growth conditions, cell or tissue heterogeneity, protocol‐specific biases, sequencing coverage depth and biases, experimental artifacts, and unknown organism‐specific inherent factors [27]. Moreover, previous research reported that some bioinformatic bias may also lead to conflicting conclusions [12]. Therefore, in our analysis, we adopted the unified bioinformatic analysis pipeline for all the Ribo‐seq data analyses and protein feature determination in all the organisms. Moreover, for the quantification of translation velocity through Ribo‐seq data analysis, we used stringent offset identification and calibration (Figure 1A).

In our workflow, we first checked the quality of FASTQ files by FastQC (Supporting Information S2: Table S3) and then performed read mapping. The resultant BAM files were subsequently fed into the calibration process to obtain the scaled footprints that were considered inversely correlated with the translation velocity (Figure 1A). Then, protein‐predicted 3D structure information from the AlphaFold database was used to determine the protein structure features including protein secondary structure elements, rASA, IDR scores, and local contact order (Figure 1B). Next, we performed the comparative analysis between translation velocity and protein structure features, including rASA, IDR scores, and local contact order, and metagene analysis for proline residue and positively charged amino acids (Figure 1C).

Quantification of translation velocity by Ribo‐seq analysis in diverse organisms

To quantify the translation velocity for the association and metagene analysis, we analyzed publicly available Ribo‐seq data sets (Supporting Information S2: Table S2). We selected 12 organisms in diverse taxonomical clades, including bacteria, protozoa, fungi, nematodes, insects, plants, and mammals. Depending on the organisms, 15–604 million reads were uniquely mapped to the reference sequences (Supporting Information S2: Tables S4 and S5). The distribution of the lengths of mapped reads varied in each organism (Supporting Information S1: Figure S1, Supporting Information S2: Table S4). To calibrate the position of RPF to the ribosomal P site in each organism, we used the MiMB_ribosome_profiling pipeline [28]. For each data set, we extracted reads whose lengths were most abundant in Supporting Information S1: Figure S1. Then, the offset for each read length was determined according to the manual's guidance. The determined offset values are summarized in Supporting Information S2: Table S4. After calibration, the coverage plots were generated around the start codon and stop codon, which exhibited three‐nucleotide periodicity within the coding region in all the organisms (Supporting Information S1: Figure S2). Such three‐nucleotide periodicity has been considered an indication of good data quality for performing codon‐resolution analysis of Ribo‐seq data [28]. We used this coverage information to calculate the scaled footprints as a measure of translation velocity at each codon position (Supplementary materials and methods). Fewer footprints were considered to indicate higher translation velocity, while more footprints were considered to indicate low translation velocity or ribosomal pausing.

Comparison of translation velocity between different secondary structure elements in diverse organisms

Proteins could be folded during the translation process, which is called the co‐translational folding process [29]. Some rare codons have been associated with co‐translational folding intermediates of a single domain or a secondary structure within a protein, suggesting pauses during protein synthesis could facilitate co‐translational folding, thereby promoting the formation of its native structures [30]. Here, we compared the translation velocity between different protein secondary structure elements to investigate their associations.

In both prokaryotes and most eukaryotes, the translation velocity in RNA sequence coding for coil regions was significantly slower than that coding for alpha‐helix and beta‐sheet regions (Figure 2), suggesting that ribosomes tend to stall in the RNA sequence coding for the unstructured regions than that coding for the structured regions. A previous study reported a higher stalling probability in RNA sequence coding for unstructured regions than that coding for structured regions, while the analysis was limited in yeast and humans [23]. We revealed that this pattern is consistent across broader organisms. Unexpectedly, the pattern in rat and T. brucei was the opposite; RNA sequence coding for coil regions exhibited faster translation velocity than that coding for alpha‐helix and beta‐sheet regions (Figure 2E,K).

Figure 2.

Figure 2

Comparisons of scaled footprints between the structured (i.e., α‐helix and β‐sheet) and coil regions in 12 organisms. p‐Values were calculated by two‐sided Wilcoxon rank‐sum test.

The effect of proline residue on translation velocity is widely conserved

The previous report has illustrated that proline amino acid acts as both a poor A‐site acceptor and peptidyl donor during the peptide bond formation process [31] and proline could slow down peptide bond formation in the translation process [32]. Many previous studies have indicated that polyproline stretches within an amino acid sequence could induce ribosome pausing in both prokaryotes and eukaryotes [26, 31]. However, few studies have analyzed the effect of proline residue on translation velocity in a single amino acid resolution.

In our metagene analysis for proline residue, we observed a strong peak around proline residue in yeast and humans suggesting that proline‐caused stalling phenomenon could be successfully detected by our strategy (Supporting Information S1: Figure S3C,J), which is consistent with the previous studies that proline residue leads to ribosome stalling events in human and yeast. In addition, we also observed one or more peaks around proline residue in many eukaryotes, such as C. albicans, Arabidopsis, maize, and fruit fly (Supporting Information S1: Figure S3). Intriguingly, we did not observe a peak around the proline residue in prokaryotes such as E. coli and P. aeruginosa (Supporting Information S1: Figure S3A,B) although previous research claimed that the proline–proline motif could cause ribosome stalling in bacteria [33].

Positively charged amino acids may impose a different effect on translation velocity between prokaryotes and eukaryotes

In a previous study, positively charged amino acids in a nascent peptide chain have been shown to strongly inhibit translation elongation speed in yeast [22].

Our metagene analysis in yeast agreed with the previous research (Supporting Information S1: Figure S4C), showing a strong positive peak around the site of positively charged amino acids. In addition, we observed similar positive peaks around positively charged amino acids in most eukaryotes such as T. brucei, C. albicans, fruit fly, Arabidopsis, maize, and humans (Supporting Information S1: Figure S4), suggesting that translation pausing by positively charged amino acids is widely conserved across eukaryotes. For prokaryotes, there are no previous reports regarding the effect of positively charged amino acids on translation velocity. We found that the positively charged amino acids exhibited a different manner in prokaryotes compared to that in eukaryotes; they indicated negative peaks in the metagene plot (Supporting Information S1: Figure S4A,B).

Correlation analysis between translation velocity and codon usage frequency among diverse organisms

Codon usage has been shown to affect local translation velocity [12]. Rare or nonoptimal codons could slow down the translation velocity, providing sufficient time for protein structures to fold correctly [34], thus fine‐tuning the co‐translational folding process [3, 12].

Our partial correlation analysis in E. coli, C. albicans, fruit flies, humans, rats, and mice exhibited a significant negative correlation between the scaled footprints and codon usage frequency (Figure 3; Supporting Information S2: Table S6), which was in agreement with the previous reports. However, we also observed an opposite result, that is, a significant positive correlation between codon usage frequency and scaled footprints in S. cerevisiae, C. elegans, and Arabidopsis (Figure 3; Supporting Information S2: Table S6). These results indicate that codon usage alone does not explain all the variations in translation velocity, suggesting the contribution of other unknown factors. To explore these factors, we performed the association analysis between protein features and translation velocity as described in the next sections.

Figure 3.

Figure 3

Heatmap of partial Spearman correlation coefficients between scaled footprints and protein structure features in 12 organisms. The mean of the partial correlation coefficients of all the analyzed genes is shown. Bonferroni‐corrected p‐value for multiple testing was calculated by one sample t‐test for the deviation of the mean from zero: *p < 0.05, **p < 2.2e−5, ***p < 2.2e−10, ****p < 2.2e−15. N.S, not significant. The raw values of the means and the Bonferroni‐corrected p‐values are shown in Supporting Information S2: Table S6.

Divergent association pattern between translation velocity and IDR scores in diverse organisms

IDR is defined as sequence segments that do not form specific 3D structures. They are often characterized by a high frequency of polar or charged amino acids with the lack of enough hydrophobic amino acids to enable cooperative folding [35, 36]. IDR plays a vital role in mediating interactions with other molecules by serving as the platform or scaffold [36, 37, 38]. In a previous study, the relationship between IDR and translation was investigated using the codon adaptation index (CAI) as a surrogate of the translation elongation rate [24]. They observed a significant negative correlation between CAI and protein disorder tendency in E. coli, S. cerevisiae, and C. elegans [24]. However, no study has shown direct evidence based on translation velocity measured by Ribo‐seq. Since our analysis showed the difference in translation velocity depending on secondary structures (Figure 2), we also explored whether IDR is associated with translation velocity.

In our partial correlation analysis, we observed a significant positive correlation between the scaled footprints and IDR scores in E. coli and S. cerevisiae (Figure 3; Supporting Information S2: Table S6), and this pattern was reminiscent of the previous CAI‐based findings [24]. However, in the bacteria clade, we also observed an opposite association pattern in P. aeruginosa compared to that in E. coli. In the fungi clade, both S. cerevisiae and C. albicans exhibited a positive correlation between scaled footprints and IDR scores. In plant clades, we observed opposite patterns between Arabidopsis (negative correlation) and maize (positive correlation).

In addition, we also explored the correlation between translation velocity and IDR score in different regions of proteins, including the N‐terminal, middle, and C‐terminal segments (Supporting Information S1: Figure S5; Supporting Information S2: Table S7). In the N‐terminal region, the correlation was overall weak, while a significantly positive correlation was detected for a few organisms. In the C‐terminal region, S. cerevisiae and C. albicans exhibited a significantly positive correlation, while higher eukaryotes such as Arabidopsis, fruit flies, and mice showed a significantly negative correlation. The correlation pattern in the middle region remained consistent with the analysis using the entire protein region.

In summary, the association between IDR scores and translation velocity was divergent among different organisms. This may reflect their evolutionary divergence while we could not make deeper interpretations.

Conserved association pattern of rASA and absolute contact order with translation velocity

The rASA is a measure of the solvent molecule contact of an amino acid residue in a protein 3D structure [39, 40]. Based on the rASA value, the interface of protein could be classified into core, rim, and support. Lower rASA indicates a buried residue inside a protein structure (core), while higher rASA indicates a residue exposed to solvent (rim) [39].

In our partial correlation analysis, most of the organisms exhibited a positive correlation with a significant p‐value between scaled footprints and rASA (Figure 3; Supporting Information S2: Table S6), suggesting that RNA sequence coding for buried regions is preferentially translated faster than that coding for the exposed region. An exception was the rat, which showed a negative correlation, being opposite to the other organisms.

Contact order is a measure of the locality of amino acid contacts in a protein 3D structure [41]. A larger value of contact order indicates that amino acids tend to get in contact with distant amino acids with large separations in a sequence. Contact order has been used to compare the topological difference and shown to correlate with many protein properties, such as protein folding rates and transition state placements [41, 42].

In our partial correlation analysis, we found that the scaled footprints are significantly negatively correlated with local absolute contact order in E. coli (Figure 3; Supporting Information S2: Table S6). This is consistent with the previous report that protein substructures with lower contact orders are associated with conserved rare codons in E. coli [43]. Furthermore, we found a significantly negative correlation between scaled footprints and local absolute contact order in most of the organisms except rats (Figure 3; Supporting Information S2: Table S6).

DISCUSSION

Translation velocity plays an essential role in regulating protein abundance and ensuring the protein's functional integrity through co‐translational folding [44]. Previous reports have indicated that translation velocity is not uniform along an mRNA sequence and tried to find sequence features determining these variations [1, 25]. However, most previous studies focused on the association between mRNA sequence features and translation velocity in specific model organisms. Recently, Tajima et al. [13] discovered the diversity and commonalities of mRNA sequence determinants for protein expression across nine microorganisms. However, they did not explore the associated protein structure features. The recent revolution in the prediction of protein 3D structure enables the proteome‐wide investigation of structural information in diverse organisms [45, 46]. In our study, by combining Ribo‐seq and AlphaFold‐predicted protein 3D structure data, we revealed many protein structure features associated with translation velocity. By analyzing the 12 organisms across various evolutionary clades, we also identified the features showing the conserved association pattern among many organisms while revealing several features with divergent association patterns. Thus, our study provides a comprehensive view of protein features correlated with translation velocity, which may affect the co‐translational folding process across the clades.

Overall, translation velocity patterns in mRNA sequence coding for structured (alpha‐helix and beta‐sheet) regions and disorder (coil) regions were widely observed in both prokaryotes and eukaryotes, which may reflect the evolutionary conservation and the same evolutionary pressure. This pattern suggests that translation velocity becoming slower in the mRNA sequence coding for disordered regions could ensure sufficient time for correct co‐translational folding in structured regions or domains. This finding is consistent with previous reports that the ribosome pausing sites located at protein domain boundaries facilitate and favor co‐translational folding, thus avoiding the potential misfolding [47].

In addition, the association pattern between rASA and translation velocity was also conserved in both prokaryotes and eukaryotes, suggesting that RNA sequence coding for exposed regions tends to be translated slower than that coding for buried regions in all the tested organisms except rats. In addition, this association pattern was also observed between translation velocity and local absolute contact order.

The association pattern in rats was different from the other mammals in many cases; for example, it showed lower‐scaled footprints in coil regions compared to alpha‐helix and beta‐sheet regions, while it was the opposite in the other mammals (Figure 2J–L). To explore whether these exceptional results are contingent on the data set, we conducted an additional analysis using another data set derived from rat brain, mouse kidney, E. coli, and human hela cell, respectively (Supporting Information S2: Table S2). Notably, the two data sets chosen for each organism originate from different research groups. Intriguingly, the association patterns for each organism were similar between the two data sets for most of the protein features including protein secondary structure, proline residue, positive charge, IDR scores, and local absolute contact order (Supporting Information S1: Figures S6S10). These results suggest that the extraordinary pattern in rats may not be caused by data set‐specific artifacts, but probably due to unknown inherent factors. A possible explanation is the more tissue‐specific nature of gene expression in higher eukaryotes. Our integrative analysis of Ribo‐seq and protein features requires genes with high read coverages (>60%); thus, the number of analyzed genes was relatively small in higher eukaryotes (Supporting Information S2: Table S4), which might affect the results in each data set.

Positively charged amino acids could strongly arrest ribosome movement, and this influence cannot be compensated by the mRNA structure and codon usage bias in yeast [22]. Moreover, a positive charge also induces ribosome pausing in the Arabidopsis chloroplast [25]. The most likely explanation is that positively charged residues in the nascent peptide chain could interact with the negatively charged exit tunnel walls, and these electrostatic interactions retard translation [22, 48]. In our analysis, we indeed observed a similar pattern in yeast and Arabidopsis; there were several peaks of scaled footprints around positively charged amino acids in yeast and one peak in Arabidopsis, suggesting that positively charged amino acids inhibit translation in yeast and Arabidopsis. On the contrary, our results suggested that positive charges may speed up the translation in prokaryotes such as E. coli and P. aerugisona (Supporting Information S1: Figure S4A,B). The possible explanation is that there is an evolutionary divergence between prokaryotes and eukaryotes in the geometry of the exit tunnel, which is induced by constricted regions (troughs in the exit tunnel) that determine the electrostatic potential [49]. This variation in electrostatic potential may lead to different association patterns between positively charged amino acid residues and translation velocity in diverse organisms.

Proline tends to cause ribosome pausing in both prokaryotes and eukaryotes, as the amino side chain of proline is known to act as both a poor A‐site acceptor and a poor donor in the P‐site in the peptidyl transfer reaction [50]. This is likely due to its restricted conformational flexibility, which impedes translational elongation, as reported in S. cerevisiae, Saccharomyces paradoxus, and zebrafish [27]. In addition, the effect of proline on ribosome stalling could be additive [23]; the translational stalling becomes strong at di‐proline motifs (XPPX) and polyproline stretches (XPPPX). However, the ribosome stalling at polyproline motifs could be rescued by the translation elongation factor EF‐P in bacteria [51], or initiation factor IF5A in eukaryotes [31, 52] and loss of function of EF‐P will lead to strong ribosome pausing at proline‐rich motifs in E. coli [53]. Both EP‐P and eIF5A could enhance the peptidyl transferase activity of the ribosome and facilitate the reactivity of poor substrates such as proline [52]. In our result (Supporting Information S1: Figure S3), the effect of the proline residue in prokaryotes and eukaryotes was different, possibly due to the protein function diversity in resolving proline‐induced ribosome stalling (EF‐P in bacteria and IF5A in eukaryotes).

In this study, we solely explored the relationship between translation velocity and protein structure features. Nonetheless, other relevant studies have indicated that the rate of protein folding is likewise correlated with protein structures such as protein size, shape, packing, and cross‐sectional radius [54, 55, 56]. Additionally, it is noteworthy that proteins exhibiting simple folding kinetics tend to fold more rapidly in bacterial systems compared to eukaryotic systems, which gives hints regarding the evolutionary divergence in protein structures [57].

CONCLUSION

In summary, we performed the association analysis between translation velocity and various protein features based on publicly available Ribo‐seq data. We revealed the conserved relationship between translation velocity and protein structure features (secondary structure elements, rASA, local absolute contact order), suggesting that organisms impose different translation velocities on protein domains to ensure correct protein folding. Our study could also provide insights into mRNA sequence design to enhance recombinant protein expression. Previous studies have only considered mRNA sequence features such as codon usage and mRNA secondary structures to design mRNA. Our study shows that protein structures are also associated with translation velocity, suggesting the possibility that protein structure features can also be incorporated to design mRNA to fine‐tune co‐translation folding. Our findings could contribute to recombinant protein expression in biomanufacturing fields.

AUTHOR CONTRIBUTIONS

Bian Bian and Yutaka Saito conceived this study and designed the method. Bian Bian, Toshitaka Kumagai, and Yutaka Saito wrote the codes. Bian Bian and Yutaka Saito analyzed and interpreted the data. Bian Bian and Yutaka Saito wrote and revised the manuscript. Yutaka Saito supervised this project. All authors have read the final manuscript and approved it for publication.

CONFLICT OF INTEREST STATEMENT

The authors declare no conflict of interest.

Supporting information

Supporting_information.

IMT2-2-e148-s001.docx (4.4MB, docx)

Supporting_information.

IMT2-2-e148-s002.xlsx (34.5KB, xlsx)

ACKNOWLEDGMENTS

This work was partly supported by JSPS KAKENHI (22H03691; 19K20409), AMED grant (JP19ak0101122), JST CREST (JPMJCR23N1), and JACI Prize for Encouraging Young Researcher. The computations were partially performed on the NIG supercomputer at ROIS National Institute of Genetics, and the SQUID supercomputer at Cybermedia Center, Osaka University through the HPCI System Research Project (hp230057).

Bian, Bian , Kumagai Toshitaka, and Saito Yutaka. 2023. “VeloPro: A Pipeline Integrating Ribo‐seq and AlphaFold Deciphers Association Patterns Between Translation Velocity and Protein Structure Features.” iMeta 2, e148. 10.1002/imt2.148

DATA AVAILABILITY STATEMENT

All data generated or analyzed in this study are included in this manuscript and Supporting Information. VeloPro pipeline is publicly available on GitHub (https://github.com/ytksailab-org/VeloPro). Supplementary materials (methods, figures, tables, scripts, graphical abstract, slides, videos, Chinese translated version, and updated materials) may be found in the online DOI or iMeta Science (http://www.imeta.science/).

REFERENCES

  • 1. Samatova, Ekaterina , Daberger Jan, Liutkute Marija, and Rodnina Marina V.. 2021. “Translational Control by Ribosome Pausing in Bacteria: How a Non‐uniform Pace  of Translation Affects Protein Production and Folding.” Frontiers in Microbiology 11: 619430. 10.3389/fmicb.2020.619430 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Buhr, Florian , Jha Sujata, Thommen Michael, Mittelstaet Joerg, Kutz Felicitas, Schwalbe Harald, Rodnina Marina V., and Komar Anton A.. 2016. “Synonymous Codons Direct Cotranslational Folding Toward Different Protein Conformations.” Molecular Cell 61: 341–351. 10.1016/j.molcel.2016.01.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Pechmann, Sebastian , and Frydman Judith. 2013. “Evolutionary Conservation of Codon Optimality Reveals Hidden Signatures of Cotranslational Folding.” Nature Structural & Molecular Biology 20: 237–243. 10.1038/nsmb.2466 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Wruck, Florian , Tian Pengfei, Kudva Renuka, Best Robert B., von Heijne Gunnar, Tans Sander J., and Katranidis Alexandros. 2021. “The Ribosome Modulates Folding Inside the Ribosomal Exit Tunnel.” Communications Biology 4: 523. 10.1038/s42003-021-02055-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Chaney, Julie L. , and Clark Patricia L.. 2015. “Roles for Synonymous Codon Usage in Protein Biogenesis.” Annual Review of Biophysics  44: 143–166. 10.1146/annurev-biophys-060414-034333 [DOI] [PubMed] [Google Scholar]
  • 6. Dabrowski‐Tumanski, Pawel , Piejko Maciej, Niewieczerzal Szymon, Stasiak Andrzej, and Sulkowska Joanna I.. 2018. “Protein Knotting by Active Threading of Nascent Polypeptide Chain Exiting from the Ribosome Exit Channel.” Journal of Physical Chemistry B 122: 11616–11625. 10.1021/acs.jpcb.8b07634 [DOI] [PubMed] [Google Scholar]
  • 7. Chwastyk, Mateusz , and Cieplak Marek. 2015. “Cotranslational Folding of Deeply Knotted Proteins.” Journal of Physics: Condensed Matter 27: 354105. 10.1088/0953-8984/27/35/354105 [DOI] [PubMed] [Google Scholar]
  • 8. Ingolia, Nicholas T. , Ghaemmaghami Sina, Newman John R. S., and Weissman Jonathan S.. 2009. “Genome‐Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling.” Science 324: 218–223. 10.1126/science.1168978 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Ingolia, Nicholas T. 2016. “Ribosome Footprint Profiling of Translation throughout the Genome.” Cell 165: 22–33. 10.1016/j.cell.2016.02.066 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Wu, Colin Chih‐Chien , Zinshteyn Boris, Wehner Karen A., and Green Rachel. 2019. “High‐Resolution Ribosome Profiling Defines Discrete Ribosome Elongation States and Translational Regulation during Cellular Stress.” Molecular Cell 73: 959–970.e5. 10.1016/j.molcel.2018.12.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Mohammad, Fuad , Green Rachel, and Buskirk Allen R.. 2019. “A Systematically‐Revised Ribosome Profiling Method for Bacteria Reveals Pauses at Single‐Codon Resolution.” eLife 8: e42591. 10.7554/eLife.42591 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Yu, Chien‐Hung , Dang Yunkun, Zhou Zhipeng, Wu Cheng, Zhao Fangzhou, Sachs Matthew S., and Liu Yi. 2015. “Codon Usage Influences the Local Rate of Translation Elongation to Regulate Co‐translational Protein Folding.” Molecular Cell 59: 744–754. 10.1016/j.molcel.2015.07.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Tajima, Naoyuki , Kumagai Toshitaka, Saito Yutaka, and Kameda Tomoshi. 2021. “Comparative Analysis of the Relationship Between Translation Efficiency and Sequence Features of Endogenous Proteins in Multiple Organisms.” Genomics 113: 2675–2682. 10.1016/j.ygeno.2021.05.037 [DOI] [PubMed] [Google Scholar]
  • 14. Fluman, Nir , Navon Sivan, Bibi Eitan, and Pilpel Yitzhak. 2014. “mRNA‐Programmed Translation Pauses in the Targeting of E. coli Membrane Proteins.” eLife 3: e03440. 10.7554/eLife.03440 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Nakahigashi, Kenji , Takai Yuki, Shiwa Yuh, Wada Mei, Honma Masayuki, Yoshikawa Hirofumi, Tomita Masaru, Kanai Akio, and Mori Hirotada. 2014. “Effect of Codon Adaptation on Codon‐Level and Gene‐Level Translation Efficiency In Vivo.” BMC Genomics 15: 1115. 10.1186/1471-2164-15-1115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Pop, Cristina , Rouskin Silvi, Ingolia Nicholas T., Han Lu, Phizicky Eric M., Weissman Jonathan S., and Koller Daphne. 2014. “Causal Signals Between Codon Bias, mRNA Structure, and the Efficiency of Translation and Elongation.” Molecular Systems Biology 10: 770. 10.15252/msb.20145524 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Li, Gene‐Wei , Oh Eugene, and Weissman Jonathan S.. 2012. “The anti‐Shine‐Dalgarno Sequence Drives Translational Pausing and Codon Choice in Bacteria.” Nature 484: 538–541. 10.1038/nature10965 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Mohammad, Fuad , Woolstenhulme Christopher J., Green Rachel, and Buskirk Allen R.. 2016. “Clarifying the Translational Pausing Landscape in Bacteria by Ribosome Profiling.” Cell Reports 14: 686–694. 10.1016/j.celrep.2015.12.073 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Quax, Tessa E. F. , Wolf Yuri I., Koehorst Jasper J., Wurtzel Omri, van der Oost Richard, Ran Wenqi, Blombach Fabian, et al. 2013. “Differential Translation Tunes Uneven Production of Operon‐Encoded Proteins.” Cell Reports 4: 938–944. 10.1016/j.celrep.2013.07.049 [DOI] [PubMed] [Google Scholar]
  • 20. Mao, Yuanhui , Liu Huiling, Liu Yanlin, and Tao Shiheng. 2014. “Deciphering the Rules by which Dynamics of mRNA Secondary Structure Affect Translation Efficiency in Saccharomyces cerevisiae .” Nucleic Acids Research 42: 4813–4822. 10.1093/nar/gku159 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Qu, Xiaohui , Wen Jin‐Der, Lancaster Laura, Noller Harry F., Bustamante Carlos, and Tinoco Ignacio. 2011. “The Ribosome Uses Two Active Mechanisms to Unwind Messenger RNA during Translation.” Nature 475: 118–121. 10.1038/nature10126 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Charneski, Catherine A. , and Hurst Laurence D.. 2013. “Positively Charged Residues are the Major Determinants of Ribosomal Velocity.” PLoS Biology 11: e1001508. 10.1371/journal.pbio.1001508 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Zhang, Sai , Hu Hailin, Zhou Jingtian, He Xuan, Jiang Tao, and Zeng Jianyang. 2017. “Analysis of Ribosome Stalling and Translation Elongation Dynamics by Deep Learning.” Cell Systems 5: 212–220. 10.1016/j.cels.2017.08.004 [DOI] [PubMed] [Google Scholar]
  • 24. Zhou, Mian , Wang Tao, Fu Jingjing, Xiao Guanghua, and Liu Yi. 2015. “Nonoptimal Codon Usage Influences Protein Structure in Intrinsically Disordered Regions.” Molecular Microbiology 97: 974–987. 10.1111/mmi.13079 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Gawroński, Piotr , Jensen Poul Erik, Karpiński Stanisław, Leister Dario, and Scharff Lars B.. 2018. “Pausing of Chloroplast Ribosomes is Induced by Multiple Features and is Linked to the Assembly of Photosynthetic Complexes.” Plant Physiology 176: 2557–2569. 10.1104/pp.17.01564 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Sabi, Renana , and Tuller Tamir. 2017. “Computational Analysis of Nascent Peptides That Induce Ribosome Stalling and Their Proteomic Distribution in Saccharomyces cerevisiae .” RNA 23: 983–994. 10.1261/rna.059188.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Artieri, Carlo G. , and Fraser Hunter B.. 2014. “Accounting for Biases in Riboprofiling Data Indicates a Major Role for Proline in Stalling Translation.” Genome Research 24: 2011–2021. 10.1101/gr.175893.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Bartholomäus, Alexander , Kolte Baban, Mustafayeva Ayten, Goebel Ingrid, Fuchs Stephan, Benndorf Dirk, Engelmann Susanne, and Ignatova Zoya. 2021. “smORFer: A Modular Algorithm to Detect Small ORFs in Prokaryotes.” Nucleic Acids Research 49: e89. 10.1093/nar/gkab477 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Liutkute, Marija , Samatova Ekaterina, and Rodnina Marina V.. 2020. “Cotranslational Folding of Proteins on the Ribosome.” Biomolecules 10: 97. 10.3390/biom10010097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Liu, Yi . 2020. “A Code within the Genetic Code: Codon Usage Regulates Co‐translational Protein Folding.” Cell Communication and Signaling 18: 145. 10.1186/s12964-020-00642-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Starosta, Agata L. , Lassak Jürgen, Peil Lauri, Atkinson Gemma C., Virumäe Kai, Tenson Tanel, Remme Jaanus, Jung Kirsten, and Wilson Daniel N.. 2014. “Translational Stalling at Polyproline Stretches is Modulated by the Sequence Context Upstream of the Stall Site.” Nucleic Acids Research 42: 10711–10719. 10.1093/nar/gku768 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Pavlov, Michael Y. , Watts Richard E., Tan Zhongping, Cornish Virginia W., Ehrenberg Måns, and Forster Anthony C.. 2009. “Slow Peptide Bond Formation by Proline and Other N‐Alkylamino Acids in Translation.” Proceedings of the National Academy of Sciences of the United States of America 106: 50–54. 10.1073/pnas.0809211106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Krafczyk, Ralph , Qi Fei, Sieber Alina, Mehler Judith, Jung Kirsten, Frishman Dmitrij, and Lassak Jürgen. 2021. “Proline Codon Pair Selection Determines Ribosome Pausing Strength and Translation Efficiency in Bacteria.” Communications Biology 4: 589. 10.1038/s42003-021-02115-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Rodnina, Marina V . 2016. “The Ribosome in Action: Tuning of Translational Efficiency and Protein Folding.” Protein Science 25: 1390–1406. 10.1002/pro.2950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Uversky, Vladimir N. , Gillespie Joel R., and Fink Anthony L.. 2000. “Why Are ‘Natively Unfolded’ Proteins Unstructured under Physiologic Conditions?.” Proteins 41: 415–427. [DOI] [PubMed] [Google Scholar]
  • 36. Babu, M Madan . 2016. “The Contribution of Intrinsically Disordered Regions to Protein Function, Cellular Complexity, and Human Disease.” Biochemical Society Transactions 44: 1185–1200. 10.1042/bst20160172 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Van Roey, Kim , Uyar Bora, Weatheritt Robert J., Dinkel Holger, Seiler Markus, Budd Aidan, Gibson Toby J., and Davey Norman E.. 2014. “Short Linear Motifs: Ubiquitous and Functionally Diverse Protein Interaction Modules Directing Cell Regulation.” Chemical Reviews 114: 6733–6778. 10.1021/cr400585q [DOI] [PubMed] [Google Scholar]
  • 38. Tompa, Peter , Davey Norman E., Gibson Toby J., and Babu M. Madan. 2014. “A Million Peptide Motifs for the Molecular Biologist.” Molecular Cell 55: 161–169. 10.1016/j.molcel.2014.05.032 [DOI] [PubMed] [Google Scholar]
  • 39. Levy, Emmanuel D . 2010. “A Simple Definition of Structural Regions in Proteins and Its Use in Analyzing Interface Evolution.” Journal of Molecular Biology 403: 660–670. 10.1016/j.jmb.2010.09.028 [DOI] [PubMed] [Google Scholar]
  • 40. Kim, Hyunsoo , and Park Haesun. 2004. “Prediction of Protein Relative Solvent Accessibility with Support Vector Machines and Long‐Range Interaction 3D Local Descriptor.” Proteins 54: 557–562. 10.1002/prot.10602 [DOI] [PubMed] [Google Scholar]
  • 41. Shi, Yi , Zhou Jianjun, Arndt David, Wishart David S., and Lin Guohui. 2008. “Protein Contact Order Prediction from Primary Sequences.” BMC Bioinformatics 9: 255. 10.1186/1471-2105-9-255 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Ivankov, Dmitry N. , Garbuzynskiy Sergiy O., Alm Eric, Plaxco Kevin W., Baker David, and Finkelstein Alexei V.. 2003. “Contact Order Revisited: Influence of Protein Size on the Folding Rate.” Protein Science 12: 2057–2062. 10.1110/ps.0302503 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Zhao, Victor , Jacobs William M., and Shakhnovich Eugene I.. 2020. “Effect of Protein Structure  on Evolution of Cotranslational Folding.” Biophysical Journal 119: 1123–1134. 10.1016/j.bpj.2020.06.037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Saunders, Rhodri , and Deane Charlotte M.. 2010. “Synonymous Codon Usage Influences the Local Protein Structure Observed.” Nucleic Acids Research 38: 6719–6728. 10.1093/nar/gkq495 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Jumper, John , Evans Richard, Pritzel Alexander, Green Tim, Figurnov Michael, Ronneberger Olaf, Tunyasuvunakool Kathryn, et al. 2021. “Highly Accurate Protein Structure Prediction with AlphaFold.” Nature 596: 583–589. 10.1038/s41586-021-03819-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Varadi, Mihaly , Anyango Stephen, Deshpande Mandar, Nair Sreenath, Natassia Cindy, Yordanova Galabina, Yuan David, et al. 2022. “AlphaFold Protein Structure Database: Massively Expanding the Structural Coverage of Protein‐Sequence Space with High‐Accuracy Models.” Nucleic Acids Research 50: D439–D444. 10.1093/nar/gkab1061 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Weinberg, David E. , Shah Premal, Eichhorn Stephen W., Hussmann Jeffrey A., Plotkin Joshua B., and Bartel David P.. 2016. “Improved Ribosome‐Footprint and mRNA Measurements Provide Insights into Dynamics and Regulation of Yeast Translation.” Cell Reports 14: 1787–1799. 10.1016/j.celrep.2016.01.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Lu, Jianli , and Deutsch Carol. 2008. “Electrostatics in the Ribosomal Tunnel Modulate Chain Elongation Rates.” Journal of Molecular Biology 384: 73–86. 10.1016/j.jmb.2008.08.089 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Dao Duc, Khanh , Batra Sanjit S., Bhattacharya Nicholas, Cate Jamie H. D., and Song Yun S.. 2019. “Differences in the Path to Exit the Ribosome across the Three Domains of Life.” Nucleic Acids Research 47: 4198–4210. 10.1093/nar/gkz106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Doerfel, Lili K. , Wohlgemuth Ingo, Kothe Christina, Peske Frank, Urlaub Henning, and Rodnina Marina V.. 2013. “EF‐P is Essential for Rapid Synthesis of Proteins Containing Consecutive Proline Residues.” Science 339: 85–88. 10.1126/science.1229017 [DOI] [PubMed] [Google Scholar]
  • 51. Peil, Lauri , Starosta Agata L., Lassak Jürgen, Atkinson Gemma C., Virumäe Kai, Spitzer Michaela, Tenson Tanel, Jung Kirsten, Remme Jaanus, and Wilson Daniel N.. 2013. “Distinct XPPX Sequence Motifs Induce Ribosome Stalling, which is Rescued by the Translation Elongation Factor EF‐P.” Proceedings of the National Academy of Sciences of the United States of America 110: 15265–15270. 10.1073/pnas.1310642110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Gutierrez, Erik , Shin Byung‐Sik, Woolstenhulme Christopher J., Kim Joo‐Ran, Saini Preeti, Buskirk Allen R., and Dever Thomas E.. 2013. “eIF5A Promotes Translation of Polyproline Motifs.” Molecular Cell 51: 35–45. 10.1016/j.molcel.2013.04.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Woolstenhulme, Christopher J. , Guydosh Nicholas R., Green Rachel, and Buskirk Allen R.. 2015. “High‐Precision Analysis of Translational Pausing by Ribosome Profiling in Bacteria Lacking EFP.” Cell Reports 11: 13–21. 10.1016/j.celrep.2015.03.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Galzitskaya, Oxana V. , Reifsnyder Danielle C., Bogatyreva Natalya S., Ivankov Dmitry N., and Garbuzynskiy Sergiy O.. 2008. “More Compact Protein Globules Exhibit Slower Folding Rates.” Proteins 70: 329–332. 10.1002/prot.21619 [DOI] [PubMed] [Google Scholar]
  • 55. Glyakina, Anna V. , and Galzitskaya Oxana V.. 2020. “How Quickly Do Proteins Fold and Unfold, and What Structural Parameters Correlate with These Values?” Biomolecules 10: 197. 10.3390/biom10020197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Ivankov, Dmitry N. , Bogatyreva Natalya S., Lobanov Michail Yu, and Galzitskaya Oxana V.. 2009. “Coupling Between Properties of the Protein Shape and the Rate of Protein Folding.” PLoS One 4: e6476. 10.1371/journal.pone.0006476 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Galzitskaya, O. V. , Bogatyreva N. S., and Glyakina A. V.. 2011. “Bacterial Proteins Fold Faster than Eukaryotic Proteins with Simple Folding Kinetics.” Biochemistry (Moscow) 76: 225–235. 10.1134/s000629791102009x [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting_information.

IMT2-2-e148-s001.docx (4.4MB, docx)

Supporting_information.

IMT2-2-e148-s002.xlsx (34.5KB, xlsx)

Data Availability Statement

All data generated or analyzed in this study are included in this manuscript and Supporting Information. VeloPro pipeline is publicly available on GitHub (https://github.com/ytksailab-org/VeloPro). Supplementary materials (methods, figures, tables, scripts, graphical abstract, slides, videos, Chinese translated version, and updated materials) may be found in the online DOI or iMeta Science (http://www.imeta.science/).


Articles from iMeta are provided here courtesy of Wiley

RESOURCES