Abstract
Translation involves the biosynthesis of a protein sequence following the decoding of the genetic information embedded in a messenger RNA (mRNA). Typically, the eukaryotic mRNA was considered to be inherently monocistronic, but this paradigm is not in agreement with the translational landscape of cells, tissues, and organs. Recent ribosome sequencing (Ribo-seq) and proteomics studies show that, in addition to currently annotated reference proteins (RefProt), other proteins termed alternative proteins (AltProts), and microproteins are encoded in regions of mRNAs thought to be untranslated or in transcripts annotated as non-coding. This experimental evidence expands the repertoire of functional proteins within a cell and potentially provides important information on biological processes. This review explores the hitherto overlooked alternative proteome in neurobiology and considers the role of AltProts in pathological and healthy neuromolecular processes.
Keywords: mRNA, non-coding RNA (ncRNA), neurobiology, polycistronic, proteome
Introduction
The quantitative composition of molecules in living systems is constantly changing (Cohn et al., 1976; Filo et al., 2019; Lehallier et al., 2019). Multiple mechanisms modulate the molecular landscape in response to a changing environment (Stadler et al., 2008; Geyer et al., 2016; Thiemicke et al., 2019). These variations in the molecular landscape are particularly reflected in the proteome and the interactome (Aebersold et al., 2018). Recently, the Human Proteome Organization reported a meritorious human proteome coverage of 90.4% (Adhikari et al., 2020). This coverage is based on the number of proteins with evidence of their existence at the protein level out of the total protein entries in the NeXtProt database. However, 75% of spectra in large-scale mass spectrometry-based proteomics studies are not matched to a specific protein (Griss et al., 2016). Although a fraction of spectra remains unidentified because of unknown post-translational modifications or low signal-to-noise events, another fraction of unmatched spectra likely results from the incompleteness of conventional protein sequence databases.
Protein synthesis is the process in which ribosomes read genetic instructions in a messenger RNA (mRNA) and assemble a chain of amino acids. Translation in eukaryotes comprises four main phases: initiation, elongation, termination, and recycling (Jackson et al., 2010; Aitken and Lorsch, 2012; Hinnebusch, 2014). The initiation process involves scanning of mRNA for the optimal initiation site and is hypothesized to be impacted by the length of the open reading frame (ORF) and by the position of an AUG codon. Seminal work from Kozak (1986) established that the consensus sequence GCCRCCAUGG (in which R is a purine, and AUG is the start codon) is the preferred nucleotide sequence for translation initiation. The longest ORF of a transcript is typically considered the only protein coding sequence (CDS) and codes for the reference protein (RefProt) (Furuno et al., 2003). As such, a consensus exists that the majority of eukaryotic mRNAs are monocistronic, implying translation of one distinct protein sequence per transcript. An additional arbitrary criterion for annotating protein-coding ORFs is that transcripts derived from pseudogenes, and transcripts that do not contain an ORF of >100 codons are automatically considered non-coding. The advent of ribosome sequencing (Ribo-seq), a technique that identifies ORFs undergoing translation, shed light on the shortcomings of previous annotations since a large fraction of translated ORFs do not correspond to known CDSs but to unannotated ORFs (Ingolia, 2014). In fact, large-scale Ribo-seq studies and proteogenomics studies including alternative proteins (AltProts) coded by unannotated ORFs identified novel proteins dubbed AltProts (Menschaert et al., 2013; Vanderperre et al., 2013; Ji et al., 2015; Chen et al., 2020). These proteins are sometimes also termed microproteins when they are shorter than 100 amino acids. Here, we will use the term AltProt independently of the length of the unannotated protein. AltProts are translated from regions previously annotated as non-coding, i.e., 5′- or 3′-UTR of mature mRNAs, from ORFs that overlap the CDS in a different reading frame, or from transcripts that were labeled as non-coding. Hence, AltProts are novel proteins and should not be confounded with protein isoforms (Figure 1). Some of these novel proteins fulfill key (patho) physiological functions and thus expand the functional proteome landscape in human cells (Gagnon et al., 2021).
FIGURE 1.
Diversity in the location of protein-coding sequences (CDSs). (A) In a typical messenger RNA (mRNA), the canonical protein CDS, or CDS (dark blue) encodes the reference protein (RefProt) and is flanked by the 5′- and 3′- untranslated regions (UTRs). (B) Protein isoforms from the RefProt (light blue) may result from translation initiation at upstream or downstream initiation sites, or from the translation of alternatively spliced transcripts. (C) In addition to the conventional CDS, an mRNA may carry other CDSs or alternative open reading frames (ORFs) (purple), alternative ORFs (AltORFs) are present in regions of the transcriptome previously annotated as non-coding: 5′- and 3′-UTR, overlapping the CDS out-of-frame, and transcripts annotated as non-coding RNA (ncRNA).
In this review, we analyze and explore AltProts in neurobiology. To begin, we discuss the alternative proteome and briefly review transcription and translation in the brain. Then we discuss the potential mechanisms elemental to aberrant translation initiation and the impact of AltProts on protein diversity. Finally, we review identified AltProts in the brain and discuss their potential role in pathological or healthy neuromolecular processes.
The alternative proteome
In the past 20 years, multiple mRNAs were found to not only code for a RefProt but also for a novel distinct protein sequence (dual-coding mRNA) (Klemke et al., 2001; Kondo et al., 2007; Autio et al., 2008; Akimoto et al., 2013; Chalick et al., 2016; Delcourt et al., 2017; Samandi et al., 2017). This led to the identification of novel functional proteins (e.g., AltMRVI1, AltSLC35A4, AltCDKN2, and PEP7) (Ouelle et al., 1995; Vanderperre et al., 2013; Andreev et al., 2015; Yosten et al., 2016). These unexpected translation events were considered exceptional and the polycistronic features of mRNA were disregarded in proteomics and functional genomics studies. As such, functional studies of a specific gene are focused on the protein encoded by the reference CDS, or by their isoforms expressed from variably spliced transcripts. Similarly, large-scale mass spectrometric-based proteomics methods rely on conventional annotations that do not include AltProts, precluding their discovery. Indeed, protein identification involves the digestion of proteins followed by the detection of peptide fragments and their corresponding experimental spectra. Subsequently, the data is compared to theoretical spectra that are generated in silico, based on protein sequence databases. Identified peptides are then matched to their corresponding proteins. Hence, “identification” of proteins in a biological sample means uncovering the presence of previously annotated proteins rather than identifying the entire set of proteins in the sample. The most popular protein sequence databases are those from UniProtKB (The UniProt Consortium, 2017, 2021). As indicated above, proteins translated from alternative ORFs (AltORFs) are not included in these databases and cannot be detected.
A more comprehensive approach was required to test the hypothesis of the widespread existence of AltProts. Ribosome profiling is the first technique that enables the discovery of ORFs being translated (Brar and Weissman, 2015). This approach sequences ribosome-protected mRNA fragments and thus reveals ribosome positions on these transcripts. As such, Ribo-seq can establish deep insight into global protein translation. However, ribosome occupancy alone is not sufficient to annotate ORFs as a genuine CDS (Guttman et al., 2013). In addition, ORFs overlapping a CDS are difficult to identify with a usual Ribo-seq approach (Bazzini et al., 2014; Smith et al., 2014). To overcome these shortcomings, strategies were developed to stall the initiating ribosomes at the start codon and sequence the protected RNA fragment at the initiation site (Ingolia et al., 2011; Gao et al., 2015). Overall, Ribo-seq led to the detection of thousands of translated AltORF and protein isoforms, in addition to annotated CDSs (Ji et al., 2015; Chen et al., 2020). Isoforms are shorter or longer versions of any RefProt following initiation at an in-frame downstream or upstream initiation codon. AltProts on the other hand, have distinct amino acid sequences that do not share significant similarities with the RefProt translated from the same transcript. The stoichiometry between proteins translated from the same mRNA is largely unknown, but some AltProts were determined to be the main gene product (Delcourt et al., 2018). Translation of these novel proteins can initiate at near-cognate (non-AUG) codons, further expanding the repertoire of prospective protein-coding-ORFs (Figure 2; Chen et al., 2020). The capacity of ORFs starting with near-cognate start codons to produce functional proteins has been demonstrated earlier (e.g., MYC) (Hann et al., 1988).
FIGURE 2.
The generation of protein diversity at the translational level. In contrast to the monocistronic transcript paradigm, protein diversity arises from the multi-coding properties of messenger RNA (mRNA) that expand the functional proteome. (A) The Reference protein (RefProt) is translated from the coding sequence (CDS) that is initiated by an AUG codon and is enclosed by the 3′- and 5′-UTR. Alternative proteins (AltProts) are translated from unannotated open reading frame (ORFs) that can initiate at AUG, CUG, GUG, or UUG codons. (B) These ORFs can overlap the CDS which will generate distinct protein sequences when the ORFs are out-of-frame. (C) In addition, translation from ORFs located within the 3′- or 5′-UTR, or from an out-of-frame ORF nested within the CDS will generate distinct protein sequences. (D) Finally, AltProts can be translated from transcripts that were previously considered non-coding. This perception can significantly expand protein diversity within cells.
Of note, the majority of AltProts are shorter than 100 amino acids. In general, small proteins, or peptides, are assumed to not be able to fold into stable structures. As such, they are generally considered non-functional and are therefore excluded from proteome annotations. Yet, small proteins can fulfill key physiological functions (Couso and Patraquim, 2017). Multiple functional endogenous peptides have been detected in the human body (Pauli et al., 2014; Huang et al., 2017; Lin et al., 2019). Of particular note, functional peptides are enriched in the brain. The 38–42 amino acid long Amyloid-beta peptides are involved in the pathogenesis of Alzheimer’s disease, but also display physiological functions, including repairing leaks in the blood-brain barrier (Brothers et al., 2018). The 33 amino acid long peptide Orexin-A is a functional peptide involved in sleep and wakefulness and has recently been linked to dysfunction of hippocampal neurogenesis (Lin et al., 1999; Forte et al., 2021). In fact, neuropeptides are that enriched in the brain that they are appreciated as a specific class (Burbach, 2010). These peptides are generated by proteolytically processing of preproproteins and should therefore not be confounded with AltProts, which are translated directly from mRNA. The contribution of neuropeptides to the functional neuroproteome is a substantial precedent for further exploring small proteins and their function in the brain.
The convoluted network of transcription and translation in neurobiology
The human brain is an extraordinarily intricate organ with neurons that are connected into vast networks by synapses. At the genomic level, the brain is characterized by an unusually high level of alternative splicing compared to other tissues in the body. Alternative splicing is the process in which pre-mRNA derived from a multi-exonic gene, produces variably spliced transcripts. The expression of these isoforms is restricted to certain tissues and cell types (Shalek et al., 2013). The transcriptome of the human brain exhibits a high number of splicing variants, which significantly increases the transcriptomic and proteomic diversity encoded by the same gene (Yeo et al., 2004; Melé et al., 2015). In approximately half of the alternative splicing events the reading frame is shifted, but these transcripts are often subjected to nonsense-mediated decay due to the presence of a premature termination codon (Lewis et al., 2003; Zhang et al., 2007). As such, the majority of translated isoforms mainly exhibit partial common amino acid sequences. Yet, they can have vastly different interaction profiles and/or an altered subcellular localization (Yang et al., 2016; Yang and Carstens, 2017).
Another specific feature in the brain is found at the translational level: there is no correlation between mRNA levels and the concentration of the corresponding proteins, with the exception of protein classes involved in protein modification, regulation of metabolic and synaptic activity, and proteins associated with a cellular membrane (Liu et al., 2016; Bauernfeind and Babbitt, 2017). The fact that factors beyond mRNA determine protein concentration suggests a prominent role for the regulation of protein translation in the brain. A recent study established that the natural anti-sense transcript, MAPT Antisense RNA 1 (MAPT-AS1), regulates and represses tau production within brain cells (Simone et al., 2021). Silencing MAPT-AS1 led to increased tau levels in neurons and was found to correlate with tau pathologies. The mammalian-wide interspersed repeat, embedded in the anti-sense transcript, competes for ribosomal pairing with the internal ribosomal entry site of MAPT, and thus blocks the translation of tau. The same study reports the involvement of other natural anti-sense transcripts in the regulation of Amyloid-beta precursor protein (APP), which is associated with Alzheimer’s disease, and α-synuclein which is associated with Parkinson’s disease and Lewy body dementia (Spillantini et al., 1997; Jellinger, 2018; Hampel et al., 2021). This finding shows how the translation of proteins involved in neurological disorders can potentially be controlled through the regulation of the interaction between the mRNA and the ribosome.
Alternative modes of translation initiation
Reference proteins are generally translated by canonical cap-dependent translation, but the recent discovery of AltProts suggests alternative modes of translation. We provide a brief overview of initiation modes that might be elemental to the translation of AltProts (Figure 3). Non-canonical translation is often linked to survival mechanisms as a response to cellular stress (Bertram et al., 2008; Kwan and Thompson, 2019; Bohlen et al., 2020). Alternative modes of translation are therefore often linked to cancer (Sriram et al., 2018). We believe that cellular stresses may induce AltProt translation in pathologies or disturb the stoichiometry between AltProt and RefProt translation from a specific transcript. Therefore, AltProts might play a role in diseases in which oxidative stress is involved in their pathogenesis such as several types of dementia (Butterfield and Halliwell, 2019).
FIGURE 3.
Translation initiation modes. (A) Canonical cap-dependent translation; the 43S ribosomal subunit is recruited by the 5′ end of an messenger RNA (mRNA), and subsequently binds to the mRNA and scans for the AUG start codon. The ribosome is dissociated after the translation of the protein. (B) Translation re-initiation follows the same steps as a cap-dependent translation but after translation of the protein, the 40S ribosomal subunit stays associated with the mRNA. While moving in the 3′ direction, the 40S ribosomal subunit can bind initiation factors recovering its initiation potential. As such, translation initiation can take place at a downstream start codon. (C) Leaky scanning; 43S ribosomal subunits can skip potential start codons when they are located in a weak Kozak sequence. (C1) When the start codons are in the same reading frame and there is no stop codon in between, this can result in the translation of two isoforms from the same transcript. Alternatively, when there is a stop codon in between this will result in the translation of two distinct proteins from the same transcript. (C2) When the start codons are out-of-frame two distinct proteins will be translated from the same transcript.
Canonical cap-dependent translation
Reference proteins are generally translated by a mechanism that is dependent on both the 7-methylguanosine cap (m7G), which is located at the 5′ end of an mRNA, and ribosome scanning (Aitken and Lorsch, 2012). The process starts when transfer-RNA (tRNA) and Guanosine-5′-triphosphate (GTP) bind to the eukaryotic initiation factor 2 (eIF2) complex, yielding a ternary complex that assembles with the small ribosomal subunit (40S) (Kimball, 1999). Together with other initiation factors the 40S forms the pre-initiation complex (43S). The 43S complex is then recruited to the 5′ end of an mRNA marked with the m7G cap and the eukaryotic initiation factor 4F (eIF4F) complex among other factors (Fraser, 2015). The 43S complex subsequently scans the 5′-UTR for a start codon within the correct nucleotide sequence. Scanning stops at the start codon and the initiation factors are released. The large ribosomal subunit (60S) then binds to the 40S complex on the mRNA to form the 80S ribosome to enable translation elongation.
Leaky scanning
Leaky scanning can occur when multiple potential start codons are present on the transcript and the first start codon is not in a strong Kozak context (Wang and Rothnagel, 2004; Palam et al., 2011). As such, the 43S complex may initiate at the first start codon, or skip the first start codon, continue scanning and initiate at a downstream start codon. If the start codons are in the same reading frame and there is no stop codon in-between, two protein isoforms are produced. Alternatively, if there is a stop codon in-between the start codons, or if the start codons are out-of-frame, this will result in distinct proteins. The frequency of leaky scanning therefore determines the stoichiometry between the RefProt and the AltProt or between isoforms that are translated from the same mRNA.
Translation re-initiation
Approximately half of all human mRNA contains at least one small ORF upstream of the CDS, also termed an upstream ORF (uORF) (Calvo et al., 2009). Translation re-initiation can occur in transcripts that exhibit start codons in a strong Kozak sequence. The initiation starts at the uORF and the protein is translated. After translation of the uORF, the 40S complex can remain associated with the mRNA. While moving in the 3′ direction, the 40S subunit can recover initiation capacity by binding the initiation factors. In the absence of the initiation factors, the 40S complex may skip subsequent start codons. As such, translation re-initiation results in the translation from downstream initiations sites.
Alternative proteins and the regulation of reference protein translation
Upstream open reading frames (uORFs) were initially thought to exclusively act as negative regulators for the translation of the RefProt. In this mechanism, the translation of the RefProt is inhibited because the ribosome is dissociated into the small and the large subunit before reaching the CDS. Alternatively, ribosomes can be stalled at the uORF and therefore block the translation of subsequent ORFs (Young et al., 2016). In both translation initiation modes, leaky scanning and re-initiation, proteins can be translated from the 5′-UTR. Recently, a large-scale study established that disruption of upstream translation is associated with human disease (Lee et al., 2021). Thus far, it is not clear if translated uORFs that regulate RefProt expression in addition exhibit the capacity to produce functional AltProts, or if they should be considered as a separate subset of proteins.
Somatic mutation as a possible mechanism for de novo generation of protein-coding sequences
Somatic mutations are variations in the DNA that are acquired after conception (Biesecker and Spinner, 2013). These postzygotic variants can generate genetically distinct cells within a single organism and are known to appear and accumulate in the brain during development and aging. This might be due to errors in DNA replication, or defects in DNA repair mechanisms induced by extensive oxidative stress (Wang et al., 2014; Maynard et al., 2015). The accumulation of somatic mutations in neurons during aging, or genosenium, has been linked to dysregulation of tau phosphorylation and Alzheimer’s disease (Park et al., 2019; Miller et al., 2021). The mutations include synonymous and non-synonymous single-nucleotide variants (SNVs). Synonymous SNVs are in general disregarded because they do not change the amino acid sequence of the protein. However, synonymous variations in the reading frame of the CDS may be non-synonymous and have a great impact on an overlapping AltORF (Brunet et al., 2021a). Furthermore, variations of low significance within untranslated regions (UTRs) may have a significant impact if they result in a missense mutation within AltORFs. Finally, variations may create translation initiation sites, resulting in a functional ORF coding a novel AltProt (Figure 4).
FIGURE 4.

Somatic mutation, more specifically synonymous single-nucleotide variants (SNVs), as a candidate mechanism for de novo generation of protein-coding sequences (CDS). (A) Messenger RNA (mRNA) transcribed from the reference DNA, (B) mRNA transcribed from DNA with an SNV on the third nucleotide. The introduction of an adenine before uracil and guanine, respectively, forms an AUG codon on which initiation potentially takes place.
Impact of alternative proteins on proteomic- and functional diversity
The field of functional genomics uses genomics, transcriptomics, proteomics, and interactomics analyses to better understand the genotype to phenotype relationship. The discovery of unannotated ORFs adds another level of complexity. OpenProt is a proteogenomic resource including AltProts and novel isoforms in addition to RefProts. This resource uses a comprehensive prediction method based on a 3-frame in silico translation of transcripts retrieved from Ensembl and NCBI RefSeq. The output yields an exhaustive database of predicted proteins coded by any ORF with a minimum size of 30 codons in the transcriptome of 10 species. In the latest release (OpenProt v1.6), OpenProt also includes the initiation of AltProts at non-AUG codons (Brunet et al., 2021b). Based on the re-analyses of mass spectrometry-based proteomics and ribosome profiling data, OpenProt provides expression evidence for more than 40,000 AltProts in humans, largely expanding the proteome listed in conventional databases. As more studies use the OpenProt annotation to discover new proteins, it is expected that the number of AltProts will increase accordingly. OpenProt does not annotate microproteins smaller than 30 amino acids. As such, very small peptides like PEP7 would remain undetected using OpenProt databases (Yosten et al., 2016). Hence, determining the contribution of very small proteins to the proteome is still a challenge. Elucidating the function of thousands of AltProts is not an easy endeavor. Investigations on small proteins comes also with significant technological challenges. Despite these challenges, we predict that including AltProts in proteomics studies will likely help discover biomarkers for precision medicine.
Exploring the alternative neuroproteome
Alternative proteins elude detection in the brain because they are not included in protein databases and the potential dual-coding feature of mRNAs and coding potential of RNAs annotated as non-coding are overlooked. However, there are some substantial arguments to investigate the existence of these novel neuroproteins. First, weak Kozak sequences are enriched in genes involved in neurobiology, which might lead to increased leaky scanning and downstream initiation events (Acevedo et al., 2018). Second, approximately 40% of long non-coding RNAs (lncRNAs) are specifically expressed in the brain, which increases the potential translation of AltProts from ncRNA (Derrien et al., 2012). Finally, somatic mutations can potentially create a translation initiation site for a new AltORF. As such, AltProts might be translated in age-related neurodegenerative disorders. Although no proteome discovery study specifically targeted the alternative neuroproteome, multiple AltProts have been detected in the brain. Humanin is translated from a short ORF from mitochondrial ribosomal RNA (mt rRNA). It was detected serendipitously in surviving brain cells of familial Alzheimer’s patients and was found to protect against Amyloid-beta toxicity (Hashimoto et al., 2001a,b; Tajima et al., 2002). In addition, it is involved in multiple neuroprotective processes such as inhibition of inflammatory responses, protection against neuronal cell death, and prevention of synapse loss (Sponne et al., 2004; Zhao et al., 2013; Zárate et al., 2019). Due to its cytoprotective properties it is now considered a therapeutic target for degenerative diseases (Zuccato et al., 2019).
Several studies identified AltProts in the brain encoded in dual-coding transcripts. For instance, the alternative prion protein (AltPrP) is translated from an out-of-frame ORF (+3 reading frame) nested within the annotated CDS of the prion protein (PRNP) locus (Vanderperre et al., 2011). The RefProt, the PrP is involved in the pathogenesis of transmissible spongiform encephalopathies (TSEs). The function of AltPrP is unknown, but the increased expression after proteasome inhibition and endoplasmic reticulum stress might indicate its biomarker potential in TSEs or other neurodegenerative diseases like Parkinson’s and Alzheimer’s (Yoshida, 2007; Salminen et al., 2009; Vanderperre et al., 2011). The transcript coding for the A2A Adenosine receptor (A2AR) is also dual-coding (Lee et al., 2014). A2AR is a drug target for multiple neurodegenerative disorders like Parkinson’s and Huntington’s (Chiang et al., 2009; Kulisevsky and Poyurovsky, 2012). The AltProt AltA2AR is translated from an out-of-frame ORF (+2 reading frame) that starts in the 5′-UTR and partially overlaps the CDS. Stimulation of A2AR increased translation of the AltProt, and upregulation of A2AR transcripts led to increased translation of both the RefProt and the AltProt. AltA2AR was found to be involved in the modulation of the expression of several genes in the mitogen-activated protein kinase (MAPK) pathway. The AltProt AltAtaxin-1 is expressed from the same mRNA as Ataxin-1 but is translated from an out-of-frame (+3 reading frame) ORF that is nested within the CDS (Bergeron et al., 2013). Ataxin-1 is involved in the neurodegenerative disorder spinocerebellar ataxia type 1 and is moreover associated with increased Alzheimer’s disease risk (Orr et al., 1993; Bertram et al., 2008; Suh et al., 2019). The AltProt is expressed in normal and in pathological conditions but whether it has a function in spinocerebellar ataxia type 1 is unknown. More recently, a transcript from the FUS gene was found to be bicistronic. AltFUS is translated from an out-of-frame ORF nested within the CDS (Brunet et al., 2021a). The RefProt FUS has been linked to the neurodegenerative diseases amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (Deng et al., 2014; Nolan et al., 2016). AltFUS was found to induce motor neuron toxicity. Thus, the RefProt and AltProt contribute both to FUS-mediated toxicity (Brunet et al., 2021a). Finally, the presence of AltProts was demonstrated in extracellular vesicles (EVs) produced by glioma cells that can contribute to carcinogenesis (Murgoci et al., 2020). In total, six AltProts were identified among other proteins involved in tumor progression. This study not only highlights the usefulness of targeting glioma EVs for diagnosis but also indicates the biomarker potential for AltProts.
Perspective
Ribosome sequencing studies shifted the paradigm of RNA expression in eukaryotes by identifying novel translated ORFs in regions of transcripts previously believed to be non-coding. Some of these novel proteins are stable, bioactive, and/or functionally independent of the RefProt translated from the same transcript. Yet, conventional protein sequence databases do not include AltProts because their role is not well understood, which in turn hampers subsequent research for determining their function.
The neurotranscriptome is characterized by a large fraction of canonical translation initiation sites with a weak Kozak sequence and a significant number of lncRNA. These two features suggest that neuronal AltORFs are numerous and their translation may be increased. Ribo-seq on the neurotranscriptome and proteomics studies on the neuroproteome would help determine the extent of the translation of AltORFs. Non-canonical translation events are linked to cellular stress responses, which might indicate their implication in carcinogenesis and neurodegenerative diseases like Alzheimer’s, Parkinson’s, TSEs, and ALS. There are several examples in the literature of functional AltProts in the brain with functions associated with normal or pathological states. This novel group of proteins can therefore be important to help unravel (patho)physiological functions, and potentially represents novel biomarkers in neurodegenerative diseases and therapeutic targets. The field of AltORFs and AltProts is recent, and much work needs to be done to determine the role of AltProts in the nervous system (Figure 5). As such, efforts to advance research on non-canonical ORF will likely advance the field (Mudge et al., 2021).
FIGURE 5.
The neuroproteome vs. the alternative neuroproteome. This value proposition of the alternative neuroproteome highlights the key findings in this exploratory review article. The alternative neuroproteome is an undiscovered area that can increase our understanding of the intricate neuroproteome and is a novel source of potential biomarkers and therapeutic targets. However, subsequent studies have to determine which alternative open reading frame (ORFs) are protein coding, and if the translated alternative protein (AltProt) is either functional, non-functional, and/or involved in the regulation of reference protein (RefProt) translation.
Very large databases, including those provided by OpenProt are useful for the discovery of AltProts but they are too large and cannot be routinely used with typical proteomic workflows. OpenProt allows subdivision by species, genes, transcripts, proteins, ncRNAs, mRNAs, or by the locations on mRNA (e.g., 5′-UTR, CDS, etc.). Additionally, it is possible to exclusively select AltProts with a minimum degree of experimental evidence. Currently, it is not possible to further divide ncRNA by biotype. Studies specifically targeting a specific biotype (e.g., processed pseudogenes) could leverage this additional filter, allowing usage of a restricted database. For example, ncRNA can be subdivided into short ncRNA, long ncRNA, and pseudogenes. Alternatively, a more comprehensive subdivision by biotype could be achieved by incorporating the biotype annotation from Ensembl/Havana or Gencode (Aken et al., 2016; Frankish et al., 2021). However, the most comprehensive approach for deciphering a cellular proteome would be to generate custom databases with RNA-sequencing data. In this approach, the protein sequence database is built from ORFs present in transcripts identified by RNA-seq rather than built with all predicted ORFs in the genome. In addition to providing a smaller database, RNA-seq data enables the identification of genetic variants.
The discrepancy between conventional annotations and experimental translational landscapes stresses the necessity for a better understanding of the mechanism of translation initiation and its regulation. In particular, transcripts may encode several proteins rather than a single protein as predicted from conventional annotations. A strong Kozak sequence is certainly an important feature for maximal translation of a specific ORF, yet a significant fraction of initiation sites, particularly in genes involved in neurobiology does not fit a consensus Kozak sequence.
Concluding remarks
The detection of novel functional ORFs is essential to further unravel the neuroproteome. The purpose of this review was to create awareness that assumptions in genome annotations have created an unintentional bias in proteomics research. Particularly in a convoluted environment like the human brain, it is important to consider the multi-coding potential of transcripts to relate gene expression and function in physiological and pathological settings. The identification and mapping of AltORFs and functional AltProts are elemental for widespread acceptance of the potential multi-coding properties of eukaryotic mRNA and to contradict the “non-coding” theorem.
Author contributions
PM, CH, and SL conceptually designed the work. PM, XR, CH, CD, and JV wrote the manuscript. PM made the illustrations. CH, XR, and SL reviewed the manuscript. All authors read, commented on the manuscript, and approved the final version.
Acknowledgments
We are grateful to all individuals for participating in this review. Figures were created with BioRender.com.
Funding
This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under the Marie Skłodowska-Curie grant agreement No. 860197 MIRIADE.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
- Acevedo J. M., Hoermann B., Schlimbach T., Teleman A. A. (2018). Changes in global translation elongation or initiation rates shape the proteome via the Kozak sequence. Sci. Rep. 8:4018. 10.1038/s41598-018-22330-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Adhikari S., Nice E. C., Deutsch E. W., Lane L., Omenn G. S., Pennington S. R., et al. (2020). A high-stringency blueprint of the human proteome. Nat. Commun. 11:5301. 10.1038/s41467-020-19045-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aebersold R., Agar J. N., Amster I. J., Baker M. S., Bertozzi C. R., Boja E. S., et al. (2018). How many human proteoforms are there? Nat. Chem. Biol. 14 206–214. 10.1038/nchembio.2576 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aitken C. E., Lorsch J. R. (2012). A mechanistic overview of translation initiation in eukaryotes. Nat. Struct. Mol. Biol. 19 568–576. 10.1038/nsmb.2303 [DOI] [PubMed] [Google Scholar]
- Aken B. L., Ayling S., Barrell D., Clarke L., Curwen V., Fairley S., et al. (2016). The Ensembl gene annotation system. Database 2016:baw093. 10.1093/database/baw093 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Akimoto C., Sakashita E., Kasashima K., Kuroiwa K., Tominaga K., Hamamoto T., et al. (2013). Translational repression of the McKusick–Kaufman syndrome transcript by unique upstream open reading frames encoding mitochondrial proteins with alternative polyadenylation sites. Biochim. Biophys. Acta Gen. Subj. 1830 2728–2738. 10.1016/j.bbagen.2012.12.010 [DOI] [PubMed] [Google Scholar]
- Andreev D. E., O’Connor P. B., Fahey C., Kenny E. M., Terenin I. M., Dmitriev S. E., et al. (2015). Translation of 5’ leaders is pervasive in genes resistant to eIF2 repression. eLife 4:e03971. 10.7554/eLife.03971 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Autio K. J., Kastaniotis A. J., Pospiech H., Miinalainen I. J., Schonauer M. S., Dieckmann C. L., et al. (2008). An ancient genetic link between vertebrate mitochondrial fatty acid synthesis and RNA processing. FASEB J. 22 569–578. 10.1096/fj.07-8986 [DOI] [PubMed] [Google Scholar]
- Bauernfeind A. L., Babbitt C. C. (2017). The predictive nature of transcript expression levels on protein expression in adult human brain. BMC Genomics 18:322. 10.1186/s12864-017-3674-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bazzini A. A., Johnstone T. G., Christiano R., Mackowiak S. D., Obermayer B., Fleming E. S., et al. (2014). Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation. EMBO J. 33 981–993. 10.1002/embj.201488411 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bergeron D., Lapointe C., Bissonnette C., Tremblay G., Motard J., Roucou X. (2013). An out-of-frame overlapping reading frame in the ataxin-1 coding sequence encodes a novel ataxin-1 interacting protein*. J. Biol. Chem. 288 21824–21835. 10.1074/jbc.M113.472654 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bertram L., Lange C., Mullin K., Parkinson M., Hsiao M., Hogan M. F., et al. (2008). Genome-wide association analysis reveals putative Alzheimer’s disease susceptibility loci in addition to APOE. Am. J. Hum. Genet. 83 623–632. 10.1016/j.ajhg.2008.10.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biesecker L. G., Spinner N. B. (2013). A genomic view of mosaicism and human disease. Nat. Rev. Genet. 14 307–320. 10.1038/nrg3424 [DOI] [PubMed] [Google Scholar]
- Bohlen J., Harbrecht L., Blanco S., Clemm von Hohenberg K., Fenzl K., Kramer G., et al. (2020). DENR promotes translation reinitiation via ribosome recycling to drive expression of oncogenes including ATF4. Nat. Commun. 11:4676. 10.1038/s41467-020-18452-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brar G. A., Weissman J. S. (2015). Ribosome profiling reveals the what, when, where and how of protein synthesis. Nat. Rev. Mol. Cell Biol. 16 651–664. 10.1038/nrm4069 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brothers H. M., Gosztyla M. L., Robinson S. R. (2018). The physiological roles of amyloid-β peptide hint at new ways to treat Alzheimer’s disease. Front. Aging Neurosci. 10:118. 10.3389/fnagi.2018.00118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brunet M. A., Jacques J.-F., Nassari S., Tyzack G. E., McGoldrick P., Zinman L., et al. (2021a). The FUS gene is dual-coding with both proteins contributing to FUS-mediated toxicity. EMBO Rep. 22:e50640. 10.15252/embr.202050640 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brunet M. A., Lucier J.-F., Levesque M., Leblanc S., Jacques J.-F., Al-Saedi H. R. H., et al. (2021b). OpenProt 2021: Deeper functional annotation of the coding potential of eukaryotic genomes. Nucleic Acids Res. 49 D380–D388. 10.1093/nar/gkaa1036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burbach J. P. H. (2010). Neuropeptides from concept to online database www.neuropeptides.nl. Eur. J. Pharmacol. 626 27–48. 10.1016/j.ejphar.2009.10.015 [DOI] [PubMed] [Google Scholar]
- Butterfield D. A., Halliwell B. (2019). Oxidative stress, dysfunctional glucose metabolism and Alzheimer disease. Nat. Rev. Neurosci. 20 148–160. 10.1038/s41583-019-0132-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Calvo S. E., Pagliarini D. J., Mootha V. K. (2009). Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans. Proc. Natl. Acad. Sci. U.S.A. 106 7507–7512. 10.1073/pnas.0810916106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chalick M., Jacobi O., Pichinuk E., Garbar C., Bensussan A., Meeker A., et al. (2016). MUC1-ARF—a novel MUC1 protein that resides in the nucleus and is expressed by alternate reading frame translation of MUC1 mRNA. PLoS One 11:e0165031. 10.1371/journal.pone.0165031 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen J., Brunner A.-D., Cogan J. Z., Nuñez J. K., Fields A. P., Adamson B., et al. (2020). Pervasive functional translation of non-canonical human open reading frames. Science 367 1140–1146. 10.1126/science.aay0262 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chiang M.-C., Chen H.-M., Lai H.-L., Chen H.-W., Chou S.-Y., Chen C.-M., et al. (2009). The A2A adenosine receptor rescues the urea cycle deficiency of Huntington’s disease by enhancing the activity of the ubiquitin–proteasome system. Hum. Mol. Genet. 18 2929–2942. 10.1093/hmg/ddp230 [DOI] [PubMed] [Google Scholar]
- Cohn S. H., Vaswani A., Zanzi I., Aloia J. F., Roginsky M. S., Ellis K. J. (1976). Changes in body chemical composition with age measured by total-body neutron activation. Metabolism 25 85–95. 10.1016/0026-0495(76)90163-3 [DOI] [PubMed] [Google Scholar]
- Couso J.-P., Patraquim P. (2017). Classification and function of small open reading frames. Nat. Rev. Mol. Cell Biol. 18 575–589. 10.1038/nrm.2017.58 [DOI] [PubMed] [Google Scholar]
- Delcourt V., Brunelle M., Roy A. V., Jacques J.-F., Salzet M., Fournier I., et al. (2018). The protein coded by a short open reading frame, not by the annotated coding sequence, is the main gene product of the dual-coding gene MIEF1. Mol. Cell. Proteomics 17 2402–2411. 10.1074/mcp.RA118.000593 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delcourt V., Franck J., Leblanc E., Narducci F., Robin Y.-M., Gimeno J.-P., et al. (2017). Combined mass spectrometry imaging and top-down microproteomics reveals evidence of a hidden proteome in ovarian cancer. EBioMedicine 21 55–64. 10.1016/j.ebiom.2017.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deng H., Gao K., Jankovic J. (2014). The role of FUS gene variants in neurodegenerative diseases. Nat. Rev. Neurol. 10 337–348. 10.1038/nrneurol.2014.78 [DOI] [PubMed] [Google Scholar]
- Derrien T., Johnson R., Bussotti G., Tanzer A., Djebali S., Tilgner H., et al. (2012). The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression. Genome Res. 22 1775–1789. 10.1101/gr.132159.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Filo S., Shtangel O., Salamon N., Kol A., Weisinger B., Shifman S., et al. (2019). Disentangling molecular alterations from water-content changes in the aging human brain using quantitative MRI. Nat. Commun. 10:3403. 10.1038/s41467-019-11319-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Forte N., Boccella S., Tunisi L., Fernández-Rilo A. C., Imperatore R., Iannotti F. A., et al. (2021). Orexin-A and endocannabinoids are involved in obesity-associated alteration of hippocampal neurogenesis, plasticity, and episodic memory in mice. Nat. Commun. 12:6137. 10.1038/s41467-021-26388-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frankish A., Diekhans M., Jungreis I., Lagarde J., Loveland J. E., Mudge J. M., et al. (2021). GENCODE 2021. Nucleic Acids Res. 49 D916–D923. 10.1093/nar/gkaa1087 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fraser C. S. (2015). Quantitative studies of mRNA recruitment to the eukaryotic ribosome. Biochimie 114 58–71. 10.1016/j.biochi.2015.02.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Furuno M., Kasukawa T., Saito R., Adachi J., Suzuki H., Baldarelli R., et al. (2003). CDS annotation in full-length cDNA sequence. Genome Res. 13 1478–1487. 10.1101/gr.1060303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gagnon M., Savard M., Jacques J.-F., Bkaily G., Geha S., Roucou X., et al. (2021). Potentiation of B2 receptor signaling by AltB2R, a newly identified alternative protein encoded in the human bradykinin B2 receptor gene. J. Biol. Chem. 296:100329. 10.1016/j.jbc.2021.100329 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao X., Wan J., Liu B., Ma M., Shen B., Qian S.-B. (2015). Quantitative profiling of initiating ribosomes in vivo. Nat. Methods 12 147–153. 10.1038/nmeth.3208 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geyer P. E., Wewer Albrechtsen N. J., Tyanova S., Grassl N., Iepsen E. W., Lundgren J., et al. (2016). Proteomics reveals the effects of sustained weight loss on the human plasma proteome. Mol. Syst. Biol. 12:901. 10.15252/msb.20167357 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Griss J., Perez-Riverol Y., Lewis S., Tabb D. L., Dianes J. A., del-Toro N., et al. (2016). Recognizing millions of consistently unidentified spectra across hundreds of shotgun proteomics datasets. Nat. Methods 13 651–656. 10.1038/nmeth.3902 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guttman M., Russell P., Ingolia N. T., Weissman J. S., Lander E. S. (2013). Ribosome profiling provides evidence that large non-coding RNAs do not encode proteins. Cell 154 240–251. 10.1016/j.cell.2013.06.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hampel H., Hardy J., Blennow K., Chen C., Perry G., Kim S. H., et al. (2021). The amyloid-β pathway in Alzheimer’s disease. Mol. Psychiatry 26 5481–5503. 10.1038/s41380-021-01249-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hann S. R., King M. W., Bentley D. L., Anderson C. W., Eisenman R. N. (1988). A non-AUG translational initiation in c-myc exon 1 generates an N-terminally distinct protein whose synthesis is disrupted in Burkitt’s lymphomas. Cell 52 185–195. 10.1016/0092-8674(88)90507-7 [DOI] [PubMed] [Google Scholar]
- Hashimoto Y., Ito Y., Niikura T., Shao Z., Hata M., Oyama F., et al. (2001a). Mechanisms of neuroprotection by a novel rescue factor Humanin from Swedish mutant amyloid precursor protein. Biochem. Biophys. Res. Commun. 283 460–468. 10.1006/bbrc.2001.4765 [DOI] [PubMed] [Google Scholar]
- Hashimoto Y., Niikura T., Tajima H., Yasukawa T., Sudo H., Ito Y., et al. (2001b). A rescue factor abolishing neuronal cell death by a wide spectrum of familial Alzheimer’s disease genes and Aβ. Proc. Natl. Acad. Sci. U.S.A. 98 6336–6341. 10.1073/pnas.101133498 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hinnebusch A. G. (2014). The scanning mechanism of eukaryotic translation initiation. Annu. Rev. Biochem. 83 779–812. 10.1146/annurev-biochem-060713-035802 [DOI] [PubMed] [Google Scholar]
- Huang J.-Z., Chen M., Chen D., Gao X.-C., Zhu S., Huang H., et al. (2017). A peptide encoded by a putative lncRNA HOXB-AS3 suppresses colon cancer growth. Mol. Cell 68 171–184.e6. 10.1016/j.molcel.2017.09.015 [DOI] [PubMed] [Google Scholar]
- Ingolia N. T. (2014). Ribosome profiling: New views of translation, from single codons to genome scale. Nat. Rev. Genet. 15 205–213. 10.1038/nrg3645 [DOI] [PubMed] [Google Scholar]
- Ingolia N. T., Lareau L. F., Weissman J. S. (2011). Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes. Cell 147 789–802. 10.1016/j.cell.2011.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jackson R. J., Hellen C. U. T., Pestova T. V. (2010). The mechanism of eukaryotic translation initiation and principles of its regulation. Nat. Rev. Mol. Cell Biol. 11 113–127. 10.1038/nrm2838 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jellinger K. A. (2018). Dementia with Lewy bodies and Parkinson’s disease-dementia: Current concepts and controversies. J. Neural Transm. 125 615–650. 10.1007/s00702-017-1821-9 [DOI] [PubMed] [Google Scholar]
- Ji Z., Song R., Regev A., Struhl K. (2015). Many lncRNAs, 5’UTRs, and pseudogenes are translated and some are likely to express functional proteins. eLife 4:e08890. 10.7554/eLife.08890 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimball S. R. (1999). Eukaryotic initiation factor eIF2. Int. J. Biochem. Cell Biol. 31 25–29. 10.1016/S1357-2725(98)00128-9 [DOI] [PubMed] [Google Scholar]
- Klemke M., Kehlenbach R. H., Huttner W. B. (2001). Two overlapping reading frames in a single exon encode interacting proteins—a novel way of gene usage. EMBO J. 20 3849–3860. 10.1093/emboj/20.14.3849 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kondo T., Hashimoto Y., Kato K., Inagaki S., Hayashi S., Kageyama Y. (2007). Small peptide regulators of actin-based cell morphogenesis encoded by a polycistronic mRNA. Nat. Cell Biol. 9 660–665. 10.1038/ncb1595 [DOI] [PubMed] [Google Scholar]
- Kozak M. (1986). Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell 44 283–292. 10.1016/0092-8674(86)90762-2 [DOI] [PubMed] [Google Scholar]
- Kulisevsky J., Poyurovsky M. (2012). Adenosine A2A-receptor antagonism and pathophysiology of Parkinson’s disease and drug-induced movement disorders. Eur. Neurol. 67 4–11. 10.1159/000331768 [DOI] [PubMed] [Google Scholar]
- Kwan T., Thompson S. R. (2019). Noncanonical translation initiation in eukaryotes. Cold Spring Harb. Perspect. Biol. 11:a032672. 10.1101/cshperspect.a032672 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee C., Lai H.-L., Lee Y.-C., Chien C.-L., Chern Y. (2014). The A2A adenosine receptor is a dual coding gene. J. Biol. Chem. 289 1257–1270. 10.1074/jbc.M113.509059 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee D. S. M., Park J., Kromer A., Baras A., Rader D. J., Ritchie M. D., et al. (2021). Disrupting upstream translation in mRNAs is associated with human disease. Nat. Commun. 12:1515. 10.1038/s41467-021-21812-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lehallier B., Gate D., Schaum N., Nanasi T., Lee S. E., Yousef H., et al. (2019). Undulating changes in human plasma proteome profiles across the lifespan. Nat. Med. 25 1843–1850. 10.1038/s41591-019-0673-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewis B. P., Green R. E., Brenner S. E. (2003). Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. Proc. Natl. Acad. Sci. U.S.A. 100 189–192. 10.1073/pnas.0136770100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin L., Faraco J., Li R., Kadotani H., Rogers W., Lin X., et al. (1999). The sleep disorder canine narcolepsy is caused by a mutation in the hypocretin (Orexin) receptor 2 gene. Cell 98 365–376. 10.1016/S0092-8674(00)81965-0 [DOI] [PubMed] [Google Scholar]
- Lin Y.-F., Xiao M.-H., Chen H.-X., Meng Y., Zhao N., Yang L., et al. (2019). A novel mitochondrial micropeptide MPM enhances mitochondrial respiratory activity and promotes myogenic differentiation. Cell Death Dis. 10:528. 10.1038/s41419-019-1767-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Y., Beyer A., Aebersold R. (2016). On the dependency of cellular protein levels on mRNA abundance. Cell 165 535–550. 10.1016/j.cell.2016.03.014 [DOI] [PubMed] [Google Scholar]
- Maynard S., Fang E. F., Scheibye-Knudsen M., Croteau D. L., Bohr V. A. (2015). DNA damage, DNA repair, aging, and neurodegeneration. Cold Spring Harb. Perspect. Med. 5:a025130. 10.1101/cshperspect.a025130 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Melé M., Ferreira P. G., Reverter F., DeLuca D. S., Monlong J., Sammeth M., et al. (2015). The human transcriptome across tissues and individuals. Science 348 660–665. 10.1126/science.aaa0355 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Menschaert G., Van Criekinge W., Notelaers T., Koch A., Crappé J., Gevaert K., et al. (2013). Deep proteome coverage based on ribosome profiling aids mass spectrometry-based protein and peptide discovery and provides evidence of alternative translation products and near-cognate translation initiation events*. Mol. Cell. Proteomics 12 1780–1790. 10.1074/mcp.M113.027540 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miller M. B., Reed H. C., Walsh C. A. (2021). Brain somatic mutation in aging and Alzheimer’s disease. Annu. Rev. Genomics Hum. Genet. 22 239–256. 10.1146/annurev-genom-121520-081242 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mudge J. M., Ruiz-Orera J., Prensner J. R., Brunet M. A., Gonzalez J. M., Magrane M., et al. (2021). A community-driven roadmap to advance research on translated open reading frames detected by Ribo-seq. bioRxiv [Preprint]. 10.1101/2021.06.10.447896 [DOI] [Google Scholar]
- Murgoci A.-N., Cardon T., Aboulouard S., Duhamel M., Fournier I., Cizkova D., et al. (2020). Reference and ghost proteins identification in rat C6 glioma extracellular vesicles. iScience 23:101045. 10.1016/j.isci.2020.101045 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nolan M., Talbot K., Ansorge O. (2016). Pathogenesis of FUS-associated ALS and FTD: Insights from rodent models. Acta Neuropathol. Commun. 4:99. 10.1186/s40478-016-0358-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orr H. T., Chung M., Banfi S., Kwiatkowski T. J., Servadio A., Beaudet A. L., et al. (1993). Expansion of an unstable trinucleotide CAG repeat in spinocerebellar ataxia type 1. Nat. Genet. 4 221–226. 10.1038/ng0793-221 [DOI] [PubMed] [Google Scholar]
- Ouelle D. E., Zindy F., Ashmun R. A., Sherr C. J. (1995). Alternative reading frames of the INK4a tumor suppressor gene encode two unrelated proteins capable of inducing cell cycle arrest. Cell 83 993–1000. 10.1016/0092-8674(95)90214-7 [DOI] [PubMed] [Google Scholar]
- Palam L. R., Baird T. D., Wek R. C. (2011). Phosphorylation of eIF2 facilitates ribosomal bypass of an inhibitory upstream ORF to enhance CHOP translation. J. Biol. Chem. 286 10939–10949. 10.1074/jbc.M110.216093 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park J. S., Lee J., Jung E. S., Kim M.-H., Kim I. B., Son H., et al. (2019). Brain somatic mutations observed in Alzheimer’s disease associated with aging and dysregulation of tau phosphorylation. Nat. Commun. 10:3090. 10.1038/s41467-019-11000-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pauli A., Norris M. L., Valen E., Chew G.-L., Gagnon J. A., Zimmerman S., et al. (2014). Toddler: An embryonic signal that promotes cell movement via apelin receptors. Science 343:1248636. 10.1126/science.1248636 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salminen A., Kauppinen A., Suuronen T., Kaarniranta K., Ojala J. (2009). ER stress in Alzheimer’s disease: A novel neuronal trigger for inflammation and Alzheimer’s pathology. J. Neuroinflammation 6:41. 10.1186/1742-2094-6-41 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Samandi S., Roy A. V., Delcourt V., Lucier J.-F., Gagnon J., Beaudoin M. C., et al. (2017). Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins. eLife 6:e27860. 10.7554/eLife.27860 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shalek A. K., Satija R., Adiconis X., Gertner R. S., Gaublomme J. T., Raychowdhury R., et al. (2013). Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells. Nature 498 236–240. 10.1038/nature12172 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simone R., Javad F., Emmett W., Wilkins O. G., Almeida F. L., Barahona-Torres N., et al. (2021). MIR-NATs repress MAPT translation and aid proteostasis in neurodegeneration. Nature 495 117–123. 10.1038/s41586-021-03556-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith J. E., Alvarez-Dominguez J. R., Kline N., Huynh N. J., Geisler S., Hu W., et al. (2014). Translation of small open reading frames within unannotated RNA transcripts in Saccharomyces cerevisiae. Cell Rep. 7 1858–1866. 10.1016/j.celrep.2014.05.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spillantini M. G., Schmidt M. L., Lee V. M.-Y., Trojanowski J. Q., Jakes R., Goedert M. (1997). α-Synuclein in Lewy bodies. Nature 388 839–840. 10.1038/42166 [DOI] [PubMed] [Google Scholar]
- Sponne I., Fifre A., Koziel V., Kriem B., Oster T., Pillot T. (2004). Humanin rescues cortical neurons from prion-peptide-induced apoptosis. Mol. Cell. Neurosci. 25 95–102. 10.1016/j.mcn.2003.09.017 [DOI] [PubMed] [Google Scholar]
- Sriram A., Bohlen J., Teleman A. A. (2018). Translation acrobatics: How cancer cells exploit alternate modes of translational initiation. EMBO Rep. 19:e45947. 10.15252/embr.201845947 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stadler A. M., Digel I., Artmann G. M., Embs J. P., Zaccai G., Büldt G. (2008). Hemoglobin dynamics in red blood cells: Correlation to body temperature. Biophys. J. 95 5449–5461. 10.1529/biophysj.108.138040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suh J., Romano D. M., Nitschke L., Herrick S. P., DiMarzio B. A., Dzhala V., et al. (2019). Loss of ataxin-1 potentiates Alzheimer’s pathogenesis by elevating cerebral BACE1 transcription. Cell 178 1159–1175.e17. 10.1016/j.cell.2019.07.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tajima H., Niikura T., Hashimoto Y., Ito Y., Kita Y., Terashita K., et al. (2002). Evidence for in vivo production of Humanin peptide, a neuroprotective factor against Alzheimer’s disease-related insults. Neurosci. Lett. 324 227–231. 10.1016/s0304-3940(02)00199-4 [DOI] [PubMed] [Google Scholar]
- The UniProt Consortium (2017). UniProt: The universal protein knowledgebase. Nucleic Acids Res. 45 D158–D169. 10.1093/nar/gkw1099 [DOI] [PMC free article] [PubMed] [Google Scholar]
- The UniProt Consortium (2021). UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res. 49 D480–D489. 10.1093/nar/gkaa1100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thiemicke A., Jashnsaz H., Li G., Neuert G. (2019). Generating kinetic environments to study dynamic cellular processes in single cells. Sci. Rep. 9:10129. 10.1038/s41598-019-46438-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vanderperre B., Lucier J.-F., Bissonnette C., Motard J., Tremblay G., Vanderperre S., et al. (2013). Direct detection of alternative open reading frames translation products in human significantly expands the proteome. PLoS One 8:e70698. 10.1371/journal.pone.0070698 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vanderperre B., Staskevicius A. B., Tremblay G., McCoy M., O’Neill M. A., Cashman N. R., et al. (2011). An overlapping reading frame in the PRNP gene encodes a novel polypeptide distinct from the prion protein. FASEB J. 25 2373–2386. 10.1096/fj.10-173815 [DOI] [PubMed] [Google Scholar]
- Wang X., Wang W., Li L., Perry G., Lee H., Zhu X. (2014). Oxidative stress and mitochondrial dysfunction in Alzheimer’s disease. Biochim. Biophys. Acta Mol. Basis Dis. 1842 1240–1247. 10.1016/j.bbadis.2013.10.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang X.-Q., Rothnagel J. A. (2004). 5’-Untranslated regions with multiple upstream AUG codons can support low-level translation via leaky scanning and reinitiation. Nucleic Acids Res. 32 1382–1391. 10.1093/nar/gkh305 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang X., Coulombe-Huntington J., Kang S., Sheynkman G. M., Hao T., Richardson A., et al. (2016). Widespread expansion of protein interaction capabilities by alternative splicing. Cell 164 805–817. 10.1016/j.cell.2016.01.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Y., Carstens R. P. (2017). Alternative splicing regulates distinct subcellular localization of epithelial splicing regulatory protein 1 (Esrp1) isoforms. Sci. Rep. 7:3848. 10.1038/s41598-017-03180-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yeo G., Holste D., Kreiman G., Burge C. B. (2004). Variation in alternative splicing across human tissues. Genome Biol. 5:R74. 10.1186/gb-2004-5-10-r74 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yoshida H. (2007). ER stress and diseases. FEBS J. 274 630–658. 10.1111/j.1742-4658.2007.05639.x [DOI] [PubMed] [Google Scholar]
- Yosten G. L. C., Liu J., Ji H., Sandberg K., Speth R., Samson W. K. (2016). A 5’-upstream short open reading frame encoded peptide regulates angiotensin type 1a receptor production and signalling via the β−arrestin pathway. J. Physiol. 594 1601–1605. 10.1113/JP270567 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Young S. K., Palam L. R., Wu C., Sachs M. S., Wek R. C. (2016). Ribosome elongation stall directs gene-specific translation in the integrated stress response. J. Biol. Chem. 291 6546–6558. 10.1074/jbc.M115.705640 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zárate S. C., Traetta M. E., Codagnone M. G., Seilicovich A., Reinés A. G. (2019). Humanin, a mitochondrial-derived peptide released by astrocytes, prevents synapse loss in hippocampal neurons. Front. Aging Neurosci. 11:123. 10.3389/fnagi.2019.00123 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang C., Krainer A. R., Zhang M. Q. (2007). Evolutionary impact of limited splicing fidelity in mammalian genes. Trends Genet. 23 484–488. 10.1016/j.tig.2007.08.001 [DOI] [PubMed] [Google Scholar]
- Zhao S.-T., Zhao L., Li J.-H. (2013). Neuroprotective peptide Humanin inhibits inflammatory response in astrocytes induced by lipopolysaccharide. Neurochem. Res. 38 581–588. 10.1007/s11064-012-0951-6 [DOI] [PubMed] [Google Scholar]
- Zuccato C. F., Asad A. S., Nicola Candia A. J., Gottardo M. F., Moreno Ayala M. A., Theas M. S., et al. (2019). Mitochondrial-derived peptide Humanin as therapeutic target in cancer and degenerative diseases. Expert Opin. Ther. Targets 23 117–126. 10.1080/14728222.2019.1559300 [DOI] [PubMed] [Google Scholar]




