Skip to main content
iScience logoLink to iScience
. 2022 Nov 10;25(12):105547. doi: 10.1016/j.isci.2022.105547

Casting CRISPR-Cas13d to fish for microprotein functions in animal development

Anthony James Treichel 1, Ariel Alejandro Bazzini 1,2,
PMCID: PMC9700322  PMID: 36444300

Summary

Protein coding genes were originally identified with sequence-based definitions that included a 100-codon cutoff to avoid annotating irrelevant open reading frames. However, many active proteins contain less than 100 amino acids. Indeed, functional genetics, ribosome profiling, and proteomic profiling have identified many short, translated open reading frames, including those with biologically active peptide products (microproteins). Yet, functions for most of these peptide products remain unknown. Because microproteins often act as key signals or fine-tune processes, animal development has already revealed functions for a handful of microproteins and provides an ideal context to uncover additional microprotein functions. However, many mRNAs during early development are maternally provided and hinder targeted mutagenesis approaches to characterize developmental microprotein functions. The recently established, RNA-targeting CRISPR-Cas13d system in zebrafish overcomes this barrier and produces potent knockdown of targeted mRNA, including maternally provided mRNA, and enables flexible, efficient interrogation of microprotein functions in animal development.

Subject areas: Genetics, Developmental biology, Complex system biology

Graphical abstract

graphic file with name fx1.jpg


Genetics; Developmental biology; Complex system biology

Pioneering the discovery of eukaryotic microproteins

At the onset of the genomics era, stringent sequence-based gene definitions were adopted to avoid annotating biologically irrelevant open reading frames (ORFs)—especially short ORFs (≤ 100 codons long) that exhibit a high frequency based on probability alone. For example, in zebrafish (Danio rerio) there are over 425,000 ORFs with canonical (AUG) start codons between 10 and 100 codons within messenger RNAs (mRNAs) and long non-coding RNAs alone. Only approximately 1,000 (0.02%) of these short ORFs are currently described as protein coding, leaving countless short ORFs with the potential to be translated as “beautiful needles in the haystack”1 (Figure 1A).

Figure 1.

Figure 1

Finding needles in the haystack: Short open reading frame translation is widespread across diverse taxa

(A) Rendition of a histogram representing all transcribed open reading frames (ORFs) in a single genome binned by length and categorized into short ORFs with translation potential (yellow haystack, left axis) and annotated proteins (blue books, right axis). Needle illustrates a short ORF that encodes a biologically active microprotein. Note: Left axis has a 100x scale compared to the right axis for ease of visualization. Based on1

(B) Ribosome profiling, proteomics, and functional genetics have cooperated to reveal widespread short ORF translation.

(C) 1. Translation of upstream (in mRNA sequence once called the 5′ UTR) short ORFs (uORFs) represses translation of the canonical ORF, but the list of uORFs that encode functional microproteins continues to grow. 2. Downstream (within sequence once called the 3′ UTR) short ORFs (dORFs) translation enhances canonical ORF translation. It remains unknown whether dORFs encode active microproteins. 3. Short ORFs that overlap with a canonical ORF (oORFs) are intrinsically tricky to interrogate, but a handful produce active microproteins. 4. RNAs that were initially annotated as non-coding (e.g., long non-coding RNAs, circular RNAs) contain short, translated ORFs (microprotein ORFs) that produce microproteins with diverse functions.

Functional genetics in budding yeast first confirmed the existence of eukaryotic “needles” upstream of the canonical ORF encoding GCN4,2 a transcription factor that activates genes involved in amino acid biosynthesis.3 Translation of four short, upstream (within mRNA sequence once defined as the 5′ untranslated region (UTR)) ORFs (uORFs) in GCN4 is required for nutrient-dependent regulation of GCN4 translation but relies on the uORF’s ability to engage ribosomes rather than a biologically active peptide product. However, subsequent functional genetic studies revealed short ORFs that encode biologically active peptide products (from here on: microproteins) in diverse organisms from budding yeast4 to legumes5 and insects6,7,8 (Figure 1B). Notably, all initially discovered microproteins are encoded on polycistronic mRNAs, and parallel bioinformatic analyses revealed that many mRNAs contain uORFs,9,10,11,12,13,14,15 including a population that shows amino acid conservation in mammals.9 This evidence collectively questioned the fundamental assumption that eukaryotic mRNAs are translated into a single peptide product.

Further supporting initial functional genetics and bioinformatics, the advent of ribosome profiling16,17,18 and targeted proteomic approaches19,20,21,22,23,24,25 over the last decade has revolutionized our ability to detect short ORF translation. Ribosome profiling, for example, leverages that ribosomes protect short fragments of bound mRNAs which are amenable to isolation and next-generation sequencing. Remarkably, some methods yield ribosome protected fragments that are consistent enough to define the reading frame18 (Figure 1B). Complementary detection of short ORF encoded peptide products with targeted proteomics has supported widespread translation of short ORFs on polycistronic mRNAs across many eukaryotic taxa16,17,21,23,24,26,27,28,29,30,31,32,33,34,35,36,37,38 (Figure 1B). Omics-based approaches have consequently completed the paradigm shift in RNA biology—many eukaryotic mRNAs exhibit multiple, distinct translation events rather than producing a single peptide product.

Furthermore, omics techniques have defined diverse classes of translated short ORFs and polycistronic mRNA structures (Figure 1C, reviewed extensively in39,40,41,42). For instance (and like those in GCN4), translation of uORFs often acts to suppress translation of the canonical coding sequence.2,12,43,44,45,46 On the other hand, translation of downstream short ORFs (dORFs) (within mRNA sequence once called the 3′ UTR) enhances canonical ORF translation.38 Although many uORFs and dORFs regulate translation independently of microprotein activity, the catalog of uORFs that encode functional microproteins continues to grow.4,33,47,48 In fact, even short ORFs nested within canonical coding sequences (overlapping ORFs) are translated into functional microproteins.49,50 In addition, many short ORFs are translated from novel mRNAs (previously annotated as non-coding) and circular RNAs, with a handful known to encode biologically active microproteins.27,28,29,33,51,52,53,54,55,56,57,58,59,60,61,62 However, although techniques continue to evolve and shine light on new short, translated ORFs, the potential functions encoded within the vast majority of identified short ORF peptide products remain unknown.

Microproteins are integral in diverse eukaryotic biology, including animal development

The arginine attenuator peptide (AAP) in budding yeast is one of the first functional microproteins identified in a eukaryote. AAP is a 25 amino acid microprotein produced from a uORF in the Saccharomyces cerevisiae CPA1 gene, which encodes an enzyme involved in arginine synthesis. AAP is required for free arginine to repress CPA1 translation, enabling arginine to negatively regulate its own biosynthesis.4 However, with a few notable exceptions5,6,7,8, most functional microproteins have been discovered in the last ten years with the aid of ribosome profiling16,17 and advances in proteomic and biochemical tools.21,24,25,63 Although microproteins encoded in animal genomes will be the main focus of this review, microproteins also play critical and emerging roles in fungi,4,12,64 bacteria65,66,67 (reviewed in68), and plants5,69,70 (reviewed in36,71).

In animals, microproteins play integral roles in diverse biological processes including transcription56,72,73 (Figure 2A), ribosome biogenesis,49 cell growth and viability,33 cancer cell survival,35 embryonic stem cell pluripotency74 (Figure 2B) and differentiation,75 immune responses48,60,62 and both cardiac34,53,54,76 (Figure 2B) and skeletal27,28,29 muscle function and development (Figure 2C). In fact, developmental contexts have provided significant insights into microprotein functions6,7,8,29,52,53,74,75,76,77 (Figures 2A–2C). For instance, parallel developmental studies in Tribolium castaneum and Drosophila melanogaster identified the first animal microprotein gene called mille-pattes or tarsal-less/polished rice, respectively, that produces a polycistronic mRNA that encodes functionally redundant 11–32 amino acid microproteins.6,7,8 Tarsal-less microproteins interact with Ubr3, an E3 ubiquitin ligase, and enable it target the transcription factor shavenbaby.72,78 Consequent proteosome-mediated cleavage of the shavenbaby N-terminus switches it from a transcriptional repressor to a transcriptional activator, inducing the expression of genes critical for distal limb patterning in Drosophila6,7 and abdominal segment identity in many other insects8,73 (Figure 2A). Similarly, characterized microproteins often function as essential regulators of protein activity or key signaling molecules, supporting that uncharacterized short ORF peptide products likely have additional functions in orchestrating animal development.

Figure 2.

Figure 2

Established microproteins play critical roles in animal development

(A) Tarsal-less/polished rice/mille-pattes is conserved throughout insects and produces a polycistronic mRNA that encodes functionally redundant microproteins critical for regulating the activity of the transcription factor shavenbaby for its role in specifying distal limb identity in Drosophila melanogaster and abdominal segment identity in the larvae of many insects, including Tribolium castaneum.

(B) apela/elabela/toddler encodes a signaling molecule that is conserved from zebrafish to humans that is critical for mesodermal migration and heart formation in both zebrafish and mouse, resulting in a lethal phenotype. Loss of Apela/Elabela/Toddler function in mouse also affects placental development and maternal health. Furthermore, APELA/ELABELA/TODDLER is required for maintaining pluripotency in human embryonic stem cells.

(C) Myomixer/Minion encodes a conserved, membrane-tethered ligand that is both necessary and sufficient for the cellular fusion required for muscle cells to develop into multinucleated muscle fibers in mouse. Because Myomixer/Minion is necessary for the development of the diaphragm muscle, knockout mice fail to inflate their lungs, resulting in perinatal lethality

Short ORF translation is prevalent during vertebrate development

Early stages of vertebrate development and vertebrate pluripotent stem cells show widespread short ORF translation.17,32,33,38,54,79,80 Among many vertebrate model systems, zebrafish (D. rerio) has emerged as a premier model system because of its high offspring number, ex-utero development, and genetic tractability. Not to mention, zebrafish exhibit many conserved cellular, molecular, and genetic foundations of development,81 and more than 70% of human genes have at least one ortholog in zebrafish, making zebrafish studies highly informative for human biology.82 Critically, zebrafish exhibit hundreds of translated short ORFs across their first five days of development.32,54 Although first identified in 2014, only twelve zebrafish developmentally translated short ORFs are currently characterized.53,54,77,83 Targeted mutagenesis has revealed that four of these short ORFs produce functional microproteins within the first seven days of development. The pioneer vertebrate functional microprotein called apela (elabela, toddler) is a conserved signaling molecule that is critical for zebrafish mesodermal migration, heart formation in zebrafish and mouse,53,54,76 pluripotency in human embryonic stem cells,74 and placental development in mouse76 (Figure 2B). Specifically in zebrafish, the microprotein bouncer localizes to the oocyte membrane and is necessary and sufficient for fertilization.83 Two additional zebrafish microproteins encoded in linc-mipep and linc-wrb have been recently reported to coordinate to regulate locomotor activity in larvae (4–7 days post-fertilization), directly or indirectly changing chromatin accessibility in CNS cells called oligodendrocytes.77 Still, hundreds of developmental short ORF encoded peptide products in zebrafish alone remain without insights into their functions.

Maternally provided RNAs impede microprotein characterization in early development

Animal development presents unique challenges to elucidating microprotein functions. During maturation, animal oocytes are packed with maternal RNAs and proteins. Immediately after fertilization, maternally provided RNAs and proteins control the developmental program while the zygote remains transcriptionally silent. As the zygote awakens its genome and ramps up production of its own transcripts, it is weaned from maternal control in a process known as the maternal-to-zygotic transition (reviewed extensively in84). Still, the majority of RNAs during the maternal-to-zygotic transition are maternally provided and present a significant barrier to elucidating microprotein functions in early animal development (Figure 3A). Indeed, a developmental function for a microprotein encoded on a maternally provided mRNA has yet to be described. apela mRNA is transcribed de-novo from the zygotic genome, which facilitated its discovery with a zygotic mutant.53,54 Similarly, linc-mipep and linc-wrb zygotic mutants show phenotypes that do not involve contributions from any maternal mRNA.77

Figure 3.

Figure 3

Maternally provided RNA presents a significant barrier to targeted mutagenesis approaches in early animal development that is overcome with CRISPR-Cas13d

(A) Representation of RNA dynamics during early animal development (Based on84). Traces show individual contributions from maternal (red) and zygotic (blue) RNA, whereas the rectangle above represents the cumulative RNA population over time. As the zygote generates its own RNAs, it takes control of its development through a process known as the maternal-to-zygotic transition. However, maternal RNAs still constitute a significant proportion of all RNAs throughout the maternal-to-zygotic transition.

(B) In a routine, targeted mutagenesis strategy, parents are maintained as heterozygotes for the loss-of-function allele (denoted by a yellow box with an orange star). With this approach, homozygous mutant offspring will possess wildtype maternal RNA in early development—known as zygotic mutants (left panel). To eliminate both maternal and zygotic RNA, the mother must be homozygous for the loss-of-function allele, but biallelic deletion of early developmental genes is often lethal. It is therefore necessary to generate a maternal line that is homozygous mutant in only its germline—a maternal mutant. If fertile, crossing this maternal mutant with a heterozygous male generates homozygous mutant offspring without wildtype maternal RNA—called maternal/zygotic mutants (right panel).

(C) Injection of CRISPR-Cas13d protein or mRNA with guide RNA in zebrafish produces potent degradation of both maternal and zygotic targeted RNAs, eliminating the need to generate maternal/zygotic mutants for loss-of-function analyses during the maternal-to-zygotic transition.

Elucidating microprotein functions in early development therefore requires either an over-expression approach or a loss-of-function strategy that targets both maternal and zygotic gene products to prevent maternal mRNA from masking any effect of a deletion in the offspring alone (Figure 3B). Targeted mutagenesis approaches (e.g., CRISPR-Cas9) therefore require the time-consuming generation of a dual maternal/zygotic mutant line, making it an inherently risky and expensive approach to interrogate the function of even a few developmental microproteins (Figure 3B). Alternatively, conventional RNAi is generally ineffective in zebrafish and other teleosts (e.g., medaka, killifish, Mexican tetra), which has led to widespread use of morpholinos as the preeminent knockdown technology. Although morpholinos have correctly elucidated gene function in select studies, the general utility of morpholinos has been questioned over off-target effects and general toxicity.85 In some cases, loss-of-function mutant and morpholino-treated animals exhibit discordant phenotypes.86,87 More strikingly, additional phenotypes have been observed on morpholino treatment in loss-of-function mutants,88 indicating cellular effects unrelated to the targeted RNAs. In fact, some morpholinos can trigger cellular immune responses and off-target mis-splicing.89,90 On top of these concerns, morpholinos are expensive and their validation can be cumbersome.

CRISPR-Cas13d knocks down barriers to studying maternally provided mRNAs

Following the discovery and widespread application of DNA-targeting CRISPR systems (e.g., CRISPR-Cas9), microbial (meta)genomic mining revealed a unique CRISPR-associated (Cas) protein—Cas13a (originally C2c2).91 Instead of DNA-nuclease domains (HNH and/or RuvC-like domains like Cas9 and Cas12a), Cas13a contains two RNA-nuclease domains (HEPN domains92) that enable Cas13a to function as an autonomous, programmable (through its CRISPR (guide) RNA) RNA-targeting nuclease.93,94,95,96 More Cas13 protein families—denoted Cas13b,97 Cas13c,98 and Cas13d99,100—were subsequently identified. The Cas13d family exhibits the smallest proteins (average length = 930 amino acids), and Cas13d from Ruminococcus flavefaciens produces potent mRNA knockdown in mammalian cell culture.100

To overcome the limitations in targeting maternally provided mRNAs, we established the RNA-targeting CRISPR-Cas13d system in zebrafish where it autonomously and potently degrades target mRNAs through at least the first 48 hours of development (see system development in101 and detailed methods in102). Because it targets both maternally provided and zygotically transcribed RNA (Figure 3C), CRISPR-Cas13d is a straightforward tool for loss-of-function studies of early developmental RNAs, including microprotein mRNAs and long non-coding RNAs. RT-qPCR offers a simple first-pass knockdown validation and a diagnostic tool for guide RNA design. For targets with ≥ 60% knockdown and a phenotype, it is simple to generate a high number of knockdown embryos to analyze the onset and the spectrum of the developmental effect. Further, this high offspring number is amenable to interrogating the molecular phenotype (e.g., genes and processes affected) and transcriptome-wide CRISPR-Cas13d targeting specificity with RNA-sequencing (Figure 4A).

Figure 4.

Figure 4

CRISPR-Cas13d is an efficient and versatile tool for interrogating microproteins in vertebrate development

(A) Standard pipeline for targeting microprotein mRNAs with CRISPR-Cas13d in zebrafish embryos. Cas13d protein or mRNA with guide RNA(s) are injected and developmental phenotypes are assessed at multiple timepoints. RT-qPCR is employed as a first-step knockdown validation. For targets with ≥ 60% knockdown and with consistent phenotypes between guide RNAs, RNA-seq assesses the molecular phenotype and further interrogates CRISPR-Cas13d targeting.

(B) When microprotein mRNA knockdown results in a phenotype, synthetic microprotein or synthetic microprotein mRNA is co-injected to test for microprotein activity. Synthetic mRNA provides insights into either microprotein and/or non-coding RNA activity with simple frameshift and synonymous mutations induced in the in vitro transcription vectors.

(C) When rescue experiments support microprotein activity, tagged microproteins can be generated within in vitro transcription vectors (and without the need for endogenous tagging). Rescue of the CRISPR-Cas13d knockdown phenotype with any tagged microprotein validates its activity and enables imaging to determine microprotein localization and co-immunoprecipitation to define microprotein interaction partners.

Further, CRISPR-Cas13d guide RNAs are sequence-unconstrained, which enables effective targeting for any RNA of interest. Similarly, cross-validation of observed phenotypes with either multiple cocktails of guide RNAs or individual guide RNAs is feasible because their synthesis is simple and relatively inexpensive. Depending on the application, Cas13d guide RNAs can be co-injected with either Cas13d protein (quick knockdown, long production) or mRNA (long knockdown, quick production). Regardless of approach (see detailed methods in102), CRISPR-Cas13d is an effective and efficient knockdown system. A CRISPR-Cas13d novice could reasonably perform a dozen unique CRISPR-Cas13d knockdowns in six months. Finally, although constitutive, heterologous expression systems in cell culture have revealed non-specific CRISPR-Cas13d RNA degradation when targeting abundant transcripts,103 the injection-based tools in zebrafish are powerful assets to interrogate any potential off-target effects. For example, co-injection of mRNA with the tbxta coding sequence rescues the “No-Tail” phenotype observed under CRISPR-Cas13d tbxta knockdown (targeting the endogenous 3′ UTR of tbxta), eliminating off-target concerns in this knockdown paradigm.101 CRISPR-Cas13d therefore opens the door to the interrogation of developmental functions for microproteins in zebrafish.

In addition to microprotein mRNAs, CRISPR-Cas13d enables loss-of-function studies for any maternally provided RNA of interest. For example, analyses of single zygotic mutants for many zebrafish long non-coding RNAs failed to find developmental functions.88,104 Although zygotic mutants enable maternally provided RNA to mask any developmental effect, CRISPR-Cas13d eliminates this maternal RNA and reduces the likelihood of false negatives in screens for non-coding RNA functions in development (Figure 3C). Outside of maternally provided RNA, double (or triple, etc.) mutants are required to uncover the impact of long non-coding RNAs with complementary functions. In contrast, CRISPR-Cas13d facilitates simple targeting of multiple RNAs to test for overlapping functions. Intriguingly, many microproteins are encoded on RNAs once described as non-coding, raising the possibility that a notable subset of RNAs have both coding and non-coding functions60 or translation-dependent, non-coding functions (somewhat analogous to uORFs and dORFs). CRISPR-Cas13d, alongside established injection-based approaches in zebrafish, provides a truly feasible system in vertebrates to simultaneously interrogate both non-coding and coding functions within diverse RNAs.

Unraveling microprotein and non-coding RNA functions with CRISPR-Cas13d

On top of its potent loss-of-function effects, the flexibility of CRISPR-Cas13d guide RNAs enables routine injection-based approaches to provide key insights into developmental microprotein mRNA and/or non-coding RNA functions. When CRISPR-Cas13d knockdown of a microprotein mRNA produces a developmental phenotype, the paramount question is whether the phenotype is a result of microprotein activity (Figure 4B). Because microproteins are short, a chemically synthesized microprotein can be co-injected in knockdown embryos to test its ability to rescue the phenotype. However, RNA-based approaches are more flexible and efficient. In-vitro transcription vectors in zebrafish enable the generation of a “rescue” mRNA with the microprotein coding sequence flanked by exogenous 5′ and-3′ UTRs,105 allowing guide RNAs targeting the microprotein UTRs to knockdown only the native zebrafish mRNA (Figure 4B). Microprotein activity is supported when a “rescue” mRNA alleviates the knockdown phenotype, but a frameshifted version fails to do so. Likewise, synonymous mutations in the microprotein sequence in the “rescue” mRNA should also alleviate the phenotype.

Rescue experiments can be leveraged for additional insights into conservation of function and critical amino acid residues. If microprotein function is conserved, homologous microprotein sequences (or syntenic non-coding RNAs) from other species in “rescue” mRNAs should similarly alleviate the developmental effect. In parallel, for any critical amino acid residue, a mutation in the “rescue” mRNA will fail to alleviate the phenotype. In a like manner, any developmentally critical non-coding RNA discovered with CRISPR-Cas13d can be mutated to assess for functional sequences and/or structures. Importantly, fluorescent or epitope tags can be added to microprotein coding sequences in the transcription vector alone, without the need for endogenous tagging. Although tagging has the potential to disrupt microprotein function, rescue of its CRISPR-Cas13d knockdown phenotype validates the activity of any tagged microprotein. Tag-based imaging can define microprotein subcellular localization whereas co-immunoprecipitation experiments based on the tag can identify microprotein interaction partners (Figure 4C).

Novel combinations of CRISPR-Cas13d will provide additional insights

RNA-seq is very effective at defining the transcriptome-wide effects in CRISPR-Cas13d knockdowns. However, a plethora of techniques are eagerly waiting to work in concert with CRISPR-Cas13d to provide further insights into microprotein and/or non-coding RNA functions. For example, v3 HCR in situ hybridization106 in knockdown embryos can provide spatiotemporal readouts for transcripts of interest from RNA-seq to determine if their localization(s) is changing (Figure 5A). In-situs for cell-type specific markers (e.g., endoderm, ectoderm, and mesoderm) in knockdown embryos allow interrogation of potential cell fate decision and differentiation defects. Further, a key process during the maternal-to-zygotic transition is the activation of transcription from the zygotic genome (Figure 3A), which involves major changes to chromatin accessibility. CRISPR-Cas13d is amenable to genomics-based techniques (e.g., ChIP-seq,107ATAC-seq108) to interrogate chromatin states under microprotein knockdown because it is straightforward to generate many, homogeneous knockdown embryos (Figure 5B).

Figure 5.

Figure 5

Future combinations of CRISPR-Cas13d will paint a more complete picture of developmental microprotein functions

(A) Spatiotemporal gene expression in CRISPR-Cas13d knockdown embryos can be visualized for transcripts of interest from RNA-seq or for cells of interest using v3 HCR in situ hybridization.

(B) Chromatin accessibility during zygotic genome activation can be assessed in knockdown embryos using omics-based techniques such as ATAC-seq.

(C) CRISPR-Cas9 is highly efficient at generating bi-allelic deletions in F0 zebrafish. In combination with CRISPR-Cas13d, analyzing F0 zygotic mutants will provide insights into the individual role of zygotically transcribed RNA. Similarly, a germline targeted Cas9 potentiates the creation of F0 maternal mutants to test the individual role of maternally provided RNA.

Although CRISPR-Cas13d enables the dissection of gene function during the maternal-to-zygotic transition without maternal/zygotic mutants, parallel targeted mutagenesis (e.g., CRISPR-Cas9) enables investigations into the individual phenotype contributions from maternally provided and/or zygotically transcribed RNA. In zebrafish, CRISPR-Cas9 is highly optimized and can produce over 90% biallelic deletions in F0 mutants,109,110 enabling rapid comparisons with CRISPR-Cas13d knockdowns. For instance, if zygotic RNA is the major contributor, zygotic F0 mutants will recapitulate the CRISPR-Cas13d knockdown phenotype. Alternatively, if maternal RNA is the major contributor, zygotic mutants will fail to recapitulate the phenotype (Figure 5C). In this case, a germline targeted Cas9 construct is available in zebrafish that can effectively generate maternal mutants in F0109. Importantly, crossing F0 maternal mutants to wildtype males would validate the essential role of maternally provided RNA in a single generation (Figure 5C).

Further, simultaneous targeting of multiple microprotein or non-coding RNA genes is feasible with CRISPR-Cas13d. In fact, multiple targeting is especially useful in zebrafish where at least 26% of genes have a paralog from the teleost-specific genome duplication that could retain overlapping function.82 Moreover, CRISPR-Cas13d is effective in mouse, killifish, and medaka, enabling interrogation of conserved microprotein and non-coding RNA functions in development.101 Although this review has focused on development, adult zebrafish afford a large repertoire of tissue and cell-type specific drivers and human disease models. Cas13d could therefore be employed in a selective manner to interrogate microprotein or non-coding RNA functions in homeostasis or in zebrafish models of human disease.

Future perspectives for CRISPR-Cas13, microproteins, and non-coding RNAs

In all, CRISPR-Cas13d in zebrafish not only expands a solid vertebrate developmental model system but also opens novel angles for diverse developmental questions about microprotein and non-coding RNAs, particularly those that involve the maternal-to-zygotic transition. Alongside CRISPR-Cas13d, RNA and protein injection-based tools in zebrafish enable relatively simple dissection of diverse microproteins (e.g., encoded in overlapping ORFs, 2–10 codon ORFs) and non-coding RNAs (e.g., coordinate functions, and complementary roles). The utility of CRISPR-Cas13 across teleosts and mammals potentiates powerful comparative studies that can test conservation of function and provide insights into the evolution of both microproteins and non-coding RNAs. Similarly, implementation of CRISPR-Cas13d in plants could provide insights into microprotein and/or non-coding RNA function(s) during early plant development, which may involve contributions from both maternally and paternally provided RNAs.111,112,113 One caveat to the CRISPR-Cas13d system is that it degrades maternally provided microprotein mRNAs, not the microproteins themselves. Any maternally provided microproteins whose function depends entirely on their maternal microprotein pool would escape detection in a CRISPR-Cas13d loss-of-function screen.

Beyond its current role in RNA knockdown, the CRISPR-Cas13 system has exciting potential. For instance, catalytically dead Cas13d fused to translation initiation factors was recently shown to increase translation in E. coli.114 If established in zebrafish, this would enable over-expression studies for microproteins without the need for injection of in vitro transcribed RNA that may fail to recapitulate the native microprotein mRNA expression pattern. Furthermore, application of the dead Cas13-based RNA imaging approaches (developed in mammalian cell culture) to the accessible live imaging provided by developing zebrafish would illuminate the spatiotemporal dynamics of microprotein mRNA and/or non-coding RNA in a living organism.115 Finally, implementing novel microprotein tagging approaches (e.g., MicroID,24 non-canonical amino acid labeling63) in zebrafish would complement functional studies with CRISPR-Cas13d to better elucidate microprotein functions in their native contexts.

Acknowledgments

We thank Mark Miller for assistance with illustrations, and Michay Diez, Josefina Doffo, Cameron Berry, Gabriel da Silva Pescador, and Danielson Baia Amaral for constructive feedback on the manuscript. This work was supported by the Stowers Institute for Medical Research. A.A.B was awarded a Pew Innovation Fund and the US National Institutes of Health (NIH-R01 GM136849). This was compiled as part of thesis research for A.J.T., Graduate School of the Stowers Institute for Medical Research.

Author contributions

Conceptualization, A.J.T. and A.A.B.; Writing—Original Draft, A.J.T.; Writing—Review and Editing, A.J.T. and A.A.B.; Visualization, A.J.T.

Declaration of interests

The authors declare no competing interests.

Inclusion and diversity

We support inclusive, diverse, and equitable conduct of research.

References

  • 1.Basrai M.A., Hieter P., Boeke J.D. Small open reading frames: beautiful needles in the haystack. Genome Res. 1997;7:768–771. doi: 10.1101/gr.7.8.768. [DOI] [PubMed] [Google Scholar]
  • 2.Mueller P.P., Hinnebusch A.G. Multiple upstream AUG codons mediate translational control of GCN4. Cell. 1986;45:201–207. doi: 10.1016/0092-8674(86)90384-3. [DOI] [PubMed] [Google Scholar]
  • 3.Hope I.A., Struhl K. Functional dissection of a eukaryotic transcriptional activator protein, GCN4 of yeast. Cell. 1986;46:885–894. doi: 10.1016/0092-8674(86)90070-x. [DOI] [PubMed] [Google Scholar]
  • 4.Werner M., Feller A., Messenguy F., Piérard A. The leader peptide of yeast gene CPA1 is essential for the translational repression of its expression. Cell. 1987;49:805–813. doi: 10.1016/0092-8674(87)90618-0. [DOI] [PubMed] [Google Scholar]
  • 5.Rohrig H., Schmidt J., Miklashevichs E., Schell J., John M. Soybean ENOD40 encodes two peptides that bind to sucrose synthase. Proc. Natl. Acad. Sci. USA. 2002;99:1915–1920. doi: 10.1073/pnas.022664799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Galindo M.I., Pueyo J.I., Fouix S., Bishop S.A., Couso J.P. Peptides encoded by short ORFs control development and define a new eukaryotic gene family. PLoS Biol. 2007;5:e106. doi: 10.1371/journal.pbio.0050106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kondo T., Hashimoto Y., Kato K., Inagaki S., Hayashi S., Kageyama Y. Small peptide regulators of actin-based cell morphogenesis encoded by a polycistronic mRNA. Nat. Cell Biol. 2007;9:660–665. doi: 10.1038/ncb1595. [DOI] [PubMed] [Google Scholar]
  • 8.Savard J., Marques-Souza H., Aranda M., Tautz D. A segmentation gene in tribolium produces a polycistronic mRNA that codes for multiple conserved peptides. Cell. 2006;126:559–569. doi: 10.1016/j.cell.2006.05.053. [DOI] [PubMed] [Google Scholar]
  • 9.Crowe M.L., Wang X.Q., Rothnagel J.A. Evidence for conservation and selection of upstream open reading frames suggests probable encoding of bioactive peptides. BMC Genom. 2006;7:16. doi: 10.1186/1471-2164-7-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Morris D.R., Geballe A.P. Upstream open reading frames as regulators of mRNA translation. Mol. Cell Biol. 2000;20:8635–8642. doi: 10.1128/MCB.20.23.8635-8642.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Tautz D. Polycistronic peptide coding genes in eukaryotes--how widespread are they? Brief. Funct. Genomic. Proteomic. 2009;8:68–74. doi: 10.1093/bfgp/eln054. [DOI] [PubMed] [Google Scholar]
  • 12.Zhang Z., Dietrich F.S. Identification and characterization of upstream open reading frames (uORF) in the 5' untranslated regions (UTR) of genes in Saccharomyces cerevisiae. Curr. Genet. 2005;48:77–87. doi: 10.1007/s00294-005-0001-x. [DOI] [PubMed] [Google Scholar]
  • 13.Hayden C.A., Bosco G. Comparative genomic analysis of novel conserved peptide upstream open reading frames in Drosophila melanogaster and other dipteran species. BMC Genom. 2008;9:61. doi: 10.1186/1471-2164-9-61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hayden C.A., Jorgensen R.A. Identification of novel conserved peptide uORF homology groups in Arabidopsis and rice reveals ancient eukaryotic origin of select groups and preferential association with transcription factor-encoding genes. BMC Biol. 2007;5:32. doi: 10.1186/1741-7007-5-32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Neafsey D.E., Galagan J.E. Dual modes of natural selection on upstream open reading frames. Mol. Biol. Evol. 2007;24:1744–1751. doi: 10.1093/molbev/msm093. [DOI] [PubMed] [Google Scholar]
  • 16.Ingolia N.T., Ghaemmaghami S., Newman J.R.S., Weissman J.S. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–223. doi: 10.1126/science.1168978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Ingolia N.T., Lareau L.F., Weissman J.S. Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes. Cell. 2011;147:789–802. doi: 10.1016/j.cell.2011.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Brar G.A., Weissman J.S. Ribosome profiling reveals the what, when, where and how of protein synthesis. Nat. Rev. Mol. Cell Biol. 2015;16:651–664. doi: 10.1038/nrm4069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ma J., Ward C.C., Jungreis I., Slavoff S.A., Schwaid A.G., Neveu J., Budnik B.A., Kellis M., Saghatelian A. Discovery of human sORF-encoded polypeptides (SEPs) in cell lines and tissue. J. Proteome Res. 2014;13:1757–1765. doi: 10.1021/pr401280w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Schwaid A.G., Shannon D.A., Ma J., Slavoff S.A., Levin J.Z., Weerapana E., Saghatelian A. Chemoproteomic discovery of cysteine-containing human short open reading frames. J. Am. Chem. Soc. 2013;135:16750–16753. doi: 10.1021/ja406606j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Slavoff S.A., Mitchell A.J., Schwaid A.G., Cabili M.N., Ma J., Levin J.Z., Karger A.D., Budnik B.A., Rinn J.L., Saghatelian A. Peptidomic discovery of short open reading frame-encoded peptides in human cells. Nat. Chem. Biol. 2013;9:59–64. doi: 10.1038/nchembio.1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tinoco A.D., Tagore D.M., Saghatelian A. Expanding the dipeptidyl peptidase 4-regulated peptidome via an optimized peptidomics platform. J. Am. Chem. Soc. 2010;132:3819–3830. doi: 10.1021/ja909524e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Khitun A., Slavoff S.A. Proteomic detection and validation of translated small open reading frames. Curr. Protoc. Chem. Biol. 2019;11:e77. doi: 10.1002/cpch.77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Na Z., Dai X., Zheng S.J., Bryant C.J., Loh K.H., Su H., Luo Y., Buhagiar A.F., Cao X., Baserga S.J., et al. Mapping subcellular localizations of unannotated microproteins and alternative proteins with MicroID. Mol. Cell. 2022;82:2900–2911.e7. doi: 10.1016/j.molcel.2022.06.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Pino L.K., Just S.C., MacCoss M.J., Searle B.C. Acquiring and analyzing data independent acquisition proteomics experiments without spectrum libraries. Mol. Cell. Proteomics. 2020;19:1088–1103. doi: 10.1074/mcp.P119.001913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Mudge J.M., Ruiz-Orera J., Prensner J.R., Brunet M.A., Calvet F., Jungreis I., Gonzalez J.M., Magrane M., Martinez T.F., Schulz J.F., et al. Standardized annotation of translated open reading frames. Nat. Biotechnol. 2022;40:994–999. doi: 10.1038/s41587-022-01369-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Anderson D.M., Makarewich C.A., Anderson K.M., Shelton J.M., Bezprozvannaya S., Bassel-Duby R., Olson E.N. Widespread control of calcium signaling by a family of SERCA-inhibiting micropeptides. Sci. Signal. 2016;9:ra119. doi: 10.1126/scisignal.aaj1460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Bi P., McAnally J.R., Shelton J.M., Sánchez-Ortiz E., Bassel-Duby R., Olson E.N. Fusogenic micropeptide Myomixer is essential for satellite cell fusion and muscle regeneration. Proc. Natl. Acad. Sci. USA. 2018;115:3864–3869. doi: 10.1073/pnas.1800052115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Bi P., Ramirez-Martinez A., Li H., Cannavino J., McAnally J.R., Shelton J.M., Sánchez-Ortiz E., Bassel-Duby R., Olson E.N. Control of muscle formation by the fusogenic micropeptide myomixer. Science. 2017;356:323–327. doi: 10.1126/science.aam9361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ladoukakis E., Pereira V., Magny E.G., Eyre-Walker A., Couso J.P. Hundreds of putatively functional small open reading frames in Drosophila. Genome Biol. 2011;12:R118. doi: 10.1186/gb-2011-12-11-r118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Dunn J.G., Foo C.K., Belletier N.G., Gavis E.R., Weissman J.S. Ribosome profiling reveals pervasive and regulated stop codon readthrough in Drosophila melanogaster. Elife. 2013;2:e01179. doi: 10.7554/eLife.01179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Bazzini A.A., Johnstone T.G., Christiano R., Mackowiak S.D., Obermayer B., Fleming E.S., Vejnar C.E., Lee M.T., Rajewsky N., Walther T.C., Giraldez A.J. Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation. EMBO J. 2014;33:981–993. doi: 10.1002/embj.201488411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Chen J., Brunner A.D., Cogan J.Z., Nuñez J.K., Fields A.P., Adamson B., Itzhak D.N., Li J.Y., Mann M., Leonetti M.D., Weissman J.S. Pervasive functional translation of noncanonical human open reading frames. Science. 2020;367:1140–1146. doi: 10.1126/science.aay0262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.van Heesch S., Witte F., Schneider-Lunitz V., Schulz J.F., Adami E., Faber A.B., Kirchner M., Maatz H., Blachut S., Sandmann C.L., et al. The translational landscape of the human heart. Cell. 2019;178:242–260.e29. doi: 10.1016/j.cell.2019.05.010. [DOI] [PubMed] [Google Scholar]
  • 35.Prensner J.R., Enache O.M., Luria V., Krug K., Clauser K.R., Dempster J.M., Karger A., Wang L., Stumbraite K., Wang V.M., et al. Noncanonical open reading frames encode functional proteins essential for cancer cell survival. Nat. Biotechnol. 2021;39:697–704. doi: 10.1038/s41587-020-00806-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Hanada K., Higuchi-Takeuchi M., Okamoto M., Yoshizumi T., Shimizu M., Nakaminami K., Nishi R., Ohashi C., Iida K., Tanaka M., et al. Small open reading frames associated with morphogenesis are hidden in plant genomes. Proc. Natl. Acad. Sci. USA. 2013;110:2395–2400. doi: 10.1073/pnas.1213958110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ji Z., Song R., Regev A., Struhl K. Many lncRNAs, 5'UTRs, and pseudogenes are translated and some are likely to express functional proteins. Elife. 2015;4:e08890. doi: 10.7554/eLife.08890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wu Q., Wright M., Gogol M.M., Bradford W.D., Zhang N., Bazzini A.A. Translation of small downstream ORFs enhances translation of canonical main open reading frames. EMBO J. 2020;39:e104763. doi: 10.15252/embj.2020104763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Wright B.W., Yi Z., Weissman J.S., Chen J. The dark proteome: translation from noncanonical open reading frames. Trends Cell Biol. 2022;32:243–258. doi: 10.1016/j.tcb.2021.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Couso J.P., Patraquim P. Classification and function of small open reading frames. Nat. Rev. Mol. Cell Biol. 2017;18:575–589. doi: 10.1038/nrm.2017.58. [DOI] [PubMed] [Google Scholar]
  • 41.Plaza S., Menschaert G., Payre F. In search of lost small peptides. Annu. Rev. Cell Dev. Biol. 2017;33:391–416. doi: 10.1146/annurev-cellbio-100616-060516. [DOI] [PubMed] [Google Scholar]
  • 42.Orr M.W., Mao Y., Storz G., Qian S.B. Alternative ORFs and small ORFs: shedding light on the dark proteome. Nucleic Acids Res. 2020;48:1029–1042. doi: 10.1093/nar/gkz734. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Barbosa C., Peixeiro I., Romão L. Gene expression regulation by upstream open reading frames and human disease. PLoS Genet. 2013;9:e1003529. doi: 10.1371/journal.pgen.1003529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Chew G.L., Pauli A., Schier A.F. Conservation of uORF repressiveness and sequence features in mouse, human and zebrafish. Nat. Commun. 2016;7:11663. doi: 10.1038/ncomms11663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Johnstone T.G., Bazzini A.A., Giraldez A.J. Upstream ORFs are prevalent translational repressors in vertebrates. EMBO J. 2016;35:706–723. doi: 10.15252/embj.201592759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Vattem K.M., Wek R.C. Reinitiation involving upstream ORFs regulates ATF4 mRNA translation in mammalian cells. Proc. Natl. Acad. Sci. USA. 2004;101:11269–11274. doi: 10.1073/pnas.0400541101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Jayaram D.R., Frost S., Argov C., Liju V.B., Anto N.P., Muraleedharan A., Ben-Ari A., Sinay R., Smoly I., Novoplansky O., et al. Unraveling the hidden role of a uORF-encoded peptide as a kinase inhibitor of PKCs. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2018899118. e2018899118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Starck S.R., Tsai J.C., Chen K., Shodiya M., Wang L., Yahiro K., Martins-Green M., Shastri N., Walter P. Translation from the 5' untranslated region shapes the integrated stress response. Science. 2016;351:aad3867. doi: 10.1126/science.aad3867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Cao X., Khitun A., Harold C.M., Bryant C.J., Zheng S.J., Baserga S.J., Slavoff S.A. Nascent alt-protein chemoproteomics reveals a pre-60S assembly checkpoint inhibitor. Nat. Chem. Biol. 2022;18:643–651. doi: 10.1038/s41589-022-01003-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Cao X., Khitun A., Luo Y., Na Z., Phoodokmai T., Sappakhaw K., Olatunji E., Uttamapinant C., Slavoff S.A. Alt-RPL36 downregulates the PI3K-AKT-mTOR signaling pathway by interacting with TMEM24. Nat. Commun. 2021;12:508. doi: 10.1038/s41467-020-20841-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.D'Lima N.G., Ma J., Winkler L., Chu Q., Loh K.H., Corpuz E.O., Budnik B.A., Lykke-Andersen J., Saghatelian A., Slavoff S.A. A human microprotein that interacts with the mRNA decapping complex. Nat. Chem. Biol. 2017;13:174–180. doi: 10.1038/nchembio.2249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Zhang Q., Vashisht A.A., O'Rourke J., Corbel S.Y., Moran R., Romero A., Miraglia L., Zhang J., Durrant E., Schmedt C., et al. The microprotein Minion controls cell fusion and muscle formation. Nat. Commun. 2017;8:15664. doi: 10.1038/ncomms15664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Chng S.C., Ho L., Tian J., Reversade B. ELABELA: a hormone essential for heart development signals via the apelin receptor. Dev. Cell. 2013;27:672–680. doi: 10.1016/j.devcel.2013.11.002. [DOI] [PubMed] [Google Scholar]
  • 54.Pauli A., Norris M.L., Valen E., Chew G.L., Gagnon J.A., Zimmerman S., Mitchell A., Ma J., Dubrulle J., Reyon D., et al. Toddler: an embryonic signal that promotes cell movement via Apelin receptors. Science. 2014;343:1248636. doi: 10.1126/science.1248636. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Chu Q., Martinez T.F., Novak S.W., Donaldson C.J., Tan D., Vaughan J.M., Chang T., Diedrich J.K., Andrade L., Kim A., et al. Regulation of the ER stress response by a mitochondrial microprotein. Nat. Commun. 2019;10:4883. doi: 10.1038/s41467-019-12816-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Koh M., Ahmad I., Ko Y., Zhang Y., Martinez T.F., Diedrich J.K., Chu Q., Moresco J.J., Erb M.A., Saghatelian A., et al. A short ORF-encoded transcriptional regulator. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2021943118. e2021943118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Rathore A., Chu Q., Tan D., Martinez T.F., Donaldson C.J., Diedrich J.K., Yates J.R., 3rd, Saghatelian A. MIEF1 microprotein regulates mitochondrial translation. Biochemistry. 2018;57:5564–5575. doi: 10.1021/acs.biochem.8b00726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Slavoff S.A., Heo J., Budnik B.A., Hanakahi L.A., Saghatelian A. A human short open reading frame (sORF)-encoded polypeptide that stimulates DNA end joining. J. Biol. Chem. 2014;289:10950–10957. doi: 10.1074/jbc.C113.533968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Senís E., Esgleas M., Najas S., Jiménez-Sábado V., Bertani C., Giménez-Alejandre M., Escriche A., Ruiz-Orera J., Hergueta-Redondo M., Jiménez M., et al. TUNAR lncRNA encodes a microprotein that regulates neural differentiation and neurite formation by modulating calcium dynamics. Front. Cell Dev. Biol. 2021;9:747667. doi: 10.3389/fcell.2021.747667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Lee C.Q.E., Kerouanton B., Chothani S., Zhang S., Chen Y., Mantri C.K., Hock D.H., Lim R., Nadkarni R., Huynh V.T., et al. Coding and non-coding roles of MOCCI (C15ORF48) coordinate to regulate host inflammation and immunity. Nat. Commun. 2021;12:2130. doi: 10.1038/s41467-021-22397-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Zhang S., Reljić B., Liang C., Kerouanton B., Francisco J.C., Peh J.H., Mary C., Jagannathan N.S., Olexiouk V., Tang C., et al. Mitochondrial peptide BRAWNIN is essential for vertebrate respiratory complex III assembly. Nat. Commun. 2020;11:1312. doi: 10.1038/s41467-020-14999-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Bhatta A., Atianand M., Jiang Z., Crabtree J., Blin J., Fitzgerald K.A. A mitochondrial micropeptide is required for activation of the Nlrp3 inflammasome. J. Immunol. 2020;204:428–437. doi: 10.4049/jimmunol.1900791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Lafranchi L., Schlesinger D., Kimler K.J., Elsässer S.J. Universal single-residue terminal labels for fluorescent live cell imaging of microproteins. J. Am. Chem. Soc. 2020;142:20080–20087. doi: 10.1021/jacs.0c09574. [DOI] [PubMed] [Google Scholar]
  • 64.Tollis S., Singh J., Palou R., Thattikota Y., Ghazal G., Coulombe-Huntington J., Tang X., Moore S., Blake D., Bonneil E., et al. The microprotein Nrs1 rewires the G1/S transcriptional machinery during nitrogen limitation in budding yeast. PLoS Biol. 2022;20:e3001548. doi: 10.1371/journal.pbio.3001548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Aoyama J.J., Raina M., Zhong A., Storz G. Dual-function Spot 42 RNA encodes a 15-amino acid protein that regulates the CRP transcription factor. Proc. Natl. Acad. Sci. USA. 2022;119 doi: 10.1073/pnas.2119866119. e2119866119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Wang H., Yin X., Wu Orr M., Dambach M., Curtis R., Storz G. Increasing intracellular magnesium levels with the 31-amino acid MgtS protein. Proc. Natl. Acad. Sci. USA. 2017;114:5689–5694. doi: 10.1073/pnas.1703415114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Sberro H., Fremin B.J., Zlitni S., Edfors F., Greenfield N., Snyder M.P., Pavlopoulos G.A., Kyrpides N.C., Bhatt A.S. Large-scale Analyses of human microbiomes reveal thousands of small. Cell. 2019;178:1245–1259.e14. doi: 10.1016/j.cell.2019.07.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Gray T., Storz G., Papenfort K. Small proteins; big questions. J. Bacteriol. 2022;204:e0034121. doi: 10.1128/JB.00341-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Graeff M., Straub D., Eguen T., Dolde U., Rodrigues V., Brandt R., Wenkel S. MicroProtein-mediated recruitment of CONSTANS into a TOPLESS trimeric complex represses flowering in arabidopsis. PLoS Genet. 2016;12:e1005959. doi: 10.1371/journal.pgen.1005959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Hong S.Y., Sun B., Straub D., Blaakmeer A., Mineri L., Koch J., Brinch-Pedersen H., Holme I.B., Burow M., Lyngs Jørgensen H.J., et al. Heterologous microProtein expression identifies LITTLE NINJA, a dominant regulator of jasmonic acid signaling. Proc. Natl. Acad. Sci. USA. 2020;117:26197–26205. doi: 10.1073/pnas.2005198117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Bhati K.K., Dolde U., Wenkel S. MicroProteins: expanding functions and novel modes of regulation. Mol. Plant. 2021;14:705–707. doi: 10.1016/j.molp.2021.01.006. [DOI] [PubMed] [Google Scholar]
  • 72.Kondo T., Plaza S., Zanet J., Benrabah E., Valenti P., Hashimoto Y., Kobayashi S., Payre F., Kageyama Y. Small peptides switch the transcriptional activity of Shavenbaby during Drosophila embryogenesis. Science. 2010;329:336–339. doi: 10.1126/science.1188158. [DOI] [PubMed] [Google Scholar]
  • 73.Ray S., Rosenberg M.I., Chanut-Delalande H., Decaras A., Schwertner B., Toubiana W., Auman T., Schnellhammer I., Teuscher M., Valenti P., et al. The mlpt/Ubr3/Svb module comprises an ancient developmental switch for embryonic patterning. Elife. 2019;8:e39748. doi: 10.7554/eLife.39748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Ho L., Tan S.Y.X., Wee S., Wu Y., Tan S.J.C., Ramakrishna N.B., Chng S.C., Nama S., Szczerbinska I., Sczerbinska I., et al. ELABELA is an endogenous growth factor that sustains hESC self-renewal via the PI3K/AKT pathway. Cell Stem Cell. 2015;17:435–447. doi: 10.1016/j.stem.2015.08.010. [DOI] [PubMed] [Google Scholar]
  • 75.Fu H., Wang T., Kong X., Yan K., Yang Y., Cao J., Yuan Y., Wang N., Kee K., Lu Z.J., Xi Q. A Nodal enhanced micropeptide NEMEP regulates glucose uptake during mesendoderm differentiation of embryonic stem cells. Nat. Commun. 2022;13:3984. doi: 10.1038/s41467-022-31762-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Ho L., van Dijk M., Chye S.T.J., Messerschmidt D.M., Chng S.C., Ong S., Yi L.K., Boussata S., Goh G.H.Y., Afink G.B., et al. ELABELA deficiency promotes preeclampsia and cardiovascular malformations in mice. Science. 2017;357:707–713. doi: 10.1126/science.aam6607. [DOI] [PubMed] [Google Scholar]
  • 77.Tornini V.A., Lee H.-J., Miao L., Tang Y., Dube S.E., Gerson T., Schmidt V.J., Du K., Kuchroo M., Kroll F., et al. bioRxiv; 2022. <em>linc-mipep</em> and <em>linc-Wrb</em> Encode Micropeptides that Regulate Chromatin Accessibility in Vertebrate-specific Neural Cells. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Zanet J., Benrabah E., Li T., Pélissier-Monier A., Chanut-Delalande H., Ronsin B., Bellen H.J., Payre F., Plaza S. Pri sORF peptides induce selective proteasome-mediated protein processing. Science. 2015;349:1356–1358. doi: 10.1126/science.aac5677. [DOI] [PubMed] [Google Scholar]
  • 79.Chew G.L., Pauli A., Rinn J.L., Regev A., Schier A.F., Valen E. Ribosome profiling reveals resemblance between long non-coding RNAs and 5' leaders of coding RNAs. Development (Cambridge, U. K.) 2013;140:2828–2834. doi: 10.1242/dev.098343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Wang H., Wang Y., Yang J., Zhao Q., Tang N., Chen C., Li H., Cheng C., Xie M., Yang Y., Xie Z. Tissue- and stage-specific landscape of the mouse translatome. Nucleic Acids Res. 2021;49:6165–6180. doi: 10.1093/nar/gkab482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Kimmel C.B., Ballard W.W., Kimmel S.R., Ullmann B., Schilling T.F. Stages of embryonic development of the zebrafish. Dev. Dynam. 1995;203:253–310. doi: 10.1002/aja.1002030302. [DOI] [PubMed] [Google Scholar]
  • 82.Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature. 2013;496:498–503. doi: 10.1038/nature12111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Herberg S., Gert K.R., Schleiffer A., Pauli A. The Ly6/uPAR protein Bouncer is necessary and sufficient for species-specific fertilization. Science. 2018;361:1029–1033. doi: 10.1126/science.aat7113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Vastenhouw N.L., Cao W.X., Lipshitz H.D. The maternal-to-zygotic transition revisited. Development (Cambridge, U. K.) 2019;146:dev161471. doi: 10.1242/dev.161471. [DOI] [PubMed] [Google Scholar]
  • 85.Schulte-Merker S., Stainier D.Y.R. Out with the old, in with the new: reassessing morpholino knockdowns in light of genome editing technology. Development (Cambridge, U. K.) 2014;141:3103–3104. doi: 10.1242/dev.112003. [DOI] [PubMed] [Google Scholar]
  • 86.Kok F.O., Shin M., Ni C.W., Gupta A., Grosse A.S., van Impel A., Kirchmaier B.C., Peterson-Maduro J., Kourkoulis G., Male I., et al. Reverse genetic screening reveals poor correlation between morpholino-induced and mutant phenotypes in zebrafish. Dev. Cell. 2015;32:97–108. doi: 10.1016/j.devcel.2014.11.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Joris M., Schloesser M., Baurain D., Hanikenne M., Muller M., Motte P. Number of inadvertent RNA targets for morpholino knockdown in Danio rerio is largely underestimated: evidence from the study of Ser/Arg-rich splicing factors. Nucleic Acids Res. 2017;45:9547–9557. doi: 10.1093/nar/gkx638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Goudarzi M., Berg K., Pieper L.M., Schier A.F. Individual long non-coding RNAs have no overt functions in zebrafish embryogenesis, viability and fertility. Elife. 2019;8:e40815. doi: 10.7554/eLife.40815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Gentsch G.E., Spruce T., Monteiro R.S., Owens N.D.L., Martin S.R., Smith J.C. Innate immune response and off-target mis-splicing are common morpholino-induced side effects in Xenopus. Dev. Cell. 2018;44:597–610.e10. doi: 10.1016/j.devcel.2018.01.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Lai J.K.H., Gagalova K.K., Kuenne C., El-Brolosy M.A., Stainier D.Y.R. Induction of interferon-stimulated genes and cellular stress pathways by morpholinos in zebrafish. Dev. Biol. 2019;454:21–28. doi: 10.1016/j.ydbio.2019.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Shmakov S., Abudayyeh O.O., Makarova K.S., Wolf Y.I., Gootenberg J.S., Semenova E., Minakhin L., Joung J., Konermann S., Severinov K., et al. Discovery and functional characterization of diverse class 2 CRISPR-cas systems. Mol. Cell. 2015;60:385–397. doi: 10.1016/j.molcel.2015.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Anantharaman V., Makarova K.S., Burroughs A.M., Koonin E.V., Aravind L. Comprehensive analysis of the HEPN superfamily: identification of novel roles in intra-genomic conflicts, defense, pathogenesis and RNA processing. Biol. Direct. 2013;8:15. doi: 10.1186/1745-6150-8-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Abudayyeh O.O., Gootenberg J.S., Konermann S., Joung J., Slaymaker I.M., Cox D.B.T., Shmakov S., Makarova K.S., Semenova E., Minakhin L., et al. C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector. Science. 2016;353:aaf5573. doi: 10.1126/science.aaf5573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Abudayyeh O.O., Gootenberg J.S., Essletzbichler P., Han S., Joung J., Belanto J.J., Verdine V., Cox D.B.T., Kellner M.J., Regev A., et al. RNA targeting with CRISPR-Cas13. Nature. 2017;550:280–284. doi: 10.1038/nature24049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.East-Seletsky A., O'Connell M.R., Burstein D., Knott G.J., Doudna J.A. RNA targeting by functionally orthogonal type VI-A CRISPR-cas enzymes. Mol. Cell. 2017;66:373–383.e3. doi: 10.1016/j.molcel.2017.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.East-Seletsky A., O'Connell M.R., Knight S.C., Burstein D., Cate J.H.D., Tjian R., Doudna J.A. Two distinct RNase activities of CRISPR-C2c2 enable guide-RNA processing and RNA detection. Nature. 2016;538:270–273. doi: 10.1038/nature19802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Smargon A.A., Cox D.B.T., Pyzocha N.K., Zheng K., Slaymaker I.M., Gootenberg J.S., Abudayyeh O.A., Essletzbichler P., Shmakov S., Makarova K.S., et al. Cas13b is a type VI-B CRISPR-associated RNA-guided RNase differentially regulated by accessory proteins Csx27 and Csx28. Mol. Cell. 2017;65:618–630.e7. doi: 10.1016/j.molcel.2016.12.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Shmakov S., Smargon A., Scott D., Cox D., Pyzocha N., Yan W., Abudayyeh O.O., Gootenberg J.S., Makarova K.S., Wolf Y.I., et al. Diversity and evolution of class 2 CRISPR-Cas systems. Nat. Rev. Microbiol. 2017;15:169–182. doi: 10.1038/nrmicro.2016.184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Yan W.X., Chong S., Zhang H., Makarova K.S., Koonin E.V., Cheng D.R., Scott D.A. Cas13d is a compact RNA-targeting type VI CRISPR effector positively modulated by a WYL-domain-containing accessory protein. Mol. Cell. 2018;70:327–339.e5. doi: 10.1016/j.molcel.2018.02.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Konermann S., Lotfy P., Brideau N.J., Oki J., Shokhirev M.N., Hsu P.D. Transcriptome engineering with RNA-targeting type VI-D CRISPR effectors. Cell. 2018;173:665–676.e14. doi: 10.1016/j.cell.2018.02.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Kushawah G., Hernandez-Huertas L., Abugattas-Nuñez Del Prado J., Martinez-Morales J.R., DeVore M.L., Hassan H., Moreno-Sanchez I., Tomas-Gallardo L., Diaz-Moscoso A., Monges D.E., et al. CRISPR-Cas13d induces efficient mRNA knockdown in animal embryos. Dev. Cell. 2020;54:805–817.e7. doi: 10.1016/j.devcel.2020.07.013. [DOI] [PubMed] [Google Scholar]
  • 102.Hernandez-Huertas L., Kushawah G., Diaz-Moscoso A., Tomas-Gallardo L., Moreno-Sanchez I., da Silva Pescador G., Bazzini A.A., Moreno-Mateos M.A. Optimized CRISPR-RfxCas13d system for RNA targeting in zebrafish embryos. STAR Protoc. 2022;3:101058. doi: 10.1016/j.xpro.2021.101058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Ai Y., Liang D., Wilusz J.E. CRISPR/Cas13 effectors have differing extents of off-target effects that limit their utility in eukaryotic cells. Nucleic Acids Res. 2022;50:e65. doi: 10.1093/nar/gkac159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Pauli A., Valen E., Lin M.F., Garber M., Vastenhouw N.L., Levin J.Z., Fan L., Sandelin A., Rinn J.L., Regev A., Schier A.F. Systematic identification of long noncoding RNAs expressed during zebrafish embryogenesis. Genome Res. 2012;22:577–591. doi: 10.1101/gr.133009.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Hyatt T.M., Ekker S.C. Vectors and techniques for ectopic gene expression in zebrafish. Methods Cell Biol. 1999;59:117–126. doi: 10.1016/s0091-679x(08)61823-3. [DOI] [PubMed] [Google Scholar]
  • 106.Choi H.M.T., Schwarzkopf M., Fornace M.E., Acharya A., Artavanis G., Stegmaier J., Cunha A., Pierce N.A. Third-generation in situ hybridization chain reaction: multiplexed, quantitative, sensitive, versatile, robust. Development (Cambridge, U. K.) 2018;145:dev165753. doi: 10.1242/dev.165753. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Bogdanović O., Fernández-Miñán A., Tena J.J., de la Calle-Mustienes E., Gómez-Skarmeta J.L. The developmental epigenomics toolbox: ChIP-seq and MethylCap-seq profiling of early zebrafish embryos. Methods. 2013;62:207–215. doi: 10.1016/j.ymeth.2013.04.011. [DOI] [PubMed] [Google Scholar]
  • 108.Doganli C., Sandoval M., Thomas S., Hart D. Assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-Seq) protocol for zebrafish embryos. Methods Mol. Biol. 2017;1507:59–66. doi: 10.1007/978-1-4939-6518-2_5. [DOI] [PubMed] [Google Scholar]
  • 109.Moreno-Mateos M.A., Vejnar C.E., Beaudoin J.D., Fernandez J.P., Mis E.K., Khokha M.K., Giraldez A.J. CRISPRscan: designing highly efficient sgRNAs for CRISPR-Cas9 targeting in vivo. Nat. Methods. 2015;12:982–988. doi: 10.1038/nmeth.3543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Vejnar C.E., Moreno-Mateos M.A., Cifuentes D., Bazzini A.A., Giraldez A.J. Optimization strategies for the CRISPR-cas9 genome-editing system. Cold Spring Harb. Protoc. 2016;2016 doi: 10.1101/pdb.top090894. pdb.top090894. [DOI] [PubMed] [Google Scholar]
  • 111.Autran D., Baroux C., Raissig M.T., Lenormand T., Wittig M., Grob S., Steimer A., Barann M., Klostermeier U.C., Leblanc O., et al. Maternal epigenetic pathways control parental contributions to Arabidopsis early embryogenesis. Cell. 2011;145:707–719. doi: 10.1016/j.cell.2011.04.014. [DOI] [PubMed] [Google Scholar]
  • 112.Nodine M.D., Bartel D.P. Maternal and paternal genomes contribute equally to the transcriptome of early plant embryos. Nature. 2012;482:94–97. doi: 10.1038/nature10756. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Zhao P., Zhou X., Shen K., Liu Z., Cheng T., Liu D., Cheng Y., Peng X., Sun M.X. Two-step maternal-to-zygotic transition with two-phase parental genome contributions. Dev. Cell. 2019;49:882–893.e5. doi: 10.1016/j.devcel.2019.04.016. [DOI] [PubMed] [Google Scholar]
  • 114.Otoupal P.B., Cress B.F., Doudna J.A., Schoeniger J.S. CRISPR-RNAa: targeted activation of translation using dCas13 fusions to translation initiation factors. Nucleic Acids Res. 2022;50:8986–8998. doi: 10.1093/nar/gkac680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Yang L.Z., Wang Y., Li S.Q., Yao R.W., Luan P.F., Wu H., Carmichael G.G., Chen L.L. Dynamic imaging of RNA in living cells by CRISPR-cas13 systems. Mol. Cell. 2019;76:981–997.e7. doi: 10.1016/j.molcel.2019.10.024. [DOI] [PubMed] [Google Scholar]

Articles from iScience are provided here courtesy of Elsevier

RESOURCES