Abstract
TGF-β family proteins including Nodal are known as central regulators of early development in metazoans, yet our understanding of the scope of Nodal signaling’s downstream targets and associated physiological mechanisms in specifying developmentally appropriate cell fates is far from complete. Here, we identified a highly conserved, transmembrane micropeptide—NEMEP—as a direct target of Nodal signaling in mesendoderm differentiation of mouse embryonic stem cells (mESCs), and this micropeptide is essential for mesendoderm differentiation. We showed that NEMEP interacts with the glucose transporters GLUT1/GLUT3 and promotes glucose uptake likely through these interactions. Thus, beyond expanding the scope of known Nodal signaling targets in early development and showing that this target micropeptide augments the glucose uptake during mesendoderm differentiation, our study provides a clear example for the direct functional impact of altered glucose metabolism on cell fate determination.
Subject terms: Growth factor signalling, Embryonic stem cells, Differentiation
Fu et al. identify the highly conserved, transmembrane micropeptide, NEMEP, as a direct target of Nodal signaling, essential for mesendoderm differentiation. NEMEP interacts with the glucose transporters GLUT1/GLUT3 and promotes glucose uptake.
Introduction
Glucose is a prominent carbohydrate in the metabolism of most organisms, and an increasing number of developmental biology studies are lending support to the hypothesis that glucose metabolic processes per se functionally contribute to cell fate determination in early embryogenesis1–3. Mouse embryos start to rely on glucose as a major energy source at the stage of compaction and blastocyst formation (around E4–E5)4, and there is genetic evidence confirming that active glycolysis is required for gastrulation5. Clear phenotypes support that two major glucose transporter proteins (GLUT1 and GLUT3) are essential for early development: Glut1−/− mice are embryonic lethal at E106, and deletion of GLUT3 results embryonic lethality at the gastrulation stage7. And studies have shown that increasing the expression of GLUT1 and GLUT3 in the pluripotent inner cell mass (ICM) increases glucose uptake, which in turn triggers enhancement of glycolytic flux for ATP generation and a subsequent increase in lactate excretion8–10. Despite these exciting observations, to date very little is known about nature of any regulatory system(s) that direct the activation of glucose uptake to meet the specific needs of distinct cell populations during early development.
Transforming growth factor β (TGF-β) signaling represents a centrally important regulatory influence on early development, and members of the TGF-β family are also known to function in widespread and diverse roles in tissue homeostasis, wound healing, immunity, and metabolism11–15. These proteins mediate their multifunctional effects in cells by eliciting transcriptional responses on many target genes, specifically through the receptor-activated SMADs (R-SMADs). R-SMADs—SMAD2 and SMAD3 for TGFβ, Nodal, Activin, and myostatin signaling—translate TGF-β pathway signals into the specific transcriptional programs in specific cell contexts, doing so via their interactions with SMAD4 and other transcriptional activators, coactivators/corepressors, and/or epigenetic regulators at enhancer elements in specific genes16–21. Nodal signaling initiates gastrulation in vivo and promotes embryonic stem cells (ESCs) to differentiate towards mesendoderm fates in vitro; these regulatory processes are mediated by the transcriptional upregulation of so-called “mesendoderm lineage determining transcription factors” (LDTFs)22–24. Further, the interactions of SMAD2 and SMAD3 with FOXH1 and TRIM33 are known to regulate essential transcriptional programs to direct mesendoderm differentiation25–27.
The insights about transcriptional regulation of mesendoderm LDTFs have advanced our understanding about how Nodal signaling regulates mesendoderm differentiation, but it is far from clear that the only impacts of Nodal signalling on early development are mesendoderm LDTFs. Long non-coding RNAs (lncRNAs) are defined as polyadenylated RNA molecules which are longer than 200 nucleotides, are weakly conserved, and are transcribed by RNA polymerase II28–30. Studies have shown that lncRNAs exert diverse functions in biological processes, including development, metabolism, immunity, and disease31–36. Our understanding of lncRNAs is expanding rapidly, and it is now known that some cytosol-localized lncRNAs can be translated into functional micropeptides in vivo37.
Here, we identified a Nodal signaling direct target gene, Gm11549, which was originally annotated as a lncRNA. To our considerable surprise, we later found that Gm11549 actually encodes a highly conserved 63 amino acid single-pass transmembrane micropeptide, NEMEP (Nodal Enhanced MEsendoderm Peptide). Through multiple follow-up functional investigations, we discovered that NEMEP interacts with two glucose transporters (GLUT1 and GLUT3) and promotes glucose uptake. Interestingly, NEMEP is specifically accumulated in the primitive streak of mouse embryo at E7.0, and depletion of NEMEP causes two strong phenotypes: dramatic impairment of mesendoderm differentiation and a significant decrease in glucose uptake in early differentiated mESCs. Hence, our study establishes an additional role of Nodal signaling in mesendoderm differentiation beyond its induction of LDTFs and shows how a Nodal-regulated micropeptide functions to augment glucose uptake into specific subpopulations of mesendodermal cells.
Results
Gm11549 is a direct target gene of Nodal signaling
Mouse embryonic stem cells (mESCs) are widely used to experimentally recapitulate early embryonic development, for example with in vitro assays examining embryoid body (EB) formation. The LIF/STAT3 (Leukemia Inhibitory Factor/Signal Transducer and Activator of Transcription-3) signaling pathway promotes self-renewal and blocks the differentiation of mouse ESCs. LIF removal from culture media is the first step in a common protocol for inducing the differentiation of mESCs38–40. This system is developmentally informative because mESCs have the potential to differentiate into all three germ layers upon selective exposure to appropriate culture conditions41. mESCs express ALK4, ALK7, ActR-II, and ActR-IIB and produce autocrine Nodal42. We set out to explore lncRNAs responsive to Nodal signaling during mesendoderm differentiation. Activin A was used as a substitute for Nodal in our study because it is easier-to-obtain and because these two protein ligands act through the same receptors; one notable distinction is that Nodal requires the co-receptors Cryptic and Cripto; Activin A does not43.
A previous RNA-seq study of day 2.5 EBs treated with either the Nodal signaling agonist Activin A (hereafter, Activin) or ALK4/5/7 inhibitor SB431542 globally characterized the mesendoderm LDTFs genes induced by Nodal/Activin signaling44,45: these include the homeobox protein gene Mixl1, the T-box transcription factor Brachyury gene (Bra; also known as T), the homeobox protein gene Goosecoid (Gsc), and the hepatocyte nuclear factor 3-β gene Foxa2. Note that each of these LDTFs is gradually induced in EBs between day 2 and day 4. Here, we re-analyzed the Wang et al. (2017) dataset44 with a particular focus on identifying Nodal/Activin signaling target lncRNAs, and thus identified 62 lncRNAs whose expression in SB431542-treated cells was increased or decreased by at least 2-fold upon Activin addition (Fig. 1a and Supplementary Fig. 1a).
Among the Nodal/Activin responsive lncRNAs, we subsequently filtered for candidates using two criteria: strong expression in EBs at day 3 and day 4 and the presence of a predicted SMAD2 or SMAD3 binding site in their promoter regions25,44. This identified one candidate, Gm11549 (Fig. 1b, Supplementary Fig. 1b, c), and we characterized the Gm11549 transcript using 3′end RACE from EBs at day 3, and the Gm11549 transcript we found in EBs differs from Genbank and Refseq (Fig. 1b). ChIP-seq analysis showed that the Gm11549 locus does have TRIM33, SMAD2 or SMAD3, SMAD4, and FOXH1 occupancy at its promoter region in Activin-treated EBs at day 2.5 and does not have SMAD2 or SMAD3 binding in SB431542-treated cells, suggesting that Gm11549 expression is responsive to Nodal/Activin signaling (Fig. 1b). Pretreatment of cycloheximide, the protein synthesis inhibitor, to the EBs did not block Activin response of Gm11549 demonstrating that Gm11549 is a primary transcriptional target of Nodal/Activin signaling, but is not regulated by the Nodal/Activin signaling downstream target genes (Fig. 1c).
We then examined the distribution of Gm11549 transcripts throughout an in vitro differentiation time course of EBs and found that Gm11549 is induced at a similar time as the known mesendodermal marker genes Mixl1, Gsc, T, and Foxa2 (Fig. 1d). Consistently, we analyzed published in vivo data46,47 for mouse embryos and found that, Gm11549 transcripts accumulate in the posterior primitive streak of mid-gastrulation embryos (~E7.0) (Fig. 1e and Supplementary Fig.1d). Notably, in adult mouse tissues, Gm11549 is highly abundant and enriched in the brain compared to other organs (Supplementary Fig. 1e).
To determine which factors in Nodal/Activin signaling induce Gm11549 transcription, we examined Gm11549 levels and responsivity to Activin in series of gene knockout mESCs. Briefly, both Gm11549 transcription and responsivity of Gm11549 to Nodal/Activin are dependent on SMAD2, SMAD3, and SMAD4 individually, as both phenotypes were impaired in Smad2 or Smad3 or Smad4 depletion cells or Smad2 KO and Smad3 KD cells (Fig. 1f, and Supplementary Fig. 1f, g). We also found that TRIM33— a Nodal signaling-specific chromatin reader that associates with H3K9me3 and H3K18ac dual marks to regulate the expression of mesendoderm LDTFs21,25—is essential for Gm11549 induction during mesendoderm differentiation and Gm11549 transcription was not induced by Nodal/Activin stimulation in Trim33 null cells (Fig. 1g, and Supplementary Fig. 1h, i). Finally, we found that Gm11549 transcriptional induction during mesendoderm differentiation requires the known early development regulator FOXH1 (Fig. 1h, and Supplementary Fig. 1j, k). In addition, depletion of co-receptors of Nodal, Cryptic, and Cripto (Supplementary Fig. 1l), dramatically impairs Gm11549 expression during mesendoderm differentiation, as well as Mixl1 (Fig. 1i, Supplementary Fig. 1m), suggesting that Gm11549 is indeed a Nodal signaling target gene. Collectively, these results establish that Nodal signaling components directly induce the transcription of Gm11549 during mesendoderm differentiation and demonstrate that Gm11549 is specifically expressed in the primitive streak.
Gm11549 encodes a transmembrane micropeptide: NEMEP
Portions of the Gm11549 genomic sequence are highly conserved between human and mouse (Fig. 2a); specifically, exon1 (224–405) and exon3 (1121–1446) share 87% and 75% identity respectively at nucleic acid sequence level, whereas there is no obvious conservation for introns. Moreover, the neighboring genes on both sides of the Gm11549 locus are conserved in both species (Fig. 2a). RNA-FISH (Fluorescence in situ hybridization) and fractionation-based analyses revealed that Gm11549 transcript is mainly localized in the cytoplasm (Supplementary Fig. 2a, b). Recent studies reported that some cytosol-localized lncRNAs can be translated into functional micropeptides in vivo37, which prompted us to further investigate whether Gm11549 may have the potential to be translated into a protein product.
First, we identified 4 putative ORFs in a sense (+) orientation, with initiating ATG codon (Fig. 2b). After cloning these ORFs, we examined the translational potential of these ORFs by transiently expressing them as C-terminal tagged fusions bearing FLAG epitopes in HEK293T cells. Only the ORF1-FLAG was detected by western blotting (Fig. 2c); mutagenesis at the ATG codon abolished ORF1-FLAG expression (Supplementary Fig. 3a, b); note that ORF1 encodes a 63 amino acid micropeptide that is conserved among mammals (Fig. 2d). Moreover, polysome profiling analysis of EBs at day 3 showed that Gm11549 is strongly associated with the polysome-like coding gene Gsc, whereas non-coding RNA H19 is associated with the monosomes (Fig. 2e). These data provided additional evidence that Gm11549 has translation potential.
Further, peptides from ORF1 were detected in HEK293T cells transiently expressing the ORF1-FLAG product by pull-down with the FLAG antibody followed by mass-spectrometry analysis (Fig. 2f). Moreover, we raised a polyclonal rabbit antibody against a 25-residue region of the ORF1 gene product (Fig. 2d), an analysis of adult mouse brain samples with pull-down using the polyclonal antibody again detected the Gm11549 ORF1 protein (Fig. 2g). We generated ORF1-FLAG knock-in mESCs using CRISPR/Cas9 (Supplementary Fig. 3c), and a FLAG-tagged protein with a molecular weight of about 7KD was detected in these cells by western blotting (Fig. 2h). These results together establish that the micropeptide from Gm11549 ORF1 is indeed translated in vivo. We refer to this Nodal signaling target gene product as Nodal Enhanced MEsendoderm microPeptide (NEMEP).
The human ortholog of Gm11549 is TMEM155, which was annotated as a coding gene for a 130 aa gene product; note that there are to date no reports of experimental confirmation of the existence of these 130 aa proteins. In experiments to examine the translation product of TMEM155, we performed similar assays as with Gm11549 and found that TMEM155 likewise encodes a 63 aa peptide (“hNEMEP”, sharing 93.6% identity with mNEMEP) (Supplementary Fig. 3d).
The protein topology prediction analyses indicated that NEMEP is a single-pass transmembrane protein (TMHMM Server 2.0) (Supplementary Fig. 3e), and NEMEP has a predicted alpha-helix domain between residues 7 and 29 (Supplementary Fig. 3f). Immunostaining of ectopically expressed, FLAG-tagged NEMEP in mouse ES cells revealed that NEMEP was localized at the plasma membrane (Fig. 2i), and further, fractionation-based analysis and immunoblotting of mESCs expressing NEMEP-GFP indicated strong accumulation of an NEMEP-GFP fusion protein in the membrane fraction (Fig. 2j).
NEMEP is required for mesendoderm differentiation
We next used CRISPR/Cas9 to generate a Gm11549-NEMEP frameshift mutants (hereafter, Nemep KO): we picked two frameshift mutant colonies, one with an 8nt deletion (KO-1) and another with a 77nt deletion (KO-2); neither of these mutant Gm11549 loci produced the NEMEP peptide in cells (Supplementary Fig. 4a–c). However, note that the in vitro differentiation induced elevation of Gm11549 RNA transcription was not altered by either of these mutations (Fig. 3a). The absence of NEMEP in EBs dramatically impaired the expression of well-known mesendoderm LDTFs including Gsc, Mixl1, T, Eomes, Foxa2, and Sox17 (Fig. 3a). Consequently, the expression of the late mesoderm marker Nkx2–5 was significantly decreased in the Nemep KO EBs (Fig. 3a). Strikingly, although Gm11549 transcription is induced during ectoderm differentiation, we found that this RNA molecule is not required for ectoderm differentiation (Fig. 3b and Supplementary Fig. 4d). We also found that the phosphorylation level of SMAD2 and SMAD3 at C-terminal tail is not affected in Nemep KO cells compared to wild type cells (Fig. 3c), indicating that NEMEP deletion does not interfere with the activity of known Nodal/Activin signaling receptors. Moreover, we found that neither depletion of exon2 or exon3 has impact on mesendoderm differentiation, excluding any regulatory impacts from Gm11549 region other than the 5′ UTR and coding region for 63 aa NEMEP (Supplementary Fig. 4a and 5a, b). This data also suggests that it is less likely Gm11549 RNA contributes to regulate mesendoderm differentiation.
Again, consistent with the EB differentiation functional impact from the NEMEP peptide frameshift mutation (Nemep KO), we found that mESCs harboring a promoter KO variant of the Gm11549 locus (generated by CRISPR/Cas9) (Supplementary Fig. 4a and 5c) displayed defects during in vitro EB differentiation, including a sharply reduced extent of Gm11549 induction and strong reductions in the expression of mesendoderm genes as shown by the transcriptome analysis (Fig. 3d). Interestingly, some pluripotency genes such as Esrrb, Dnmt3l, and Utf1 were not declined in Gm11549 promoter KO EBs compared to the wild type EBs (Fig. 3d). Consistent with this notion of NEMEP-dependent progression towards a mesendoderm differentiation fate, the FOXA2, and T protein levels were significantly decreased in cells harboring mutant EBs including the frameshift and the promoter KO mutant cells by immunostaining assay (Fig. 3e, f). Offering further support, similar results were obtained in RNAi and CRISPRi-dCas9-KRAB experiments for depletion of Gm11549 transcripts from EBs (Supplementary Fig. 5d, e). Thus, NEMEP is required for mesendoderm differentiation.
NEMEP deletion leads to early defects in mouse embryo chimeras
Embryonic chimeras of the mouse are well-established tools for studying cell lineage and cell potentials48. To further investigate the role of NEMEP in mesendoderm differentiation, we microinjected GFP-labeled WT, or Nemep KO (Nemep−/−) mESCs into WT blastocysts to generate chimeric embryos, which were collected and examined at E7.5 (Fig. 4a). At E7.5, we observed no obvious differences between WT and Nemep−/− chimeras (Fig. 4b). Immunofluorescence analysis of the chimeras for expression of the mesendoderm marker FOXA2 and T showed that WT but not E7.5 Nemep−/− chimeras expressed FOXA2 and T (Fig. 4c, d). These chimeric embryo experiments strongly support that NEMEP is necessary for mesendoderm specification.
NEMEP interacts with GLUT1/GLUT3 and promotes glucose uptake
To explore the molecular mechanism of NEMEP in mesendoderm differentiation, we conducted pull-down experiments with EBs at day 3 using NEMEP-GFP as bait. Mass spectrometry analysis of co-purified proteins revealed two glucose transporters (GLUT1 and GLUT3) among the top-ranking candidate NEMEP-interacting proteins. Co-immunoprecipitation (co-IP) studies validated that both GLUT1 and GLUT3 do physically interact with NEMEP (Fig. 5a, Supplementary Data 1). Then, we confirmed that the homologous human NEMEP protein also interacts with the human GLUT1 and GLUT3 proteins (Fig. 5b). GLUT1 to GLUT4 are class I facilitative glucose transporters49. Glut1 and Glut3 have much higher expression in mESCs compared to Glut2 or Glut4 (Supplementary table 1). We conducted bimolecular fluorescence complementation (BiFC) analysis and successfully validated the interaction between NEMEP and these four of class I facilitative glucose transporters (Supplementary Fig. 6a).
Moreover, both confocal microscope images from BiFC assay and transient transfection of both GLUT1-FLAG or GLUT3-FLAG and NEMEP-HA vectors into mESCs show the colocalization of GLUT1 and GLUT3 with NEMEP on the cell membrane (Supplementary Fig. 6b, c). We also validated that NEMEP binds to endogenous GLUT1 and GLUT3 using NEMEP-3xFLAG knock-in mESCs (Fig. 5c). Further, deletion of the transmembrane domain (TMD) and of a 7-residue region (lacking H51-F57) of the NEMEP C terminus disrupted the interactions between NEMEP and GLUT1/GLUT3 proteins (Fig. 5d and Supplementary Fig. 6d). Domain swapping of the NEMEP-TMD domain for the ACVR1-TMD or ITGB1-TMD also impaired the interactions (Fig. 5d). These results demonstrate that the TMD domain and NEMEP residues H51-F57 are essential for NEMEP-GLUT1 and -GLUT3 interactions.
We next examined whether NEMEP impacts glucose transport by using the CRISPR-dCas9-VP64 activator system (CRISPRa) to generate mESCs that overexpress the endogenous mRNA encoding NEMEP (Gm11549) (Supplementary Fig. 7a). Compared to WT mESCs, we found that overexpression of NEMEP resulted in a ~30% increase in glucose consumption as well as a ~60% increases in lactate excretion (Fig. 5e, f). Consistently, a Seahorse-based analysis measuring the extracellular acidification rate (ECAR) showed that the glycolysis activity of the NEMEP-overexpressing mESCs was significantly higher than in WT mESCs (Fig. 5g and Supplementary Fig. 7b, c). Moreover, the overexpression of human NEMEP in HepG2 cells (Supplementary Fig. 7d) also significantly increased glucose consumption, lactate excretion, glycolysis activity (Supplementary Fig. 7e–g).
We then used Promega Glucose Uptake-GloTM Assays to measure mESC glucose uptake to examine whether NEMEP impacts glucose transporter activity. Note that the measurement readout of the assay specifically reflects the glucose transporter activity. No changes in transporter activity were detected in mESCs expressing diverse NEMEP mutant variants (including TMD deletion, H51-F57 deletion, and the domain-swapped ACVR1-TMD and ITGB1-TMD variants); note that we did detect the expected increase in glucose transporter activity in mESCs expressing the full-length, wild type NEMEP (Fig. 5h).
Intriguingly, we detected NEMEP-mediated enhancements on both glucose uptake and glycolysis activities in mESCs in experiments examining overexpression of GLUT1 or GLUT3. Specifically, the expression of WT NEMEP but not TMD-deletion NEMEP or H51-F57-deletion NEMEP in mESCs overexpressing GLUT1 or GLUT3 resulted in significantly higher glucose uptake compared to the GLUT1 or GLUT3 overexpressing mESCs (Fig. 5i, j). Since H51-F57-deletion NEMEP failed to form complex with GLUT1 or GLUT3 (Fig. 5d), it suggests that NEMEP may act to facilitate glucose uptake through interaction with GLUT1 or GLUT3. However, it does not exclude the possibility that NEMEP might facilitate glucose uptake via other factors. In addition, the expression of WT NEMEP but not TMD-deletion NEMEP in mESCs overexpressing GLUT1 or GLUT3 resulted in significantly higher glycolysis activities compared to the GLUT1 or GLUT3 overexpressing mESCs (Supplementary Fig. 8a–h). Therefore, NEMEP’s boosting glucose uptake likely through interactions with the known glucose transporters GLUT1 and GLUT3.
NEMEP facilitates glucose uptake during mesendoderm differentiation
We next explored the potential impacts of altered glucose uptake and differential glycolysis activity on mesendoderm differentiation. In vitro differentiation assays revealed that decreased glucose concentration in the culture medium reduces the expression levels of mesendoderm developmental marker genes in WT EBs (Fig. 6a). Further, knockdown of GLUT1 and GLUT3 led to severe defects in mesendoderm differentiation and in glucose uptake in EBs at day 3 (Fig. 6b, c). We also found that increasing the glucose concentration in the culture medium caused a significant reduction in the expression levels of multiple mesendoderm developmental marker genes in WT EBs (Fig. 6d). Thus, GLUT1 and GLUT3 are essential for normal mesendoderm differentiation. Moreover, it appears that normal mesendoderm differentiation requires a developmentally-appropriate supply of glucose.
We then measured glucose uptake, ECAR, and oxygen consumption rate (OCR) in WT and Nemep KO EBs at day 3. Compared to WT EBs, the Nemep KO EBs displayed significant reductions in glucose uptake, glycolytic function, and mitochondrial respiration (Fig. 6e–j and Supplementary Fig. 9a–e). Consistently, a metabolomics analysis detected significant declines in the levels of glycolysis and TCA cycle (Tricaboxylic Cycle) metabolites in Nemep KO EBs at day 3 (Fig. 6k). However, a companion RNA-seq analysis found no differences between day 3 WT and Nemep promoter KO EBs for the genes encoding glucose metabolism enzymes or glucose transporters (Supplementary Fig. 9f), and AKT activation which have been reported that regulates glucose uptake in other type of cells is not impacted by depletion of NEMEP (Supplementary Fig. 9g)50–52. These results indicate that the aforementioned physical interaction of NEMEP and GLUT1/GLUT3 may support mesendoderm differentiation by facilitating glucose uptake.
Our analysis of data from a previously published transcriptome analysis of WT EBs from day 0 to day 4 revealed that a group of well-known glucose metabolism genes including Glut1/Glut3 (Slc2a1/ Slc2a3) are up-regulated at day 3 EBs44, the same time point that mesendoderm genes are induced (Supplementary Fig. 9h).
Despite previous reports that TGF-β signaling can impact glucose metabolism in mesangial cells and NIH 3T3 cells53–55, we are unaware of any studies examining whether Nodal signaling regulates glucose metabolism during mesendoderm differentiation. Pursuing this, we found that treatment of day 3 EBs with the known Nodal signaling activator Activin significantly increased glucose uptake (Fig. 7a). Further, compared to WT EBs the extent of glucose uptake was significantly reduced in EBs deficient for the Cripto and Cryptic, deficient for SMAD2 and SMAD3, and deficient for TRIM33 (Fig. 7b–d). Moreover, Activin treatment does not induce glucose uptake in EBs deficient for SMAD2 and SMAD3, or deficient for TRIM33 compared to WT EBs (Fig. 7c, d). Noteworthy, Activin treatment induces glucose uptake in EBs deficient for the Cripto and Cryptic (Fig. 7b), since Cripto/Cryptic-depleted cells remained responsive to Activin43. Together, these in vitro findings indicate that the Nodal signaling functions to positively regulate glucose uptake during mesendoderm differentiation. However, given that the expression of well-known glucose metabolism genes (including Slc2a1 and Slc2a3) was not altered by activation (Activin A treatment) or inhibition (SB431542 treatment) of Nodal signaling activity (Supplementary Fig. 9i), we suspected that the Nodal-signaling-mediated promotion of glucose uptake is through unknown glucose metabolism genes, and NEMEP may be one of them.
To explore potential mechanisms that may reconcile our observation of Nodal signaling’s promotion of glucose uptake during mesendoderm differentiation with our findings that the NEMEP-GLUT1/GLUT3 physical interaction and NEMEP promoting glucose uptake, we conducted experiments combining stimulation of Nodal signaling and genetic knockout of NEMEP. Consistently, we found that the Activin-induced increase in glucose uptake was significantly stronger in WT EBs than in Nemep KO EBs (Fig. 7e). An expression of NEMEP in the Nemep KO EBs rescued the defect of glucose uptake (Fig. 7f and Supplementary Fig. 9j). Given that NEMEP KO does not affect Nodal signaling activity (Fig. 3c) and the fold induction in response to Activin remains the same as in control cells for glucose uptake (Fig. 7e), we suspect that: (i) Nodal signaling induces the transcriptional activation of other, as-yet-unknown genes that somehow regulate glucose uptake; (ii) crosstalk of Nodal signaling with other signaling pathways to facilitate glucose uptake; (iii) Nodal signaling may modulate post-translational modifications (i.e. phosphorylation) of proteins involved in facilitating glucose uptake56. Hence, even if NEMEP is missing, the cells can respond to Nodal signaling to regulate glucose uptake, but at relatively lower level.
That is, our results support a model of mesendoderm differentiation from pluripotent mESCs wherein activated Nodal signaling induces the expression of a micropeptide, NEMEP that may function by interacting with GLUT1 and GLUT3 or other facilitators involved in glucose uptake, and augmenting the glucose uptake to meet the energy needs during mesendoderm differentiation (Fig. 7g).
Discussion
Our study identified a highly conserved cytosolic lncRNA Gm11549 as a direct target of Nodal signaling during mesendoderm differentiation. Interestingly, there is an “ORF” hidden within Gm11549 that encodes a transmembrane micropeptide which we named NEMEP. We discovered that NEMEP is essential for mesendoderm differentiation in mESCs, a finding that both expands the list of Nodal target genes essential for mesendoderm differentiation and provides a clear example for a functional micropeptide acting as a developmental “stage-specific facilitator” during early development. Given the widespread temporal and spatial expression patterns for many lncRNAs during early development46, it seems likely that additional “lncRNA-coding peptides” will be identified along with improvements in bioinformatics and biological research technologies57–61. It is conceivable that such a “facilitator” role for micropeptides could represent a common mechanism for achieving the required fine-scale temporal and spatial regulation to support proper cell fates in early development.
Studies based on ribosomal profiling and mass spectrometry showed that a fraction of putative small open reading frames (sORFs) hidden in some cytosol-localized lncRNAs are translated into functional micropeptides62–64. These micropeptides (shorter than 100 amino acids) are essential in diverse biological processes62,65–71. We speculate that there could be more micropeptides regulated by TGF-β members in different cell contexts. Experimentally deciphering the in vivo production of such micropeptides and their molecular functions can deepen our understanding of the apparently quite broad and mechanistically diverse impacts of TGF-β signaling during early development.
Currently, studies of micropeptides are still in their infancy, and there is insufficient data to enable the definition of a canonical mode of action for these biomolecules. Nevertheless, there are examples for a group of micropeptides that function through their interactions with larger proteins64. By interacting with larger proteins, these micropeptides are involved in regulating the activities of protein complex machinery. Our work in the present study demonstrates that NEMEP interacts with the GLUT1/GLUT3 glucose transporter proteins and augments glucose uptake during mesendoderm differentiation. We validated that glucose is essential for mesendoderm differentiation, and demonstrated that loss of NEMEP both significantly impaired mesendoderm differentiation and decreased glucose uptake. Intriguingly, Nodal signaling induces glucose uptake through NEMEP, and NEMEP can rescue the defect of glucose uptake in Nemep KO cells. Understood together, these results establish that the Nodal signaling mediated induction of NEMEP regulates mesendoderm differentiation by facilitating glucose uptake.
The precise biochemical mechanism whereby NEMEP cooperates with GLUT1/GLUT3 to facilitate glucose uptake remains to be elucidated. Although we have lines of evidence supporting the interaction of NEMEP and GLUT1/GLUT3 (BiFC, co-IP), the precise nature of their binding interactions is not yet clear. If the interaction is direct, then it will be interesting to solve a crystal or cryo-EM structure of NEMEP in complex with GLUT1/GLUT3, which should help reveal how the interaction with the glucose transporters facilitates glucose uptake. If the interaction is indirect, then identification of other components in the complex should still enable similar structure-based studies to reveal NEMEP’s glucose-uptake augmentation mechanism.
Beyond its long-understood central functions in cellular energy production, there is growing evidence supporting that glycolysis must be understood as a crucial regulator of cell identity under physiological conditions4,5,72. For example, a study of human pluripotent stem cells demonstrated metabolic switching from glycolysis to oxidative phosphorylation during mesoderm and endoderm differentiation; this switch was not detected during ectoderm differentiation3. A seminal genetics study demonstrated that glycolysis is necessary for gastrulation in mouse5. The primitive streak is the organizing center for amniote gastrulation; this region is known to define the future embryo midline and to serve as a conduit of cell migration for germ layer formation73,74. The RNA transcript for NEMEP is specifically expressed in the primitive streak of E7.0 mouse embryos. And the expression pattern is highly similar to that of mesendoderm LDTFs (e.g., Mixl1, Gsc, T, and Foxa2) which are Nodal signaling transcriptional target genes. Given its function in augmenting glucose uptake, it is plausible to speculate that the induction of NEMEP by Nodal signaling at the primitive streak “turns on” a precisely spatiotemporally regulated additional glucose source to meet the highly specific energy needs of a small number of cells at this development stage. At minimum, our discoveries provide another line of evidence illustrating how metabolism actively regulates cell identity during embryonic development.
Our study also has implications outside of early development. Recall our finding that the brain is apparently the only adult tissue in which NEMEP is highly expressed. Given the known role of glucose as the required fuel for brain function under non-starvation conditions (more than 40% of the blood glucose is consumed by brain)75,76, we speculate that NEMEP may function to help meet the energy requirements for brain function. Conditional knockout mice will be helpful for elucidating this potential role of NEMEP in the brain, as will profiling of NEMEP expression in particular brain structures in differential states of activation. It is conceivable that specific cells in the brain may employ NEMEP to augment glucose uptake in a manner similar to our observations from mesendoderm differentiation.
Methods
Animal ethics statement
Experiments involving mice were all in accordance with institutional guidelines for the care and use of animals. The animal experiments that were conducted as part of this research were completed in accordance with the guidelines provided by the Tsinghua University Institutional Animal Care and Use Committee (IACUC) and were in compliance with the relevant ethical regulations regarding animal research. All animal procedures were approved by Institutional Animal Care and Use Committee of Tsinghua University (IACUC:16-XQR2). C57BL/6J mouse strains were obtained from Laboratory Animal Research Center in Tsinghua University. C57BL/6J mouse strains were housed under SPF (specific-pathogen-free) at 20–22 degree with 12 h:12 h light: dark cycles at 50–60% humidity.
Generation of chimeric embryos
GFP-expressing WT and Nemep−/− (Nemep KO) mESCs were micro-injected after culturing for 3 days on irradiated mouse embryonic fibroblast (MEF) feeder layers. 15 to 20 mESCs from each group were injected into embryonic day (E) 3.5 blastocysts (C57BL/6J). Injected blastocysts were cultured in KSOM/AA (Millipore) at 37 oC in an atmosphere of 5% CO2 for 2–3 h to allow for recovery of blastocyst morphology and then implanted into the uterine horns (up to 10 embryos per horn) of E2.5 pseudopregnant females (C57BL/6J). Chimeric embryos were recovered at E7.5. All the C57BL/6J mouse strains were obtained from Laboratory Animal Research Center in Tsinghua University. Female mice were used between ages 6–8 weeks.
Cell lines
Mouse ES cell lines E14Tg2a.IV were purchased from the American Type Culture Collection (ATCC; https://www.atcc.org:443/). mESCs E14Tg2a.IV, CCE, Smad4 null (BNN)77, Smad2 null (KT15)78, Smad3 KD/Smad2 null, and Trim33 null25 lines were maintained on 0.1% gelatin-coated plates in LIF-supplemented medium at 37 °C with 5% CO2 as previously described25,79. Basic mESC medium contains Dulbecco’s modified Eagle’s medium (DMEM, Thermofisher), 15% Fetal Bovine Serum (FBS) (ExCell Bio), 1% penicillin-streptomycin-amphotericin B solution (Biological Industries), 1% non-essential amino acids (Thermofisher), 1% L-glutamine (Biological Industries), 1% sodium pyruvate (Sigma), 100 μM β-mercaptoethanol (Sigma), 103 U/mL mouse LIF. HEK293T and Hep G2 cells were cultured in DMEM supplemented with 10% FBS (ExCell Bio) and 1% penicillin-streptomycin-amphotericin B solution. We maintained all cell lines at 37 oC in a 5% CO2 cell culture incubator and tested all cell lines routinely for mycoplasma contamination.
mESCs differentiation
Mouse EBs formation and differentiation were carried out as described by the supplier (ATCC). Briefly, for mesendoderm differentiatrion, mESCs were cultured in mESC medium without LIF in an ultra-low attachment dish, embryoid bodies(EBs) were collected at the indicated time points.Ectoderm differentiation assays were carried out as described previously80,81. Briefly, mESCs were cultured in mESC medium without LIF in an ultra-low attachment dish, and 1 µM Retinoic acid was added in culture medium. Samples were collected at the indicated time points, and medium was changed every other day.
shRNA mediated transcript knocking down
Annealed shRNAs oligonucleotides were cloned into AgeI- and EcoRI-digested pLKO.1-puro lentiviral vector (Addgene #8453). shRNA sequences for Gm11549, Slc2a1, and Slc2a3 are listed in Supplementary Table 2. Lentivirus packaging was performed as previously described25. HEK293T cells were plated onto a 10 cm dish, and then were transfected with 12 µg pLKO.1 vector along with 6 µg pMDL-RRE(packaging plasmid), 3 µg pVSVG (envelope plasmid), and 3 µg pRev (packaging plasmid) and using Lipofectamine 2000 reagent (Invitrogen), following the manufacturer’s protocol. Medium was changed 24 h later. The lentivirus-containing medium were collected at 48 h and 72 h post-transfection and incubated overnight at 4 °C with lenti-X concentrator (631232, TaKaRa), and were then spun at 3000 g at 4 °C for 30 min. The viral pellets were used for transducing the cells.
Rapid amplification of cDNA ends (RACE)
5′ and 3′ RACE was performed using the primers listed in Supplementary Table 3 and the SMART RACE cDNA Amplification Kit (Clontech), according to the manufacturer’s recommendations.
CRISPR-Cas9 mediated genome editing
The designated sgRNA sequences were designed using online software (https://portals.broadinstitute.org/gpp/public/analysis-tools/sgrna-design).
The annealed oligonucleotide pairs for the mutants: Gm11549 promoter knock-out (KO) (promoter-KO-1 and promoter-KO-2), exon KOs (exon-1 KO, exon-2 KO, exon-3 KO-1, and exon-3 KO-2), NEMEP frameshift mutants (KO-1 and KO-2) and 3xFLAG knock-in (KI), was ligated into BbsI-linearized pSpCas9 (BB)-2A-GFP (PX458) (Addgene #48138)82.
To generate Gm11549 promoter KOs and variant exon KOs mESCs, two PX458-sgRNAs to designated mutants were co-transfected into mESCs using Lipofectamine 2000 (Life Technologies) according to the manufacturer’s instructions. 48 h post-transfection, GFP-positive cells were sorted by Fluorescence-activated Cell Sorting (FACS) as a single cell and cultured in a 96-well plate for 2 weeks. PCR-based genotyping was applied to pick clones showing the deletion of the targeted region in Gm11549/NEMEP genomic DNA.
To generate NEMEP frameshift mutants (Nemep KO) mESCs, PX458-sgRNA was transfected into mESCs using Lipofectamine 2000. 48 h post-transfection, GFP-positive cells were sorted by FACS as a single cell and cultured in a 96-well plate for 2 weeks. PCR primers flanking the targeted region were used to amplify the affected region for genotyping. After AfeI (NEB) digestion for 2 h at 37 °C, the PCR product was electrophoresed in an agarose gel to identify homozygous wild-type, homozygous knockout, and heterozygous clones.
To generate NEMEP-3xFLAG KI mESCs, PX458-sgRNA and the donor plasmid were co-transfected into mESCs using Lipofectamine 2000. 48 h post-transfection, GFP-positive cells were sorted by FACS sorted as a single cell and cultured in a 96-well plate for 2 weeks. PCR primers flanking the homologous arms were used. After EcoRV (NEB) digestion for 2 h at 37 °C, the PCR products were electrophoresed in agarose gel to identify wild-type and 3xFLAG knock-in clones, which were confirmed by immunoblot.
For activating endogenous Gm11549 expression (CRISPRa)83: sgRNAs were cloned into MS2-sgRNA-Zeo (Addgene #61427) using the BsmBI restriction sites. Lentiviruses containing each of the plasmids (dCAS-VP64-Blast (Addgene #61425), MS2-P65-HSF1-Hygro (Addgene #61426), and MS2-sgRNA-Zeo) were generated, and mESCs were infected with a 1:1:1 mix of the viruses in the presence of 8 μg/mL polybrene (Sigma). Infected cells were selected for 48 h with 10 μg/mL blasticidin (Selleck), 250 μg/mL hygromycin (BIOBYING), and 250 μg/mL zeocin (InvivoGen).
For repressing endogenous Gm11549 expression (CRISPRi)84: sgRNAs were cloned into MS2-sgRNA-Zeo (Addgene #61427) using BsmBI restriction sites. Lentiviruses containing each of the plasmids (dCAS9-KRAB-Blast (Addgene #50919) and MS2-sgRNA-Zeo) were generated, and mESCs were infected with a 1:1 mix of the viruses in the presence of 8 μg/mL Polybrene. Infected cells were selected for 48 h with 10 μg/mL blasticidin and 250 μg/mL zeocin. All sgRNAs and PCR primers sequences are listed in Supplementary Tables 4 and 5.
RNA-seq and ChIP-seq
The RNA-seq library preparation and sequencing were performed as described44. For RNA-seq data analysis, we first evaluated RNA-seq reads quality using FastQC (version 0.10.1). Then, the reads were mapped to the mouse reference genome (mm10) using TopHat (version 2.0.10) with default parameters, and the mouse reference genome sequence was downloaded from Ensembl (Mus musculus GRCm38/mm10). Next, the gene expression levels (Reads Per Kilobase per Million mapped reads; RPKM) of annotation genes were estimated using Cufflinks (version 2.2.1) by providing the mouse genome annotation from GENCODE (version M4). Differential testing and log2 fold change calculations were performed using Cuffdiff (version 2.2.1), with the implementation of two biological replicates.
We analyzed the ChIP-seq data (TRIM33, SMAD2/3, SMAD4, FOXH1) as previously described21,44. Briefly, the data were trimmed using TrimGalore (version 0.6.1) and then aligned to mm10 using Bowtie2 (version 2.3.3). We chose MACS2 (version 2.1.4) to call the peaks using the “broad peaks” setting for SMAD2/3, SAMD4, TRIM33, and FOXH1. We subsequently generated bigwig files from the bam files using the Coverage function in deepTools (version 3.4.4). For visualization purposes, we normalized the data to 1X genome coverage (mm10). Representative track diagrams were generated using the Integrated Genomics Viewer software (version 2.8.13).
Subcellular fractionation assay
For RNA subcellular fractionation, EBs and monolayer differentiation samples were harvested. Nucleus and cytoplasm RNA fractions were extracted as previously described85. Cells were harvested and washed by 10 mL ice-cold PBS, then spun at 700 g for 3 min. The cell pellets were resuspended in 200 μL cold cytoplasmic lysis buffer (0.15 NP-40, 10 mM Tris pH 7.5, 150 mM NaCl), and incubated on ice for 5 min. The lysates were transferred onto 500 μL ice-cold sucrose buffer (10 mM Tris pH 7.5, 150 mM NaCl, 24% sucrose), spun at 12,000 g for 10 min. The supernatants were collected as the cytoplasmic fraction. The pellets were collected and used for nuclear RNA extraction. Ratios of interesting genes in different subcellular fractions were calculated by normalizing the expression levels of quantitative RT-PCR to the relative volume of each fraction used for RNA analysis. For protein subcellular fractionation, we used a previously published protocol (https://bio-protocol.org/e754).
Immunofluorescence
For the attached cells, samples were fixed in 4% PFA in PBS for 30 min at room temperature. The cells were permeabilized with 0.5% Triton X-100 in PBS (PBST) for 10 min at room temperature and blocked in 1% BSA in PBS for 1 h at room temperature. Then, the samples were incubated with primary antibodies in 1% BSA in PBST overnight at 4 oC. The next day, the coverslips were incubated with DyLight594-Goat Anti-Mouse IgG (H + L) (Huaxingbio,1:1000) for 1 h at room temperature. Then, samples were incubated with Hoechst 33342 (Thermo Fisher Scientific) for 5 min at room temperature followed by three PBS washes.
For EBs, EBs at day 3.5 were fixed in 4% PFA in PBS for 30 min at room temperature, incubated in a solution of 30% sucrose in PBS for 1–2 days, and processed for OCT embedding. EB sections were permeabilized for 30 min in 0.5% PBST, followed by 1% BSA in PBST for blocking. Then, samples were incubated overnight with the primary antibody for FOXA2 (Cell Signaling Technology, 8186) or T/Brachyury (Abcam, 209665) at a dilution of 1:200 at 4 oC, washed three times with PBS, followed by a 1 h incubation with IF 488-Goat Anti-Rabbit IgG (H + L) (Huaxingbio) or DyLight 488-Donkey Anti-Goat IgG (H + L) (Huaxingbio) at a dilution of 1:1000 in 1% BSA in PBST. Finally, samples were incubated with Hoechst 33342 for 5 min in room temperature followed by PBS washing three times.
For E7.5 embryo wholemount immunostaining, after dissection of E7.5 embryos, embryos were imaged wholemount using a Nikon fluorescence stereomicroscope to assess the level of chimerism based on GFP fluorescence prior to fixation. Embryos were fixed for 15 min at room temperature in 4% PFA (Electron Microscopy Sciences) then washed with PBS with 0.1% Triton X-100. Embryos were permeabilized with 0.5% PBST for 30 min at room temperature and blocked overnight at 4 °C in blocking buffer (0.1% PBST) with 5% horse serum (Solarbio) and 1% BSA. The next day, embryos were transferred to primary antibodies diluted in blocking buffer and incubated overnight at 4 °C. Primary antibodies were used at the following dilutions: T/Brachyury (1:200) and FOXA2 (1:200). The next day, embryos were washed twice in PBS with 0.1% Triton X-100 for 10 min at room temperature and blocked for approximately 6 h in blocking buffer followed by overnight incubation in blocking buffer with secondary antibodies (1:500) at 4 °C. Embryos were then washed three times for 10 min in PBS with 0.1% Triton X-100 at room temperature. The final wash contained 5 µg/mL Hoechst 33342.
For imaging, embryos were positioned in glass-bottom dishes (NEST) in PBS and imaged using a Nikon A1R HD25 confocal microscope equipped with 405 nm, 488 nm, and 594 nm solid lasers and analyzed by NIS-Elements Viewer 4.11.0 and Image J V1.50e. A 60x/100x oil immersion objective lens was used for captured images.
RNA fluorescence in situ hybridization (RNA FISH)
mESCs were set up to differentiation for 3 days and grown on coverslips in 12-well plates, briefly washed with PBS, and fixed with PBS/3.7% formaldehyde at room temperature for 10 min. Following fixation, cells were washed twice with PBS. The cells were then permeabilized in 70% ethanol for at least 1 h at 4 °C. Stored cells were briefly rehydrated with Wash Buffer (2 × SSC, 10% formamide) (Biosearch) before FISH. The Stellaris FISH Probes (Gm11549, Gapdh, and Malat1) were added to the hybridization buffer (2 × SSC, 10% formamide, 10% dextran sulfate) (Biosearch) at a final concentration of 250 nM. Hybridization was carried out in a humidified chamber at 37 °C overnight. The following day, the cells were washed twice with Wash Buffer at 37 °C for 30 min each. The third wash contained 5 µg/mL Hoechst33342 for nuclear staining.
Extracellular acidification rate (ECAR) and oxygen consumption rate (OCR)
ECAR and OCR were analyzed on an XF96 Extracellular Flux Analyzer (Agilent) according to the manufacturer’s recommendations. For Seahorse flux analysis, cells were plated in XF 96-well microplates (Agilent). After 24 h, the medium was completely replaced with XF Base medium (Aligent) (ECAR: XF Base Medium containing 4 mM L-glutamine. OCR: XF Base Medium containing 25 mM glucose (Sigma), 4 mM L-glutamine and 2 mM sodium pyruvate). Reagents were added at the following final concentrations: for ECAR measurement, glucose (25 mM), oligomycin (2 µM) (Abcam), 2-DG (100 mM). For OCR measurement, FCCP (1 or 2 µM) (Sigma), rotenone (1 µM) (Abcam), antimycin A (1 µM) (Sigma). Measured values were normalized to total protein amount quantified by a BCA quantification kit (Huaxingbio).
Glucose uptake measurement
Glucose consumption and lactate production were measured by LC-MS/MS. Cells were cultured in the regular medium. After 12 h, the medium was collected and added to 4 volumes of 100% methanol. Metabolites were extracted and analyzed by LC-MS/MS. Ultimate 3000 UHPLC (Dionex) coupled with HF orbitrap (Thermofisher) was used to perform LC separation. In negative mode, BEH Amide column (2.1 × 100 mm, Waters) was applied for separation with column temperature at 50 °C. The gradient is generated with flow rate at 400 μL/min as follows: 0 min, 2%B; 1 min, 2% B; 5.5 min, 10% B; 6.5 min, 80% B; 8.3 min, 2% B; 10.0 min, 2% B. Mobile phase A is prepared by dissolving 0.58 g of ammonium acetate in 50 mL of HPLC-grade water, then adding 950 mL of HPLC-grade acetonitrile. Adjust pH to 9.0 with ammonium hydroxide solution. Mobile phase B is prepared by dissolving 0.58 g of ammonium acetate in 500 mL HPLC-grade water, subsequently, adding 500 mL HPLC-grade acetonitrile and adjusting pH to 9.0 with ammonium hydroxide solution. Data with mass ranges of m/z 80–300 was acquired at negative ion mode with data-dependent MSMS acquisition. The full scan and fragment spectra were collected with resolution of 60,000 and 30,000 respectively. The detailed mass spectrometer parameters are as follows: spray voltage, 3.0 kV for negative; capillary temperature, 320 °C; heater temperature: 300 °C; sheath gas flow rate: 35; auxiliary gas flow rate:10. Metabolite identification was based on Tracefinder (Thermofisher) search with home-built database.
Also, glucose uptake measurement was performed using the Glucose Uptake-GloTM Assay (Promega) according to the manufacturer’s directions. In brief, the cell culture medium was removed, and plates were incubated in 1 mM of 2-DG in PBS for 10 min at room temperature. Following cell lysis and neutralization, samples were incubated in the 2DG6P detection reagent for 1 h and analyzed with a luminometer.
Metabolites extraction and metabolomic analysis
Day 3 EBs were collected in a 15 mL tube and washed gently with PBS buffer 3 times. The tubes were placed on dry ice and 2 mL of 80% (vol/vol) methanol (pre-chilled to −80 °C) was added. The tubes were centrifuged at 14,000 g for 20 min at 4−8 °C and the metabolite-containing supernatant was transferred to a new 15 mL tube on dry ice. A Speedvac (Thermo Fisher Scientific) was used to dry the pellet using no heat (room temperature). Targeted metabolomics was implemented with a TSQ Quantiva Triple Quadrupole mass spectrometer (Thermo Fisher Scientific). Mobile phase A was prepared by using 10 mM tributylamine, 15 mM acetate in HPLC-grade water, Mobile phase B was HPLC-grade 100% methanol. This analysis focused on TCA cycle, glycolysis pathway, pentose phosphate pathway, amino acids, and purine metabolism. In this experiment, we used a 25-minute gradient from 5% to 90% mobile B. Positive-negative ion switching mode was performed for data acquisition. Cycle time was set as 1 s and a total of 138 ion pairs were included. The resolution for Q1 and Q3 are both 0.7 FWHM. The source voltage was 3500 v for positive and 2500 v for negative ion mode. The source parameters are as follows: capillary temperature: 320 °C; heater temperature: 300 °C; sheath gas flow rate: 35; auxiliary gas flow rate: 10. Tracefinder 3.2 (Thermo Fisher Scientific) was applied for metabolite identification and peak integration.
For multivariate model analysis, between-group differences in metabolite abundance were assessed by Welch’s two-sample t-test. Hierarchical clustering (HCL) and unsupervised principal components analysis (PCA) were implemented to assess data quality and detect sample outliers. Raw data from the metabolic analysis were normalized to the median of each sample, respectively. Missing values were assumed to be below the limit of detection and were imputed with half of the minimum of the whole dataset. The heatmap of metabolites in glycolysis, TCA cycle, and pentpentose phosphate pathway were constructed in RStudio (V1.2.1335).
Bimolecular fluorescence complementation (BiFC)
BiFC assays were performed as described previously86. Venus fluorescence protein was selected as the reporter for complementation. Mouse NEMEP including full length, ΔN, ΔC, and ΔTMD coding sequences were cloned into NheI- and XbaI- digested pcDNA3.1 vector with the amino- (YN, amino acid 1–173) termini of Venus. And the mouse GLUT1/3 coding sequences were cloned into NheI- and XbaI- digested pcDNA3.1 vector with the carboxyl- (YC, amino acids 155–239). HEK293T cells with co-transfection of YC and YN for 48 h were analyzed for fluorescence by FACS and FlowJo v7.6. For flow-based sorting of the GFP positive cells, parental cells without plasmid transfection were used to define negative cell populations and set gates for analysis. For flow studies of BiFC, we use the same cells without plasmid transfection for defining negative cell populations and set gates for analysis.
Quantitative reverse transcription-PCR analysis
RNA extraction, reverse transcription, and quantitative real-time polymerase chain reaction (qPCR) were performed as described19. The sequence of primers used in qPCR analysis is listed in Supplementary Table 6.
Immunoprecipitation and western blot
Whole cell lysates were prepared from exponentially-growing cells. After centrifugation at 3000 g for 5 min, cell pellets were lysed in whole cell lysis (WCL) buffer (10 mM Tris-HCl pH8.0, 150 mM NaCl, 1% Triton X-100, 5 mM EDTA) supplemented with protease inhibitors cocktail and phosphatase inhibitor cocktail (Biotool). The lysates were incubated on ice for 30 min and vortexed in 5 min intervals, at finally the lysates were cleared by centrifugation at 12,000 g for 10 min at 4 °C. The soluble material was collected and incubated with Anti-Flag Affinity Gel (Biotool) or GFP-Trap Agarose (ChromoTek) for 3 h. Beads were then washed 4 times with WCL buffer, captured protein complexes were boiled in SDS sample buffer and analyzed by SDS-PAGE followed by immunoblotting. The blots were blocked with 5% non-fat powdered milk in TBST (0.1% Tween-20 in TBS) at room temperature for 1 h and incubated with primary antibodies diluted in blocking buffer overnight at 4 °C. Membranes were incubated with the appropriate secondary antibodies conjugated with horseradish peroxidase for 2 h at room temperature. Enhanced chemiluminescence system (GE healthcare, RPN2108) was used for Western blot detection. The primary antibodies used in this paper targeted the following proteins: FLAG (Sigma-Aldrich, F3165, 1:3000), GAPDH (ZSGB-BIO, TA-08, 1:5000), GFP (EASYBIO, BE2001, 1:2000), Histone H3 (Abcam, ab10799, 1:10000),SMAD2/3 (CST, 8685, 1:1000), SMAD4 (Santa Cruz, sc-7966, 1:1000), CDH1 (CST, 3195, 24E10, 1:1000), GLUT1 (Beyotime, AF1015, 1:1000), GLUT3 (Abclonal, A4137, 1:1000), HA (Millipore, 04–902, 1:2000), Phospho-SMAD2/3 (CST, 8828, 1:1000), SMAD2 (CST, 5339, 1:3000), TRIM33 (Bethyl Laboratories, A301–060A, 1:1000).
Ribosome profiling
2.5 × 105 mESCs were seeded on 15-cm dish and cultured without LIF for 3 days. Cells were then treated with cycloheximide at a final concentration of 100 µg/mL for 10 min in the 37 °C 5% CO2 incubator. Aspirate the media, wash the plates with ice-cold PBS containing 100 µg/mLcycloheximide (PBS/CHX). Then scrap and collect the cells with ice-cold PBS/CHX into 15 mL tubes. The cells were centrifuged at 500 g for 5 min at 4 °C. Then pellet was subsequently lysed in the polysome lysis buffer (10 mM HEPES-pH 7.4, 5 mM MgCl2, 150 mM KCl, 1% NP40, 100 μg/ml cycloheximide, 1 mM DTT, Rnase inhibitor (200 u/ml) and Protease inhibitor) and incubated on ice for 10 min. Lysates were centrifuged at 13,000 g for 10 min at 4 °C, and supernatant was carefully transferred to a new 1.5 mL tube. Prepare a 15–45% linear sucrose gradient (sucrose buffer containing 10 mM HEPES-pH7.4, 5 mM MgCl2, 150 mM KCl, 100 μg/mL cycloheximide, 1 mM DTT, and sucrose.) using a Gradient Master (Gradient108, Biocomp company). Lysates were loaded on a 15–45% linear sucrose gradient and centrifuged at 40,000 g for 2 h at 4 °C in a Beckman SW41 rotor and subsequently fractionated using a fraction collector (Gradient108, Biocomp company). Polysome profiles were measured using an absorbance detector connected to the fraction collector and measuring absorbance at 260 nm. RNA was extracted with Trizol. For RNA analysis from polysome fractions, qPCR were done as described19. The quantity of each fraction was first determined using a standard curve, and then expression was normalized to RNA contained in fraction 1.
Quantification and statistical analysis
The statistical tests used in this study are indicated in the respective figure legends. In general, data with two groups were analyzed by Student unpaired t-test to determine statistically significant effects. Data with multiple groups were analyzed by one-way, two-way ANOVA to determine statistically significant effects.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank Drs. Jianbin Wang, Jie Na, Suneng Fu, Peng Jiang, Ye-Guang Chen, Qinghua Tao, Wei Wu, and Nieng Yan for reagents, and helpful discussions. We also thanks Technology Center for Protein Sciences in Tsinghua University: Xianbin Meng, Chongchong Zhao, Xiaolin Tian, and Dr. Haiteng Deng at Protein Chemistry Facility for mass spectrometry analysis; Xueying Wang, Yupei Jiao and Dr. Xiaohui Liu at Facility Center of Metabolomics and Lipidomics for targeted metabolomic experiment analysis and LC-MS/MS analysis; Yanli Zhang and Yalan Chen at Imaging Core Facility for assistance of using Nikon A1R HD25. We would like to thank Pengcheng Jiao and Jiaojiao Ji at Center of Biomedical Analysis for flow-cytometry analysis; Jing Zhang and Dr. Zai Chang from Laboratory Animal Research Center for generating chimera embryos in Tsinghua University. We thank Ping Wu for establishing CRISPRi and CRISPRa cell lines. The work was supported by National Natural Science Foundation of China (NSFC:91540108) and Tsinghua-Peking Center for Life Sciences.
Source data
Author contributions
H.F. and X.K. established most cell lines. H.F., X.K., and N.W. performed cell differentiation assays and qPCR. K.K. supervised N.W. for cell differentiation assays. X.K. performed the RACE assay. T.W. performed ribosome profiling. and H.F. and T.W. performed Seahorse Extracellular Flux Analysis and glucose uptake experiment. H.F. prepared RNA-seq and metabolomics samples. K.Y., J.C., and Y.Y. analyzed the RNA-seq and ChIP-seq data under the supervision of Q.X. and Z.L. Y.Y. helped to perform affinity purification assay. H.F. performed affinity purification assay, chimera embryos dissection, immunostaining, immunoblotting, and co-IP experiment. Q.X. supervised the study. H.F. and Q.X. wrote the manuscript; all authors contributed to data interpretation and discussion.
Peer review
Peer review information
Nature Communications thanks Yifeng Zhou and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
The RNA-seq data used in this study are available in the NCBI’s Gene Expression Omnibus database under accession code GSE157073, GSE115169, GSE70486. The ChIP-seq data used in this study are available in the NCBI’s Gene Expression Omnibus database under accession code GSE125116. The mass spectrometry data generated in this study are provided in Supplementary Data 1. Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-31762-x.
References
- 1.Zhang J, Nuebel E, Daley GQ, Koehler CM, Teitell MA. Metabolic regulation in pluripotent stem cells during reprogramming and self-renewal. Cell Stem Cell. 2012;11:589–595. doi: 10.1016/j.stem.2012.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Oginuma, M. et al. Intracellular pH controls WNT downstream of glycolysis in amniote embryos. Nature584, 98–101 (2020). [DOI] [PMC free article] [PubMed]
- 3.Cliff TS, et al. MYC controls human pluripotent stem cell fate decisions through regulation of metabolic flux. Cell Stem Cell. 2017;21:502–516.e509. doi: 10.1016/j.stem.2017.08.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhang J, et al. Metabolism in pluripotent stem cells and early mammalian development. Cell Metab. 2018;27:332–338. doi: 10.1016/j.cmet.2018.01.008. [DOI] [PubMed] [Google Scholar]
- 5.Kelly A, West JD. Genetic evidence that glycolysis is necessary for gastrulation in the mouse. Dev. Dyn. 1996;207:300–308. doi: 10.1002/(SICI)1097-0177(199611)207:3<300::AID-AJA7>3.0.CO;2-L. [DOI] [PubMed] [Google Scholar]
- 6.Wang D, et al. A mouse model for Glut-1 haploinsufficiency. Hum. Mol. Genet. 2006;15:1169–1179. doi: 10.1093/hmg/ddl032. [DOI] [PubMed] [Google Scholar]
- 7.Schmidt S, et al. Essential role of glucose transporter GLUT3 for post-implantation embryonic development. J. Endocrinol. 2009;200:23–33. doi: 10.1677/JOE-08-0262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Pantaleon M, Kaye PL. Glucose transporters in preimplantation development. Rev. Reprod. 1998;3:77–81. doi: 10.1530/ror.0.0030077. [DOI] [PubMed] [Google Scholar]
- 9.Leppens-Luisier G, Urner F, Sakkas D. Facilitated glucose transporters play a crucial role throughout mouse preimplantation embryo development. Hum. Reprod. 2001;16:1229–1236. doi: 10.1093/humrep/16.6.1229. [DOI] [PubMed] [Google Scholar]
- 10.Arhin SK, et al. Multiple facilitated glucose transporters SLC2As are required for normal mouse preimplantation embryo development. Am. J. Transl. Res. 2019;11:3412–3425. [PMC free article] [PubMed] [Google Scholar]
- 11.Arnold SJ, Robertson EJ. Making a commitment: cell lineage allocation and axis patterning in the early mouse embryo. Nat. Rev. Mol. Cell Biol. 2009;10:91–103. doi: 10.1038/nrm2618. [DOI] [PubMed] [Google Scholar]
- 12.Derynck R, Akhurst RJ. Differentiation plasticity regulated by TGF-beta family proteins in development and disease. Nat. Cell Biol. 2007;9:1000–1004. doi: 10.1038/ncb434. [DOI] [PubMed] [Google Scholar]
- 13.van der Kraan PM. The changing role of TGFbeta in healthy, ageing and osteoarthritic joints. Nat. Rev. Rheumatol. 2017;13:155–163. doi: 10.1038/nrrheum.2016.219. [DOI] [PubMed] [Google Scholar]
- 14.Batlle E, Massague J. Transforming growth factor-beta signaling in immunity and cancer. Immunity. 2019;50:924–940. doi: 10.1016/j.immuni.2019.03.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhu J, Thompson CB. Metabolic regulation of cell growth and proliferation. Nat. Rev. Mol. Cell Biol. 2019;20:436–450. doi: 10.1038/s41580-019-0123-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Pouponnot C, Jayaraman L, Massague J. Physical and functional interaction of SMADs and p300/CBP. J. Biol. Chem. 1998;273:22865–22868. doi: 10.1074/jbc.273.36.22865. [DOI] [PubMed] [Google Scholar]
- 17.Feng XH, Zhang Y, Wu RY, Derynck R. The tumor suppressor Smad4/DPC4 and transcriptional adaptor CBP/p300 are coactivators for smad3 in TGF-beta-induced transcriptional activation. Genes Dev. 1998;12:2153–2163. doi: 10.1101/gad.12.14.2153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wotton D, Lo RS, Lee S, Massague J. A Smad transcriptional corepressor. Cell. 1999;97:29–39. doi: 10.1016/S0092-8674(00)80712-6. [DOI] [PubMed] [Google Scholar]
- 19.Xi Q, He W, Zhang XH, Le HV, Massague J. Genome-wide impact of the BRG1 SWI/SNF chromatin remodeler on the transforming growth factor-beta transcriptional program. J. Biol. Chem. 2008;283:1146–1155. doi: 10.1074/jbc.M707479200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Massague J. TGFbeta signalling in context. Nat. Rev. Mol. Cell Biol. 2012;13:616–630. doi: 10.1038/nrm3434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Luo M, et al. H3K18ac primes mesendodermal differentiation upon nodal signaling. Stem Cell Rep. 2019;13:642–656. doi: 10.1016/j.stemcr.2019.08.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Brennan J, et al. Nodal signalling in the epiblast patterns the early mouse embryo. Nature. 2001;411:965–969. doi: 10.1038/35082103. [DOI] [PubMed] [Google Scholar]
- 23.Gaarenstroom T, Hill CS. TGF-beta signaling to chromatin: How Smads regulate transcription during self-renewal and differentiation. Semin Cell Dev. Biol. 2014;32:107–118. doi: 10.1016/j.semcdb.2014.01.009. [DOI] [PubMed] [Google Scholar]
- 24.David CJ, Massague J. Contextual determinants of TGFbeta action in development, immunity and cancer. Nat. Rev. Mol. Cell Biol. 2018;19:419–435. doi: 10.1038/s41580-018-0007-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Xi Q, et al. A poised chromatin platform for TGF-beta access to master regulators. Cell. 2011;147:1511–1524. doi: 10.1016/j.cell.2011.11.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kim SW, et al. Chromatin and transcriptional signatures for Nodal signaling during endoderm formation in hESCs. Dev. Biol. 2011;357:492–504. doi: 10.1016/j.ydbio.2011.06.009. [DOI] [PubMed] [Google Scholar]
- 27.Charney RM, et al. Foxh1 occupies cis-regulatory modules prior to dynamic transcription factor interactions controlling the mesendoderm gene program. Dev. Cell. 2017;40:595. doi: 10.1016/j.devcel.2017.02.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wilusz JE, Sunwoo H, Spector DL. Long noncoding RNAs: functional surprises from the RNA world. Genes Dev. 2009;23:1494–1504. doi: 10.1101/gad.1800909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Quinn JJ, Chang HY. Unique features of long non-coding RNA biogenesis and function. Nat. Rev. Genet. 2016;17:47–62. doi: 10.1038/nrg.2015.10. [DOI] [PubMed] [Google Scholar]
- 30.Ransohoff JD, Wei Y, Khavari PA. The functions and unique features of long intergenic non-coding RNA. Nat. Rev. Mol. Cell Biol. 2018;19:143–157. doi: 10.1038/nrm.2017.104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wahlestedt C. Targeting long non-coding RNA to therapeutically upregulate gene expression. Nat. Rev. Drug Discov. 2013;12:433–446. doi: 10.1038/nrd4018. [DOI] [PubMed] [Google Scholar]
- 32.Fatica A, Bozzoni I. Long non-coding RNAs: new players in cell differentiation and development. Nat. Rev. Genet. 2014;15:7–21. doi: 10.1038/nrg3606. [DOI] [PubMed] [Google Scholar]
- 33.Huarte M. The emerging role of lncRNAs in cancer. Nat. Med. 2015;21:1253–1261. doi: 10.1038/nm.3981. [DOI] [PubMed] [Google Scholar]
- 34.Daneshvar K, et al. DIGIT is a conserved long noncoding RNA that regulates GSC expression to control definitive endoderm differentiation of embryonic stem cells. Cell Rep. 2016;17:353–365. doi: 10.1016/j.celrep.2016.09.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Chen YG, Satpathy AT, Chang HY. Gene regulation in the immune system by long noncoding RNAs. Nat. Immunol. 2017;18:962–972. doi: 10.1038/ni.3771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kopp F, Mendell JT. Functional classification and experimental dissection of long noncoding RNAs. Cell. 2018;172:393–407. doi: 10.1016/j.cell.2018.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Makarewich CA, Olson EN. Mining for micropeptides. Trends Cell Biol. 2017;27:685–696. doi: 10.1016/j.tcb.2017.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Nichols J, et al. Complementary tissue-specific expression of LIF and LIF-receptor mRNAs in early mouse embryogenesis. Mech. Dev. 1996;57:123–131. doi: 10.1016/0925-4773(96)00531-X. [DOI] [PubMed] [Google Scholar]
- 39.Daheron L, et al. LIF/STAT3 signaling fails to maintain self-renewal of human embryonic stem cells. Stem Cells. 2004;22:770–778. doi: 10.1634/stemcells.22-5-770. [DOI] [PubMed] [Google Scholar]
- 40.Niwa H, Ogawa K, Shimosato D, Adachi K. A parallel circuit of LIF signalling pathways maintains pluripotency of mouse ES cells. Nature. 2009;460:118–122. doi: 10.1038/nature08113. [DOI] [PubMed] [Google Scholar]
- 41.Murry CE, Keller G. Differentiation of embryonic stem cells to clinically relevant populations: lessons from embryonic development. Cell. 2008;132:661–680. doi: 10.1016/j.cell.2008.02.008. [DOI] [PubMed] [Google Scholar]
- 42.James D, Levine AJ, Besser D, Hemmati-Brivanlou A. TGFbeta/activin/nodal signaling is necessary for the maintenance of pluripotency in human embryonic stem cells. Development. 2005;132:1273–1282. doi: 10.1242/dev.01706. [DOI] [PubMed] [Google Scholar]
- 43.Yeo C, Whitman M. Nodal signals to Smads through Cripto-dependent and Cripto-independent mechanisms. Mol. Cell. 2001;7:949–957. doi: 10.1016/S1097-2765(01)00249-0. [DOI] [PubMed] [Google Scholar]
- 44.Wang Q, et al. The p53 family coordinates wnt and nodal inputs in mesendodermal differentiation of embryonic stem cells. Cell Stem Cell. 2017;20:70–86. doi: 10.1016/j.stem.2016.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Inman GJ, et al. SB-431542 is a potent and specific inhibitor of transforming growth factor-beta superfamily type I activin receptor-like kinase (ALK) receptors ALK4, ALK5, and ALK7. Mol. Pharm. 2002;62:65–74. doi: 10.1124/mol.62.1.65. [DOI] [PubMed] [Google Scholar]
- 46.Peng G, et al. Molecular architecture of lineage allocation and tissue organization in early mouse embryo. Nature. 2019;572:528–532. doi: 10.1038/s41586-019-1469-8. [DOI] [PubMed] [Google Scholar]
- 47.Peng G, et al. Spatial transcriptome for the molecular annotation of lineage fates and cell identity in mid-gastrula mouse embryo. Dev. Cell. 2016;36:681–697. doi: 10.1016/j.devcel.2016.02.020. [DOI] [PubMed] [Google Scholar]
- 48.Tam PP, Rossant J. Mouse embryonic chimeras: tools for studying mammalian development. Development. 2003;130:6155–6163. doi: 10.1242/dev.00893. [DOI] [PubMed] [Google Scholar]
- 49.Deng D, Yan N. GLUT, SGLT, and SWEET: Structural and mechanistic investigations of the glucose transporters. Protein Sci. 2016;25:546–558. doi: 10.1002/pro.2858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Bae SS, Cho H, Mu J, Birnbaum MJ. Isoform-specific regulation of insulin-dependent glucose uptake by Akt/protein kinase B. J. Biol. Chem. 2003;278:49530–49536. doi: 10.1074/jbc.M306782200. [DOI] [PubMed] [Google Scholar]
- 51.Kohn AD, Summers SA, Birnbaum MJ, Roth RA. Expression of a constitutively active Akt Ser/Thr kinase in 3T3-L1 adipocytes stimulates glucose uptake and glucose transporter 4 translocation. J. Biol. Chem. 1996;271:31372–31378. doi: 10.1074/jbc.271.49.31372. [DOI] [PubMed] [Google Scholar]
- 52.Wu L, Derynck R. Essential role of TGF-beta signaling in glucose-induced cell hypertrophy. Dev. Cell. 2009;17:35–48. doi: 10.1016/j.devcel.2009.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kitagawa T, Masumi A, Akamatsu Y. Transforming growth factor-beta 1 stimulates glucose uptake and the expression of glucose transporter mRNA in quiescent Swiss mouse 3T3 cells. J. Biol. Chem. 1991;266:18066–18071. doi: 10.1016/S0021-9258(18)55237-6. [DOI] [PubMed] [Google Scholar]
- 54.Inoki K, Haneda M, Maeda S, Koya D, Kikkawa R. TGF-β1 stimulates glucose uptake by enhancing GLUT1 expression in mesangial cells. Kidney Int. 1999;55:1704–1712. doi: 10.1046/j.1523-1755.1999.00438.x. [DOI] [PubMed] [Google Scholar]
- 55.Hua W, Ten Dijke P, Kostidis S, Giera M, Hornsveld M. TGFbeta-induced metabolic reprogramming during epithelial-to-mesenchymal transition in cancer. Cell Mol. Life Sci. 2020;77:2103–2123. doi: 10.1007/s00018-019-03398-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.D’Souza RC, et al. Time-resolved dissection of early phosphoproteome and ensuing proteome changes in response to TGF-beta. Sci. Signal. 2014;7:rs5. doi: 10.1126/scisignal.2004856. [DOI] [PubMed] [Google Scholar]
- 57.Zanet J, Chanut-Delalande H, Plaza S, Payre F. Small peptides as newcomers in the control of drosophila development. Curr. Top. Dev. Biol. 2016;117:199–219. doi: 10.1016/bs.ctdb.2015.11.004. [DOI] [PubMed] [Google Scholar]
- 58.Pauli A, Valen E, Schier AF. Identifying (non-)coding RNAs and small peptides: challenges and opportunities. Bioessays. 2015;37:103–112. doi: 10.1002/bies.201400103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Pueyo JI, et al. Hemotin, a regulator of phagocytosis encoded by a small ORF and conserved across metazoans. PLoS Biol. 2016;14:e1002395. doi: 10.1371/journal.pbio.1002395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Andrews SJ, Rothnagel JA. Emerging evidence for functional peptides encoded by short open reading frames. Nat. Rev. Genet. 2014;15:193–204. doi: 10.1038/nrg3520. [DOI] [PubMed] [Google Scholar]
- 61.Saghatelian A, Couso JP. Discovery and characterization of smORF-encoded bioactive polypeptides. Nat. Chem. Biol. 2015;11:909–916. doi: 10.1038/nchembio.1964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.van Heesch S, et al. The translational landscape of the human heart. Cell. 2019;178:242–260.e229. doi: 10.1016/j.cell.2019.05.010. [DOI] [PubMed] [Google Scholar]
- 63.Sousa ME, Farkas MH. Micropeptide. PLoS Genet. 2018;14:e1007764. doi: 10.1371/journal.pgen.1007764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Plaza S, Menschaert G, Payre F. In search of lost small peptides. Annu Rev. Cell Dev. Biol. 2017;33:391–416. doi: 10.1146/annurev-cellbio-100616-060516. [DOI] [PubMed] [Google Scholar]
- 65.Anderson DM, et al. A micropeptide encoded by a putative long noncoding RNA regulates muscle performance. Cell. 2015;160:595–606. doi: 10.1016/j.cell.2015.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Matsumoto A, et al. mTORC1 and muscle regeneration are regulated by the LINC00961-encoded SPAR polypeptide. Nature. 2017;541:228–232. doi: 10.1038/nature21034. [DOI] [PubMed] [Google Scholar]
- 67.D’Lima NG, et al. A human microprotein that interacts with the mRNA decapping complex. Nat. Chem. Biol. 2017;13:174–180. doi: 10.1038/nchembio.2249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Jackson R, et al. The translation of non-canonical open reading frames controls mucosal immunity. Nature. 2018;564:434–438. doi: 10.1038/s41586-018-0794-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Stein CS, et al. Mitoregulin: A lncRNA-encoded microprotein that supports mitochondrial supercomplexes and respiratory efficiency. Cell Rep. 2018;23:3710–3720.e3718. doi: 10.1016/j.celrep.2018.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Makarewich CA, et al. MOXI is a mitochondrial micropeptide that enhances fatty acid beta-oxidation. Cell Rep. 2018;23:3701–3709. doi: 10.1016/j.celrep.2018.05.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Bhatta A, et al. A mitochondrial micropeptide is required for activation of the Nlrp3 inflammasome. J. Immunol. 2020;204:428–437. doi: 10.4049/jimmunol.1900791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Moussaieff A, et al. Glycolysis-mediated changes in acetyl-CoA and histone acetylation control the early differentiation of embryonic stem cells. Cell Metab. 2015;21:392–402. doi: 10.1016/j.cmet.2015.02.002. [DOI] [PubMed] [Google Scholar]
- 73.Martyn I, Kanno TY, Ruzo A, Siggia ED, Brivanlou AH. Self-organization of a human organizer by combined Wnt and Nodal signalling. Nature. 2018;558:132–135. doi: 10.1038/s41586-018-0150-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Lawson KA, Pedersen RA. Cell fate, morphogenetic movement and population kinetics of embryonic endoderm at the time of germ layer formation in the mouse. Development. 1987;101:627–652. doi: 10.1242/dev.101.3.627. [DOI] [PubMed] [Google Scholar]
- 75.Mergenthaler P, Lindauer U, Dienel GA, Meisel A. Sugar for the brain: the role of glucose in physiological and pathological brain function. Trends Neurosci. 2013;36:587–597. doi: 10.1016/j.tins.2013.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Cahill GF., Jr. Fuel metabolism in starvation. Annu Rev. Nutr. 2006;26:1–22. doi: 10.1146/annurev.nutr.26.061505.111258. [DOI] [PubMed] [Google Scholar]
- 77.Chu GC, Dunn NR, Anderson DC, Oxburgh L, Robertson EJ. Differential requirements for Smad4 in TGFbeta-dependent patterning of the early mouse embryo. Development. 2004;131:3501–3512. doi: 10.1242/dev.01248. [DOI] [PubMed] [Google Scholar]
- 78.Dunn NR, et al. Mice exclusively expressing the short isoform of Smad2 develop normally and are viable and fertile. Genes Dev. 2005;19:152–163. doi: 10.1101/gad.1243205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Keller GM. In vitro differentiation of embryonic stem cells. Curr. Opin. Cell Biol. 1995;7:862–869. doi: 10.1016/0955-0674(95)80071-9. [DOI] [PubMed] [Google Scholar]
- 80.Bain G, Kitchens D, Yao M, Huettner JE, Gottlieb DI. Embryonic stem cells express neuronal properties in vitro. Dev. Biol. 1995;168:342–357. doi: 10.1006/dbio.1995.1085. [DOI] [PubMed] [Google Scholar]
- 81.Ying QL, Stavridis M, Griffiths D, Li M, Smith A. Conversion of embryonic stem cells into neuroectodermal precursors in adherent monoculture. Nat. Biotechnol. 2003;21:183–186. doi: 10.1038/nbt780. [DOI] [PubMed] [Google Scholar]
- 82.Ran FA, et al. Genome engineering using the CRISPR-Cas9 system. Nat. Protoc. 2013;8:2281–2308. doi: 10.1038/nprot.2013.143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Konermann S, et al. Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex. Nature. 2015;517:583–588. doi: 10.1038/nature14136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Kearns NA, et al. Cas9 effector-mediated regulation of transcription and differentiation in human pluripotent stem cells. Development. 2014;141:219–223. doi: 10.1242/dev.103341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Yin Y, et al. Opposing roles for the lncRNA haunt and its genomic locus in regulating HOXA gene activation during embryonic stem cell differentiation. Cell Stem Cell. 2015;16:504–516. doi: 10.1016/j.stem.2015.03.007. [DOI] [PubMed] [Google Scholar]
- 86.Kerppola TK. Design and implementation of bimolecular fluorescence complementation (BiFC) assays for the visualization of protein interactions in living cells. Nat. Protoc. 2006;1:1278–1286. doi: 10.1038/nprot.2006.201. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The RNA-seq data used in this study are available in the NCBI’s Gene Expression Omnibus database under accession code GSE157073, GSE115169, GSE70486. The ChIP-seq data used in this study are available in the NCBI’s Gene Expression Omnibus database under accession code GSE125116. The mass spectrometry data generated in this study are provided in Supplementary Data 1. Source data are provided with this paper.