Skip to main content
iScience logoLink to iScience
. 2024 Feb 29;27(4):109355. doi: 10.1016/j.isci.2024.109355

Quantitative proteome dynamics across embryogenesis in a model chordate

Alexander N Frese 1,2,3, Andrea Mariossi 1,2,3, Michael S Levine 1,2,, Martin Wühr 1,2,4,∗∗
PMCID: PMC10951915  PMID: 38510129

Summary

The evolution of gene expression programs underlying the development of vertebrates remains poorly characterized. Here, we present a comprehensive proteome atlas of the model chordate Ciona, covering eight developmental stages and ∼7,000 translated genes, accompanied by a multi-omics analysis of co-evolution with the vertebrate Xenopus. Quantitative proteome comparisons argue against the widely held hourglass model, based solely on transcriptomic profiles, whereby peak conservation is observed during mid-developmental stages. Our analysis reveals maximal divergence at these stages, particularly gastrulation and neurulation. Together, our work provides a valuable resource for evaluating conservation and divergence of multi-omics profiles underlying the diversification of vertebrates.

Subject areas: Animals, Embryology, Evolutionary developmental biology, Proteomics, Transcriptomics

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • Resource of absolute concentration for ∼6,000 proteins in the Ciona egg

  • Comprehensive quantitative analysis of ∼7,000 proteins during Ciona development

  • Embryonic protein dynamics are evolutionarily more conserved than those of mRNA

  • Cross-species protein dynamic comparison supports an inverse hourglass model


Animals; Embryology; Evolutionary developmental biology; Proteomics; Transcriptomics

Introduction

Embryonic development progresses through a series of cellular states, each defined by distinct changes in mRNA and protein levels. Optimal cellular functionality depends on precise control of gene expression and correct protein concentrations.1,2,3 However, (1) accurately measuring protein concentrations and (2) understanding the mechanisms governing cellular proteostasis remain a significant challenge.

While transcriptomic studies often rely on mRNA levels to predict protein concentrations, the key determinants of cellular functionality and phenotype, numerous studies have reported weak correlations between the two, challenging their reliability as proxies for each other.4,5,6,7 This disparity is influenced by the stochastic nature of mRNA transcription, translation, and degradation and becomes particularly pronounced for dynamic cellular transitions during embryogenesis.8,9,10,11 Thus, mRNA levels are not necessarily predictive of protein concentrations, which prompts a shift toward applying more comprehensive proteome-wide analyses.

Proteomic methods provide an accurate measurement of protein abundance but have been historically limited by technical challenges.12 Recent advancements in quantitative multiplexed mass spectrometry (MS) have significantly enhanced the sensitivity and precision of these measurements, expanding our capacity to map the cellular proteome in detail.13,14,15,16,17,18 Applying these techniques to the study of vertebrate embryos still presents considerable challenges. The system’s complexity, high cell numbers, and substantial yolk content, which affects detection of moderate and low abundance proteins, have limited the coverage and scope of these analyses.4,19,20,21,22,23 Urochordates are the nearest extant relatives to vertebrates and share several morphological and genomic traits.24 In particular, Ciona has numerous experimental advantages like small size, low cell number, stereotyped cell lineages, rapid and comparatively simple development with experimental tractable embryogenesis, and a compact genome that is not complicated by the gene duplication events accompanying the advent of the vertebrates.25 Additionally, Ciona retains conservation of non-coding elements, macrosynteny, and microsynteny with chordates, making it an ideal model for studying the evolution of vertebrate developmental processes.26,27,28,29,30,31

While Ciona lacks the complex specializations and innovations characteristic of vertebrates, it has nonetheless advanced our understanding of the morphogenesis of basic chordate tissues such as the muscles, heart and notochord, as well as the evolution of key vertebrate processes such as neural crest.32,33,34,35,36,37,38,39,40 The assembly of the Ciona genome25 represented a significant landmark that enabled a variety of transcriptomics, epigenomics, and single-cell studies.41,42,43,44,45,46,47 Here, we extend these large-scale datasets through the use of quantitative proteomics methods.

The evolution of gene expression and its role in morphological innovations have been studied primarily by comparative transcriptomics.48,49,50,51 These studies point toward a 'phylotypic period’ in vertebrates, whereby gene expression is most similar across different species during mid-embryogenesis or pharyngula stage, the “hourglass” model.52 However, comparisons with non-vertebrate chordates such as tunicates and cephalochordates are not entirely consistent with the hourglass.53 This suggests potential divergent developmental pathways or an earlier onset of conservation as compared with vertebrates. For example, in amphioxus this conservation aligns with the earlier neurula stage.54 In fact, extending comparative analysis to invertebrates, suggests an inverse hourglass model with increased conservation during early and late developmental stages rather than in the middle of development.55,56,57 This model implies a bottleneck in developmental pathways, potentially influencing the emergence of species-specific traits. The effectiveness of these comparative analyses require careful consideration of phylogenetic distances, species diversity, embryonic stages, and gene sets compared.58 Several studies stress limitations of simplistic pairwise comparisons, robust testing of null hypotheses, and the challenge in balancing phylogenetic distances, which can be too short among closely related species or too extensive when the comparisons are made between vertebrates and invertebrates or across multiple phyla.51,53,58

A major limitation of the earlier studies is the reliance of transcriptome datasets to infer the dynamics of gene activities.59 Recent reports suggest significant disparities in mRNA and protein levels.4,5,6,60 In this study we re-examine similarity of embryos at various developmental stages with comparisons of both transcriptome and proteome datasets. Proteomic studies offer a novel perspective in cross-species comparisons by quantifying protein conservation patterns, which are the primary executors of most cellular functions.61

Here, we use state-of-the-art proteomics to quantify proteins in unfertilized Ciona eggs and to track proteomic changes throughout embryogenesis, revealing that the embryonic proteome accounts for at least half of the genome’s protein-coding capacity. We create a detailed genome-wide dataset that shows precise measurement of protein kinetics and their association to key developmental processes such as fertilization, maternal-to-zygotic transition (MZT), gastrulation, and the formation of larval tissues. Further, we integrated these data with corresponding transcriptome information and carried out inter-species comparisons between Ciona and Xenopus laevis, the African clawed frog. We discuss the implications of these studies with respect to the conservation and divergence of genetic activities during chordate evolution and reconsider the hourglass model of development.

Results and discussion

Adapting proteomics for the analysis of Ciona eggs and embryos

Mass spectrometry-based proteomics (MS) is a versatile tool for studying a variety of biological processes, although new model systems often require method adaptations. Key areas needing optimization include sample preparation and the reference proteome. Analyzing eggs and early embryos is often challenging due to the high yolk content. For instance, in Xenopus, yolk constitutes ∼90% of egg protein content, limiting the depth of proteomics analyzes.62 Researchers usually remove yolk through centrifugation after lysis of eggs or embryos.63,64,65 However, when we analyzed Ciona egg lysates via Coomassie-stained gels, we found no exceptionally dominant protein band (Figure S1A), allowing us to analyze Ciona samples by MS without yolk removal. Another concern in proteomics is the quality of the protein reference database. For widely used models such as humans, mice, or yeast, this is typically derived from the genome. However, the quality of the genome for non-canonical model organisms is often poor, thereby severely limiting the proteins that can be identified via MS. A better reference database can be generated based on mRNA-seq data.64,66 Accordingly, we first evaluated the quality of the latest Ciona genome by benchmarking it against a genome-free protein reference database, which we generated from available RNA-seq datasets (Figure S1B).39,67,68,69 Upon comparison, the RNA-seq based reference database clearly outperformed Uniprot70 and the previous genome annotations (KH-2013 and KY19),71 but increased peptide coverage by only 5% compared to the most recent KY21 annotation (Figures S1C and S1D).72 We decided to accept the modest decrease in identified peptides for the ease of annotation offered by the genome assembly and proceeded to use the KY21 genome as our primary reference for the remainder of this study.

Further examination of peptides identified using our genome-free database revealed mis-annotated gene coding sequences, mis-positioned intercistronic regions, and discrepancies in selenoprotein sequences present in the KY21 proteome (Figure S1E).73,74,75 We believe that our analysis is a step forward in improving the accuracy and completeness of the Ciona genome annotation and the potential of the proteome atlas to refine Ciona gene models and protein coding sequences. Collectively, our data reveals that the latest assembled Ciona genome, combined with the characteristics of its eggs and embryos, is highly suitable for proteomics studies, and supports Ciona’s potential as a valuable model system for proteomics investigation.

Absolute protein abundance measurements in the unfertilized egg

The mature egg contains an array of maternal proteins required for fertilization, transition to zygotic transcription, and the early stages of embryogenesis.76,77,78,79 Given that many of these proteins remain unidentified, incorporating a proteomic approach was the logical next step. We estimate the absolute concentrations of proteins in the unfertilized egg using MS1 precursor intensity in a deep label-free analysis.64 Altogether, we quantified the abundance of 6,102 proteins, after collapsing isoforms (Figure 1A; Table S1), thereby expanding the number of known proteins by an additional 5,058 entries compared to the previous proteomic investigation of the Ciona egg.80 Nearly 90% of identified proteins are supported by at least two peptides, and the mean sequence coverage is 21% (Figure S1F).

Figure 1.

Figure 1

Absolute proteomics of the Ciona egg

(A) Schematic of label-free proteomics utilized to determine absolute protein concentrations. Unfertilized Ciona eggs were lysed, and human proteins of known concentrations (UPS2) were added to the lysate as a reference standard. Following normalization as outlined in the materials and methods, we detect ∼195,000 peptides and estimate protein concentrations for ∼6,000 proteins.

(B) Table of selected proteins in the unfertilized egg including the top 5 most abundant and some transcription factors important to embryonic development.

(C) Histogram of all quantified proteins in the Ciona egg (gray) with superimposed kernel density estimates (KDE) of transcription factors (TFs - red) and signaling molecules (SMs - blue). Both TFs and SMs follow a distribution similar to the global egg proteome (black) but with a lower median concentration. The complete data is provided in Table S1.

(D) Stoichiometries of protein complexes. Concentrations of subunits from a shared protein complex display comparable values and show typically a statistically different distribution than the entire proteome (∗p < 0.01, two-way ANOVA with Tukey’s multiple-comparisons test).

As expected, the most abundant protein is Vitellogenin (yolk protein), followed by ATP synthase subunits, actin, and a 60S ribosomal subunit (Figure 1B).81 The analysis spans approximately eight orders of magnitude, covering 95 transcription factors (TFs) and 46 signaling molecules (SMs) (Figure 1C). The median protein concentration is 22 nM. In contrast, the median concentrations of TFs and SMs are lower, 5.4 nM and 3.5 nM, respectively. Most of them are distributed toward the lower end of the concentration curve, aligning with reports from other systems, where it has been noted that these molecules can exert significant biological effects even at low concentrations, particularly in driving dynamic cellular processes such as differentiation.82 Among the identified TFs in the egg are known maternal factors such as Gata.a, Prd-B/Prdtun2, and Zeb (also known as Zinc Finger (C2H2)-33 or Ci-ZF266).83 Among the SMs, known maternal factors include β-Catenin, Eph.a/Eph1, Eph.b/Eph2, Raf/Raf1, Tll/Tolloid, Notch, and Numb.83,84,85 The interaction of these known maternal deposits have been reported to be essential to establish the first distinct spatial domains of gene expression that launch the gene regulatory networks controlling embryogenesis.86 Alongside these molecules, the proteomic landscape is characterized by an abundance of kinases and phosphatases, common regulatory components controlling the cell cycle and proliferation. Proteins indicative of posterior end markers (PEM), which include germline determinants and positional cues for the axial development of the embryo, are conspicuous components of the maternal proteome.87,88 These findings suggest a preparatory state for fertilization and subsequent developmental cascades. Furthermore, in addition to Vitellogenin, we observe a notable enrichment of metabolic components, emphasizing the importance of energy and nutritional reserve components supplied by the egg for the early stages of development. These proteins ensure that Ciona embryos, which do not feed before metamorphosis, have the necessary resources for successful settlement.

We next asked whether different subunits within the same protein complex are found at expected stoichiometric ratios. To this end, we mapped the proteins identified in the egg to known stable complexes from the CORUM database (Figure 1D).89 We observed overall comparatively tight distributions of subunits in most macromolecular complexes, such as MCM (involved in genomic DNA replication),90 CCT (playing a significant role in protein folding in the eukaryotic cytosol),91 the HAUS complex (essential for mitotic spindle assembly),92 and Prefoldin (chaperone proteins regulating correct protein folding).93 For all the complexes for which we detect more than two subunits, the distribution is significantly different from the distribution of the entire dataset (p < 0.01, two-way ANOVA with Tukey’s multiple-comparisons test) (Figure 1D).

Altogether, the proteomics of the unfertilized egg highlights intricate networks that anticipate subsequent developmental processes such as fertilization, spatial patterning, and hatching. The consistency of values obtained for different subunits of stoichiometric protein complexes corroborates the reliability of our data, providing a robust platform for future studies.

A high-quality multi-omics atlas of Ciona development

We next measured the dynamics of protein and mRNA abundances as the egg develops into a swimming tadpole. For this relative comparison analysis we combined accurate multiplexed proteome analysis (TMTproC)14 with RNA-seq on matching samples at eight key developmental stages. These stages span early embryonic development and include the maternal/zygotic transition, gastrulation, neurulation, tail elongation, and hatching of swimming tadpoles (Figure 2A), thereby encompassing all of the important developmental processes. Moreover, the parallel sampling of both modalities facilitates a direct comparison between RNA and protein expression.

Figure 2.

Figure 2

Proteome and RNA analyses during Ciona embryogenesis

(A) Overview of the transcriptome and proteome time-course experiments. Staged embryos were collected at eight developmental stages, beginning with unfertilized egg (unfE), fertilized egg (fertE), 16-cell stage (cell-16), initial gastrula (iniG), late neurula (latN), middle tailbud II (midTII), late tailbud II (latTII), and hatching tadpole (larva). Each stage is represented by a unique color code, and abbreviation; both are kept consistent throughout the figures. Time indicates hours postfertilization (hpf).

(B) Number and overlap of identified protein-coding genes in the transcriptome and proteome datasets.

(C) Donut plot with the percentage of protein evidence categories from UniProt that are identified at the proteome level (9,419 entries). Evidence level: (1) protein evidence; (2) transcript evidence; (3) homology; (4) predicted.

(D) Histogram of Pearson correlations between RNA and corresponding protein dynamics throughout Ciona development (gray). The lines represent kernel density estimates (KDE) for all genes (black), transcription factors (red), and signaling molecules (blue). Notably, mRNA dynamics correlate poorly with protein dynamics. n = 7021 pairs.

(E) Example of high Pearson correlation between RNA and protein dynamics for the transcription factor Hox10.

(F) K-means clustering used to classify RNA (left) and protein (right) dynamics for each gene during Ciona development. The thickness of the lines scales with the number represented in each cluster, as indicated in the legend.

(G) GO term analysis used to discern the functional relevance of each of the clusters (indicated by matching colors) identified in F.

Using this framework, we detected 7,095 protein isoforms encoded by 7,057 genes (Figure 2A; Table S2), representing 38% of the protein-coding genes annotated in the latest Ciona genome assembly.72 This accounts for approximately 50% of the expressed genes captured in RNA-seq analyses (Figure 2B; Table S3). This protein number is more than 10-fold greater than that reported in an earlier study, which identified 695 proteins across three sampled stages using two-dimensional gel electrophoresis and MALDI-TOF/MS.81 Our proteome marks a significant advancement in the quality of the UniProt database, which reports experimental evidence at the protein level (PE1) for less than 1% (21 out of 17,311 records). We cover 55% of the redundant UniProt entries, of which four had prior evidence at the PE1 level. Importantly, we confirmed protein products for an additional 9,415 entries previously undocumented at the protein level, categorized under evidence levels PE2–4 (Figure 2C). The new proteome dataset significantly expands the known proteomic landscape of Ciona.

Descriptive analysis of proteomic data and RNA-seq atlas

For MS data, we applied a 1% false discovery rate with a target-decoy strategy94,95 (Figure S2A). We quantify a total of 62,471 peptides, the proteins with most identified peptides are Vitellogenin and Titin (Figure S2B). The median number of peptides per quantified protein is 5, with 84% of the proteome showing more than two peptides per protein (Figure S2B). The identified peptides correspond to 7,095 protein isoforms matching 7,057 unique proteins (Figure 1A). In 35 instances, the dataset enabled differentiation between 2 and 4 splice variants (Figure S2C). The poly(A) pulldown RNA-seq datasets cover an average of 10,727 ± 1,007 genes (mean ± s.d.), with high reproducibility of the biological replicates (Figures S3A–S3C). The number of detected genes steadily increases as development proceeds, reflecting an expanding gene expression repertoire (Figure S3D). However, post-zygotic genome activation (ZGA) at the 16-cell stage did not result in an increase in gene counts, likely due to the degradation of maternal mRNAs as previously observed in zebrafish development.96 The distribution of expression levels (transcripts per million, TPM) initially exhibited a bimodal pattern with peaks at very low and higher levels. As embryonic development proceeded, this distribution evolved into a more normal distribution (Figure S3E). These observations are consistent with the transition of bimodal distributions seen for homogeneous cell populations to a unimodal distribution for heterogeneous cell populations.97

Temporal dynamics and tissue-specific patterns in the proteome atlas

In order to extend our analysis and systematically identify proteins that may influence differentiation programs, we categorized the proteins into eight distinct clusters based on their activity at various stages (Figure S4A) and performed gene ontology (GO) enrichment analysis on each gene cluster (Table S2). Cluster 1 genes exhibited the most stable dynamics, with proteins involved in translation, RNA processing, cell division, DNA organization, ribonucleoprotein complex formation, ribosome biogenesis, and transfer RNA (tRNA) activity. These are indicative of housekeeping functions. Cluster 2 genes, most abundant in unfertilized eggs, rapidly degrade following fertilization and are enriched for mRNA processing, single fertilization proteins, and small GTPase-mediated signal transduction, aligning with spindle assembly roles post-fertilization. They also have an abundance of maternal ribosomes preparing embryos for future development. Proteins in Cluster 3, abundant in both fertilized and unfertilized eggs but rapidly degrading before MZT, are linked to cell division and protein degradation, facilitating rapid embryonic development during the first 4 h postfertilization (hpf). Notably, the Gata4 TF is an early determinant of dorsal-ventral patterning and it makes sense that it is a constituent of Cluster 3.86 Cluster 4 proteins, peaking during gastrulation and neurulation, are associated with cell division, translation elongation, embryonic organ development, and chromatin modification. This reflects the shift from maternal to zygotic production, high translational activity, cell division, and the onset of tissue differentiation. Clusters 5 to 8 exhibit a monotonous growth pattern during MZT, gastrulation, neurulation, and tailbud stages. In later stages, the focus shifts to energy generation, transport, metabolic processes, and tissue morphogenesis. These clusters are enriched with cofactors, coenzymes involved in metabolism, and actin filament organization, correlating with metabolic preparation for swimming tadpoles. Collectively, these analyses revealed proteome dynamics during development, mirroring various aspects of tissue differentiation and morphogenesis.

Next, we evaluated the utility of the proteome atlas as a tool to analyze the expression of tissue-specific marker genes, including those representing the major lineages/germ layers (Figure S4B). This revealed a series of staggered progression waves in protein expression across different tissue types. In line with existing literature,98 we observe that the onset of most tissue differentiation began with gastrulation at the 110-cell stage (epidermis, and endoderm). In the case of the notochord (Sec31b) and mesenchyme (Ci-Psl3), some markers emerge as early as the 16-cell stage, underscoring the unique aspects of Ciona embryogenesis where most cells are restricted to a single tissue fate by the start of gastrulation.99 Markers of differentiating neurons associated with the dorsal and lateral regions of the brain such as Synaptotagmin 1 (Syt),100 Cel3/4/5 (also known as Etr-1, Cel3.a),101 and Rlbp1 (also known as Cralbp)100 are also identified at relatively early stages of embryogenesis. For the muscle lineage, we observe multiple proteins expressed contemporaneously starting from the mid-tailbud II stage (Figure S4B).37 These examples highlight a developmental progression in protein expression patterns and how the proteome atlas effectively mirrors the establishment of definitive cellular phenotypes, in this case elongated muscles.

To further evaluate the utility of the proteome atlas, we explored aspects of temporal fate patterning, focusing on TFs and SMs that are critical for cell specialization during embryogenesis. The data cover approximately 40% of all annotated TFs and ∼60% of all SMs, kinases and phosphatases (Figure S4C). Principal component analysis (PCA) shows a smooth transition from one stage to the next, with the first two principal components accounting for over 80% of the proteome’s variance. A striking 'salt and pepper' pattern emerged when overlaying transcriptional regulators across the proteome’s development. The observed expression dynamics likely reflect a combination of tissue composition and protein accumulation, effectively separating early and late expression protein along a spatial developmental continuum (Figure S4D).

We also ranked protein changes across consecutive developmental stages to identify stage-specific proteins. This analysis highlights significant changes in protein abundance at three key stages: post-fertilization, the maternal-to-zygotic transition (MZT), and the onset of metamorphosis. Post-fertilization, the egg’s proteome exhibits substantial alterations of proteins involved in calcium signaling, mitochondrial function, and translation. The MZT phase shows a surge in proteins related to organogenesis. As swimming tadpoles transition toward metamorphosis there is an increase in proteins associated with tail reabsorption. Examples include the TF Hox10102 (Figure S4E).

Quantitative mRNA-protein expression landscapes

Cellular protein concentrations are modulated via transcriptional and translational mechanisms.103 By integrating transcriptomic and proteomic data from stage-specific embryos, we can explore the extent to which RNA signatures explain protein dynamics. First, we observe that protein and transcript expression vary significantly, spanning different orders of magnitude (Figure S5A). Moreover, consistent with existing literature,104,105 proteins encoded by low-abundance genes are underrepresented, indicating proteome coverage is not yet exhaustive (Figure S5B). We also notice strong variations in quantitative levels at each developmental stage, evident at both the protein and gene levels. There is little overlap in the rank order or even the identity of the most abundant proteins and mRNAs at any given stage (Figure S5C).

The overall correlation between the 7,021 mRNA and protein pairs is low, with a median Pearson correlation of −0.012 (Figure 2D, Table S5), similar to previous studies (Figure S5).4,5,6,7 Our approach assesses how mRNA and protein pairs change over the developmental timeline rather than a snapshot of a specific stage. Figure 2E illustrates an example of TF with high Pearson correlation between RNA-protein dynamics. Additionally, Figure S6 presents a selection of TFs known to play significant roles in the early development of Ciona.98

Using k-means co-clustering of mRNA and protein pairs, we identified 5 distinct cluster dynamics (Figure 2F). We found that the genes involved in DNA replication/repair, centriole elongation/replication, rRNA processing, and protein localization to the nucleus have maternally loaded RNA and the most static protein dynamics. Metabolic processes broadly span all of the clusters, implying that metabolic processes are not categorized by a specific dynamic pattern. Axon development, heart development, and muscle filament sliding/contraction genes are expressed at the transcript and protein level during the tailbud and larval stages of development. These data suggest that the genes in the more dynamic clusters are preferentially associated with organogenesis while the genes in the less dynamic clusters tend to drive housekeeping or cell cycle functions (Figure 2G).

In summary, we profiled Ciona’s proteome and transcriptome across key developmental stages, resulting in an atlas of 7,021 protein-mRNA pairs, underscoring the complementary nature of mRNA and protein data in understanding cellular mechanisms. The dataset shows how mRNA and protein profiles can diverge and decouple due to translational regulation, demonstrating that transcriptional changes can be modified or overridden. This atlas, enriched with existing genomic and epigenomic data, provides a basis for further exploring RNA-protein dynamics during embryogenesis and systematically assessing adaptive expression of both RNAs and proteins.

Conserved and divergent features of the Ciona and Xenopus proteomes

Embryogenesis progresses through distinct stages, but it remains unclear if the regulatory mechanisms guiding these transitions are conserved across species. In particular, how well are the protein dynamics of orthologues conserved over significant evolutionary distances? Is there a conservation of protein abundances in relation to the levels of their corresponding mRNAs? With these questions in mind, we compare the proteome of Ciona development with that of a vertebrate. We focused on the African clawed frog Xenopus laevis, which is very attractive for proteomics analysis4,5,63,64,106,107 resulting in one of the best characterized vertebrate proteomes throughout embryogenesis. Xenopus and Ciona diverged approximately 500–600 million years ago,108 providing a significant evolutionary distance for comparison (Figure 3A).

Figure 3.

Figure 3

Comparison of development between chordate and vertebrate

(A) Experimental design of the inter-species comparative developmental transcriptome and proteome time courses. Full circles highlight stages of development sampled for RNA-seq and proteomics. Mya, million years ago.

(B) K-means co-clustering of the dynamics of orthologs (3,325) between Ciona and Xenopus development. The thickness of the line scales with the number of proteins represented in each cluster. The number of proteins in each cluster are quantified in the legend. Xenopus proteome time series from Sonnett et al.106

(C) GO term analysis identifying the functional significance of each of the clusters from B. The color of the clusters in B is kept consistent.

(D) The log2 fold change (FC) protein correlation between Ciona and Xenopus TFs. Here, FC is defined as the ratio of relative protein abundance in the larva stage compared to the egg. Most TFs show similar behavior with the notable exception of Ybx.

(E) Relative protein dynamics of TFs Ybx, Smyd1, Tfap2-r.b, Arid3, and E2f4/5. Each exhibit large fold changes in both organisms. Colors are preserved in these five proteins from the plotting in D. These TFs are canonically important for organism development by regulating transcriptional activation during the cell cycle, early muscle development, ectoderm development, gene activation through chromatin remodeling, and Nodal signaling respectively. Ybx exhibits signs of being maternally deposited in Ciona, but not in Xenopus, suggesting functional evolutionary divergence of this ortholog from chordate to vertebrate. Xenopus illustrations © Natalya Zahn (2022).

We applied k-means clustering to classify 3,350 one-to-one orthologous protein pairs into 5 distinct clusters, using the frog proteome time series data from Sonnet et al.106 (Table S7), and we identified significant similarities in proteome dynamics between these two species (Figure 3B). More than half of the shared proteins are stably expressed in both species throughout development (blue cluster, Figures 3B and 3C). This cluster is enriched for proteins involved in DNA replication, spindle formation, and chromosome movements. Clusters that capture the activity of genes involved in rRNA processing, tRNA processing, and mRNA splicing via the spliceosome show an increase in expression throughout embryogenesis in both organisms. Genes involved in metabolic and catabolic processes also shared an increase in expression throughout embryogenesis in both organisms, however with a more pronounced increase in Ciona (Figures 3B and 3C). Basement membrane assembly and muscle differentiation genes have similarly high expression throughout embryogenesis in both organisms (Figures 3B and 3C), including those known to have roles in late development such as Lamα5 and Smyd1.109,110 These results highlight the similarities of orthologous protein dynamics during the development of these highly divergent species.

We next shifted our focus to the dynamics of orthologous TFs during development. We looked at the relative expression of these proteins in swimming tadpoles over their relative expression levels in the eggs of each organism (Figure 3D). Overall, TFs that showed the most pronounced changes in Ciona tended to also increase their expression in Xenopus. Notably, Smyd1, Tfap2-r.b, and Arid3, which are known transcriptional regulators of muscle,83,99,109 ectoderm/neural crest development,99 and chromatin remodeling,83 respectively, exhibited similar patterns of expression in both species (Figure 3E). Importantly, we observed TFs that showed different expression dynamics between the two species. The Y-box binding protein, Ybx, exhibited inverse behavior between the two organisms. In Ciona, Ybx mRNA83 and protein are maternally deposited, whereas in Xenopus, it is strictly expressed after fertilization and plays a crucial role in muscle and vascular development.111,112 Ybx is a highly conserved protein involved in transcriptional regulation and is a component of messenger ribonucleoprotein complexes.113 Notably, in zebrafish, both mRNA and protein are maternally deposited and are essential for activating maternal Nodal signaling.114 Understanding the underlying reasons for the differential behavior of Ybx in Ciona and Xenopus requires further investigation. Despite many similarities, there are numerous differences that probably reflect species-specific functions.

We have identified conserved and unique protein dynamics across Ciona and Xenopus through comparison for more than ∼3,000 orthologous proteins. Overall, we find strikingly high conservation of protein dynamics between the two organisms even though they are separated by ∼600 million years of evolution. This analysis therefore presents an exciting opportunity to shed light on conserved regulatory processes in chordate development.

An inverse hourglass model for proteome evolution between Ciona and Xenopus

Cross-species embryonic development is typically aligned at the transcriptome level.48,49,50,51,53,55 We therefore used developmental proteomes to establish stage correspondences between Ciona and Xenopus species throughout embryogenesis. We identified 7,636 one-to-one orthologs at the gene level (Tables S8 and S9).53,115 At the proteome level, we complemented the time series data from Sonnet et al.106 (comprising 3,350 one-to-one orthologs, Table S7) by using an additional independent proteome time series from Van Itallie et al.,107 which included 5,376 one-to-one protein pairs (Table S10).

Starting at the transcriptome level, we observed that 60% of the orthologs are commonly expressed in both species during the early stages, before gastrulation. This shared expression decreased to 55% during the mid-developmental transition (gastrulation and neurulation) and reached 50% in the late phase (tailbud, larva, juveniles), with the highest proportion detected in early development (Figure S7A). We next sought to determine how changes in gene expression mark different developmental stages. We found that gene expression patterns between the two species do not show abrupt changes between stages but rather change gradually and continuously throughout embryonic development. This indicates a single continuum of differentiation, rather than distinct subsets, with smooth transitions across consecutive stages. The greatest transcriptomic similarity occurs at hatching, when excluding Ciona metamorphosis stages (Figures 4A and S7B; Tables S8 and S9).

Figure 4.

Figure 4

The protein anti-hourglass model

(A) Similarity heatmaps showing Pearson similarity between the two species for each investigated time point. Developmental stages are color-coded as defined in Figure 3A. The black line follows the highest correlation of the Xenopus time-point for each Ciona stage (n = 3,350, Xenopus transcriptome from Hu et al.,53 and Session et al.,115. Xenopus proteome from Sonnett et al.106).

(B) Temporal divergence of gene (blue) and protein (red) expression from Xenopus embryogenesis to each Ciona stage. Maximal similarity is represented by the smallest distance from the center line, revealing a nested hourglass model in which the proteome exhibits more evident bottlenecks at early and later stages. Gray boxes outline these periods of minimal divergence. Regardless of stage, proteins show higher similarity between the two species' developmental mapping than RNA-seq, suggesting that protein dynamics are evolutionarily more conserved than mRNA dynamics (n = 3,350, Xenopus transcriptome from Hu et al.,53 and Session et al.,115. Xenopus proteome from Sonnett et al.106).

Comparison of the shared proteome reveals striking differences with the analysis of transcriptomes. The proteomes exhibit distinct phases of shared expression, one early and one late, which are divided by a sharp mid-developmental transition (Figure 4A). The two species showed increasing proteome divergence with each other as they undergo neurulation. This pattern is consistent with an inverse hourglass model with the highest divergence during gastrulation and neurulation (Figures 4A, S8, and S9). The early developmental phase may be subject to more functional constraints and less refractory to change, while the larval stage, crucial for forming a swimming tadpole in both species, shows overlapping protein functions and similar phenotypes.

The proteogenomic patterns revealed by this study remain consistent across various types of comparisons and are robust against different parameters used in constructing the correlation matrix (Pearson (r), Spearman (ρ), Cosine) (Figure S7C), and potential stage sampling biases (Figures S7B, S8, and S9). For example, extending the Ciona time series from 8 to 20 stages (from egg to juveniles, Table S853 and the Xenopus series to 17 distinct time points (from egg to swimming and feeding tadpoles, Table S9)53,115 again showed maximal transcriptomic similarity at hatching (Figure S7B). Similarly, when analyzing a different proteome dataset for inter-species comparison,106,107 the dual-phase pattern is still evident. This Xenopus time series included two additional time points beyond those previously analyzed, effectively spanning the first 120 (hpf) of embryogenesis (Figures S8 and S9).

To map stage transitions in the embryonic timeline, we classified stages with similar morphological events in both species, including cleavage, blastula formation, gastrulation, neurulation, tailbud and swimming larva. We determined the highest correlation points for each stage using both transcriptome and proteome data. By connecting these points (shown as a black line in Figure 4A), we assessed whether mRNA or protein expression better matched the known phenotypic stages. This analysis revealed that protein correlations more closely followed the established mapping of equivalent developmental stages (Figure 4A), indicating that proteomes provide a more accurate representation of embryonic stages compared to transcriptomes (Figure 4A).

Our results are consistent with an inverse hourglass model for protein conservation whereby protein activity is most divergent at mid-developmental stages and the molecular components that comprise early and late embryogenesis are more conserved (Figures 4B, S8, and S9). We hypothesize that this divergence might represent the distinct mechanisms of gastrulation and neurulation in the two species. In Ciona, gastrulation takes place via a cup-shaped gastrula driven by invagination of the endoderm, whereas in Xenopus, convergent extension of mesoderm and epidermal epiboly play important roles. Most importantly, Ciona differs temporally from its vertebrate cousin by specifying its axis at the neurula stage, rather than at gastrulation.116 In frog development, Stage 9 signifies the beginning of gastrulation. Maternal deposits and translation play a significant role in shaping early embryogenesis. It is likely that similar proteins and pathways are conserved across species for timing and initiating this crucial phase, as evidenced by the high conservation observed in the proteome during this period. However, as gastrulation begins, the dynamics of embryogenesis shift, the mechanisms underlying this process start to differ significantly among species, setting the stage for the zygotic genome to take over gradually. This divergence is reflected in low or negligible signals of conservation observed in the blastula stage transcriptome among different species. New genes need to be expressed becoming more diverse and species-specific to evolutionary adaptations. The highest similarity between the species proteomes is observed at the larval stage, likely due to shared structural and ecological needs of swimming larvae.

Throughout all stages, we noticed that the proteome correlations were always higher than the transcriptome correlations (Figures 4A and 4B). This suggests that protein behavior is more evolutionarily conserved over time than mRNA behavior, likely because proteins are directly responsible for carrying out functions.61,117 It is possible that post-transcriptional mechanisms, such as variations in translation or protein degradation rates, have evolved to offset differences in mRNA dynamics.

The proteome closely reflects an organism’s physical traits, offering a more accurate measure of developmental and evolutionary differences within chordates. This underscores the importance of proteomics for evolutionary studies across species. However, previous gene ontology analysis linked variations in the transcriptome to specific biological functions. Regulatory mechanisms, including post-transcriptional, translational, and protein-degradation processes, appear to compensate for mRNA levels dissimilarity, aligning protein abundances with evolutionarily preferred levels.61,118,119 This suggests a synergy between genetic drift and regulatory mechanisms in chordate evolution, focusing on key regulatory genes essential for developmental processes and post-translational regulation. Our study highlights the significance of the simple chordate Ciona in understanding chordate development, proving its worth as a model for future comparative research, particularly in studying proteome stability and its evolutionary implications.

Limitations of the study

Our analysis is subject to certain limitations. The proteome atlas identifies ∼15,000 expressed genes and ∼7,000 proteins. Nearly 40% of the proteome remains uncharacterized, likely missing proteins expressed during later stages, such as metamorphosis, which our embryo-centric analysis does not cover. It is also possible that a number of RNAs and proteins are exclusively expressed in juveniles or adults, representing another gap yet to be addressed. Additionally, the detection of certain proteins is challenged by their incompatibility with standard proteomics methods, including precipitation and digestion steps, or due to their low abundance.13,120 Our analysis, based on whole embryos, inherently reflects average protein levels across diverse cell types. Our study includes the analysis of different stages of Ciona embryogenesis, however we would like to point out that there is a comparative under-representation of metamorphosis and juvenile stages.

STAR★Methods

Key resources table

REAGENT or RESOURCE SOURCE IDENTIFIER
Biological samples

Ciona robusta formerly Ciona intestinalis type A San Diego, USA N/A

Chemicals, peptides, and recombinant proteins

Pierce Protease Inhibitor Mini Tablets, EDTA Free Thermo Scientific Cat#PI88666
Lysyl Endopeptidase, MS Grade (Lys-C) Wako Pure Chemical Cat#125-05061
Sequencing Grade Modified Trypsin Promega Cat#V5111
RNase A, DNase and protease-free Thermo Scientific Cat#EN0531
Trypsin Protease, MS Grade Thermo Scientific Cat#90305
TRI Reagent Sigma-Aldrich Cat#93289
TMTsixplex Isobaric Label Reagent Set Thermo Scientific Cat#90062
Sep-Pak C18 1 cc Vac Cartridge Waters Cat#WAT054955
Pierce C18 Spin Tips & Columns Thermo Scientific Cat#84850
TURBO DNase Invitrogen Cat#AM2238

Critical commercial assays

Quick Start Bradford Protein Assay Kit 1 Bio-Rad Cat#5000201
Proteomics Dynamic Range Standard Set Sigma-Aldrich Cat#232-650-8
RNA Clean & Concentrator Kit Zymo Cat#R1017
PrepX RNA-Seq for Illumina Library Kit Takara Bio Cat#640097
Pierce BCA Protein Assay Kits Thermo Scientific Cat#23225

Deposited data

Raw and analyzed RNA-seq data This paper GEO: GSE237005
Raw proteomics data This paper ProteomeXchange: PXD043619
Ciona bulk RNA-seq Reeves et al.39; Kaplan et al.67; Sharma et al.68; Wang et al.69 NCBI SRA PRJNA376667,PRJNA508201,PRJNA498494,PRJNA529900
KH Ciona Transcriptome ANISEED121 https://aniseed.fr/
Homo sapiens proteome Uniprot70 Proteome ID: UP000005640
Gallus gallus proteome Uniprot70 Proteome ID: UP000000539
Xenopus tropicalis proteome Uniprot70 Proteome ID: UP000008143
Danio rerio proteome Uniprot70 Proteome ID: UP000000437
Branchiostoma floridae proteome Uniprot70 Proteome ID: UP000001554
Strongylocentrotus purpuratus proteome Uniprot70 Proteome ID: UP000007110
Ciona robusta proteome Uniprot70 Proteome ID: UP000008144
KY21 Ciona proteome Satou et al.72 http://ghost.zool.kyoto-u.ac.jp/download_ht.html
Xenopus laevis v10.1 proteome NCBI NCBI RefSeq assembly GCF_017654675.1
Ciona time-series RNA-seq data Hu et al.53 NCBI SRA PRJDB3785
Xenopus laevis time-series RNA-seq data Hu et al.53; Session et al.115 NCBI SRA PRJDB3785, PRJNA296953
Xenopus laevis time-series proteomics data Sonnett et al.106 ProteomeXchange: PXD007915
UPS2 proteomics standards FASTA file Sigma-Aldrich https://www.sigmaaldrich.com/deepweb/assets/sigmaaldrich/marketing/global/fasta-files/ups1-ups2-sequences.fasta

Software and algorithms

Mass Spec Protein Reference Tool Wühr et al.64 https://kirschner.med.harvard.edu/tools/mz_ref_db.html
Python Python Software Foundation https://www.python.org
BLAST (version 2.10.1) Altschul et al.122 https://blast.ncbi.nlm.nih.gov/doc/blast-help/downloadblastdata.html; RRID:SCR_001653; RRID:SCR_001010
Trinity (version 2.11) Grabherr et al.123 https://github.com/trinityrnaseq/trinityrnaseq/releases; RRID:SCR_013048
SeqClean Dana-Farber Cancer Institute https://sourceforge.net/projects/seqclean/files/
RepeatMasker (version 4.1) Smit et al.124 https://www.repeatmasker.org/RepeatMasker/; RRID:SCR_012954
TGICL (version 2.1) Pertea et al.125 https://sourceforge.net/projects/tgicl/files/tgicl%20v2.1/
CAP3 Huang et al.126 https://faculty.sites.iastate.edu/xqhuang/cap3-assembly-program; RRID:SCR_007250
CD-HIT (version 4.8.1) Fu et al.127; Li et al.128 https://github.com/weizhongli/cdhit; RRID:SCR_007105
R (gProfiler, topGo) Kolberg et al.129; Alexa et al.130 RRID:SCR_006809; RRID:SCR_014798
FastQC (version 0.12.0) Babraham Bioinformatics https://github.com/s-andrews/FastQC; RRID:SCR_014583
Trimgalore (version 0.6.10) Babraham Institute https://github.com/FelixKrueger/TrimGalore; RRID:SCR_011847
Salmon Patro et al.131 https://github.com/COMBINE-lab/salmon; RRID:SCR_017036

Other

Genome annotation files, transcription factor and signaling molecules databases used for RNA-seq and proteomics analyses, alignment files used in orthology assignment and other additional files This paper https://github.com/andreamariossi/proteome_ciona
Adapted Ciona schematics Hotta et al.132 https://chordate.bpni.bio.keio.ac.jp/chordate/faba/1.4/top.html
Xenopus illustrations Xenbase, Zahn et al.133 https://www.xenbase.org/xenbase/zahn.do

Resource availability

Lead contact

Further information and requests should be directed to the lead contact, Martin Wühr (wuhr@princeton.edu).

Materials availability

Materials generated for this study are available on request from Martin Wühr (wuhr@princeton.edu).

Data and code availability

  • Data: The raw data associated with the RNA-seq experiments and gene expression matrices are available in GEO under the accession number: GSE237005. The mass spectrometry experiments presented in this study have been deposited to the ProteomeXchange Consortium (http://www.proteomexchange.org/). Embryo developmental proteome (deposited via the PRIDE partner repository) with accession number: PXD043619. Genome annotation files, transcription factor and signaling molecules databases used for RNA-seq and proteomics analyses, alignment files used in orthology assignment, and additional files are publicly available on GitHub (https://github.com/andreamariossi/proteome_ciona).

  • Code: All code to reproduce this study is publicly available on GitHub (https://github.com/andreamariossi/proteome_ciona).

  • Other items: Additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Experimental model and study participant details

Ciona handling and embryos collection

Wild type adult hermaphrodite Ciona robusta (formerly known as Ciona intestinalis Type A)134 were obtained from M-Rep located in San Diego, CA and maintained in artificial seawater (Instant Ocean) at 18°C, under continuous illumination. Dechorionation and in vitro fertilization procedures were conducted following the protocol described in.135 For each time point in the time series, embryos were staged and collected according to132 at approximately 18°C and a total of 150 embryos were placed in Trizol for RNA extraction, while approximately 3,000 embryos were rapidly frozen in liquid nitrogen for protein TMTproC sample preparation. All samples were then stored at −80°C until further use. For absolute mass spectrometry analysis, approximately 5,000 unfertilized dechorionated eggs were directly snap-frozen.

Method details

SNP prevalence between ciona batches

One concern is the presence of single nucleotide polymorphisms (SNPs), a characteristic feature of ascidian evolution,73,136 which can cause protein sequence polymorphisms and lead to incorrect peptide inference during the processing of MS data. We evaluated the potential influence of SNPs on peptide quantification accuracy. We obtained bulk RNA-seq data from two batches of 16-cell Ciona embryos. Each batch was assembled via Trinity, then translated into protein reference databases with the mass spec protein reference tool (https://kirschner.med.harvard.edu/tools/mz_ref_db.html).64 We reciprocally BLASTed each database against the other and found 16,037 shared proteins. These shared proteins were trypsin digested in silico. 98.8% of the resulting peptides were identical between these batches while only 1.2% were wholly unique to one batch or the other indicating minimal influence of intra-specific genetic variability on peptide recognition.

Generating protein reference database

The protein reference database, a FASTA file containing all potential proteins from the species under study, was used to generate in silico tryptic peptides and reference MS/MS spectra for peptide identification. 1,222,451,669 Ciona bulk RNA-seq reads from numerous studies39,67,68,69 were assembled de novo via Trinity (version 2.11) into 2,328,005 transcripts.123 The 55,974 transcripts making up the KH Ciona transcriptome (KHNCBI.Transcript.2018.fasta, retrieved from ANISEED)121 were integrated alongside our de novo transcripts. The transcripts were cleaned and trimmed via SeqClean (http://compbio.dfci.harvard.edu/tgi/software/), then masked for common repeat motifs via RepeatMasker (version 4.1).124 The masked transcripts were clustered via TGICL (version 2.1) and assembled via CAP3.125,126 The resulting contigs and singletons were searched against a database of model organism containing human (Homo sapiens), red junglefowl (Gallus gallus), western clawed frog (Xenopus tropicalis), zebrafish (Danio rerio), florida lancelet (Branchiostoma floridae), pacific purple sea urchin (Strongylocentrotus purpuratus), and urochordate (Ciona robusta) using BLASTX (version 2.10.1).122 The BLASTX report was parsed and the transcripts were translated into proteins. The translated proteins were processed to remove redundancies with a CD-HIT (version 4.8.1) threshold of 95%.127,128

Proteomics sample preparation

Samples were prepared by lysing frozen embryos in lysis buffer (50 mM HEPES pH 7.2, 2% SDS, and 1x protease in artificial saltwater) followed by clarification via centrifugation. Lysates were diluted to 2 μg/μL with 100 mM HEPES (pH 7.2). DTT was added to a concentration of 5 mM and samples incubated for 20 min at 60°C. After cooling to room temperature (RT), N-ethylmaleimide (NEM) was added to a concentration of 20 mM and samples incubated for 20 min at RT. 10 mM DTT was added and samples incubated for 10 min at RT to quench NEM. 200 μL of each sample were brought up to 2 mL with 800 μL MeOH, 400 μL chloroform, and 600 μL water. Samples were centrifuged at 20,000 g for 2 min at RT. Upper layer was discarded and 600 μL MeOH was added. Samples were centrifuged at 20,000 g for 2 min at RT. Supernatant was discarded and 500 μL MeOH was added.137 Samples were centrifuged at 20,000 g for 2 min at RT. Supernatant was discarded and the pellet was air dried. Pellet was resuspended in 6 M GuaCl, 10 mM EPPS pH 8.5 to ∼5 mg/mL.

For the label-free samples, UPS2 standards (Sigma-Aldrich) were added to a final concentration of 27 ng/μL in the 450 μg protein samples. Samples were diluted with 10 mM EPPS pH 8.5 to 2 M guanidine hydrochloride. Samples were digested overnight at RT in LysC (Wako) at a concentration of 20 ng/μL. Samples were further diluted with 10 mM EPPS pH 8.5 to 0.5 M guanidine hydrochloride. 20 ng/μL LysC and 10 ng/μL trypsin (Promega) were added to each sample and incubated for 16 h at 37°C. Peptide supernatant was cleared by ultracentrifugation at 100,000 g for 1 h at 4°C (Beckman Coulter, 343775), then vacuum-dried overnight.

For TMTpro-labeling, samples were digested with LysC and trypsin as above, then resuspended in 200 mM EPPS pH 8.0. pre-mixed TMTpro tags (8-plex Thermo Fisher Scientific 20 μg/μL in dry acetonitrile stored at −80°C) at a 5 μg TMTpro: 1 μg peptide ratio. To cover the eight developmental time series samples, tags are as follows: 126 - unfertilized egg; 128C – fertilized egg; 129N–16-cell; 130C–initial gastrula; 131N – late neurula; 131C – mid tailbud II; 133C – late tailbud II; 134N – larva. Samples were incubated for 2 h at RT. Reactions were quenched by addition of hydroxylamine (Sigma, HPLC grade) to a final concentration of 0.5% for 30 min at RT. Samples were pooled into a single tube, cleared by ultracentrifugation at 100,000 g for 1 h at 4°C (Beckman Coulter, 343775), then and vacuum-dried overnight.

For either label-free or TMTpro-labeled, samples were resuspended with 10 mM ammonium bicarbonate (pH 8.0) with 5% acetonitrile to 1 μg/μL. Samples were separated by medium pH reverse phase HPLC (Zorbax 300Extend C18, 4.6 × 250 mm column) into 96 fractions.14,138 The fractions were then pooled into 24 fractions,139 dried, and resuspended in HPLC grade water. Samples were then desalted via homemade stage tips with C18 material (Empore) and resuspended to 1 μg/μL in 1% formic acid.140

Drawings

Ciona schematics are adapted from FABA (FABA Four-dimensional Ascidian Body Atlas)132 and Xenopus illustrations from Xenbase (www.xenbase.org RRID:SCR_003280) and Natalya Zahn.133 Source icons with BioRender.com.

Quantification and statistical analysis

Proteomics analysis

Approximately 1 μg per sample was analyzed by LC-MS, as previously described.138 LC-MS experiments were analyzed on an nLC-1200 HPLC (Thermo Fisher Scientific) coupled to an Orbitrap Fusion Lumos MS (Thermo Fisher Scientific). Peptides were separated on an Aurora Series emitter column (25 cm × 75 μm ID, 1.6 μm C18) (Ionopticks), held at 60°C during separation by an in-house built column oven. Separation was achieved by applying a 12%–35% acetonitrile gradient in 0.125% formic acid and 2% DMSO over 90 min for fractionated samples. Electrospray ionization was enabled by applying a voltage of 2.6 kV through a MicroTee at the inlet of the microcapillary column. For the label-free samples, we used the Orbitrap Fusion Lumos with the label-free method with data-dependent acquisition (DDA) previously described.64 For the TMTpro samples, we used the Orbitrap Fusion Lumos with the TMTproC method previously described.14

Mass spectrometry data analysis was performed essentially as previously described106 with the following modifications. The raw MS files were analyzed using the GFY software licensed through Harvard University. MS2 spectra assignment was performed using the Sequest algorithm141 by searching the data against either our reference protein dataset described above, the KY21 Ciona proteome,141 or the Uniprot Ciona proteome.70

For label-free analysis, these proteomes were merged with the UPS2 proteomics standards FASTA file (Sigma-Aldrich) along with common contaminants. Peptides that matched multiple proteins were assigned to the proteins with the greatest number of unique peptides. To control for peptide false discovery rate, target-decoy search strategy was used where reverse sequences were searched in parallel with forward sequences.94 Filtering was performed using a linear discriminant analysis (LDA) that accounts for parameters from Sequest’s database search output, such as XCorr, deltaCorr, missed cleavages, charge state, peptide length, and the fraction of matched ions was also implemented to distinguish genuine peptide spectral matches (PSMs) from reverse hits. The data were then filtered to 0.5% FDR on the peptide level and 1% FDR on the protein level.95,142

Absolute protein concentration estimates in unfertilized egg

Protein concentration in the label-free egg sample was calculated by building a standard curve of MS signal to UPS2 standard concentration. The UPS2 known standard concentrations were obtained from Sigma Aldrich and concentrations were converted to log space. The MS signal area was also converted to log space and Thiel regression was performed to obtain a standard curve. Signal area was then converted to concentration and scaled to a total protein concentration of 2 mM. A cutoff of 0.01 μM was applied for low concentration protein. Information on known protein complexes was obtained from the CORUM Protein Complexes dataset.89 A two-way ANOVA, followed by a post-hoc Tukey HSD test, was applied to assess the distribution of protein concentrations.

Proteomics data processing

GFY output tables for TMTcPro MS were filtered for human protein contaminants, reversed sequences and proteins which were only identified based on modified peptides as previously described.14

Annotations and classifications of transcription factors, signaling molecules, kinases, and phosphatases are based on data merged from the Ghost website143 and.121,144 The proportional coverage of these families within our dataset was determined by counting the number of members that could be identified at the protein level.

K-means clustering was performed using the kmeans function in R with nstart = 100. The number of clusters was selected to 8 to capture overall protein dynamics. Further cluster increases did not reveal new cluster dynamics. GO enrichment analyses were used to assign categories to each cluster using gProfiler.129

Principal component analysis (PCA) was performed in R with prcomp function from the stats package. Annotations for families of transcription factors, signaling molecules, kinases, and phosphatases were then overlaid on the graphs.

For the calculation of cumulative abundance, proteins and genes were initially ranked from highest to lowest. The total expressed as a percentage is plotted against their rank order. The names or identifiers of the seven most abundant transcripts or proteins (rank 1 to 7) are listed in descending order for the respective stage.

To measure the similarity between the proteome and transcriptome datasets, Pearson’s correlation coefficient (r), Spearman’s rank correlation coefficient (ρ), and Cosine distance were calculated for each individual gene-protein pair across all stages. These coefficients were then plotted as histogram distributions.

Ciona and Xenopus protein orthologs

Reciprocal protein-protein BLAST (RHB) (BLASTP, version 2.10.1) was used to identify orthologs between Ciona and Xenopus.122 Ciona and Xenopus alternated as query and reference. For each BLASTP, the max target sequence was set to 1, e-value threshold was set to 0.01, and the matrix set to BLOSUM45. The query ID, reference ID, e-value, and bit score were logged for each match. “best-match” protein orthologs between Ciona and Xenopus based on the criteria of (1) lowest e-value and (2) highest bit score. Only proteins confirmed in both directions as “best-match” were used in the cross-species proteomic analysis (Table S6).

Comparative proteomics

The extent of conservation or divergence in protein expression among chordates and vertebrates was assessed by comparing the proteome of Xenopus laevis with that of Ciona. Two independent frog time series were reanalyzed: one comprising 8 time points from Sonnet et al.106 (Table S7) and another with 10 time points from Itallie et al.107 (Table S10). These series collectively cover frog embryogenesis comprehensively, overlapping at three stages (St1, St12, and St30). All three datasets were first subjected to median-based normalization. Then, the dynamics of proteins in each dataset were scaled to sum to 1 across the time series, allowing for comparison of expression across species. Correlation coefficients, including Pearson (r), and Spearman (ρ), were calculated using pairs of orthologs. These orthologs were identified based on RHB methods as explained earlier, for all pairwise combinations of developmental stages. To co-cluster the Ciona-Xenopus proteomes across developmental stages (data from Sonnet et al.106), we used k-means clustering. Each cluster was then assigned to a functional category, based on its overall gene expression and GO enrichment profiles.

RNA sequencing

For each of the eight embryonic stages, a total of 150 embryos were collected and stored at −80°C in Trizol (Thermo Fisher Scientific). We prepared two biological replicates, one replicate consisted of embryos from the same in vitro fertilization batch for proteomic analysis. The other replicate was collected from an independent developmental time course. Total RNA was isolated using the Clean and Concentrator Zymo kit (Zymo), with genomic DNA (gDNA) removal achieved through on-column treatment with Turbo DNase (Invitrogen) at room temperature for 10 min. The resulting RNA was re-suspended in 15 μL of DEPC-treated water and quantified using a NanodropTM and Qubit (Thermo Fisher Scientific), while its quality was assessed using a Bioanalyzer 2100 (Agilent Technologies). The RNA integrity number (RIN) values ranged between 8 and 10. cDNA libraries were prepared using the PrepX RNA-seq directional protocol (Takara Bio) following the manufacturer’s instructions and utilizing an Apollo 324 robot. For mRNA enrichment and separation from rRNA, the oligo dT-based mRNA isolation kit (Takara Bio) was employed. The libraries were sequenced on the NovaSeq platform (Illumina) at the Genomics Core Facility at Princeton University with a depth of 20–40 million paired-end strand-specific reads.

Quality assessment of raw and trimmed 61-bp paired reads was performed with FastQC (version 0.12.0). Trimgalore (version 0.6.10) was used to trim the raw RNA-seq reads, removing adapters and primer contamination and poor quality base call (Q < 25). Reads shorter than 30 nt after trimming were discarded. The trimmed RNA-seq reads were then mapped to the KY21 transcriptome using Salmon (v0.42.4, with parameters --libType A, --seqBias, --gcBias, --validateMappings).131 Details about alignment quality are given in Table S4. mRNA quantities are presented as transcripts per million (TPM), with a cutoff of 2 TPM as the lower limit for detection across all samples. This cutoff was determined based on the inspection of distribution density plot and corroborated by known markers visualized from in situ hybridization chain reaction (HCR) studies145 at the 16-cell stage, which is when the newly zygotic genes are activated. For each stage, RNA data from biologically independent experiments were pooled to estimate average gene expression.

For Ciona, the extended time-series RNA-seq data was obtained from Hu et al.53 (Table S8). For Xenopus leavis, data was sourced from Session et al.115 and Hu et al.53 (Table S9). Gene expression for each species was estimated using Salmon,131 with KY21 annotation for Ciona and Xenbase X. laevis v10.1 annotation for frog. Gene-level expression was obtained by summing up TPMs from all transcript isoforms per gene using tximport R package.146 For each stage, RNA data from biologically independent experiments were pooled to estimate average gene expression. A gene was considered expressed if it had a TPM ≥2.

To compare gene expression across embryonic stages between the two species, we utilized orthologs, as identified by reciprocal best hits (RBHs). To normalize the data for distinct expression levels and mitigate the impact of highly expressed genes, we applied quantile normalization using the preprocessCore package from Bioconductor.147 We employed several metrics to estimate gene expression divergence between the two species, including Pearson (r) and Spearman (ρ) correlations, and Cosine similarity.

Gene set enrichment analysis

Gene Ontology (GO) term enrichment analyses were conducted using the gProfiler and topGo functional annotation tools.129,130 For each cluster, genes were analyzed against a background list comprising all genes expressed across all time points. Enriched GO terms were identified in the categories of ‘molecular function’, ‘cellular component’, and ‘biological process’. A Benjamini-corrected P-value threshold of 0.01 was applied to determine significant enrichment.

Acknowledgments

We thank all members of the Wühr laboratory for helpful discussion, particularly to Felix Keber, Edward Cruz and Alex Johnson. We also thank members of the Levine laboratory, especially Pavan Choppakatla for insightful inputs. We thank Nicholas Treen for providing unfertilized Ciona eggs and Lillia Ryazanova for assistance in Ciona protein sample preparation. We thank Elizabeth Van Itallie for sharing Xenopus proteomics data. This study was funded by NIH grant (T32GM007388) to Princeton University, NIH grant (NS076542) to M.S.L., NIH grant (R35GM128813) to MW, Eric and Wendy Schmidt Transformative Technology Fund to M.W., Diekman collaboration fund to M.S.L. and M.W. and Princeton Catalysis Initiative to M.S..L and M.W.

Author contributions

Conceptualization, A.N.F., A.M., M.S.L., and M.W.; Methodology, A.N.F., A.M., M.S.L., and M.W.; Investigation, A.N.F. and A.M.; Writing – Original Draft, A.N.F., A.M., M.S.L., and M.W.; Writing – Review and Editing, A.M., M.S.L., and M.W.; Funding Acquisition, M.S.L. and M.W.; Supervision, M.S.L. and M.W.

Declaration of interests

The authors declare no competing interests.

Published: February 29, 2024

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.isci.2024.109355.

Contributor Information

Michael S. Levine, Email: msl2@princeton.edu.

Martin Wühr, Email: wuhr@princeton.edu.

Supplemental information

Document S1. Figures S1–S9 and Table S4
mmc1.pdf (3.8MB, pdf)
Table S1. Ciona absolute protein abundance in unfertilized egg, related to Figure 1
mmc2.xlsx (279.3KB, xlsx)
Table S2. Ciona relative protein abundance time series, related to Figure 2
mmc3.xlsx (1.1MB, xlsx)
Table S3. Ciona TPM RNA-seq time series, related to Figure 2
mmc4.xlsx (2.9MB, xlsx)
Table S5. Ciona relative mRNA-protein dynamics, related to Figure 2
mmc5.xlsx (1.6MB, xlsx)
Table S6. Ciona-Xenopus one-to-one orthologs, related to Figure 3
mmc6.xlsx (161.3KB, xlsx)
Table S7. Ciona-Xenopus protein dynamics from Sonnett et al. 2018, related to Figure 3
mmc7.xlsx (1.1MB, xlsx)
Table S8. Ciona TPM RNA-seq time series from Hu et al. 2017, related to Figure 4
mmc8.xlsx (8.5MB, xlsx)
Table S9. Xenopus TPM RNA-seq time series from Session et al. 2016 and Hu et al. 2017, related to Figure 4
mmc9.xlsx (21.4MB, xlsx)
Table S10. Ciona-Xenopus protein dynamics from Itallie et al. 2021, related to Figure 4
mmc10.xlsx (717KB, xlsx)

References

  • 1.Hausser J., Mayo A., Keren L., Alon U. Central dogma rates and the trade-off between precision and economy in gene expression. Nat. Commun. 2019;10:68. doi: 10.1038/s41467-018-07391-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Li G.-W., Burkhardt D., Gross C., Weissman J.S. Quantifying Absolute Protein Synthesis Rates Reveals Principles Underlying Allocation of Cellular Resources. Cell. 2014;157:624–635. doi: 10.1016/j.cell.2014.02.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Teixeira F.K., Lehmann R. Translational Control during Developmental Transitions. Cold Spring Harb. Perspect. Biol. 2019;11:a032987. doi: 10.1101/cshperspect.a032987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Peshkin L., Wühr M., Pearl E., Haas W., Freeman R.M., Gerhart J.C., Klein A.M., Horb M., Gygi S.P., Kirschner M.W. On the Relationship of Protein and mRNA Dynamics in Vertebrate Embryonic Development. Dev. Cell. 2015;35:383–394. doi: 10.1016/j.devcel.2015.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Smits A.H., Lindeboom R.G.H., Perino M., van Heeringen S.J., Veenstra G.J.C., Vermeulen M. Global absolute quantification reveals tight regulation of protein expression in single Xenopus eggs. Nucleic Acids Res. 2014;42:9880–9891. doi: 10.1093/nar/gku661. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Vogel C., Marcotte E.M. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat. Rev. Genet. 2012;13:227–232. doi: 10.1038/nrg3185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wegler C., Ölander M., Wiśniewski J.R., Lundquist P., Zettl K., Åsberg A., Hjelmesæth J., Andersson T.B., Artursson P. Global variability analysis of mRNA and protein concentrations across and within human tissues. NAR Genom. Bioinform. 2020;2:lqz010. doi: 10.1093/nargab/lqz010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Raj A., van Oudenaarden A. Nature, Nurture, or Chance: Stochastic Gene Expression and Its Consequences. Cell. 2008;135:216–226. doi: 10.1016/j.cell.2008.09.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Cai L., Friedman N., Xie X.S. Stochastic protein expression in individual cells at the single molecule level. Nature. 2006;440:358–362. doi: 10.1038/nature04599. [DOI] [PubMed] [Google Scholar]
  • 10.Sonneveld S., Verhagen B.M.P., Tanenbaum M.E. Heterogeneity in mRNA Translation. Trends Cell Biol. 2020;30:606–618. doi: 10.1016/j.tcb.2020.04.008. [DOI] [PubMed] [Google Scholar]
  • 11.Livingston N.M., Kwon J., Valera O., Saba J.A., Sinha N.K., Reddy P., Nelson B., Wolfe C., Ha T., Green R., et al. Bursting translation on single mRNAs in live cells. Mol. Cell. 2023;83:2276–2289.e11. doi: 10.1016/j.molcel.2023.05.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Aebersold R., Mann M. Mass spectrometry-based proteomics. Nature. 2003;422:198–207. doi: 10.1038/nature01511. [DOI] [PubMed] [Google Scholar]
  • 13.Pappireddi N., Martin L., Wühr M. A Review on Quantitative Multiplexed Proteomics. Chembiochem. 2019;20:1210–1224. doi: 10.1002/cbic.201800650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Johnson A., Stadlmeier M., Wühr M. TMTpro Complementary Ion Quantification Increases Plexing and Sensitivity for Accurate Multiplexed Proteomics at the MS2 Level. J. Proteome Res. 2021;20:3043–3052. doi: 10.1021/acs.jproteome.0c00813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Thompson A., Schäfer J., Kuhn K., Kienle S., Schwarz J., Schmidt G., Neumann T., Johnstone R., Mohammed A.K.A., Hamon C. Tandem Mass Tags: A Novel Quantification Strategy for Comparative Analysis of Complex Protein Mixtures by MS/MS. Anal. Chem. 2003;75:1895–1904. doi: 10.1021/ac0262560. [DOI] [PubMed] [Google Scholar]
  • 16.Demichev V., Messner C.B., Vernardis S.I., Lilley K.S., Ralser M. DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput. Nat. Methods. 2020;17:41–44. doi: 10.1038/s41592-019-0638-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Ammar C., Schessner J.P., Willems S., Michaelis A.C., Mann M. Accurate label-free quantification by directLFQ to compare unlimited numbers of proteomes. Mol. Cell. Proteomics. 2023;22:100581. doi: 10.1016/j.mcpro.2023.100581. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.McAlister G.C., Nusinow D.P., Jedrychowski M.P., Wühr M., Huttlin E.L., Erickson B.K., Rad R., Haas W., Gygi S.P. MultiNotch MS3 Enables Accurate, Sensitive, and Multiplexed Detection of Differential Expression across Cancer Cell Line Proteomes. Anal. Chem. 2014;86:7150–7158. doi: 10.1021/ac502040v. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lucitt M.B., Price T.S., Pizarro A., Wu W., Yocum A.K., Seiler C., Pack M.A., Blair I.A., FitzGerald G.A., Grosser T. Analysis of the Zebrafish Proteome during Embryonic Development. Mol. Cell. Proteomics. 2008;7:981–994. doi: 10.1074/mcp.m700382-mcp200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Gao Y., Liu X., Tang B., Li C., Kou Z., Li L., Liu W., Wu Y., Kou X., Li J., et al. Protein Expression Landscape of Mouse Embryos during Pre-implantation Development. Cell Rep. 2017;21:3957–3969. doi: 10.1016/j.celrep.2017.11.111. [DOI] [PubMed] [Google Scholar]
  • 21.Purushothaman K., Das P.P., Presslauer C., Lim T.K., Johansen S.D., Lin Q., Babiak I. Proteomics Analysis of Early Developmental Stages of Zebrafish Embryos. Int. J. Mol. Sci. 2019;20:6359. doi: 10.3390/ijms20246359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Sun L., Bertke M.M., Champion M.M., Zhu G., Huber P.W., Dovichi N.J. Quantitative proteomics of Xenopus laevis embryos: expression kinetics of nearly 4000 proteins during early development. Sci. Rep. 2014;4:4365. doi: 10.1038/srep04365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Abdulghani M., Song G., Kaur H., Walley J.W., Tuteja G. Comparative Analysis of the Transcriptome and Proteome during Mouse Placental Development. J. Proteome Res. 2019;18:2088–2099. doi: 10.1021/acs.jproteome.8b00970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Delsuc F., Brinkmann H., Chourrout D., Philippe H. Tunicates and not cephalochordates are the closest living relatives of vertebrates. Nature. 2006;439:965–968. doi: 10.1038/nature04336. [DOI] [PubMed] [Google Scholar]
  • 25.Dehal P., Satou Y., Campbell R.K., Chapman J., Degnan B., De Tomaso A., Davidson B., Di Gregorio A., Gelpke M., Goodstein D.M., et al. The Draft Genome of Ciona intestinalis: Insights into Chordate and Vertebrate Origins. Science. 2002;298:2157–2167. doi: 10.1126/science.1080049. [DOI] [PubMed] [Google Scholar]
  • 26.Olinski R.P., Lundin L.-G., Hallböök F. Conserved Synteny Between the Ciona Genome and Human Paralogons Identifies Large Duplication Events in the Molecular Evolution of the Insulin-Relaxin Gene Family. Mol. Biol. Evol. 2006;23:10–22. doi: 10.1093/molbev/msj002. [DOI] [PubMed] [Google Scholar]
  • 27.Simakov O., Bredeson J., Berkoff K., Marletaz F., Mitros T., Schultz D.T., O’Connell B.L., Dear P., Martinez D.E., Steele R.E., et al. Deeply conserved synteny and the evolution of metazoan chromosomes. Sci. Adv. 2022;8:eabi5884. doi: 10.1126/sciadv.abi5884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Kikuta H., Laplante M., Navratilova P., Komisarczuk A.Z., Engström P.G., Fredman D., Akalin A., Caccamo M., Sealy I., Howe K., et al. Genomic regulatory blocks encompass multiple neighboring genes and maintain conserved synteny in vertebrates. Genome Res. 2007;17:545–555. doi: 10.1101/gr.6086307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Doglio L., Goode D.K., Pelleri M.C., Pauls S., Frabetti F., Shimeld S.M., Vavouri T., Elgar G. Parallel Evolution of Chordate Cis-Regulatory Code for Development. PLoS Genet. 2013;9:e1003904. doi: 10.1371/journal.pgen.1003904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Sanges R., Hadzhiev Y., Gueroult-Bellone M., Roure A., Ferg M., Meola N., Amore G., Basu S., Brown E.R., De Simone M., et al. Highly conserved elements discovered in vertebrates are present in non-syntenic loci of tunicates, act as enhancers and can be transcribed during development. Nucleic Acids Res. 2013;41:3600–3618. doi: 10.1093/nar/gkt030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Prummel K.D., Hess C., Nieuwenhuize S., Parker H.J., Rogers K.W., Kozmikova I., Racioppi C., Brombacher E.C., Czarkwiani A., Knapp D., et al. A conserved regulatory program initiates lateral plate mesoderm emergence across chordates. Nat. Commun. 2019;10:3857. doi: 10.1038/s41467-019-11561-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Abitua P.B., Wagner E., Navarrete I.A., Levine M. Identification of a rudimentary neural crest in a non-vertebrate chordate. Nature. 2012;492:104–107. doi: 10.1038/nature11589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Abitua P.B., Gainous T.B., Kaczmarczyk A.N., Winchell C.J., Hudson C., Kamata K., Nakagawa M., Tsuda M., Kusakabe T.G., Levine M. The pre-vertebrate origins of neurogenic placodes. Nature. 2015;524:462–465. doi: 10.1038/nature14657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Stolfi A., Gainous T.B., Young J.J., Mori A., Levine M., Christiaen L. Early Chordate Origins of the Vertebrate Second Heart Field. Science. 2010;329:565–568. doi: 10.1126/science.1190181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Stolfi A., Ryan K., Meinertzhagen I.A., Christiaen L. Migratory neuronal progenitors arise from the neural plate borders in tunicates. Nature. 2015;527:371–374. doi: 10.1038/nature15758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Horie R., Hazbun A., Chen K., Cao C., Levine M., Horie T. Shared evolutionary origin of vertebrate neural crest and cranial placodes. Nature. 2018;560:228–232. doi: 10.1038/s41586-018-0385-737. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Lemaire A., Cao C., Yoon P.H., Long J., Levine M. The hypothalamus predates the origin of vertebrates. Sci. Adv. 2021;7:eabf7452. doi: 10.1126/sciadv.abf7452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Long J., Mariossi A., Cao C., Mo Z., Thompson J.W., Levine M.S., Lemaire L.A. Cereblon influences the timing of muscle differentiation in Ciona tadpoles. Proc. Natl. Acad. Sci. USA. 2023;120 doi: 10.1073/pnas.230998912039. e2309989120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Reeves W.M., Wu Y., Harder M.J., Veeman M.T. Functional and evolutionary insights from the Ciona notochord transcriptome. Development. 2017;144:3375–3387. doi: 10.1242/dev.156174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Papadogiannis V., Pennati A., Parker H.J., Rothbächer U., Patthey C., Bronner M.E., Shimeld S.M. Hmx gene conservation identifies the origin of vertebrate cranial ganglia. Nature. 2022;605:701–705. doi: 10.1038/s41586-022-04742-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Cao C., Lemaire L.A., Wang W., Yoon P.H., Choi Y.A., Parsons L.R., Matese J.C., Wang W., Levine M., Chen K. Comprehensive single-cell transcriptome lineages of a proto-vertebrate. Nature. 2019;571:349–354. doi: 10.1038/s41586-019-1385-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Keller T.E., Han P., Yi S.V. Evolutionary Transition of Promoter and Gene Body DNA Methylation across Invertebrate–Vertebrate Boundary. Mol. Biol. Evol. 2016;33:1019–1028. doi: 10.1093/molbev/msv345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Zhang T., Xu Y., Imai K., Fei T., Wang G., Dong B., Yu T., Satou Y., Shi W., Bao Z. A single-cell analysis of the molecular lineage of chordate embryogenesis. Sci. Adv. 2020;6:eabc4773. doi: 10.1126/sciadv.abc4773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Sladitschek H.L., Fiuza U.-M., Pavlinic D., Benes V., Hufnagel L., Neveu P.A. MorphoSeq: Full Single-Cell Transcriptome Dynamics Up to Gastrulation in a Chordate. Cell. 2020;181:922–935.e21. doi: 10.1016/j.cell.2020.03.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Suzuki M.M., Mori T., Satoh N. The Ciona intestinalis cleavage clock is independent of DNA methylation. Genomics. 2016;108:168–176. doi: 10.1016/j.ygeno.2016.10.001. [DOI] [PubMed] [Google Scholar]
  • 46.Madgwick A., Magri M.S., Dantec C., Gailly D., Fiuza U.-M., Guignard L., Hettinger S., Gomez-Skarmeta J.L., Lemaire P. Evolution of embryonic cis-regulatory landscapes between divergent Phallusia and Ciona ascidians. Dev. Biol. 2019;448:71–87. doi: 10.1016/j.ydbio.2019.01.003. [DOI] [PubMed] [Google Scholar]
  • 47.Kubo A., Suzuki N., Yuan X., Nakai K., Satoh N., Imai K.S., Satou Y. Genomic cis-regulatory networks in the early Ciona intestinalis embryo. Development. 2010;137:1613–1623. doi: 10.1242/dev.046789. [DOI] [PubMed] [Google Scholar]
  • 48.Irie N., Kuratani S. Comparative transcriptome analysis reveals vertebrate phylotypic period during organogenesis. Nat. Commun. 2011;2:248. doi: 10.1038/ncomms1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Yanai I., Peshkin L., Jorgensen P., Kirschner M.W. Mapping Gene Expression in Two Xenopus Species: Evolutionary Constraints and Developmental Flexibility. Dev. Cell. 2011;20:483–496. doi: 10.1016/j.devcel.2011.03.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Gerstein M.B., Rozowsky J., Yan K.-K., Wang D., Cheng C., Brown J.B., Davis C.A., Hillier L., Sisu C., Li J.J., et al. Comparative analysis of the transcriptome across distant species. Nature. 2014;512:445–448. doi: 10.1038/nature13424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Chan M.E., Bhamidipati P.S., Goldsby H.J., Hintze A., Hofmann H.A., Young R.L. Comparative transcriptomics reveals distinct patterns of gene expression conservation through vertebrate embryogenesis. Genome Biol. Evol. 2021;13:evab160. doi: 10.1093/gbe/evab160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Uesaka M., Kuratani S., Irie N. The developmental hourglass model and recapitulation: An attempt to integrate the two models. J. Exp. Zool. B Mol. Dev. Evol. 2022;338:76–86. doi: 10.1002/jez.b.23027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hu H., Uesaka M., Guo S., Shimai K., Lu T.-M., Li F., Fujimoto S., Ishikawa M., Liu S., Sasagawa Y., et al. Constrained vertebrate evolution by pleiotropic genes. Nat. Ecol. Evol. 2017;1:1722–1730. doi: 10.1038/s41559-017-0318-0. [DOI] [PubMed] [Google Scholar]
  • 54.Marlétaz F., Firbas P.N., Maeso I., Tena J.J., Bogdanovic O., Perry M., Wyatt C.D.R., de la Calle-Mustienes E., Bertrand S., Burguera D., et al. Amphioxus functional genomics and the origins of vertebrate gene regulation. Nature. 2018;564:64–70. doi: 10.1038/s41586-018-0734-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Levin M., Anavy L., Cole A.G., Winter E., Mostov N., Khair S., Senderovich N., Kovalev E., Silver D.H., Feder M., et al. The mid-developmental transition and the evolution of animal body plans. Nature. 2016;531:637–641. doi: 10.1038/nature16994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Wu L., Ferger K.E., Lambert J.D. Gene Expression Does Not Support the Developmental Hourglass Model in Three Animals with Spiralian Development. Mol. Biol. Evol. 2019;36:1373–1383. doi: 10.1093/molbev/msz065. [DOI] [PubMed] [Google Scholar]
  • 57.Bininda-Emonds O.R., Jeffery J.E., Richardson M.K. Inverting the hourglass: quantitative evidence against the phylotypic stage in vertebrate development. Proc. R. Soc. Lond. Ser. B Biol. Sci. 2003;270:341–346. doi: 10.1098/rspb.2002.2242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Dunn C.W., Zapata F., Munro C., Siebert S., Hejnol A. Pairwise comparisons across species are problematic when analyzing functional genomic data. Proc. Natl. Acad. Sci. USA. 2018;115:E409–E417. doi: 10.1073/pnas.1707515115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Yanai I. Development and Evolution through the Lens of Global Gene Regulation. Trends Genet. 2018;34:11–20. doi: 10.1016/j.tig.2017.09.011. [DOI] [PubMed] [Google Scholar]
  • 60.Schwanhäusser B., Busse D., Li N., Dittmar G., Schuchhardt J., Wolf J., Chen W., Selbach M. Global quantification of mammalian gene expression control. Nature. 2011;473:337–342. doi: 10.1038/nature10098. [DOI] [PubMed] [Google Scholar]
  • 61.Laurent J.M., Vogel C., Kwon T., Craig S.A., Boutz D.R., Huse H.K., Nozue K., Walia H., Whiteley M., Ronald P.C., Marcotte E.M. Protein abundances are more conserved than mRNA abundances across diverse taxa. Proteomics. 2010;10:4209–4212. doi: 10.1002/pmic.201000327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Goldberger R.F. Springer; 1980. Biological Regulation and Development, Molecular Organization and Cell Function. [DOI] [Google Scholar]
  • 63.Baxi A.B., Lombard-Banek C., Moody S.A., Nemes P. Proteomic Characterization of the Neural Ectoderm Fated Cell Clones in the Xenopus laevis Embryo by High-Resolution Mass Spectrometry. ACS Chem. Neurosci. 2018;9:2064–2073. doi: 10.1021/acschemneuro.7b00525. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Wühr M., Freeman R.M., Presler M., Horb M.E., Peshkin L., Gygi S., Kirschner M.W. Deep Proteomics of the Xenopus laevis Egg using an mRNA-Derived Reference Database. Curr. Biol. 2014;24:1467–1475. doi: 10.1016/j.cub.2014.05.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Gupta M., Sonnett M., Ryazanova L., Presler M., Wühr M. Quantitative Proteomics of Xenopus Embryos I, Sample Preparation. Methods Mol. Biol. 2018;1865:175–194. doi: 10.1007/978-1-4939-8784-9_13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Evans V.C., Barker G., Heesom K.J., Fan J., Bessant C., Matthews D.A. De novo derivation of proteomes from transcriptomes for transcript and protein identification. Nat. Methods. 2012;9:1207–1211. doi: 10.1038/nmeth.2227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Kaplan N.A., Wang W., Christiaen L. Initial characterization of Wnt-Tcf functions during Ciona heart development. Dev. Biol. 2019;448:199–209. doi: 10.1016/j.ydbio.2018.12.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Sharma S., Wang W., Stolfi A. Single-cell transcriptome profiling of the Ciona larval brain. Dev. Biol. 2019;448:226–236. doi: 10.1016/j.ydbio.2018.09.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Wang W., Niu X., Stuart T., Jullian E., Mauck W.M., Kelly R.G., Satija R., Christiaen L. A single-cell transcriptional roadmap for cardiopharyngeal fate diversification. Nat. Cell Biol. 2019;21:674–686. doi: 10.1038/s41556-019-0336-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.UniProt Consortium. Bateman A., Martin M.-J., Orchard S., Magrane M., Ahmad S., Alpi E., Bowler-Barnett E.H., Britto R., Bye-A-Jee H., et al. UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2022;51:D523–D531. doi: 10.1093/nar/gkac1052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Satou Y., Nakamura R., Yu D., Yoshida R., Hamada M., Fujie M., Hisata K., Takeda H., Satoh N. A Nearly Complete Genome of Ciona intestinalis Type A (C. robusta) Reveals the Contribution of Inversion to Chromosomal Evolution in the Genus Ciona. Genome Biol. Evol. 2019;11:3144–3157. doi: 10.1093/gbe/evz228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Satou Y., Tokuoka M., Oda-Ishii I., Tokuhiro S., Ishida T., Liu B., Iwamura Y. A Manually Curated Gene Model Set for an Ascidian, Ciona robusta (Ciona intestinalis Type A) Zoolog. Sci. 2022;39:253–260. doi: 10.2108/zs210102. [DOI] [PubMed] [Google Scholar]
  • 73.Tsagkogeorga G., Cahais V., Galtier N. The Population Genomics of a Fast Evolver: High Levels of Diversity, Functional Constraint, and Molecular Adaptation in the Tunicate Ciona intestinalis. Genome Biol. Evol. 2012;4:740–749. doi: 10.1093/gbe/evs054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Santesmasses D., Mariotti M., Guigó R. Selenoproteins, Methods and Protocols. Methods Mol. Biol. 2018;1661:17–28. doi: 10.1007/978-1-4939-7258-6_2. [DOI] [PubMed] [Google Scholar]
  • 75.Satou Y., Hamaguchi M., Takeuchi K., Hastings K.E.M., Satoh N. Genomic overview of mRNA 5′-leader trans-splicing in the ascidian Ciona intestinalis. Nucleic Acids Res. 2006;34:3378–3388. doi: 10.1093/nar/gkl418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Messerschmidt D.M., de Vries W., Ito M., Solter D., Ferguson-Smith A., Knowles B.B. Trim28 Is Required for Epigenetic Stability During Mouse Oocyte to Embryo Transition. Science. 2012;335:1499–1502. doi: 10.1126/science.1216154. [DOI] [PubMed] [Google Scholar]
  • 77.Bultman S.J., Gebuhr T.C., Pan H., Svoboda P., Schultz R.M., Magnuson T. Maternal BRG1 regulates zygotic genome activation in the mouse. Genes Dev. 2006;20:1744–1754. doi: 10.1101/gad.1435106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Toralova T., Kinterova V., Chmelikova E., Kanka J. The neglected part of early embryonic development: maternal protein degradation. Cell. Mol. Life Sci. 2020;77:3177–3194. doi: 10.1007/s00018-020-03482-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Zhang H., Ji S., Zhang K., Chen Y., Ming J., Kong F., Wang L., Wang S., Zou Z., Xiong Z., et al. Stable maternal proteins underlie distinct transcriptome, translatome, and proteome reprogramming during mouse oocyte-to-embryo transition. Genome Biol. 2023;24:166. doi: 10.1186/s13059-023-02997-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Yamada L., Saito T., Taniguchi H., Sawada H., Harada Y. Comprehensive Egg Coat Proteome of the Ascidian Ciona intestinalis Reveals Gamete Recognition Molecules Involved in Self-sterility. J. Biol. Chem. 2009;284:9402–9410. doi: 10.1074/jbc.m809672200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Nomura M., Nakajima A., Inaba K. Proteomic profiles of embryonic development in the ascidian Ciona intestinalis. Dev. Biol. 2009;325:468–481. doi: 10.1016/j.ydbio.2008.10.038. [DOI] [PubMed] [Google Scholar]
  • 82.Gillespie M.A., Palii C.G., Sanchez-Taltavull D., Shannon P., Longabaugh W.J.R., Downes D.J., Sivaraman K., Espinoza H.M., Hughes J.R., Price N.D., et al. Absolute Quantification of Transcription Factors Reveals Principles of Gene Regulation in Erythropoiesis. Mol. Cell. 2020;78:960–974.e11. doi: 10.1016/j.molcel.2020.03.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Imai K.S., Hino K., Yagi K., Satoh N., Satou Y. Gene expression profiles of transcription factors and signaling molecules in the ascidian embryo: towards a comprehensive understanding of gene networks. Development. 2004;131:4047–4058. doi: 10.1242/dev.01270. [DOI] [PubMed] [Google Scholar]
  • 84.Ahn H.R., Kim G.J. The Ascidian Numb Gene Involves in the Formation of Neural Tissues. Dev. Reprod. 2012;16:371–378. doi: 10.12717/dr.2012.16.4.371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Walton K.D., Croce J.C., Glenn T.D., Wu S.-Y., McClay D.R. Genomics and expression profiles of the Hedgehog and Notch signaling pathways in sea urchin development. Dev. Biol. 2006;300:153–164. doi: 10.1016/j.ydbio.2006.08.064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Oda-Ishii I., Kubo A., Kari W., Suzuki N., Rothbächer U., Satou Y. A Maternal System Initiating the Zygotic Developmental Program through Combinatorial Repression in the Ascidian Embryo. PLoS Genet. 2016;12:e1006045. doi: 10.1371/journal.pgen.1006045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Yamada L. Embryonic expression profiles and conserved localization mechanisms of pem/postplasmic mRNAs of two species of ascidian, Ciona intestinalis and Ciona savignyi. Dev. Biol. 2006;296:524–536. doi: 10.1016/j.ydbio.2006.05.018. [DOI] [PubMed] [Google Scholar]
  • 88.Shirae-Kurabayashi M., Matsuda K., Nakamura A. Ci-Pem-1 localizes to the nucleus and represses somatic gene transcription in the germline of Ciona intestinalis embryos. Development. 2011;138:2871–2881. doi: 10.1242/dev.058131. [DOI] [PubMed] [Google Scholar]
  • 89.Tsitsiridis G., Steinkamp R., Giurgiu M., Brauner B., Fobo G., Frishman G., Montrone C., Ruepp A. CORUM: the comprehensive resource of mammalian protein complexes–2022. Nucleic Acids Res. 2023;51:D539–D545. doi: 10.1093/nar/gkac1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Bochman M.L., Schwacha A. The Mcm Complex: Unwinding the Mechanism of a Replicative Helicase. Microbiol. Mol. Biol. Rev. 2009;73:652–683. doi: 10.1128/mmbr.00019-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Kubota H. Function and regulation of cytosolic molecular chaperone CCT. Vitam. Horm. 2002;65:313–331. doi: 10.1016/s0083-6729(02)65069-1. [DOI] [PubMed] [Google Scholar]
  • 92.Uehara R., Nozawa R.S., Tomioka A., Petry S., Vale R.D., Obuse C., Goshima G. The augmin complex plays a critical role in spindle microtubule generation for mitotic progression and cytokinesis in human cells. Proc. Natl. Acad. Sci. USA. 2009;106:6998–7003. doi: 10.1073/pnas.0901587106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Liang J., Xia L., Oyang L., Lin J., Tan S., Yi P., Han Y., Luo X., Wang H., Tang L., et al. The functions and mechanisms of prefoldin complex and prefoldin-subunits. Cell Biosci. 2020;10:87. doi: 10.1186/s13578-020-00446-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Elias J.E., Gygi S.P. Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat. Methods. 2007;4:207–214. doi: 10.1038/nmeth1019. [DOI] [PubMed] [Google Scholar]
  • 95.Cao W.X., Kabelitz S., Gupta M., Yeung E., Lin S., Rammelt C., Ihling C., Pekovic F., Low T.C.H., Siddiqui N.U., et al. Precise Temporal Regulation of Post-transcriptional Repressors Is Required for an Orderly Drosophila Maternal-to-Zygotic Transition. Cell Rep. 2020;31:107783. doi: 10.1016/j.celrep.2020.107783. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.White R.J., Collins J.E., Sealy I.M., Wali N., Dooley C.M., Digby Z., Stemple D.L., Murphy D.N., Billis K., Hourlier T., et al. A high-resolution mRNA expression time course of embryonic development in zebrafish. Elife. 2017;6:e30860. doi: 10.7554/elife.30860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Hebenstreit D., Fang M., Gu M., Charoensawan V., van Oudenaarden A., Teichmann S.A. RNA sequencing reveals two major classes of gene expression levels in metazoan cells. Mol. Syst. Biol. 2011;7:497. doi: 10.1038/msb.2011.28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Imai K.S., Levine M., Satoh N., Satou Y. Regulatory Blueprint for a Chordate Embryo. Science. 2006;312:1183–1187. doi: 10.1126/science.1123404. [DOI] [PubMed] [Google Scholar]
  • 99.Kobayashi K., Maeda K., Tokuoka M., Mochizuki A., Satou Y. Controlling Cell Fate Specification System by Key Genes Determined from Network Structure. iScience. 2018;4:281–293. doi: 10.1016/j.isci.2018.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Mochizuki Y., Satou Y., Satoh N. Large-scale characterization of genes specific to the larval nervous system in the ascidian Ciona intestinalis. Genesis. 2003;36:62–71. doi: 10.1002/gene.10199. [DOI] [PubMed] [Google Scholar]
  • 101.Satou Y., Takatori N., Yamada L., Mochizuki Y., Hamaguchi M., Ishikawa H., Chiba S., Imai K., Kano S., Murakami S.D., et al. Gene expression profiles in Ciona intestinalis tailbud embryos. Development. 2001;128:2893–2904. doi: 10.1242/dev.128.15.2893. [DOI] [PubMed] [Google Scholar]
  • 102.Kawai N., Ogura Y., Ikuta T., Saiga H., Hamada M., Sakuma T., Yamamoto T., Satoh N., Sasakura Y. Hox10-regulated endodermal cell migration is essential for development of the ascidian intestine. Dev. Biol. 2015;403:43–56. doi: 10.1016/j.ydbio.2015.03.018. [DOI] [PubMed] [Google Scholar]
  • 103.Buccitelli C., Selbach M. mRNAs, proteins and the emerging principles of gene expression control. Nat. Rev. Genet. 2020;21:630–644. doi: 10.1038/s41576-020-0258-4. [DOI] [PubMed] [Google Scholar]
  • 104.Wang D., Eraslan B., Wieland T., Hallström B., Hopf T., Zolg D.P., Zecha J., Asplund A., Li L.H., Meng C., et al. A deep proteome and transcriptome abundance atlas of 29 healthy human tissues. Mol. Syst. Biol. 2019;15:e8503. doi: 10.15252/msb.20188503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Mergner J., Frejno M., List M., Papacek M., Chen X., Chaudhary A., Samaras P., Richter S., Shikata H., Messerer M., et al. Mass-spectrometry-based draft of the Arabidopsis proteome. Nature. 2020;579:409–414. doi: 10.1038/s41586-020-2094-2. [DOI] [PubMed] [Google Scholar]
  • 106.Sonnett M., Gupta M., Nguyen T., Wühr M. Quantitative Proteomics for Xenopus Embryos II, Data Analysis. Methods Mol. Biol. 2018;1865:195–215. doi: 10.1007/978-1-4939-8784-9_14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Itallie E.V., Kalocsay M., Wühr M., Peshkin L., Kirschner M.W. Transitions in the proteome and phospho-proteome during xenopus laevis development. bioRxiv. 2021 doi: 10.1101/2021.08.05.455309. Preprint at. [DOI] [Google Scholar]
  • 108.Delsuc F., Philippe H., Tsagkogeorga G., Simion P., Tilak M.-K., Turon X., López-Legentil S., Piette J., Lemaire P., Douzery E.J.P. A phylogenomic framework and timescale for comparative studies of tunicates. BMC Biol. 2018;16:39. doi: 10.1186/s12915-018-0499-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Izzi S.A., Colantuono B.J., Sullivan K., Khare P., Meedel T.H. Functional studies of the Ciona intestinalis myogenic regulatory factor reveal conserved features of chordate myogenesis. Dev. Biol. 2013;376:213–223. doi: 10.1016/j.ydbio.2013.01.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Veeman M.T., Nakatani Y., Hendrickson C., Ericson V., Lin C., Smith W.C. chongmague reveals an essential role for laminin-mediated boundary formation in chordate convergence and extension movements. Development. 2008;135:33–41. doi: 10.1242/dev.010892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Hanel M.L., Wuebbles R.D., Jones P.L. Muscular dystrophy candidate gene FRG1 is critical for muscle development. Dev. Dyn. 2009;238:1502–1512. doi: 10.1002/dvdy.21830. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Wuebbles R.D., Hanel M.L., Jones P.L. FSHD region gene 1 (FRG1) is crucial for angiogenesis linking FRG1 to facioscapulohumeral muscular dystrophy-associated vasculopathy. Dis. Model. Mech. 2009;2:267–274. doi: 10.1242/dmm.002261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Lyabin D.N., Eliseeva I.A., Ovchinnikov L.P. YB-1 protein: functions and regulation. Wiley Interdiscip. Rev. RNA. 2014;5:95–110. doi: 10.1002/wrna.1200. [DOI] [PubMed] [Google Scholar]
  • 114.Kumari P., Gilligan P.C., Lim S., Tran L.D., Winkler S., Philp R., Sampath K. An essential role for maternal control of Nodal signaling. Elife. 2013;2:e00683. doi: 10.7554/elife.00683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Session A.M., Uno Y., Kwon T., Chapman J.A., Toyoda A., Takahashi S., Fukui A., Hikosaka A., Suzuki A., Kondo M., et al. Genome evolution in the allotetraploid frog Xenopus laevis. Nature. 2016;538:336–343. doi: 10.1038/nature19840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Winkley K.M., Kourakis M.J., DeTomaso A.W., Veeman M.T., Smith W.C. Tunicate gastrulation. Curr. Top. Dev. Biol. 2020;136:219–242. doi: 10.1016/bs.ctdb.2019.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Schrimpf S.P., Weiss M., Reiter L., Ahrens C.H., Jovanovic M., Malmström J., Brunner E., Mohanty S., Lercher M.J., Hunziker P.E., et al. Comparative Functional Analysis of the Caenorhabditis elegans and Drosophila melanogaster Proteomes. PLoS Biol. 2009;7:e1000048. doi: 10.1371/journal.pbio.1000048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Liu J., Robinson-Rechavi M. Adaptive Evolution of Animal Proteins over Development: Support for the Darwin Selection Opportunity Hypothesis of Evo-Devo. Mol. Biol. Evol. 2018;35:2862–2872. doi: 10.1093/molbev/msy175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Roux J., Robinson-Rechavi M. Developmental Constraints on Vertebrate Genome Evolution. PLoS Genet. 2008;4:e1000311. doi: 10.1371/journal.pgen.1000311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Klont F., Bras L., Wolters J.C., Ongay S., Bischoff R., Halmos G.B., Horvatovich P. Assessment of Sample Preparation Bias in Mass Spectrometry-Based Proteomics. Anal. Chem. 2018;90:5405–5413. doi: 10.1021/acs.analchem.8b00600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Brozovic M., Dantec C., Dardaillon J., Dauga D., Faure E., Gineste M., Louis A., Naville M., Nitta K.R., Piette J., et al. ANISEED 2017: extending the integrated ascidian database to the exploration and evolutionary comparison of genome-scale datasets. Nucleic Acids Res. 2018;46:D718–D725. doi: 10.1093/nar/gkx1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/s0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 123.Grabherr M.G., Haas B.J., Yassour M., Levin J.Z., Thompson D.A., Amit I., Adiconis X., Fan L., Raychowdhury R., Zeng Q., et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Smit H., Green R. 2015. RepeatMasker Open-4.0. [Google Scholar]
  • 125.Pertea G., Huang X., Liang F., Antonescu V., Sultana R., Karamycheva S., Lee Y., White J., Cheung F., Parvizi B., et al. TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003;19:651–652. doi: 10.1093/bioinformatics/btg034. [DOI] [PubMed] [Google Scholar]
  • 126.Huang X., Madan A. CAP3: A DNA Sequence Assembly Program. Genome Res. 1999;9:868–877. doi: 10.1101/gr.9.9.868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Fu L., Niu B., Zhu Z., Wu S., Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012;28:3150–3152. doi: 10.1093/bioinformatics/bts565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Li W., Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22:1658–1659. doi: 10.1093/bioinformatics/btl158. [DOI] [PubMed] [Google Scholar]
  • 129.Kolberg L., Raudvere U., Kuzmin I., Adler P., Vilo J., Peterson H. g:Profiler—interoperable web service for functional enrichment analysis and gene identifier mapping (2023 update) Nucleic Acids Res. 2023;51:W207–W212. doi: 10.1093/nar/gkad347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Alexa A., Rahnenfuhrer J. 2023. topGO: Enrichment Analysis for Gene Ontology. R package version. [DOI] [Google Scholar]
  • 131.Patro R., Duggal G., Love M.I., Irizarry R.A., Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods. 2017;14:417–419. doi: 10.1038/nmeth.4197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Hotta K., Mitsuhara K., Takahashi H., Inaba K., Oka K., Gojobori T., Ikeo K. A web-based interactive developmental table for the ascidian Ciona intestinalis, including 3D real-image embryo reconstructions: I. From fertilized egg to hatching larva. Dev. Dyn. 2007;236:1790–1805. doi: 10.1002/dvdy.21188. [DOI] [PubMed] [Google Scholar]
  • 133.Zahn N., James-Zorn C., Ponferrada V.G., Adams D.S., Grzymkowski J., Buchholz D.R., Nascone-Yoder N.M., Horb M., Moody S.A., Vize P.D., Zorn A.M. Normal Table of Xenopus development: a new graphical resource. Development. 2022;149:dev200356. doi: 10.1242/dev.200356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Brunetti R., Gissi C., Pennati R., Caicci F., Gasparini F., Manni L. Morphological evidence that the molecularly determined Ciona intestinalis type A and type B are different species: Ciona robusta and Ciona intestinalis. J. Zoolog. Syst. Evol. Res. 2015;53:186–193. doi: 10.1111/jzs.12101. [DOI] [Google Scholar]
  • 135.Christiaen L., Wagner E., Shi W., Levine M. Isolation of Sea Squirt (Ciona) Gametes, Fertilization, Dechorionation, and Development: Figure 1. Cold Spring Harb. Protoc. 2009 doi: 10.1101/pdb.prot5344. pdb.prot5344. [DOI] [PubMed] [Google Scholar]
  • 136.Abdul-Wajid S., Veeman M.T., Chiba S., Turner T.L., Smith W.C. Exploiting the Extraordinary Genetic Polymorphism of Ciona for Developmental Genetics with Whole Genome Sequencing. Genetics. 2014;197:49–59. doi: 10.1534/genetics.114.161778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.Wessel D., Flügge U.I. A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids. Anal. Biochem. 1984;138:141–143. doi: 10.1016/0003-2697(84)90782-6. [DOI] [PubMed] [Google Scholar]
  • 138.Nguyen T., Costa E.J., Deibert T., Reyes J., Keber F.C., Tomschik M., Stadlmeier M., Gupta M., Kumar C.K., Cruz E.R., et al. Differential nuclear import sets the timing of protein access to the embryonic genome. Nat. Commun. 2022;13:5887. doi: 10.1038/s41467-022-33429-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Edwards A., Haas W. Multiplexed Quantitative Proteomics for High-Throughput Comprehensive Proteome Comparisons of Human Cell Lines. Methods Mol. Biol. 2016;1394:1–13. doi: 10.1007/978-1-4939-3341-9_1. [DOI] [PubMed] [Google Scholar]
  • 140.Rappsilber J., Mann M., Ishihama Y. Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips. Nat. Protoc. 2007;2:1896–1906. doi: 10.1038/nprot.2007.261. [DOI] [PubMed] [Google Scholar]
  • 141.Eng J.K., McCormack A.L., Yates J.R. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 1994;5:976–989. doi: 10.1016/1044-0305(94)80016-2. [DOI] [PubMed] [Google Scholar]
  • 142.Savitski M.M., Wilhelm M., Hahne H., Kuster B., Bantscheff M. A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets[S] Mol. Cell. Proteomics. 2015;14:2394–2404. doi: 10.1074/mcp.m114.046995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 143.Satou Y., Kawashima T., Shoguchi E., Nakayama A., Satoh N. An Integrated Database of the Ascidian, Ciona intestinalis: Towards Functional Genomics. Zoolog. Sci. 2005;22:837–843. doi: 10.2108/zsj.22.837. [DOI] [PubMed] [Google Scholar]
  • 144.Nitta K.R., Vincentelli R., Jacox E., Cimino A., Ohtsuka Y., Sobral D., Satou Y., Cambillau C., Lemaire P. High-Throughput Protein Production and Purification, Methods and Protocols. Methods Mol. Biol. 2019;2025:487–517. doi: 10.1007/978-1-4939-9624-7_23. [DOI] [PubMed] [Google Scholar]
  • 145.Treen N., Chavarria E., Weaver C.J., Brangwynne C.P., Levine M. An FGF timer for zygotic genome activation. Genes Dev. 2023;37:80–85. doi: 10.1101/gad.350164.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 146.Soneson C., Love M.I., Robinson M.D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res. 2015;4:1521. doi: 10.12688/f1000research.7563.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Bolstad B. preprocessCore: A collection of pre-processing functions. 2023. https://bioconductor.org/packages/preprocessCore

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S9 and Table S4
mmc1.pdf (3.8MB, pdf)
Table S1. Ciona absolute protein abundance in unfertilized egg, related to Figure 1
mmc2.xlsx (279.3KB, xlsx)
Table S2. Ciona relative protein abundance time series, related to Figure 2
mmc3.xlsx (1.1MB, xlsx)
Table S3. Ciona TPM RNA-seq time series, related to Figure 2
mmc4.xlsx (2.9MB, xlsx)
Table S5. Ciona relative mRNA-protein dynamics, related to Figure 2
mmc5.xlsx (1.6MB, xlsx)
Table S6. Ciona-Xenopus one-to-one orthologs, related to Figure 3
mmc6.xlsx (161.3KB, xlsx)
Table S7. Ciona-Xenopus protein dynamics from Sonnett et al. 2018, related to Figure 3
mmc7.xlsx (1.1MB, xlsx)
Table S8. Ciona TPM RNA-seq time series from Hu et al. 2017, related to Figure 4
mmc8.xlsx (8.5MB, xlsx)
Table S9. Xenopus TPM RNA-seq time series from Session et al. 2016 and Hu et al. 2017, related to Figure 4
mmc9.xlsx (21.4MB, xlsx)
Table S10. Ciona-Xenopus protein dynamics from Itallie et al. 2021, related to Figure 4
mmc10.xlsx (717KB, xlsx)

Data Availability Statement

  • Data: The raw data associated with the RNA-seq experiments and gene expression matrices are available in GEO under the accession number: GSE237005. The mass spectrometry experiments presented in this study have been deposited to the ProteomeXchange Consortium (http://www.proteomexchange.org/). Embryo developmental proteome (deposited via the PRIDE partner repository) with accession number: PXD043619. Genome annotation files, transcription factor and signaling molecules databases used for RNA-seq and proteomics analyses, alignment files used in orthology assignment, and additional files are publicly available on GitHub (https://github.com/andreamariossi/proteome_ciona).

  • Code: All code to reproduce this study is publicly available on GitHub (https://github.com/andreamariossi/proteome_ciona).

  • Other items: Additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.


Articles from iScience are provided here courtesy of Elsevier

RESOURCES