Skip to main content
eLife logoLink to eLife
. 2024 Aug 23;13:RP93062. doi: 10.7554/eLife.93062

The role of heterochronic gene expression and regulatory architecture in early developmental divergence

Nathan D Harry 1, Christina Zakas 1,
Editors: Detlef Weigel2, Detlef Weigel3
PMCID: PMC11343563  PMID: 39177024

Abstract

New developmental programs can evolve through adaptive changes to gene expression. The annelid Streblospio benedicti has a developmental dimorphism, which provides a unique intraspecific framework for understanding the earliest genetic changes that take place during developmental divergence. Using comparative RNAseq through ontogeny, we find that only a small proportion of genes are differentially expressed at any time, despite major differences in larval development and life history. These genes shift expression profiles across morphs by either turning off any expression in one morph or changing the timing or amount of gene expression. We directly connect the contributions of these mechanisms to differences in developmental processes. We examine F1 offspring – using reciprocal crosses – to determine maternal mRNA inheritance and the regulatory architecture of gene expression. These results highlight the importance of both novel gene expression and heterochronic shifts in developmental evolution, as well as the trans-acting regulatory factors in initiating divergence.

Research organism: Other

Introduction

Small changes in development can result in vast morphological differentiation and divergence. Through the history of evolutionary developmental biology, researchers have proposed that changes in the timing of traits during development can produce most morphological changes (deBeer, 1930; Gould, 1985; Dobreva et al., 2022). Though this theory has been further refined in the field since Haeckel first coined the term ‘heterochrony’ in development (1875), the timing of developmental changes remains a prominent mechanism of diversification (Wray and McClay, 1989; Wray and Raff, 1991; Smith, 2002; Smith, 2003). At the molecular level, changes in gene expression timing (heterochronic genes) or expression amount (heteromorphic genes) can underlie major morphological differences (Erwin and Davidson, 2002). However, these heterochronic shifts are usually investigated on a per-gene basis to reveal the extent that morphological change is achieved through gene expression perturbation (as reviewed in Dobreva et al., 2022). Overall, the extent that heterochronic gene expression contributes to developmental differences at the molecular level has not been quantified. Therefore, the underlying regulatory changes that result in gene expression changes also remain elusive. In this study we quantify total gene expression over sequential developmental time to determine the extent that gene expression differences occur between divergent developmental and life-history modes. Furthermore, we use genetic crosses between the developmental morphs to quantify the mode of regulatory change that is responsible for these differences.

Changes in gene expression (heterochrony and heteromorphy) are now well established drivers of both interspecific (King and Wilson, 1975; Carroll, 1995; Wray et al., 2003; Fay and Wittkopp, 2008) and intraspecific (López-Maury et al., 2008; Hamann et al., 2021) differentiation. Knowing the extent that these biological processes are at play is fundamental to understanding developmental evolution (Gould, 1985; Raff and Wray, 1989; Smith, 2003; Vaglia and Smith, 2003; Carleton et al., 2008; Gunter et al., 2014; Willink et al., 2020). While somewhat subtle differences in gene expression timing and amount can drive morphological changes, morph-specific genes – where genes are simply turned on or off, and possibly gained or lost in one lineage – are also possible (Hilgers et al., 2018; Luna and Chain, 2021). Despite the importance of these mechanisms for developmental and evolutionary change, long-standing questions in the field remain: What are the extents to which modifications of gene expression change developmental programs and morphological outcomes? What are the molecular factors that regulate these changes and how do they act?

One difficulty in determining the genetic basis of gene expression shifts is that it typically involves parsing developmental programs across divergent taxa, and therefore requires querying numerous genetic differences that could have arisen by either selection or drift over millions of years. Creating hybrids across divergent species presents its own difficulties. Here, we determine the extent that gene expression differences contribute to developmental divergence over small evolutionary timescales using a model with two developmental modes within a single species. We examine the prevailing prediction that heterochrony is the primary driver of morphological change and developmental evolution (Raff and Wray, 1989; McNamara, 2012; Dobreva et al., 2022). While numerous interspecific gene expression studies have assessed the occurrence of both heterochronic genes and morph-specific genes (Ruvkun and Giusto, 1989; Capra et al., 2010; Simola et al., 2010; Schmitz et al., 2020), no study has assessed the contributions of these mechanisms to producing differences in developmental processes. By using an intraspecific model of developmental divergence where genetic crosses are possible, we determine the architecture of regulatory differences that drive gene expression and ultimately life-history differences.

Results

Study system and embryology

We compare the gene expression over developmental time for two morphs of a marine annelid, Streblospio benedicti, which has an intraspecific developmental dimorphism. There are two distinct developmental morphs which differ in their egg size, embryological development time, larval ecology, and morphology. These are either obligately feeding planktotrophic (PP) larvae or non-feeding lecithotrophic (LL) larvae. Despite these developmental differences, as adults they are morphologically indistinguishable outside of some reproductive traits and occupy the same environmental niches. The larval traits are heritable, meaning the differences in development are genetic and not plastic (Levin and Creed, 1986). The two morphs have been characterized extensively in terms of life-history and genetic differences (Levin, 1984; Levin and Creed, 1986; Gibson et al., 2010; Zakas and Rockman, 2014; Zakas et al., 2018; Zakas, 2022). Intriguingly, crosses between the morphs are viable with no obvious fitness effects, and these F1 offspring can have intermediate larval traits compared to the parentals (Levin and Creed, 1986; Zakas, 2022). Embryological differences between the two types (and thus egg sizes) have been briefly described (McCAIN, 2008) but here we detail the full time course of embryogenesis from the one-cell embryo through the larval phase in detail for both morphs, and present data from reciprocal F1 crosses (PL or LP) between the morphs in both directions across the same developmental period.

Spiralian animals have a famously conserved pattern of embryological cleavage (reviewed in Lyons et al., 2012), so unsurprisingly we find the two morphs proceed through the same developmental stages despite starting from eggs of different sizes (8× volume difference). Despite the similarity in embryo morphology, there are some notable differences between the two morphs: the absolute time between each stage is shifted such that the LL embryos take longer to reach an equivalent larval stage, which is expected as larger, more yolky cells can take longer to divide (McCAIN, 2008). At the swimming larval stage there are some notable morphological and behavioral differences (reviewed in Zakas, 2022, Figure 1), notably the PP larvae are obligately feeding and have feeding structures, while the LL are not. While the embryological stages may not be morphologically different other than size, it is reasonable to expect that they could be expressing different genes or changing the relative timing of expression to produce the morphological and behavioral differences seen in the larvae.

Figure 1. Embryology and development of S. benedicti.

Figure 1.

(A) Timeline of early development of the two morphs. The ‘swimming’ stage is the onset of swimming ability in both morphs. The ‘1-week’ old stage (7 days post fertilization) occurs approximately 1–4 days following larval release from maternal brood pouches under natural conditions. Planktotrophic (PP) larvae at this stage were not fed. Stages used in this study are marked with a red dot. (B) Adult male. The adults of both morphs are indistinguishable outside of reproductive traits. (C) Table of larval traits. Lecithotrophic (LL) offspring are facultatively feeding, meaning that feeding is not obligatory, but they are capable of feeding (Pernet and McArthur, 2006). Gut formation from Pernet and McHugh, 2010, where the three-band stage is our swimming stage and the five-band stage is shortly before our 1-week stage.

Total RNA expression analysis

We measured total gene expression from the six developmental stages using RNAseq (Figure 2) with at least four biological replicates per morph at each stage (Figure 2A). Using the full dataset, the first two principal component analysis (PCA) show most of the variance in total gene expression is due to developmental stage and morph (PC1; Figure 2B). As expected the LL individuals appear to transcriptionally fall behind the PP offspring at each morphologically defined, equivalent developmental stage. We expect this pattern as LL offspring develop more slowly to reach these first developmental stages but reach the juvenile stage more quickly in absolute time than PP offspring, which develop in the water column for a longer period (Figure 1). Notably, pre-gastrulation development is distinctly separated from post-gastrulation by the second principal component.

Figure 2. Differentially expressed (DE) genes across six developmental stages in the two developmental morphs.

(A) Sequenced libraries. Colors indicate morphs: green = PP (planktotrophic), orange = LL (lecithotrophic), purple = (F1) PL, pink = (F1) LP. Numbers in boxes are the sample’s sequencing depth in million reads. Total number of replicates for each time point are in bold. Samples for which all six stages were collected from the same cross are indicated as ‘complete’ with a check mark. (B) Principal component analysis of PP and LL samples. PC1 represents developmental time, whereas PC2 separates pre-gastrulation from post-gastrulation (C) DE between morphs over development. Black line is the percentage of total expressed genes that are DE at each stage, ribbon displays the relative mean and median log2 fold-change between morphs for those genes that are DE.

Figure 2.

Figure 2—figure supplement 1. Read mapping rate between samples.

Figure 2—figure supplement 1.

Mapping rates in all pairwise comparisons do not differ significantly. p-Values are shown above the plot indicating no significant differences between the mapping rates of any two groups.
Figure 2—figure supplement 2. Number of genes expressed at each stage in planktotrophic (PP) and lecithotrophic (LL) samples.

Figure 2—figure supplement 2.

As expected for an intraspecific comparison, most genes are conserved, having the same expression levels at all stages in both morphs. Only 36.2% of all expressed genes are significantly differentially expressed (DE) between PP and LL at any stage in this dataset. We find that early in development over a third of these DE genes are significantly different between the morphs, but these differences tend to be quite small in magnitude. At gastrulation the number of significant genes decreases to less than 5% of the total DE genes, however these remaining expression differences are much larger in magnitude (Figure 2C). It appears that the two morphs are more functionally distinct during early development, likely because of the different metabolic requirements imposed on them by the differences in maternal egg provisioning (Moran and McAlister, 2009; Zakas, 2022; Harry and Zakas, 2023).

Differential expression analysis

We use our RNAseq time course dataset to quantify the extent that modifications to gene expression timing and amount contribute to developmental differences. We define heteromorphic genes as homologous genes whose expression differs significantly between larval morphs, but the pattern of expression (expression profile) does not change (Figure 3). The significant difference in heteromorphic genes could be at only one or a few discrete time points. Heterochronic genes are those whose overall expression pattern and timing change between the developmental morphs, as discussed in the clustering algorithm below. We categorize genes that are only expressed in one morph over the full time course of development as morph-specific genes. These genes are specific to only one developmental type.

Figure 3. Classifications with examples of gene expression categories.

Figure 3.

Functional heterochrony is a change in the developmental stage at which a particular pattern of gene expression appears instead of a difference in absolute time.

To differentiate heterochronic shifts in gene expression from heteromorphic ones, we clustered all gene expression patterns from PP into a representative set of expression profiles using Mfuzz (v2.60.0) (no additional clusters were found when using LL). We selected six clusters based on the criteria that the average correlation between cluster pairs increased with each additional cluster, such that the final clusters generated sufficiently represented the diversity of gene expression patterns (Figure 4, Figure 4—figure supplement 1). We then assigned each gene to a cluster in the PP and LL datasets independently. A gene’s expression is considered heterochronic when it appears in one cluster for PP and a different cluster for LL.

Figure 4. Six gene expression profiles representing the patterns of expression.

The number of conserved genes and heteromorphic genes that match each cluster is listed to summarize overall trends in expression patterns that appear mostly similar in both morphs. Clusters 2 (early expression) and 5 (late expression) represent the most genes.

Figure 4.

Figure 4—figure supplement 1. Cluster number optimization.

Figure 4—figure supplement 1.

The correlation between the two most similar gene expression clusters (profiles) as cluster number increases. Threshold (solid line) at r=0.85.
Figure 4—figure supplement 2. Clusters with all genes shown plotted on top of cluster core genes.

Figure 4—figure supplement 2.

Red lines indicate genes with high cluster membership scores while blue lines indicate genes with lower membership scores. Genes with no cluster membership score greater than 0.5 were not assigned to a cluster and are not presented in this study’s results.

We identified 354 genes from our set of DE genes (45.9%) where both the PP and LL expression pattern matched the same cluster. (These are heteromorphic genes and are significantly DE in at least one developmental stage, but do not have different profiles of expression between PP and LL.) Approximately half of these are assigned to clusters 2 and 5. Cluster 2, which shows a pattern of maternal transcript degradation with no subsequent zygotic expression, is likely to contain genes associated with embryogenesis that are shared by both morphs. Cluster 5 shows a pattern of largely post-gastrulation zygotic gene expression. Based on this pattern, these genes are likely to be associated with shared larval features. Although there are some late-appearing heteromorphic genes that are associated with discrete larval differences: ‘chitin catabolic processes’, for example, are overexpressed in PP at the swimming stage (when they grow swimming chaetae) and are lowly expressed in LL which do not make swimming chaetae (see Supplementary file 1).

Next, we identified heterochronies, which we define as a gene that switches clusters – and therefore expression profiles – between PP and LL. There are 224 heterochronic genes, which is 29% of DE genes. As we reduced the complexity of the entire gene expression dataset into six clusters, some genes with similar profiles in both PP and LL were ultimately assigned to different clusters, generating a false positive. To ensure against these false positives, we filtered genes by Pearson correlation between sample means in PP and LL. Genes with a correlation r>0.85 were not counted as heterochronic.

Heterochronies are a change in the gene expression profile depending on developmental morph, but how different are the patterns of heterochronic changes? Parsimoniously, we would expect most heterochronies to have a similar overall shape that is simply shifted by a stage or two (like a switch from cluster 1 to cluster 2; Figure 4). Examining the six clusters above, it is obvious that some expression profiles have similar trajectories (clusters 5 and 6, for example, both show increasing expression post blastula), while others are essentially inverted expression patterns (clusters 1 and 4 for example). Clusters are ordered in a correlation heatmap (Figure 5A) from early gene expression to late gene expression. We find that most genes that switch expression profiles between PP and LL do so between the most similar clusters, and thus are near the diagonal. But where we see opposite profiles, the trend is early expression in PP genes compared to LL (top left corner). This is expected given the delay in embryogenesis for LL compared to PP. That said, many genes are expressed earlier in LL compared to PP (bottom right orange triangle in Figure 5A) but most of these are relatively small shifts.

Figure 5. Magnitude of expression profile changes.

Figure 5.

(A) Number of switches between each cluster. Clusters are arranged on the axes in order from high early developmental expression (left/bottom) to high late developmental gene expression (right/top). The diagonal represents genes which are assigned to the same cluster in both planktotrophic (PP) and lecithotrophic (LL) samples, boxes near the diagonal are switches between similar cluster profiles, and boxes furthest from the diagonal are extreme cluster switches. Genes expressed earlier in PP are above diagonal and earlier in LL are below. (B) Density plot of genes correlations between PP and LL. Heterochronic genes are yellow and heteromorphic genes in blue. Most heterochronies are minor changes in gene expression timing but the left-tailed distribution shows that few genes have large differences in timing.

We quantify the magnitude of change in gene expression between PP and LL independently of the cluster assignment by estimating the Pearson correlation directly. This shows that most heterochronies are minor changes in the timing of gene expression. There is a left-tailed distribution indicating that a few genes have a large difference in expression patterns (like a profile inversion) between PP and LL (Figure 5B).

Gene ontology (GO) enrichment tests show heterochronic genes that are expressed earlier in PP are functionally enriched for mesoderm specification, cell fate specification pathways, and several organogenesis pathways (others include BMP signaling, mesoderm specification, intestinal epithelial cell differentiation, and gene silencing (see Supplementary file 1)). Notably, heterochronic genes which are shifted earlier in LL are enriched for many metabolic functions. The shift in expression of metabolic genes is consistent with the life-history and feeding differences between the larvae, and it may be that most of what is differentiating a PP and LL larvae is the onset of feeding and gut-related developmental programs.

Morph-specific genes, by definition, have no expression in one morph and cannot be assigned to a cluster. To quantify morph-specific genes we use a data-adaptive flag method (DAFS) to calculate expression thresholds for each sample, below which genes are not considered as expressed. Using this approach, we identify 195 genes (25.3% of DE genes) which are only expressed in one morph (Figure 6). We find considerably more P-specific genes (150 genes in PP vs 45 genes in LL), which tend to be expressed in later developmental stages. These include genes which code for proteins such as Fibropellin, Forkhead-box, Crumbs-like, Notch-2, and many zinc-finger proteins. While the function of these genes during development remains to be determined, it is possible that they maintain PP-specific larval traits.

Figure 6. Morph-specific gene expression by stage.

Figure 6.

These genes are never expressed in the opposite morph, but may be expressed at one to six stages within a morph. Planktotrophic (PP) is green and lecithotrophic (LL) is orange.

Expression of functionally distinct categories of genes may evolve by different mechanisms. Morph-specific genes may be involved in very different functional processes than heterochronic genes in this system. GO enrichment tests for morph-specific genes expressed in PP found a few functions related to cell fate specification and signaling pathways (GO enrichment tests for biological process, p<0.05; no significant functional enrichments for LL-specific genes), but a distinct lack of genes involved in metabolic processes. This is because metabolic pathways genes shifted earlier in the development of LL offspring are still required in both offspring morphs. But the morph-specific genes are not required for development and may be modified by different evolutionary mechanisms. These findings also imply that PP embryos require a few specific cell types to produce a PP larval form that are reduced or lost in LL.

Overall, we find that heteromorphic changes in expression accounts for most gene expression differences, which is expected as this is the smallest change in gene expression of the three categories. Gene expression need only be significantly different at one stage to fit this category, and such differences may not necessarily translate into biological differences in development. Morph-specific and heterochronic changes make up very similar proportions of the remaining differences (Figure 7). This demonstrates that while true heterochronies are common, they are not the main driver of gene expression differences in development.

Figure 7. Distribution of the 772 genes with differentiated expression between morphs.

Figure 7.

Heteromorphy makes up nearly half of all differentiated genes, while heterochrony and morph-specific genes split the remainder nearly equally.

Gene expression of genetic crosses

Because we are using an intraspecific comparison, we can extend our analysis to understand the regulatory architecture behind these expression differences. We included RNAseq time course data for offspring from reciprocal crosses between the two morphs, meaning PP and LL parents were crossed in both directions alternating the role of the mother. Cross of the two developmental morphs to produce F1 offspring, which can have a range of intermediate traits, but typically closely resemble their mother’s phenotype (Zakas and Rockman, 2014). This is particularly useful to disentangle maternal effects, as both F1s (PL and LP; mother’s genotype is listed first) are heterozygotes, but they originate from different egg sizes and mothers with different genetic backgrounds. F1s allow us to identify the regulatory architecture underlying DE, and to assess the impact of maternal background on gene expression.

F1s have a general pattern of intermediate expression values but have more variability within replicates and clear cases of outliers (Figure 8A, Figure 8—figure supplement 1). They have considerably more variability across replicates, and some genes are misexpressed – where the F1 expression value is outside the range of the parental difference (Figure 8A). Misexpression is reported in hybridization studies and is thought to be the result of epistatic interactions between divergent genomes that reduce the fitness of offspring contributing to sympatric speciation (Norrström et al., 2011; Moran et al., 2021). Whether misexpression affects the fitness of the embryos in S. benedicti is unclear, although F1s typically develop normally in the lab and no systematic reduction in viability has been observed. In previous studies of gene expression in eggs of the two morphs, we saw a similar pattern where F1 eggs typically have intermediate expression values compared to parental genotypes, but high levels of misexpression (Harry and Zakas, 2023). For genes that are DE between PP and LL samples, we find that F1 gene expression patterns are split between those matching the maternal and paternal gene expression pattern, with slightly more genes matching the paternal expression at each stage (Figure 8B). We expect most transcripts in the first and second stages of development to be maternally derived, so the high proportion of genes matching paternal expression patterns is somewhat unexpected.

Figure 8. Differential gene expression including F1 offspring.

(A) Principal component analysis (PCA) of the top 500 most variable genes, including F1 (PL or LP) offspring. F1s have more misexpression in early stages but converge after the swimming stage and become intermediate to planktotrophic (PP) and lecithotrophic (LL) samples. (B) Plot relating F1 expression back to parental types. Dots indicate the number of DEGs between PP and LL samples at each stage. Purple bars represent the number of those DEGs for which the F1 gene expression is more like its maternal parent, and gray bars represent the number of those DEGs for which the F1 gene expression is more similar to its paternal parent.

Figure 8.

Figure 8—figure supplement 1. Principal component analysis (PCA) of all gene expression labeled by family (crossID).

Figure 8—figure supplement 1.

Figure 8—figure supplement 2. Genes with parental effects.

Figure 8—figure supplement 2.

Most genes match maternal expression patterns.

We find that most genes (>8000 genes on average, out of 14,327 genes expressed) have extremely conserved expression patterns among PP, LL, and F1 embryos at each developmental stage. The high variability and misexpression of F1s negatively impacts our statistical power by introducing variation and results in fewer genes being confidently assigned gene regulatory mechanisms. As a result, almost no genes can be identified as having a dominant or additive inheritance pattern, while 150 genes are identified as overdominant over the course of development. Notably, very few morph-specific genes are expressed at a significant level in either F1s (between 0 [PP] and 3 [LL] genes), strongly suggesting negative regulation of expression that acts in trans for these genes.

Regulatory architecture

We leverage the F1 offspring to dissect the regulatory architecture underlying developmental gene expression differences; we use allele-specific expression patterns in F1 offspring – tracking the expression of the maternal or paternal allele – to assign gene regulatory differences as either cis- or trans-acting modifications (or both; Davidson and Peter, 2015). Typically this approach is used in hybrids (Wittkopp et al., 2004; Tirosh et al., 2009; McManus et al., 2010; Coolon et al., 2014; Wang et al., 2020), but we have adapted it to intraspecific F1 offspring to test whether early divergence is consistent with predictions in the literature (Harry and Zakas, 2023).

We determine the regulatory architecture underlying DE by calculating P and L allele-specific expression in reads from F1 samples and assigned primary regulatory modes according to established criteria (Wittkopp et al., 2004; Graze et al., 2009; Coolon et al., 2014; Wang et al., 2020). As this is an intraspecific comparison with little fixed genomic differences between the morphs, not all genes have alleles that can be assigned parentage; however, a set of ~2000 DE genes have morph-differentiating SNPs enabling the parental assignment of F1 transcript reads. The regulatory modes of these DE genes are assigned as ‘cis’, ‘trans’, ‘cis+trans’, ‘cis × trans’, or ‘compensatory’ (see Supplementary file 2 for formal criteria). Of this set, misexpression in F1 offspring causes many genes to receive ambiguous assignments. As a result, only 143 genes have informative regulatory mode assignments. While this is a small number, we expect that the genes we can classify are proportionally representative of the remaining regulatory architecture.

The primary mode of regulatory change throughout development is trans-acting (Figure 9). No differences are caused by purely cis-acting regulatory modifications, though we did find some cis+trans and cis-×-trans interactions. We found that the number of genes with compensatory regulatory modifications increases sharply after gastrulation, indicating that gene expression may be more tightly controlled past that point. Studies across Drosophila species, for example, have indicated that maternal effects are regulated differently than zygotic effects, but in this case there are larger trans-acting factors in early (maternally controlled) developmental stages (Cartwright and Lott, 2020). We do not see such a clear effect; trans-acting factors make up most of the regulatory architecture throughout development. Interestingly, this result is different from previous studies of egg mRNA expression of this species, where cis- and trans-regulatory modifications were found at similar rates for maternal gene expression (Harry and Zakas, 2023). Comparison across the egg (maternal) and embryo (maternal and zygotic) regulatory landscape shows greater cis-acting regulation in the maternally expressed genes. These results demonstrate the possibility that maternal and zygotic gene regulatory architecture evolve through distinct mechanisms and on separate timescales.

Figure 9. Distribution of regulatory mode over time.

Figure 9.

The majority of regulatory changes act in trans, and compensatory changes emerge after gastrulation.

We tested the possibility that the reciprocal F1s generated in the study (for the three stages that have both LP and PL samples) might have different regulatory mode changes due to parental effects. Splitting our analyses into separate PL and LP parental arrangements did not yield significantly different results, but treating them separately meant fewer genes could be assigned a mode.

We also investigated parent-of-origin effects on gene expression using F1s. These occur when an allele’s expression is at least partially dependent on which parent contributed that allele. For instance, maternal effects may involve polymorphisms that affect the development of an offspring only when contributed as part of the maternal genome (and which impose no developmental variation when contributed by the paternal genome). These effects are a form of epigenetic inheritance which can allow for complex, adaptive maternal-zygote interactions such as certain metabolic genes being specifically activated in the offspring to match the nutrient content of its mother’s egg. Due to animal constraints where LL mothers produce far fewer offspring per clutch (10–40 on average), we were only able to collect samples for F1 offspring with L mothers (LP) for three developmental stages (16-cell, gastrula, and swimming). All six time points were sampled for F1s with PP mothers (PL). To find parental effects (maternal or paternal), we conducted a differential expression analysis contrasting the PL samples to the LP samples. Tests for differential expression show that many significant differences in gene expression between the F1 morphs arise at early stages (ostensibly due to the presence of maternal transcripts inherited through the egg) and then drop sharply after gastrulation (Figure 10; Figure 8—figure supplement 2). This indicates that while a few parental effects on gene expression may persist throughout development, most parental effects are limited to early embryogenesis, which is consistent with expectations for maternal transcription degradation.

Figure 10. Genes with parental effects.

Figure 10.

Early developmental stages have significantly more genes, likely due to maternal transcript differences. Maternal transcripts are likely degraded around the gastrula stage and any remaining parental effects observed are likely due to other mechanisms.

Discussion

By comparing gene expression at the relative embryological stages of the two morphs, we determine the extent that expression changes contribute to life-history differences. LL offspring take longer to reach the same embryonic developmental stages as PP in absolute time, although the stages are similar, and morphological differences are not observed until later development. Numerous morphological and life-history differences occur between the morphs at the swimming larval stage (reviewed in Zakas, 2022), but here we detail differences in the embryology as well. Evolutionarily, we expect that LL is the more derived developmental mode, which likely arose from a maternal increase in egg size followed by adaptive changes in the genome that increased LL fitness (reviewed in Zakas, 2022). This generates predictions about the types of genes that we expect to change in patterns and magnitude. For example, we may expect that genes associated with feeding processes, growth, and cell cycle and specification initiate faster in the smaller, PP embryo (Wray and Lowe, 2000; Strathmann, 2000; Lowe et al., 2002; Segers et al., 2012; Figueroa et al., 2021). The non-obligatory-feeding LL larvae have a delay in mesoderm formation and through-gut development (Pernet, 2003; Gibson et al., 2010) so we would expect a delay in LL for mesoderm and gut specification, which we do see in the GO terms for the genes expressed earlier in PP.

The pattern of differential expression we observe over development – where numerous but small expression differences occur early followed by a few genes with large expression differences (Figure 2C) – is consistent with the importance of gastrulation to canalize development. Gastrulation is the ‘phylotypic stage’ at which different phyla are the most morphologically and molecularly similar to each other during their development (Slack et al., 1993; Duboule, 1994; Richardson, 1995; Irie and Kuratani, 2011; Irie and Kuratani, 2014; Macchietto et al., 2017). Furthermore, adults of both types are extremely similar, so we would not expect persistent, large-scale gene expression differences at these late stages. The few genes that remain significantly DE post gastrulation are candidates for maintaining morphological and life-history differences between the morphs. (Chitin biosynthesis genes, for example, remain significantly different at later stages, and upregulated in PP.) However, these genes’ expression levels might converge much later in development as both morphs reach adulthood.

We see similar amounts of heterochronic and morph-specific genes. Heterochronic genes, and their pathways, are ideal candidates for modifying larval and life-history traits. Morph-specific genes are of particular interest in S. benedicti as previous genetic and transcriptomic work suggests there is very little differentiation of the two morphs at the genomic level (Zakas et al., 2018; Zakas et al., 2022). While it is possible that genomic rearrangements (Janssen et al., 2001), gene duplications (Long et al., 2003), or co-options (Linksvayer and Wade, 2005) underlie the expression of morph-specific genes, given previous findings of limited total sequence differentiation between morphs (Zakas et al., 2018; Zakas et al., 2022), we find these possibilities unlikely. The morph-specific genes are more likely driven by regulatory differences in enhancing or silencing elements (as in Zhao et al., 2017; Matlosz et al., 2022).

We expect that incipient species that are early in their divergence have more genetic regulatory changes due to trans-acting factors; this is because trans-acting factors may be more pleiotropic and initiate numerous developmental changes parsimoniously. Essentially a few trans-acting regulatory modifications, such as changes to transcription factors, could account for most of the developmental and phenotypic differences between morphs. As species divergence time increases, cis-acting elements can arise and refine gene expression of individual genes (Wittkopp et al., 2004; Coolon et al., 2014; Cartwright and Lott, 2020). These regulatory architecture trends have been reported in other species, such as the urchin Heliocidaris spp. where a PP and LL species have diverged in the same genus (Israel et al., 2016; Wang et al., 2020). However, no studies to date have examined this regulatory architecture in the context of the short evolutionary window of divergent morphs within the same species. This suggests the life-history differences we see in this early evolutionary divergence are driven by a few developmentally upstream trans-acting factors with pleiotropic effects.

We quantify the relative contribution of heteromorphic, heterochronic, and morph-specific gene expression pattern modifications in the evolution of a developmental dimorphism. We find that heterochronic and morph-specific genes contribute similarly to gene expression differentiation. Additionally, we find that the regulatory architecture of differential expression is predominantly trans-acting modifications, supporting the hypothesis that early gene expression evolution occurs by few, highly pleiotropic trans-acting regulatory modifications.

Methods

Animal rearing and sample collection

We use lab-reared male and female S. benedicti originally sampled from Newark Bay Bayonne, New Jersey (PP), and Long Beach, California (LL). All experimental procedures and growth incubations are carried out at 20°C unless otherwise noted. To produce offspring for sampling, we crossed virgin females with one male for each test cross (Cross ID in Figure 2A). PP offspring were fed small quantities of our lab’s standard feeding algae after the swimming stage had been reached to avoid starvation. As clutch sizes can be quite small, we took advantage of the multiple broods that can be produced by a single mating event. We used three to five consecutive broods per female where all offspring were full sibs. This is necessary as embryo number is limited (10–40 embryos per clutch for LL and 100–400 for PP) and 10–40 pooled embryos are required to produce sufficient input mRNA. In this study brood number and developmental stage are confounded within sample groups (Cross IDs).

Embryo timeline construction

To build a timeline of development, we removed embryos from the maternal brood pouch and observed development in a Petri dish in artificial seawater in an incubator at 20°C (previous work indicates development is normal outside the brood pouch). We selected six distinct time points that we confirmed with cell counts (nuclei) using Hoechst: 16 cells, blastula (64 cells), and gastrula (124 cells). Time to each development stage was averaged over at least four clutch observations. Later embryo and larval stages were identified by morphological differences (appearance of ciliated band trochophores and eye development). Six stages capture a broad scope of developmentally critical periods while having enough timing separation to be distinct (each embryonic time point was ~12 hr apart). Images were captured using a ×40 objective on a Zeiss Axio inverted microscope with an indicator for scale which was subsequently used to scale images to the same relative size in Figure 1.

RNAseq

Embryos were collected from a single female, and 10–40 were processed for total mRNA extraction at each time point. If the number of embryos was insufficient for all time points, we used multiple broods. We used a minimum of 20 LL embryos for stages 16-cell, blastula, gastrula, and 10 LL embryos for stages trochophore, swimming, 1 week. PP females have large clutches, so embryos were divided equally (~40 embryos/clutch) among the six stages. With this approach we could not collect all time points for the same females, but there are at least two complete sets and at least four replicates per stage for each morph. The same sampling stages were used when collecting F1 samples in this experiment (Figure 2A).

Embryo RNA extraction used the Arcturus PicoPure kit including the DNAse step. We used a Qubit RNA kit to measure RNA yields for pilot data, but once established we bypassed this step to maximize RNA yields. Libraries were constructed with the NEB UltraII Stranded RNA library prep kit (cat# E7760S) for Illumina. Libraries were sequenced on two lanes of 150 bp on the Illumina NovaSeq.

Sequencing read quality trimming and mapping

We used TrimGalore (cutadapt) (Martin, 2011) and FastP (Chen et al., 2018) for quality assessment and trimming. Reads were mapped to a reference of all transcript sequences (transcripts extracted with GFFread, Pertea and Pertea, 2020) that are annotated in the S. benedicti reference genome (Zakas et al., 2022) using Salmon 1.10 (Patro et al., 2017) using the default scoring parameters. The reference genome for S. benedicti is a chromosome-level assembly with >99% of genes occurring in the first 11 chromosomal scaffolds (Zakas et al., 2022). Individual sample transcript expression quantification estimates were summarized to the gene-level using the Tximport R package (Soneson et al., 2015).

Library diagnostics

As the S. benedicti genome is from P individuals (Zakas et al., 2022), we checked the mapping rate of P and L samples (as well as both F1s) at each developmental stage to ensure similar mapping rates (Figure 2—figure supplement 1). We calculated mapping by dividing the sum of expression estimates for all genes by the sequencing depth for each sample individually, and then averaging samples from each developmental stage by morph to produce an average mapping rate (Figure 2—figure supplement 1). We find LL and PP mapping rates do not differ significantly (two-sided t-test; p=0.1118), nor do the F1 samples’ mapping rates differ from any other group (Figure 2—figure supplement 2). While this does not eliminate the possibility of mapping bias, these results indicate that missing data would not substantially change our results.

Samples which received fewer than 2 million total reads were excluded from further analyses since our expected sequencing depth was between 20 and 40 million reads (six samples total). Samples were normalized using the DESeq2 R package (Love et al., 2014) and expression estimates were transformed using the variance stabilizing transformation function to perform PCA using the function prcomp (R Development Core Team, 2020), which is plotted for the first two principal components with ggplot2 (Wickham, 2016; Figure 2B).

Differential expression

We used DESeq2 to determine significance in expression differences (Love et al., 2014) using the variable grouping method recommended by the DESeq2 manual. We used two factors for characterization: developmental stage and genotype (P, L, PL, LP) which generate a factor level for each stageHead2/genotype combination (i.e. ‘16-cell_Lecithotroph’) which we called ‘multiFactor’ for DESeq2 as ‘design = ~multiFactor’. The Benjamini-Hochberg false discovery rate (FDR) algorithm was used with p-values between factor levels of ‘multiFactor’ to reduce the incidence of false positives for differential expression. Throughout this study, the threshold for significant gene expression differences is an FDR-adjusted p-value of 0.05 or less, and an expression fold-change greater than twofold.

Expression profile clustering

To summarize expression clusters we used the Mfuzz 2.58 R package (Futschik and Carlisle, 2005; Kumar and E Futschik, 2007). Clusters are based on mean expression estimates from P samples. Normalization was (DESeq2, Love et al., 2014), log2-transformed, and then filtered for genes with low variability/expression and standardized using the Mfuzz functions ‘filter.std’ and ‘standardize’, respectively. Mfuzz requires a priori cluster numbers. To capture the representative sample of expression patterns, we found the highest number of clusters for which Pearson correlations of pairs of cluster centroids (the single most cluster-representative gene for each cluster) did not exceed 0.85 (r<0.85) for any pair of clusters. This resulted in six clusters. The fuzzifier coefficient (m) required by Mfuzz was estimated to be 1.71 by the function ‘mestimate’ (Futschik and Carlisle, 2005; Kumar and E Futschik, 2007). Following cluster generation with the P expression data, the L expression data was mapped onto the clusters using the Mfuzz function ‘membership’ so both morphs could be compared. Clusters were plotted using the function ‘mfuzz.plot2’ (Figure 4, Figure 4—figure supplement 2).

Expression classification

To identify genes as heterochronic we first identify genes that have different expression patterns between the morphs and then filter those genes based on their Pearson correlation coefficients. Genes for which the PP and LL expression are on different clusters are categorized as heterochronic. Some of these genes fall onto different clusters but still have relatively similar expression profiles. Therefore, we also filter by a measure of the Pearson correlation coefficient. The threshold is based on the distribution of correlation coefficients of all PP to LL genes’ expression in the dataset (Figure 5B). Based on the distribution, a threshold of r<0.85 was selected to filter genes which would be classified as switching clusters but are similar enough in their expression between the two morphs to be removed. This filter disqualified 38 genes.

Morph-specific expression

To categorize morph-specific expression we use a gene expression thresholding approach. Spurious gene expression resulting in low levels of estimated expression (few RNAseq reads) is categorically and functionally distinct from robust gene expression (Hebenstreit et al., 2011). We establish meaningful expression thresholds based on the assumption that genes with robust expression will have values which are normally distributed. Individual sample thresholds were calculated using a DAFS (George and Chang, 2014). Thresholds were used to identify genes which are expressed at significant levels in only PP or only LL offspring. For example, if a gene is expressed above its threshold in three or more (out of five) P samples at one or more developmental stages and in no more than one LL sample at any developmental stage, then that gene was categorized as morph-specific (Table 1).

Table 1. Logical table showing criterion for morph-specific expression classification of genes based on number of samples in each group that have gene expression greater than their expression level threshold as assessed by data-adaptive flag method (DAFS).

An allowance for a single mismatched replicate is made in this criterion. This is because we do not believe that a single incidence within a morph (across all developmental stages) is a functionally significant and biologically relevant level of gene expression in this case.

PP samples above expression threshold LL samples above expression threshold Classification
≤1 ≥3 Lecithotrophic-specific expression
≥3 ≤1 Planktotrophic-specific expression

F1 expression time-series collection

We concurrently sequenced RNA libraries from F1 offspring (Figure 2A). We generated F1 offspring from reciprocal crosses P×L and L×P (both mother-father directions) using the same sample collection method as stated above. However, we only sequenced samples from three of the developmental stages for the F1s in the LP direction. All analyses were performed as above.

Mode of inheritance and F1 misexpression

Using F1 data, we classified the mode of inheritance for each gene according to established criteria and differential expression tests from DESeq2 (Coolon et al., 2014; Harry and Zakas, 2023; Wang et al., 2020). Genes which were significantly DE between reciprocal F1 offspring (PL vs LP) at any developmental stage were not considered for this analysis. The remaining genes were classified as either (1) conserved, (2) additive, (3) dominant for one genotype, or (4) misexpressed (over/under-dominant).

Parent of origin effects

DE genes between PL and LP offspring are different due to parental effects. DE was detected with DESeq2 with the contrasts ‘c(“multiFactor”, “PL_sixteencell”, “LP_sixteencell”)’, ‘c(“multiFactor”, “PL_gastrula”, “LP_gastrula”)’, and ‘c(“multiFactor”, “PL_swimming”, “LP_swimming”)’ similar to PP and LL samples. We evaluate these at three of the six developmental stages. The direction of the parental effect was determined by matching the expression change with the direction of each identified gene’s expression in PP and LL samples (where those genes were DE in PP and LL samples).

Mode of regulatory change

To measure allele-specific expression, we assigned sequencing reads from F1 samples to either P or L parentage by identifying fixed SNPs within the transcript sequences of the parental types. We used HyLiTE (Duchemin et al., 2015) to identify SNPs and assign reads as a P or L allele. We then categorized genes’ regulatory mode according to established empirical methods (Wittkopp et al., 2004; Graze et al., 2009; Coolon et al., 2014; Harry and Zakas, 2023; Wang et al., 2020) for each developmental stage independently. This required three comparisons of each gene’s expression which we performed using DESeq2: (1) the contrast of PP to LL samples, (2) the contrast of P alleles to L alleles within F1 samples, and (3) a ratio of the differential expression of PP:LL to the differential expression of P:L alleles. For (3) we use a special contrast in DESeq2, applying the design (~Geno × Ori) where Geno identifies reads as either a P or L allele and Ori identifies the reads as originating from the parentals or F1 samples. DE genes were categorized as either in ‘cis’, ‘trans’, ‘cis+trans’, or ‘cis × trans’ (Wittkopp et al., 2004; Graze et al., 2009; Coolon et al., 2014; Harry and Zakas, 2023; Wang et al., 2020).

Acknowledgements

Sarah Cole for help with collecting embryos, animal rearing, constructing the sampling timeline. Thanks to Matthew Rockman and Greg Wray and members of the Zakas lab for comments on the manuscript. This work was supported by National Institutes of Health NIGMS grant 5R35GM142853 to C Zakas.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Christina Zakas, Email: czakas@ncsu.edu.

Detlef Weigel, Max Planck Institute for Biology Tübingen, Germany.

Detlef Weigel, Max Planck Institute for Biology Tübingen, Germany.

Funding Information

This paper was supported by the following grant:

  • National Institutes of Health 5R35GM142853 to Christina Zakas.

Additional information

Competing interests

No competing interests declared.

Author contributions

Resources, Data curation, Formal analysis, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.

Conceptualization, Funding acquisition, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Additional files

Supplementary file 1. Gene ontology (GO) enrichment results spreadsheet.
elife-93062-supp1.xlsx (549.4KB, xlsx)
Supplementary file 2. Mode of regulatory change classifications.
elife-93062-supp2.docx (110.5KB, docx)
MDAR checklist

Data availability

All reads are submitted to NCBI under BioProject: PRJNA1008044. All analyses and datasets are available at https://github.com/NathanDHarry/Harry-Zakas-TC, (copy archived at Harry, 2024) a GitHub repository which contains estimated transcript expression data generated by Salmon 1.10 and an R script file containing annotated code used to conduct these analyses.

The following dataset was generated:

Harry ND, Zakas C. 2024. Streblospio benedicti RNAseq developmental time-series. NCBI BioProject. PRJNA1008044

References

  1. Capra JA, Pollard KS, Singh M. Novel genes exhibit distinct patterns of function acquisition and network integration. Genome Biology. 2010;11:R127. doi: 10.1186/gb-2010-11-12-r127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Carleton KL, Spady TC, Streelman JT, Kidd MR, McFarland WN, Loew ER. Visual sensitivities tuned by heterochronic shifts in opsin gene expression. BMC Biology. 2008;6:22. doi: 10.1186/1741-7007-6-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Carroll SB. Homeotic genes and the evolution of arthropods and chordates. Nature. 1995;376:479–485. doi: 10.1038/376479a0. [DOI] [PubMed] [Google Scholar]
  4. Cartwright EL, Lott SE. Evolved Differences in cis and trans Regulation Between the Maternal and Zygotic mRNA Complements in the Drosophila Embryo. Genetics. 2020;216:805–821. doi: 10.1534/genetics.120.303626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Coolon JD, McManus CJ, Stevenson KR, Graveley BR, Wittkopp PJ. Tempo and mode of regulatory evolution in Drosophila. Genome Research. 2014;24:797–808. doi: 10.1101/gr.163014.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Davidson EH, Peter IS. In: Genomic Control Process. Davidson EH, Peter IS, editors. Oxford: Academic Press; 2015. Chapter 4 - genomic control processes in adult body part formation; pp. 133–200. [DOI] [Google Scholar]
  8. deBeer GR. Embryology and Evolution. Humana.Mente. 1930;5:482–484. doi: 10.1017/S0031819100017964. [DOI] [Google Scholar]
  9. Dobreva MP, Camacho J, Abzhanov A. Time to synchronize our clocks: Connecting developmental mechanisms and evolutionary consequences of heterochrony. Journal of Experimental Zoology Part B. 2022;338:87–106. doi: 10.1002/jez.b.23103. [DOI] [PubMed] [Google Scholar]
  10. Duboule D. Temporal colinearity and the phylotypic progression: a basis for the stability of a vertebrate Bauplan and the evolution of morphologies through heterochrony. Development. 1994;1994:135–142. doi: 10.1242/dev.1994.Supplement.135. [DOI] [PubMed] [Google Scholar]
  11. Duchemin W, Dupont P-Y, Campbell MA, Ganley ARD, Cox MP. HyLiTE: accurate and flexible analysis of gene expression in hybrid and allopolyploid species. BMC Bioinformatics. 2015;16:8. doi: 10.1186/s12859-014-0433-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Erwin DH, Davidson EH. The last common bilaterian ancestor. Development. 2002;129:3021–3032. doi: 10.1242/dev.129.13.3021. [DOI] [PubMed] [Google Scholar]
  13. Fay JC, Wittkopp PJ. Evaluating the role of natural selection in the evolution of gene regulation. Heredity. 2008;100:191–199. doi: 10.1038/sj.hdy.6801000. [DOI] [PubMed] [Google Scholar]
  14. Figueroa Á, Brante A, Cárdenas L. RNA-Seq reveals divergent gene expression between larvae with contrasting trophic modes in the poecilogonous polychaete Boccardia wellingtonensis. Scientific Reports. 2021;11:14997. doi: 10.1038/s41598-021-94646-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Futschik ME, Carlisle B. Noise-robust soft clustering of gene expression time-course data. Journal of Bioinformatics and Computational Biology. 2005;3:965–988. doi: 10.1142/s0219720005001375. [DOI] [PubMed] [Google Scholar]
  16. George NI, Chang CW. DAFS: a data-adaptive flag method for RNA-sequencing data to differentiate genes with low and high expression. BMC Bioinformatics. 2014;15:92. doi: 10.1186/1471-2105-15-92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gibson G, MacDonald K, Dufton M. Morphogenesis and phenotypic divergence in two developmental morphs of Streblospio benedicti (Annelida, Spionidae) Invertebrate Biology. 2010;129:328–343. doi: 10.1111/j.1744-7410.2010.00213.x. [DOI] [Google Scholar]
  18. Gould SJ. Ontogeny and Phylogeny. Harvard University Press; 1985. [Google Scholar]
  19. Graze RM, McIntyre LM, Main BJ, Wayne ML, Nuzhdin SV. Regulatory divergence in Drosophila melanogaster and D. simulans, a genomewide analysis of allele-specific expression. Genetics. 2009;183:547–561. doi: 10.1534/genetics.109.105957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gunter HM, Koppermann C, Meyer A. Revisiting de Beer’s textbook example of heterochrony and jaw elongation in fish: calmodulin expression reflects heterochronic growth, and underlies morphological innovation in the jaws of belonoid fishes. EvoDevo. 2014;5:8. doi: 10.1186/2041-9139-5-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hamann E, Pauli CS, Joly-Lopez Z, Groen SC, Rest JS, Kane NC, Purugganan MD, Franks SJ. Rapid evolutionary changes in gene expression in response to climate fluctuations. Molecular Ecology. 2021;30:193–206. doi: 10.1111/mec.15583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Harry ND, Zakas C. Maternal patterns of inheritance alter transcript expression in eggs. BMC Genomics. 2023;24:191. doi: 10.1186/s12864-023-09291-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Harry ND. Harry-zakas-TC. swh:1:rev:ac6935374df4ed51591c99f533a1b517e7ec154dSoftware Heritage. 2024 https://archive.softwareheritage.org/swh:1:dir:97683ab2fd92ba74f6952645fafc471ee968ab54;origin=https://github.com/NathanDHarry/Harry-Zakas-TC;visit=swh:1:snp:30a3be52b2a93ede289f68b27fd7472d74fe8e7b;anchor=swh:1:rev:ac6935374df4ed51591c99f533a1b517e7ec154d
  24. Hebenstreit D, Fang M, Gu M, Charoensawan V, van Oudenaarden A, Teichmann SA. RNA sequencing reveals two major classes of gene expression levels in metazoan cells. Molecular Systems Biology. 2011;7:497. doi: 10.1038/msb.2011.28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hilgers L, Hartmann S, Hofreiter M, von Rintelen T. Novel genes, ancient genes, and gene co-option contributed to the genetic basis of the radula, a molluscan innovation. Molecular Biology and Evolution. 2018;35:1638–1652. doi: 10.1093/molbev/msy052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Irie N, Kuratani S. Comparative transcriptome analysis reveals vertebrate phylotypic period during organogenesis. Nature Communications. 2011;2:248. doi: 10.1038/ncomms1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Irie N, Kuratani S. The developmental hourglass model: a predictor of the basic body plan? Development. 2014;141:4649–4655. doi: 10.1242/dev.107318. [DOI] [PubMed] [Google Scholar]
  28. Israel JW, Martik ML, Byrne M, Raff EC, Raff RA, McClay DR, Wray GA. Comparative developmental transcriptomics reveals rewiring of a highly conserved gene regulatory network during a major life history switch in the sea urchin genus heliocidaris. PLOS Biology. 2016;14:e1002391. doi: 10.1371/journal.pbio.1002391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Janssen PJ, Audit B, Ouzounis CA. Strain-specific genes of Helicobacter pylori: distribution, function and dynamics. Nucleic Acids Research. 2001;29:4395–4404. doi: 10.1093/nar/29.21.4395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. King MC, Wilson AC. Evolution at two levels in humans and chimpanzees. Science. 1975;188:107–116. doi: 10.1126/science.1090005. [DOI] [PubMed] [Google Scholar]
  31. Kumar L, E Futschik M. Mfuzz: A software package for soft clustering of microarray data. Bioinformation. 2007;2:5–7. doi: 10.6026/97320630002005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Levin LA. Multiple patterns of development Instreblospio Benedicti webster (spionidae) from three coasts of North America. The Biological Bulletin. 1984;166:494–508. doi: 10.2307/1541157. [DOI] [Google Scholar]
  33. Levin LA, Creed EL. Effect of temperature and food availability on reproductive responses of Streblospio benedicti (Polychaeta: Spionidae) with planktotrophic or lecithotrophic development. Marine Biology. 1986;92:103–113. doi: 10.1007/BF00392752. [DOI] [Google Scholar]
  34. Linksvayer TA, Wade MJ. The evolutionary origin and elaboration of sociality in the aculeate Hymenoptera: maternal effects, sib-social effects, and heterochrony. The Quarterly Review of Biology. 2005;80:317–336. doi: 10.1086/432266. [DOI] [PubMed] [Google Scholar]
  35. Long M, Betrán E, Thornton K, Wang W. The origin of new genes: glimpses from the young and old. Nature Reviews. Genetics. 2003;4:865–875. doi: 10.1038/nrg1204. [DOI] [PubMed] [Google Scholar]
  36. López-Maury L, Marguerat S, Bähler J. Tuning gene expression to changing environments: from rapid responses to evolutionary adaptation. Nature Reviews. Genetics. 2008;9:583–593. doi: 10.1038/nrg2398. [DOI] [PubMed] [Google Scholar]
  37. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lowe CJ, Issel-Tarver L, Wray GA. Gene expression and larval evolution: changing roles of distal-less and orthodenticle in echinoderm larvae. Evolution & Development. 2002;4:111–123. doi: 10.1046/j.1525-142x.2002.01066.x. [DOI] [PubMed] [Google Scholar]
  39. Luna SK, Chain FJJ. Lineage-specific genes and family expansions in dictyostelid genomes display expression bias and evolutionary diversification during development. Genes. 2021;12:1628. doi: 10.3390/genes12101628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Lyons DC, Perry KJ, Lesoway MP, Henry JQ. Cleavage pattern and fate map of the mesentoblast, 4d, in the gastropod Crepidula: a hallmark of spiralian development. EvoDevo. 2012;3:21. doi: 10.1186/2041-9139-3-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Macchietto M, Angdembey D, Heidarpour N, Serra L, Rodriguez B, El-Ali N, Mortazavi A. Comparative transcriptomics of steinernema and caenorhabditis single embryos reveals orthologous gene expression convergence during late embryogenesis. Genome Biology and Evolution. 2017;9:2681–2696. doi: 10.1093/gbe/evx195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.Journal. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  43. Matlosz S, Sigurgeirsson B, Franzdóttir SR, Pálsson A, Jónsson ZO. DNA methylation differences during development distinguish sympatric morphs of Arctic charr (Salvelinus alpinus) Molecular Ecology. 2022;31:4739–4761. doi: 10.1111/mec.16620. [DOI] [PubMed] [Google Scholar]
  44. McCAIN ER. Poecilogony as a tool for understanding speciation: Early development ofStreblospio benedictiandStreblospio gynobranchiata(Polychaeta:Spionidae) Invertebrate Reproduction & Development. 2008;51:91–101. doi: 10.1080/07924259.2008.9652259. [DOI] [Google Scholar]
  45. McManus CJ, Coolon JD, Duff MO, Eipper-Mains J, Graveley BR, Wittkopp PJ. Regulatory divergence in Drosophila revealed by mRNA-seq. Genome Research. 2010;20:816–825. doi: 10.1101/gr.102491.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. McNamara KJ. Heterochrony: The evolution of development. Evolution. 2012;5:203–218. doi: 10.1007/s12052-012-0420-3. [DOI] [Google Scholar]
  47. Moran AL, McAlister JS. Egg size as a life history character of marine invertebrates: Is it all it’s cracked up to be? The Biological Bulletin. 2009;216:226–242. doi: 10.1086/BBLv216n3p226. [DOI] [PubMed] [Google Scholar]
  48. Moran BM, Payne C, Langdon Q, Powell DL, Brandvain Y, Schumer M. The genomic consequences of hybridization. eLife. 2021;10:e69016. doi: 10.7554/eLife.69016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Norrström N, Getz WM, Holmgren NMA. Selection against accumulating mutations in niche-preference genes can drive speciation. PLOS ONE. 2011;6:e29487. doi: 10.1371/journal.pone.0029487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nature Methods. 2017;14:417–419. doi: 10.1038/nmeth.4197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Pernet B. Persistent ancestral feeding structures in nonfeeding annelid larvae. The Biological Bulletin. 2003;205:295–307. doi: 10.2307/1543293. [DOI] [PubMed] [Google Scholar]
  52. Pernet B, McArthur L. Feeding by larvae of two different developmental modes in Streblospio benedicti (Polychaeta: Spionidae) Marine Biology. 2006;149:803–811. doi: 10.1007/s00227-006-0266-8. [DOI] [Google Scholar]
  53. Pernet B, McHugh D. Evolutionary changes in the timing of gut morphogenesis in larvae of the marine annelid Streblospio benedicti. Evolution & Development. 2010;12:618–627. doi: 10.1111/j.1525-142X.2010.00446.x. [DOI] [PubMed] [Google Scholar]
  54. Pertea G, Pertea M. GFF Utilities: GffRead and GffCompare. F1000Research. 2020;9:ISCB Comm J-304. doi: 10.12688/f1000research.23297.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Raff RA, Wray GA. Heterochrony: Developmental mechanisms and evolutionary results. Journal of Evolutionary Biology. 1989;2:409–434. doi: 10.1046/j.1420-9101.1989.2060409.x. [DOI] [Google Scholar]
  56. R Development Core Team . Vienna, Austria: R Foundation for Statistical Computing; 2020. https://www.r-project.org [Google Scholar]
  57. Richardson MK. Heterochrony and the phylotypic period. Developmental Biology. 1995;172:412–421. doi: 10.1006/dbio.1995.8041. [DOI] [PubMed] [Google Scholar]
  58. Ruvkun G, Giusto J. The Caenorhabditis elegans heterochronic gene lin-14 encodes a nuclear protein that forms a temporal developmental switch. Nature. 1989;338:313–319. doi: 10.1038/338313a0. [DOI] [PubMed] [Google Scholar]
  59. Schmitz JF, Chain FJJ, Bornberg-Bauer E. Evolution of novel genes in three-spined stickleback populations. Heredity. 2020;125:50–59. doi: 10.1038/s41437-020-0319-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Segers F, Berishvili G, Taborsky B. Egg size-dependent expression of growth hormone receptor accompanies compensatory growth in fish. Proceedings. Biological Sciences. 2012;279:592–600. doi: 10.1098/rspb.2011.1104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Simola DF, Francis C, Sniegowski PD, Kim J. Heterochronic evolution reveals modular timing changes in budding yeast transcriptomes. Genome Biology. 2010;11:R105. doi: 10.1186/gb-2010-11-10-r105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Slack JMW, Holland PWH, Graham CF. The zootype and the phylotypic stage. Nature. 1993;361:490–492. doi: 10.1038/361490a0. [DOI] [PubMed] [Google Scholar]
  63. Smith KK. Sequence heterochrony and the evolution of development. Journal of Morphology. 2002;252:82–97. doi: 10.1002/jmor.10014. [DOI] [PubMed] [Google Scholar]
  64. Smith KK. Time’s arrow: heterochrony and the evolution of development. The International Journal of Developmental Biology. 2003;47:613–621. [PubMed] [Google Scholar]
  65. Soneson C, Love MI, Robinson MD. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Research. 2015;4:1521. doi: 10.12688/f1000research.7563.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Strathmann RR. Functional design in the evolution of embryos and larvae. Seminars in Cell & Developmental Biology. 2000;11:395–402. doi: 10.1006/scdb.2000.0192. [DOI] [PubMed] [Google Scholar]
  67. Tirosh I, Reikhav S, Levy AA, Barkai N. A yeast hybrid provides insight into the evolution of gene expression regulation. Science. 2009;324:659–662. doi: 10.1126/science.1169766. [DOI] [PubMed] [Google Scholar]
  68. Vaglia JL, Smith KK. Early differentiation and migration of cranial neural crest in the opossum, Monodelphis domestica. Evolution & Development. 2003;5:121–135. doi: 10.1046/j.1525-142x.2003.03019.x. [DOI] [PubMed] [Google Scholar]
  69. Wang L, Israel JW, Edgar A, Raff RA, Raff EC, Byrne M, Wray GA. Genetic basis for divergence in developmental gene expression in two closely related sea urchins. Nature Ecology & Evolution. 2020;4:831–840. doi: 10.1038/s41559-020-1165-y. [DOI] [PubMed] [Google Scholar]
  70. Wickham H. Ggplot2: Elegant Graphics for Data Analysis. Springer; 2016. [DOI] [Google Scholar]
  71. Willink B, Duryea MC, Wheat C, Svensson EI. Changes in gene expression during female reproductive development in a color polymorphic insect. Evolution; International Journal of Organic Evolution. 2020;74:1063–1081. doi: 10.1111/evo.13979. [DOI] [PubMed] [Google Scholar]
  72. Wittkopp PJ, Haerum BK, Clark AG. Evolutionary changes in cis and trans gene regulation. Nature. 2004;430:85–88. doi: 10.1038/nature02698. [DOI] [PubMed] [Google Scholar]
  73. Wray GA, McClay DR. Molecular heterochronies and heterotopies in early echinoid development. Evolution; International Journal of Organic Evolution. 1989;43:803–813. doi: 10.1111/j.1558-5646.1989.tb05178.x. [DOI] [PubMed] [Google Scholar]
  74. Wray GA, Raff RA. The evolution of developmental strategy in marine invertebrates. Trends in Ecology & Evolution. 1991;6:45–50. doi: 10.1016/0169-5347(91)90121-D. [DOI] [PubMed] [Google Scholar]
  75. Wray GA, Lowe CJ. Developmental regulatory genes and echinoderm evolution. Systematic Biology. 2000;49:28–51. doi: 10.1080/10635150050207375. [DOI] [PubMed] [Google Scholar]
  76. Wray GA, Hahn MW, Abouheif E, Balhoff JP, Pizer M, Rockman MV, Romano LA. The evolution of transcriptional regulation in eukaryotes. Molecular Biology and Evolution. 2003;20:1377–1419. doi: 10.1093/molbev/msg140. [DOI] [PubMed] [Google Scholar]
  77. Zakas C, Rockman MV. Dimorphic development in Streblospio benedicti: genetic analysis of morphological differences between larval types. The International Journal of Developmental Biology. 2014;58:593–599. doi: 10.1387/ijdb.140088mr. [DOI] [PubMed] [Google Scholar]
  78. Zakas C, Deutscher JM, Kay AD, Rockman MV. Decoupled maternal and zygotic genetic effects shape the evolution of development. eLife. 2018;7:e37143. doi: 10.7554/eLife.37143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Zakas C. In: Current Topics in Developmental Biology. Goldstein B, Srivastava M, editors. Academic Press; 2022. Chapter seventeen - streblospio benedicti: A genetic model for understanding the evolution of development and life-history; pp. 497–521. [DOI] [PubMed] [Google Scholar]
  80. Zakas C, Harry ND, Scholl EH, Rockman MV. The Genome of the Poecilogonous Annelid Streblospio benedicti. Genome Biology and Evolution. 2022;14:evac008. doi: 10.1093/gbe/evac008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Zhao J, Zhou Y, Li X, Cai W, Hua H. Silencing of juvenile hormone epoxide hydrolase gene (Nljheh) enhances short wing formation in a macropterous strain of the brown planthopper, Nilaparvata lugens. Journal of Insect Physiology. 2017;102:18–26. doi: 10.1016/j.jinsphys.2017.08.012. [DOI] [PubMed] [Google Scholar]

eLife assessment

Detlef Weigel 1

This important study examines the extent to which distinct developmental pathways that result in alternative morphs correlate with transcriptome differences in a marine annelid, Streblospio benedicti. The strengths of the study include the experimental design and dense temporal sampling, which together provide convincing evidence that the two morphs can be clearly distinguished at the transcriptome level, despite relatively modest overall differences. The work will be of particular interest to students of the evolution of development.

Reviewer #1 (Public Review):

Anonymous

Summary:

Overall, this study provides a meticulous comparison of developmental transcriptomes between two sub-species of the annelid Streblospio benedicti. Different lineages of S. benedicti maintain one of two genetically programmed alternative life histories, the ancestral planktotrophic or derived lecithotrophic forms of development. This contrast is also seen at the inter-species level in many marine invertebrate taxa, such as echinoderms and molluscs. The authors report relatively (surprisingly?) modest differences in transcriptomes overall, but also find some genes whose expression is essentially morph-specific (which they term "exclusive").

Strengths:

The study is based on dense and appropriately replicated sampling of early development. The tight clustering of each stage/morph combination in PCA space suggests the specimens were accurately categorized. The similar overall trajectories of the two morphs was surprising to me for two stage: (1) the earliest stage (16-cell), at which we might expect maternal differences due to the several-fold difference in zygote size, and (2) the latest stage (1-week), where there appears to be the most obvious morphological difference. This is why we need to do experiments!

The examination of F1 hybrids was another major strength of the study. It also produced one of the most surprising results: though intermediate in phenotype, F1 embryos have the most distinct transcriptomes, and reveal a range of fixed, compensatory differences in the parental lines. Further, the F1 lack expression of nearly all transcripts identified as morph-specific in the pure parental lines. Since the F1 larvae present intermediate traits combining the features of both morphs, this implies that morph-specific transcripts are not actually necessary for morph-specific traits. This is interesting and somewhat counter to what one might naively expect.

Weaknesses:

Overall I really enjoyed this paper, and in its revised form it addresses some concerns I had in the first version. I still see a few places where it can be tightened and made more insightful.

Reviewer #2 (Public Review):

Anonymous

The manuscript by Harry and Zakas determined the extent to which gene expression differences contribute to developmental divergence by using a model that has two distinct developmental morphs within a single species. Although the authors did collect a valuable dataset and trends in differential expression between the two morphs of S. benedicti were presented, we found limitations about the methods, system, and resources that the authors should address.

We have two major points:

(1) Background information about the biological system needs to be clarified in the introduction of this manuscript. The authors stated that F1 offspring can have intermediate larval traits compared to the parents (Line 81). However, the authors collected F1 offspring at the same time as the mother in the cross. If offspring have intermediate larval traits, their developmental timeline might be different than both parents and necessitate the collection of offspring at different times to obtain the same stages as the parents. Could the authors (1) explain why they collected offspring at the same time as parents given that other literature and Line 81 state these F1 offspring develop at intermediate rates, and (2) add the F1 offspring to Figure 1 to show morphological and timeline differences in development?

Additionally, the authors state (Lines 83-85) that they detail the full-time course of embryogenesis for both the parents and the F1 crosses. However, we do not see where the authors have reported the full-time course for embryogenesis of the F1 offspring. Providing this information would shape the remaining results of the manuscript.

(2) We have several concerns about the S. benedicti genome and steps regarding the read mapping for RNA-seq:

The S. benedicti genome used (Zakas et al. 2022) was generated using the PP morph. The largest scaffolds of this assembly correspond to linkage groups, showing the quality of this genome. The authors should point out in the Methods and/or Results sections that the quality of this genome means that PP-specific gene expression can be quantified well. However, the challenges and limitations of mapping LL-specific expression data to the PP genome should be discussed.

It is possible that the authors did not find exclusive gene expression in the LL morph because they require at least one gene to be turned on in one morph as part of the data-cleaning criteria. Because the authors are comparing all genes to the PP morph, they could be missing true exclusive genes responsible for the biological differences between the two morphs. Did they make the decision to only count genes expressed in one stage of the other morph because the gene models and mapping quality led to too much noise?

The authors state that the mapping rates between the two morphs are comparable (Supplementary Figure 1). However, there is a lot of variation in mapping the LL individuals (~20% to 43%) compared to the PP individuals. What is the level of differentiation within the two morphs in the species (pi and theta)? The statistical tests for this comparison should be added and the associated p-value should be reported. The statistical test used to compare mapping rates between the two morphs may be inappropriate. The authors used Salmon for their RNA alignment and differential expression analysis, but it is possible that a different method would be more appropriate. For example, Salmon has some limitations as compared to Kallisto as others have noted. The chosen statistical test should be explained, as well as how RNA-seq data are processed and interpreted.

What about the read mapping rate and details for the F1 LP and PL individuals? How did the offspring map to the P genome? These details should be included in Supplementary Figure 1. Could the authors also provide information about the number of genes expressed at each stage in the F1 LP and PL samples in S Figure 2? How many genes went into the PCA? Many of these details are necessary to evaluate the F1 RNA-seq analyses.

Generally, the authors need to report the statistics used in data processing more thoroughly. The authors need to report the statistics used to (1) process and evaluate the RNA-seq data and (2) determine the significance between the two morphs (Supplementary Figures 1 and 2).

eLife. 2024 Aug 23;13:RP93062. doi: 10.7554/eLife.93062.3.sa3

Author response

Nathan D Harry 1, Christina Zakas 2

The following is the authors’ response to the original reviews.

Reviewer #1 (Public Review):

Summary:

Overall, this study provides a meticulous comparison of developmental transcriptomes between two sub-species of the annelid Streblospio benedicti. Different lineages of S. benedicti maintain one of two genetically programmed alternative life histories, the ancestral planktotrophic or derived lecithotrophic forms of development. This contrast is also seen at the inter-species level in many marine invertebrate taxa, such as echinoderms and molluscs. The authors report relatively (surprisingly?) modest differences in transcriptomes overall but also find some genes whose expression is essentially morph-specific (which they term "exclusive").

Strengths:

The study is based on a dense and appropriately replicated sampling of early development. The tight clustering of each stage/morph combination in PCA space suggests the specimens were accurately categorized. The similar overall trajectories of the two morphs were surprising to me for two stages: (1) the earliest stage (16-cell), at which we might expect maternal differences due to the several-fold difference in zygote size, and (2) the latest stage (1-week), where there appears to be the most obvious morphological difference. This is why we need to do experiments!

The examination of F1 hybrids was another major strength of the study. It also produced one of the most surprising results: though intermediate in phenotype, F1 embryos have the most distinct transcriptomes, and reveal a range of fixed, compensatory differences in the parental lines.

Weaknesses:

Overall I really enjoyed this paper, but I see a few places where it can be tightened and made more insightful. These relate to better defining the basis for "exclusive" expression (regulation or gene presence/absence?), providing more examples of how specific genes related to trophic mode behave, and placing the study in the context of similar work in other phyla.

As suggested, we changed the term “exclusive expression” to “morph-specific” expression throughout the paper to clarify which genes are only expressed in one morph. We also added references to similar work in other phyla such as recent work on lecithotrophic and planktotrophic development in species of Heliocidaris sea urchins in the 4th paragraph of the discussion. We added additional data about the F1 hybrids in “Gene expression of Genetic Crosses” section and the new Figure 8B. We find that gene expression in F1 offspring is divided between matching the maternal and paternal gene expression patterns, with slightly more genes matching paternal expression.

Reviewer #2 (Public Review):

The manuscript by Harry and Zakas determined the extent to which gene expression differences contribute to developmental divergence by using a model that has two distinct developmental morphs within a single species. Although the authors did collect a valuable dataset and trends in differential expression between the two morphs of S. benedicti were presented, we found limitations about the methods, system, and resources that the authors should address.

We have two major points:

(1) Background information about the biological system needs to be clarified in the introduction of this manuscript. The authors stated that F1 offspring can have intermediate larval traits compared to the parents (Line 81). However, the authors collected F1 offspring at the same time as the mother in the cross. If offspring have intermediate larval traits, their developmental timeline might be different than both parents and necessitate the collection of offspring at different times to obtain the same stages as the parents. Could the authors (1) explain why they collected offspring at the same time as parents given that other literature and Line 81 state these F1 offspring develop at intermediate rates, and (2) add the F1 offspring to Figure 1 to show morphological and timeline differences in development?

Additionally, the authors state (Lines 83-85) that they detail the full-time course of embryogenesis for both the parents and the F1 crosses. However, we do not see where the authors have reported the full-time course for embryogenesis of the F1 offspring. Providing this information would shape the remaining results of the manuscript.

(2) We have several concerns about the S. benedicti genome and steps regarding the read mapping for RNA-seq:

The S. benedicti genome used (Zakas et al. 2022) was generated using the PP morph. The largest scaffolds of this assembly correspond to linkage groups, showing the quality of this genome. The authors should point out in the Methods and/or Results sections that the quality of this genome means that PP-specific gene expression can be quantified well. However, the challenges and limitations of mapping LL-specific expression data to the PP genome should be discussed.

It is possible that the authors did not find exclusive gene expression in the LL morph because they require at least one gene to be turned on in one morph as part of the data-cleaning criteria. Because the authors are comparing all genes to the PP morph, they could be missing true exclusive genes responsible for the biological differences between the two morphs. Did they make the decision to only count genes expressed in one stage of the other morph because the gene models and mapping quality led to too much noise?

The authors state that the mapping rates between the two morphs are comparable (Supplementary Figure 1). However, there is a lot of variation in mapping the LL individuals (~20% to 43%) compared to the PP individuals. What is the level of differentiation within the two morphs in the species (pi and theta)? The statistical tests for this comparison should be added and the associated p-value should be reported. The statistical test used to compare mapping rates between the two morphs may be inappropriate. The authors used Salmon for their RNA alignment and differential expression analysis, but it is possible that a different method would be more appropriate. For example, Salmon has some limitations as compared to Kallisto as others have noted. The chosen statistical test should be explained, as well as how RNA-seq data are processed and interpreted.

What about the read mapping rate and details for the F1 LP and PL individuals? How did the offspring map to the P genome? These details should be included in Supplementary Figure 1. Could the authors also provide information about the number of genes expressed at each stage in the F1 LP and PL samples in S Figure 2? How many genes went into the PCA? Many of these details are necessary to evaluate the F1 RNA-seq analyses.

Generally, the authors need to report the statistics used in data processing more thoroughly. The authors need to report the statistics used to (1) process and evaluate the RNA-seq data and (2) determine the significance between the two morphs (Supplementary Figures 1 and 2).

(1) We clarified in the methods that F1 embryos are collected at the same stage (not absolute time) as the parental types. So the “16-cell” stage is comparable across planktotrophic, lecithotrophic and F1 offspring regardless of absolute time taken to reach that stage (which differs by ~3 hours- Figure 1).

Figure 2A details every time point collected for all crosses. As mentioned in the methods, we were unable to collect two timepoints for one set of crosses (LP) due to limited tissue. However, we still cover the full development time from “16 cell” through “swimming larvae” stages, which is the full larval development time.

(2) We appreciate the reviewer's concerns regarding the mapping to the reference genome. The S. benedicti genome is a largely complete and contiguous chromosome-length genome which we have now highlighted in the manuscript. However, the reference is only for the planktotrophic morph. So it is certainly possible that there could be mapping bias for lecithotrophic reads or F1 reads, as we point out in the discussion. While some bias is certainly possible, it is unlikely to be driving major differences in the results. We performed several tests to demonstrate this:

(1) We conducted two-sided T-tests of the mapping rates between all sample groups in our dataset (PP, LL, PL, LP) to determine if there were significant differences in mapping rates among the populations. No significant differences were found. The specific results of these statistical tests are included in the updated manuscript in supplementary figure 1 and are as follows:

Author response table 1.

Comparison (Read Mapping Rates) p-value of two sided T-test
PP versus LL 0.1118
PP versus PL 0.453
PL versus LP 0.418
LP versus LL 0.908

(2) In response to the comment about sequence level divergence affecting mapping rate, we estimated pi (nucleotide diversity within a population) and dxy (genomic divergence between two populations) based on the sampled transcriptomic data of our Planktotrophic and Lecithotrophic populations. We used PIXY (Korunes, K.L. and Samuk, K., 2021) with its standard settings to estimate these values, with variant call files in bcf format produced with bcftools - one for all planktotrophic samples and one for all lecithotrophic samples in our dataset. We found that across regions of the transcriptome, the difference in pi between Planktotrophs and Lecithotrophs was between 0.11% and 4.2%. Genomic divergence across the transcriptome is also relatively minor: estimates of dxy ranged from 0.0049 to 0.0076. Given that these estimates show relatively modest differences in nucleotide diversity and overall sequence divergence, we maintain that it is unlikely that they significantly impact the results described in this study. From what we have seen in the literature, these values are not outside of other population studies that are mapping to a species reference derived from one population.

We added the mapping rates of all samples in the Supplement (SFig. 1) as requested. We added the number of genes expressed at each stage in the Supplement (SFig. 2) as requested. We have also provided further details and figures (Fig 8B) on read mapping rates and statistics used in data processing, including those for F1 RNA-seq data.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Harry ND, Zakas C. 2024. Streblospio benedicti RNAseq developmental time-series. NCBI BioProject. PRJNA1008044

    Supplementary Materials

    Supplementary file 1. Gene ontology (GO) enrichment results spreadsheet.
    elife-93062-supp1.xlsx (549.4KB, xlsx)
    Supplementary file 2. Mode of regulatory change classifications.
    elife-93062-supp2.docx (110.5KB, docx)
    MDAR checklist

    Data Availability Statement

    All reads are submitted to NCBI under BioProject: PRJNA1008044. All analyses and datasets are available at https://github.com/NathanDHarry/Harry-Zakas-TC, (copy archived at Harry, 2024) a GitHub repository which contains estimated transcript expression data generated by Salmon 1.10 and an R script file containing annotated code used to conduct these analyses.

    The following dataset was generated:

    Harry ND, Zakas C. 2024. Streblospio benedicti RNAseq developmental time-series. NCBI BioProject. PRJNA1008044


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES