Abstract
Single-cell RNA-seq has been established as a reliable and accessible technique enabling new types of analyses, such as identifying cell types and studying spatial and temporal gene expression variation and change at single-cell resolution. Recently, single-cell RNA-seq has been applied to developing embryos, which offers great potential for finding and characterising genes controlling the course of development along with their expression patterns. In this study, we applied single-cell RNA-seq to the 16-cell stage of the Ciona embryo, a marine chordate and performed a computational search for cell-specific gene expression patterns. We recovered many known expression patterns from our single-cell RNA-seq data and despite extensive previous screens, we succeeded in finding new cell-specific patterns, which we validated by in situ and single-cell qPCR.
Subject terms: Bioinformatics, Genome-wide analysis of gene expression, Embryogenesis
Introduction
Cell types can be characterised at high resolution with single-cell RNA-seq. Apparently homogenous groups of cells can be clustered to identify novel and rare subtypes1–5 and cells undergoing differentiation at different rates can be ordered and analysed6–9. This offers great potential for studying developing embryos with their high diversity of cell types10–20. Making sense of this diversity is a goal of developmental biology and an important first step is identifying and characterising the relatively few genes controlling the course of development and the subsets of cells they are expressed in, or in other words, their gene expression patterns.
We applied single-cell RNA-seq (scRNA-seq) to the 16-cell stage of Ciona, a simple chordate, and looked for gene expression patterns using a novel computational approach. Zygotic expression begins around the 8-cell stage21,22 and many gene expression patterns, both maternal and zygotic, are known from comprehensive in situ screens in Ciona23 (Fig. 1b, Supplementary Table 1). We identified many of these from our scRNA-seq data. In addition, we found unknown gene expression patterns, which we validated by in situ and single-cell qPCR.
Results
Ciona development follows an invariant lineage allowing cells to be unambiguously identified at the 16-cell stage24–26, whether before sequencing as in our work (Fig. 1a) or afterwards based on expression, as in recent work of the same stage27. We collected cells from four 16-cell stage embryos, each from different individuals fertilised on different days. The left and right side of the embryo at this stage are thought to be symmetrical, but to avoid any potential bilateral variation in our small sample, we collected all eight cell types from the right half of each embryo. We multiplexed the cells of each embryo (or batch) and sequenced them on Illumina MiSeq, as well as on three lanes across two Illumina HiSeq 2500 runs, leading to scRNA-seq measurements for 32 cells (Supplementary Table 2).
Our cells were sequenced with relatively high coverage (around 20 million reads per cell on Illumina HiSeq and 4.5 million per cell on MiSeq). Although we had only four replicates per cell type, we produced a reliable dataset.
Assessment of technical variability and reproducibility
Our results show limited technical variation within each batch: the expression levels in different cell types from the same embryo are well correlated for embryos 2, 3 and 4. They are, in fact, more similar to each other than the same cell types are across different individuals (Fig. 2a–b;→, Supplementary Fig. 1). Although we cannot separate out the sources of cross-embryo variation, this result is consistent with a previous report showing that maternal mRNA levels vary significantly between unfertilized eggs from different individuals28. It is also worth noting that very little of the variation between embryos is from the sequencing run. This is consistent with previous results showing high correlation between expression measurements from tens of millions of reads per cell and those from lower coverage of a million or fewer reads29,30.
This embryo batch effect is further demonstrated by a Principal Components Analysis (Fig. 2b), which shows a similar result with the cell types of embryos 2, 3 and 4 being close to each on the first two components (which explain 56% of the variance) and the cell types of embryo 1 being more spread out. Embryo 1 belonged to a healthy developmental batch (Supplementary Table 2) and we suggest that its outlier status is due to technical variability introduced during library preparation, specifically cDNA synthesis and amplification.
The close clustering of cells from the same embryo, as well as their high correlation, suggests that our technical variability is low, leading to reproducibility within each batch (or embryo). A confirmation of the reproducibility of our results is the tight distribution of genes detected across samples within embryos (Supplementary Fig. 1). Genes were considered detected when the measured count was greater than zero. These results show that slightly more genes were detected on HiSeq than MiSeq, but that the median difference for each embryo is less than 10%. This can be compared with a previous result showing a reduction of genes detected of around 39% when lowering sequence coverage to less than a million reads per cell29. As before, embryo 1 showed more variability across samples than the other embryos.
Further, our data can be compared to previously published data for the 16-cell stage that was generated using gene expression microarrays from many pooled single cells28. We found good agreement with our scRNA-seq expression patterns for the key genes shown in their paper (Fig. 2c).
Normalisation of counts
After producing a dataset of counts, we normalised our data. Since we were interested in gene expression differences between different cell types, we did not normalize across genes (such as by GC content or transcript length), but only for sequencing depth by dividing by the total number of reads per sample. More complex normalisation steps could be considered31, but this simple approach suffices for our data. It gives a natural measure of expression for each gene, namely the proportion it contributes to the total, which we assume is independent of the total number of reads.
We then made use of a suitable transformation of proportions, the arcsine of the square root (referred to here as the φ transformation), namely:
where kij is the count for the ith gene and jth sample and Nj is the total number of counts for the jth sample. The difference between φi of two samples can be interpreted as an effect size index for proportions, namely Cohen’s h32.
In practice, the arcsine function in the φ transformation is largely redundant because most genes are expressed at low proportions. At these low values, the arcsine transformation is close to identity, meaning that a square root transformation of proportions performs equivalently in many cases.
Finding putative gene expression patterns
We then took a simple and effective approach to find gene expression patterns. Instead of grouping cells into a single set of clusters, we clustered the gene expression pattern of each gene separately (Fig. 3). For each gene, we grouped the different cell types into two classes of ON and OFF expression. We took advantage of our known cell types and calculated the Euclidean distance between vectors of replicate measurements for each cell and performed single-linkage hierarchical clustering of these (bottom-up, agglomerative clustering of the cells). The resulting two top-level clusters determined the ON/OFF pattern of each cell type for each gene (Fig. 3).
We then ranked these results, by taking an approach that does not require parametric estimation of variation or dispersion. We calculated our cluster reliability score as the difference between the first quartile of the ON cluster and the third quartile of the OFF cluster, which we call the Transquartile Range (TQR). The TQR is larger when the difference in cluster means is larger, but it penalizes higher variation for a given difference in means.
Ranking cell-specific gene expression patterns
We applied this approach and produced a list of ranked genes and their expression patterns. We examined our results for 77 genes (Supplementary Table 1) with known in situ patterns28,33–35. Our top 35 results matched the known in situ patterns (Supplementary Fig. 2a), except for KH.L152.12, which is validated below as a new pattern by in situ and qPCR (see below). However, the lower ranked results did not correspond well (Supplementary Fig. 2b).
There were a few reasons for this. For example, clustering did not produce a correct pattern for Lefty (Supplementary Fig. 2b). The cell types, A5.2 and B5.1, are normally considered part of the Lefty pattern, but they have intermediate expression in our scRNA-seq results and, in this instance, the clustering algorithm places them in the OFF cluster. For a few other genes, e.g. DPOZ (KH.C12.589) and Dlx.b, the clustering is correct, but the TQR is low. For a few other genes, no reads were mapped, e.g. Sox7/17/18, Fringe 2 and KH.C13.22, but in most cases where our scRNA-seq data does not agree with published in situ patterns, our expression measurements were low or relatively uniform across the eight cells and thus the algorithm functions correctly in attributing lower score to these results. Many of these genes are expressed maternally in the Ciona embryo and are de-adenylated during the maternal-to-zygotic transition36,37; thus they might not be easily detected by our RNA-seq protocol, which amplifies from the poly(A) tail of mRNA. In contrast, in situ hybridisation can detect localised maternal signal in the cytoplasm, which could explain the discrepancy. It is also possible that some in situ results are false positives.
Validating cell-specific gene expression patterns
Using these known in situ results, we assessed that the top 40 ranked results from all genes were likely to be reliable and focused on these for further validation (Fig. 4). We found 12 distinct patterns in the top 40, which included all known patterns as well as three potentially new patterns (highlighted in orange in Fig. 4). Ten patterns are currently known at this stage in Ciona28,33,38–42, and although we matched only nine of these directly (Fig. 1b), the tenth pattern has a single known case, Tfap2-r.b (AP-2-like2), which we did pick up, but without expression in A5.2. This agrees with previous observations that expression is not consistent in this cell across embryos28,33. Further, it is in agreement with the average over many embryos as measured by microarray28.
We validated one of the potentially new patterns, namely the pattern for KH.L152.12, by in situ in biological duplicates and single-cell qPCR in triplicate (Fig. 5e). This observation does not necessarily negate previously published ones for KH.L152.12, but rather highlights the value of our algorithm in identifying new patterns from scRNA-seq data comprising just four embryos. We could not validate the second pattern for KH.C1.933 that has relatively ubiquitous expression, although with apparently reduced expression in b5.4.
In addition to recovering all known patterns and validating one novel pattern, we found at least 28 genes with known in situ patterns. Of the 12 patterns we found, the pattern with expression in the B5.2 cell only was the most represented. This is also the most frequent pattern in known in situ patterns, i.e. postplasmic/PEM RNAs35. The majority of our results for B5.2 are confirmed by previous in situ datasets, but we identified new B5.2-specific genes, such as KH.C13.98 and KH.C12.212, confirming their expression by in situ and single-cell qPCR (Fig. 5a).
We also validated other classes of uncharacterized genes, namely KH.S1497.1, which expresses specifically in the animal hemisphere, and KH.C11.529 on the anterior side (Fig. 5c–d). These results are particularly striking, since it was expected that no further zygotic gene expression patterns would be found at this stage in Ciona23.
We also looked more widely in the top 60 (Fig. 4, Supplementary Fig. 3) and validated additional genes, KH.C8.450, KH.C9.289 and KH.C4.260, by single-cell qPCR and in situ hybridisation (Fig. 5a-b). The first is another example of B5.2 expression, whereas the last two genes are expressed in all cells except B5.2, a pattern known previously from Hes.a43. While developing our approach, we also applied the TQR to the microarray data of this stage28 and looked for reliable B5.2 cell-specific expression using both datasets. As a result, we also found and validated KH.L60.2 and KH.C14.501 (ranked 88 and 2403 in our data; see Supplementary Table 7).
In conclusion, we have recovered many known patterns, as well as patterns and genes that had not been detected previously despite extensive in situ screens. These results open up opportunities for further research into developmental patterning in Ciona. In addition, we have demonstrated that single-cell RNA-seq is a viable alternative to extensive in situ screens, offering a promising approach for finding genes with cell-specific expression in less well-studied organisms.
Methods
Study design
We isolated cells from five 16-cell stage Ciona embryos, each on a different day (Supplementary Table 2). Early ascidian embryos are thought to be bilaterally symmetrical, but to avoid any potential bilateral variation in our small sample, we collected eight cells from the right side of each embryo. The cells were collected individually in batches of eight cells from the same embryo on the same day, with sequencing libraries prepared in parallel, barcoded and then sequenced together. This means that biological variation between embryos and technical variation between batches cannot be distinguished. The advantage of this design is that it minimizes technical variation between cell types of the same embryo and controls for confounding technical and biological variation between embryos. Averaging across the cell types of different batches reduces this unwanted variation, maintaining cell-specific variation. Our results show that cells from the same embryo are more similar to each other than the same cell types are across individuals, with a similar number of genes detected per cell type (Fig. 2a-b, Supplementary Fig. 1).
Preparation of Ciona embryos
Ciona intestinalis type A, recently designated Ciona robusta44,45, adults were obtained from Maizuru Fisheries Research Station (Kyoto University) and Misaki Marine Biological station (The University of Tokyo) under the National Bio-Resource Project for Ciona. They were maintained in an aquarium in our laboratory at Okinawa Institute of Science and Technology Graduate University under constant light (Calcitrans, Nisshin Marinetech Co., Ltd.) for three days apart from a few hours of darkness a day with feeding to induce spawning of the old eggs. After this, the Ciona were maintained under constant light to induce oocyte maturation. Eggs and sperm were obtained surgically from the gonoducts. Embryos were dechorionated after insemination using a solution of 0.07% actinase and 1.3% sodium thioglycolate. Eggs were reared to reach the 16-cell stage in Millipore-filtered seawater (MFSW) at about 18 °C. Embryos from each insemination batch were kept to check the ratio that developed into morphologically normal tailbud. We only used embryos from batches where more than 70% developed normally to tailbud (10 hours post fertilization at 18 °C) (see Supplementary Table 2 for embryo batch information).
Naming of cells
In Ciona, cells are named using the nomenclature of Conklin24: the animal side is prefixed with a lowercase letter (a or b) and the vegetal with an uppercase letter (A or B); the anterior with A or a and the posterior with B or b. The initial letter is followed by a number that indicates the embryo stage since fertilization, with individual cells numbered according to their lineage. At the 16-cell stage, the animal domain corresponds to a5.3, a5.4, b5.3 and b5.4, the vegetal domain to A5.1, A5.2, B5.1 and B5.2, and postplasmic RNAs are localized to B5.2.
Isolation of single cells at the 16-cell stage
At a defined point in development of the 16-cell embryo i.e., at the stage immediately after compaction of the embryo (2.5~2.6 hours post fertilization), the embryo was transferred to 4 °C to slow its development. Each blastomere was isolated with a fine glass needle in a mannitol solution (0.77 M mannitol: MFSW, 9:1) under a stereo microscope at 4 °C regulated by a thermo plate (Tokai Hit Co., Ltd.) and its identity noted. Isolated blastomeres were picked up and transferred immediately with a mouth pipet into a lysis buffer46 for reverse transcription.
Library preparation
We followed the single-cell library preparation method of Tang et al.46,47 with some modification. We added ERCC spike-in RNA (Thermo Fisher scientific, 4456740, 1:80000) to each lysis buffer and applied 14 and then 9 cycles of PCR amplification after second strand synthesis. Amplified cDNA was purified with MinElute PCR Purification kit (28006, QIAGEN) and QIAquick PCR Purification Kit (28106, QIAGEN) after each PCR reaction respectively and its concentration measured with Qubit® 2.0 Fluorometer (Q32866, Life Technologies) to have more than 150 ng total yield of cDNA. The quality of the amplified cDNA and distribution of DNA fragment size were confirmed by Agilent 2100 Bioanalyzer (Agilent Technologies) with High Sensitivity DNA Kit (5067–4626, Agilent) to consist mainly of 1.0–1.5 kb fragments.
Amplified cDNAs were sheared using sonication Covaris S2 System to produce DNA of 300 bp on average. The settings were as follows: Duty cycle: 20%, Intensity: 5, Cycles per burst: 200, Power mode - Frequency sweeping, Treatment time: 90 seconds, Temperature: 12 °C.
NEB Next® ChIP-Seq Library Prep Master Mix Set for Illumina® (E6240, NEB) was applied to sheared cDNA for preparation of the library for the Illumina platform. NEBNext® Multiplex Oligos for Illumina (E7335, E7500, NEB Next Multiplex Oligos for Illumina, NEB) were combined to introduce an index and adaptor to the double-stranded DNA. After extraction of the 300 bp fraction of adaptor-ligated DNA by E-Gel Size Select 2% Agarose (G661002, Invitrogen), DNA was amplified with individual index primers using PCR with 19 cycles.
The amplified DNA fragment composition was purified with Agencourt AMPure XP twice (A63881, Beckman) and again checked by Qubit (> 60 ng of cDNA in total yield) and by Bioanalyzer to ensure that the fragment size was sharply distributed around 300 bp (on average, about 320 bp with a standard deviation of 40). The concentration of fragments with appropriate index adapters was quantified by KAPA Library Quantification Kits (KAPA Library Quantification Kits, Illumina GA/Universal, KK4825, Genetics) to ensure that the final libraries had adapters for both ends and their concentration was at least 20 pM.
Data generation and quality checking
Libraries were sequenced on Illumina’s (San Diego, CA) MiSeq benchtop sequencer and Illumina HiSeq2500. Libraries were prepared with different index primers and sequenced on MiSeq using paired 150 nt reads (No. MS-102–2002, MiSeq Reagent Kit v2) with eight multiplexed samples per run with the standard Illumina protocols. The same libraries were sequenced on an Illumina HiSeq 2500 with 150 bp paired end reads (No. PE-402-4001 and FC-402-4001, TruSeq Rapid Cluster - Paired-End and SBS Kits) with 16 multiplexed samples per lane following standard Illumina protocols. Our results from using HiSeq and MiSeq were similar (Supplementary Figs. 1, 3 and 4).
The resulting reads were aligned using Bowtie48 version 2.2.6 to the Ciona KH genome assembly49,50, downloaded from Ghost (http://ghost.zool.kyoto-u.ac.jp/download_kh.html). Reads were mapped using local alignment (–local), with other settings at their default. We did not trim or filter reads, but instead made use of local alignment to find the optimal match. This had the additional benefit that we did not need to split up reads to handle transcripts spanning more than one intron, as is done, for example, in TopHat51. Gene counts were calculated from the resulting alignment files using htseq-count52 with the non-stranded option and mode “intersection-nonempty” against the KH gene models (version 2013) downloaded from Ghost.
We assessed our samples for mapping quality. We excluded one embryo from subsequent analysis since it had oligo-dT primer sequence in more than 50% of its read pairs; the remaining four embryos had less than 1% of read pairs affected. All remaining samples mapped well to the genome (Supplementary Table 3) and a uniform number of genes were detected (about 60%), although embryo 1 had noticeably fewer detected genes for some of its cells.
Pattern discovery
Hierarchical clustering to determine candidate patterns was performed with ClusteringComponents in Mathematica 10.4 with the Agglomerate method and Euclidean distance function. This is equivalent to hclust in R with the single linkage method. The Transquartile Range (TQR) was calculated as the difference between the first quartile of the ON cluster and the third quartile of the OFF cluster. The quantile method for the TQR used linear interpolation equivalent to type 5 in the R quantile function (the hydrologist method). The resulting patterns and TQR score are listed in Supplementary Table 7.
Single-cell qPCR analysis
cDNA was reverse transcribed from all cells of one embryo per gene replicate using the same protocol we used for single-cell RNA-seq46,47. Quantitative PCR was performed using a StepOnePlus PCR machine (Applied Biosystems) with the SYBR green method (No. RR820B, Takara). Each gene was measured with either two or three replicates, except KH.L152.12, which had four. We did not get reliable measurements for KH.S1497.1. The qPCR measures for the cell types of each embryo were scaled between 0 and 1 and then averaged for each cell type across replicates. If there was insufficient target mRNA, it was first amplified using primers covering a wider region of the target gene than those used for single-cell qPCR. Amplification of a specific product in each reaction was confirmed by determining a dissociation curve and comparing with the relevant standard plasmid to estimate the copy number in each cell type. The primers for single-cell qPCR analysis, the IDs for the cDNA clones and the resulting data are listed in Supplementary Tables 4–6.
In situ hybridisation
Whole-mount in situ hybridisation was carried out as previously described with minor modification53. Dig-labeled antisense RNA probes were synthesized in vitro from cDNAs from the Ciona cDNA project54. mRNA expression was visualized using the NBT/BCIP system (Roche, No. 11681451001) and detected on a Zeiss Axio Imager Z1 microscope using Differential Interference Contrast (DIC). The images were acquired with Axiovision SE64 release 4.9.1. Contrast and brightness were adjusted for some images using Adobe Photoshop. The IDs for the cDNA clones are shown in Supplementary Table 5.
Microarray processing
Previously published microarray data28 was processed with the limma R package55. Background was corrected using normexp and arrays were normalised with the quantile method.
Gene models and names
Gene names for the KH 2013 gene models were downloaded from Ghost (http://ghost.zool.kyoto-u.ac.jp/TF_KH.html and http://ghost.zool.kyoto-u.ac.jp/ST_KH.html) and supplemented with names from Prodon et al.35.
Supplementary information
Acknowledgements
We thank the staff of the Maizuru Fisheries Research Station of Kyoto University and Misaki Marine Biological Station of the University of Tokyo for collecting and cultivating Ciona under the National BioResource Project (NBRP) of MEXT, Japan, and RIKEN BRC for providing Ciona EST clones through the NBRP. We thank Vladimir Benes and Dinko Pavlinic in the Genomics Core Facility at the European Molecular Biology Laboratory (EMBL) for initial advice on the library preparation protocol and the members of the OIST DNA Sequencing Section for their support in running our samples on their Illumina MiSeq and HiSeq machines. We also thank Sylvain Guillot for his technical support, Filipe Tavares-Cadete for early feedback on the method, Kenji Kobayashi for a helpful comment on the validation experiment and Yutaka Satou advising the microarray data analysis. This work was supported by core funding from OIST to the Genomics & Regulatory Systems and Marine Genomics Units.
Author contributions
N.M.L. and N.S. established and supervised the project. All authors contributed to the study design. R.S. and T.N. optimized the experimental protocols, collected and prepared the samples and sequencing libraries. R.S. designed and performed the in situ and qPCR analysis. G.R.I. conceived and designed the normalization, gene testing and pattern discovery method and performed the bioinformatics analysis. G.R.I., R.S. and N.M.L. wrote the paper and all authors edited and approved the final manuscript.
Data availability
RNA-seq data have been deposited in the ArrayExpress database at EMBL-EBI (www.ebi.ac.uk/arrayexpress) under accession number E-MTAB-6117.
Code availability
Software is available at https://github.com/ilsley/Ciona16.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These author contributed equally: Garth R. Ilsley and Ritsuko Suyama.
Supplementary information
is available for this paper at 10.1038/s41598-020-61591-1.
References
- 1.Björklund ÅK, et al. The heterogeneity of human CD127+ innate lymphoid cells revealed by single-cell RNA sequencing. Nat. Immunol. 2016;17:451–460. doi: 10.1038/ni.3368. [DOI] [PubMed] [Google Scholar]
- 2.Grün D, et al. Single-cell messenger RNA sequencing reveals rare intestinal cell types. Nat. 2015;525:251–255. doi: 10.1038/nature14966. [DOI] [PubMed] [Google Scholar]
- 3.Jaitin DA, et al. Massively Parallel Single-Cell RNA-Seq for Marker-Free Decomposition of Tissues into Cell Types. Sci. 2014;343:776–779. doi: 10.1126/science.1247651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kiselev VY, et al. SC3: consensus clustering of single-cell RNA-seq data. Nat. Meth. 2017;14:483–486. doi: 10.1038/nmeth.4236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Usoskin D, et al. Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing. Nat. Neurosci. 2015;18:145–153. doi: 10.1038/nn.3881. [DOI] [PubMed] [Google Scholar]
- 6.Mojtahedi M, et al. Cell Fate Decision as High-Dimensional Critical State Transition. PLOS Biol. 2016;14:e2000640. doi: 10.1371/journal.pbio.2000640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Olsson A, et al. Single-cell analysis of mixed-lineage states leading to a binary cell fate choice. Nat. 2016;537:698–702. doi: 10.1038/nature19348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Richard A, et al. Single-Cell-Based Analysis Highlights a Surge in Cell-to-Cell Molecular Variability Preceding Irreversible Commitment in a Differentiation Process. PLOS Biol. 2016;14:e1002585. doi: 10.1371/journal.pbio.1002585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Trapnell C, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotech. 2014;32:381–386. doi: 10.1038/nbt.2859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Biase FH, Cao X, Zhong S. Cell fate inclination within 2-cell and 4-cell mouse embryos revealed by single-cell RNA sequencing. Genome Res. 2014;24:1787–1796. doi: 10.1101/gr.177725.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cao J, et al. Comprehensive single-cell transcriptional profiling of a multicellular organism. Sci. 2017;357:661–667. doi: 10.1126/science.aam8940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Deng Q, Ramsköld D, Reinius B, Sandberg R. Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells. Sci. 2014;343:193–196. doi: 10.1126/science.1245316. [DOI] [PubMed] [Google Scholar]
- 13.Goolam M, et al. Heterogeneity in Oct4 and Sox2 Targets Biases Cell Fate in 4-Cell Mouse Embryos. Cell. 2016;165:61–74. doi: 10.1016/j.cell.2016.01.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hashimshony T, Wagner F, Sher N, Yanai I. CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. Cell Rep. 2012;2:666–673. doi: 10.1016/j.celrep.2012.08.003. [DOI] [PubMed] [Google Scholar]
- 15.Ibarra-Soria X, et al. Defining murine organogenesis at single-cell resolution reveals a role for the leukotriene pathway in regulating blood progenitor formation. Nat. Cell Biol. 2018;20:127–134. doi: 10.1038/s41556-017-0013-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Karaiskos N, et al. The Drosophila embryo at single-cell transcriptome resolution. Sci. 2017;358:194–199. doi: 10.1126/science.aan3235. [DOI] [PubMed] [Google Scholar]
- 17.Scialdone A, et al. Resolving early mesoderm diversification through single-cell expression profiling. Nat. 2016;535:289–293. doi: 10.1038/nature18633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Tintori SC, Osborne Nishimura E, Golden P, Lieb JD, Goldstein B. A Transcriptional Lineage of the Early C. elegans Embryo. Developmental Cell. 2016;38:430–444. doi: 10.1016/j.devcel.2016.07.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Xue Z, et al. Genetic programs in human and mouse early embryos revealed by single-cell RNA sequencing. Nat. 2013;500:593–597. doi: 10.1038/nature12364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Yan L, et al. Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nat. Struct. Mol. Biol. 2013;20:1131–1139. doi: 10.1038/nsmb.2660. [DOI] [PubMed] [Google Scholar]
- 21.Shirae-Kurabayashi M, Matsuda K, Nakamura A. Ci-Pem-1 localizes to the nucleus and represses somatic gene transcription in the germline of Ciona intestinalis embryos. Dev. 2011;138:2871–2881. doi: 10.1242/dev.058131. [DOI] [PubMed] [Google Scholar]
- 22.Lamy C, Rothbächer U, Caillol D, Lemaire P. Ci-FoxA-a is the earliest zygotic determinant of the ascidian anterior ectoderm and directly activates Ci-sFRP1/5. Dev. 2006;133:2835–2844. doi: 10.1242/dev.02448. [DOI] [PubMed] [Google Scholar]
- 23.Satou Y, Imai KS. Gene regulatory systems that control gene expression in the Ciona embryo. Proc. Jpn. Academy, Ser. B. 2015;91:33–51. doi: 10.2183/pjab.91.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Conklin EG. The organization and cell-lineage of the ascidian egg. J. Acad. Nat. Sci. Phila. 1905;13:1–119. [Google Scholar]
- 25.Lemaire P. Unfolding a chordate developmental program, one cell at a time: Invariant cell lineages, short-range inductions and evolutionary plasticity in ascidians. Developmental Biol. 2009;332:48–60. doi: 10.1016/j.ydbio.2009.05.540. [DOI] [PubMed] [Google Scholar]
- 26.Nishida H. Specification of embryonic axis and mosaic development in ascidians. Dev. Dyn. 2005;233:1177–1193. doi: 10.1002/dvdy.20469. [DOI] [PubMed] [Google Scholar]
- 27.Treen N, Heist T, Wang W, Levine M. Depletion of Maternal Cyclin B3 Contributes to Zygotic Genome Activation in the Ciona Embryo. Curr. Biol. 2018;28:1150–1156.e4. doi: 10.1016/j.cub.2018.02.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Matsuoka T, Ikeda T, Fujimaki K, Satou Y. Transcriptome dynamics in early embryos of the ascidian, Ciona intestinalis. Developmental Biol. 2013;384:375–385. doi: 10.1016/j.ydbio.2013.10.003. [DOI] [PubMed] [Google Scholar]
- 29.Pollen AA, et al. Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex. Nat. Biotech. 2014;32:1053–1058. doi: 10.1038/nbt.2967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Shalek AK, et al. Single-cell RNA-seq reveals dynamic paracrine control of cellular variation. Nat. 2014;510:363–369. doi: 10.1038/nature13437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Vallejos CA, Risso D, Scialdone A, Dudoit S, Marioni JC. Normalizing single-cell RNA sequencing data: challenges and opportunities. Nat. Methods. 2017;14:565–571. doi: 10.1038/nmeth.4292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Cohen, J. Statistical Power Analysis for the Behavioral Sciences. (Routledge, 1988).
- 33.Imai KS, Hino K, Yagi K, Satoh N, Satou Y. Gene expression profiles of transcription factors and signaling molecules in the ascidian embryo: towards a comprehensive understanding of gene networks. Dev. 2004;131:4047–4058. doi: 10.1242/dev.01270. [DOI] [PubMed] [Google Scholar]
- 34.Miwata K, et al. Systematic analysis of embryonic expression profiles of zinc finger genes in Ciona intestinalis. Developmental Biol. 2006;292:546–554. doi: 10.1016/j.ydbio.2006.01.024. [DOI] [PubMed] [Google Scholar]
- 35.Prodon F, Yamada L, Shirae-Kurabayashi M, Nakamura Y, Sasakura Y. Postplasmic/PEM RNAs: A class of localized maternal mRNAs with multiple roles in cell polarity and development in ascidian embryos. Dev. Dyn. 2007;236:1698–1715. doi: 10.1002/dvdy.21109. [DOI] [PubMed] [Google Scholar]
- 36.Tadros W, Lipshitz HD. The maternal-to-zygotic transition: a play in two acts. Dev. 2009;136:3033–3042. doi: 10.1242/dev.033183. [DOI] [PubMed] [Google Scholar]
- 37.Li L, Zheng P, Dean J. Maternal control of early mouse development. Dev. 2010;137:859–870. doi: 10.1242/dev.039487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Bertrand V, Hudson C, Caillol D, Popovici C, Lemaire P. Neural Tissue in Ascidian Embryos Is Induced by FGF9/16/20, Acting via a Combination of Maternal GATA and Ets Transcription Factors. Cell. 2003;115:615–627. doi: 10.1016/S0092-8674(03)00928-0. [DOI] [PubMed] [Google Scholar]
- 39.Hamaguchi M, Fujie M, Noda T, Satoh N. Microarray analysis of zygotic expression of transcription factor genes and cell signaling molecule genes in early Ciona intestinalis embryos. Development, Growth Differ. 2007;49:27–37. doi: 10.1111/j.1440-169X.2007.00902.x. [DOI] [PubMed] [Google Scholar]
- 40.Hudson C, Yasuo H. Patterning across the ascidian neural plate by lateral Nodal signalling sources. Dev. 2005;132:1199–1210. doi: 10.1242/dev.01688. [DOI] [PubMed] [Google Scholar]
- 41.Imai KS, Levine M, Satoh N, Satou Y. Regulatory Blueprint for a Chordate Embryo. Sci. 2006;312:1183–1187. doi: 10.1126/science.1123404. [DOI] [PubMed] [Google Scholar]
- 42.Shi W, Levine M. Ephrin signaling establishes asymmetric cell fates in an endomesoderm lineage of the Ciona embryo. Dev. 2008;135:931–940. doi: 10.1242/dev.011940. [DOI] [PubMed] [Google Scholar]
- 43.Satou Y, Kawashima T, Shoguchi E, Nakayama A, Satoh N. An Integrated Database of the Ascidian, Ciona intestinalis: Towards Functional Genomics. Zool. Sci. 2005;22:837–843. doi: 10.2108/zsj.22.837. [DOI] [PubMed] [Google Scholar]
- 44.Hoshino Z, Tokioka T. An unusually robust Ciona from the northeastern coast of Honsyu Island, Japan. Publ. Seto Mar. Biol. Lab. 1967;15:275–290. doi: 10.5134/175476. [DOI] [Google Scholar]
- 45.Pennati R, et al. Morphological Differences between Larvae of the Ciona intestinalis Species Complex: Hints for a Valid Taxonomic Definition of Distinct Species. PLOS ONE. 2015;10:e0122879. doi: 10.1371/journal.pone.0122879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Tang F, et al. RNA-Seq analysis to capture the transcriptome landscape of a single cell. Nat. Protoc. 2010;5:516–535. doi: 10.1038/nprot.2009.236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Tang F, et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nat. Meth. 2009;6:377–382. doi: 10.1038/nmeth.1315. [DOI] [PubMed] [Google Scholar]
- 48.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Meth. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Dehal P, et al. The Draft Genome of Ciona intestinalis: Insights into Chordate and Vertebrate Origins. Sci. 2002;298:2157–2167. doi: 10.1126/science.1080049. [DOI] [PubMed] [Google Scholar]
- 50.Satou Y, et al. Improved genome assembly and evidence-based global gene model set for the chordate Ciona intestinalis: new insight into intron and operon populations. Genome Biol. 2008;9:R152. doi: 10.1186/gb-2008-9-10-r152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kim D, et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinforma. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Wada S, Katsuyama Y, Yasugi S, Saiga H. Spatially and temporally regulated expression of the LIM class homeobox gene Hrlim suggests multiple distinct functions in development of the ascidian, Halocynthia roretzi. Mechanisms Dev. 1995;51:115–126. doi: 10.1016/0925-4773(95)00359-9. [DOI] [PubMed] [Google Scholar]
- 54.Satou Y, et al. A cDNA resource from the basal chordate Ciona intestinalis. Genes. 2002;33:153–154. doi: 10.1002/gene.10119. [DOI] [PubMed] [Google Scholar]
- 55.Ritchie ME, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47–e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
RNA-seq data have been deposited in the ArrayExpress database at EMBL-EBI (www.ebi.ac.uk/arrayexpress) under accession number E-MTAB-6117.
Software is available at https://github.com/ilsley/Ciona16.