Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2017 Sep 8;7:11030. doi: 10.1038/s41598-017-10704-4

Complete sequence-based pathway analysis by differential on-chip DNA and RNA extraction from a single cell

D van Strijp 1, R C M Vulders 1, N A Larsen 2, J Schira 3, L Baerlocher 3, M A van Driel 1, M Pødenphant 4, T S Hansen 2, A Kristensen 4, K U Mir 5, T Olesen 2, W F J Verhaegh 1, R Marie 4, P J van der Zaag 1,
PMCID: PMC5591254  PMID: 28887473

Abstract

We demonstrate on-chip, differential DNA and RNA extraction from a single cell using a microfluidic chip and a two-stage lysis protocol. This method enables direct use of the whole extract, without additional washing steps, reducing sample loss. Using this method, the tumor driving pathway in individual cells from a colorectal cancer cell line was determined by applying a Bayesian computational pathway model to sequences obtained from the RNA fraction of a single cell and, the mutations driving the pathway were determined by analyzing sequences obtained from the DNA fraction of the same single cell. This combined functional and mutational pathway assessment of a single cell could be of significant value for dissecting cellular heterogeneity in tumors and analyzing single circulating tumor cells.

Introduction

Accumulation of genetic changes leads to inter-cellular heterogeneity within tumours producing sub-populations of cells that have different potentials for metastasis and drug resistance1. Molecular differences between cells can be investigated by transcriptome2, 3 and genome4, 5 sequencing. We have developed a novel microfluidic device and a two-stage cell lysis method to extract and sequence separately both RNA and DNA from the same single cell. We applied this to cells from colorectal cancer cell lines to show, using a Bayesian computational pathway model6, that Wnt transcriptional pathway activity could be derived from the obtained single cell RNA profile. Whole-genome sequence data from the same cell revealed which genes in the Wnt pathway were mutated and hence candidate drivers of the aberrant signalling. Such single-cell resolved molecular phenotype to genotype correlations have implications in cancer management, including prognosis, prediction and personalized treatment.

Sequencing tumours in routine clinical practice reveals a multitude of variants, whose relevance for the individual patient is difficult to assess. This has led to discussions about driver and passenger mutations1 and with the premise that cancer is a clonal evolutionary process of somatic mutations7, the current focus is on known actionable mutations8. Unfortunately, a direct relationship does not always exist between DNA mutations and the activated molecular pathways driving tumor growth. In medulloblastoma, a direct relationship exists between mutations in CTNNB1 (coding for ß-catenin) and Wnt activity. In liver carcinoma such a clear relationship between CTNNB1 mutations and the Wnt pathway activation does not exist6, while in breast cancer, a mutation in PIK3CA is not always predictive of PI3K inhibitor response9. Epigenetic changes may also activate molecular pathways. We propose to identify driver mutations by determining both the genotype and the molecular phenotype10 of a tumor6, 11, 12: mRNA patterns will reveal which pathway is activated and DNA mutations will reveal where within the pathway the deregulation occurs. Thus, druggable mutations that might be the root cause of the activated pathway can be found. An ability to conduct such genotype and molecular phenotype analysis on single cells provides the resolution needed to address heterogeneity within a tumour and the sensitivity for a similar analysis on circulating tumour cells (CTCs).

Two recent approaches describe analysing the genome and transcriptome from the same single cell: DR-seq (gDNA-mRNA sequencing)13 and G&T-seq (genome and transcriptome sequencing)14. Both are bench protocols with multiple handling steps, not ideal in a diagnostic setting and risking sample loss. We report a two-step lysis procedure on cells captured in picoliter traps in a microfluidic chip, to extract consecutively the RNA and DNA from a single cell. This simplifies processing of the extracts and enables the use of conventional, commercial kits for single cell RNA and DNA amplification on the separated nucleic acid fractions. After sequencing both extracts, we assess complete pathway activity at both geno- and phenotypical level in single cells using a Bayesian network-based computational pathway model previously applied only to bulk samples6, 12.

Results

To sequence both RNA and DNA from the same single cell, we use a microfluidic chip to process cells and their content, plus a two-stage lysis procedure where the plasma membrane is lysed first to release the cytoplasmic RNA and then the nuclear membrane is lysed to release the DNA. A microfluidic chip is mounted in a set-up capable of affecting cell flow and monitoring individual cells and their processing (Fig. 1). Figure 1a gives a schematic diagram of the system, while Fig. 1b shows pictures. The system provides bright field and fluorescence in-situ monitoring of the cells and allows them to be processed in the chip by stepwise flow of reagents. The movement of the cells and reagent liquids in the chip is controlled by a pressure-driven system. A heating and cooling stage enables thermal cycling of the chip for nucleic acid amplification. Figure 1c shows the layout and a picture of the chips used. The pressure-driven microfluidics enables a valveless, passive, inexpensive, disposable chip to perform all necessary functions. Figure 1d shows how PBS buffer applied from the other inlet channels I1 and I2 pushes the cells, flowing from inlet S to the waste chamber, in the direction of the traps, each connected to individual outlets (Oi with i being 1 to 8).

Figure 1.

Figure 1

Single cell microfluidics instrument and chip. (a) Diagram and (b) picture of the set-up for operating the single cell platform. The system is a modified Philips BioCell FluidScope and provides bright field (BF) and epifluorescence (FL) in-situ monitoring of cells and their processing, a stage to mount microfluidic chips, focus automation and pressure-driven flow control (for details see Methods). A separate heating and cooling module in the set-up (see lower half Fig. 1b) enables nucleic acid amplification within the chip. The lower half of (b) shows also the position in which the microfluidic chip is mounted and the gasket through which pressure-driven flow is applied. (c) Diagram and picture of the single-use injection-molded microfluidic chip. In the diagram S labels the sample input, I1 and I2 the two buffer inputs, Oi the in total 8 outputs and W the waste chamber. The insert shows a blow-up of the channel and two of the 8 traps. (d) Schematic diagram of how cells are aligned in the main channel by flow-pinching and delivered to the traps downstream.

Figure 2a shows various steps in the two-stage lysis protocol for separate RNA and DNA extraction using the microfluidic chip. First calcein-stained cells are captured in pl volume traps. Calcein fluorescence allows monitoring the presence and movement of these viable cells. Next the first lysis buffer (0.5x TBE containing 0.5% (v/v) Triton X-100 to which the DNA intercalating YOYO-1 dye is added) is applied. This buffer lyses the cell membrane, releasing the cytosol contents into the trap outlets filled with 10–20 μl nuclease-free H2O, leaving the nucleus with the DNA in the trap. This process can be monitored as a time-lapse series (Fig. 2b), where the top row shows the bright-field, the middle row the fluorescent, and the bottom row the merged images. Through pseudo-colors, the calcein (green) and YOYO-1 (blue) fluorescence have been highlighted. The first column shows the situation with a trapped cell (see arrow). After application of the first lysis buffer, the fluorescence due to the calcein disappears (second column). After ≃3 minutes the fluorescence signal reappears as the YOYO-1 dye stains the DNA (third column). The cytosol content of each separate cell is lysed into and collected from a separate outlet and processed off-chip to synthesize and amplify cDNA. Subsequently, the nuclear lysis is performed with a second buffer (0.5x TBE containing 0.5% (v/v) Triton X-100 and Protease K). This causes the disappearance of the DNA from the traps into the outlets; see fourth column Fig. 2b. After reagent addition for whole genome amplification (WGA), the system is put at 30 °C for on-chip WGA. The final step is a 60 °C heat kill to end the amplification process. Subsequently, each sample is collected and sent with the corresponding cDNA sample for sequencing.

Figure 2.

Figure 2

Two-stage lysis. (a) Diagram of the process flow: a single cell is trapped and its membrane is lysed releasing the mRNA that is collected. Next, the DNA is stained with fluorescent YOYO-1 dye to confirm its presence in the trap before it is lysed and genomic DNA can be collected in the outlet. (b) Bright field and fluorescence and merged time-lapse images from the instrument, during trapping, cell membrane lysis (loss of cytosol dye) and staining of the nuclear DNA retained in the cell trap. The scale bar corresponds to 15 μm. The white arrow in bright-field image shows the position of the trapped cell. In the merged image the fluorescence is shown in colour.

Functional activity of the canonical Wnt pathway was analysed using a Bayesian network that represents its transcriptional program6, 12 with three types of nodes: (a) a transcription complex (TC), (b) Wnt target genes, and (c) expression level measurements of the target genes (Fig. 3a). The model quantitatively describes the relation between the TC and the target genes, and between each target gene and its associated measurements. Once such a model has been constructed from prior knowledge about which genes and expression levels to include, the model can be used on newly-analyzed samples by entering their measured expression values and applying Bayesian reasoning to infer the probability that the TC must have been active6, 15. The initial Bayesian Wnt pathway model, including 34 target genes, constructed using microarray data6, 16, is adapted to RNA-sequencing data here. The model was first calibrated on RNA-seq data from samples with known Wnt activity status based on analysis of microarray data (see GSE 24795) and literature17, using log2 RPKM measurements (reads per kilobase per million mapped reads): two Wnt active LS180 and SW1222 samples and two Wnt inactive RKO samples, shown as the first group in Fig. 3c.

Figure 3.

Figure 3

Complete Wnt pathway analysis results. (a) The structure of the Bayesian network used to model the transcriptional activity of a signal transductional pathway. (b) The panel shows putative driver mutations identified in the genes CTNNB1, KRAS, and PIK3CA for these samples, using the DNA derived from the same single cell for which the odds on Wnt activity are displayed in the corresponding columns below (c). (c) Results from Wnt pathway model analysis for the samples studied. Each sample result is represented by a bar with different colours for each cell line, as indicated. The y-axis shows the inferred odds of the Wnt pathway being active (on) versus inactive (off) on a logarithmic scale. For calibration, the RNA extracted from four colon cancer cell line experiments were used, which were known to be Wnt active (LS180, orange; SW1222 yellow) or inactive (RKO, grey) based on analysis of microarray data (see GSE 24795) and literature17. Subsequently the model was verified for LS174T (green) and a CC20 (black) samples known to be Wnt (in)active based on analysis of microarray data (see GSE 24795) and the literature18. Finally, the Wnt activity odds were inferred for the single cells processed: LS174T (green) and RKO (grey). All cells were correctly classified as Wnt active or inactive. (d) Cluster plot of the gene expression for the single cell data for LS174T and RKO cells including the expression data for bulk reference samples (LS180, SW122, RKO, LS174T and CC20) across the 34 genes constituting the Wnt profile, which are listed on the right hand-side6. The colour scale, as also shown on the left side, indicates the degree to which a gene’s expression is above (red) or below (blue) its threshold value set to separate the active from inactive calibration samples. See the main text for how the variation in the individual gene expressions translate into the resulting odds for the Wnt transcription complex activity i.e. Wnt pathway being (in)active.

Subsequently, the calibrated model was used on newly-analysed samples to infer the probability of TC activity. The model correctly classified CC20 and additional LS174T samples as Wnt inactive and Wnt active, respectively (labelled’verification’ in Fig. 3c), given previous microarray (see GSE24795) and literature data18. This verification also showed that the amount of RNA input (1 or 10 ng), did not influence the results (first two’verification’ samples in Fig. 3c). No differences in reported pathway activity were found for the LS174T cell lines, whether purified RNA extracted from bulk samples, or cells themselves (e.g. crude lysate) were directly used as input. This is relevant because in the single cell experiments, cells will serve as input rather than purified RNA. Finally, there was no difference in predicted pathway activity between cell line samples from the Weatherall Institute (Oxford) or those purchased from the ATCC; even when 3–4 cells were used as input (data not shown). Finally, RNA-seq measurements of single cells were analysed (‘single cell’ group in Fig. 3c). All seven analysed single LS174T cells (green) were classified to be Wnt active, showing that the Bayesian model can determine Wnt activity in single cells. These single cells were not synchronized in order to assess the power of our approach for determining pathway activity in a heterogeneous pool of cells. Figure 3d shows a cluster plot of the probability that the genes in the Bayesian Wnt model are up- or down-regulated compared to the threshold values set in the calibration step. Although RNA expression levels of individual genes varied between cells (see Fig. 3d), the model robustly infers Wnt pathway activity as it uses a panel of 34 target genes.

Some genes, up-regulated in LS174T verification samples with RNA as input (first four green bars), were not up-regulated when 1000 cells (with a similar RNA content) and single cells were used. These genes are generally expressed at low levels making them prone to disturbances in the amplification process, due to the presence of crude cell content. Still, from the calculated odds for the TC to be present in these samples (odds ratio ~1:1000), the Wnt pathway is active, despite some of these genes no longer being upregulated. The Bayesian Wnt model was constructed to give more weight to up-regulated genes in test samples, by a factor of approximately five, than non-up-regulated genes (see the probability Tables in Eq. (1)6) because of the intrinsic noise in biological systems. A number of confounding factors exist that may lower the expression of a gene even though the pathway and its transcription complex are active19. Bayesian modeling is very suited to deal with this noisiness. As a negative biological control we analysed single RKO cells (grey), known to be Wnt inactive based on microarray data (see GSE24795). As Fig. 3c shows, all 6 cells studied were inferred to be Wnt inactive. Our pathway model thus correctly classified all 7 Wnt active and 6 Wnt inactive validation samples (Fisher’s exact test p = 0.0006).

For the same cells, DNA isolated and amplified on chip was sequenced. Figure 4 shows the Lorenz (a) and coverage graphs (b) obtained for this data. The Lorenz graph in Fig. 4a displays the cumulative fraction of total reads as a function of the cumulative fraction of the genome covered. The advantage of a Lorenz graph is that it provides a good representation of the coverage diversity even at low coverage. In our initial experiments the percentage of non-covered bases was high and varying. After improving the temperature regulation, the percentage of non-covered bases could be reduced to 10.6 ± 0.9% for an average read depth of 16.2 ± 1.0, which is close to the 8.4% non-covered bases obtained for a bulk sample (see Supplementary Table S1), which is important for driver mutation identification. Moreover, this also compares favourably to previous single cell sequencing methods. For the multiple annealing and looping-based amplification cycles (MALBAC) method, coverages of 85 up to 93% (≥1x) were achieved20. Other (bench-based) single cell RNA and DNA sequencing methods yield a coverage between 62–80% (at 0.6–2.5x) for DR-seq13 and 67.2 ± 8.1% for G&T-seq using multiple-displacement amplification (MDA)14. Note that in some publications these numbers are provided in comparison to the bulk coverage20. Therefore our results show the advantage of the on-chip DNA amplification for single cell DNA analysis to improve coverage. Figure 4b shows the coverage graph for these data i.e. the % of the genome covered vs the coverage depth. This graph is useful as it shows the percentage of the bases that achieve the 20 or 30x coverage typically used for SNP calling. Note that further sequencing will shift the fraction of the bases with sufficient coverage, but it will not improve the fraction of the bases covered, as this is determined by the initial loss of DNA going into the first amplification step.

Figure 4.

Figure 4

Lorenz and coverage graphs. (a) Lorenz graph and (b) coverage graph of the DNA sequencing coverage for the single cells samples LS174T (green lines) and a bulk LS174T sample (black) shown as reference.

From a mutation analysis of the DNA sequencing data for samples with good coverage, C ≥ 70% of the genome, the known driver mutation in CTNNB1 (c.134 C > T) for the Wnt pathway in the LS174T cell line was found, see Fig. 3b. This CTNNB1 mutation leads to an amino acid change S45F, which alters potential GSK-3 β phosphorylation sites of β-catenin, thereby hampering its degradation and hence leading to activation of the TCF4/β-catenin transcription complex driving the canonical Wnt pathway21. Also other known mutations of LS174T cells in KRAS (c.35 G > A) and PIK3CA (c.3140 A > G) were found, see Fig. 3b. A mutation analysis of RKO single cells showed wild type CTNNB1 in 5 out of 5 cells (one had insufficient coverage), as expected for RKO and matching the Wnt pathway inactivity. The RKO single cell DNA analysis showed wild type KRAS in 5 out of 5 cells, and mutated PIK3CA in 5 out of 5 cells. The latter mutation (c.3140 A > G) induces activity of the PI3K pathway in the RKO cells, and also in the LS174T cells, as indicated by an inactive FOXO transcription factor, even though we found FOXO family members are expressed.

Discussion

Two other approaches have been published for analysing both the genome and transcriptome from the same single cell, namely DR-seq13 and G&T-seq14 recently. These are bench protocols which can be automated by using a robotic workstation14. By contrast, the approach we have presented here is an integrated approach, in which the two crucial process steps, RNA and DNA separation and single cell DNA amplification, have been performed in a microfluidic chip. A physical method of RNA and DNA separation is implemented using a microfluidic chip and a two-stage lysis protocol. Compared to the DR-seq approach, the use of a physical method to separate the gDNA from the RNA has three advantages. First, there is no need for masking-out the coding sequences when mapping the gDNA fraction to the genome13, implying that mutations in genes which are not expressed or are suppressed can also be found. Secondly, the whole cytosol and nuclear extract is used, while in the DR-seq approach the sample is split in half13. Thirdly, conventional, commercial kits for single cell RNA and DNA amplification can be used, as the two nucleic acids fractions are separated, whereas DR-seq requires an additional in-vitro transcription step in the RNA sample preparation13. The G&T-seq method does use physical means for separating the RNA and DNA by using oligo-dT coated magnetic beads to extract the mRNA. Here the advantage of our approach is that the processing is done within a microfluidic chip, which limits handling, transfer and separation steps and thereby reduces sample loss and contamination. This is particularly important, when processing the DNA and RNA content from a single cell13. Note that the G&T-seq protocol involves a number of handling steps to separate the RNA and DNA. To maximize gDNA recovery, the beads capturing the RNA have to be washed four times and the eluate of each wash has to be added to the original supernatant with the gDNA. Finally, a microfluidic cartridge in which processing is done may also bring cost benefits through reduction of labor and use of standard commercial kits, which should outweigh the costs of the dedicated instrument required.

The RNA-seq data have been analyzed based on a Bayesian model for the canonical Wnt pathway, known to be active in colorectal cancer (cell lines)16. Similar Bayesian models are being developed for the other tumor driving pathways. Once these additional models become available, one could determine which tumor driving pathways are active from the measured gene expression profiles, without prior knowledge which pathway may be active. Since tumors tend to be heterogeneous and comprised of (sub)clones, this work would enable a systematic approach to find these clones, by first establishing which tumor driving pathways are active and then finding the mutations that drive them, using single cell resolution. This approach can be seen as an example of an integrative network modelling approach (see refs 22 and 23). Being able to see which tumor driving signalling pathways are active as the disease progresses, may provide new insights into tumor evolution, and may enable the selection of targeted drug combinations to treat cancer patients more effectively.

Conclusion

We have presented a method using a microfluidic chip and a two-stage lysis protocol, to extract both the RNA and the DNA from the same single cell. An important advantage of this approach is that the DNA and RNA content from a single cell can be processed directly after lysis, owing to the picoliter cell trap volume, without the need for additional washing and handling steps that increase the risk of removing some of the nucleic acid content. Moreover, in contrast to the G&T method, our method will also extract RNAs without a poly-A tail24, and thus could enable the partitioning of any RNAs between cytosol and nucleus25, 26.

Sequencing of each fraction enables determination of pathway activity from the cytosol faction and the mutations responsible for driving this pathway from the nuclear fraction. This ability to predict pathway activity indicates that the Bayesian pathway-modelling approach, previously only applied to bulk samples, can be extended to single cells. Identifying active pathways and their driver mutations and weighing their contribution by single cell analysis of a heterogeneous tumour could be important for guiding targeted therapy. Pathway analysis by differential on-chip DNA and RNA extraction could also be a very promising approach for analysis of individually captured circulating tumour cells.

Methods

System

The set-up used to monitor and control single cells in the microfluidic chip and enabling movement, trapping, RNA/DNA extraction and amplification comprises the following modules:

  • A microscopy module enabling epi-fluorescence and bright-field imaging

  • A temperature module

  • A flow control module

  • Microfluidic chip adaptor and tube connector

  • XY translation and focus adjustment unit

Each module is operated through a common graphical user interface program (GUI) and characterized as follows (see Fig. 1):

a) Microscopy module: The main elements of the module are the following components: Microscope objective (Wild Heerbrugg: 10X, NA 0.45). Excitation light source for fluorescence microscopy: A LED with center wavelength at 470 nm (Thorlabs model M470L3) equipped with collimator lenses. Fluorescence filters (Semrock, barrier filter model 733–527/645–25, dichroic mirror model 733–474/23–25, excitation filter model 733–495/605-Di01-25 × 36). CMOS imaging sensor (Fairchild, CIS1910) with 1920 × 1080 pixels, and pixel size 6.5 μ m, using a home-built image board. The bright-Field illuminator delivers light from a LED with center wavelength of 505 nm.

b) XY translation and focus adjustment unit: The XY translation is managed by moving the microfluidic chip in the Y-direction and illuminator-objective arm in the X-direction. The z-translation for focussing is performed by moving the objective along the optical axis.

c) Flow control module: The control of moving and trapping the cells in the microfluidic chip is achieved by applying minor elevated air pressure of typically 5–10 mbar onto the inlet channels. The pressure is delivered by an external pressure-based flow control module (Fluigent, MFCSTM–EZ 4C). This provides 4 pneumatic, independently controllable channels that are connected to the tube connector of the chip adaptor, see Fig. 1a and b.

d) Temperature module: The heating and cooling of the microfluidic chip is done by three Peltier elements. The temperature profile used for the nucleic acid amplification within the chip is regulated by PID control of the Peltier elements. Temperature feedback is provided by a thermo-couple situated inside the heating /cooling plate that the microfluidic chip is clamped on (see Fig. 1). Due to a temperature difference between this plate and the sample volume, the plate temperature is elevated by a calibrated offset.

Device fabrication

30 μm deep microchannels were created by reactive ion etching in silicon and replicated in nickel by electroplating. The nickel master was replicated in TOPAS 5013 (Topas, Germany) by injection molding using a tool creating twelve holes for fluidic connections. The device was sealed by UV-assisted thermal bonding of a 150 μm thick TOPAS 5013 foil (Topas, Germany).

Cell culture

The colorectal cancer cell line samples came from the Weatherall Institute of Molecular Medicine (Oxford, U.K.), which maintains a large collection of colorectal cancer cell lines. The cell lines CC20, LS174T, LS180, and SW1222, were cultured in DMEM (Gibco, Life Technologies). RKO was cultured in RPMI (Gibco, Life Technologies) all media were supplemented with 10% FBS (Gibco, Life Technologies), 2 mM Glutamax (Life Technologies) and 1% pen/strep (Gibco, Life Technologies). Cells were grown at 37 °C in a humidified incubator containing 10% CO2. Prior and after experiments, the cells were tested for Mycoplasma contamination (Agilent). For verification purposes an additional LS174T cell line sample was purchased directly from the ATCC (Manassas, VA, USA).

Cell staining

Freshly prepared cells were detached with 0.25% trypsin containing 0.04% EDTA (Gibco). Upon detachment the cells were taken up in medium and spun down, the cells were washed once with PBS and finally resuspended in PBS containing 1 μM calcein-AM (Sigma Aldrich). Cells were incubated for 30 minutes at room temperature prior to loading them onto the chip. The on-chip nuclear staining was achieved during lysis of the cellular membrane by applying a lysis buffer consisting of 0.5x TBE (Sigma Aldrich) containing 0.5 % Triton-X100 (Sigma Aldrich) and 1 μM YOYO-1 (ThermoFischer Scientific). (See main text and Fig. 2).

RNA extraction

RNA was extracted and purified from the cell line samples described in the cell culture section using a commercial kit (VERSANT Sample Preparation 1.0 Reagents kit) according to manufacturer’s instructions. The cells were trypsinized, washed once with PBS and resuspended in PBS. RNA was extracted from ≈105 cells and stored at −80 °C.

Sample preparation

The RNA fraction of a single cell was converted to cDNA and amplified using the SMART-Seq v4 Ultra Low Input RNA Kit for Sequencing (Takara/ Clontech) according to manufacturer’s instructions27. For the calibration and verification of the pathway model also higher inputs of cells or purified RNA were used for cDNA synthesis and amplification using the same kit. The gDNA fraction of a single cell was amplified on chip using the REPLI-g Single Cell kit (Qiagen).

Sequencing

Sequencing libraries were prepared using the Nextera libray preparation kit from inputs as low as 10 ng. The Nextera XT DNA library preparation kit was used when the input was only ≥1 ng of cDNA or DNA. After cluster formation on the Illumina cBot using the HiSeq Paired End Cluster Kit v4, sequencing was done on the Illumina HiSeq. 2500, using 2 × 125 bp paired-end sequencing with the HiSeq SBS kit v4 (250 cycles).

Bioinformatic analysis

The DNA sequencing reads were aligned to the reference human genome GRCh37 release 60 using the Burrows-Wheeler alignment tool version 0.7.5a28. Reference coverage statics were computed with the BEDtool version 2.9.0 on the bam files.

For alignment of the RNA sequencing data the pipeline proposed by Fonseca et al.29 was used (version 0.4.2, mapper = tophat2 (v2.0.10), quantification method = htseq. 2 (v0.6.1p1)). Determination of the gene expression levels was described in reads per kbase per million mapped reads, RPKM.

Electronic supplementary material

Acknowledgements

The authors gratefully acknowledge funding from the European Commission under the Seventh Framework Programme (FP7/2007–2013) under grant agreement number 278204 (Cell-0-Matic). The authors thank R. Kragt for the design of the heating stage, Walter Bodmer and Kamila Koprowska for providing the colon cancer cell lines used in this study, and Charles Cantor and Anja van de Stolpe for stimulating discussions.

Author Contributions

D.v.S., R.V., P.J.Z. and R.M. performed the experiments, N.A.L., T.S.H. and T.O. designed and built the instrument, R.M. and M.P. designed the chips, J.S. and L.B. led the sequencing, M.v.D. and W.V. performed the RNA and pathway analysis, P.J.Z., R.M., K.M., and A.K. conceived and oversaw the project, P.J.Z., W.V., D.v.S, R.M., and K.M. drafted the manuscript with topical input from all other authors.

Competing Interests

D.v.S., R.V., N.A.L., M.v.D., W.V. and P.J.Z. are, and T.S.H. and T.O. have been, Philips employees and may own company stock. J.S. was and L.B. is employee of Fasteris. K.M. is the founder of XGenomes.

Footnotes

Electronic supplementary material

Supplementary information accompanies this paper at doi:10.1038/s41598-017-10704-4

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Vogelstein B, et al. Cancer Genome Landscapes. Science. 2013;339:1546–1558. doi: 10.1126/science.1235122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hashimshony T, Wagner F, Sher N, Yanai I. CEL-Seq: Single-cell RNA-Seq by multiplexed linear amplification. Cell Rep. 2012;2:666–673. doi: 10.1016/j.celrep.2012.08.003. [DOI] [PubMed] [Google Scholar]
  • 3.Ramsköld D, et al. Full-length mRNA-seq from single-cell levels of RNA and individual circulating tumor cells. Nat. Biotech. 2012;30:777–782. doi: 10.1038/nbt.2282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Navin N, et al. Tumor evolution inferred by single cell sequencing. Nature. 2011;472:90–95. doi: 10.1038/nature09807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Xu X, Yin X, Bao L, Tang A, et al. Single cell exosome sequencing reveals single-nucleotide mutation characteristics of a kidney tumor. Cell. 2012;148:886–895. doi: 10.1016/j.cell.2012.02.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Verhaegh W, et al. Selection of personalized Patient therapy through the use of knowledge-based computational models that identify tumor-driving signal transduction pathways. Cancer Res. 2014;74:2936. doi: 10.1158/0008-5472.CAN-13-2515. [DOI] [PubMed] [Google Scholar]
  • 7.Ding L, et al. Clonal evolutionl in relapsed acute myeloid leukaemia revealed by whole-genome sequnecing. Nature. 2012;418:506–510. doi: 10.1038/nature10738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Guinney J, et al. The consensus molecular subtypes of colorectal cancer. Nat. Medicine. 2015;21:1350–1356. doi: 10.1038/nm.3967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Mayer, I. A. et al. SU2C Phase Ib study of the PI3K-alpha inhibitor BYL719 (apelisib) with letrozole in ER+/HER2- metastatic breast cancer. Cancer Res. 75 (15 Suppl) Abstract nr. CT232 (2015).
  • 10.Zhang JD, Küng E, Boess F, Certa U, Ebeling M. Pathway reporter genes define molecular phenotypes of human cells. MBC Genomics. 2015;16:342. doi: 10.1186/s12864-015-1532-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hanahan D, Weinberg RA. Hallmarks of Cancer: The Next Generation. Cell. 2011;144:646–674. doi: 10.1016/j.cell.2011.02.013. [DOI] [PubMed] [Google Scholar]
  • 12.Verhaegh W, van de Stolpe A. Knowledge-based computational models. OncoTarget. 2014;5:5196–5197. doi: 10.18632/oncotarget.2276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Dey SS, et al. Integrated genome and transcriptome sequencing of the same cell. Nat. Biotech. 2015;33:285–289. doi: 10.1038/nbt.3129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Macaulay IC, et al. G&T-seq: parallel sequencing of single cell genomes and transcriptomes. Nat. Methods. 2015;12:519–522. doi: 10.1038/nmeth.3370. [DOI] [PubMed] [Google Scholar]
  • 15.Pearl, J. Reverend Bayes on interference engines: A distributed hierarchical approach. AAAI proceedings 133–136 (1982).
  • 16.Clevers H, Nusse R. Wnt/β Catenin signalling and disease. Cell. 2012;149:1192–1205. doi: 10.1016/j.cell.2012.05.012. [DOI] [PubMed] [Google Scholar]
  • 17.Pai P, et al. MUC4 is negatively regulated through the Wnt β-catenin pathway via the Notch effector Hath1 in colorectal cancer. Genes Cancer. 2016;7:154–168. doi: 10.18632/genesandcancer.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.van der Flier LG, et al. The Intestinal Wnt/TCF Signature. Gasteroenterology. 2007;132:628–632. doi: 10.1053/j.gastro.2006.08.039. [DOI] [PubMed] [Google Scholar]
  • 19.Lin SY, et al. Beta-catenin, a novel prognostic marker for breast cancer: its role in cyclin D1 expression and cancer progression. Proc. Natl. Acad. Sci. USA. 2000;68:6779–88. doi: 10.1073/pnas.060025397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhong CH, Lu S, Chapman AR, Xie XS. Genome wide detection of single nucleotide and copy number variation of a single human cell. Science. 2012;338:1622–1626. doi: 10.1126/science.1229164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Morin PJ, et al. Activation of beta-catenin-Tcf signaling in colon cancer by mutations in beta-catenin or APC. Science. 1997;275:1787–90. doi: 10.1126/science.275.5307.1787. [DOI] [PubMed] [Google Scholar]
  • 22.Wang E, et al. Cancer systems biology in the genome sequencing era: Part1, dissecting and modelling of tumor clones and their networks. Sem. Can. Bio. 2013;23:279–285. doi: 10.1016/j.semcancer.2013.06.002. [DOI] [PubMed] [Google Scholar]
  • 23.Wang E, et al. Cancer systems biology in the genome sequencing era: Part2, evolutionary dynamics of tumor clones networks and drug reistance. Sem. Can. Bio. 2013;23:286–292. doi: 10.1016/j.semcancer.2013.06.001. [DOI] [PubMed] [Google Scholar]
  • 24.Yang L, Duff MO, Graveley BR, Carmichael GG, Chen L-L. Genomewide characterization of non-polyadenylated RNAs. Gen. Biology. 2011;12:R16. doi: 10.1186/gb-2011-12-2-r16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kato, M., Han., T. W., Xie, S., Shi, K. et al. Cell-free formation of RNA granules: low complexity sequence domains form dynamic fibers within hydrogels. Cell149 753–767. [DOI] [PMC free article] [PubMed]
  • 26.Han, T. W., Kato, M., Xie, S., Wu, L. C. et al. Cell-free formation of RNA granules: bound RNAs identify features and components of cellular assemblies. Cell149 768–779. [DOI] [PubMed]
  • 27.Picelli S, et al. Full-length RNA-seq from single cells using Smart-seq. 2. Nature Protocols. 2014;9:171–181. doi: 10.1038/nprot.2014.006. [DOI] [PubMed] [Google Scholar]
  • 28.Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinform. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Fonseca NA, Marioni J, Brazma A. RNA-Seq gen profiling - A systematic emprical comparison. PLOS One. 2014;9:e107026. doi: 10.1371/journal.pone.0107026. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES