Skip to main content
eLife logoLink to eLife
. 2022 Feb 23;11:e73971. doi: 10.7554/eLife.73971

Hydrop enables droplet-based single-cell ATAC-seq and single-cell RNA-seq using dissolvable hydrogel beads

Florian V De Rop 1,2, Joy N Ismail 1,2, Carmen Bravo González-Blas 1,2, Gert J Hulselmans 1,2, Christopher Campbell Flerin 1,2,3, Jasper Janssens 1,2, Koen Theunis 1,2,3, Valerie M Christiaens 1,2, Jasper Wouters 1,2, Gabriele Marcassa 1,4, Joris de Wit 1,4, Suresh Poovathingal 1,†,, Stein Aerts 1,2,†,
Editors: Naama Barkai5, Naama Barkai6
PMCID: PMC8993220  PMID: 35195064

Abstract

Single-cell RNA-seq and single-cell assay for transposase-accessible chromatin (ATAC-seq) technologies are used extensively to create cell type atlases for a wide range of organisms, tissues, and disease processes. To increase the scale of these atlases, lower the cost and pave the way for more specialized multiome assays, custom droplet microfluidics may provide solutions complementary to commercial setups. We developed HyDrop, a flexible and open-source droplet microfluidic platform encompassing three protocols. The first protocol involves creating dissolvable hydrogel beads with custom oligos that can be released in the droplets. In the second protocol, we demonstrate the use of these beads for HyDrop-ATAC, a low-cost noncommercial scATAC-seq protocol in droplets. After validating HyDrop-ATAC, we applied it to flash-frozen mouse cortex and generated 7996 high-quality single-cell chromatin accessibility profiles in a single run. In the third protocol, we adapt both the reaction chemistry and the capture sequence of the barcoded hydrogel bead to capture mRNA, and demonstrate a significant improvement in throughput and sensitivity compared to previous open-source droplet-based scRNA-seq assays (Drop-seq and inDrop). Similarly, we applied HyDrop-RNA to flash-frozen mouse cortex and generated 9508 single-cell transcriptomes closely matching reference single-cell gene expression data. Finally, we leveraged HyDrop-RNA’s high capture rate to analyze a small population of fluorescence-activated cell sorted neurons from the Drosophila brain, confirming the protocol’s applicability to low input samples and small cells. HyDrop is currently capable of generating single-cell data in high throughput and at a reduced cost compared to commercial methods, and we envision that HyDrop can be further developed to be compatible with novel (multi) omics protocols.

Research organism: Drosophila melanogaster, Human, Mouse

eLife digest

Scientists are now able to determine the order of chemical blocks, or nucleic acids, that make up the genetic code. These sequencing tools can be used to identify which genes are active within a biological sample. They do this by extracting and analysing open chromatin (regions of DNA that are accessible to the cell’s machinery), or sequences of RNA (the molecular templates cells use to translate genes into working proteins).

Initially, most sequencing tools could only provide an ‘averaged-out’ profile of the genes activated in bulk pieces of tissue which contain multiple types of cell. However, advances in technology have led to new methods that can extract and analyse open chromatin or RNA from individual cells.

First, the cells are separated, via a technique called microfluidics, into tiny droplets of water along with a single bead that carries a unique barcode. The cell is then broken apart inside the droplet and the barcode within the bead gets released and attaches itself to the genetic material extracted from the cell. All the genetic material inside the droplets is then pooled together and sequenced. Researchers then use the barcode tags to identify which bits of RNA or DNA belong to each cell.

Single-cell sequencing has many advantages, including being able to pinpoint precise genetic differences between healthy and abnormal cells, and to create cell atlases of whole organisms, tissues and microbial communities. But existing methods for extracting chromatin are very expensive, and there were no openly available tools for processing thousands of cells at speed. Furthermore, while several single-cell RNA sequencing tools are already freely available, they are not very sensitive or practical to use.

Here, De Rop et al. have developed a new open-source platform called HyDrop that overcomes these barriers. The method entails a new type of barcoded bead and optimised elements of existing microfluidics protocols using open-source reagents. These changes created a more user-friendly workflow and increased sensitivity of sequencing at no additional cost.

De Rop et al. used their new platform to screen the RNA and open chromatin of thousands of individuals cells from the brains of mice and flies. HyDrop outperformed other open-source methods when working in RNA-sequencing mode. It also provides the first open-source tool for sequencing open chromatin in single cells. Further improvements are expected as researchers tweak the platform, which for now provides an affordable alternative to existing methods.

Introduction

In the past 5 years, droplet microfluidics have been applied extensively to partition single cells and sequence their nucleic acid content. Two methods that pioneered the field, inDrop (Klein et al., 2015) and Drop-seq (Macosko et al., 2015), both rely on the same working principle: individual cells are rapidly encapsulated into a nanoliter droplet together with a barcoded bead. Barcoding primers carried by the beads inside the emulsion are then used to index each individual cell’s mRNA. This process occurs either inside the droplet, where the cell’s mRNA is reverse transcribed using barcoded primers released by the barcoded bead (inDrop) or after emulsion breaking, where the cellular mRNA is anchored onto barcodes carried by resin beads (Drop-seq). The thousands of single-cell transcriptomes can then be processed for next-generation sequencing in a pooled manner. The high throughput and low per-cell reagent consumption associated with this approach have allowed researchers to profile gene expression of tens of thousands of single cells (Kalish et al., 2020; Yap et al., 2021; Karaiskos et al., 2017) at an affordable cost. However, the labor requirements of current open-source droplet microfluidic protocols combined with their limited sensitivity have hindered widespread adoption. Furthermore, academic development of droplet-microfluidic single-cell sequencing technology after inDrop and Drop-seq has been limited. To our knowledge, only one noncommercial droplet-based single-cell assay for transposase accessible chromatin (scATAC-seq) protocol has been published so far (Chen et al., 2019). Despite its elegant conceptual solution to droplet-based combined scATAC and scRNA-seq, the SNARE-seq protocol is still labor intensive, and the use of resin beads leads to reduced cell capture rates. These resin beads are loaded at dilute concentrations to prevent microfluidic obstruction, and as a result, many droplets are empty, leading to a low cell capture rate (~2%), and reagent waste (Zhang et al., 2019). Instead, deformable hydrogel beads can be stacked and loaded into droplets at a fixed rate without the risk of microfluidic failure, thereby increasing cell capture rate to >50% (Zhang et al., 2019; Abate et al., 2009). Several commercial solutions have emerged since inDrop and Drop-seq (Zheng et al., 2017; Sarkar et al., 2018), and their application has been used to generate hundreds of thousands (Davie et al., 2018; Svensson et al., 2020) - and recently millions (Datlinger et al., 2021; Ren et al., 2021) - of single-cell transcriptomes at a high sensitivity. However, the high cost of these commercial protocols remains prohibitive for many research applications, and their fixed nature limits custom protocol development.

In order to increase the sensitivity and user-friendliness of open-source microfluidic protocols, and to provide a more flexible and open platform, we developed HyDrop, a new hydrogel-based droplet microfluidic method for high-throughput scRNA-seq or scATAC-seq. We adapted inDrop’s original isothermal extension bead barcoding protocol (Zilionis et al., 2017) to a linear amplification workflow to generate more uniformly barcoded beads. Next, we applied a custom hydrogel bead production process similar to a recently published protocol (Wang et al., 2020) to generate dissolvable beads, improving barcoded primer release and diffusion. We also optimized Drop-seq’s pooled template-switching reverse transcription strategy for application inside the cell/bead emulsion, and optimized the assay’s sensitivity by testing several different cDNA library preparation strategies. The combination of these adaptations resulted in a significantly increased sensitivity for HyDrop-RNA compared to inDrop and Drop-seq at no additional cost, and in a more user-friendly workflow. Additionally, the dissolution of the hydrogel beads stabilizes the cell/bead emulsion during linear amplification thermocycling. This change allowed us to implement to our knowledge the first open-source single-cell ATAC-seq in droplets using hydrogel beads. We applied both technologies to mouse cerebral cortex and generated single-cell ATAC-seq and single-cell RNA-seq data that is highly concordant with reference data.

Results

Generation of dissolvable hydrogel beads with barcodes for scATAC-seq and scRNA-seq

We generated barcoded hydrogel beads that can dissolve and release their embedded barcoded oligonucleotide. Polyacrylamide beads incorporating disulfide cross-linkers and short oligonucleotide polymerase chain reaction (PCR) priming sites were generated by droplet microfluidics similar to a recently published method (Wang et al., 2020). A custom droplet microfluidic chip (Figure 3—figure supplement 1) was employed to produce beads of approximately 50 μm diameter. These hydrogel beads were then barcoded using a modified three-round split-pool linear amplification synthesis method (Klein et al., 2015; Zilionis et al., 2017), resulting in 96 × 96 × 96 (884,736) barcode possibilities (see Supplementary file 1 – ‘Molecular sequence description of HyDrop bead barcoding’ for an in-depth visualization of the nucleic acid sequences in every step). The terminal sequence used in the final round of barcoding can be varied depending on the assay the beads will be used for (see Methods, Figure 1a). A sequence complementary to the Tn5 transposase adapter was used to capture tagmented chromatin fragments in scATAC-seq and a unique molecular identifier (UMI) (Kivioja et al., 2011)+ poly (dT) sequence was used to capture and count poly (A)+ mRNA in scRNA-seq (protocols described further). The barcoded beads were stored in a glycerol-based freezing buffer at –80°C in order to prevent loss of primers over time (Figure 1—figure supplement 1).

Figure 1. Technical overview of HyDrop barcoded bead production and quality control.

(a) Split-pool process for barcoding of dissolvable hydrogel beads for single-cell RNA-sequencing (scRNA-seq) and single-cell assay for transposase-accessible chromatin (scATAC-seq). Unbarcoded hydrogel beads are sequentially distributed over 96 wells, sub-barcoded using linear amplification polymerase chain reaction (PCR), repooled, and re-distributed two more times to generate 96 × 96 × 96 (884,736) possible barcode combinations. Different 3-prime terminal capture sequences are possible depending on the oligonucleotide sequence appended in the last step. (b) Semiquantitative assessment of bead primer incorporation by fluorescence in-situ hybridisation (FISH) using a fluorescein amidite (FAM) probe after every sub-barcoding step shows that bead fluorescence uniformity is retained throughout the barcoding process. (c) FISH with FAM probes complementary to only one of 96 sub-barcode possibilities shows that approximately 1/96 beads exhibit fluorescence for a selected sub-barcode probe. Fluorescence signal is overlaid with a brightfield image at 50% transparency to indicate positions of nonfluorescent beads (see Figure 1—figure supplements 26 for additional quality control experiments and full images).

Figure 1.

Figure 1—figure supplement 1. Hydrogel bead integrity after being frozen and thawed.

Figure 1—figure supplement 1.

Beads were frozen overnight in Tris–EDTA–Tween (TET) buffer or 30% glycerol buffer and thawed at room temperature. Rough edges in the TET buffer sample indicate freezing/thawing damage.
Figure 1—figure supplement 2. Fluorescence signal of freshly barcoded beads.

Figure 1—figure supplement 2.

Beads were incubated with probes complementary to acrydite primer or capture site (poly-T or Tn5 adapter site) before and after barcoding.
Figure 1—figure supplement 2—source data 1. Count, mean diameter, and mean intensity of linear amplification and isothermal extension beads.
Beads were incubated in fluorescent probes complementary to basal acrydite primer or poly-A or ATAC capture site. Absolute size of 1 SD is added in mean diameter and mean intensity columns.
Figure 1—figure supplement 3. Brightfield images of barcoded beads incubated with FAM oligonucleotide probe complementary to one out of 96 possible sub-barcodes.

Figure 1—figure supplement 3.

Segmented fluorescence masks of positive beads overlaid over brightfield image of same beads to visualize positives.
Figure 1—figure supplement 3—source data 1. Total counts and fluorescence-positive counts of beads incubated with fluoresceine amidite (FAM) oligonucleotide probe complementary to 1 out of 96 barcode possibilities.
Figure 1—figure supplement 4. Fluorescence signal of nondissolvable 50 uM HyDrop-RNA beads.

Figure 1—figure supplement 4.

Unbarcoded beads were incubated with probes complementary to the acrydite primer. Images taken immediately after 15- and 50-min incubation in 50 mM of dithiothreitol (DTT), and a short wash using quality control (QC) buffer.
Figure 1—figure supplement 4—source data 1. Count, mean diameter, and mean background-adjusted intensity of beads incubated in 50 mM of DTT for 15 and 50 min as well as negative control.
Absolute size of 1 SD is added in mean diameter and mean intensity columns.
Figure 1—figure supplement 5. HyDrop-ATAC emulsion before and after PCR, with dissolvable and non-dissolvable barcoded hydrogel beads.

Figure 1—figure supplement 5.

(a) HyDrop emulsions made with nondissolvable and dissolvable beads. Picture taken within 5 min after the emulsion was made. (b) HyDrop-ATAC emulsions made with nondissolvable and dissolvable beads before and after thermocycling. Picture taken within 5 min after the thermocycling was completed.
Figure 1—figure supplement 6. HyDrop-RNA bead fluorescence intensity and barnyard plots for acrydite primer concentrations of 50, 12 and 6 uM.

Figure 1—figure supplement 6.

(a) Fluorescence signal of 50, 12, and 6 μM primer concentration beads. Unbarcoded beads were incubated with probes complementary to acrydite primer. (b) Purity plots for HyDrop-RNA species-mixing experiments performed using 50, 12, and 6 uM beads. Cells were assigned to a species if >95% of their unique molecular identifier (UMIs) belonged to transcripts mapping to that species’ genome.

We validated the extension of the hydrogel bead primers using fluorescent probes complementary to the beads 3-prime terminal sequence (Zilionis et al., 2017; Figure 1b and Figure 1—figure supplement 2) and the sub-barcode purity using fluorescent probes complementary to 1 of the 96 sub-barcode possibilities (Figure 1c, Figure 1—figure supplements 2 and 3, Tables S12). These experiments showed that there was no significant loss of primers or mixing of barcodes throughout the barcoding process, and that the beads are uniform in size and primer content. Additional testing revealed that our modified linear amplification barcoding method produced more uniformly barcoded beads compared to the isothermal amplification protocol described in inDrop (Figure 1—figure supplement 2). Furthermore, the disulfide moieties incorporated in both the bead’s polymer matrix and oligonucleotide linker are cleaved when exposed to reducing conditions, such as dithiothreitol (DTT). This chemical method of release is more user-friendly compared to the UV-mediated (Klein et al., 2015; Zilionis et al., 2017) primer release as the beads do not have to be shielded from light. In addition to improved primer release compared to nondissolvable beads (Figure 1—figure supplement 4, Table S3), dissolvable beads did not disrupt the emulsion during the thermocycling (Figure 1—figure supplement 5). We hypothesize that this effect is due to the lower physical stress generated by the dissolved hydrogel as opposed to the solid hydrogel. This added benefit allowed us to implement scATAC-seq (see further) on HyDrop. Finally, by varying the concentration of the acrydite primer during bead synthesis, lower or higher amounts of cleavable barcoded primers could be generated. We found that when the concentration of acrydite primer incorporated in the hydrogel matrix was high (50 μM, similar to InDrop), excess unreacted barcodes could not be sufficiently filtered out in further downstream steps. These primers were then carried over to subsequent reactions, leading to random barcoding of free fragments after droplet merging, and subsequently to cell-mixed expression or chromatin accessibility profiles. The bead primer concentration with an optimal balance between sensitivity and library purity was found to be 12 µM for both scATAC-seq and scRNA-seq (Figure 1—figure supplement 6).

Implementation and accuracy assessment of HyDrop-ATAC

We implemented a new open-source protocol for single-cell ATAC-seq using HyDrop’s dissolvable barcoded hydrogel beads. Nuclei were Tn5 tagmented in bulk and coencapsulated with HyDrop-ATAC beads. Pitstop, a selective small molecule clathrin inhibitor, was supplemented during nuclei extraction and tagmentation to increase nucleus permeability to Tn5 (Mulqueen et al., 2019). Inside the droplet, the hydrogel beads dissolve and release their uniquely barcoded primers inside the droplet due to the presence of DTT carried by the nuclei/PCR mix. Subsequent thermocycling of the emulsion denatures the Tn5 protein complex and releases accessible chromatin fragments within the droplet. These fragments were then linearly amplified and cell indexed by the bead’s barcoded primers after which the emulsion was broken and the indexed ATAC fragments were pooled, PCR amplified, and sequenced (Figure 2, see – Supplementary file 2 ‘Molecular sequence description of HyDrop-ATAC’ for an in-depth visualization of the nucleic acid sequences in every step).

Figure 2. Schematic overview of HyDrop-ATAC.

Figure 2.

Nuclear membrane is visualized in salmon, water droplet is visualized in blue. Nuclei were Tn5 tagmented in bulk and co-encapsulated with HyDrop-ATAC beads, where the hydrogel beads dissolve and release their uniquely barcoded primers. Thermocycling of the emulsion releases accessible chromatin fragments which are then linearly amplified and cell indexed by the bead’s barcoded primers within the droplet. The emulsion is then broken and the indexed ATAC fragments are pooled, PCR amplified, and sequenced.

In order to coencapsulate beads and nuclei with a high capture rate and minimal microfluidic complexity, we developed a custom microfluidic chip (Figure 3a and b). The chip design features one inlet for beads, one inlet for cells or nuclei, and one inlet for the emulsion carrier oil. This configuration is slightly more convenient to operate compared to inDrop’s four channel setup. Several layers of passive filters near the inlet ports mitigate dust and debris buildup during droplet generation to prevent obstruction of the channels. Beads and nuclei were loaded via a tip reservoir to reduce nonlinear flow behavior and the potential accumulation of cells/nuclei and hydrogel beads associated with narrow tubes (Sinha et al., 2019; Hur et al., 2011; Figure 3b). Due to the stability of all flows and the deformable nature of the hydrogel beads, >90% occupancy of hydrogel beads in droplets could be achieved (Abate et al., 2009; Figure 3c), resulting in a final cell recovery of ~65%.

Figure 3. Microfluidics setup employed in HyDrop emulsion generation.

(a) 3d rendering of HyDrop chip design. The design has three inlets: one each for carrier oil, barcoded hydrogel beads and cell/reaction mix (see Figure 3—figure supplement 1 for full design). Passive filters at each inlet prevent dust and debris from entering the droplet generating junction. (b) Diagram and snapshot of cell/bead encapsulation into microdroplets. (c) Microfluidic chip setup on the Onyx integrated microfluidic instrument. Cells and beads are loaded into pipette tips and plugged into a HyDrop chip. Flow of oil and aqueous phases is achieved by Onyx displacement syringe pumps.

Figure 3.

Figure 3—figure supplement 1. Microfluidic chip designs for cell/bead encapsulation and hydrogel bead generation - oil, monomer mix, cell suspensions, and beads can be pumped in through their respective inlets.

Figure 3—figure supplement 1.

At the flow focusing point, aqueous phases (cells/beads/monomer mix) are emulsified by collision with the oil phase at the flow-focusing point. Emulsion is collected at the outlet channel.

To assess the purity of scATAC-seq libraries generated by HyDrop-ATAC, we performed two mixed species experiments. First, we generated single-nuclei ATAC-seq libraries from a 50:50 mixture of human breast cancer (MCF-7) and a mouse melanoma cell line generated previously (Dankort et al., 2009). We developed a custom preprocessing and mapping pipeline for HyDrop-ATAC data (Dankort et al., 2009). After filtering the cell barcodes for a minimum transcription start sites (TSS) enrichment score of 7 and unique fragment count of 1000, we recovered 1353 cells from an input of 2000, and a median of 2705 unique fragments per cell. About 88% of all barcode reads were paired with the whitelist when one mismatch was allowed along the entire length of the 30 bp barcode. An additional 1.16% and 1.13% of HyDrop barcodes could be assigned to the whitelist when two and three mismatches were allowed, respectively. The remaining 9% of barcode sequences could not be identified and are most likely a result of frameshift due to incorporation errors in the synthetic oligonucleotide production process. Notably, only 39.7% of barcode reads could be matched with the whitelist in a public inDrop dataset (Kalish et al., 2020), when one mismatch was allowed. When prefiltered cell barcode sequences were used as a whitelist for a public mouse retina Drop-seq dataset (Macosko et al., 2015), 49.5% of reads could be matched. We identified 98.4% of cells as either human or mouse at a minimum purity of 95% fragments mapping to either species (Figure 4a). Next, we generated libraries from a mixture of MCF-7/PC-3/Mouse cortex (45:45:10) to evaluate whether two human cell types can be distinguished. A spike-in of 10%mouse cells was used as an internal control. We recovered 2602 human cells, 466 mouse cells, and 93 species doublets after filtering for 95% species purity. Clustering human cells (together with the MCF-7 cells from the first species mixing experiment to evaluate batch effects) recovered two distinct populations, each exhibiting specific ATAC-seq peaks near MCF-7 or PC-3 marker genes (Figure 4—figure supplement 1a). Aggregated reads per cluster showed typical ATAC-seq profiles concordant with public bulk ATAC-seq data (Consortium, 2012; Figure 4b, Figure 4—figure supplement 1).

Figure 4. Validation of HyDrop-ATAC on mixed species cell lines.

(a) Scatterplot of the number of unique fragments detected in a 50:50 mixture of human Michigan Cancer Foundation-7 (MCF-7) and mouse melanoma cells colored by local density estimation. (b) reads per genome coverage (RPGC)-normalized aggregate genome tracks comparing HyDrop-ATAC and bulk ATAC-seq profiles of human MCF-7 and prostate cancer-3 (PC-3) cell lines around the estrogen receptor 1 (ESR1) locus, scaled to maximum of all four samples. Aggregate enrichment profile of reads around transcription start site, see Figure 4—figure supplement 1 for clustering of these cells and count correlation with public data. Supplementary source data files available for figure (a).

Figure 4.

Figure 4—figure supplement 1. UMAP and count correlations within bulk ATAC-seq data for MCF-7 and PC-3 HyDrop-ATAC.

Figure 4—figure supplement 1.

(a) MCF-7 and PC-3 HyDrop-ATAC uniform manifold approximation and embedding (UMAP) colored by imputed gene activity for vimentin (VIM) and estrogen receptor 1 (ESR1) – VIM is a marker gene for PC-3 cells, ESR1 is a marker gene for MCF-7 cells. (b) Correlation of counts in peak regions of public bulk ATAC-seq and HyDrop-ATAC data regions were derived from public data, and counts within these regions were correlated for both the HyDrop and bulk data. Color scale encodes each count’s local Gaussian kernel density estimation score.

Application of HyDrop-ATAC on flash-frozen mouse cortex recapitulates cerebral cellular heterogeneity and cell-type specific accessibility profiles

To evaluate the performance of HyDrop-ATAC on primary tissue, we generated single-cell libraries from snap-frozen, dissected adult mouse brain cortex. Libraries were sequenced to approximately 75% duplication rate. After filtering for a minimum of 1000 unique nuclear fragments, a TSS enrichment score of 5, and removing 506 cells (6%) detected as doublets by Scrublet (Wolock et al., 2019), we recovered a total of 7996 single nuclei. Cells passing the filters had a median of 4148 fragments per cell, a median TSS enrichment score of 13, and a median of 53% of fragments in peaks, reflecting high-quality cells and low levels of background signal (Figure 5a–d). Even though the number of unique fragments per cell (~4 K) is lower than that of commercial methods (e.g. 17–20 K per cell for 10× Genomics, see Methods), HyDrop-ATAC achieves comparable results in terms of TSS enrichment and fraction of reads in peaks scores. We used cisTopic (Bravo González-Blas et al., 2019) to reduce the dimensionality of the dataset and the Leiden algorithm (Traag et al., 2019) to cluster cells (Figure 5e). Cell annotation using the aggregated ATAC signal around several neocortex markers (Yao et al., 2021; Zeisel et al., 2018) recovered 19 distinct cell types, similar to previously published scATAC-seq mouse cortex data (Preissl et al., 2018; Li et al., 2020; Figure 5—figure supplement 1). For example, we identified oligodendrocyte precursors and mature oligodendrocytes, marked by exclusive accessibility nearby Sox10 and Pdgfra2, respectively. Within ganglionic eminence-derived interneurons, we were able to further distinguish medial ganglionic eminence-derived subtypes with specific ATAC-seq signal near either Vip or Lamp5, and caudal ganglionic eminence-derived subtypes with accessibility near either Sst or Pvalb. Finally, HyDrop-ATAC data revealed distinct cell-type specific differentially accessible regions (Figure 6a and b).

Figure 5. Application of HyDrop-ATAC on flash-frozen mouse cortex recapitulates cellular heterogeneity.

Aggregate enrichment of ATAC fragments near transcription start sites (TSS) (a), TSS enrichment per barcode (b), fraction of reads in peaks (FRIP) per barcode, (c) and duplication rate per barcode (d) in mouse cortex HyDrop-ATAC data. A minimum TSS enrichment of 5 and a unique number of fragments of 1000 are used as cut-off values to separate cells from background (red lines). Cells are colored by local density estimation. (e) UMAP projection of 7996 mouse cortex nuclei annotated with cell type inferred by accessibility near marker genes. Abbreviations: microglia (MGL), mature oligodendrocytes (MOL), oligodendrocyte precursors (OPC), astrocytes (AST), endothelial cells (END), piriform cortex neurons (PIR), caudal and medial ganglionic eminence derived neurons (CGE, MGE), layers 2–6 intratelencephalic (IT), L5 extratelencephalic (ET), L5/6 near projecting excitatory neurons (NP), L6 corticoencephalic (CT), and deep L6 excitatory neurons (L6b). See Figure 5—figure supplement 1 for cluster marker gene activities. Supplementary source data files available for figures b, c, and d.

Figure 5.

Figure 5—figure supplement 1. Heatmap of mouse cortex HyDrop-ATAC gene activity – gene activity was imputed by normalized accessibility within a 10 kb window around the gene.

Figure 5—figure supplement 1.

Figure 6. Differentially accessible regions (DAR) between cell types recovered by HyDrop-ATAC on mouse cerebral cortex.

Figure 6.

(a) Aggregate accessibility of top 1000 DAR per cluster. (b) Row-scaled, counts-per-million (CPM)-normalized aggregate genome track covering the top one DAR for each cluster.

Implementation of HyDrop-RNA as a hybrid method between inDrop and Drop-Seq

We next implemented a new scRNA-seq assay using barcoded bead primers carrying a 3-prime poly (dT) sequence. Single cells or nuclei were resuspended in a reverse transcriptase mix and coencapsulated into microdroplets with 3-prime poly (dT) HyDrop beads. The same microfluidic chip design was used for both HyDrop-RNA and HyDrop-ATAC. Cells are lysed inside the droplets upon contact with the lysis buffer in which the barcoded beads were suspended. Simultaneously, barcoded primers were released from the hydrogel bead after exposure to DTT present in the reverse transcriptase mix. Reverse transcription inside the emulsion generates thousands of barcoded single-cell cDNA libraries in parallel. The emulsion was then broken and the single-cell transcriptome libraries were processed further in a pooled manner (Figure 7, see Supplementary file 3).

Figure 7. Schematic overview of HyDrop-ATAC.

Figure 7.

Single cells or nuclei are resuspended in a reverse transcriptase mix and coencapsulated into microdroplets with 3-prime poly (dT) HyDrop beads. Cells are lysed inside the droplets and barcoded primers (BC) are released from the hydrogel bead. Reverse transcription inside the emulsion using a template-switching oligo (TSO) generates thousands of barcoded single-cell cDNA libraries in parallel which are processed further in a pooled manner after breaking the emulsion.

‘Molecular sequence description of HyDrop-RNA’ for an in-depth visualization of the nucleic acid sequences in every step, similarly to the InDrop protocol (Zilionis et al., 2017). Although both assays are based on hydrogel beads, HyDrop-RNA differs significantly from InDrop. HyDrop-RNA employs a template switching oligo (TSO) reverse transcription technique (similar to Drop-seq), rather than an in vitro transcription/random hexamer priming workflow. This change simplifies and speeds up the protocol significantly with no reduction in sensitivity.

To improve the assay’s sensitivity, we interrogated the impact of several alterations to the protocol’s reaction chemistry by testing them on a 50:50 human-mouse (human melanoma, mouse melanoma) mixture. We first investigated the use of pooled exonuclease I treatment after reverse transcription to remove unused barcode primers. We reasoned that these unused barcoded primers could potentially prime transcripts during the subsequent bulk inverse-suppressive polymerase chain reaction (IS-PCR), leading to a loss of purity of transcripts associated with a given barcode. As evident from the increase in pure cells detected, we found that exonuclease I treatment indeed improved assay purity (Figure 8—figure supplement 1). We then tested the implementation of a locked nucleic acid (LNA) in the TSO, as it has been shown to increase assay sensitivity due to increased stability of the TSO-cDNA complex (Picelli et al., 2014). We also investigated the addition of guanosine triphosphate (GTP) and polyethylene glycol (PEG) to the in-droplet reverse transcription reaction. Both the addition of PEG as a molecular crowding agent and GTP as a stabilizer of the TSO complex has been shown to improve assay sensitivity in SMART-seq3 (Hagemann-Jensen et al., 2020). Finally, we wondered whether a second strand synthesis library construction approach could outperform the TSO/IS-PCR approach. In order to test this, we compared both alkaline hydrolysis and enzymatic treatment (RNAse H) to remove the RNA strand from the first strand product, and evaluated the performance of both Bst 2.0 DNA polymerase and Klenow (exo-) (Hughes et al., 2020; Stickels et al., 2021) fragment for second strand synthesis. We found that the classical TSO and IS-PCR protocol supplemented with GTP/PEG performed best, yielding a median of 2110 UMIs and 1325 genes per cell with a species purity of 90.1% (Figure 8a and Figure 8—figure supplement 1). Accordingly, the GTP/PEG method was used in all further HyDrop-RNA experiments. Applying this protocol to a 50:50 human-mouse (MCF-7, mouse melanoma) mixture recovered 1235 human and 846 mouse cells with 169 species doublets at a cutoff of 95% species purity, with a median of 1439 UMIs and 904 genes per cell (Figure 8b).

Figure 8. Validation and benchmarking of HyDrop-RNA on species mixed cell line samples and mouse cerebral cortex.

(a) Comparison of unique molecular identifier (UMI) and gene count of HyDrop-RNA with and without Exo I treatment postdroplet merging, with the use of a locked nucleic acid (LNA) template switching oligo (TSO) and with GTP/PEG, BST2.0, and Klenow fragment library preparation. Inner lines represent Q1, median, and Q3. See Figure 8—figure supplement 1 for species purity plots of these experiments. (b) Scatterplot of human and mouse UMIs detected in a 50:50 mixture of human MCF-7 and mouse melanoma cells colored by local density estimation. (c) Comparison of UMI and gene count of public inDrop mouse cortex data, public Drop-seq mouse retina data, and HyDrop-RNA mouse cortex data. Inner lines represent Q1, median, and Q3. See Figure 8—figure supplement 2 for additional quality comparison including commercial methods. Supplementary source data files available for figures a, b, and c.

Figure 8.

Figure 8—figure supplement 1. HyDrop-RNA species-mixing purity plots for several different protocol versions.

Figure 8—figure supplement 1.

Protocol versions include non-Exo I treated, Exo I treated, template switching oligo- locked nucleic acid (TSO-LNA), Klenow fragment, Bst2.0, and GTP/PEG HyDrop-RNA libraries. UMI, unique molecular identifier.
Figure 8—figure supplement 2. Collection of quality control metrics for inDrop, Drop-seq, HyDrop-RNA, and 10× datasets on mouse brain cells.

Figure 8—figure supplement 2.

‘% Reads with valid barcodes’ is calculated after allowing one mismatch to the whitelist. ‘% Duplicates’ is calculated on mapped transcripts using unique molecular identifiers (UMIs). ‘% Genome’ and ‘% Transcriptome‘ are the percentages of all raw reads mapping to the reference genome and transcriptome, respectively. ‘% Raw reads as UMIs in cells’ is the sum of UMIs in filtered cells divided by the total number of sequenced reads. Supplementary source data files available.

HyDrop-RNA on flash-frozen mouse cortex recovers cerebral cell types

We then used HyDrop-RNA to generate 9508 single-nuclei transcriptomes from snap-frozen mouse cortex in a single experiment. At a sequencing saturation of approximately 60% duplicates, we achieve a median of 3389 UMIs and 1658 genes per cell before, and 3404 UMIs and 1662 genes per cell after doublet filtering (Figure 8c), compared to the median of 1920 UMIs and 1321 genes detected by inDrop snRNA-seq on mouse auditory cortex neurons (Kalish et al., 2020) and the median of 1071 UMIs and 763 genes detected by Drop-seq on mouse retina neurons (Macosko et al., 2015). Both datasets also exhibited a lower read alignment efficiency compared to HyDrop, with Drop-seq mapping 52%/21%, and inDrop mapping 48%/28% of reads to the mouse genome/transcriptome, while HyDrop reaches 88%/55%. HyDrop’s ‘sequencing efficiency’ is higher than both inDrop and Drop-seq’s: 7.44% of HyDrop’s sequenced reads end up as mapped transcripts with a UMI, whereas this metric is 3.24% for the inDrop dataset and 4.46% for the Drop-seq dataset (Figure 8—figure supplement 2). 10× Chromium v2 gene expression reference data reports a median number of genes of 775–2679 and a median number of UMIs of 1127–6570 on E18 and adult mouse brain nuclei, and a sequencing efficiency of 2.54%–20.9% (see methods). Comparing the top percluster differentially expressed genes with markers from the Allen Brain Atlas SMART-seq data (Yao et al., 2021) revealed 30 distinct populations corresponding to previously identified cell types (Figure 9a-d, Figure 9—figure supplement 1). In addition to the major neuronal and glial populations previously detected in our HyDrop-ATAC experiment, we detect a small population of vascular leptomeningeal cells and layer two intratelencephalic neurons from the medial entorhinal area (L2 IT ENTm). We also detect both D1 and D2 medium spiny neurons as a result of residual striatal tissue and layer 3 Scnn1a + neurons from the retrosplenial and anterior cingulate area (L3 RSP ACA), concordant with Atlas SMART-seq data (Yao et al., 2021).

Figure 9. Application of HyDrop-RNA on flash-frozen mouse cerebral cortex.

Mouse cortex UMAP colored by log-scaled unique molecular identifier counts of Cux1, Rorb, Foxp2 (a), Aqp4, Sox10, Selplg (b), Chodl, Sst, and Pvalb (c). Colors are scaled to minimum and maximum values. (d) UMAP projection of 9507 mouse cortex nuclei annotated with cell type inferred by marker gene expression. Abbreviations: microglia (MGL), mature oligodendrocytes (MOL), oligodendrocyte precursors (OPC), astrocytes (AST), endothelial cells (END), piriform cortex neurons (PIR), caudal and medial ganglionic eminence derived neurons (CGE, MGE), layers 2–6 intratelencephalic (IT), pyramidal tract (PT), near projecting excitatory neurons (NP) and corticoencephalic (CT) neurons, layer two intratelencephalic medial entorhinal area neurons (L2 IT ENTm), L2/3 intratelencephalic area prostriata neurons (L2/3 IT APr), layer three intratelencephalic entorhinal neurons (L3 IT ENTl), layer retrosplenial and anterior cingulate area neurons (L3 RSP ACA), deep L6 excitatory neurons (L6b), D1 and D2 medium spiny neurons (MSN), and vascular leptomeningeal cells (VLMC). See Figure 9—figure supplement 1 for expression of top differentially accessible genes within these clusters.

Figure 9.

Figure 9—figure supplement 1. Expression of HyDrop-RNA mouse cortex top three differentially expressed genes from each cluster.

Figure 9—figure supplement 1.

Dot sizes denote fraction of cluster in which the gene (row) was expressed. Dot color encodes the column min/max-scaled expression of the gene (row).

HyDrop-RNA on FAC-sorted Drosophila neurons confirms high capture rate even on small input samples

To assess HyDrop-RNA’s performance on low cell input samples, we also performed the protocol on approximately 1500 FAC-sorted neurons from the Drosophila brain. We dissected brains in which mCherry expression was driven in specific cell populations by a Gal4 driver line (R74G01-Gal4) and used mCherry-positive cells as input for HyDrop-RNA (Figure 10a). Of the 1500 cells obtained after FACS sorting, we recovered 973 fly brain cells with a median of 1307 UMIs and 640 genes (Figure 10b). In comparison, in-house Drop-seq performed on the entire fly brain recovered a median of 579 UMIs and 289 genes per cell at a deeper sequencing saturation (Davie et al., 2018). Annotation of the 973 single-cell transcriptomes obtained by HyDrop confirmed the presence of all three Kenyon cell subtypes alongside T1 and Tm1 neurons, as expected from our stainings and previous reports (Konstantinides et al., 2018). Surprisingly, we also detected a small population of Mi1 neurons (Figure 10c-e, Figure 10—figure supplement 1).

Figure 10. Application of HyDrop-RNA on FAC-sorted Drosophila neurons.

(a) Confocal maximum intensity projection of R74G01-Gal4> UAS-mCherry brain. (b) Violin plot of unique molecular identifiers (UMIs) and genes detected in nuclei derived from FAC-sorted fly neurons. Inner lines represent Q1, median, and Q3. Fly neuron UMAP colored by log-scaled UMI counts of Mef2, ab, DAT (c) and Tk, Awh, ey (d). Colors are scaled to minimum and maximum values. (e) UMAP of 973 FAC-sorted Drosophila neurons annotated with cell types inferred by marker gene expression (KC, Kenyon cells; Tm1, transmedullary neuron; Mi1, medullary intrinsic neuron). See Figure 10—figure supplement 1 for expression of marker genes within these clusters. Supplementary source data file available for figure b.

Figure 10.

Figure 10—figure supplement 1. Expression of marker genes of HyDrop-RNA on FAC-sorted fly brain cells.

Figure 10—figure supplement 1.

Dot sizes denote fraction of cluster in which the gene (row) was expressed. Dot color encodes the column min/max-scaled expression of the gene (row).

Discussion

We developed HyDrop, an open-source platform for single-cell RNA- or single-cell ATAC-seq using hydrogel beads. We applied HyDrop to generate thousands of mouse, human, and Drosophila single-cell gene expression and chromatin accessibility profiles to demonstrate the protocol’s applicability to a variety of different biological samples. HyDrop-ATAC and HyDrop-RNA experiments on mouse and fly tissues recapitulated the cellular heterogeneity of these complex samples and strongly agreed with reference datasets (Davie et al., 2018; Yao et al., 2021; Preissl et al., 2018; Li et al., 2020). We found HyDrop-RNA to outperform its open-source predecessors (Klein et al., 2015; Macosko et al., 2015; Chen et al., 2019) both in terms of sensitivity and user-friendliness. Moreover, at a per-cell library cost of <$0.03 (Supplementary file 5) it does so at a significantly lower cost compared to commercial droplet-microfluidic alternatives (Zheng et al., 2017; Satpathy et al., 2019; Lareau et al., 2019).

Optimization and modification of HyDrop’s reaction chemistry and bead composition may lead to further improvements in sensitivity. For example, we consider that benchmarking additional bead barcoding strategies, such as direct on-bead DNA synthesis or barcode ligation (Delley and Abate, 2020) instead of linear amplification may further improve overall barcode quality. Ligation strategies would also allow the incorporation of oligonucleotide modifications in the barcode’s capture sequence, a possible necessity for adaptation of HyDrop to novel assays. For example, this change would permit HyDrop’s extension to protocols such as s3-ATAC (Mulqueen et al., 2021), which leverages the use of uracil bases and uracil-intolerant polymerases to double the amount of usable fragments generated by Tn5. The current 96 Kalish et al., 2020 design could also be changed to 384 Macosko et al., 2015 (like inDrop) or even 768 Macosko et al., 2015. This two-step barcoding approach would result in shorter barcode sequences, allowing for cost-saving in sequencing and increased flexibility when pooling HyDrop libraries together with other library types such as 10× Genomics 3’ gene expression libraries. However, rescaling the barcodes in this manner would also necessitate the acquisition of a larger pool of sub-barcodes - from three 96 well plates to 8 and 16 plates respectively - and reduce the total barcode complexity from 884,736 to 147,456 and 589,824. Finally, one of the main challenges of implementing custom microfluidics systems is the considerable price of accurate stepper motor syringe pumps (~$2000 per independent channel) and the need for trained personnel. However, several simplified alternatives have recently emerged (Langer et al., 2018; Kim et al., 2016), and we believe their application with HyDrop could further reduce the start-up costs and complexity of single-cell experiments.

We envision that HyDrop’s reduction in both cost and labor will accelerate the scaling of large-scale atlasing efforts and bring the benefits of single-cell sequencing to smaller projects. Additionally, HyDrop’s dissolvable bead synthesis and linear amplification barcoding toolkit could potentially be exploited to produce more complex beads incorporating multiple capture sequences, leading to implementation of single-cell (multi) omics assays such as the capture of accessible chromatin, (m)RNA, and surface epitopes (Stoeckius et al., 2017) or intracellular proteins (Rivello et al., 2020) from single cells.

Materials and methods

Microfluidic droplet generator manufacturing

Microfluidic droplet generators were produced using standard SU-8 lithography and polydimethylsiloxane (PDMS) lithography according to well-established protocols (Xia and Whitesides, 1998). Briefly, the designs for droplet generators were made in AutoCAD R2014 and the designs are printed onto a chrome mask using a laser writer. The SU-8 lithography is performed on a 4-inch silicon wafer using SU 8–2050 (Microchem) negative photoresist using UV aligner (EVG-620). As per manufacturers’ recommendation, spin coating of the wafer with SU8 was performed at 500 rpm (ramp 100 rpm/s) - 10 s and 2000 rpm (ramp: 300 rpm/s) - 30 s, to achieve a feature height of 70–80 μm. For preparing the PDMS chip, a mixture of PDMS monomer and cross-linker (Dow Corning SYLGARD 184) was prepared at a ratio of 10:1 and mixed thoroughly. The mixture was degassed in vacuum for 45 min and poured onto an SU-8 master template and baked at 80°C for 4 hr. Inlet ports were cut using a 1-mm biopsy needle after which the chips were exposed to high-voltage plasma for 30 s and bonded onto a glass slide. 5 μL of 2% Trichloro (1 H,1H,2H,2H-perfluorooctyl)silane in hydrofluoroether (HFE) was injected into each channel, incubated for 10 min at room temperature and excess oil was removed by applying pressurized air. Chips were finally baked at 100°C for 2 hr (more detailed methods for photolithography and PDMS lithography in https://doi.org/10.17504/protocols.io.bvpin5ke).

Barcoded hydrogel bead manufacturing and storage

Dissolvable hydrogel beads are synthesized similar to a previously published protocol Ren et al., 2021 and barcoded according to a modified inDrop protocol (Zilionis et al., 2017). For synthesizing 2–3 mL batch of beads, 2 mL of Bead Monomer Mix (6% acrylamide, 0.55% bisacryloylcystoylamine, 10% Tris-buffered saline with EDTA and Triton X-100 (TBSET) (10 mM Tris-HCl pH 8, 137 mM NaCl, 2.7 mM KCl, 10 mM EDTA, 0.1% Triton X-100), 12 μM acrydite primer, 0.6% ammonium persulfate) was encapsulated into 50 μm diameter droplets in HFE-7500 Novac oil with EA-008 surfactant (RAN Biotech). 1 mL aliquots of the resulting emulsion was layered with 400 μL of mineral oil and incubated at 65°C for 14 hr. Excess mineral oil and the emulsion oil were removed and 2–3 washes with 1 mL of droplet breaking solution (20% PFO in HFE) were performed. Beads were pelleted at 5000 xg, 4°C for 30 s and washed twice in 1 mL of 1% SPAN-80 in hexane. Beads were sequentially washed in TBSET until all hexane phase was removed.

Beads were washed twice in Bead Wash Buffer (10 mM Tris-HCl pH 8, 0.1% Tween-20), twice in PCR Wash Buffer (10 mM Tris-HCl pH8, 50 mM KCl, 1.5 mM MgCl2, 0.1% Tween-20). The subsequent liquid handling in the 96-well plate is performed using Hamilton microlab STAR robot. 22.5 μL of beads were distributed to a 96-well plate. 2.5 μL of 100 μM sub-barcode primer and 25 μL of KAPA HiFi Hotstart master mix (Roche) were added to each well and the plate was thermocycled (95°C 3 min, 5 cycles of 98°C 20 s, 38°C 4 min, 72°C 2 min, 1 cycle of 98°C 1 min, 38°C 10 min, 72°C 4 min, followed by a final hold on 4°C) with intermittent vortexing during every annealing step. 50 μL of STOP-25 (10 mM Tris-HCl pH 8, 25 mM EDTA, 0.1% Tween-20, 100 mM KCl) was added to each well to deactivate the polymerase and its contents were pooled. Remaining beads in wells were washed out with 100 μL of STOP-25 and the beads were rotated at room temperature for 30 min. Beads were then washed with STOP-10 (10 mM Tris-HCl pH 8, 10 mM EDTA, 0.1% Tween-20, 100 mM KCl) and rotated for 10 min in denaturation solution (150 mM NaOH, 85 mM BRIJ-35). Beads were washed twice in denaturation solution and twice more in neutralization solution (100 mM Tris-HCl pH 8, 10 mM EDTA, 0.1% Tween-20, 100 mM NaCl). The sub-barcoding step was repeated twice more for a total of three sub-barcodes.

Hydrogel beads were sequentially filtered using a 70 μm strainer (Falcon). For both the HyDrop-ATAC and RNA beads were pelleted at 300 xg, 4°C and resuspended in 5 mL of Bead Freezing Buffer (150 mM NaCl, 125 mM Tris-HCl pH 7, 10 mM MgCl2, 4% Tween-20, 0.75% Triton X-100, 30% glycerol, 0.3% BSA). Beads were pelleted at 300 xg, 4°C and resuspended in 5 mL of Bead Freezing Buffer a second time and incubated at 4°C for at least 3 hr. Beads were pelleted at 1000 xg, 4°C and the pellet was aliquoted into 35 μL aliquots and stored at –80°C for long-term storage (further method details in https://doi.org/10.17504/protocols.io.b4cyqsxw).

Hydrogel bead fluorescence in situ hybridization quality control

Bead QC was performed as described previously (Zilionis et al., 2017). Briefly, 10 μL of hydrogel beads were resuspended in 1 mL of hybridization buffer (5 mM Tris-HCl pH 8.0, 5 mM EDTA, 0.05% Tween-20, 1 M KCl) and centrifuged for 1 min at 1000 xg. The wash was repeated once more, and 960 μL of the supernatant was removed. 4 μL of 200 μM specific FAM probe was added depending on which part of the barcode needed testing (see supplementary oligonucleotide table). The beads were incubated at room temperature in the dark for 30 min. Beads were washed thrice in QC buffer and visualized under a Zeiss Axioplan two microscope and a Zeiss Colibri seven light source (300 ms exposure time, 80% light source intensity). Occasional punctate intensities (diameter ~1 μm) were observed in quality control performed on 10× beads and are thought to be dye precipitates or crystals.

Fluorescence images were analyzed using opencv2 (Bradski, 2015). Briefly, foreground and background were determined using Otsu’s method. Circles were detected in the foreground using Hough’s algorithm. The mean grayscale intensity of the background was subtracted from the grayscale intensity at the center of each detected circle. The center intensity method was favored over a volumetric intensity measurement over the entire bead as slight differences in focus between images can affect intensity near the edge of the bead. For the 1/96 sub-barcode experiment, both fluorescence and brightfield images were taken of the same beads incubated with 1 out of 96 possible sub-barcodes. Total number of beads was counted manually from the brightfield image, while positive beads were counted using the image analysis tool from the fluorescence images. A mask of the positive beads from the fluorescence image was overlaid onto the brightfield image to mark positive beads for visualization. Image analysis tool code and raw data of all used images available at https://github.com/aertslab/hydrop_data_analysis, (copy archieved at swh:1:rev:059bf5a7779dc2894670ecf5f820c14bceb68493; Rop, 2022 and https://doi.org/10.5281/zenodo.6415968).

Cell culture and cell dissociation

MCF-7 cells (NCI-DTP Cat# MCF7, RRID:CVCL_0031) were cultured in RPMI1640 (ThermoFisher 11875093) medium supplemented with 10% FBS (ThermoFisher 10270–106), 1% penicillin/streptomycin (Life Technologies 15140122), and 10 ug/mL insulin (Sigma Aldrich I9278) and passaged twice per week. PC-3 cells (NCI-DTP Cat# PC-3, RRID:CVCL_0035) were cultured in RPMI1640 medium supplemented with 10% FBS and 1% penicillin/streptomycin and passaged twice per week. Mouse melanoma cells were cultured in DMEM (ThermoFisher 13345364) supplemented with 10% FBS and 1% penicillin/streptomycin and passaged once per week. MM087 melanoma cells were cultured in F-10 Nutrient mix supplemented with 10% FBS and 1% penicillin/streptomycin and passaged once per week. All cell lines tested negative for mycoplasma prior to use. The identity of PC-3, MCF-7, and mouse melanoma cell lines were correctly identified since their aggregate scATAC-seq data correlated highly with public data (see Figure 4b, Figure 4—figure supplement 1 for PC-3 and MCF-7, mouse melanoma data not shown). The identity of our MM087 was confirmed by SNP comparison with the MM087 genome for our manuscript ‘Robust gene expression programs underlie recurrent cell states and phenotype switching in melanoma’ (Wouters et al., https://doi.org/10.1038/s41556-020-0547-3).

Cells were washed in PBS and dissociated into single-cell suspensions by adding 1.5 mL of 0.05% trypsin (Life Technologies 25300054) and waiting for 5–7 min. The single-cell suspension was centrifuged at 500 rcf for 5 min at 4°C and the resulting pellet was resuspended in PBS. This PBS wash was repeated once more and the single-cell suspension was processed further.

Fly rearing and cell dissociation

GMR74G01-Gal4 (BDSC Cat# 39868, RRID:BDSC_39868) and UAS-mCherry (BL#38425) flies were obtained from Bloomington Drosophila Stock Center. The resulting cross-strain was raised on standard cornmeal-agar medium at 25°C at a 12 hr light/dark cycle. Fifty adult brains were dissected in DPBS and transferred to a tube containing 100 μL of cold DPBS solution. Samples were centrifuged at 500 rcf for 1 min and the supernatant was replaced by 50 μL of dispase (3 mg/mL, Sigma-Aldrich, D4818, 2 mg) and 75 μL of collagenase I (100 mg/mL, Invitrogen, 17100–017). Brains were dissociated in a Thermoshaker (Grant Bio PCMT) at 500 rpm for 2 hr at 25°C, with pipette mixing every 15 min. Cells were subsequently washed with 1000 μL of cold DPBS solution and centrifuged at 500 rcf for 5 min at 4°C and resuspended in 250 μL of DPBS with 0.04% BSA. Cell suspensions were passed through a 10 μM pluriStrainer (ImTec Diagnostics, 435001050). Cells were sorted based on viability and mCherry positivity using the Sony MA900 cell sorter. Sorted cells were collected into Eppendorf tubes precoated with 1% BSA.

Cell line nuclei extraction

A pellet of one million dissociated cells or fewer was incubated on ice in 200 μL of ATAC lysis buffer (1% BSA, 10 mM Tris-HCl pH 7.5, 10 mM NaCl, 0.1% Tween-20, 0.1% NP-40, 3 mM MgCl2, 70 μM Pitstop in DMSO, 0.01% digitonin) for 5–7 min. 1 mL of ATAC nuclei wash buffer (1% BSA, 10 mM Tris-HCl pH 7.5, 0.1% Tween-20, 10 mM NaCl, 3 mM MgCl2) was added and the nuclei were pelleted at 500 xg, 4°C for 5 min. The resulting pellet was resuspended in 100 μL of ice-cold PBS and filtered with a 40 μm strainer (Flowmi).

Mouse cortex dissection

All animal experiments were conducted according to the KU Leuven ethical guidelines and approved by the KU Leuven Ethical Committee for Animal Experimentation (approved protocol numbers ECD P183/2017). Mice were maintained in a specific pathogen-free facility under standard housing conditions with continuous access to food and water. Mice used in the study were 57 days old and were maintained on 14 hr light, 10 hr dark light cycle from 7 to 21 hr. In this study, cortical brain tissue from female P57 BL/6Jax was used. Animals were anesthetized with isoflurane, and decapitated. Cortices were collected, divided in four equal quadrants along the dorsoventral and anterior-posterior axis, and immediately snap frozen in liquid nitrogen. For HyDrop-ATAC, a ventral/posterior quadrant from the left hemisphere was used. For HyDrop-RNA, a dorsal/anterior quadrant was used from the left hemisphere of a second mouse.

Snap-frozen mouse cortex nuclei extraction

For the preparation of nuclei for scRNA-seq, we used a modified protocol from the recently published single-nuclei preparation toolbox Delley and Abate, 2020 to extract nuclei from snap-frozen mouse cortex samples. Briefly, a ~1 cm3 frozen piece of mouse cortex tissue was transferred to 0.5 mL of ice-cold homogenization buffer (Salt-tris solution - 146 mM NaCl, 10 mM Tris 7.5, 1 mM CaCl2, 21 mM MgCl2, 250 mM sucrose, 0.03% Tween-20, 0.01% BSA, 25 mM KCl, 1 mM 2-mercaptoethanol, 1× complete protease inhibitor, 0.5 U/ul of RNAse In Plus (Promega)) in a Dounce homogenizer mortar and thawed for 2 min. The tissue was homogenized with 10 strokes of pestle A and 5 strokes of pestle B until a homogeneous nuclei suspension was achieved. The resulting homogenate was filtered through a 70-μm cell strainer (Corning). The homogenizer and the filter are rinsed with an additional 500 μL of homogenization buffer. The tissue material was pelleted at 500 xg and the supernatant was discarded. The tissue pellet was resuspended in a homogenization buffer without Tween-20. An addition 1.65 ml of homogenization buffer was topped up and mixed with 2.65 ml of gradient medium (75 mM sucrose, 1 mM CaCl2, 50% Optiprep, 5 mM MgCl2, 10 mM Tris 7.5, 1 mM 2-mercaptoethanol 1× complete protease inhibitor, 0.5 U/ul of RNAse In Plus (Promega)). 4 mL of 29% iodoxanol cushion was prepared with a diluent medium (250 mM sucrose, 150 mM KCl, 30 mM MgCl2, 60 mM Tris 8) and was loaded into an ultracentrifuge tube. 5.3 mL of sample in homogenization buffer + gradient medium was gently layered on top of the 29% iodoxanol cushion. Sample was centrifuged at 7700 xg, 4°C for 30 min and the supernatant was gently removed without disturbing the nuclei pellet. Nuclei were resuspended in 100 μL of nuclei buffer (1% BSA in PBS + 1 U/ul of RNAse Inhibitor).

For the preparation of nuclei for ATAC-seq, we used a slightly modified protocol to extract nuclei from snap-frozen mouse cortex samples. Briefly, a ~1 cm3 frozen piece of mouse cortex tissue was transferred to 1 mL of ice-cold homogenization buffer (320 mM sucrose, 10 mM NaCl, 3 mM Mg (OAc), 10 mM Tris 7.5, 0.1 mM EDTA, 0.1% IGEPAL-CA360, 1× complete protease inhibitor and 1 mM DTT) in a Dounce homogenizer mortar and thawed for 2 min. The tissue was homogenized with 10 strokes of pestle A and 5 strokes of pestle B until a homogeneous nuclei suspension was achieved. The resulting homogenate was filtered through a 70 μm cell strainer (Corning). 2.65 mL of ice-cold gradient medium was added to 2.65 mL of homogenate and mixed well. 4 mL of 29% iodoxanol cushion (129.2 mM sucrose, 77.5 mM KCl, 15.5 mM MgCl, 31 mM Tris-HCl pH 7.5, 29% iodoxanol) was loaded into ultracentrifuge tube. 5.3 mL of sample in homogenization buffer + gradient medium was gently layered on top of the 29% iodoxanol cushion. Sample was centrifuged at 7700 xg, 4°C for 30 min and the supernatant was gently removed without disturbing the nuclei pellet. Nuclei were resuspended in 100 μL of nuclei buffer (1% BSA in PBS).

HyDrop-ATAC library preparation

A total of 50,000 nuclei were resuspended in 50 μL of ATAC reaction mix (10% DMF, 10% Tris-HCl pH 7.4, 5 mM MgCl2, 5 ng/μL Tn5, 70 μM Pitstop in DMSO, 0.1% Tween-20, 0.01% digitonin) and incubated at 37°C for 1 hr. 100 μL of 0.1% BSA in PBS was added and the nuclei were pelleted at 500 xg, 4°C for 5 min and resuspended in 40 μL of 0.1% BSA in PBS.

Tagmented nuclei were added to 100 μL of PCR mix (1.3× Phusion HF Buffer, 15% Optiprep, 1.3 mM dNTPs, 39 mM DTT, 0.065 U/μL Phusion HF polymerase, 0.065 U/μL Deep Vent polymerase, 0.013 U/μL ET SSB). PCR mix was coencapsulated with 35 μL of freshly thawed HyDrop-ATAC beads in HFE-7500 Novac oil with EA-008 surfactant (RAN Biotech) on the Onyx microfluidics platform (Droplet Genomics). The resulting emulsion was collected in aliquots of 50 μL total volume and thermocycled according to the linear amplification program (72°C 15 min, 98°C 3 min, 13 amplification cycles of [98°C 10 s, 63°C 30 s, 72°C 1 min], followed by a final hold on 4°C). 125 μL of recovery agent (20% PFO in HFE), 55 μL of GITC buffer (5 M GITC, 25 mM EDTA, 50 mM Tris-HCl pH 7.4) and 5 μL of 1 M DTT was added to each separate aliquot of 50 μL thermocycled emulsion and incubated on ice for 5 min. 5 μL of Dynabeads was added to the aqueous phase and incubated for 10 min. Dynabeads were pelleted on a Nd magnet and washed twice with 80% EtOH. Elution was performed in 50 μL of EB-DTT-Tween (10 mM DTT, 0.1% Tween-20 in EB (10 mM Tris-HCl, pH 8.5)). A 1× Ampure bead purification was performed according to manufacturer’s recommendations. Elution was performed in 30 μL of EB-DTT (10 mM DTT in EB). Eluted library was further amplified in 100 μL of PCR mix (1× KAPA HiFi, 1 μM index i7 primer, 1 μM index i5 primer). Final library was purified in a 0.4×–1.2× double-sided Ampure purification and eluted in 25 μL of EB-DTT (10 mM DTT in EB). User-friendly protocol available on https://www.protocols.io/view/hydrop-atac-v1-0-bxsbpnan. A detailed cost analysis of HyDrop-ATAC can be found in supplementary sheet 2.

HyDrop-RNA single-cell library preparation

For a recovery of 2000 cells, 3795 cells were resuspended in 85 μL of RT mix (1× Maxima RT Buffer, 0.9 mM dNTPs, 25 mM DTT, 1.3 mM GTP, 15% Optiprep, 1.3 U/μL RNAse inhibitor, 15 U/μL Maxima hRT, 12.5 μM TSO, 4.4% PEG-8000). RT mix was co-encapsulated with 35 μL of freshly thawed HyDrop-RNA beads in RAN oil on the Onyx microfluidics platform. The resulting emulsion was collected in aliquots of 50 μL total volume and thermocycled according to the RT program (42°C for 90 min, 11 cycles of [50°C for 2 min, 42°C for 2 min], 85°C for 5 min, followed by a final hold on 4°C). 125 μL of recovery agent (20% PFO in HFE), 55 μL of GITC Buffer (5 M GITC, 25 mM EDTA, 50 mM Tris-HCl pH 7.4) and 5 μL of 1 M DTT was added to each separate aliquot of 50 μL thermocycled emulsion and incubated on ice for 5 min. 99 μL of Ampure XP beads was added to the aqueous phase and incubated for 10 min. Ampure beads were pelleted on a Nd magnet and washed twice with 80% EtOH. Elution was performed in 30 μL of EB-DTT-Tween (10 mM DTT, 0.1% Tween-20 in EB). Exonuclease treatment was performed by adding 4 μL of 10× NEBuffer 3.1, 4 μL of Exo I, and 2 μL of dH2O to 30 μL of eluted library. The Exo I reaction mix was incubated at 37°C for 5 min, 80°C for 1 min for heat inactivation followed by a final hold at 4°C. 2 μL of 1 M DTT was added and a 0.8× Ampure XP purification was performed according to manufacturer’s recommendations. cDNA was eluted in 40.5 μL of EB-DTT (10 mM DTT in EB) and added to ISPCR mix (40 μL library, 50 μL 2× KAPA HiFi, 10 μL 10 μM TSO-P primer). PCR cycling was performed according to the ISPCR program (95°C for 3 min, 13 cycles of [98°C for 20 s, 63°C for 20 s, 72°C for 3 min.], 72 °C for 5 min) followed by a final hold at 4°C. 2 μL of 1 M DTT was added and a 0.6× Ampure XP purification was performed according to manufacturer’s recommendations. cDNA was eluted in 28.5 μL of EB-DTT. Final sequencing library was prepared according to the following customized NEB Ultra II FS protocol (NEB E7805S). 80 ng of amplified cDNA was fragmented in Ultra II fragmentation mix (26 μL of amplified cDNA, 7 μL of NEBNext Ultra II FS Reaction Buffer, 2 μL of NEBNext Ultra II FS Enzyme Mix) on the following thermocycling program: 37°C for 10 min, 65°C for 30 min, and a final hold at 4°C. 15 μL of EB was added and a 0.8× Ampure purification was performed according to manufacturer’s recommendation and eluted in 35 μL. Fragmented library was adapter ligated in NEBNext Ultra II adapter ligation mix (35 μL of fragmented library, 30 μL of NEBNext Ultra II Ligation Master Mix, 1 μL of NEBNext Ligation Enhancer, 2.5 μL of NEBNext Adapter for Illumina) at 20°C for 15 min, with 4°C final hold. 28.5 μL of EB was added and a 0.8× Ampure purification was performed according to manufacturer’s recommendation and eluted in 30 μL. Eluted library was amplified in PCR master mix (50 μL 2× KAPA HiFi, 10 μL 10 μM Hy-i7 primer, 10 μL 10 μM Hy-i5 primer, 30 μL eluted library) in the following thermocycling program: 95°C for 3 min, 13 cycles of (98°C for 20 s, 64°C for 30 s, 72°C for 30 s), 72°C for 5 min, and a final hold at 4°C. Sequencing-ready library was purified using a 0.8× Ampure purification and eluted in 30 μL of EB. User-friendly protocol available on https://doi.org/10.17504/protocols.io.b4xwqxpe.

HyDrop-RNA optimization trials

We performed six trials on a 50:50 mixture of human melanoma (MM087) and mouse melanoma (MMel). Trials were performed as described in the general HyDrop-RNA protocol, but with the following changes. All trials, except for the GTP/PEG trial, were performed using the following RT reaction mix (1.6× Maxima h-RT buffer, 1.6 mM dNTPs, 47 mM DTT, 15% Optiprep, 1.6 U/μL RNAse Inhibitor, 15.7 U/μL Maxima hRT, 12.5 μM TSO). For the exocondition, the exonuclease I treatment was skipped. For all other conditions the exonuclease I treatment was performed as described above. For the TSO-LNA trial, an LNA TSO was used instead of the regular TSO. For the GTP/PEG trial, all steps were performed as described in the main protocol.

For the Klenow fragment second strand synthesis trial, the purified first strand product was treated with 1 μL of Escherichia coli RNase H (NEB M0297S). The mixture was incubated at 37°C for 30 min after which the enzyme was inactivated using 10 mM EDTA. The single stranded product was purified using 1.2× Ampure XP bead purification (BD sciences) and eluted in 25 μL of EB buffer. dN-SMRT primer was added to the single strand product to a final concentration of 2.5 μM and the mixture was denatured by incubation at 95°C for 5 min. The sample was then allowed to cool to room temperature and incorporated in the Klenow enzyme mix (1× Maxima h-RT buffer, 1 mM dNTP, 1 U/μL of Klenow Exo-; NEB M0212L) was added to the single strand library. The Klenow enzyme mix was incubated at 37°C for 60 min. The second strand reaction was stopped by heating the product at 85°C for 5 min. The sample was purified using 1× Ampure XP and eluted in 40 μL of EB buffer. The purified second strand product was amplified with ISPCR primers as described above.

For the BST 2.0 polymerase second strand synthesis trial, the purified first strand product was treated with 1 μL of E. coli RNase H (NEB M0297S). The mixture was incubated at 37°C for 30 min after which the enzyme was inactivated using 10 mM EDTA. The single stranded product was purified using 1.2× Ampure XP bead purification (BD sciences) and eluted in 25 μL of EB buffer. dN-SMRT primer was added to the single strand product to a final concentration of 2.5 μM and the mixture was denatured by incubation at 95°C for 5 min. The sample was then allowed to cool to room temperature and incorporated in the Bst 2.0 enzyme mix (1× Isothermal amplification buffer, 1 mM dNTP, 1 U/μL of Bst 2.0 DNA polymerase; NEB M0537L) was added to the denatured library and the mixture was incubated at 55°C for 10 min and 60°C for 45 min. The second strand reaction was stopped by heating the product at 85°C for 5 min. The sample was purified using 1× Ampure XP and eluted in 40 μL of EB buffer. The purified second strand product was amplified with ISPCR primers as described above.

Sequencing

HyDrop-ATAC libraries were sequenced on Illumina NextSeq500 or NextSeq2000 systems using 50 cycles for read 1 (ATAC paired-end mate 1), 52 cycles for index 1 (barcode), 10 cycles for index 2 (sample index), and 50 cycles for read 2 (ATAC paired-end mate 2).

HyDrop-RNA libraries were sequenced on Illumina NextSeq2000 systems using 50 cycles for read 1 (3-prime cDNA), 10 cycles for index 1 (sample index, custom i7 read primer), 10 cycles for index 2 (sample index), and 58 cycles for read 2 (barcode+ UMI, custom read two primer).

HyDrop-ATAC data processing

Barcode reads were trimmed to exclude the intersub-barcode linear amplification adapters using a mawk script. Next, the VSN scATAC-seq pre-processing pipeline (Waegeneer et al., 2021) was used to map the reads to the reference genome and generate a fragments file for downstream analysis. Here, barcode reads were compared to a whitelist (of 884,736 valid barcodes), and corrected, allowing for a maximum 1 bp mismatch. Uncorrected and corrected barcodes were appended to the fastq sequence identifier of the paired end ATAC-seq reads. Reads were mapped to the reference genome using bwa-mem with default settings, and the barcode information was added as tags to each read in the bam file. Duplicate-marking was performed using samtools markdup. In the final step of the pipeline, fragment files were generated using Sinto (https://github.com/timoast/sinto). For mixed species data, cells were filtered for a minimum of 1000 unique fragments and a minimum TSS enrichment of 7. For mouse cortex data, higher level analysis such as clustering and differential accessibility were performed using cisTopic (Bravo González-Blas et al., 2019). In brief, cells were filtered for a minimum of 1000 unique fragments and a minimum TSS enrichment of 5. Fragments overlapping mouse candidate cis-regulatory regions (ENCODE Project Consortium et al., 2020) were counted, and the resulting matrix was filtered for potential cell doublets using a Scrublet (Wolock et al., 2019) threshold of 0.35. Cells were Leiden clustered based on the cell-topic probability matrix generated by an initial cisTopic LDA incorporating 51 topics, at a resolution of 0.9 with 10 neighbours. A consensus peak set was generated from per-cluster peaks and used to recount fragments. Cells were filtered using the same filtering parameters and a new model with 50 topics was trained. Cells were again Leiden clustered based on the cell-topic probability matrix generated by the second LDA, at a resolution of 0.9 with 10 neighbours. Region accessibility was imputed based on binarized topic-region and cell-topic distributions. Gene activity was imputed based on Gini index-weighted imputed accessibility in a 10 kb up/downstream decaying window around each gene including promoters. Leiden clusters were annotated based on imputed gene accessibility around marker genes (Yao et al., 2021; Zeisel et al., 2018). Differentially accessible regions were called using one-versus-all Wilcoxon rank-sum tests for each cell type, with an adjusted p-value of 0.05 and log2FC of 1.5. RPGC-normalized aggregate genome coverage bigwigs were generated from BAM files using DeepTools (Ramírez et al., 2016). Per-cluster genome coverage tracks were generated using pyBigWig.

HyDrop-RNA data processing

Barcode reads were trimmed to exclude the intersub-barcode linear amplification adapters using a mawk script. Reads were then mapped and cell-demultiplexed using STARsolo (Kaminow et al., 2021) in CB_UMI_Complex mode. The resulting STARsolo-filtered count matrices were further analyzed using Scanpy (Wolf et al., 2018). In short, cells were filtered on expression of a maximum of 4000 genes, and a maximum of 1% UMIs from mitochondrial genes. Genes were filtered on expression in a minimum of three cells. Potential cell doublets were filtered out using a Scrublet (Wolock et al., 2019) threshold of 0.25. The filtered expression matrix was scaled to total counts and log-normalized. Total counts and mitochondrial reads were regressed out and UMAP embedding was performed after PCA. Cells were annotated and fine tuned based on differential gene expression of marker genes sourced from either the Davie et al. Drosophila brain atlas (Davie et al., 2018) or the Allen Brain RNA-seq Database (Yao et al., 2021).

Raw inDrop (SRR10545068 to SRR10545079) and Drop-seq (SRR1853178 to SRR1853184) sequencing data was downloaded from SRA. Data was sampled to the HyDrop-RNA mouse cortex sample sequencing depth (52,738 reads per cell) and mapped to mouse reference genome using STARsolo, allowing one mismatch in the cell barcodes (like in our HyDrop-RNA and HyDrop-ATAC analyses). Full analysis process is documented on the Hydrop GitHub repository (https://github.com/aertslab/hydrop_data_analysis/tree/main/HyDrop-RNA_publicdata_comparison). Public reference 10× single-cell ATAC-seq data was sourced from https://support.10xgenomics.com/single-cell-atac/datasets (‘Flash frozen cortex, hippocampus, and ventricular zone from embryonic mouse brain (E18)’, Fresh cortex from adult mouse brain (P50)’). Public reference 10× single-cell gene expression data was sourced from https://support.10xgenomics.com/single-cell-gene-expression/datasets (‘1 k Brain Nuclei from an E18 Mouse’, ‘2 k Brain Nuclei from an Adult Mouse ( > 8 weeks)’). Public PC-3 and MCF-7 ATAC-seq data was sourced from ENCODE (ENCFF772EFK: doi:10.17989/ENCSR422SUG, ENCFF024FNF: doi:10.17989/ENCSR499ASS).

Data was visualized using a combination of Seaborn (Waskom, 2021) and Matplotlib (Hunter, 2007). A vector image representing mouse head and cortex was sourced from SciDraw (Kennedy, 2020).

Acknowledgements

We thank Andrew Adey for his great advice on Tn5 and scATAC-seq. We also thank Sebastián Najle, Céline Vallot, and Arnau Sebé-Pedrós for many discussions on droplet microfluidics, Frederik Ceyssens and the KU Leuven Nanocenter for their support with microfabrication, Ghanem Ghanem for his kind donation of the MM087 melanoma lines, Jean-Christophe Marine for his kind donation of the mouse melanoma lines, and Koen De Wispelaere for his assistance during his master internship.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Suresh Poovathingal, Email: suresh.poovathingal@kuleuven.be.

Stein Aerts, Email: stein.aerts@kuleuven.be.

Naama Barkai, Weizmann Institute of Science, Israel.

Naama Barkai, Weizmann Institute of Science, Israel.

Funding Information

This paper was supported by the following grants:

  • H2020 European Research Council 724226_cis- CONTROL to Stein Aerts.

  • KU Leuven C14/18/092 to Stein Aerts.

  • Fonds Wetenschappelijk Onderzoek G0B5619N to Stein Aerts.

  • Michael J. Fox Foundation for Parkinson's Research ASAP-000430 to Christopher Campbell Flerin.

  • Aligning Science Across Parkinson’s ASAP-000430 to Christopher Campbell Flerin.

  • Foundation Against Cancer 2016-070 to Stein Aerts.

  • Stichting Tegen Kanker to Jasper Wouters.

  • Belgian Cancer Society to Jasper Wouters.

  • Fonds Wetenschappelijk Onderzoek to Florian V De Rop, Carmen Bravo González-Blas, Jasper Janssens.

  • VIB Tech Watch to Suresh Poovathingal.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review and editing.

Investigation, Writing – review and editing.

Formal analysis, Software, Writing – review and editing.

Data curation, Software.

Software, Writing – original draft, Writing – review and editing.

Software.

Methodology.

Resources.

Writing – review and editing.

Resources.

Resources.

Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Validation, Writing – original draft, Writing – review and editing.

Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – original draft, Writing – review and editing.

Ethics

All animal experiments were conducted according to the KU Leuven ethical guidelines and approved by the KU Leuven Ethical Committee for Animal Experimentation (approved protocol numbers ECD P037/2016, P014/2017, and P062/2017). All use of cell lines was approved by the KU Leuven Ethical Committee for Research under project number S63316.

Additional files

Supplementary file 1. "Molecular sequence description of HyDrop bead barcoding": visual description of progression of nucleotide sequences involved in every step of the barcoding process.
elife-73971-supp1.docx (19.2KB, docx)
Supplementary file 2. "Molecular sequence description of HyDrop-ATAC": visual description of progression of nucleotide sequences involved in every step of HyDrop-ATAC.
elife-73971-supp2.docx (24.2KB, docx)
Supplementary file 3. "Molecular sequence description of HyDrop-RNA": visual description of progression of nucleotide sequences involved in every step of HyDrop-RNA.
elife-73971-supp3.docx (25.7KB, docx)
Supplementary file 4. "Reagents and oligonucleotide list": list of all reagents and oligonucleotides used in HyDrop.
elife-73971-supp4.xlsx (25.9KB, xlsx)
Supplementary file 5. "Price calculation for HyDrop bead barcoding and HyDrop-ATAC": detailed price calculation for HyDrop bead barcoding and HyDrop-ATAC.
elife-73971-supp5.xlsx (18.2KB, xlsx)
Transparent reporting form
Source data 1. Source data files for Figure 4a, Figure 5b–d, Figure 8a–c, Figure 8—figure supplement 2, and Figure 10b.
elife-73971-data1.zip (51.7MB, zip)

Data availability

The data discussed in this publication have been deposited in NCBI's Gene Expression Omnibus and are accessible through GEO Series accession number GSE175684 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE175684) and on SCope (https://scope.aertslab.org/#/HyDrop/*/welcome). Source data files have been provided for Figures 2a, 3a, 3b, 3c and 3i, and can be regenerated data analysis tutorials for HyDrop. Data analysis tutorials for HyDrop are available on GitHub (https://github.com/aertslab/hydrop_data_analysis; copy archieved at swh:1:rev:059bf5a7779dc2894670ecf5f820c14bceb68493).

The following dataset was generated:

De Rop F, Ismail J, Bravo González-Blas C, Hulselmans G, Flerin C, Janssens J, Christiaens V, Poovathingal S, Aerts S. 2021. HyDrop: droplet-based scATAC-seq and scRNA-seq using dissolvable hydrogel beads. NCBI Gene Expression Omnibus. GSE175684

The following previously published datasets were used:

Macosko EZ. 2015. Drop-Seq analysis of P14 mouse retina single-cell suspension. NCBI Gene Expression Omnibus. GSE63472

Kalish BT, Greenberg ME. 2020. inDrop scRNA-seq of mouse primary auditory cortex. NCBI Gene Expression Omnibus. GSE140883

10x Genomics 2019. 10x scATAC-seq on flash frozen cortex, hippocampus, and ventricular zone from embryonic mouse brain (E18) 10x Genomics Resources. atac_v1_E18_brain_flash_5k

10x Genomics 2019. 10x scATAC-seq on fresh cortex from adult mouse brain (P50) 10x Genomics Resources. atac_v1_adult_brain_fresh_5k

10x Genomics 2017. 10x scRNA-seq on 1k brain nuclei from an E18 mouse. 10x Genomics Resources. nuclei_900

10x Genomics 2017. 10x scRNA-seq on 2k brain nuclei from an adult mouse (>8 weeks) 10x Genomics Resources. nuclei_2k

ENCODE Project Consortium 2021. ATAC-seq from MCF-7 (ENCSR422SUG) NCBI Gene Expression Omnibus. GSE169929

ENCODE Project Consortium 2021. ATAC-seq from PC-3 (ENCSR499ASS) NCBI Gene Expression Omnibus. GSE170337

References

  1. Abate AR, Chen CH, Agresti JJ, Weitz DA. Beating Poisson encapsulation statistics using close-packed ordering. Lab on a Chip. 2009;9:2628–2631. doi: 10.1039/b909386a. [DOI] [PubMed] [Google Scholar]
  2. Bradski G. Automated Calibration of RF On-Wafer Probing and Evaluation of Probe Misalignment Effects Using a Desktop Micro-Factory. Journal of Computer and Communications. 2015;4:122–125. doi: 10.4236/jcc.2016.43009. [DOI] [Google Scholar]
  3. Bravo González-Blas C, Minnoye L, Papasokrati D, Aibar S, Hulselmans G, Christiaens V, Davie K, Wouters J, Aerts S. cisTopic: cis-regulatory topic modeling on single-cell ATAC-seq data. Nature Methods. 2019;16:397–400. doi: 10.1038/s41592-019-0367-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chen S, Lake BB, Zhang K. High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell. Nature Biotechnology. 2019;37:1452–1457. doi: 10.1038/s41587-019-0290-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Dankort D, Curley DP, Cartlidge RA, Nelson B, Karnezis AN, Damsky WE, Jr, You MJ, DePinho RA, McMahon M, Bosenberg M. Braf(V600E) cooperates with Pten loss to induce metastatic melanoma. Nature Genetics. 2009;41:544–552. doi: 10.1038/ng.356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Datlinger P, Rendeiro AF, Boenke T, Senekowitsch M, Krausgruber T, Barreca D, Bock C. Ultra-high-throughput single-cell RNA sequencing and perturbation screening with combinatorial fluidic indexing. Nature Methods. 2021;18:635–642. doi: 10.1038/s41592-021-01153-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Davie K, Janssens J, Koldere D, De Waegeneer M, Pech U, Kreft Ł, Aibar S, Makhzami S, Christiaens V, Bravo González-Blas C, Poovathingal S, Hulselmans G, Spanier KI, Moerman T, Vanspauwen B, Geurs S, Voet T, Lammertyn J, Thienpont B, Liu S, Konstantinides N, Fiers M, Verstreken P, Aerts S. A Single-Cell Transcriptome Atlas of the Aging Drosophila Brain. Cell. 2018;174:982–998. doi: 10.1016/j.cell.2018.05.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Delley CL, Abate AR. Modular barcode beads for microfluidic single cell genomics. Cell Biology. 2020;1:e292326. doi: 10.1101/2020.09.10.292326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. ENCODE Project Consortium. Moore JE, Purcaro MJ, Pratt HE, Epstein CB, Shoresh N, Adrian J, Kawli T, Davis CA, Dobin A, Kaul R, Halow J, Van Nostrand EL, Freese P, Gorkin DU, Shen Y, He Y, Mackiewicz M, Pauli-Behn F, Williams BA, Mortazavi A, Keller CA, Zhang X-O, Elhajjajy SI, Huey J, Dickel DE, Snetkova V, Wei X, Wang X, Rivera-Mulia JC, Rozowsky J, Zhang J, Chhetri SB, Zhang J, Victorsen A, White KP, Visel A, Yeo GW, Burge CB, Lécuyer E, Gilbert DM, Dekker J, Rinn J, Mendenhall EM, Ecker JR, Kellis M, Klein RJ, Noble WS, Kundaje A, Guigó R, Farnham PJ, Cherry JM, Myers RM, Ren B, Graveley BR, Gerstein MB, Pennacchio LA, Snyder MP, Bernstein BE, Wold B, Hardison RC, Gingeras TR, Stamatoyannopoulos JA, Weng Z. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature. 2020;583:699–710. doi: 10.1038/s41586-020-2493-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Hagemann-Jensen M, Ziegenhain C, Chen P, Ramsköld D, Hendriks G-J, Larsson AJM, Faridani OR, Sandberg R. Single-cell RNA counting at allele and isoform resolution using Smart-seq3. Nature Biotechnology. 2020;38:708–714. doi: 10.1038/s41587-020-0497-0. [DOI] [PubMed] [Google Scholar]
  12. Hughes TK, Wadsworth MH, Gierahn TM, Do T, Weiss D, Andrade PR, Ma F, de Andrade Silva BJ, Shao S, Tsoi LC, Ordovas-Montanes J, Gudjonsson JE, Modlin RL, Love JC, Shalek AK. Second-Strand Synthesis-Based Massively Parallel scRNA-Seq Reveals Cellular States and Molecular Features of Human Inflammatory Skin Pathologies. Immunity. 2020;53:878–894. doi: 10.1016/j.immuni.2020.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Hunter JD. Matplotlib: A 2D Graphics Environment. Computing in Science & Engineering. 2007;9:90–95. doi: 10.1109/MCSE.2007.55. [DOI] [Google Scholar]
  14. Hur SC, Henderson-MacLennan NK, McCabe ERB, Di Carlo D. Deformability-based cell classification and enrichment using inertial microfluidics. Lab on a Chip. 2011;11:912–920. doi: 10.1039/c0lc00595a. [DOI] [PubMed] [Google Scholar]
  15. Kalish BT, Barkat TR, Diel EE, Zhang EJ, Greenberg ME, Hensch TK. Single-nucleus RNA sequencing of mouse auditory cortex reveals critical period triggers and brakes. PNAS. 2020;117:11744–11752. doi: 10.1073/pnas.1920433117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Kaminow B, Yunusov D, Dobin A. GenomicSuperSignature: interpretation of RNA-seq experiments through robust, efficient comparison to public databases. Bioinformatics (Oxford, England) 2021;1:e55. doi: 10.1101/2021.05.05.442755. [DOI] [Google Scholar]
  17. Karaiskos N, Wahle P, Alles J, Boltengagen A, Ayoub S, Kipar C, Kocks C, Rajewsky N, Zinzen RP. The Drosophila embryo at single-cell transcriptome resolution. Science (New York, N.Y.) 2017;358:194–199. doi: 10.1126/science.aan3235. [DOI] [PubMed] [Google Scholar]
  18. Kennedy A. mouse brain silhouette. Zenodo. 2020 https://zenodo.org/record/3925919
  19. Kim B, Hahn YK, You D, Oh S, Choi S. A smart multi-pipette for hand-held operation of microfluidic devices. The Analyst. 2016;141:5753–5758. doi: 10.1039/c6an00681g. [DOI] [PubMed] [Google Scholar]
  20. Kivioja T, Vähärautio A, Karlsson K, Bonke M, Enge M, Linnarsson S, Taipale J. Counting absolute numbers of molecules using unique molecular identifiers. Nature Methods. 2011;9:72–74. doi: 10.1038/nmeth.1778. [DOI] [PubMed] [Google Scholar]
  21. Klein AM, Mazutis L, Akartuna I, Tallapragada N, Veres A, Li V, Peshkin L, Weitz DA, Kirschner MW. Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell. 2015;161:1187–1201. doi: 10.1016/j.cell.2015.04.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Konstantinides N, Kapuralin K, Fadil C, Barboza L, Satija R, Desplan C. Phenotypic Convergence: Distinct Transcription Factors Regulate Common Terminal Features. Cell. 2018;174:622–635. doi: 10.1016/j.cell.2018.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Langer K, Bremond N, Boitard L, Baudry J, Bibette J. Micropipette-powered droplet based microfluidics. Biomicrofluidics. 2018;12:044106. doi: 10.1063/1.5037795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Lareau CA, Duarte FM, Chew JG, Kartha VK, Burkett ZD, Kohlway AS, Pokholok D, Aryee MJ, Steemers FJ, Lebofsky R, Buenrostro JD. Droplet-based combinatorial indexing for massive-scale single-cell chromatin accessibility. Nature Biotechnology. 2019;37:916–924. doi: 10.1038/s41587-019-0147-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Li YE, Preissl S, Hou X, Zhang Z, Zhang K, Fang R, Qiu Y, Poirion O, Li B, Liu H, Wang X, Han JY, Lucero J, Yan Y, Kuan S, Gorkin D, Nunn M, Mukamel EA, Behrens MM, Ecker J, Ren B. An Atlas of Gene Regulatory Elements in Adult Mouse Cerebrum. Neuroscience. 2020;1:e87585. doi: 10.1101/2020.05.10.087585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, Tirosh I, Bialas AR, Kamitaki N, Martersteck EM, Trombetta JJ, Weitz DA, Sanes JR, Shalek AK, Regev A, McCarroll SA. Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets. Cell. 2015;161:1202–1214. doi: 10.1016/j.cell.2015.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Mulqueen RM, DeRosa BA, Thornton CA, Sayar Z, Torkenczy KA, Fields AJ, Wright KM, Nan X, Ramji R, Steemers FJ, O’Roak BJ, Adey AC. Improved Single-Cell ATAC-Seq Reveals Chromatin Dynamics of in Vitro Corticogenesis. bioRxiv. 2019 doi: 10.1101/637256. [DOI]
  28. Mulqueen RM, Pokholok D, O’Connell BL, Thornton CA, Zhang F, O’Roak BJ, Link J, Yardımcı GG, Sears RC, Steemers FJ, Adey AC. High-content single-cell combinatorial indexing. Nature Biotechnology. 2021;39:1574–1580. doi: 10.1038/s41587-021-00962-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Picelli S, Faridani OR, Björklund AK, Winberg G, Sagasser S, Sandberg R. Full-length RNA-seq from single cells using Smart-seq2. Nature Protocols. 2014;9:171–181. doi: 10.1038/nprot.2014.006. [DOI] [PubMed] [Google Scholar]
  30. Preissl S, Fang R, Huang H, Zhao Y, Raviram R, Gorkin DU, Zhang Y, Sos BC, Afzal V, Dickel DE, Kuan S, Visel A, Pennacchio LA, Zhang K, Ren B. Single-nucleus analysis of accessible chromatin in developing mouse forebrain reveals cell-type-specific transcriptional regulation. Nature Neuroscience. 2018;21:432–439. doi: 10.1038/s41593-018-0079-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Research. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Ren X, Wen W, Fan X, Hou W, Su B, Cai P, Li J, Liu Y, Tang F, Zhang F, Yang Y, He J, Ma W, He J, Wang P, Cao Q, Chen F, Chen Y, Cheng X, Deng G, Deng X, Ding W, Feng Y, Gan R, Guo C, Guo W, He S, Jiang C, Liang J, Li Y-M, Lin J, Ling Y, Liu H, Liu J, Liu N, Liu S-Q, Luo M, Ma Q, Song Q, Sun W, Wang G, Wang F, Wang Y, Wen X, Wu Q, Xu G, Xie X, Xiong X, Xing X, Xu H, Yin C, Yu D, Yu K, Yuan J, Zhang B, Zhang P, Zhang T, Zhao J, Zhao P, Zhou J, Zhou W, Zhong S, Zhong X, Zhang S, Zhu L, Zhu P, Zou B, Zou J, Zuo Z, Bai F, Huang X, Zhou P, Jiang Q, Huang Z, Bei J-X, Wei L, Bian X-W, Liu X, Cheng T, Li X, Zhao P, Wang F-S, Wang H, Su B, Zhang Z, Qu K, Wang X, Chen J, Jin R, Zhang Z. COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas. Cell. 2021;184:1895–1913. doi: 10.1016/j.cell.2021.01.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Rivello F, van Buijtenen E, Matuła K, van Buggenum J, Vink P, van Eenennaam H, Mulder KW, Huck WTS. Single-cell intracellular epitope and transcript detection revealing signal transduction dynamics. Molecular Biology. 2020;1:e08120. doi: 10.1101/2020.12.02.408120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rop FD. hydrop_data_analysis. swh:1:rev:059bf5a7779dc2894670ecf5f820c14bceb68493Software Heritage. 2022 https://archive.softwareheritage.org/swh:1:dir:28fb90ef390d0028cdc38102244567996855e4fd;origin=https://github.com/aertslab/hydrop_data_analysis;visit=swh:1:snp:8c8b341705393049010ae1d01233a0aee531d82b;anchor=swh:1:rev:059bf5a7779dc2894670ecf5f820c14bceb68493
  35. Sarkar A, Mei A, Paquola ACM, Stern S, Bardy C, Klug JR, Kim S, Neshat N, Kim HJ, Ku M, Shokhirev MN, Adamowicz DH, Marchetto MC, Jappelli R, Erwin JA, Padmanabhan K, Shtrahman M, Jin X, Gage FH. Efficient Generation of CA3 Neurons from Human Pluripotent Stem Cells Enables Modeling of Hippocampal Connectivity In Vitro. Cell Stem Cell. 2018;22:684–697. doi: 10.1016/j.stem.2018.04.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Satpathy AT, Granja JM, Yost KE, Qi Y, Meschi F, McDermott GP, Olsen BN, Mumbach MR, Pierce SE, Corces MR, Shah P, Bell JC, Jhutty D, Nemec CM, Wang J, Wang L, Yin Y, Giresi PG, Chang ALS, Zheng GXY, Greenleaf WJ, Chang HY. Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion. Nature Biotechnology. 2019;37:925–936. doi: 10.1038/s41587-019-0206-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Sinha N, Subedi N, Wimmers F, Soennichsen M, Tel J. A Pipette-Tip Based Method for Seeding Cells to Droplet Microfluidic Platforms. Journal of Visualized Experiments. 2019;1:e48. doi: 10.3791/57848. [DOI] [PubMed] [Google Scholar]
  38. Stickels RR, Murray E, Kumar P, Li J, Marshall JL, Di Bella DJ, Arlotta P, Macosko EZ, Chen F. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nature Biotechnology. 2021;39:313–319. doi: 10.1038/s41587-020-0739-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Stoeckius M, Hafemeister C, Stephenson W, Houck-Loomis B, Chattopadhyay PK, Swerdlow H, Satija R, Smibert P. Simultaneous epitope and transcriptome measurement in single cells. Nature Methods. 2017;14:865–868. doi: 10.1038/nmeth.4380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Svensson V, da Veiga Beltrame E, Pachter L. A curated database reveals trends in single-cell transcriptomics. Database. 2020;2020:baaa073. doi: 10.1093/database/baaa073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Traag VA, Waltman L, van Eck NJ. From Louvain to Leiden: guaranteeing well-connected communities. Scientific Reports. 2019;9:5233. doi: 10.1038/s41598-019-41695-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Waegeneer MD, Flerin CC, Davie K. vib-singlecell-nf/vsn-pipelines. 1.01Zenodo. 2021 https://zenodo.org/record/4468513
  43. Wang Y, Cao T, Ko J, Shen Y, Zong W, Sheng K, Cao W, Sun S, Cai L, Zhou YL, Zhang XX, Zong C, Weissleder R, Weitz D. Dissolvable Polyacrylamide Beads for High-Throughput Droplet DNA Barcoding. Advanced Science (Weinheim, Baden-Wurttemberg, Germany) 2020;7:1903463. doi: 10.1002/advs.201903463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Waskom ML. seaborn: statistical data visualization. Journal of Open Source Software. 2021;6:3021. doi: 10.21105/joss.03021. [DOI] [Google Scholar]
  45. Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biology. 2018;19:15. doi: 10.1186/s13059-017-1382-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Wolock SL, Lopez R, Klein AM. Scrublet: Computational Identification of Cell Doublets in Single-Cell Transcriptomic Data. Cell Systems. 2019;8:281–291. doi: 10.1016/j.cels.2018.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Xia Y, Whitesides GM. SOFT LITHOGRAPHY. Annual Review of Materials Science. 1998;28:153–184. doi: 10.1146/annurev.matsci.28.1.153. [DOI] [Google Scholar]
  48. Yao Z, van Velthoven CTJ, Nguyen TN, Goldy J, Sedeno-Cortes AE, Baftizadeh F, Bertagnolli D, Casper T, Chiang M, Crichton K, Ding SL, Fong O, Garren E, Glandon A, Gouwens NW, Gray J, Graybuck LT, Hawrylycz MJ, Hirschstein D, Kroll M, Lathia K, Lee C, Levi B, McMillen D, Mok S, Pham T, Ren Q, Rimorin C, Shapovalova N, Sulc J, Sunkin SM, Tieu M, Torkelson A, Tung H, Ward K, Dee N, Smith KA, Tasic B, Zeng H. A taxonomy of transcriptomic cell types across the isocortex and hippocampal formation. Cell. 2021;184:3222–3241. doi: 10.1016/j.cell.2021.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Yap E-L, Pettit NL, Davis CP, Nagy MA, Harmin DA, Golden E, Dagliyan O, Lin C, Rudolph S, Sharma N, Griffith EC, Harvey CD, Greenberg ME. Bidirectional perisomatic inhibitory plasticity of a Fos neuronal network. Nature. 2021;590:115–121. doi: 10.1038/s41586-020-3031-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Zeisel A, Hochgerner H, Lönnerberg P, Johnsson A, Memic F, van der Zwan J, Häring M, Braun E, Borm LE, La Manno G, Codeluppi S, Furlan A, Lee K, Skene N, Harris KD, Hjerling-Leffler J, Arenas E, Ernfors P, Marklund U, Linnarsson S. Molecular Architecture of the Mouse Nervous System. Cell. 2018;174:999–1014. doi: 10.1016/j.cell.2018.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Zhang X, Li T, Liu F, Chen Y, Yao J, Li Z, Huang Y, Wang J. Comparative Analysis of Droplet-Based Ultra-High-Throughput Single-Cell RNA-Seq Systems. Molecular Cell. 2019;73:130–142. doi: 10.1016/j.molcel.2018.10.020. [DOI] [PubMed] [Google Scholar]
  52. Zheng GXY, Terry JM, Belgrader P, Ryvkin P, Bent ZW, Wilson R, Ziraldo SB, Wheeler TD, McDermott GP, Zhu J, Gregory MT, Shuga J, Montesclaros L, Underwood JG, Masquelier DA, Nishimura SY, Schnall-Levin M, Wyatt PW, Hindson CM, Bharadwaj R, Wong A, Ness KD, Beppu LW, Deeg HJ, McFarland C, Loeb KR, Valente WJ, Ericson NG, Stevens EA, Radich JP, Mikkelsen TS, Hindson BJ, Bielas JH. Massively parallel digital transcriptional profiling of single cells. Nature Communications. 2017;8:14049. doi: 10.1038/ncomms14049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Zilionis R, Nainys J, Veres A, Savova V, Zemmour D, Klein AM, Mazutis L. Single-cell barcoding and sequencing using droplet microfluidics. Nature Protocols. 2017;12:44–73. doi: 10.1038/nprot.2016.154. [DOI] [PubMed] [Google Scholar]

Editor's evaluation

Naama Barkai 1

De Rop et al. introduce a flexible microfluidics-based single-cell genomics technology that expands and improves previously existing custom droplet-based scRNA-seq protocols (inDrops and Drop-seq) in interesting directions: better data quality, simplified workflow, high-cell recovery, and flexibility towards other single-cell applications.

Decision letter

Editor: Naama Barkai1
Reviewed by: Klaas Mulder

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article “HyDrop: droplet-based single-cell ATAC-seq and single-cell RNA-seq using dissolvable hydrogel beads” for consideration by eLife. Your article has been reviewed by 2 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Naama Barkai as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Klaas Mulder (Reviewer #3).

The reviewers are excited about your paper, we will be happy to publish it, but please address the suggestions made below.

Reviewer #1 (Recommendations for the authors):

The paper is clearly written and it Includes enough methodological details, for example explaining the rationale behind some of the modifications benchmarked and later selected in the HyDrop protocol. Moreover, the authors exemplify the application of HyDrop to different contexts (e.g. low-input samples). For all these reasons, I think this is an outstanding candidate for publication in eLife.

Reviewer #2 (Recommendations for the authors):

Specific comments for the authors:

1) The introduction (line 38) contains the following statement "Primers carried or released by the bead allow each individual cell’s mRNA to be indexed inside the droplet".

This is indeed the case for the inDrop method, however is incorrect for Drop-seq, where the RT reaction takes place after emulsion breaking. Therefore, this sentence does not represent both methods accurately. However, this distinction is important to make as these are the main methods HyDrop is compared to.

2) Figure 1 and Figure 1—figure supplement 2. The produced hydrogel beads do not seem perfectly monodisperse and uniform in their fluorescent signal after barcode production. It might be good to mention this in the text and briefly discuss whether this will potentially impact the detection of heterogeneity in the cell population after sequencing, or not.

3) Figure 1—figure supplement 6c, the 6uM primer beads are concluded to be the best option for the HyDropRNA application. The images of these beads show some punctate intensities. Would the authors like to comment on this briefly? This could raise some questions for the readers, potentially leading to doubts about how to best adopt the method.

4) Figure 4—figure supplement 1b, please indicate in the legend what the colors encode.

5) The linear amplification method to add the barcodes to the beads (and in the scATAC workflow) is non-standard, would the authors be able to provide an estimation of the rate (or proportions) of errors introduced into the barcodes during this process? The authors state that 88% of the detected barcodes are in the ‘'whitelist’' (1 mismatch allowed). What is the distribution of mismatches in the 12% of the barcode that were discarded? Would that be of interest to estimate the error rate of the linear amplification step?

6) On the conclusion stated on line 116 on the optimal bead primer concentration: it seems that adding half the concentration of primer (6 instead of 12 uM) leads to 10x less reads. Which concentration would they in the end advice and in which context? (not mentioned clearly).

7) Line 258: This sentence is confusing. Some points they elaborate on (use og GTP/PEG for molecular crowding) others they just refer to other papers (use of LNA). Maybe cut into different sentences with explanation and referral to supplemental figure? The overall conclusion can still be linked to the main figure.

8) Figure 8c, It seems from the methods section that the filtering of the InDrop and Drop-seq data is not the same? What happens you perform the same filtering as with your HyDrop analysis?

9) Line 284: the header states HyDrop-ATAC, whereas the experiment and text are on HyDrop-RNA.

10) Line 327. Were the discussed neuronal cell populations also detected in the in-house generated Drop-seq experiments? It would be good to include a mention/discussion of this in the text.

eLife. 2022 Feb 23;11:e73971. doi: 10.7554/eLife.73971.sa2

Author response


Reviewer #2 (Recommendations for the authors):

Specific comments for the authors:

1) The introduction (line 38) contains the following statement: "Primers carried or released by the bead allow each individual cell’s mRNA to be indexed inside the droplet".

This is indeed the case for the inDrop method, however is incorrect for Drop-seq, where the RT reaction takes place after emulsion breaking. Therefore, this sentence does not represent both methods accurately. However, this distinction is important to make as these are the main methods HyDrop is compared to.

We thank the reviewer for pointing out this mistake. We have clarified and modified the text to the following:

“Barcoding primers carried by the beads inside the emulsion are then used to index each individual cell’s mRNA. This process occurs either inside the droplet, where the cell’s mRNA is reverse transcribed using barcoded primers released by the barcoded bead (inDrop) or after emulsion breaking, where the cellular mRNA is anchored onto barcodes carried by resin beads (Drop-seq).”

2) Figure 1 and Figure 1—figure supplement 2. The produced hydrogel beads do not seem perfectly monodisperse and uniform in their fluorescent signal after barcode production. It might be good to mention this in the text and briefly discuss whether this will potentially impact the detection of heterogeneity in the cell population after sequencing, or not.

While the small images in Figure 1 and Figure 1 supplement 2 indeed give the impression that the beads are heterodisperse in both size and fluorescent signal, data generated by our automated image analysis script indicates that the diameter of our beads vary by < 5% of the mean, and the intensity varies by <8% of the mean. To make this more clear to the reader, we have appended the absolute values of 1 standard deviation to both intensity and bead diameter in the supplementary data tables. For more details, the script, fluorescence images and resulting tables are available in GitHub: https://github.com/aertslab/hydrop_data_analysis/tree/main/manuscript_beadqc.

3) Figure 1—figure supplement 6c, the 6uM primer beads are concluded to be the best option for the HyDropRNA application. The images of these beads show some punctate intensities. Would the authors like to comment on this briefly? This could raise some questions for the readers, potentially leading to doubts about how to best adopt the method.

Indeed, some intense spots are present in some micrographs shown in the manuscript. However, we are almost certain that these small spots are dye precipitates or crystal aggregates arising from the primer suspension. We have also observed this while performing the same QC method on 10x genomics beads. For example, Author response image 1 is a QC image of 10x RNA beads.

Author response image 1.

Author response image 1.

In order to clarify this, we have added the following line to the methods:“Occasional punctate intensities (diameter ~1 μm) were observed in quality control performed on both HyDrop and 10X Genomics beads and are thought to be either dye precipitate (crystals aggregates in the oligonucleotide-fluorophore suspension) auto-fluorescent particles.”

4) Figure 4—figure supplement 1b, please indicate in the legend what the colors encode.

Thank you for pointing this out. We have added the following line to the figure caption:

“Color scale encodes each count’s local Gaussian kernel density estimation score.”

This should clarify the legend scores to the reader.

5) The linear amplification method to add the barcodes to the beads (and in the scATAC workflow) is non-standard, would the authors be able to provide an estimation of the rate (or proportions) of errors introduced into the barcodes during this process? The authors state that 88% of the detected barcodes are in the ‘'whitelist’' (1 mismatch allowed). What is the distribution of mismatches in the 12% of the barcode that were discarded? Would that be of interest to estimate the error rate of the linear amplification step?

Thank you for this interesting suggestion. We have calculated the distribution of mismatches in the remaining 12% of unidentified barcode sequences up to a hamming distance of 3. This analysis showed that an additional 1.16% and 1.13% of all barcodes can be assigned to the whitelist when 2 and 3 mismatches are allowed. When hamming distance is increased to 4 and beyond, matching of a barcode to more than 1 whitelist entry becomes possible and obfuscates results. We believe that the remaining ~10% of unidentified barcode sequences are a result of incorporation errors in the oligonucleotide production process. Importantly, in order to limit cost of the barcode plates we opted for standard desalting purification when purchasing our barcode oligonucleotides. Here, the error in base incorporation is about: 1 in 1000 bases (https://eu.idtdna.com/pages/support/faqs/why-does-my-sequence-data-have-errors) which is far more frequent than the reported error rate of the polymerase we use (Kapa HiFi has an error rate of around 1 in 3.6M bases, https://rochesequencingstore.com/wp-content/uploads/2017/10/KAPA-HiFi-PCRKit_KR0368-%E2%80%93-v13.19.pdf) and results in a frame shift. These frameshifted barcodes produce a barcode read that cannot be identified using simple hamming distance based error correction.

We have added the following text to the manuscript:

“An additional 1.16% and 1.13% of HyDrop barcodes could be assigned to the whitelist when 2 and 3 mismatches were allowed respectively. The remaining 9% of barcodes sequences could not be identified and are most likely a result of frame-shift due to incorporation errors in the synthetic oligonucleotide production process. Notably, only 39.7% of barcode reads could be matched with the whitelist in a public inDrop dataset (3), when 1 mismatch was allowed. When pre-filtered cell barcode sequences were used as a whitelist for a public mouse retina Drop-seq dataset (2), 49.5% of reads could be matched.”

6) On the conclusion stated on line 116 on the optimal bead primer concentration: it seems that adding half the concentration of primer (6 instead of 12 uM) leads to 10x less reads. Which concentration would they in the end advice and in which context? (not mentioned clearly).

This was indeed not mentioned strongly enough. We are continuously evaluating the optimal concentration of primers needed for bead synthesis of different HyDrop assays. Based on our experience we have found that the 12 μm HyDrop beads provide the most sensitive performance on both the HyDrop assays. Therefore, we used (and continue to use) 12 µM primer concentration to produce barcoded hydrogel beads for both HyDrop-RNA and HyDrop-ATAC experiments.

We have clarified line 116 to:

“The bead primer concentration with an optimal balance between assay sensitivity and library purity was found to be 12 µM for both scATACseq and scRNA-seq (Figure 1 – Figure supp. 6).”

7) Line 258: This sentence is confusing. Some points they elaborate on (use og GTP/PEG for molecular crowding) others they just refer to other papers (use of LNA). Maybe cut into different sentences with explanation and referral to supplemental figure? The overall conclusion can still be linked to the main figure.

We thank the reviewer for pointing out that this part of the manuscript was not clearly written. We have rewritten the paragraph to:

“To improve the assay’s sensitivity, we interrogated the impact of several alterations to the protocol’s reaction chemistry by testing them on a 50:50 human-mouse (human melanoma, mouse melanoma) mixture. We first investigated the use of pooled exonuclease I treatment after reverse transcription to remove unused barcode primers. We reasoned that these unused barcoded primers could potentially prime transcripts during the subsequent bulk ISPCR, leading to a loss of purity of transcripts associated with a given barcode. As evident from the increase in pure cells detected, we found that exonuclease I treatment indeed improved assay purity (Figure supp. 1). We then tested the implementation of a locked nucleic acid (LNA) in the TSO, as it has been shown to increase assay sensitivity due to increased hybridization stability of the TSO-cDNA dimer (30). We also investigated the addition of GTP/PEG to the in-droplet reverse transcription reaction. Both the addition of PEG as a molecular crowding agent and GTP have been shown to improve assay sensitivity in SMART-seq3 (31). Finally, we wondered whether a second strand synthesis library construction approach could outperform the TSO/ISPCR approach. In order to test this, we compared both alkaline hydrolysis and enzymatic treatment (RNAse H) to remove the RNA strand from the first strand product, and evaluated the performance of both Bst 2.0 DNA polymerase and Klenow (exo-) fragment for second strand synthesis (32,33). We found that the classical TSO and ISPCR protocol supplemented with GTP/PEG performed best, yielding a median of 2,110 UMIs and 1,325 genes per cell with a species-purity of 90.1% (Figure 8a, Figure 8 – Figure supp. 1).”

8) Figure 8c, It seems from the methods section that the filtering of the InDrop and Drop-seq data is not the same? What happens you perform the same filtering as with your HyDrop analysis?

We thank the reviewer for commenting on this filtering issue. In our original analysis, the sensitivities of Drop-seq and inDrop were deduced from the expression matrices provided as supplementary material to the respective publications, and these contained only the cells as filtered by the authors. In the Macosko et al., (Drop-seq) manuscript, these cells were selected based on visual detection of a drop in the UMI barcode rank plot. In the Kalish et al., (inDrop) manuscript, an arbitrary filter of barcodes containing > 500 UMIs was used. As opposed to opting for manual filtering methods, HyDrop employs STARsolo’s automated cell filtering algorithm (identical to 10X Genomics cellranger’s cell filtering algorithm) which is adaptive to the distribution of UMIs in cell barcodes. We believe that this automated filter is the best approach for HyDrop, as it is independent of human judgment, matches our expected cell recovery, and clearly distinguishes background from true cells in the barcode rank plots.

We decided to re-analyze the published InDrop and Drop-seq datasets starting from the raw fastq files, using STARsolo for cell filtering. Surprisingly, both for the inDrop and Drop-seq data, the STARsolo filtering method yielded very different results of cell filtering compared to the reported results. STARsolo identified only 29k cells in the

Drop-seq’s 49k cells dataset, and 21k in inDrop’s 31k cells dataset. This discrepancy likely results from the lack of a clear separation between cells and non-cells in inDrop and Drop-seq’s UMI/barcode-rank, which makes it difficult for STARsolo to filter true cells from background. If we were to reduce the number of cells in the inDrop and Dropseq samples by such large margins, then the fraction of raw reads that end up as unique transcript counts in barcodes identified as cells, would fall too low.

As an alternative comparison of the three data sets, we decided to retain the number of cells reported in the Macosko et al., manuscript and sampled the sequencing data to HyDrop’s depth (52738 reads/cell). For the inDrop data, this constraint led to a reduction of the number of cells from 31293 to 27094 and resulted in a minimum UMI cutoff of 577 instead of the original 500 for the Kalish et al., inDrop data. As a result, the dataset’s median genes/UMIs increased from 1097/1521 to 1321/1920. Note that we also allowed 1 mismatch in Drop-seq’s barcodes (like with inDrop and HyDrop), a step which is not performed in the original Drop-seq analysis. This analysis is transparently reported step-by-step in our GitHub repository at: https://github.com/aertslab/hydrop_data_analysis/tree/main/HyDropRNA_publicdata_comparison.

We have also adjusted the figure and adapted the main text as following:

“We then used HyDrop-RNA to generate 9,508 single nuclei transcriptomes from snap-frozen mouse cortex in a single experiment. At a sequencing saturation of approximately 60% duplicates and with 86% of reads mapping to genome and 55% of reads mapping to transcriptome, we achieve a median of 3,389 UMIs and 1,658 genes per cell before, and 3,404 UMIs and 1,662 genes per cell after doublet filtering (Figure 8c), compared to the median of 1,920 UMIs and 1,321 genes detected by inDrop snRNA-seq on mouse auditory cortex neurons (3) and the median of 1,071 UMIs and 763 genes reported by Drop-seq on mouse retina neurons (2). Both datasets also exhibited a lower read alignment efficiency, with Drop-seq mapping 52%/21%, and inDrop mapping 48%/28% of reads to the mouse genome/transcriptome. Importantly, HyDrop’s “sequencing efficiency” is higher than both inDrop and Drop-seq’s: 7.44% of HyDrop’s sequenced reads end up as mapped transcripts with a unique molecular identifier, whereas this metric is 3.24% for the inDrop dataset and 4.46% for the Drop-seq dataset (Figure 8 —figure supplement 2). 10x Chromium v2 gene expression reference data reports a median number of genes of 775-2,679 and a median number of UMIs of 1,127-6,570 on E18 and adult mouse brain nuclei (see methods).”.

We also added the following to the methods section:

“Raw inDrop (SRR10545068 to SRR10545079) and Drop-seq (SRR1853178 to SRR1853184) sequencing data was downloaded from SRA. Data was sampled to the HyDrop-RNA mouse cortex sample sequencing depth (52738 reads per cell) and mapped to mouse reference genome using STARsolo, allowing 1 mismatch in the cell barcodes (like in our HyDrop-RNA and HyDrop-ATAC analyses). Full analysis process is documented on the Hydrop GitHub repository (https://github.com/aertslab/hydrop_data_analysis/tree/main/HyDropRNA_publicdata_comparison).”

9) Line 284: the header states HyDrop-ATAC, whereas the experiment and text are on HyDrop-RNA.

We thank the reviewer for this remark and have corrected it in the final manuscript.

10) Line 327. Were the discussed neuronal cell populations also detected in the in-house generated Drop-seq experiments? It would be good to include a mention/discussion of this in the text.

Previous analysis of these drop-seq experiments did not yield such a granularity in detected cell types, separating mostly glia and neurons. We believe that this difference can be explained partially due to the lower data quality, but mostly due to the difference in sample: the HyDrop runs were performed on specifically FAC-sorted neuronal populations, whereas the Drop-seq runs were performed on unsorted whole brain. We clarified in the text that the cell populations found by HyDrop after FAC sorting a specific subset are not expected to be found in the Drop-seq whole-brain dataset as follows:

“In comparison, in-house Drop-seq performed on fly brain neurons the entire fly brain recovered a median of 579 UMIs and 289 genes per cell at a deeper sequencing saturation (11).”

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. De Rop F, Ismail J, Bravo González-Blas C, Hulselmans G, Flerin C, Janssens J, Christiaens V, Poovathingal S, Aerts S. 2021. HyDrop: droplet-based scATAC-seq and scRNA-seq using dissolvable hydrogel beads. NCBI Gene Expression Omnibus. GSE175684
    2. Macosko EZ. 2015. Drop-Seq analysis of P14 mouse retina single-cell suspension. NCBI Gene Expression Omnibus. GSE63472
    3. Kalish BT, Greenberg ME. 2020. inDrop scRNA-seq of mouse primary auditory cortex. NCBI Gene Expression Omnibus. GSE140883
    4. 10x Genomics 2019. 10x scATAC-seq on flash frozen cortex, hippocampus, and ventricular zone from embryonic mouse brain (E18) 10x Genomics Resources. atac_v1_E18_brain_flash_5k
    5. 10x Genomics 2019. 10x scATAC-seq on fresh cortex from adult mouse brain (P50) 10x Genomics Resources. atac_v1_adult_brain_fresh_5k
    6. 10x Genomics 2017. 10x scRNA-seq on 1k brain nuclei from an E18 mouse. 10x Genomics Resources. nuclei_900
    7. 10x Genomics 2017. 10x scRNA-seq on 2k brain nuclei from an adult mouse (>8 weeks) 10x Genomics Resources. nuclei_2k
    8. ENCODE Project Consortium 2021. ATAC-seq from MCF-7 (ENCSR422SUG) NCBI Gene Expression Omnibus. GSE169929
    9. ENCODE Project Consortium 2021. ATAC-seq from PC-3 (ENCSR499ASS) NCBI Gene Expression Omnibus. GSE170337

    Supplementary Materials

    Figure 1—figure supplement 2—source data 1. Count, mean diameter, and mean intensity of linear amplification and isothermal extension beads.

    Beads were incubated in fluorescent probes complementary to basal acrydite primer or poly-A or ATAC capture site. Absolute size of 1 SD is added in mean diameter and mean intensity columns.

    Figure 1—figure supplement 3—source data 1. Total counts and fluorescence-positive counts of beads incubated with fluoresceine amidite (FAM) oligonucleotide probe complementary to 1 out of 96 barcode possibilities.
    Figure 1—figure supplement 4—source data 1. Count, mean diameter, and mean background-adjusted intensity of beads incubated in 50 mM of DTT for 15 and 50 min as well as negative control.

    Absolute size of 1 SD is added in mean diameter and mean intensity columns.

    Supplementary file 1. "Molecular sequence description of HyDrop bead barcoding": visual description of progression of nucleotide sequences involved in every step of the barcoding process.
    elife-73971-supp1.docx (19.2KB, docx)
    Supplementary file 2. "Molecular sequence description of HyDrop-ATAC": visual description of progression of nucleotide sequences involved in every step of HyDrop-ATAC.
    elife-73971-supp2.docx (24.2KB, docx)
    Supplementary file 3. "Molecular sequence description of HyDrop-RNA": visual description of progression of nucleotide sequences involved in every step of HyDrop-RNA.
    elife-73971-supp3.docx (25.7KB, docx)
    Supplementary file 4. "Reagents and oligonucleotide list": list of all reagents and oligonucleotides used in HyDrop.
    elife-73971-supp4.xlsx (25.9KB, xlsx)
    Supplementary file 5. "Price calculation for HyDrop bead barcoding and HyDrop-ATAC": detailed price calculation for HyDrop bead barcoding and HyDrop-ATAC.
    elife-73971-supp5.xlsx (18.2KB, xlsx)
    Transparent reporting form
    Source data 1. Source data files for Figure 4a, Figure 5b–d, Figure 8a–c, Figure 8—figure supplement 2, and Figure 10b.
    elife-73971-data1.zip (51.7MB, zip)

    Data Availability Statement

    The data discussed in this publication have been deposited in NCBI's Gene Expression Omnibus and are accessible through GEO Series accession number GSE175684 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE175684) and on SCope (https://scope.aertslab.org/#/HyDrop/*/welcome). Source data files have been provided for Figures 2a, 3a, 3b, 3c and 3i, and can be regenerated data analysis tutorials for HyDrop. Data analysis tutorials for HyDrop are available on GitHub (https://github.com/aertslab/hydrop_data_analysis; copy archieved at swh:1:rev:059bf5a7779dc2894670ecf5f820c14bceb68493).

    The following dataset was generated:

    De Rop F, Ismail J, Bravo González-Blas C, Hulselmans G, Flerin C, Janssens J, Christiaens V, Poovathingal S, Aerts S. 2021. HyDrop: droplet-based scATAC-seq and scRNA-seq using dissolvable hydrogel beads. NCBI Gene Expression Omnibus. GSE175684

    The following previously published datasets were used:

    Macosko EZ. 2015. Drop-Seq analysis of P14 mouse retina single-cell suspension. NCBI Gene Expression Omnibus. GSE63472

    Kalish BT, Greenberg ME. 2020. inDrop scRNA-seq of mouse primary auditory cortex. NCBI Gene Expression Omnibus. GSE140883

    10x Genomics 2019. 10x scATAC-seq on flash frozen cortex, hippocampus, and ventricular zone from embryonic mouse brain (E18) 10x Genomics Resources. atac_v1_E18_brain_flash_5k

    10x Genomics 2019. 10x scATAC-seq on fresh cortex from adult mouse brain (P50) 10x Genomics Resources. atac_v1_adult_brain_fresh_5k

    10x Genomics 2017. 10x scRNA-seq on 1k brain nuclei from an E18 mouse. 10x Genomics Resources. nuclei_900

    10x Genomics 2017. 10x scRNA-seq on 2k brain nuclei from an adult mouse (>8 weeks) 10x Genomics Resources. nuclei_2k

    ENCODE Project Consortium 2021. ATAC-seq from MCF-7 (ENCSR422SUG) NCBI Gene Expression Omnibus. GSE169929

    ENCODE Project Consortium 2021. ATAC-seq from PC-3 (ENCSR499ASS) NCBI Gene Expression Omnibus. GSE170337


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES