Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Jan 22.
Published in final edited form as: Adv Mater Technol. 2023 Dec 14;9(2):2301155. doi: 10.1002/admt.202301155

Magnetically functionalized hydrogels for high-throughput genomic applications

Evan Lammertse 1, Siran Li 2, Jude Kendall 3, Catherine Kim 4, Patrick Morris 5, Nissim Ranade 6, Dan Levy 7, Michael Wigler 8, Eric Brouzes 9,10,11,12
PMCID: PMC11029686  NIHMSID: NIHMS1953677  PMID: 38645306

Abstract

Single-cell genomics has revolutionized tissue analysis by revealing the genetic program of individual cells. The key aspect of the technology is the use of barcoded beads to unambiguously tag sequences originating from a single cell. The generation of unique barcodes on beads is mainly achieved by split-pooling methods, which are labor-intensive due to repeated washing steps. Towards the automation of the split-pooling method, we developed a simple method to magnetize hydrogel beads. We show that these hydrogel beads provide increased yields and washing efficiencies for purification procedures. They are also fully compatible with single-cell sequencing using the BAG-Seq workflow. Our work opens the automation of the split-pooling technique, which will improve single-cell genomic workflows.

Keywords: droplet microfluidics, single-cell genomics, split-pooling, automation, magnetic separation, BAG-Seq

1. Introduction

Single-cell sequencing is commonly used to investigate the genetic heterogeneity of cell populations. Identifying cell subpopulations within tissues can lead to the discovery of new cell types[1] and the deciphering of tumor evolution.[2] Droplet microfluidics has proven to be an invaluable method for single-cell sequencing due to its ability to isolate cells, modular nature, and high throughput.[3, 4] Droplet-based single-cell sequencing relies on encapsulating single cells with single barcoded beads in individual droplets.[5, 6] The barcoded bead captures the nucleic acids upon cell lysis. The sequences of the captured molecules are then appended to the barcode by either elongation or reverse transcription for DNA or RNA, respectively. The physical linkage, or tagging, assures the identification of the capturing bead, enabling the aggregation of sequences originating from the same single cell.

Barcodes are typically generated by split-pooling, which sequentially and randomly adds a short sequence to the growing barcode on beads.[6, 7] These beads can be solid[6] or polyacrylamide-based;[5, 7] the latter can be encapsulated without the limitation imposed by Poisson’s statistics.[8] Our recent BAG-Seq method also employs barcoding by split-pooling.[9] We encapsulate single cells into droplets containing functionalized oligomers, cells are lysed, and droplets are subsequently polymerized into balls of acrylamide gel (BAGs). The polymerizing gel network captures the cellular nucleic acids and oligonucleotides. Notably, the gel porosity allows reagents and polymerases to access nucleic acids, which enables direct split-and-pool barcoding of individual single-cell BAGs (scBAG).

Barcoding by split-pooling is thus central to single-cell genomics; however, it remains labor-intensive and would greatly benefit from automation because of its repetitive washes and buffer exchanges. The workflow is based on centrifugation, which presents inherent obstacles to its automation, such as bulkiness, complexity, and moving parts. In contrast, magnetic separation is an attractive approach. It is commonly employed in commercial automated nucleic acid purification systems, such as the BioRobot series[10] (Qiagen, Germany), KingFisher System[11] (Thermo-Fisher Scientific, USA), and MagNA Pure series[12] (Roche, Germany). Magnetization of BAGs would enable bulk separation and the automation of the split-pooling method. It would reduce labor costs, increase reproducibility, and improve sample processing throughput.

The formation of hydrogel beads benefits from droplet microfluidics, which offers precise control over gel particle size[13] and shape.[14] Particle homogeneity assures response uniformity to external stimuli such as magnetic fields. This aspect is particularly critical for efficient automation. Few examples of embedding magnetic particles in hydrogel beads using droplet microfluidics exist. They typically encapsulate small magnetic nanoparticles (5–20 nm)[13] or ferrofluids[14, 15] in various gel formulations and particle geometries. While basic properties are commonly reported for these magnetic microgels, a comprehensive characterization of their bulk separation performance and a practical application are lacking.

In this work, we produced magnetic BAGs (Mag-BAGs) by physically embedding 1 μm commercially available superparamagnetic microbeads within the gel matrix. We quantified the effect of the magnetic bead and monomer concentrations, BAG size, bead and buffer types, and PCR thermocycling on the bulk separation of Mag-BAGs. We identified key parameters and predictors of efficient magnetic separation. We directly compared the Mag-BAG collection yield with centrifugation after multiple wash cycles in both microtubes and microplates. Finally, we developed and validated Mag-BAG-Seq by analyzing a mixture of SKN-1 human fibroblast and SK-BR-3 human breast cancer cell lines via single-cell DNA sequencing. This demonstrates the compatibility of Mag-BAGs with the BAG-seq workflow. Mag-BAGs are compatible with commercial magnetic separation devices and suitable for single-cell encapsulation, genomic library preparation, and single-cell sequencing. This work paves the way for efficient automation of the labor-intensive split-pooling in general and single-cell sequencing using the BAG-seq workflow in particular.

2. Results and Discussion

2.1. Mag-BAG generation.

We generate Mag-BAGs with a droplet generator that includes a co-flow for two aqueous solutions.[9] The first solution contains the single-cell suspension. The second solution contains the hydrogel component, the cell lysis buffer, and the magnetic beads. The oil phase contains the PEG-PFPE surfactant[16] dissolved at 2% weight in fluorinated oil HFE7500 and TEMED 0.4% (v/v) to catalyze the polyacrylamide polymerization. After encapsulation, we incubate Mag-BAGs overnight at 50 °C for gelation (Fig. 1). We manipulate Mag-BAGs in aqueous buffers and process them via the split-pool method[5, 6] using a series of washing and centrifugation steps.

Fig. 1. Mag-BAG workflow.

Fig. 1.

A co-flow droplet generator is used to encapsulate the combination of a single-cell suspension with functionalized hydrogel components, lysis buffer, and magnetic beads. After incubation, the hydrogel forms, capturing genomic DNA via Acrydite primers and physically embedding 1 μm magnetic beads. The Mag-BAGs can be separated on a magnetic stand to create and maintain a flat pellet during aspiration. In contrast, centrifugation interrupts the workflow and creates pellets concentrated at the bottom of microtubes and wells.

We optimized gelation conditions by embedding magnetic beads in different acrylamide gel concentrations by varying the monomer concentration (%T). We first observed that hydrogel beads are reliably formed above 3.8 %T (Fig. S1). Second, we investigated the impact of the monomer concentration on the separation performance. For each sample, we prepared a 500 μL 20% (v/v) suspension of Mag-BAGs in 6X SSC buffer, recorded a video of the separation, and extracted the kinetics and separation timescales (Fig. S1e). We established that the separation time was inversely correlated with the monomer concentration. The separation was complete at 300 s for 6.2 %T, which we used in all subsequent experiments (Supplemental Information 1.1).

Bead variety strongly affects the separation performance of Mag-BAGs, as demonstrated by head-to-head comparison of 1 μm and 500 nm magnetic beads. 500 nm beads sediment more rapidly, which creates difficulties in maintaining a uniform concentration of beads over the Mag-BAGs population. Depending on the buffer, the smaller beads also tended to remain aggregated after magnetization. Finally, they provide a lower magnetic force per unit weight than the 1 μm beads (Fig. S2, Supplemental Information 1.2).

In this report, we used streptavidin-functionalized beads because they are readily available and would not interfere with nucleic acids. Experiments proved that their performance is similar to un-functionalized beads provided by the manufacturer (Fig. S3ab, Supplemental Information 1.3). This result allows multi-functionalizing Mag-BAGs to capture additional molecules with capture moieties tethered to the magnetic beads. We also observed that the magnetic separation of Mag-BAGs was not affected by the high salt concentration of the 6X SSC buffer (Fig. S3cd, Supplemental Information 1.4) or 40 cycles on a thermocycler (Fig. S3ef, Supplemental Information 1.5). We noticed that thermocycling could induce a magnetization loss for other magnetic beads. This observation further emphasizes the need to select the appropriate bead type for specific applications. We used the 1 μm superparamagnetic beads for all other experiments because of their better overall performance. Those results also indicate that Mag-BAGs embedded with the 1 μm superparamagnetic beads used here are compatible with various bead functionalization, buffer types, and thermocycling.

2.2. Effect of weight bead concentration and Mag-BAG size on separation rate and pellet shape.

BAG-seq automation requires a fast separation rate and a pellet shape that enables efficient supernatant aspiration. Here, we investigated the effect of bead count and Mag-BAG size on these metrics. We generated a series of Mag-BAGs with 100 μm, 65 μm, and 45 μm diameters, each at five different bead weight concentrations doubling from 0.14 to 2.3 mg/mL.

The separation timescale at fixed Mag-BAG size decreased with increasing magnetic bead concentration. For 100 μm Mag-BAGs, the separation was complete within 60 s at 2.30 and 1.15 mg/mL (Fig. 2ac, green and red). Below 0.59 mg/mL, the separation timescale increased substantially with decreasing concentrations (Fig. 2d).

Fig. 2. Higher bead concentration reduces separation time and narrows the pellet shape at a constant Mag-BAG diameter.

Fig. 2

a) Video frames showing bulk separation progress at t=60 s of 2.3 (red), 1.15 (green), 0.58 (blue), 0.29 (gold), and 0.14 (purple) mg/mL samples. At 1.15 and 2.3 mg/mL, magnetic separation is complete within 60 seconds. Scalebar = 5mm. b) The Last frame of videos analyzed in panel (b) shows the final pellet shapes. Higher bead concentrations yield flatter vertical pellets. Scalebar = 5mm. The asterisks indicate where Mag-BAGs accumulate due to the effect of gravity. c) Bulk separation progress of 100 μm Mag-BAGs (6.2 %T) at 0.14, 0.29, 0.58, 1.15, and 2.3 mg/mL bead concentration. Curves and error bars represent the mean and standard deviation of 3 video replicates. d) Timescale τ of the separation curves as a function of bead concentration and BAG size. τ decreases with increasing bead concentration indicating a faster separation with increasing bead concentration. τ becomes independent of Mag-BAG size at higher bead concentrations. Curves and error bars represent the mean and standard deviation of at least 3 video replicates.

The separation timescale at fixed magnetic bead concentration increased with decreasing Mag-BAG sizes (Fig. 3d). At 1.15 mg/mL, the separation was only complete at 240 s for the 45 μm case compared to 60 s for the 100μm case (in Fig. 3ac, gold). Notably, the separation timescale became less sensitive to the Mag-BAG diameter with increasing bead concentration (Fig. 3d). Indeed, at 2.3 mg/mL, the timescale became independent of the Mag-BAG diameter, indicating a regime where the magnetic force largely dominated the drag force.

Fig. 3. Larger Mag-BAG diameter reduces separation time at constant bead concentration.

Fig. 3

a) Bulk separation progress of 100 μm, 65 μm, and 45 μm diameter Mag-BAGs at 1.15 mg/mL bead concentration. Curves and error bars represent the mean and standard deviation of 3 video replicates. b) Video frames showing bulk separation progress at t=60 s of 100 μm (gold), 65 μm (blue), and 45 μm (purple) Mag-BAG samples. Final separation time increases from about 60 s for the 100 μm case to about 150 s and 300 s for the 65 μm and 45 μm cases, respectively. Scalebar = 5mm. c) The Last frame of videos analyzed in panel (b) shows the final pellet shapes. Pellet shape is unaffected by BAG size at 1.15 mg/mL bead concentration. Scalebar = 5mm. d) timescale τ as a function of bead concentration and Mag-BAG size. τ becomes independent of Mag-BAG size at higher bead concentrations (above 2.3 mg/mL). Curves and error bars represent the mean and standard deviation of at least 3 video replicates.

Indeed, magnetic and viscous drag forces scale differently with the Mag-BAG diameter. On the one hand, the magnetic force is proportional to the number of embedded magnetic beads and scales with the bead weight concentration (Supplemental Information 1.6). Thus, increasing the bead concentration increases the magnetic force and decreases the timescale. At fixed concentration, the bead count per Mag-BAG scales linearly with the Mag-BAG volume or the cube of its diameter. On the other hand, the Stokes drag force on a spherical Mag-BAG scales linearly with the diameter Fd=3πμDν (4), where μ is the dynamic viscosity of the buffer, v is the separation velocity of the Mag-BAG, and D is the Mag-BAG diameter. Thus, the increase in magnetic force with increasing diameter exceeds the corresponding increase in the viscous drag force, resulting in a decrease in timescale.

Reducing the Mag-BAG size is necessary for specialized applications such as fluorescence-activated cell sorting, designed to sort particles in the 10–30 μm range efficiently. We could use a higher concentration of magnetic beads to maintain a suitable separation timescale for smaller Mag-BAGs. We also noted that for decreasing BAG size, a fixed bead count per BAG increased the relative effect of the magnetic force and thus made more efficient use of the beads in achieving a desired timescale. More generally, smaller Mag-BAGs offer cost reduction at scale by lowering reagent consumption per droplet and processing cost per cell. The most cost-effective design would minimize Mag-BAG size and bead concentration for the appropriate timescale.

The pellet shape is important because it affects the washing efficiency and yield, ease of use, and robustness to variation in pipette position. The ideal pellet is pulled flat against the tube wall without sedimented BAGs. This shape allows for optimal distance between the pipette tip and the pellet. It enables complete supernatant aspiration with minimal pellet disturbance and increases washing efficiency. In automated applications, this shape makes the separation more robust to position variability of the pipette tip by removing the pellet from the tip’s vertical path. For 100 μm Mag-BAGs, we achieved the ideal shape for 2.3 and 1.15 mg/mL magnetic bead concentrations (Fig. 2b, red and green). At lower concentrations, hydrogel particles accumulated at the bottom of the tube due to the effect of gravity (Fig 2b, asterisks in blue, gold, and purple). At a constant bead concentration of 1.15 mg/mL, the pellets for 100 μm, 65 μm, and 45 μm Mag-BAGs displayed an ideal shape (Fig. 3b). Smaller Mag-BAGs could generate the ideal pellet shape at a lower bead count. Their smaller sizes led to a reduced sedimentation rate and a reduced drag force, allowing them to move sideways faster.

2.3. Magnetic separation offers improved yield over centrifugation after repeated wash cycles.

We compared the separation yield of Mag-BAGs (100 μm diameter, 2.3 mg/mL bead concentration) by magnetic separation and centrifugation. We measured the yield as the percent volume loss of a sedimented Mag-BAG layer in 500 μL 20% (v/v) suspensions after ten wash cycles (two split-pool steps require fifteen wash cycles). We considered two use cases: i) 1.5mL microcentrifuge tubes and ii) 96-well PCR plates. We used both 6X SSC and PCR buffers because both apply to different stages of the BAG-seq workflow.[9] Also, the higher viscosity of 6X SSC generates higher local shear forces, which could increase the loss of BAGs during buffer aspiration. We evaluated N=3 replicate samples for each separation method, buffer, and container type combination.

The magnetic separation resulted in greater yield for wash cycles performed in 1.5mL tubes, and the buffer types did not make a difference. Using a two-way ANOVA, we found a statistically significant difference in percent volume loss between magnetic separation and centrifugation (F(1)=20.804, p<0.001) but not between 6X SSC and PCR buffers (F(1)=2.035, p=0.192) or for the interaction between separation and buffer types (F(1)=0.792, p=0.399). Critically, we aspirate 400 μL from 500 μL of a 20% v/v BAG suspension, leaving no remainder supernatant. Thus, we conclude that magnetic separation offers improved yield over centrifugation with either buffer using 1.5 mL tubes. Magnetic separation improves washing yield by allowing a greater volume to be aspirated thanks to the pellet shape produced by a side magnet configuration.

The 96-well PCR plate format provided a better yield than microtubes for both centrifugation and magnetic separation (Figs. 4b,d). In 96-well PCR plates, magnetic separation performed better than centrifugation overall. A two-way ANOVA found no statistically significant difference in percent volume loss between separation methods (F(1)=4.384, p=0.07), buffer type (F(1)=1.038, p=0.338), or their interaction (F(1)=0.183, p=0.680). The key benefit to magnetic separation is that centrifugation exhibited a higher variability between repeats in these conditions. We attribute this variability to the reduced precision in positioning the multichannel pipette tips relative to the BAG pellet located at the bottom of the well. Aspiration from 96-well plates is more prone to accidental disturbances of the BAG pellet when using centrifugation. The advantage of the pellet shape with magnetic separation was particularly apparent when using 96-well plates. We can apply the tips of the multichannel pipette along the opposite wall of the undisturbed Mag-BAG pellets. In addition, magnetic separation maintains the pellet’s aggregation force during aspiration, contributing to a higher pellet integrity.

Fig. 4. Magnetic separation offers improved yield over centrifugation after repeated wash cycles (100 μm diameter, 2.3 mg/mL bead concentration).

Fig. 4

a, c) Regions of interest (ROIs) indicating BAG pellet boundaries before and after 10 separation/wash cycles performed in (a) 1.5mL microcentrifuge tubes and (b) 96-well PCR plates. BAGs (no magnetic beads) were separated via centrifugation (top), and Mag-BAGs (2.3 mg/mL 1 μm magnetic beads) via magnetic separation (bottom). Washes were performed with 6X SSC (left) and PCR buffers (right). b,d) Percent volume loss for each condition of suspension buffer and separation method performed in (b) 1.5mL microcentrifuge tubes and (d) 96-well PCR plates. The buffer type does not significantly affect the recovery efficiency in all cases. However, magnetic separation exhibits a significantly (p < 0.001) higher recovery efficiency than centrifugation in 1.5mL microtubes. Recovery was also higher for magnetic separation than centrifugation in 96-well PCR plates, but the difference was not statistically significant (p = 0.07).

2.4. Magnetization does not affect BAG-seq sequencing data quality.

We would expect the mere incorporation of magnetic beads into BAGs, thus creating Mag-BAGs, and the use of magnetic versus centrifugal separation would not produce differences in the data quality arising from the BAG-seq workflow. Nevertheless, we tested the two methods for our most frequent application: detection of segmental amplifications or deletions of the genome in cancer and normal cells. We compared the data from BAGs and Mag-BAGs using a similar workflow (see Materials and Methods).

In brief, both BAG and Mag-BAG protocols used in this experiment are based on the original BAG-seq workflow,[9] where Acrydite primers captured the genomic content of each single cell copolymerized into the gel ball matrix. In addition, for the Mag-BAGs protocol, we added 1 μm magnetic beads into droplets. The only other difference between these protocols is that we used magnetic separation for the Mag-BAGs and a standard centrifugation step for the BAGs during the pool-and-split steps. Two rounds of pool-and-split steps were carreid out for each protocol. After the genomic DNA copies are captured and replicated, they are cut using the enzyme NLAIII to achieve a universal 3 ‘overhang sequence CATG. Each round introduces 96 different cell barcodes, generating 96*96 = 9216 cell barcode varieties. The first pool-and-split also introduces four bases of random sequences, the varietal tag, which, combined with the mapped sequence, produces a template-specific barcode unique to each template.

After sequencing using Illumina NovaSeq, we analyzed the sequencing data using a pipeline described in the original BAG-seq paper.[9] We extracted the cell barcode and template-specific tag information from the fastq files. Subsequently, we mapped the read pairs to the human reference genome using HISAT2.[17] Reads with the same varietal tag, mapped sequence, and cell barcode were identified as originating from the same template.

The cells were a mixture of SKN-1, a cultured human skin fibroblast, and SK-BR-3, a cultured human female breast cancer. The genome of cancerous SK-BR-3 cells has a characteristic CNV profile, while the SKN-1 cells are normal fibroblast cells with a diploid male genome. Each cell type has unique single nucleotide variants (SNVs), allowing their identification and validation of methodologies (see Materials and Methods). Whereas we would ordinarily go through three rounds of pool and split for creating BAG tags, in these experiments, we only used two rounds, increasing BAG tag “collisions.” However, that does not affect the comparison of methods. We used already established protocols by our group for copy number analysis.

We compared the sequence data from the two methods by the following criteria: number of templates per BAG for a given read depth; fragment length distribution of templates; extent of diffusion between BAGs of cell type-specific SNVs; correctness of subpopulation identification; and consistency of the copy number profiles obtained by the two methods. By all these criteria, the two methods are within 10–20% of each other, if not indistinguishable.

We have two libraries, one from BAGs, A, and one from Mag-BAGs, B. After provisional cell type assignment is determined by majority of either SKN-1 or SK-BR-3 SNVs, we determine which Mag-BAG tags and BAG tags represent captured cells using the 2nd derivative of smoothed ranked read count curves for SKN-1 and SK-BR-3 cells separately, and present them in their respective cumulative distribution plots (Fig. 5a). The resulting data set for Mag-BAGs has 3,472 tags with cells totaling 281,835,031 reads; and for BAGs 2,583 tags with cells totaling 341,867,689 reads. In order to facilitate comparison between the libraries, we downsampled the number of tags with cells in the Mag-BAG library to match that in the BAG library. Then we downsampled the total number of reads in the BAG library to match the total read count in the downsampled Mag-BAG data set, resulting in two libraries, A’ and B’, each with 2,583 tags with cells totaling 207,763,305 reads.

Fig. 5. Comparisons between BAG and Mag-BAG performance from a single-cell SKN-1/SK-BR-3 mixture experiment.

Fig. 5.

a) Cumulative distribution plots indicating the cut-offs (red line) for SKN-1 and SK-BR-3 cells for both protocols. The SKN-1 and SK-BR-3 cells were separately plotted and distinguished by abundant SNPs for each BAG-barcode. b) The three populations in each condition indicating SKN-1 (blue), SK-BR-3 (red), and collisions (green). c) After downsampling to the same number of cells and total reads for both conditions, the distribution of the number of templates per BAG is shown in a histogram. d) The histogram displays the size distribution of the templates based on the mapping information for all mapped read pairs in each condition. e) The heatmaps showing the copy number profiles of randomly-sampled 100 SKN-1 and 100 SK-BR-3 cells for each condition. f) The aggregated copy number profiles for all the SK-BR-3 cells for each condition showing very similar profiles.

We identified tags with a single cell type using thresholds of greater than 98% SK-BR-3 SNVs or greater than 70% SKN-1 SNVs. The other tags are considered “mixed” tags (Fig. 5b). In these data sets, we have for Mag-BAGs: 1,883 SK-BR-3 tags, 245 SKN-1 tags, and 455 mixed tags. For BAGs, we have 2,270 SK-BR-3 tags, 102 SKN-1 tags, and 211 mixed tags. The variation in the proportion of SKN-1 cells undoubtedly represents user pipetting variability. Both BAGs and Mag-BAGs have a high collision rate, as seen in the population with “mixed” SNVs, and this is due to performing only two rounds of pool-and-split. The ratio of SNVs for threshold identifiers is skewed because the SK-BR-3 cells have a higher ploidy, there are more SK-BR-3 cells than SKN-1, and there are about fourfold more SK-BR-3 SNVs detected than SKN-1 SNVs.

First, we computed the number of templates (template tags) per BAG for the two protocols, restricted to the downsampled libraries, A’ and B’. These are shown as distributions in Figure 5c. The distributions are very similar, and the median number of templates was 35014 per BAG and 33123 per Mag-BAG, within 10%. This result demonstrates similar capture and tagging from individual genomes, irrespective of the protocol. Second, we examined the size distribution of the templates for each protocol. They are virtually indistinguishable, as seen in Figure 5d. This result demonstrates that the biochemical processing of cellular DNA (priming, elongation, and cleavage) proceeds equivalently in the two environments. Third, we examined the diffusion rates in the two libraries (see Materials and Methods), that is, the contamination of one BAG with genomic fragments from another BAG. To do this, we computed the ratio of SK-BR-3 to SKN-1 SNVs within the BAGs identified as primarily SKN-1 for each BAG or Mag-BAG and then took the mean. The mean ratios are 0.128 and 0.122, from BAGs and Mag-BAGs, respectively, within 5% of each other. This result demonstrates that the diffusion of lysed cellular nuclear DNA between the BAGs for the two protocols is similar.

Next, we clustered BAGs by their genomic copy number features and derived the copy number profiles of each cluster for each of the two protocols. To cluster by genomic features, we first used the known copy number profile of the two components, the normal diploid and the tumor cell line. We used the boundaries of the SK-BR-3 copy number profile as the boundaries of the bins used in clustering. We took 100 cells from each SKN-1 and SK-BR-3 BAGs and combined them similarly for the Mag-BAGs. Reads are assigned to each bin for each BAG. We use a multinomial expectation maximization algorithm (EM) that will be described in another manuscript to cluster the populations (Fig. 5e). In each case, the clustering reproduced the identity determined by SNVs, without any error of assignment. We derived the copy number from each cluster, compared them for the two protocols, and saw only very minor differences (Fig. 5f).

We conclude that magnetic beads in Mag-BAGs do not affect the ability to distinguish between the two cell lines using BAG-seq. Mag-BAG is suitable for the general application of BAG-seq and split-pooling. These results support the purpose of our work, which is to enable us to robotize the BAG protocol. Magnetic field separation should be easier to robotize than centrifugation or filtration. Currently, the protocol is manual and labor intensive, particularly the pool-and-split steps.

Consequently, we are limited in the number of unique BAGs we can produce, and the process is prone to user error. Our objective is to reduce labor, increase reproducibility, and increase the number of cells that can be uniquely tagged from tens of thousands to millions through larger splits at the pool-and-split stage. The ultimate applications are in the analysis of tumor biopsies or the analysis of circulating blood components in healthy persons and patients with leukemia, blood dyscrasias, infections, or inflammation.

3. Conclusion

In this work, we developed magnetized balls of acrylamide gel (Mag-BAGs) by physically embedding commercially available superparamagnetic microparticles. We achieved consistent and repeatable BAG magnetization that allows bulk separation and future automation of critical workflows such as split-pooling. Magnetization, separation performance, and molecular applications strongly depend upon the physical characteristics of the magnetic particles. We identified 1 μm superparamagnetic particles that perform robust separation for typical buffer exchange and thermocycling. Larger BAGs and higher bead concentrations result in faster separation due to the differential scaling relationships of the magnetic force and viscous drag with the Mag-BAG radius. Above a critical bead concentration, the separation timescale becomes independent of the Mag-BAG size. Here, we reach the maximum utility of increased bead count per BAG. We can thus optimize the total bead content per Mag-BAG to achieve a desired separation rate and pellet profile to reduce bead and reagent costs.

Mag-BAG washing via magnetic separation offers improved yield to centrifugation in both 1.5mL tubes and 96-well plates, proving more robust to user and system variability. Mag-BAGs are fully compatible with single-cell sequencing workflows. We used Mag-BAGs to efficiently distinguish cells from a mixed cell population using the BAG-Seq protocol. Mag-BAGs exhibited similar sequencing performance as non-magnetized BAGs.

In conclusion, we increased the functionality of hydrogel beads by embedding magnetic beads using a straightforward method. These Mag-BAGs provide increased yields and washing efficiencies for purification procedures. Our work enables automation to replace the labor-intensive methods required for split-pooling, and we expect it will have a lasting impact on single-cell genomic workflows.

4. Experimental section

4.1. Microfluidic designs

Microfluidic droplets containing a bead suspension and a gel monomer solution were generated using a 2-layer co-flow design (see the CAD file in Supplemental Information). For the BAG magnetization experiments, three different microfluidic circuits were designed for nominal BAG diameters of 100 μm, 65 μm, and 45 μm. A schematic of the circuits is shown in Fig. S4. Relevant channel dimensions are tabulated in Table S1. For the scDNA experiment, a commercially sourced Drop-seq device (Nanoshift, Emeryville, CA, USA) was used.[18]

4.2. Microfabrication

Microfluidic devices were fabricated in PDMS (Sylgard 184 Silicone Elastomer Kit, Dow Corning, Midland, MI) using soft lithography.[19] We used an established protocol to create a double-layer mold.[20] We designed the microfluidic circuits using the AutoCAD software (AutoDesk, San Francisco, CA, USA) and had them printed onto mylar masks (CAD/Art services, Bandon, OR, USA). We spincoated a negative photoresist (SU-8, Kayaku Advanced Materials, Westborough, MA, USA) onto 3” silicon wafers. We used a UV mask aligner (500W UV illuminator, Newport, Irvine, CA) to pattern the designs onto the photoresist. We developed the structures using PGMEA (Kayaku Advanced Materials, Inc). To facilitate PDMS release, we silanized the mold surfaces via overnight vapor deposition of a fluorinated silane compound (1H, 1H, 2H, 2H-Perfluorooctyltrichlorosilane, Gelest, Morrisville, PA). We mounted the mold onto a custom acrylic jig to ensure consistent PDMS slab thickness. PDMS was mixed with crosslinker at a 10:1 (polymer:crosslinker) weight ratio, degassed under vacuum, and poured before curing for 1 hour at 65 °C. We punched the inlet and outlet ports with a biopsy tool (Syneo, West Palm Beach, FL, USA). PDMS slabs were bonded to 50 mm x 75 mm glass slides spin-coated with a 150 μm thick layer of PDMS to ensure the same surface properties of all four channel walls. Bonding was achieved after surface activation via 1 minute exposure to oxygen plasma (PDC-32G, Harrick Plasma, Ithaca, NY). To ensure preferential wetting of fluorinated oil, channels are surface-treated with a solution at 5% (v/v) of (heptadecafluoro-1,1,2,2-tetrahydrodecyl)trichlorosilane (Gelest) in HFE-7500 (3M, St. Paul, MN, USA) for 5 minutes right after bonding. The solution was then flushed with pure HFE-7500 oil.

4.3. Fluid compositions

Fluid compositions are based on the scDNA BAG-seq protocol of Li et al,[9] with slight modifications for investigating BAG magnetization. Beads were added to the water component volume in the aqueous phase 2. For optimization experiments (i.e., without cells), we replaced the Linker TG primer and Proteinase K components with equal volumes of water. Otherwise, fluid compositions were taken from the scDNA BAG-seq protocol. [9]

The oil phase consisted of 0.4% (v/v) TEMED (1610801, Bio-Rad, Hercules, CA, USA) in HFE-7500 (3M) that contained 2% (w/w) PFPE-PEG surfactant.[21] TEMED in the oil phase catalyzed the polymerization of the acrylamide gel within droplets. Aqueous phase 1 consisted of 0.05% (w/v) BSA in 1X PBS.

For scDNA applications, the aqueous phase 2 was prepared in 500 μL batches containing 90 μL Acrylamide/Bis 19:1, 40% w/v (A9926, Sigma-Aldrich, St. Louis, MO), 64.5 μL Acrylamide 40% w/v (A4058, Sigma-Aldrich, St. Louis, MO), 135.5 μL bead suspension in H2O, 80 μL 500 μM Linker TG primer /5ACryd//iSp18/TGTGTTGG GTGTGTTTGGKKKKKKKGKKKKKKKKNN, (Integrated DNA Technologies, Coralville, IA, USA), 25 μL EDTA 0.5M (AM9260G, Thermo Fisher Scientific, Waltham, MA, USA), 50 μL 1M Tris pH 7.5 (15567–027, Invitrogen), 5 μL 20% (w/v) Sarkosyl (L7414, Sigma-Aldrich) in H2O, 10 μL Proteinase K (P4850, Sigma-Aldrich), 10 μL 0.1M DTT (707265ML, Thermo Fisher Scientific, Waltham, MA, USA), and 30 μL 10% (w/v) ammonium persulfate (09913–100G, Sigma-Aldrich) in H2O.

We investigated three types of superparamagnetic beads: 1 μm streptavidin-functionalized beads (S1420S, New England Biolabs, Ipswich, MA, USA), 1 μm blank beads (i.e. non-functionalized, but otherwise identical to S1420S from New England Biolabs, provided as a special request), and 500 nm azide-functionalized beads (MGB-AZD-10–10, Luna Nanotech, Markham, ON, Canada). Beads were washed three times in wash buffer (0.1M PBS pH7.4) using a magnetic separation stand (Z5332, Promega) before being resuspended in H2O.

4.4. Experimental setup

We injected solutions using 1 mL glass syringes (Gastight #1001, Hamilton, Reno, NV, USA), mounted on syringe pumps (NE-300, New Era Pump Systems, Farmingdale, NY, USA). The fluids were delivered to the devices via PEEK tubing (0.254 mm ID, 0.787 mm OD, Zeus, Orangeburg, SC, USA). We monitored on-chip droplet generation using an inverted brightfield microscope (Diaphot-TMD, Nikon, Tokyo, Japan) equipped with a camera (XCD-V60, Sony, Tokyo, Japan). Droplet and BAG images were captured with 10x (Plan Ph1 10/0.3 DL 160/0.17, Nikon) and 20x (Plan Ph2 20/0.4 160/1.2 ELWD, Nikon) objective lenses using a custom LabVIEW software (National Instruments, Austin, TX, USA).

4.5. BAG generation

When generating BAGs for the magnetization and yield experiments, the aqueous phases 1 and 2 were pre-mixed at 1:1 (v/v) into a 1mL total volume and injected using a single syringe. Oil and aqueous phases were injected at 468.2 μL/hr and 323.1 μL/hr, respectively. When using 500 nm beads, these flow rates were doubled to reduce sedimentation in the syringe (see Section 2.3). The emulsion was collected into 1.5mL microcentrifuge tubes containing 300 μL mineral oil (330779, Sigma Aldrich) for a minimum of 1 hour.

4.6. Incubation and BAG release

After collection, samples were incubated at 50°C overnight to ensure complete polymerization of the gel matrix. After incubation, the HFE-7500 oil layer at the bottom of the sample tube was aspirated with a 22G needle and replaced with a solution of 2% (w/w) PFPE-PEG surfactant[21] in FC-40 fluorinated oil (F9755, Sigma Aldrich). Samples were then incubated at 95°C for 12 minutes, at 55oC for 1 hour, and at room temperature for 10 minutes. We used a 22G needle to remove both the bottom layer of FC-40 oil and the top layer of mineral oil. The dense emulsion layer was then resuspended in 600 μL 6X SSC buffer. We added 150 μL of 1H,1H,2H,2H-Perfluorooctanol (370533, Sigma Aldrich) and manually shook the tube for 10 seconds to break the droplet interfaces and release the balls of acrylamide gel (BAGs). The tube was centrifuged at 1000 rcf for 1.5 minutes before removing the bottom oil and top buffer layers with a 22G needle. We again washed the BAGs using 600 μL 6X SSC buffer and centrifuged at 1000 rcf for 1.5 minutes.

4.7. Sample preparation

500 μL 20% (v/v) Mag-BAG suspensions in 1.5mL microcentrifuge tubes were prepared for recording magnetic separation videos. We ensured equal volumes of the sedimented BAG layer (100 μL) for each sample. Before separation, the BAG suspension was agitated via pipette aspiration to break up BAG clusters and ensure a uniform dispersion.

4.8. Magnetic separation video capture

Mag-BAGs in suspension were separated on a two-tube magnetic separation stand (Z5332, Promega) using the default side-magnet configuration. The separation was recorded using a USB microscope/camera (#1061, Adafruit, NY, NY, USA). A dark backdrop was placed behind the tube, and a 3.75” square LED backlight (LED-SP, Amscope, Irvine, CA, USA) was placed between the magnetic stand and the backdrop (Fig. S5a). The backlight was covered with a mylar mask with a 0.6 mm wide transparent slit to illuminate the sample with a vertically structure light (Fig. S5b). Videos were recorded in a dark room using the AMCap software, producing 640×480 pixel RBG videos in .avi format. The setup allowed for a clear contrast between the suspended Mag-BAGs and the background buffer during separation. Videos are available upon request.

4.9. Video post-processing and separation curve generation

The procedure for generating separation progress curves from video is outlined in Fig. S6. The separation progress was monitored by comparing the mean gray value of the Region Of Interest (ROI) that defines the final pellet and the rest of the solution (void area). Video .avi files were imported to ImageJ/Fiji[22] as a virtual time stack and cropped to a rectangle, bounding the tube tip, the meniscus bottom, and the side edges. The red channel’s 8-bit greyscale .tif stack was generated to improve the contrast between the separated Mag-BAG pellet and the “void area”. “Enhance Contrast” was then applied with the “Normalize” setting and 20% pixel saturation. The average background grey value inside the void area was then subtracted from the image before applying “Enhance Contrast” again with the same settings.

ROIs bounding the pellet and void areas were established using the last slice of the time stack: 1) the polygon tool was used to manually draw an ROI that defined the entire interior perimeter of the solution (including the pellet); 2) a ROI was drawn to define the void area with the same method; 3) these two ROIs were combined with the “XOR” command, and then the “Split” command was used to generate the pellet ROI. The resulting pellet and void area ROIs formed two subdivisions of the full tube interior.

A custom ImageJ macro measured the mean grey value inside these ROIs and calculated the difference between these values (the “raw contrast” C) for each slice in the time stack. These contrast values were then normalized to the minimum and maximum values:

Cnorm=CCminCmaxCmin (1)

The evolution of Cnorm across the time stack was an exponential asymptote from 0, reflecting the homogenous distribution of Mag-BAGs in suspension, to 1, reflecting the complete magnetic separation of Mag-BAGs into a pellet along the side of the tube. We thus modelled Cnorm(t) as:

Cnormt=1A(etτ) (2)

We used MATLAB’s fit() function with “fitType” parameter “exp1” to fit A and τ to (1Cnorm) from the time-array of Cnorm. Here, τ is the separation timescale; we used this parameter as a metric of separation time performance (see Results).

4.10. Post-thermocycling separation experiment

We recorded a baseline magnetic separation video of a 500 μL 20% (v/v) BAG suspension in 1X standard Taq reaction buffer (B9014S, New England Biolabs). Each sample was then split into two PCR microtubes to accommodate their lower volume. Each tube was then subjected to the following thermocycle sequence: step 1) 1X of 72°C for 300s, step 2) 94°C for 30s, step 3) 40 cycles of 94 °C for 10s, followed by 63°C for 30s, followed by 72°C for 60s. After thermocycling, the two tubes for each concentration were recombined into a 1.5mL microcentrifuge tube to which 300 μL of 1X PCR buffer was added to restore the original 500 μL 20% (v/v) sample volume and concentration. Post-thermocycling separation videos were then recorded.

4.11. Magnetic separation yield experiment (1.5mL microcentrifuge tubes)

To compare the magnetic separation yield performance of Mag-BAGs against centrifugation-based separation in 1.5mL microcentrifuge tubes, we generated 100 μm BAGs and Mag-BAGs at 2.3 mg/mL bead concentration. For each sample, we prepared four sets of 3 replicates 500 μL 20% v/v samples: i) bead-free BAGs in 6X SSC buffer, ii) Mag-BAGs in 6X SSC buffer, iii) bead-free BAGs in 1X PCR buffer, and iv) Mag-BAGs in 1X PCR buffer.

We compared the separation performance by measuring the percent volume loss of the BAG pellets between a pre-separation baseline and the remaining volume after 10 wash cycles. We first recorded the baseline volume after concentrating Mag-BAGs with a bottom-magnet and allowing BAGs to sediment. For the bead-free BAG samples, we separated BAGs by centrifuging at 1000rcf for 1.5 minutes. For the Mag-BAG samples, we separated BAGs for 30 seconds on a magnetic separation stand (25332, Promega) with the default side-magnet configuration.

To aspirate 400 μL of supernatant, we placed sample tubes on the separation stand and monitored them live using the video-capture setup. Using the video feed as a guide, we carefully inserted a 1mL micropipette into the tube to the 0.3mL mark and then slowly aspirated 400 μL of supernatant, slowly lowering the pipette tip as needed to ensure that the tip stays submerged while also not disturbing the BAG pellet. We then resuspended in 400 μL of fresh buffer and repeated the cycle. After 10 separation/wash cycles, we recorded the post-separation volume.

4.12. Magnetic separation yield experiment (96-well PCR plates)

Using the same conditions as the separation in 1.5mL microcentrifuge tubes, we performed the separation directly in a 96-well plate using a magnetic separator designed for PCR plates (12331D, Invitrogen) and a centrifuge.

After recording the baseline BAG pellet volumes, we diluted each sample to 1mL (10% v/v) and split them into 8 wells of a 96-well PCR plate. For each cycle, we placed the 96-well PCR plate on the magnetic separator for 2 minutes (for the Mag-BAGs) or centrifuged the plate at 1000rcf for 2 minutes (for the bead-free BAGs). We aspirated 85 μL of buffer from the top of each well simultaneously using an 8-channel multipipette and then replaced it with 85 μL of the appropriate buffer. We pressed the pipette tips against the wall of the wells opposite the magnet-side wall, slowly lowered the tips just below the fluid surface, and gently aspirated while slowly lowering the tips to keep pace with the descending fluid surface. After 10 cycles, we recombined the wells from each sample’s row into a 1.5mL microcentrifuge tube, centrifuged these tubes at 1000rcf for 1.5 minutes, and aspirated 500 μL of supernatant buffer to restore the starting 500 μL 20% (v/v) sample volume and concentration. Finally, we recorded the post-separation pellet volume and estimated the percent volume loss.

4.13. Estimating BAG pellet volume loss

To estimate the percent volume loss of the BAG pellets, we imported the pre- and post-separation video .avi files recorded by AMCap to ImageJ/Fiji for volume analysis. Using the polygon tool, we manually drew ROIs around the sedimented BAG layers in the last frame of each video. We then estimated the baseline and post-separation pellet volumes as volumes of revolution by applying Pappus’ second centroid theorem:

V=A*d (3)

Where A is the area to be revolved around the axis of revolution, and d is the distance traveled by the centroid of that area over the course of a 360° revolution about the axis. We used a custom ImageJ macro to estimate the volume of revolution for each ROI. First, a blank image was created with height and width equal to that of the ROI’s bounding box. We then defined a new ROI on this image with a flat top surface, using the points that defined the original ROI but replacing the vertical coordinate for each point that defined the top surface with the averaged value among all these points. We then defined the revolution axis as a vertical line down the middle of the image and split the ROI into left and right halves about this line using the “XOR” and “Split” commands. We then estimated the BAG pellet volume as the average of the two volumes of revolution calculated for each half using eq. (3) above, with d=2π*r where r is the distance of the half-ROI’s centroid from the axis of revolution.

4.14. Single-cell DNA (scDNA) sequencing experiment

We assessed the use of Mag-BAGs in a single-cell DNA (scDNA) sequencing experiment. We compared the performance to bead-free BAGs evaluated in a previous paper[9] using human cells from normal skin fibroblast (SKN-1) and a breast cancer cell line (SK-BR-3). The bead-free BAGs (“no mag beads” condition) and Mag-BAGs (“mag beads” condition) are generated and processed separately but concurrently by the same operator.

4.14.1. Sample preparation, droplet generation, and BAG release.

1 million total cells, from a mixture of two cell sources, were suspended in a 1mL solution containing 850 μL aqueous phase 1 (0.05% (w/v) BSA in 1X PBS) and 150 μL OptiPrep (D1556, Sigma Aldrich) to reduce cell sedimentation in the syringe and inlet tubing. For this experiment only, the oil phase consisted of 5% (v/v) surfactant (008-Fluorosurfactant, RAN Biotechnologies, Beverly, MA, USA) in HFE-7500. For generating Mag-BAGs, a final bead concentration of 1.15mg/mL was used; 288 μL of the bead stock volume was washed 3 times in 1X PBS pH 7.4, then resuspended in the H2O volume component of aqueous phase 2. For this experiment, we encapsulated the cell suspension (aqueous phase 1) and aqueous phase 2 into microfluidic droplets using a Drop-seq[18] device (Nanoshift, Emeryville, CA, USA). We used flow rates of 650 μL/hr for both aqueous phases and 2,800 μL/hr for the oil phase. Droplets were collected for about 15 minutes into a 1.5mL microcentrifuge tube containing 300 μL mineral oil. We incubated the BAGs overnight at 37°C.

4.14.2. Linear extension

The BAG tubes were incubated at 95 °C for 12 minutes, followed by 55 °C for 30 minutes and room temperature for 10 minutes before the oil layers were removed. BAGs were successively washed with 6X SSC buffer and 1X NEBuffer 2 (B7002S, New England Biolabs), before being resuspended in a 1mL solution containing 830 μL H2O, 100 μL NEBuffer 2, 60 μL 10mM dNTP (11814362001, Sigma Aldrich), and 10 μL DNA Polymerase I (M0210M, New England Biolabs). The suspension was incubated at room temperature for 1.5 hours and then at 37°C for 30 minutes under rotation. The reaction was stopped using a series of STOP-X buffers, where X is the concentration of EDTA in mM; these buffers also contained 10 mM Tris pH 8.0, 0.1% Tween-20, and 0.1m KCl. After incubation, the BAG suspension was mixed with 5mL of STOP-25 for 2 minutes. We spun down the BAG suspension, removed the supernatant, resuspended in 1mL STOP-10, passed through a 150 μm cell strainer, and then transferred to a new 1.5mL microcentrifuge tube for each condition.

4.14.3. Exonuclease treatment

After linear extension, we used a 3’ exonuclease treatment to cleave nucleotides from unused linker TG primers. BAGs were successively washed with STOP-1 and 1X Exonuclease I buffer (B0293S, New England Biolabs), before being resuspended in an 800 μL solution containing 680 μL H2O, 80 μL 10X Exonuclease I buffer, and 40 μL Exonuclease I enzyme (M0293S, New England Biolabs). This suspension was incubated at 37°C for 1.5 hours under rotation. To stop the reaction, BAGs were washed twice in STOP-25, and once in STOP-10.

4.14.4. 3’ CATG overhang cutting

DNA was then cleaved using NLAIII restriction enzymes (R0125L, New England Biolabs) to generate 3’CATG overhangs. BAGs were washed succesively with STOP-1 and 1X rCutSmart buffer (B6004S, New England Biolabs). BAGs were resuspended in a 1mL solution containing 840mL H2O, 100 μL 10X rCutSmart buffer, and 60 μL NLAIII enzyme and incubated rotating at 37°C for 1.5 hours. BAGs were centrifuged at 800rcf, the supernatant was removed, and BAGs were resuspended in a 1mL solution containing 880 μL H2O, 100 μL 10X rCutSmart buffer, and 20 μL NLAIII enzyme, and then incubated at 37°C for 1.5 hours under rotation. After incubation, BAGs were washed twice with STOP-25 and once with STOP-10.

4.14.5. First split-and-pool

In this step, the UMI and the first BAG barcodes were added to the 3’ CATG overhangs through a ligation-and-extension reaction. First, BAGs were washed 2X in HBW buffer (10 mM Tris pH 8.0, 1 mM EDTA, and 0.1% v/v Tween-20). The BAG pellet was then resuspended in fresh HBW buffer to 350 μL total volume and mixed with 770 μL of 2X Quick Ligation Reaction Buffer (B2200S, New England Biolabs) and 110 μL 10mM dNTP. We then used an 8-head multi pipette to dispense 11 μL of this BAG suspension into each well of a 96-well PCR plate. Into each well, we added 4 μL of well-specific 10 μM LNA (4bp) primers, briefly centrifuged the plate, inverted and incubated for 5 minutes at room temperature, incubated for 5 minutes at 50°C, moved plates onto a 4°C cold plate and kept inverted for 10 minutes, and then rotated at 4 °C for 10 minutes. We prepared a solution on ice containing 220 μL H2O, 302.5 μL 2X Quick Ligation Reaction Buffer, 82.5 μL Klenow DNA polymerase (3’ -> 5’ exo-, M0212L, New England Biolabs), and 55 μL Quick Ligase (M2200S, New England Biolabs) and then used a multipipette to dispense 6 μL of that solution to each well of the PCR plate (on cold rack) and mixed each well via pipette aspiration. The PCR plates were incubated under rotation at 4°C for 10 minutes, then at 10 °C for 10 minutes, then at room temperature for 20 minutes, and finally at 37°C for 20 minutes. The reaction was stopped by adding 100 μL STOP-25 to each well and incubating for 5 minutes at room temperature. Next, the bead-free BAGs (“no mag beads” condition) were separated via centrifugation (900rcf for 2 minutes) while Mag-BAGs (“beads” condition) were separated for 10 minutes on a 96-well magnetic separation rack (12331D, Invitrogen). 85 μL of supernatant was removed from the top of each well via multipipette while carefully avoiding disruption of BAGs. The remaining volume in each well was then pooled into a solution basin preloaded with 5mL of STOP-25. BAGs were then transferred to a 15mL conical tube, washed 2X in STOP-10, and resuspended in 800 μL STOP-10.

4.14.6. Second split-and-pool

This step added a second BAG barcode to DNA strands via well-specific PCR primers, creating 962=9,216 unique barcodes to identify individual BAGs. BAGs were first washed 2X in HBW buffer. A universal primer/BAG suspension was prepared containing 10 μL of 100 μM TG primer (TGTGTTGGGTGTGTTT*G*G, Integrated DNA Technologies) and 880 μL of BAG suspension using H2O. To each well of a 96-well PCR plate, we dispensed 9 μL of this suspension and 1 μL of 10 μM well-specific barcode primers. On ice, we added 10 μL NEBNext Ultra II Q5 Master Mix (M0544S, New England Biolabs) to each well. Samples were amplified using the following thermocycle sequence: 95°C for 2min, 98°C for 15s, 12X of [98°C for 15s, 63°C for 1min, 65°C for 1min], 65°C for 5min, 4°C infinite. Finally, we purified the resulting amplified DNA using AMPure XP magnetic beads (A63881, Bechman Coulter, Pasadena, CA, USA).

4.14.7. Final sequencing library preparation

A final PCR reaction solution was prepared containing 20 μL NEBNext Ultra II Q5 Master Mix (M0544S, New England Biolabs), 1 μL of 10 μM “P5-TG” primer (AATGATACGGCGACCACCGAGATCTACAC GGAGATGTG TGTGTTGGGTGTGTTT*G*G, Illumina, San Diego, CA), and 1 μL of 10 μM N70x Nextera primer (FC-131–1096, Illumina) and added to the DNA samples after resuspension in H2O to 40 μL. Samples were amplified via PCR using the following thermocycling sequence: 95°C for 20s, 4X of [98°C for 15s, 62°C for 40s, 72°C for 40s], 3X of [98°C for 10s, 67°C for 20s, 72°C for 40s], 72°C for 4min, 4°C infinite. After PCR, the product was purified using AMPure XP magnetic beads and then sequenced using the Illumina platform with a custom Read 1 sequencing primer (GGAGATGTG TGTGTTGGGTGTGTTTGG).

4.15. Genomics data-analysis workflow

4.15.1. Mapping reads to human reference genome

The single-cell data analysis pipeline largely follows the methodology described in the previous paper.[9] We performed pair-end read mapping to the human reference genome hg19 using HISAT2. Reads with the same varietal tag (UMI), genomic mapping position, and cell barcode were identified as originating from the same template.

4.15.2. Identifying BAGs that contain cells

To determine which BAG tags represent cells rather than cellular debris or background diffusion, the read counts for each tag were sorted from highest to lowest, and the log of the read counts was plotted on the y-axis. This curve was smoothed using the loess function in R with the span parameter set to 0.1. The second-order finite central difference

δ2[f]x=fxk2fx+fx+k (4)

was computed for each point and the point with the minimum value was selected as the threshold read count above which BAG tags were deemed to represent a cell. For SKN-1 and SK-BR-3 BAGs, k was set to 30 and 100, respectively.

4.15.3. Cell-specific SNPs and cell-identity classification

Using whole-genome-sequencing (WGS) libraries of SKN-1 and SK-BR-3, we examined positions with a sequencing depth greater than 30 in both libraries, from which we searched for the cell-source-specific SNPs that satisfied the following rule: for one library, the alternative allele frequency exceeded 50% of the reference allele frequency, while for the other library, the alternative allele frequency was 0. In all, we identified 225,037 SK-BR-3-specific SNPs and 245,614 SKN-1-specific SNPs. These SNPs specific to each cell source were applied to the single-cell libraries.

4.15.4. Downsampling libraries for yield comparison

The Mag-BAG and BAG libraries had different numbers of cells and reads. To compare the efficiency of capture and tagging between libraries, we downsampled the BAG tags in the Mag-BAG library to the same number of tags as the BAG library. We then downsampled the reads in the BAG library to match the number of reads in the sampled tags in the Mag-BAG library. The two libraries used These downsampled data sets to compare reads and tags per BAG.

4.15.5. Comparing genomic insert lengths between the two libraries

To compare genomic insert length distributions in the two libraries, we sampled 350,000 read pairs from the bam files that were marked as primary mappings and proper pairs.

4.15.6. procedure for clustering and copy number profile comparison

For clustering based on copy number, we take 100 cells each from SKN-1 and SK-BR-3 from a single protocol. We use the known copy number profile from a previously done whole genome analysis of the cell lines for binning. We use the constant copy number segment boundaries of the SK-BR-3 as bin boundaries to get 64 bins. Even though the sizes of the bins vary greatly, genomic locations where the copy number changes in any cell will occur only near the bin boundaries. We assign reads to each bin in each BAG, and use a multinomial expectation maximization algorithm on unique read counts in bins. The details of the algorithm will be described in another manuscript at a later time. In each case, the clustering reproduces the identity determined by SNVs without assignment error. We compute the copy number profiles for the SK-BR-3 clusters in both protocols by normalizing the unique read counts in each bin by the average read counts in SKN-1 clusters in each protocol. We see only minor differences in the copy number profiles derived in this manner for SK-BR-3 from both protocols.

Supplementary Material

SI

Acknowledgements

We want to thank Dr. Phenix-Lan Quan for revising the manuscript. This work was supported by grants to M. Wigler from The Breast Cancer Research Foundation (BCRF-22-174) and the Simons Foundation, Life Sciences Founders Directed Giving-Research (519054), and an award from the National Science Foundation (NSF)-CBET (No. 1705578) to E. Brouzes.

Footnotes

Supporting Information

Supporting Information is available from the Wiley Online Library or from the author.

This paper is the accepted author manuscript and may not exactly replicate the final, authoritative version of the article. The final article will be available via its DOI: 10.1002/admt.202301155.

Contributor Information

Evan Lammertse, Department of Biomedical Engineering, Stony Brook University, Stony Brook, NY, 11794, USA.

Siran Li, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY 11724, USA.

Jude Kendall, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY 11724, USA.

Catherine Kim, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY 11724, USA.

Patrick Morris, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY 11724, USA.

Nissim Ranade, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY 11724, USA.

Dan Levy, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY 11724, USA.

Michael Wigler, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY 11724, USA.

Eric Brouzes, Department of Biomedical Engineering, Stony Brook University, Stony Brook, NY, 11794, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA.; Cancer Center, Stony Brook School of Medicine, Stony Brook, NY 11794, USA. Institute for Engineering Driven Medicine, Stony Brook University, Stony Brook, NY 11794, USA.

References

  • [1].Villani A-C, Satija R, Reynolds G, Sarkizova S, Shekhar K, Fletcher J, Griesbeck M, Butler A, Zheng S, Lazo S, Jardine L, Dixon D, Stephenson E, Nilsson E, Grundberg I, McDonald D, Filby A, Li W, Jager PLD, Rozenblatt-Rosen O, Lane AA, Haniffa M, Regev A, Hacohen N, Science, 2017,356, eaah4573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Navin N, Kendall J, Troge J, Andrews P, Rodgers L, McIndoo J, Cook K, Stepansky A, Levy D, Esposito D, Muthuswamy L, Krasnitz A, McCombie WR, Hicks J, Wigler M, Nature, 2011,472, 90–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Clausell-Tormos J, Griffiths AD, Merten CA, Lab Chip, 2010,10, 1302–1307. [DOI] [PubMed] [Google Scholar]
  • [4].Brouzes E, Medkova M, Savenelli N, Marran D, Twardowski M, Hutchison JB, Rothberg JM, Link DR, Perrimon N, Samuels ML, Proc Natl Acad Sci U S A, 2009,106, 14195–14200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Macosko Evan Z., Basu A, Satija R, Nemesh, Shekhar K, Goldman M, Tirosh I, Bialas Allison R., Kamitaki N, Martersteck Emily M., Trombetta John J., Weitz David A, Sanes Joshua R., Shalek Alex K., Regev A, McCarroll Steven A., Cell, 2015,161, 1202–1214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Klein Allon M., Mazutis L, Akartuna I, Tallapragada N, Veres A, Li V, Peshkin L, Weitz David A., Kirschner Marc W, Cell, 2015,161, 1187–1201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Zilionis R, Nainys J, Veres A, Savova V, Zemmour D, Klein AM, Mazutis L, Nat Protoc, 2017,12, 44–73. [DOI] [PubMed] [Google Scholar]
  • [8].Abate AR, Chen CH, Agresti JJ, Weitz DA, Lab Chip, 2009,9, 2628–2631. [DOI] [PubMed] [Google Scholar]
  • [9].Li S, Kendall J, Park S, Wang Z, Alexander J, Moffitt A, Ranade N, Danyko C, Gegenhuber B, Fischer S, Robinson BD, Lepor H, Tollkuhn J, Gillis J, Brouzes E, Krasnitz A, Levy D, Wigler M, Genome Res, 2020,30, 49–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Kishore R, Reef Hardy W, Anderson VJ, Sanchez NA, Buoncristiani MR, Journal of Forensic Sciences, 2006,51, 1055–1061. [DOI] [PubMed] [Google Scholar]
  • [11].Ware SA, Desai N, Lopez M, Leach D, Zhang Y, Giordano L, Nouraie M, Picard M, Kaufman BA, J Biol Chem, 2020,295, 15677–15691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Chranioti A, Aga E, Margari N, Kottaridi C, Pappas A, Panayiotides I, Karakitsos P, Infectious Diseases in Obstetrics and Gynecology, 2011,2011, 931281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Kim J-W, Utada AS, Fernández-Nieves A, Hu Z, Weitz DA, Angewandte Chemie International Edition, 2007,46, 1819–1822. [DOI] [PubMed] [Google Scholar]
  • [14].Hwang DK, Dendukuri D, Doyle PS, Lab Chip, 2008,8, 1640–1647. [DOI] [PubMed] [Google Scholar]
  • [15].Chen C-H, Abate AR, Lee D, Terentjev EM, Weitz DA, Adv Mater, 2009,21, 3201–3204. [Google Scholar]
  • [16].Holtze C, Rowat AC, Agresti JJ, Hutchison JB, Angile FE, Schmitz CH, Koster S, Duan H, Humphry KJ, Scanga RA, Johnson JS, Pisignano D, Weitz DA, Lab on a chip, 2008,8, 1632–1639. [DOI] [PubMed] [Google Scholar]
  • [17].Kim D, Paggi JM, Park C, Bennett C, Salzberg SL, Nat Biotechnol, 2019,37, 907–915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Habib N, Avraham-Davidi I, Basu A, Burks T, Shekhar K, Hofree M, Choudhury SR, Aguet F, Gelfand E, Ardlie K, Weitz DA, Rozenblatt-Rosen O, Zhang F, Regev A, Nature methods, 2017,14, 955–958. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Duffy DC, McDonald JC, Schueller OJA, Whitesides GM, Anal. Chem, 1998,70, 4974–4984. [DOI] [PubMed] [Google Scholar]
  • [20].Sauzade M, Li L, Bakowski T, Strey HH, Brouzes E, Biomicrofluidics, 2020,14, 014116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [21].Scanga R, Chrastecka L, Mohammad R, Meadows A, Quan PL, Brouzes E, RSC advances, 2018,8, 12960–12974. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez J-Y, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A, Nature methods, 2012,9, 676–682. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SI

RESOURCES