Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Feb 18.
Published in final edited form as: Nat Methods. 2020 Jul 6;17(8):833–843. doi: 10.1038/s41592-020-0880-2

ZipSeq : Barcoding for Real-time Mapping of Single Cell Transcriptomes

Kenneth H Hu 1, John P Eichorst 1,2, Chris S McGinnis 3, David M Patterson 3, Eric D Chow 4, Kelly Kersten 1, Stephen C Jameson 5, Zev J Gartner 3, Arjun A Rao 1,2, Matthew F Krummel 1,2
PMCID: PMC7891292  NIHMSID: NIHMS1664825  PMID: 32632238

Abstract

Spatial transcriptomics seeks to integrate single-cell transcriptomic data within the 3-dimensional space of multicellular biology. Current methods use glass substrates pre-seeded with matrices of barcodes or fluorescence hybridization of a limited number of probes. We developed an alternative approach, called ‘ZipSeq’, that uses patterned illumination and photocaged oligonucleotides to serially print barcodes (Zipcodes) onto live cells within intact tissues, in real-time and with on-the-fly selection of patterns. Using ZipSeq, we mapped gene expression in three settings: in-vitro wound healing, live lymph node sections and in a live tumor microenvironment (TME). In all cases, we discovered new gene expression patterns associated with histological structures. In the TME, this demonstrated a trajectory of myeloid and T cell differentiation, from periphery inward. A combinatorial variation of ZipSeq efficiently scales in number of regions defined, providing a pathway for complete mapping of live tissues, subsequent to real-time imaging or perturbation.

INTRODUCTION:

Single cell RNA Sequencing (scRNA-Seq) combined with other multimodal analyses such as surface epitope labeling and repertoire analysis have revealed previously unappreciated heterogeneity within cell populations. This approach has been especially useful in immunology given the diversity of immune cell types and the microenvironments they experience. 1. Yet scRNA-Seq studies lose information on the spatial context where a given single cell transcriptome was localized 2. Conventional microscopy localizes cells and molecules in space, however, it is limited in the parameters it can detect. Even with high dimensional imaging techniques (MIBI, CODEX), probes must be selected a priori 3,4. In order to better understand specifically how cellular transcriptional heterogeneity is influenced by the local environment and vice versa in a discovery-based, unbiased approach, it becomes necessary to link high-dimensional scRNA-Seq data to the spatial dimensions and real-time phenotypical analyses that microscopy affords.

To couple conventional microscopy of live tissues with transcriptomics, we needed a means to demarcate multiple manually defined regions-of-interest (ROI’s) in real-time, in both mouse and human tissues. While excellent for some applications, we decided against grid-based approaches57, which average together gene expression within a region and are defined prior to imaging on fixed tissue sections. Instead, we sought to use existing scRNA-Seq workflows and to develop a method for ‘printing’ a DNA barcode onto live cells in a spatially defined manner, which can be read-out during sequencing 8. We accomplished this by initially coating a base DNA oligo onto cells in a tissue and –through photocaging—we could control hybridization of subsequent DNA strands in a light- and thus spatially-restricted manner 9,10. The resultant barcodes then provided a connection to those user defined regions.

RESULTS:

We generated a photo-uncaging system that allowed light-based printing of DNA barcodes onto the surface of cells. A double-stranded piece of DNA was attached to cells, either by a high affinity antibody (e.g. anti-CD45, for pan-immune cells) or via stable lipid insertion (e.g. lignoceric acid) into the membrane 11. This double stranded “anchor strand” contained a 16 bp overhang sequence (termed “01”) that is blocked at four sites along its length using 6-nitropiperonyloxylmethyl (NPOM) conjugated to thymidine, and thus is unable to participate in base-pairing 9. These photocages have been historically used to precisely control gene expression in optogenetics approaches 12. However, following local illumination with 365 nm light to release the cages, a readout oligonucleotide strand, termed a “Zipcode” or ‘ZC’ strand can hybridize to O1 (Fig. 1A). This annealed Zipcode terminates in a polyA sequence and an Illumina Read 2 Sequence which then allows for poly-dT based amplification during library construction 8. We first demonstrated light-based control of hybridization for two readout strands, by marking two separated populations of primary mouse CD4 and CD8 T cells and demonstrating excellent concordance of populations and the associated Zipcodes (Fig. S1AF). Cells with either a mix of Zipcodes or ‘incorrectly’ assigned could partially be explained by doublets detected computationally13 (Fig. S1F) combined with <100% cell purity following purification from lymphoid tissues. We note that de-hybridization and re-hybridization during potential tissue dissociation onto another cell represents a negligible effect (Fig. S1G, H). Additionally, we show that our workflow of UV illumination, oligonucleotide addition and washing does not adversely affect cell viability of mouse primary lymphocytes (Fig. S1IK)

Figure 1:

Figure 1:

(a) Schematic of oligonucleotide sequences and labeling moieties used in this paper. Both lipid and antibody are covalently conjugated to an ‘anchor’ sequence. Meanwhile a caged strand consisting of 4 photocaging groups on an overhang sequence linked to the reverse complement of the anchor strand can hybridize with the Ab or lipid DNA conjugate prior to labeling cells. Readout strand or ‘Zipcodes’ consist of a reverse complement sequence to the caged overhang sequence 1 or 2, followed by a partial Illumina Small RNA Read 2 sequence for downstream amplification. In addition, each Zipcode strand bears a 8bp barcode and a 28 polyA sequence for capture by poly dT primers during reverse transcription. (b) A microscope light path for simultaneous imaging and photo-uncaging of a sample. Spatially directed photo-uncaging is accomplished through directing light from a mercury arc-lamp onto a Mosaic DMD with an 800 × 600 micromirror array in plane with the sample. The sample can be simultaneously imaged using epi-fluorescent excitation. In the imaging software, a user defined ROI is converted into a mask which is reflected in the micromirror array. This spatially patterned light is then directed through the microscope and objective onto the sample. An example mask is shown with the resulting illumination pattern visualized on a mirrored slide. (c) Illustration of proof-of-concept demonstrating ability to spatially control hybridization of fluorescently labelled oligonucleotides. Briefly, a monolayer of primary mouse CD8 T cells was plated, labeled w/ the anti-CD45 Ab conjugated anchor strand (here without internal Cy5 modification) hybridized to a caged overhang (O1) strand. The leftmost square ROI of 300 μm size was illuminated with 365 nm light and the first oligonucleotide with single stranded O1’ conjugated to Cy5 was added and allowed to hybridize. Following wash steps, the process was repeated 2x at other positions with distinct O1’-fluorophore combinations (TAMRA and FAM) resulting in 3 defined regions. Scale bar = 200 μm (d) Schematic for workflow for labeling two regions of interest in a tissue section beginning with labeling of cells in a dish/in tissue with appropriate labeling moiety hybridized to a strand bearing the photocaged ssDNA overhang. Manual delineation of an ROI is followed by a pulse of UV illumination. Addition of a readout strand or Zipcode 1 allows for labeling of uncaged overhangs i.e. cells within ROI. Following washout of this Zipcode 1, the process is repeated for ROI #2. Cells are then harvested or dissociated from tissue, (FACS-)sorted for labelled cells, and passed to the 10X Chromium Controller for encapsulation and reverse transcription.

We used a digital micromirror device (DMD) to control the spatial pattern of 365 nm light in a plane conjugate with the image plane of a conventional Zeiss widefield microscope (Fig. 1B). The nature of the DMD allows individual pixels to be illuminated down to approximately the resolution limit of the objective used.

To demonstrate spatial printing of a collection of Zipcodes, we plated CD8 mouse T cells, labeled with caged anti-CD45 Ab-DNA anchor strands in a chamber slide and used complementary O1’ strands that were labeled with one of three fluorophores to visually track spatially-controlled annealing. Following three rounds of: (1. Patterned illumination 2. Zipcode addition 3. Washing), we obtained clear delineation of 3 regions, showing the linear scaling of resolution and number of rounds (Fig. 1C). Individual cells that floated through the imaging field are likely the cause of deviations from the expected labeling scheme. Taking these two proofs of concept together, a schematic of an idealized workflow is depicted in Fig. 1D. Note that an internal fluorophore such as Cy5 can be incorporated into the anchor sequence or a terminal Zipcode, allowing for enrichment of all Ab-oligonucleotide labeled cells (ZC1-n or ZC unlabeled which are sorted computationally) before encapsulation.

Defining Spatially-Segregated Motility and Cell Division Programs in Wound Healing

We applied ZipSeq to study spatially-defined transcriptional programs in a well defined model of wound healing in a monolayer of NIH/3T3 fibroblasts. 12 hours after ‘wounding’, we performed live-imaging of the wound edge (Fig. 2A). We then used the lignoceric acid conjugated photocaged anchor strand (Fig. 1A) to first label all cells and then illuminated a band 0-200 μm from the wound edge (“Front”) and added Zipcode 1. Following a brief incubation, we washed this oligonucleotide out, and illuminated another band 200-400 μm away from the wound edge (“Rear”) followed by Zipcode 2 addition (Fig. 2B). We then dissociated the monolayer into single cells and subjected the cells to a 10X scRNA-Seq pipeline.

Figure 2: ZipSeq Mapping of a Live Cell Monolayer Following Wounding.

Figure 2:

(a) Experimental setup. NIH/3T3 cells were plated 48 hours prior to imaging and allowed to reach confluency. 12 hours prior to imaging, a pipette tip was used to cleanly scrape away a band. The wound was imaged after 12 hours and ROI’s defined. Cells were then labeled with a lipid-oligo conjugate and then uncaged in a series of vertical bands alternating with Zipcode addition. Harvested cells were then passed into the modified 10X workflow. (b) Brightfield of wound 12 hours post-wounding with ROI’s overlaid. Two vertical bands of 200 μm width were drawn with increasing distance from the wound edge (0-200 μm) and (200-400 μm), referred to as ‘front’ and ‘rear’ respectively for illumination and zipcoding. Scale bar = 200 μm. (c) UMAP representation of Zipcode labeled cells with majority Zipcode identity overlaid. UMAP was calculated using the top 10 principal components. (n=160 cells, nFront=67, nRear=93) (Mean nUMI = 17.3k, Mean nGene = 3550, Cutoffs: nGene > 1000, percent.mito < 0.15) (d) UMAP representation of labeled cells with cluster overlay. Clusters calculated using Seurat’s built-in SNN based clustering algorithm. (e) Percentage of cells belonging to either Front or Rear populations within each cluster as defined in (d). (f) Volcano plot from differential expression analysis between Front and Rear cells. Colored points represent genes with an adjusted p-value (Bonferroni corrected) <0.05. (g) Feature plots overlaid on UMAP representation for 3 selected genes from DE analysis enriched in either Front (Acta2, Cav1, and Tagln) or Rear cells (Stmn1, Cenpa, H2afv). Color scale indicates Log-Normalized gene read counts. (h) Hits from DE analysis were passed through GO analysis. Significantly enriched biological processes shown with -Log(p-value) (Bonferroni corrected). Violin plots for (i) S phase and (j) G2M phase signature score for Front and Rear cells. Gene lists used are shown in Methods. (k) Assignment to cell cycle phase (S, G2M or G1 phase) based on the signature scores calculated in (i) and (j). Immunofluorescence imaging of fixed NIH/3T3 cells 12 hours post-wounding stained for either (l) ACTA2 or (m) STMN1. Fire LUT from ImageJ applied. Zoomed in insets shown for indicated regions. Scale bar = 100 μm. (N) Line plot with quantification of mean fluorescence intensity vs. distance from edge. IF images from (l and m) were first masked for pixels belonging to cells vs. background. Then in-cell pixels within vertical bands stepping away from the wound were averaged to create the indicated line-scan profiles with a smoothed fit applied.

During analysis, we removed the cells with low Zipcode counts (i.e. from neither illuminated zone) from the analysis and used the ratio of Zipcode 1 to 2 counts to determine whether a cell derived from the Front region versus the Rear. We then performed unsupervised, nearest neighbor clustering combined with UMAP dimensional reduction to visualize labeled cells in transcriptional space and identified three clusters (Fig. 2C). When we overlaid the ZipSeq region calls, a clear partitioning between Front and Rear cells was observed in transcriptional space (Fig. 2D), with cluster 2 highly enriched for Front cells and clusters 0 and 1 relatively enriched for Rear cells (Fig. 2E).

Differential expression analysis between Front and Rear cells (Fig. 2F) identified collections of genes associated with cell motility (e.g. Tagln, Acta2, and Cav1) enriched in Front cells and genes associated with cell division (e.g.H2afv, Cenpa and Stmn1) for Rear cells. We displayed the expression level of single genes from (Fig. 2F) onto the UMAP representation and found that each gene had slightly different patterns, indicating additional heterogeneity of the programs at this resolution (Fig. 2G). Gene ontology (GO) analysis of biological processes using significantly differentially expressed genes supported the broad segregation of cell division (Rear) and motility (Front) genes in these two regions (Fig. 2H). Furthermore, using gene signatures for S and G2M phases of the cell cycle14, we observed that the Rear population exhibited significantly higher signature scores for S and G2M phases (Fig. 2I and J). Cluster 1 was especially enriched for cells in S or G2M phase relative to cluster 0 and 2 which were largely in G0/G1 phase (Fig. 2K). Finally, we validated these findings, using antibodies detecting the proteins encoded by genes discovered by ZipSeq. Staining for ACTA2 was broadly enriched near the front (Fig. 2L), tapering after peaking around 100 μm from the wound edge (Fig. 2N) whereas staining for STMN1 demonstrated enrichment (Fig. 2M) approximately 300 μm from the front (Fig. 2N) consistent with the regions we defined.

Mapping Cortex vs. Medulla in a Live Lymph Node

We next applied ZipSeq to learn examine gene expression in entire mouse lymph nodes (LN), which have well-characterized spatial organization of cell types and gene-expression. We targeted two regions for ZipSeq discovery; the ‘Outer’ cortex extending from the tissue edge to the T-B margin and an ‘Inner’ region largely comprising the deep T cell zone and the medulla. We first stained live LN sections from an adult C57BI/6 mouse with fluorescently labeled anti-CD3ε and anti-B220 antibodies to delineate these regions by widefield microscopy (Fig. 3AB).

Figure 3:

Figure 3:

(a) Schematic of workflow for lymph node study. A lymph node was taken from a C57BI/6 mouse and sectioned. Following this, the section was stained for B220 and CD3ε along with the anti-CD45 Ab conjugated anchor strand (with internal Cy5 modification) hybridized to a caged strand. The section was imaged and ROI’s were illuminated prior to Zipcode 1 or 2 addition. Tissue was then dissociated and labeled live cells (Cy5+) were sorted for 10X encapsulation, (b) Composite stitched image of lymph node section used with B220 marked in green and CD3ε in red to delineate inner and outer regions used for Zipcoding in subsequent study. Scale bar = 400 μm. (c) tSNE dimensional reduction of sorted live, Cy5+ cells following 10X scRNA-Seq workflow. Assigned regional id based on ZC1:ZC2 counts overlaid. Immune cell populations were identified using known expression markers on Immgen. (n=7019 cells)(Mean nUMI = 4.1k, Mean nGene = 1057. Cutoffs used: nGene > 400, percent.mito < 0.15) (d) Regional distributions of major immune cell populations as identified in (c). Asterisks denote significance of enrichment with color indicating direction (Inner vs. Outer). ***: p<0.0001 **: p<0.001 and *: p<0.01 (e) Volcano plot showing differential gene expression analysis within the CD4 T cell subpopulation. Colored points represent genes with a p-value < 0.05 (Bonferroni adjusted). (f) Same as in e for the B cell population. (g) Immunofluorescence imaging of fixed lymph node section taken from a GFP-KLF2 reporter mouse. Section was stained for GFP, CD4 and B220. Dotted line represents demarcation between inner and outer regions used during quantification. Scale bar = 200 μm. Zoomed in insets show representative fields within inner and outer regions. (h) Mean fluorescence intensity of GFP-KLF2 signal intensity within CD4 T and B cells in IF image from (g) either inner or outer region. (n=84,152,51,42 for B outer, B inner, T outer, T inner respectively) (i) Mean fluorescence intensity of S100A6 signal within CD4 T and B cells found in outer and inner regions of the lymph node in IF image from (j). Bee swarm plots represent intensities of individual cells with bars denoting standard error. *** p-value < 0.0001, ** p-value < 0.001 * p-value < 0.05 by Wilcoxon’s rank sum test. (n=81,60,31,52 for T Inner, T outer, B Inner, B outer) (j) Fixed frozen lymph node section stained for CD4, B220, and S100A6. Zoomed-in insets show representative fields from outer and inner regions, Scale bar = 200 μm.

We also labeled immune cells within the section with an anti-CD45 based caged anchor strand that was also conjugated to Cy5 to allow for purification of immune cells prior to encapsulation. Using the B220:CD3ε signal as a guide, we first printed Zipcodes to the Outer region and, following washing, printed a second Zipcode to the inner region. The lymph node section was then dissociated and live labeled CD45+ cells were sorted out based on Cy5+ signal and encapsulated. Following merging of Zipcode (ZC) and cDNA counts, we separated populations based on dominance of ZC1 and ZC2 (Fig. S2). In parallel, we performed tSNE dimensional reduction and identified major immune cell populations using compiled expression signatures on Immgen. We then overlaid ZC identity onto the tSNE projection. On first inspection, this revealed clear enrichment of “Outer” cells in the B cell cluster and “Inner” cells for T cell populations (Fig. 3C and D), consistent with expectations and fluorescence imaging.

Using the ZC spatial information, we performed differential gene expression analysis within populations based on position. Within the CD4 T cell population, we identified Ccr7, Klf2 and Klf6 preferentially expressed in cells found in the Inner region and calcium binding proteins S100a6/4 and transcription factor Rora preferentially expressed in the Outer region (Fig. 3E). Performing a similar analysis in B cells identified Klf2 and Fosb in B cells found in the Inner vs. Outer region (Fig. 3F). Given the appearance of Klf2 as an “inner” enriched gene in both of these analyses, we validated its spatial expression pattern using immunofluorescent imaging of lymph node sections (Fig. 3G) from a previously generated KLF2-GFP reporter mouse15. Using B220 and CD4 staining to identify B and CD4 T cells respectively, we found that there was indeed more KLF2-GFP expression in both B and CD4 T cells in the interior (Fig. 3H). We similarly validated that T cells found near and in B cell follicles expressed more S100A6 than those found deeper in the T cell zone (Fig. 3I and J). We noted a more modest difference in S100A6 in B cells (Fig. 3I). We also observed this particular protein in cells comprising high endothelial venules (HEV), which had been excluded from Zipcode analysis due to our selection of an anti-CD45-based anchor.

Mapping immune cell differentiation in relation to position within tumors

To map variations in immune cell expression state within a live tumor, we derived cell lines from spontaneous tumors arising in the PyMT chOVA mouse breast cancer model, in which mCherry and ovalbumin (OVA) were co-expressed under the MMTV promoter, along with the Polyoma middle T antigen (PyMT) 16. We orthotopically injected these into the inguinal mammary fat pad of female C57Bl/6 mice and 10 days later, we adoptively transferred 2 million CD8 T cells isolated from an OT-I UBC-GFP mouse. We allowed the T cells to expand for 4 days in lymph nodes and traffic to tumors. We then harvested the tumors and sectioned them into ~150 μm thick slices (Fig. 4A) and observed dense clusters of GFP OTI T cells in the tumor margin with more dispersed cells in the interior (Fig. 4B).

Figure 4:

Figure 4:

(a) Schematic of experimental setup. 200k PyMT-ChOVA tumor cells were injected into the inguinal mammary fatpad of 8-week-old female C57Bl/6 mice. After 14 days, 2e6 CD8 T cells from a GFP OTI mouse were adoptively transferred. Following 4 more days, tumor was harvested, sectioned, imaged, and labelled with anti-CD45 Ab conjugated anchor strand (with internal Cy5 modification) hybridized to a caged strand. The section was imaged and ROI’s were illuminated prior to Zipcode 1 or 2 addition as denoted in (b). Imaging of 180 μm live tumor section used for scRNA-Seq in following experiments. Red channel denotes mCherry signal from PyMT-ChOVA tumor cells and green channel denotes adoptively transferred GFP OTI T cells. ROI’s used for Zipcode labeling shown overlaid. Scale bar = 400 μm. (c) UMAP representation of sorted live Cy5+ cells following 10X scRNA-Seq workflow. Cells below nUMI and ZC count threshold or above mitochondrial percentage threshold were filtered out. Assigned regional id based on ZC1:ZC2 counts overlaid. Large scale populations annotated based on similarity to known markers on Immgen. (n=4916 cells) (Mean nUMI = 22.5k, Mean nGene = 3939, Mean nZC = 6564, Cutoffs: nGene > 500, percent mitochondrial < 0.15) (d) Stacked bar charts denoting regional distributions of major immune cell populations from data in (c). (e) UMAP dimensional reduction on Monocyte/Macrophage population subset with regional identity as determined by ZC1:ZC2 ratio. (n=3144 cells) (Mean nUMI = 25k, Mean nGene = 4239, Mean nZC = 5268, Cutoffs: nGene = 500, percent mitochondrial = 0.1) (f) Feature plot of UMAP representation in (e) with normalized gene expression denoted by color scale for Ly6c2 as a marker for monocytes and C1qc as tumor-associated macrophage (TAM) marker. (g) UMAP representation of monocyte/macrophage population with state identity calculated from Monocle pseudotime analysis in (h) overlaid. Arrows represent differentiation trajectory from the monocyte population to the terminal macrophage populations. Each major state is annotated with a selected marker genes. (h) DDR Tree dimensional reduction of monocyte/macrophage population as computed by Monocle with state identities overlaid. Arrows denote increasing pseudotime with the Ly6CHi, Ccr2Hi state designated as the root. (i) DDR Tree dimensional reduction of monocyte/macrophage population plotted with regional localization overlaid. Pie charts represent regional distributions (marginal vs. interior) for each state. (j) UMAP dimensional reduction on cells within the T cell clusters expressing at least one GFP transcript. Regional identity as determined by ZC1:ZC2 ratio overlaid. (n=265 cells)(Mean nUMI = 24.7k, Mean nGene = 4083, Mean nZC = 12k, Cutoffs: nGene > 500, percent mitochondrial < 0.15) (k) UMAP representation with gene expression signature scores overlaid. Exhaustion vs. naive gene signature scores ere calculated for the Gfp+ T cell subpopulation (cells within lymphoid clusters with at least one Gfp transcript) and these scores overlaid on the UMAP representation. Violin plot represents this score distribution based on regional identity. Bottom row represent similar quantification of a terminal vs. stem-like exhausted signature score. (l) Volcano plot showing top differentially expressed genes in the Gfp+ T cell subpopulation based on regional identity. Colored points represent genes with a p-value < 0.05 (Bonferroni adjusted).

We defined “margin” vs. “interior” regions based on the GFP and Cherry signals and used anti-CD45-Cy5 labeled anchor strands and light-based to label them with Zipcodes 1 and 2, respectively (Fig. 4B). After dissociation of the tumors into a single cell suspension, we sorted out Cy5+ cells and encapsulated them for scRNA-Seq using our modified 10X scRNA-Seq workflow. Dimensional reduction and nearest neighbor clustering on RNA levels from these cells revealed clusters of T lymphocytes and monocytes/macrophages (Fig. 4C) and smaller populations of neutrophils, dendritic cells, and NK cells. Several of these populations displayed distinct regional distributions within the sample. For example, lymphocytes and neutrophils skewed towards the marginal region while macrophage populations were more frequently in the interior, matching observations made in a subcutaneous colorectal cancer tumor model using whole-volume imaging and tissue clearing 17 (Fig. 4D)

Sub-sampling only the monocyte/macrophage cluster (Fig. 4E) we found that that “margin“ cells appeared differentially enriched on the left hand side of the UMAP projection where prototypical monocyte genes like Ly6c2 were also enriched. Conversely “interior” cells were predominantly found on the right hand side where terminal tumor-associated macrophage (TAM) markers such as C1qc and Apoe were enriched (Fig. 4E and F) 18. Exploring this further, we generated a pseudotime trajectory using Monocle19 with the Ly6cHi, Ccr2Hi state as the root state (State 1) (Fig. 4H). We observed differentiation of several TAM states (States 2-4) from the root monocyte state as pseudotime advanced with graded changes in gene expression such as loss of Ly6c2 and Ccr2 expression and gain of other TAM defining markers (Fig. S3). These terminal TAM states could be defined by expression of marker genes (Fig. 4G and Fig. S4) consistent with previously described TAM markers20. When we overlaid regional localization onto our pseudotime trajectory, we observed the regional localization of cells shift from margin to interior as pseudotime progressed from State 1 to terminal states 2,3 and 4 (Fig.4I). The terminal states exhibited their own differences in regional localization with State 2 more marginal vs. States 3/4 (Fig.4I) and differentially expressed genes based on localization (Fig.S5). One common feature amongst each ‘arm’ was an enrichment for monocyte markers Ly6c2, Ccr2 and Plac8 in marginal cells further supporting a gradient of differentiation concurrent with infiltration depth21.

Focusing on the antigen-specific (Gfp-expressing) OTI T cells, we also observed regional variation in the distribution of recently arrived effector CD8 T cells in transcriptional space (Fig. 4J). We observed a clear enrichment for genes previously associated with exhaustion (vs. naive) in interior localized antigen specific T cells (Fig. 4K)22,23. Given that T cell exhaustion represents a graded process, we also applied a terminal vs. stem-like exhaustion signature and observed a clear increase in terminal vs. stem-like exhaustion score for interior cells (Fig. 4K) 24. Similarly, when we performed differential expression analysis for marginal vs interior cells in this GFP+ subset, the most significant gene hits were enriched in those defining earlier differentiation (e.g. Tcf7 and Siamf6) in margin T cells25 and more committed exhaustion (e.g. Id2 and Pdcd1) in the interior T cells (Fig. 4L) 23,24. When we applied signatures for cell cycle phase as shown previously in Fig. 2, we found no significant difference in the proportion of actively cycling cells amongst margin vs. interior GFP-expressing T cells (data not shown).

Increased resolution in lymph node reveals spatial patterns of gene expression

To increase the number of regions that could be labeled, we devised two variations of ZipSeq. Instead of a terminating Zipcode sequence, we used DNA duplex strand bearing an orthogonal NPOM caged overhang (O2) sequence to effectively swap the potential binding site from O1 to O2 upon illumination in the first round. Thus, multiple regions can be defined at a time, through downstream addition of Zipcodes bearing either a complement to overhangs O1 or O2. This approach can theoretically be scaled up through the use of additional orthogonal overhang sequences, resulting in definition of 2N regions using N+1 rounds of illumination and oligonucleotide addition. We demonstrate our ability to print a series of strands using two fluorescently labeled oligonucleotides as seen in Fig. 5A to generate 4 distinct regions.

Figure 5:

Figure 5:

(a) Schematic of oligonucleotide design used for defining 4 regions by adding a second layer of caged oligonucleotides. A secondary oligonucleotide duplex bearing an orthogonal caged O2 overhang can hybridize to the uncaged O1 sequence. With a combination of Zipcode strands with either an overhang region O1’ or O2’, 4 distinct regions can be defined by 4 Zipcode species. Workflow illustrates the ability to define 4 such regions through fluorescence tagging of these oligonucleotide strands. Scale bar = 100 μm. (b) B cells extracted from C57Bl/6 mice were labeled with CFSE and CD8 T cells were extracted from a CD2-RFP mouse and adoptively transferred into a C57Bl/6 mouse. Lymph nodes were harvested, sectioned and labeled with anti-CD45 Ab conjugated anchor strand (with internal Cy5 modification) hybridized to a caged strand. Following imaging, 4 regions were defined with a unique Zipcode 1-4 as overlaid onto the micrograph. These 4 regions were generated using the sequence of illumination and oligonucleotide additions shown. Scale bar = 400 μm. (c) tSNE dimensional reduction of sorted live, Cy5+ cells from LN section shown in (b) with regional identity overlaid following 10X workflow. Major immune populations are annotated. (n=5489 cells) (Mean nUMI = 3.9k, mean nGene = 1273, Cutoffs: nGene > 400, percent.mito < 0.15) (d) Bar chart illustrating distribution of cells in each of the 4 regions for selected immune cell populations in (c). (e) Plots of mean scaled gene expression levels within the CD4 T cell cluster as a function of regional assignment for selected genes. (f) Plots of average scaled gene expression levels within the CD4 T cell cluster as a function of regional assignment for Klf2 and two similar and dissimilar genes as calculated by cross-correlation score. Genes with significantly different expression levels and a logFC threshold of 0.4 between at least one pair of regions were considered for analysis. Cross-correlation scores were calculated between the averaged scaled expression levels of these genes and the reference gene. (g) Similar analysis with S100a6 as reference. (n=171/262/141/49 cells) (h) Schematic of second design iteration. An anchoring moiety is conjugated to an anchor sequence which is hybridized to an oligonucleotide with a NPOM caged overhang sequence. Each additive Zipcode duplex block consists of a Zipcode strand with a complement to the overhang sequence, a universal hybridization sequence, an 8 nt barcode, and a 28 nt polyA sequence. Meanwhile, a strand with a caged overhang is hybridized to the Zipcode sequence through the universal hybridization sequence. In this way, Zipcode blocks are added on in a combinatorial manner, defining 2N populations based on presence or absence of a given Zipcode block. (i) Demonstration of combinatorial spatial barcoding of a field of cells in an exponentially scaling manner. Conjugate labeled CD8 T cell were plated and subjected to a 3X sequence of illumination patterns and Zipcode block additions bearing distinct fluorophores resulting in 8 regions with distinct fluorophore combinations. Scale bar = 50 μm.

Using this design, we sought to test our method for its ability to detect gradients of genes, such as those for chemokine receptors like CCR7, in the lymph node. We adoptively transferred B cells (CFSE-labeled) and CD8 T cells (RFP-labeled) into a C57Bl/6 mouse before harvesting and sectioning the inguinal LN which revealed T and B zones (Fig.5B). The sequence of illumination patterns and oligonucleotide additions to generate 4 concentric regions of labels (ZC1-4) is illustrated in Fig.5B. Following dissociation and sequencing, we observed 4 groupings of cells with dominant ZC counts for 1-4 while filtering out cells that were ambiguous with no clear ZC dominance(Fig. S6). When we overlaid these regional identities onto our tSNE representation, we observed a strong enrichment for regions 3,4 in the B cell cluster vs T cell clusters (Fig.5C). We noted that the degree of region 4 enrichment in B cells vs. T cells (>50% vs. 4%) (Fig.5D) was greatly increased compared to our 2-region lymph node study in Figure 3. In addition, when examining rarer immune cell subpopulations, we observed the natural killer cell population enriched specifically at the interface between T and B cell zones (region 3) as has been previously observed26. (Fig.5D).

Definition of 4 concentric regions should permit us to identify the existence of gradients of gene expression in space. As predicted, within T cell clusters, expression of Ccr7 and Klf2 steadily decrease moving outwards towards the B cell zone while S100a6 expression increases, matching our findings from Figure 3 and published results27(Fig. 5E). In contrast, Cxcl13, a chemokine very highly associated with B cells zones, was significantly expressed in region 4 alone (Fig. 5E). This type of analysis allowed us to compare patterns of gene expression across space using genes such as Klf2 and S100a6 as references, and calculating cross correlation scores with all other expressed genes in our dataset, identifying the most ‘similar’ and ‘dissimilar’ genes. For example, Klf2 shares similar profiles with Nkg7 and Actn1 (with which it shares a similar differentiation trajectory in vitro)28 while being most dissimilar to Rora and Nfkb1. Meanwhile S100a6 shares similarity to its frequent binding partner S100a4 as well as the costimulatory molecule Icos while being most dissimilar to Syngr2 and Mrpl14 (Fig. 5F,G).

This approach, while allowing for definition of exponentially increasing number of regions, also requires a similar scaling of distinct orthogonal caged sequences which could become cost-prohibitive, so we devised a second variation. Here, each coding segment consists of a Zipcode ‘block’ which is a duplex of a polyA, barcode, a universal hybridization region, and an overhang sequence O. This strand is pre-hybridized to a strand with the universal hybridization region and a caged overhang sequence O’ (Fig. 5H). This schematic necessitates synthesis of only a single caged sequence species and N distinct barcoded polyA strands yielding potentially 2N regions after N rounds of illumination and addition. Using an in-tube validation experiment, we were able to observe 4 separate populations of cells using two rounds of illumination and Zipcode addition (Fig. S7). Merging of cell type identity and Zipcode combination showed good agreement with the experimental scheme. To spatially visualize this approach, we used 3 separate illumination and addition steps with three distinct Zipcode blocks, each bearing a distinct fluorophore. This yielded 8 (23) distinct color combinations, each defining a grid position (Fig. 5I). Because the resolution of this method is theoretically diffraction limited, we repeated this using smaller regions, approximately 20 μm per side, demonstrating the capability to define areas on the order of cell neighborhoods(Fig. S8).

DISCUSSION:

Here we introduce ZipSeq, an approach that allows for on-demand barcoding of cells within defined regions during microscopy. We demonstrate that our approach is compatible with live tissue sections and with multiple cell and tissue types, depending on anchoring moiety. It precludes the need for genetically encoded photoactivatable proteins 29 so is applicable to human tissues and allows for definition of multiple regions at once. ZipSeq plugs into the commercially available 10X workflow 30, and is theoretically compatible with many other scRNA-Seq methodologies 3133 making its wider adoption feasible, requiring only caged oligonucleotides and a photo-patterning module.

Using this approach, we demonstrate the ability in both an in vitro wound healing model and ex-vivo tissue sections (lymph node and tumor) to assign single cell transcriptomes to regions defined concurrently with fluorescence imaging. In the wound healing model, our approach identified distinct transcriptional programs activated in fibroblasts as a function of distance from the wound edge with several targets validated through immunofluorescence. We found a migratory cell state enriched at the leading edge (0-200 μm) and a proliferative state enriched at the rear (200-400 μm). The generalized spatial segregation between migration and proliferation has been previously observed in multiple cell types such as epithelial cells and keratinocytes during wound healing 34,35 with overexpression of Acta2 and Serpine1 described at the leading edge in prior wound healing studies36,37. It will be informative to repeat this study at higher resolution at different timepoints to observe how these spatial patterns of expression might evolve or whether patterns exist in other axes or at other scales.

In the context of lymph nodes, this method reports spatially dependent gene expression validated by previous works including Klf2, Ccr7 and S100a6 expression 15,38,39. With increasing region number, this method permits identification of genes that map ‘similarly’ or ‘dissimilarly’ to a known gene over space, some of which are known, while others are not. Additionally, in the context of a tumor model, this method allows the progression of myeloid and T cell differentiation to be mapped to physical infiltration depth. The myeloid differentiation in particular is consistent with recruited monocytes receiving local cues that skew differentiation trajectories as they arrive in the tumor margins, as described previously in several tumor models 40,41.

Alongside macrophage differentiation, we also observed genes associated with T cell exhaustion upregulated in tumor specific CD8 T cells when comparing tumor interior with margins. Notably Tcf1, a major factor in maintenance of a stem-like exhausted phenotype, was enriched in marginal T cells vs. interior 24,42. Comparison with imaging data taken prior to barcoding suggests that the dispersed, deeper infiltrating antigen-specific T cells we observed (Fig. 5B) are further along the exhaustion pathway compared to the T cells at the edges. The mechanistic link between depth and commitment towards exhaustion bears further investigation. We also noted the enhanced expression of chemokines/receptors in interior vs. margin CD8 T cells associated with increased trafficking and infiltration of T cells into the tumor such as Ccl4/5 and receptor Ccr5 (Fig. 4L)4345. Our work highlights a number of previously unappreciated spatially variable gene expression patterns in the TME that require further high resolution transcriptional profiling approaches such as fluorescent in-situ hybridization (FISH) or CODEX to correlate with.

Many spatial transcriptomics approaches use prefabricated grids of barcoded poly-dT’s or barcoded beads bearing poly-dT’s to capture all transcripts within the grid position57. While these approaches offer excellent spatial resolution, it will be difficult to apply them directly to tissue following live imaging or perturbations. The relatively low read depth offered by several of these approaches (~1e4 reads per 150 μm grid square for Spatial Transcriptomics and 2e2 per 10 μm bead for SLIDE-Seq), would further complicate analysis of expression patterns in rarer cell types or those with low levels of RNA as their transcripts become diluted out during the capture step. Because ZipSeq plugs into droplet-based scRNA-Seq workflows, it has potential to tap into greater read depth per cell, generating true single cell transcriptomes without the need for deconvolution of a pool of transcripts derived from multiple cells. Another advantage of ZipSeq is the potential to easily integrate with other multimodal measurements such as concurrent surface epitope labeling using CITE-Seq, ATAC-seq or single cell immune repertoire sequencing8.

ZipSeq however faces limitations, most notably in spatial resolution. We propose that ZipSeq is currently most effective in questions based around microanatomical features observed during imaging that can guide definition of ROI’s for barcoding. To make a layered barcoding printing scheme achievable and cost-effective, we demonstrated the ability to add on layers of secondary caged oligonucleotides to increase the number of definable regions. With this increased spatial resolution, we can describe the segregation of different cell states in finer detail and detect genes with sharply defined spatial expression in an unbiased fashion.

While many of our studies focused on the immune compartment, alteration of anchoring moiety will potentially expand application to diverse multicellular models. By using a panel of oligonucleotide conjugated antibodies directed against various surface markers, spatial information gained from our approach could theoretically be combined with surface epitope profiling. We speculate that lipid anchored oligonucleotides may not be as efficient in binding to cells in tissue contexts given the observed sequestration of lipid modified oligos by protein. Thus we note lipid anchored oligonucleotides may best be suited for in vitro models. Combined with live imaging of cells prior to Zipcoding, time-dependent cell behaviors such as motility can be used to define regions of interest (e.g. transcriptional states of cells in low and high motility zones, zones defined with measures of hypoxia, etc.). In summary, ZipSeq represents a novel approach to mapping scRNA-Seq data from conventional scRNA-Seq workflows using on-demand light-controlled hybridization of DNA barcodes onto cells. We propose that ZipSeq will strengthen our capability to link spatial heterogeneity in multicellular systems to transcriptional heterogeneity of the constituent cell populations.

Methods:

Oligonucleotide list:

Amine modified anchor strand for conjugation to antibody: (Fig. 1, S1)
Amine-C6 linker - ATCGTTTTTTTTTTTTGATGTTGACGG
Amine-modified anchor strand with internal Cy5 fluorophore for conjugation to antibody (Fig. 3,4)
Amine-C6 linker - ATCCAG\iCy5\TTTTTTTTTTTTGATGTTGACGG
Caged Strand hybridized to anchor strand prior to labeling cells with overhang region O1 (NPOM modified thymidines highlighted in red): (Fig. 1, S1, 2,3,4,5)
5’-CGATCTGTGGTTGCTACCCGTCAACATCAAAAAAAAAAAA-3’
Blocking Strand (O1) for neutralizing carried over Zipcodes: (All figures)
5’-CGATCTGTGGTTGCTAC-3’
Second layer oligonucleotide (contains overhang region O1’) (Fig. 5)
GTAGCAACCACAGATCGCATGAGTCGAATCTCCCAC
Second layer oligonucleotide (contains overhang region O1’) with AF488 (Fig. 5)
AF488-TTGTAGCAACCACAGATCGCATGAGTCGAATCTCCCAC
Second layer NPOM caged oligonucleotide with overhang region O2 (NPOM modified thymidines highlighted in red): Fig. 5
5’-CCTAGATCATGCAGTTCCGTGGGAGATTCGACTCATG-3’
Zipcode 1 w/ complementary overhang O1’ (Fig. S1, 2, 3, 4, 5)
GTAGCAACCACAGATCGCACCCGAGAATTCCATGATGCAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Zipcode 2 w/ complementary overhang O1’ (Fig. S1, 2, 3, 4, 5)
GTAGCAACCACAGATCGCACCCGAGAATTCCAAGCCATAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Zipcode 3 w/ complementary overhang O2’ (Fig. 5)
GAACTGCATGATCTAGGCACCCGAGAATTCCATCAACGAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Zipcode 4 w/ complementary overhang O2’ (Fig. 5)
GAACTGCATGATCTAGGCACCCGAGAATTCCATACGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Fluorescent oligonucleotide for flow readout with complementary overhang O1’:
GTAGCAACCACAGATCGTATA-Cy5
Adaptor Strands for lipid anchored oligonucleotide: (Fig. 2)
GATGCTTCACGGGATATTTTTTTTTTTTGATGTTGACGG
TATCCCGTGAAGCTTGAGTGGAATCCCTTGGCACCCGAGAATTCCA
Lignoceric acid conjugated anchor strand (Fig. 2)
Lignoceric acid - GTAACGATCCAGCTGTCACTTGGAATTCTCGGGTGCCAAGG
Co-anchor strand (Fig. 2)
AGTGACAGCTGGATCGTTAC - Palmitic acid
Design version 2 amine modified anchor strand (Fig. 5, S7)
Amine-C6 linker - TTTTTCACCCGAGAATTCCAC
Design version 2 universal caged strand (Fig. 5, S7)
CGATCTGTGGTTGCTACGTGGAATTCTCGGGTG
Design version 2 generic Zipcode strand (Fig. 5, S7)
GTAGCAACCACAGATCGCACCCGAGAATTCCACNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Zipcode 1 strand w/complementary overhang O1 w/ terminal Cy5 (Fig. 1, 5, S7)
GTAGCAACCACAGATCGCACCCGAGAATTCCACTGATGCAAAAAAAAAAAAAAAAAAAAAAAAAAAACy5
Zipcode 2 strand w/complementary overhang O1 w/ terminal TAMRA (Fig. 1, 5, S7)
GTAGCAACCACAGATCGCACCCGAGAATTCCACAGCCATAAAAAAAAAAAAAAAAAAAAAAAAAAAATAMRA
Zipcode 3 strand w/complementary overhang O1 w/ terminal FAM (Fig. 1, 5, S7)
GTAGCAACCACAGATCGCACCCGAGAATTCCACCTCTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAFAM

All oligonucleotides save for the caged strand were ordered from IDT with HPLC purification. NPOM caged strand was ordered as a custom synthesis from BioSynthesis. Details on properties of NPOM-caged dT’s can be found here: https://www.biosyn.com/oligonucleotideproduct/light-activated%20npom-caged-dt-modified-oligonucleotid.aspx

Reagents:

Nuclease Free Bovine Serum Albumin purchased from VWR (VWRV0332-25G)

Single stranded salmon sperm DNA purchased from Abcam (ab229278)

10X v2 and v3 kits were purchased from 10X Genomics. SPRI selection beads from Beckman Coulter (B23317). Collagenase I and IV were purchased from Worthington Biosciences (LS004194) and (LS004186) respectively. 2X Kapa HiFi HotStart Master Mix purchased from Roche (KK2601)

PBS (minus Calcium/Magnesium), RPMI 1640 and DMEM were ordered from Gibco.

Antibodies:

LEAF-purified anti-CD45 antibody (30F-11) (#103164), PE-anti-CD3ε (145-2C11) (#100307), and FITC-anti-CD45R(B220) (RA3-6B2) (#103205) purchased from Biolegend.

For IF, Abs for targets included anti-STMN1 (Abcam 52630), anti-S100A6 (Invitrogen PA5-16590), anti-ACTA22 (Sigma 1A4)

Conjugation of Anchor oligonucleotide w/ Antibody

The Thunder-Link Plus kit (Expedeon)(425-0300) was used to conjugate the amine modified anchor strand to an anti-mCD45 antibody (clone 30F-110) at a molar ratio of 1:5 Ab:oligo and allowed to conjugate overnight at room temperature prior to conjugate purification according to instructions.

Lipid conjugated oligonucleotide:

Synthesis of the lipid conjugated anchor oligo was performed as in11. In order to hybridize the caged oligonucleotide species, two adaptor sequences were also pre-hybridized with the anchor and caged strands (sequences shown above)

Mouse Strains:

Experiments were performed in 6-8 week old female C57BI/6J mice from JAX (#000664). For adoptive transfer, CD8 T cells were derived from a CD2-dsRed mouse (MGI# 5296821) and a OTI (C57BL/6-Tg(TcraTcrb)1100Mjb/J) (#003831) crossed with a UBC-GFP (C57BL/6-Tg(UBC-GFP)30Scha/J) (#004353).

In-tube validation of zipcoding:

The caged strand was pre-hybridized to the anti-CD45 Ab-anchor conjugate by adding a 1:1 molar ratio of caged to anchor strand and incubating at 37C for 15 minutes and allowing to cool to RT. CD4 and CD8 T cells were isolated from a mouse using CD4 and CD8 negative selection kits respectively (StemCell Technologies #19852 #19853) and were labelled with the caged strand hybridized to the Ab-oligo conjugate. In the first round, CD4 T cells were illuminated with 365 nm light and the first Zipcode strand added to both populations. Following a 4 min. incubation and 3 washes with S2 blocking strand (0.1 μM), the CD8 population was then illuminated and Zipcode 2 added and allowed to hybridize. Following a series of washes, the cells were pooled and encapsulated using a 10X v2 3’ kit with a target cell # of 4k.

For the in-tube validation experiment for Design 2 shown in Fig. S7, bone-marrow derived dendritic cells were isolated. LB27.4 cells obtained from ATCC (HB-99).

Microscopy:

Imaging and photo-uncaging was performed with a customized Zeiss Axiovert 200M inverted microscope fitted with a Mosaic digital micromirror device from Andor. Illumination for wide-field imaging was provided by a Lambda DG-4. For photo-uncaging, light from a mercury arc lamp was passed through a 365 nm bandpass filter then directed into the DMD. Imaging was performed with a 20X objective from Zeiss (Plan-NEOFLUAR). For imaging, we used a Photometrics Evolve 512. Metamorph (Molecular Devices) was used to control the microscope and illumination by the Mosaic DMD with a custom Visual Basics program (available on Github see link below) to provide a user interface for manual ROI delineation across multiple fields of view.

On-Scope validation of spatially controlled DNA hybridization:

Isolated primary mouse CD4 T cells were plated in a Labtek chamber slide coated with anti-CD3ε antibody to adhere cells. Cells were then labeled with Ab-oligo conjugate hybridized to the caged strand. Following 2x washes and blocking w/ ssDNA to prevent nonspecific DNA-well interactions, vertical bands of 200 μm width were illuminated with 365 nm light from a mercury arc-lamp through the Mosaic. A fluorescently labeled Zipcode strand was flowed into the well and incubated for 5 minutes for hybridization. Following 3x washes with media, the sequence was repeated 2x for 2 other fluorescently labeled Zipcode strands. The illuminated regions were then imaged.

For the combinatorial design shown in Fig. 5H and Fig. S11, Zipcode strands 1-3 were conjugated to fluorophores Cy5, TAMRA and FAM. These Zipcode strands were then hybridized to the complementary caged oligonucleotide strand to form 3 Zipcode blocks. Similar to the workflow above, a series of illumination, Zipcode addition, washing, and blocking generated 3 patterns of Zipcode hybridization overlaid, creating 8 distinct color combinations. The grids used in Fig. 5h were designed to generate 200 μm × 200 μm squares. In fig. S11, the grid squares were scaled down to push the limits of DMD resolution to roughly 20 μm × 20 μm squares.

Wound Healing study:

NIH/3T3 cells were ordered from ATCC (CRL-1658) and cultured in complete growth medium (DMEM+10%FCS+55mM BME+PSG). 2 days prior to labeling and scRNA-Seq, a monolayer of 40k cells was plated in a well of a 8-well LabTek chambered coverglass. Cells were allowed to settle and fill in the well until 12 hours prior to imaging and labeling.

A pipette tip was used to introduce a scratch around 0.8mm in width in the monolayer. Cells were washed once with fresh growth medium to remove debris or floater cells and incubated for a further 12 hours. Cells were then transferred to the microscope and the regions of interest we delineated. Growth medium was washed out and replaced with serum-free phenol red free DMEM with ssDNA to block nonspecific DNA binding in further steps. The anchor-caged strand lipid tag was added and allowed to sit for 10 minutes followed by the co-anchor strand. Following 3 washes, the desired ROI’s were illuminated using an 800 ms pulse of 365nm light.

The first Zipcode strand was added to the monolayer and allowed to hybridize for 5 minutes. Following 2 washes, a blocking strand was added at a lower concentration for 5 minutes to prevent residual Zipcode strand from binding in the following cycle. After 2 another 2 washes, the second ROI was illuminated and the steps repeated with the second Zipcode strand.

After the last series of washes, the medium was removed and Accutase (Stem Cell #07920) was added to detach the cells. After 5minutes of incubation at RT, cells were harvested and washed with cold PBS+0.04%BSA as recommended by 10X protocol. Encapsulation was performed for a target cell # of 8k using the v2 chemistry.

Lymph node study:

Inguinal lymph nodes were harvested from 8 week old C57BI/6 female mice and embedded while live in 2% agarose. Using a Leica Vibratome or Precisionary VF 310-0Z Vibratome, the lymph nodes were sectioned into 150 um thick slices and affixed to a LabTek chamber slide (Thermo Fisher #155409) using Vetbond (3M) applied to the agarose ‘rim’. Sections were then incubated with anti-B220 FITC and anti-CD3ε PE along with the anti-CD45 conjugated NPOM caged anchor strand bearing an internal Cy5 modification for 1 hour at 4C. Following 3 washes, the sections were imaged on the scope and ROI’s delineated and illuminated with pulses of 365 nm light with 1000 ms. duration. The first Zipcode was added at a 1 μM concentration in RPMI and allowed to incubate 10 minutes. Following 3 washes, the blocking strand was added at 0.25 μM concentration and incubated a further 5 minutes. Following a series of 3 washes, the process was repeated for the second Zipcode. Following a final blocking step, the section was mechanically disrupted and strained over a 100 μm nylon strainer and sorted for live, Cy5+ cells. Sorted cells were washed with cold PBS+0.04%BSA and encapsulated following 10X guidelines using a v2 3’ kit with a target cell # of 8k.

For the 4-region lymph node study, naïve CD8 T cells were purified from a hCD2-dsRed mouse and 4e6 were adoptively transferred into a 6 week old B6 mouse. Meanwhile, B cells were purified from a B6 mouse using an EasySep B cell negative selection kit (StemCell Technologies #19854) and labelled with CFSE before being transferred at 4e6 per mouse. 3 days following transfers, the mouse was sacrificed and the inguinal lymph nodes extracted for sectioning and study. Following sorting, cells were encapsulated as described above using a 10X v2 3’ kit with a target cell # of 8k

Tumor study:

For orthotopically injected PyMTChOVA models, the PyMTChOVA breast cancer cell line was generated from de novo mammary tumors in the PyMTChOVA breast cancer mouse model16. Briefly, mammary tumors were harvested in ice cold PBS and mechanically minced into small fragments. Cells and tissue fragments were cultured in DMEM + 10% FCS and added Penicillin, Streptomycin and Glutamine. After 7-10 days tissue fragments and debris were washed out with ice cold PBS and attached cells were allowed to grow to confluency. Cells were cultured in growth medium for an additional 3-5 passages to generate the established PyMTChOVA mammary tumor cell line. Tumor cell line was kept frozen and thawed directly from stock as needed prior to injection to avoid unnecessary passages. 200k cells were injected in Matrigel (Corning #356238) into the inguinal fat pad of 8-week-old female C57Bl/6 mice.

C57BL/6-Tg(UBC-GFP)30Scha/J mice were crossed to OT-1 mice to generate a GFP OT-1 mouse strain. Lymph nodes were harvested from a 6-week-old GFP OT-1 mouse and CD8 T cells were isolated using an EasySep mouse CD8 T cell negative selection kit (StemCell Technologies). 14 days post tumor injection, 5e6 CD8 T cells from a GFP OTI mouse were adoptively transferred through retro-orbital injection. 4 days following, the mouse was sacrificed, and the tumor harvested for sectioning on a Leica Vibratome into 150 μm thick slices. As before in the lymph node study, the tissue was embedded live into 2% agarose for sectioning. Sections were blocked with ssDNA and BSA in RPMI for 30’ at 4C then stained with the anti-CD45 Ab conjugated to NPOM caged anchor strand with internal Cy5 for 1 hr at 4C. Following washes, sections were affixed to an Ibidi μ-Slide 8 well using Vetbond. Sections were then imaged with the desired channels, and spatially Zipcoded as described above using pulses of 365 nm light of 1000 ms. duration. Following the final block and wash step, tissue sections were diced finely and incubated with a Collagenase I and IV blend in RPMI and incubated for 30’ at RT. The resulting suspension was mechanically agitated by pipetting and then strained on a 100 μm strainer. As before, live Cy5+ cells were sorted out on a FACSAria II, washed in PBS + 0.04% BSA and then encapsulated following 10X specifications for v3 3’ chemistry with target cell number of 8k.

Library Construction:

cDNA library construction following GEM formation was performed as directed by 10X using v2/3 3’ chemistry depending on experiment. Following the published CITE-Seq protocol, an additive primer (partial Read 2 small RNA) was spiked into the cDNA amplification reaction. During the post-cDNA amplification SPRI cleanup step, given that Zipcode reads are significantly shorter (<200 bp), these reads were separated from cDNA reads by decanting the supernatant. cDNA reads bound to SPRI beads were processed as recommended in the 10X use guide. Meanwhile the supernatant containing Zipcode reads was saved and underwent two successive 3X SPRI cleanup steps.. This library was then amplified using primers from CITE-Seq protocol8. Following fragment analysis on the BioAnalyzer and library quantification by qPCR, the Zipcode library was mixed with the associated cDNA library at a 1:10 molar ratio and sequenced on either Illumina HiSeq Rapid Run mode (all studies save for Fig. 4) or NovaSeq SP (Fig.4 studies) using 10X recommended sequencing parameters based on kit version (v2 vs. v3)

Processing of raw sequencing reads:

Raw read files were processed using CellRanger bcl2fastq to separate cDNA and ZC libraries. cDNA libraries were processed using standard CellRanger count function. For PyMT tumor studies that made use of alignment to GFP transgene, the 10X provided mm10 reference was modified through addition of artificial genes made up of fluorophore coding sequences. ZC libraries were counted using a Python script made available by CITE-Seq using a whitelist of cells provided by the CellRanger count function determined by a minimum nUMI threshold8. The two outputs were then merged in Seurat for further analysis.

Data Analysis:

Cells with a high mitochondrial read count % (above 10% assumed to be dead or dying cells) were filtered out. Cells with low counts for cDNA were filtered out based on the presence of a local peak at the low end of the distribution. Read counts were normalized using log-normalization, scaled and centered and nUMI and mitochondrial percentage regressed out. PCA was performed and the top 10 of these PC’s were used to inform the dimensional reduction by Seurat built-in tSNE or UMAP (using the Python implementation of umap-learn package). Meanwhile, nearest neighbor clustering using these PC’s was performed using Seurat’s built-in FindClusters function at a specified resolution of 0.8. In order to call Zipcode identities, Zipcode counts were normalized and the ratios of these normalized counts used to gate cells as belonging to one identity or another with ambiguous cells (between gates) filtered from analysis. For the 4-region lymph node study, k-means clustering on normalized Zipcode counts was used to generate 5 clusters corresponding to ZC1-4 dominance along with a centrally localized ambiguous population.

For differential gene expression analysis, we used Seurat’s built-in FindMarkers function which implements testing based on the Wilcoxon rank sum test along with a Bonferroni correction to adjust p-values. The testing was restricted to genes expressed in at least 10% of cells.

For Monocle analysis, we directly imported the Seurat object of interest into monocle and used the clusters pre-defined in Seurat to get a list of differentially expressed genes between clusters19. The top 800 genes were then used in the DDRTree dimensional reduction and pseudotime ordering.

For signature analysis, curated lists of genes from literature were passed into Single Cell Signature Explorer to generate scores 46. Sources for gene lists used for signature scores derived from:

  • S and G2M phase: from built-in gene lists in Seurat

  • Exhaustion vs. Naïve: 23

  • Terminal vs. stem-like exhaustion: 24

For analyzing spatial gradient profiles of genes in the 4-region lymph node study, we used the mean scaled and normalized gene expression for a given gene in each region. In order to find the most similar and dissimilar gene profiles, we first filtered out genes expressed in <10% of cells for all regions. We then calculated a cross-correlation score compared to a reference gene and ranked these scores from lowest correlation (−1) to highest correlation (1).

CC=i(xix_)(yiy_)var(x)*var(y)

For gene x and y where i represents regions 1-4.

For doublet detection, DoubletFinder v1 was used as described in (C.McGinnis et al. 2019 Cell Systems) https://github.com/chris-mcginnis-ucsf/DoubletFinder. pK parameter was determined using a parameter sweep as described with the nExp calculated from the number of cells belonging to each cluster determined by Seurat. Immunofluorescence: For IF study on KLF2-GFP reporter mouse lymph nodes, sections were prepared, stained and imaged from PE-primed mouse on d14 as in 15. For IF of S100a6, lymph nodes from 8 week old C57BI/6 mice were embedded in optimal cutting temperature compound (OCT)(Sakura #4583) and cryosectioned into 10 μm thick slices. Slices were fixed with 4% PFA and permeabilized by TritonX-100. Following blocking with 5% goat serum, primary antibodies were added and incubated at 4C overnight. Following washes, slices were then incubated with secondary Abs for 1 hr at RT, washed and then incubated for 5 minutes at RT with DAPI. Following another wash, slices were mounted with Vectashield (Vector Labs H-1900)and a coverslip, then imaged.

For IF studies on wound healing monolayers, NIH/3T3 fibroblasts were prepared in LabTek chambered slides as before and 12 hours post-wounding were fixed with 4% PFA, permeabilized with TritonX-100 and blocked with 5% goat serum.

Quantification of LN immunofluorescence: Analysis was performed within Fiji. Depending on the cell type of interest (CD4 T or B cells), CD4 or B220 signal was used to identify cells within the tissue. Identified cells were selected randomly based on this signal and MFI of target channel was calculated within this mask. Wilcoxon rank-sum test (two-tailed) used for comparison of MFI’s between populations.

For wound healing, binary masks for cell area were generated through thresholding of images and applied to target channel. Columns of 10 pixel width were taken progressively from wound edge and pixels in the mask were averaged to generate a column fluorescence intensity average.

Statistical Testing:

Wilcoxon rank-sum test (two-tailed) used for comparison of MFI’s between populations for Immunofluorescence data. In order to see if regional enrichment for a given immune cell population from scRNA-Seq data was significant, we used a hypergeometric distribution to determine the probability of drawing a regional composition with equal to or greater enrichment at random. This probability was directly reported as the p-statistic.

Data Availability:

For all scRNA-Seq studies described here, transcript counts as determined by CellRanger count function as well as raw Zipcode fastq’s/counts as well as the modified genome for transgene alignment during CellRanger count can be found on GEO under accession#: GSE145502

Code Availability:

Visual Basics code for custom Metamorph User Program for delineation of multi-FOV spanning ROI’s for Mosaic illumination can be found on Github:https://github.com/BIDCatUCSF/VB-Plugin-for-Patterned-Illumination

Python script used to generate ZC counts from fastq available from CITE-Seq8 (https://hoohm.github.io/CITE-seq-Count/)

Supplementary Material

Supplementary Figure 1

Supplementary Figure 1: In-tube validation that photo-uncaging combined with Zipcode hybridization can distinguish cell populations. Purified CD4 and CD8 T mouse T cells were labeled with a caged anti-CD45 Ab-DNA conjugate (without internal fluorophore modification) and then spatially confined to separate tubes. ‘CD4 T cells’ were uniquely illuminated prior to addition of the first Zipcode (ZC1) to both tubes, followed by a washing step. ‘CD8 T cells’ were then illuminated and a second Zipcode (ZC2) was added to both tubes followed by washing. The contents of both tubes were pooled and introduced into the 10X scRNA-Seq pipeline. Nearest neighbor clustering of the populations sorted CD4 and CD8 T cell signatures from each other. We observed good (95%) correlation between dominant Zipcode identity and assignment to CD4 or CD8 clusters. We believe this error rate stems from the fact that the starting isolated populations were also approximately 95% pure by flow cytometry. (a) Flow cytometry plots of purified ‘CD4’ (population 1) and ‘CD8’ (population 2) populations used for the following study. CD4 and CD8 staining were used to define single positive cells with the % of cells in the single positive gate displayed. (b) Workflow for validation study. Population 1 was photo-uncaged w/ 365 nm light illumination prior to addition of Zipcode 1 while population 2 was illuminated prior to Zipcode 2 addition as a simulation of the workflow in tissue. (c) Histogram of (ZC1−ZC2)/(ZC1+ZC2) showing the bimodal distribution of cells with a ZC1 and ZC2 majority population as well as a small population centered about 0, suggesting presence of mixed doublets. Line at 0 denotes demarcation of cells into either ZC1 or ZC2 dominant population. (d) UMAP dimensional reduction of live cells with dominant Zipcode ID determined from (c) overlaid (blue: ZC1, red: ZC2) (n=1107) (nUMI = 11k, nGene = 2442, Cutoffs: nGene > 500 percent.mito < 0.15) (e) Overlay of normalized gene expression levels for selected genes that define CD4 and CD8 T cells populations. (f) Proportion of cells assigned to either ZC1 or ZC2 dominant populations as shown in (c) belonging to either cluster 1 or 2. Doublets were computationally excluded and the proportions following are also displayed. Mixing of Zipcodes during population pooling/tissue dissociation quantified. Briefly, two populations of mouse lymphocytes were labeled with Zipcode strand conjugated to TAMRA or FAM fluorophores. Cells were then pooled in media conditions mimicking tissue dissociation steps and incubated at RT for the times denoted before flow cytometric analysis. (g) Histograms of TAMRA signal denoting the two populations pooled as well as quantification of gMFI over time of these two starting populations. (h) Similarly for FAM signal. Primary mouse CD4 lymphocytes were isolated from mouse lymph nodes and labeled with anti-CD45 Ab-DNA conjugate hybridized to a fluorescently labeled Zipcode strand. Cells were then subjected to 0/1/2/3 rounds of UV illuminations, Zipcode additions and wash steps meant to mimic the workflow described for sequential Zipcode labeling of cells. Additionally, a group of cells was left untouched at 4C during this process. Following this process, cells were labeled with Zombie NIR Live/Dead stain as well as DAPI to assay cell viability. (i) Flow cytometric gating strategy showing how viability was determined from a Zombie NIR and DAPI population. (j) Fraction of singlet cells determined to be viable for cells undergoing 0 to 3 rounds of illumination and labeling. ‘4c’ denotes untouched sample sitting on ice (k) TAMRA gMFI of viable cell populations for the conditions shown.

Supplementary Figure 3

Supplementary Figure 3: Heatmap of gene expression (color) over pseudotime (columns) for most significantly branch-dependent genes (rows) as determined by BEAM analysis. Genes are hierarchically clustered to illustrate genes that share expression patterns across branch point. (a) For the first branch point (from state 1 to either state 2 or 3/4) and (b) for second branch point differentiating states 3 and 4.

Supplementary Figure 4

Supplementary Figure 4: Feature plots of monocyte/macrophage population for selected genes that define terminal states 2-4. (a) UMAP dimensional reduction of monocyte/macrophage population with state identities overlaid. (b) Genes that are expressed by both states 2 and 3 over state 4. Despite the differentiation trajectory, States 2 and 3 shared overexpression of Arg1 and angiogenesis factors (Lgals3, Vegfa) while under-expressing MHCII (H2-Aa and H2-Ab1) compared to State 4. (c) State 2 diverged from State 3 through expression of C3ar1, Ptgs2, Cxcl2, Fcgr1 (d) Genes overexpressed in state 3. State 3 was defined by overexpression of genes such as Nos2, Timp2, and Mmp12. (e) Genes overexpressed in state 4. State 4 was defined by expression of H2-Aa, C1qc/a, Apoe and Cx3cr1. Color scale indicates log-normalized transcript counts.

Supplementary Figure 2

Supplementary Figure 2: (a) Plot of Zipcode counts for Zipcode 1 and 2 on a logarithmic scale for cells detected using Cell Ranger defined cutoff. Regional calls were made based on dominance of ZC1 or ZC2 using k-means clustering to identify three groups. Cells within the cluster termed as Outer (Blue) had relatively more ZC1 counts and inversely, cells within the Inner (Red) cluster were dominated by ZC2 counts. Cells in-between were determined to be ‘ambiguous’ and excluded from downstream analysis.

Supplementary Figure 5

Supplementary Figure 5: (a-c) Differential Gene expression analysis between margin and interior cells for each of the terminal macrophage states (2-4) defined through Monocle analysis as shown in Figure 4G. Volcano plots show genes with p-value (Bonferroni corrected) > 0.05 colored blue for margin enriched and red for interior enriched.

Supplementary Figure 6

Supplementary Figure 6: (a) Raw counts for Zipcodes 1-4 from live cells as defined by CellRanger were normalized to the total Zipcode count for each cell (i.e. % of ZC). Principal components were calculated using these 4 parameters. Plots show PC1 vs PC2 with the normalized ZC count for 1-4 overlaid. (b) k-means clustering was used to identify 5 clusters based on the normalized ZC counts in order to identify ZC1-4 dominant populations. Color overlays correspond to the regional call used in Figure 5.

Supplementary Figure 8

Supplementary Figure 8: 8 region Zipcode delineation using combinatorial design with 20-micron resolution. (a) Microscope image of field of mouse T cells following 3 rounds of illumination w/ patterns shown and addition of three distinct Zipcode species with conjugated fluorophores. Experimental setup was similar to Figure 5I. Each grid square is approximately 20 × 20 μm. (b) Quantification of mean fluorescence intensity for each channel (Zipcode) on cell surfaces. Briefly, for each grid square, cell membranes were defined using the union of masks generated by thresholding in each channel. Mean channel intensity was calculated for pixels within this combined mask and background intensity for each channel was subtracted. Bar charts are shown normalized to maximal MFI in all 8 grid squares. Scale bar = 20 μm.

Supplementary Figure 7

Supplementary Figure 7: In-tube validation of combinatorial Zipcode labeling strategy (a) Isolated primary mouse immune cells of the designated types were labeled with the caged anti-CD45 anchor conjugate. Each population was then subjected to the specified series of illuminations and Zipcode additions with Zipcodes 1 and 2 used to determine identity and Zipcode 3 which was added onto all cells as a reference for normalization during analysis. Cell were then pooled and run through the modified 10X pipeline in order to recover both Zipcode and cDNA single cell libraries. (BMDC: Bone Marrow Derived Dendritic Cell, BMDM: Bone Marrow Derived Macrophages) (b) TSNE Plot showing major immune cell populations annotated. (n=6778 cells) (nUMI = 11.3k, nGene = 2633, Cutoffs: nGene = 500, percent.mito = 0.15) (c) Same tSNE plot with combinatorial Zipcode id overlaid. Black dots represent ambiguous Zipcode calls. (d) Plot of log-transformed, normalized Zipcode counts for ZC1 vs ZC2 (normalized to total ZC1-3 count) with Zipcode combination id’s overlaid. Clusters were determined using density based clustering algorithm DBSCAN on normalized ZC1 and ZC2 counts. (e) Bar charts showing proportion of cells with a particular Zipcode combination for a given cell type before and after doublet removal. Doublets were determined using DoubletFinder (ndoublet=419). Colors consistent with those used in (b).

ACKNOWLEDGEMENTS

We thank the Biological Imaging Development Center at UCSF for help with microscopy data collection and instrumentation. We also thank the Parnassus Flow Cytometry Core for flow cytometry instrumentation and the Institute for Human Genomics at UCSF for sequencing and bioinformatics support. In addition, we thank the Computational Biology and Informatics (CBI) core at the UCSF Helen Diller Family Comprehensive Cancer Center (HDFCCC) for computing resources. This work was supported from NIH/NCI grants P30DK063720 and 1R01CA197363, by the Parker Institute for Cancer Immunotherapy (PICI) as well as NIH T32 training grant (5T32CA177555-02)

Footnotes

COMPETING FINANCIAL INTERESTS

K.H.H. and M.F.K. are listed on a patent application regarding the ZipSeq approach.

References

  • 1.Galli SJ, Borregaard N, Wynn TA. Phenotypic and functional plasticity of cells of innate immunity: Macrophages, mast cells and neutrophils. Nat Immunol. 2011;12(11):1035–1044. doi: 10.1038/ni.2109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Stegle O, Teichmann SA, Marioni JC. Computational and analytical challenges in single-cell transcriptomics. Nat Rev Genet. 2015;16(3):133–145. doi: 10.1038/nrg3833 [DOI] [PubMed] [Google Scholar]
  • 3.Goltsev Y, Samusik N, Kennedy-Darling J, et al. Deep Profiling of Mouse Splenic Architecture with CODEX Multiplexed Imaging. Cell. 2018;174(4):968–981.e15. doi: 10.1016/j.cell.2018.07.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Angelo M, Bendall SC, Finck R, et al. Multiplexed ion beam imaging of human breast tumors. Nat Med. 2014;20(4):436–442. doi: 10.1038/nm.3488 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ståhl PL, Salmén F, Vickovic S, et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science (80- ). 2016;353(6294):78–82. doi: 10.1126/science.aaf2403 [DOI] [PubMed] [Google Scholar]
  • 6.Vickovic S, Eraslan G, Salmén F, et al. High-definition spatial transcriptomics for in situ tissue profiling. Nat Methods. 2019;16(10):987–990. doi: 10.1038/s41592-019-0548-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Rodriques SG, Stickels RR, Goeva A, et al. Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution. Science (80- ). 2019;363(6434):1463–1467. doi: 10.1126/science.aaw1219 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Stoeckius M, Hafemeister C, Stephenson W, et al. Simultaneous epitope and transcriptome measurement in single cells. Nat Methods. 2017;14(9):865–868. doi: 10.1038/nmeth.4380 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Govan JM, Young DD, Lusic H, Liu Q, Lively MO, Deiters A. Optochemical control of RNA interference in mammalian cells. Nucleic Acids Res. 2013;41(22):10518–10528. doi: 10.1093/nar/gkt806 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ando H, Furuta T, Tsien RY, Okamoto H. Photo-mediated gene activation using caged RNA/DNA in zebrafish embryos. Nat Genet. 2001;28(4):317–325. doi: 10.1038/ng583 [DOI] [PubMed] [Google Scholar]
  • 11.McGinnis CS, Patterson DM, Winkler J, et al. MULTI-seq: sample multiplexing for single-cell RNA sequencing using lipid-tagged indices. Nat Methods. 2019;16(7):619–626. doi: 10.1038/s41592-019-0433-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Deiters A Light activation as a method of regulating and studying gene expression. Curr Opin Chem Biol. 2009;13(5–6):678–686. doi: 10.1016/j.cbpa.2009.09.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.McGinnis CS, Murrow LM, Gartner ZJ. DoubletFinder: Doublet Detection in Single-Cell RNA Sequencing Data Using Artificial Nearest Neighbors. Cell Syst. 2019;8(4):329–337.e4. doi: 10.1016/j.cels.2019.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kowalczyk MS, Tirosh I, Heckl D, et al. Single-cell RNA-seq reveals changes in cell cycle and differentiation programs upon aging of hematopoietic stem cells. Genome Res. 2015;25(12):1860–1872. doi: 10.1101/gr.192237.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lee JY, Skon CN, Lee YJ, et al. The Transcription Factor KLF2 Restrains CD4+ T Follicular Helper Cell Differentiation. Immunity. 2015;42(2):252–264. doi: 10.1016/j.immuni.2015.01.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Engelhardt JJ, Boldajipour B, Beemiller P, et al. Marginating Dendritic Cells of the Tumor Microenvironment Cross-Present Tumor Antigens and Stably Engage Tumor-Specific T Cells. Cancer Cell. 2012;21(3):402–417. doi: 10.1016/j.ccr.2012.01.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Stoltzfus CR, Filipek J, Gern BH, et al. CytoMAP: a spatial analysis toolbox reveals features of myeloid cell organization in lymphoid tissues. bioRxiv. 2019:769877. doi: 10.1101/769877 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Afik R, Zigmond E, Vugman M, et al. Tumor macrophages are pivotal constructors of tumor collagenous matrix. J Exp Med. 2016;213(11):2315–2331. doi: 10.1084/jem.20151193 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Trapnell C, Cacchiarelli D, Grimsby J, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32(4):381–386. doi: 10.1038/nbt.2859 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Yang M, McKay D, Pollard JW, Lewis CE. Diverse functions of macrophages in different tumor microenvironments. Cancer Res. 2018;78(19):5492–5503. doi: 10.1158/0008-5472.CAN-18-1367 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Blecher-Gonen R, Bost P, Hilligan KL, et al. Single-Cell Analysis of Diverse Pathogen Responses Defines a Molecular Roadmap for Generating Antigen-Specific Immunity. Cell Syst. 2019;8(2):109–121.e6. doi: 10.1016/j.cels.2019.01.001 [DOI] [PubMed] [Google Scholar]
  • 22.Maimela NR, Liu S, Zhang Y. Fates of CD8+ T cells in Tumor Microenvironment. Comput Struct Biotechnol J. 2019;17:1–13. doi: 10.1016/j.csbj.2018.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Wherry EJ, Ha SJ, Kaech SM, et al. Molecular Signature of CD8+ T Cell Exhaustion during Chronic Viral Infection. Immunity. 2007;27(4):670–684. doi: 10.1016/j.immuni.2007.09.006 [DOI] [PubMed] [Google Scholar]
  • 24.Im SJ, Hashimoto M, Gerner MY, et al. Defining CD8 + T cells that provide the proliferative burst after PD-1 therapy. Nature. 2016;537(7620):417–421. doi: 10.1038/nature19330 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Jadhav RR, Im SJ, Hu B, et al. Epigenetic signature of PD-1+ TCF1+ CD8 T cells that act as resource cells during chronic viral infection and respond to PD-1 blockade. Proc Natl Acad Sci U S A. 2019;116(28):14113–14118. doi: 10.1073/pnas.1903520116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Garrod KR, Wei SH, Parker I, Cahalan MD. Natural killer cells actively patrol peripheral lymph nodes forming stable conjugates to eliminate MHC-mismatched targets. Proc Natl Acad Sci U S A. 2007;104(29):12081–12086. doi: 10.1073/pnas.0702867104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Okada T, Cyster JG. CC Chemokine Receptor 7 Contributes to Gi-Dependent T Cell Motility in the Lymph Node. J Immunol. 2007;178(5):2973–2978. doi: 10.4049/jimmunol.178.5.2973 [DOI] [PubMed] [Google Scholar]
  • 28.Lee MS, Hanspers K, Barker CS, Korn AP, McCune JM. Gene expression profiles during human CD4+ T cell differentiation. Int Immunol. 2004;16(8):1109–1124. doi: 10.1093/intimm/dxh112 [DOI] [PubMed] [Google Scholar]
  • 29.Medaglia C, Giladi A, Stoler-Barak L, et al. Spatial reconstruction of immune niches by combining photoactivatable reporters and scRNA-seq. Science (80- ). 2017;358(6370):1622–1626. doi: 10.1126/science.aao4277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Zheng GXY, Terry JM, Belgrader P, et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8. doi: 10.1038/ncomms14049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Macosko EZ, Basu A, Satija R, et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161(5):1202–1214. doi: 10.1016/j.cell.2015.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Picelli S, Faridani OR, Björklund ÅK, Winberg G, Sagasser S, Sandberg R. Full-length RNA-seq from single cells using Smart-seq2. Nat Protoc. 2014;9(1):171–181. doi: 10.1038/nprot.2014.006 [DOI] [PubMed] [Google Scholar]
  • 33.Meador JP, Lech JJ, Rice SD, et al. Massively Parallel Single-Cell. 2014;(February). [Google Scholar]
  • 34.Safferling K, Sütterlin T, Westphal K, et al. Wound healing revised: A novel reepithelialization mechanism revealed by in vitro and in silico models. J Cell Biol. 2013;203(4):691–709. doi: 10.1083/jcb.201212020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Park S, Gonzalez DG, Guirao B, et al. Tissue-scale coordination of cellular behaviour promotes epidermal wound repair in live mice. Nat Cell Biol. 2017;19(3):155–163. doi: 10.1038/ncb3472 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rockey DC, Weymouth N, Shi Z. Smooth muscle α actin (Acta2) and myofibroblast function during hepatic wound healing. PLoS One. 2013;8(10). doi: 10.1371/journal.pone.0077166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Simone TM, Higgins CE, Czekay R-P, et al. SERPINE1: A Molecular Switch in the Proliferation-Migration Dichotomy in Wound-“Activated” Keratinocytes. Adv Wound Care. 2014;3(3):281–290. doi: 10.1089/wound.2013.0512 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Weber JP, Fuhrmann F, Feist RK, et al. ICOS maintains the T follicular helper cell phenotype by down-regulating kruppel-like factor 2. J Exp Med. 2015;212(2):217–233. doi: 10.1084/jem.20141432 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Zemmour D, Zilionis R, Kiner E, Klein AM, Mathis D, Benoist C. Single-cell gene expression reveals a landscape of regulatory T cell phenotypes shaped by the TCR. Nat Immunol. 2018;19(3):291–301. doi: 10.1038/s41590-018-0051-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Movahedi K, Laoui D, Gysemans C, et al. Different tumor microenvironments contain functionally distinct subsets of macrophages derived from Ly6C(high) monocytes. Cancer Res. 2010;70(14):5728–5739. doi: 10.1158/0008-5472.CAN-09-4672 [DOI] [PubMed] [Google Scholar]
  • 41.Franklin RA, Liao W, Sarkar A, et al. The cellular and molecular origin of tumor-associated macrophages. Science (80- ). 2014;344(6186):921–925. doi: 10.1126/science.1252510 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Wang Y, Hu J, Li Y, et al. The transcription factor Tcf1 preserves the effector function of exhausted CD8 T cells during chronic viral infection. Front Immunol. 2019;10(FEB):1–14. doi: 10.3389/fimmu.2019.00169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Parsonage G, MacHado LR, Hui JWY, et al. CXCR6 and CCR5 localize T lymphocyte subsets in nasopharyngeal carcinoma. Am J Pathol. 2012;180(3):1215–1222. doi: 10.1016/j.ajpath.2011.11.032 [DOI] [PubMed] [Google Scholar]
  • 44.González-Martín A, Mira E, Mañes S. CCR5 in cancer immunotherapy. Oncoimmunology. 2012;1(1):106–108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.De Oliveira CE, Gasparoto TH, Pinheiro CR, et al. CCR5-dependent homing of T regulatory cells to the tumor microenvironment contributes to skin squamous cell carcinoma development. Mol Cancer Ther. 2017;16(12):2871–2880. doi: 10.1158/1535-7163.MCT-17-0341 [DOI] [PubMed] [Google Scholar]
  • 46.Pont F, Tosolini M, Fournié JJ. Single-Cell Signature Explorer for comprehensive visualization of single cell signatures across scRNA-seq datasets. Nucleic Acids Res. 2019;47(21). doi: 10.1093/nar/gkz601 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figure 1

Supplementary Figure 1: In-tube validation that photo-uncaging combined with Zipcode hybridization can distinguish cell populations. Purified CD4 and CD8 T mouse T cells were labeled with a caged anti-CD45 Ab-DNA conjugate (without internal fluorophore modification) and then spatially confined to separate tubes. ‘CD4 T cells’ were uniquely illuminated prior to addition of the first Zipcode (ZC1) to both tubes, followed by a washing step. ‘CD8 T cells’ were then illuminated and a second Zipcode (ZC2) was added to both tubes followed by washing. The contents of both tubes were pooled and introduced into the 10X scRNA-Seq pipeline. Nearest neighbor clustering of the populations sorted CD4 and CD8 T cell signatures from each other. We observed good (95%) correlation between dominant Zipcode identity and assignment to CD4 or CD8 clusters. We believe this error rate stems from the fact that the starting isolated populations were also approximately 95% pure by flow cytometry. (a) Flow cytometry plots of purified ‘CD4’ (population 1) and ‘CD8’ (population 2) populations used for the following study. CD4 and CD8 staining were used to define single positive cells with the % of cells in the single positive gate displayed. (b) Workflow for validation study. Population 1 was photo-uncaged w/ 365 nm light illumination prior to addition of Zipcode 1 while population 2 was illuminated prior to Zipcode 2 addition as a simulation of the workflow in tissue. (c) Histogram of (ZC1−ZC2)/(ZC1+ZC2) showing the bimodal distribution of cells with a ZC1 and ZC2 majority population as well as a small population centered about 0, suggesting presence of mixed doublets. Line at 0 denotes demarcation of cells into either ZC1 or ZC2 dominant population. (d) UMAP dimensional reduction of live cells with dominant Zipcode ID determined from (c) overlaid (blue: ZC1, red: ZC2) (n=1107) (nUMI = 11k, nGene = 2442, Cutoffs: nGene > 500 percent.mito < 0.15) (e) Overlay of normalized gene expression levels for selected genes that define CD4 and CD8 T cells populations. (f) Proportion of cells assigned to either ZC1 or ZC2 dominant populations as shown in (c) belonging to either cluster 1 or 2. Doublets were computationally excluded and the proportions following are also displayed. Mixing of Zipcodes during population pooling/tissue dissociation quantified. Briefly, two populations of mouse lymphocytes were labeled with Zipcode strand conjugated to TAMRA or FAM fluorophores. Cells were then pooled in media conditions mimicking tissue dissociation steps and incubated at RT for the times denoted before flow cytometric analysis. (g) Histograms of TAMRA signal denoting the two populations pooled as well as quantification of gMFI over time of these two starting populations. (h) Similarly for FAM signal. Primary mouse CD4 lymphocytes were isolated from mouse lymph nodes and labeled with anti-CD45 Ab-DNA conjugate hybridized to a fluorescently labeled Zipcode strand. Cells were then subjected to 0/1/2/3 rounds of UV illuminations, Zipcode additions and wash steps meant to mimic the workflow described for sequential Zipcode labeling of cells. Additionally, a group of cells was left untouched at 4C during this process. Following this process, cells were labeled with Zombie NIR Live/Dead stain as well as DAPI to assay cell viability. (i) Flow cytometric gating strategy showing how viability was determined from a Zombie NIR and DAPI population. (j) Fraction of singlet cells determined to be viable for cells undergoing 0 to 3 rounds of illumination and labeling. ‘4c’ denotes untouched sample sitting on ice (k) TAMRA gMFI of viable cell populations for the conditions shown.

Supplementary Figure 3

Supplementary Figure 3: Heatmap of gene expression (color) over pseudotime (columns) for most significantly branch-dependent genes (rows) as determined by BEAM analysis. Genes are hierarchically clustered to illustrate genes that share expression patterns across branch point. (a) For the first branch point (from state 1 to either state 2 or 3/4) and (b) for second branch point differentiating states 3 and 4.

Supplementary Figure 4

Supplementary Figure 4: Feature plots of monocyte/macrophage population for selected genes that define terminal states 2-4. (a) UMAP dimensional reduction of monocyte/macrophage population with state identities overlaid. (b) Genes that are expressed by both states 2 and 3 over state 4. Despite the differentiation trajectory, States 2 and 3 shared overexpression of Arg1 and angiogenesis factors (Lgals3, Vegfa) while under-expressing MHCII (H2-Aa and H2-Ab1) compared to State 4. (c) State 2 diverged from State 3 through expression of C3ar1, Ptgs2, Cxcl2, Fcgr1 (d) Genes overexpressed in state 3. State 3 was defined by overexpression of genes such as Nos2, Timp2, and Mmp12. (e) Genes overexpressed in state 4. State 4 was defined by expression of H2-Aa, C1qc/a, Apoe and Cx3cr1. Color scale indicates log-normalized transcript counts.

Supplementary Figure 2

Supplementary Figure 2: (a) Plot of Zipcode counts for Zipcode 1 and 2 on a logarithmic scale for cells detected using Cell Ranger defined cutoff. Regional calls were made based on dominance of ZC1 or ZC2 using k-means clustering to identify three groups. Cells within the cluster termed as Outer (Blue) had relatively more ZC1 counts and inversely, cells within the Inner (Red) cluster were dominated by ZC2 counts. Cells in-between were determined to be ‘ambiguous’ and excluded from downstream analysis.

Supplementary Figure 5

Supplementary Figure 5: (a-c) Differential Gene expression analysis between margin and interior cells for each of the terminal macrophage states (2-4) defined through Monocle analysis as shown in Figure 4G. Volcano plots show genes with p-value (Bonferroni corrected) > 0.05 colored blue for margin enriched and red for interior enriched.

Supplementary Figure 6

Supplementary Figure 6: (a) Raw counts for Zipcodes 1-4 from live cells as defined by CellRanger were normalized to the total Zipcode count for each cell (i.e. % of ZC). Principal components were calculated using these 4 parameters. Plots show PC1 vs PC2 with the normalized ZC count for 1-4 overlaid. (b) k-means clustering was used to identify 5 clusters based on the normalized ZC counts in order to identify ZC1-4 dominant populations. Color overlays correspond to the regional call used in Figure 5.

Supplementary Figure 8

Supplementary Figure 8: 8 region Zipcode delineation using combinatorial design with 20-micron resolution. (a) Microscope image of field of mouse T cells following 3 rounds of illumination w/ patterns shown and addition of three distinct Zipcode species with conjugated fluorophores. Experimental setup was similar to Figure 5I. Each grid square is approximately 20 × 20 μm. (b) Quantification of mean fluorescence intensity for each channel (Zipcode) on cell surfaces. Briefly, for each grid square, cell membranes were defined using the union of masks generated by thresholding in each channel. Mean channel intensity was calculated for pixels within this combined mask and background intensity for each channel was subtracted. Bar charts are shown normalized to maximal MFI in all 8 grid squares. Scale bar = 20 μm.

Supplementary Figure 7

Supplementary Figure 7: In-tube validation of combinatorial Zipcode labeling strategy (a) Isolated primary mouse immune cells of the designated types were labeled with the caged anti-CD45 anchor conjugate. Each population was then subjected to the specified series of illuminations and Zipcode additions with Zipcodes 1 and 2 used to determine identity and Zipcode 3 which was added onto all cells as a reference for normalization during analysis. Cell were then pooled and run through the modified 10X pipeline in order to recover both Zipcode and cDNA single cell libraries. (BMDC: Bone Marrow Derived Dendritic Cell, BMDM: Bone Marrow Derived Macrophages) (b) TSNE Plot showing major immune cell populations annotated. (n=6778 cells) (nUMI = 11.3k, nGene = 2633, Cutoffs: nGene = 500, percent.mito = 0.15) (c) Same tSNE plot with combinatorial Zipcode id overlaid. Black dots represent ambiguous Zipcode calls. (d) Plot of log-transformed, normalized Zipcode counts for ZC1 vs ZC2 (normalized to total ZC1-3 count) with Zipcode combination id’s overlaid. Clusters were determined using density based clustering algorithm DBSCAN on normalized ZC1 and ZC2 counts. (e) Bar charts showing proportion of cells with a particular Zipcode combination for a given cell type before and after doublet removal. Doublets were determined using DoubletFinder (ndoublet=419). Colors consistent with those used in (b).

Data Availability Statement

For all scRNA-Seq studies described here, transcript counts as determined by CellRanger count function as well as raw Zipcode fastq’s/counts as well as the modified genome for transgene alignment during CellRanger count can be found on GEO under accession#: GSE145502

RESOURCES