Abstract
A holistic understanding of tissue and organ structure and function requires the detection of molecular constituents in their original three-dimensional (3D) context. Imaging mass cytometry (IMC) enables simultaneous detection of up to 40 antigens and transcripts using metal-tagged antibodies but has so far been restricted to two-dimensional imaging. Here we report the development of 3D IMC for multiplexed 3D tissue analysis at single-cell resolution and demonstrate the utility of the technology by analysis of human breast cancer samples. The resulting 3D models reveal cellular and microenvironmental heterogeneity and cell-level tissue organization not detectable in two dimensions. 3D IMC will prove powerful in the study of phenomena occurring in 3D space such as tumor cell invasion and is expected to provide invaluable insights into cellular microenvironments and tissue architecture.
Tissues and organs are complex ecosystems consisting of numerous cell types arranged in a manner that is inextricably related to function. Understanding tissue function and pathology thus requires knowledge of constituent cells and their states, extracellular matrix proteins and vasculature in the context of their native 3D arrangement. Historically, tissues have been studied using microscopy and recently developed methods have enabled various types of 3D tissue analysis (Supplementary Table 1). Confocal 3D microscopy enables analysis of tissue sections at subcellular resolution but is limited to a depth of about 100 μm1. Multi-photon confocal and light-sheet microscopes allow for 3D reconstructions of up to 1-mm tissue depth at single-cell resolution2,3. As these 3D microscopy methods rely on fluorescent reporters that show high spectral overlap, the number of epitopes that can be measured simultaneously is limited.
To enable multiplexed tissue analysis, cyclic immunostaining and chromogenic approaches have been used4–7 and such methods have also been implemented in 3D8,9. In addition to fluorescence-based approaches, mass spectrometry-based imaging of epitopes and transcripts is becoming broadly used. In mass spectrometry-based technologies, mass tags, such as a molecule of a defined mass or metal isotopes, are used as reporters on affinity reagents10,11,12. IMC allows simultaneous detection of up to 40 antigens13 and nucleic acid sequences14 in formalin-fixed paraffin-embedded (FFPE) tissues15, in frozen tissue sections16 and in cultured cells17. Currently, however, none of these methods combines multiplex detection of many targets with 3D tissue imaging, which is necessary for visualization of single cells together with larger structures such as blood vessels.
Here we describe an extension of IMC to the analysis of tissues in 3D. With the 3D IMC method, the volume and depth of a tissue that can be analyzed is limited mainly by the measurement time. The full pipeline, from sample processing to cell-level computational analysis of a complete 3D model, can be performed in 1 week15,18. We demonstrate how 3D IMC enables the study of tumor architecture by combining the analysis of tissue volumes with single-cell information. We show that spatial heterogeneity of marker expression and preferential cell–cell interactions become apparent with 3D models and that spatially contained events could be captured within a single 3D model. Overall, we demonstrate that the detailed models generated with 3D IMC facilitate comprehensive, single-cell-resolution analysis of cellular microenvironments and tissue architecture.
Results
Generation of 3D models from IMC data
Our 3D IMC approach relies on serial sectioning of a tissue cylinder punched from a paraffin-embedded tissue. We optimized our sample processing methods and settled on using an ultramicrotome with a diamond knife designed for FFPE sectioning to minimize deformations that are known to be caused during tissue cutting, handling of thin slices, and further experimental procedures19. We chose to cut 2-μm-thick sections to provide a compromise between capturing single cells across multiple slices and the difficulty of handling ultra-thin slices, thereby making the approach accessible to a broader user base. Tissue sectioning is followed by tissue hydration and heat-induced epitope retrieval (Extended Data Fig. 1). After acquiring two-dimensional (2D) IMC data, we assemble 3D tissue models and derive single-cell marker profiles using a computational workflow that we established based on publicly available tools (Fig. 1a). Data acquisition for 0.5 mm3 currently takes about 1 week.
The 3D IMC approach enables the study of the spatial distribution of around 40 epitopes in the context of 3D tissue architecture. To exemplify the 3D IMC workflow, we reconstructed 3D models from different breast cancer samples. For the main model with a final size of 652 × 488 × 304 μm3, we used an HER2+ ductal breast carcinoma sample (Fig. 1b). We cut 152 consecutive slices from the tissue sample and stained the sections with a breast-cancer-centric panel of antibodies (Supplementary Table 2 and Extended Data Fig. 2a) designed to reveal basal and luminal cells, vascularization, immune cell infiltration, proliferation, apoptosis, hypoxia, cell signaling and collagen deposition. After data acquisition, we converted the files to TIFF format and aligned consecutive images to achieve the final model from the overlapping area of the images (Extended Data Fig. 2b and Supplementary Videos 1 and 2). The raw voxel-level data rendering of this sample shows regions positive for pan-cytokeratin (panCK) intertwined with blood vessels and stromal mass (Fig. 1b and Supplementary Video 3). The stromal compartment seems heterogeneous in cell composition throughout the volume analyzed, expressing common immune cell markers, including CD20, CD8a, CD3 and CD68 (Fig. 1b). Thus, with 3D IMC, spatial arrangement of multiple markers can be analyzed throughout the tissue volume by evaluating distributions of the raw voxel-level data.
Derivation of phenotypic single-cell data from 3D IMC images
To study the tumor ecosystem in more depth, we extracted single-cell information from the overlapping area of the images using a 3D watershed segmentation (Fig. 2a,b). Segmentation assigns each cell a unique identifier enabling the computation of antibody signal statistics and spatial data (such as volume and direct neighbor interactions) for each single cell. The antibody signal statistics, hereafter referred to as marker expression, is calculated as the mean intensity of ion counts over each single-cell 3D mask for every channel in the antibody panel. We performed the 3D watershed segmentation on the nuclear channel stained with iridium after applying a set of digital image processing steps (Methods). The segmentation results were evaluated visually by overlaying the single-cell mask with the iridium signal (Fig. 2b,c). The image processing steps were chosen such that the final mask was minimally affected by the intensity variation between the slices and that it would yield similar average cell diameters for the x–y and x–z directions (Extended Data Fig. 3a,b). Final statistics for each cell were collated in a cell data catalog for easy export and downstream single-cell analysis (Extended Data Fig. 3c). The iridium-channel-dependent segmentation enables the segmentation procedure to be applied to IMC data regardless of the antibody panel.
To assess whether the derived marker expression for single cells captures expected biological information and aligns with the raw voxel data, we used unsupervised Phenograph clustering to assign a cell phenotype to each single cell (Fig. 3a)20. The antibodies in our panel were chosen to enable identification of known tumor, immune, and stromal cell phenotypes. We identified expected cell phenotypes based on differential expression of key markers such as epithelial cells positive for panCK (such as clusters 1–7 and 10–13), stromal cells positive for collagen and SMA (clusters 17 and 22), B cells positive for CD20 and CD45 (cluster 25), T cells positive for CD8a, CD3 and CD45 (cluster 18), macrophages positive for CD68 and CD45 (cluster 30) and endothelial cells positive for von Willebrand factor (vWF) and CD31 (cluster 21). Additionally, the segmented data captured the expected average larger size of tumor cells compared to B cells and T cells (Extended Data Fig. 3d). The segmentation results can be evaluated by overlaying raw voxel data with a cell phenotype mask, where each voxel for every cell is assigned its cluster label (Figs. 1b and 3b and Supplementary Videos 4 and 5). Highlighting the value of 3D data, the location of different cell phenotypes can be easily visualized, for example we observed a clear tendency of CD45+CD3+ T cells (cluster 13 and 18) to cluster around the vWF+CD31+ endothelial cells (cluster 21) (Fig. 3c and Supplementary Video 6). Similarly, we observed spatial separation of different subsets of CD68+ cells (clusters 30 and 28) (Extended Data Fig. 3e). These distinct patterns would be much more difficult to recognize in 2D. In addition to assigning each cell its phenotype based on aggregated marker expression via clustering, the mean antibody signal statistics for each individual cell can be evaluated together with its spatial location using an expression mask, where each voxel for every cell is assigned the derived marker expression value. We observed that tumor cells in this model showed varying expression levels of luminal cell markers (cytokeratins 8/18 and 19), basal cell markers (cytokeratin 5 and SMA) and other breast cancer epithelial markers such as HER2 and CD44 (Fig. 3d and Supplementary Videos 7 and 8). This shows that 3D IMC captures marker expression heterogeneity in tumor cells as expected from our previous large-scale analyses of breast cancer cohorts13,21,22. Reassuringly, applying our pipeline with the same segmentation approach and with the same antibody panel to a second breast carcinoma sample from the same patient yielded similar cell phenotypes (Extended Data Fig. 4a–d and Supplementary Table 3). These analyses demonstrate that 3D IMC enables a detailed analysis of expected cellular phenotypes in a 3D tissue context and identifies cellular populations that differ in location within the tissue volume.
3D IMC reveals cellular and microenvironmental relationships
As tissues are 3D structures, we expected that measurements of distances and intercellular proximity in 3D should better capture the modeled tissue than 2D IMC. We assessed this by comparing various quantitative measures in data obtained from the main HER2-positive ductal breast carcinoma sample described above. We first measured distances between different cell phenotypes, determined with the initial clustering step and the nearest blood vessel in all 2D sections and in the reconstructed 3D model (Fig. 4a). The average distances for different cell phenotypes to the closest blood vessel in 3D were always shorter than when measured in 2D (Fig. 4b). This was as expected, because in single 2D planes the features above and below the plane cannot be detected. To test whether cell–cell interactions differed when analyzed in 3D versus 2D, we determined which cell phenotypes were directly touching for all the cells in each section in 2D and for the entire 3D model. The 3D interaction analysis yielded a different cell interaction picture than 2D analysis. For example, there were more homotypic interactions among all cell types in 2D than in 3D and fewer heterotypic interactions among cluster 25 (B cells) and cluster 18 (T cells) in 2D (Fig. 4c). Plausible explanations for these differences are under-sampling in the 2D space and the existence of interactions that occur in constrained directions (such as lymphocytes tethered to blood vessels). These two analyses suggest that a 3D model yields quantitatively different measures of cellular and microenvironmental relationships than a 2D image and that the differences are expected or plausible given the 3D tissue architecture.
We also reasoned that 3D models would reveal spatial arrangements of protein expression that might not be visible on 2D images. In our main HER2+ ductal breast carcinoma 3D model, we observed a panCK+ luminal region and a patchy and discontinuous pattern of cytokeratin 5 (CK5) in the basal layer, although SMA, another basal marker, had uniform expression (Fig. 5a,b). The patchy basal layer is only clearly visible in the 3D reconstruction rather than in the 2D image (Fig. 5a,b and Supplementary Video 9). We observed a similar discontinuous pattern of CK5 in the second breast carcinoma sample reconstructed from the same HER2+ ductal breast patient sample using the 3D IMC workflow (Fig. 5c). The second model is composed of large epithelial tumor compartments and a stromal compartment with a CD8a+ region (Fig. 5d and Supplementary Video 10). Understanding these different phenotypes in the basal layer may shed light on the early steps of invasion in ductal carcinomas. Additionally, with cell-level marker expression data in the second breast carcinoma model, we observed in 3D that pS6+ cells line the epithelial tumor compartments (Fig. 5e). In 2D, the distribution of pS6+ cells seems disordered and the distance to the SMA+ cell area is unclear (Fig. 5f). In the 3D reconstruction, however, it is apparent that pS6+ cells colocalize with SMA+ cells (Fig. 5f and Supplementary Video 11). These examples illustrate that spatial cell-level organization become apparent in 3D, underscoring the value of our approach for studying tissue architecture.
Spatial invasion-associated events captured by 3D models
3D reconstructions could be particularly powerful in analysis of phenomena that proceed through 3D space. Reconstruction of the main HER2+ ductal breast carcinoma 3D model revealed a structure that might resembles a microinvasive lesion and a stream of potentially invasive tumor cells (Fig. 6a). The distal part of the tumor (up to a depth of 80 μm along the z axis) contained no putative invading cells and the tumor cell compartments had smooth tumor–stroma boundaries (Fig. 6a). Toward the center of the tumor (about a depth of 120 μm along the z axis) and through the proximal area (160 μm and deeper), putative invasive panCK+ epithelial cells potentially emerging from a microprotrusion were detected (Fig. 6a,b and Supplementary Video 12). In the 3D model, these potentially invasive cells could be differentiated from the bulk tumor mass with confidence, as it is possible to determine whether a given cluster of epithelial cells is completely disconnected from the tumor network or not. This is impossible in a single or few 2D sections. We observed that these cells differed from the tumor bulk in the expression level for key epithelial cell markers (Fig. 6c and Supplementary Video 13). The putative invasive cells had lower expression of epithelial markers such as CK8/18, CK19 and HER2 and higher expression of CD138 and pS6 than other epithelial cells in the whole model (Extended Data Fig. 5a). We also observed that CK5+ basal cells and stromal cells in the immediate microenvironment (50-μm radius from each putative invasive cell) also had increased levels of pS6 expression compared to other CK5+ basal and stromal cells in this model (Fig. 6d and Extended Data Fig. 5b).
We reconstructed two additional 3D models to further illustrate the capabilities of 3D IMC for analyzing spatially confined events. These samples were identified by pathologists as lymphovascular invasion sites at the periphery of two different HER2+ breast carcinomas and as such should capture metastasis-associated events. Presence of lymphovascular invasion in breast cancer has been associated with worse patient outcome23 and various genes have been associated with a higher likelihood of the event24,25, but they are challenging to analyze on account of their 3D topology. Multiplexed 3D IMC enabled us to evaluate cell subsets and their 3D spatial distribution in such lymphovascular clusters. For instance, in one of the models (Fig. 6e, Supplementary Video 14 and Supplementary Table 4), a large panCK+ tumor cell cluster is visible inside a blood vessel that is positive for vWF, CD31 and vimentin. Similarly, we captured a HER2+ and E/P-cadherin+ tumor cell cluster inside a lymph vessel (SMA+, CD31− and vimentin−) and single E-cadherin+ cells in the surrounding blood vessels (positive for CD31 and vimentin) (Fig. 6f, Supplementary Video 15 and Supplementary Table 4), in the second lymphovascular invasion model. Clustering-based phenotypic analysis revealed higher expression of HER2 and lower expression of TMEM173 in the large lymph vessel-invaded cluster compared to the single blood vessel-invaded cells (Extended Data Fig. 5c). Future analysis could involve an analysis of cell phenotypes inside the lymphovascular clusters in comparison to the whole tumor mass. Confident assignment of the properties of potentially invasive tumor cell clusters and their associations with metastases will require further study, but these three examples highlight the suitability of multiplexed 3D IMC for studying spatial events such as cell invasion during metastasis.
Conclusions
Here we demonstrate a multiplexed, single-cell resolution, 3D analysis strategy for FFPE tissues. 3D IMC will enable detailed visualization and analysis of cell lineage and communication mechanisms in the context of native tissue architecture for any tissue type. Studies using multiplexed 2D imaging methods have shown a correlation between spatial cell communities and clinical outcomes in breast and colorectal cancer21,26,27 and difference in cell interaction patterns has been linked to disease progression in diabetes28. We have demonstrated that measurements of the spatial neighborhood can differ between 3D and 2D images. As 3D measurements are expected to better capture tissue composition compared to 2D representation, 3D IMC could prove especially beneficial for studying the effect of microenvironment on tumor cell biology. Additionally, 3D methods have been indispensable for studying tumor invasion into blood vessels in pancreatic cancer patient samples29 or identifying tumor budding in samples from patients with pancreatic, colorectal, lung and breast cancer30 because neither of the phenomena can be represented clearly in a single 2D image. 3D IMC would be suitable for capturing such events and thanks to the multiplexed nature of IMC, our method would facilitate a detailed understanding of cell characteristics associated with invasive events.
Additional computational and technical developments will further improve the utility of 3D IMC. For instance, acquisition time is long (4 full days for each tumor sample). Development of hardware with higher ablation speed will allow shorter acquisition times or, alternatively, better resolution. As the amount of 3D IMC data increases, better 3D cell segmentation approaches based on machine learning methods could also be implemented31, which could provide more accurate cell shape representations and reduce unclear cell phenotypes. Furthermore, newly developed visualization and analysis tools, including virtual reality applications, to augment the amount of data that can be simultaneously visualized in 3D32,33, will further facilitate analysis of such detailed 3D datasets. With our accessible computational pipeline based on publicly available tools and a growing set of well-validated antibodies among IMC users, we envision that 3D IMC could be increasingly employed by different laboratories. In conclusion, the 3D IMC approach advances the study of the spatial distribution of proteins in the context of 3D tissue architecture.
Methods
Statement of research integrity
This study complies with all relevant ethical regulations and was approved by the ethics committee of the Canton Zurich (PB_2016-00811).
Breast cancer tissue samples
We used archived material from the tissue biobank at the University Hospital Zurich. Regions of interest in different breast carcinoma samples were identified in hematoxylin and eosin-stained sections from FFPE tissue blocks by a pathologist and 1-mm or 1.5-mm diameter cylindrical punches were obtained. The main and the second breast carcinoma 3D models were generated from one patient sample. The blood and the lymph vessel lymphovascular invasion models were generated from two different patient samples. Another sample from a patient with breast carcinoma was used for comparing different antigen retrieval conditions
Antibody testing and validation
Antibody panels were designed using AirLab34. All antibodies (Supplementary Tables 2–4 and Extended Data Fig. 1b) were conjugated to metals using the MaxPar X8 Multimetal Labeling Kit (Fluidigm) according to the manufacturer’s instructions. Before testing antibodies, manufacturer’s website and antibody databases were used for choosing antibodies for targets of interest. Antibodies were initially tested using immunofluorescence staining without metal conjugation in lymph node, spleen and breast cancer FFPE sections. Antibodies that revealed expression patterns consistent with the literature were chosen for metal conjugation. After the conjugation, another round of testing was undertaken using IMC with breast cancer FFPE sections and antibodies that showed staining patterns consistent with the literature and with sufficient signal intensity were utilized. In this step, thanks to multiplexing capabilities of IMC, often already validated antibodies were used in addition to the antibodies being tested to validate cell type specific staining (epithelial and different immune cells). Breast cancer FFPE tissues were used for titrating antibodies for final IMC measurements.
Sample sectioning with ultramicrotome
For generating a 3D IMC model, first a tissue punch from an FFPE sample was re-cast into an EPON silicon mold by first putting the sample into a container of molten paraffin for a few minutes to clear the remaining paraffin and then placing the tissue into the silicon mold with the cylinder axis along the axis of the mold. After which, the mold was filled with fresh paraffin and placed at 4 °C. Once the paraffin was solidified (about 1 h), a tissue block was mounted in an ultramicrotome capsule holder and a diamond trimming knife was used to trim a trapezium shape with tissue exposed in the salient block face. Trimming was required to enable adherence of consecutive sections into ribbons that then could be collected on a microscopy slide. A Histo-Jumbo Diamond knife (Diatome) with a plastic-sealed hole in the bottom was used for cutting sections. A needle connected to a syringe filled with ultrapure water was inserted through the bottom hole, a Superfrost histological glass slide was placed in a boat of the knife and then the knife boat was filled with ultrapure water until the water was level with the knife edge. With step length set to 2 μm, consecutive slices were cut. The water level in the boat was reduced using the syringe to collect ribbons on the microscopy slides. These slides were then dried on a warm metal plate (30 °C) for 2 h or left overnight at room temperature to tether tissues to the glass slide.
Tissue preparation and antigen retrieval
Tissue sections were dewaxed in xylene for at least 2 h and again in fresh xylene at least 1 h and then in xylene:ethanol (1:1) for 10 min. Samples were rehydrated in a graded series of alcohol (ethanol:deionized water 100:0, 90:10, 80:20, 70:30, 50:50 and 0:100; 10 min each). After that, slides were put into to Tris-EDTA buffer at pH 9.2 followed by heat-induced epitope retrieval in a pressure cooker (Medite) for 30 min at 95 °C or for 80 min at 80 °C. Samples were left to cool in the Tris-EDTA buffer for 20 min followed by cooling in Tris-buffered saline (TBS) at room temperature for at least 20 min and then blocked with blocking buffer containing 3% BSA in TBS or TBS with 1% horse serum and 1% FcBlock (Miltenyi Biotech) for about 1 h. Samples were then stained with antibody mix (diluted in blocking buffer) and incubated overnight at 4 °C. Samples were washed three times in TBS for 5–10 min each wash before incubating for 2–5 min with iridium intercalator diluted in TBS followed by three washes in TBS (5 min each wash) or by a 10-s wash in PBS if ruthenium staining was used in the next step. For ruthenium counterstaining15, ruthenium tetroxide (0.5% solution from Electron Microscopy Sciences) was diluted 1:1,000 in PBS. Samples were incubated for 3–5 min and then samples were washed in ultrapure water. Samples were then air dried before IMC acquisition. All samples were subjected to these steps with the following differences: the samples for the main and the second breast carcinoma model were subjected to HIER antigen retrieval for 80 min at 80 °C and were counterstained with ruthenium; the lymphovascular invasion models were subjected to HIER for 30 min at 95 °C without the ruthenium counterstaining step.
Imaging mass cytometry
Data acquisition was performed on a Helios time-of-flight mass cytometer (CyTOF) coupled to a Hyperion Imaging System (Fluidigm). Selected areas for ablation were larger than the actual area of interest to account for loss of overlapping areas among sections due to cumulative rotation.
The selected area ablated per section was around 1 mm2. Laser ablation was performed at a resolution of approximately 1 μm with a frequency of 200 Hz (or 400 Hz for lymphovascular invasion models) with estimated acquisition time of 1 mm2 h−1. To ensure performance stability, the machine was calibrated daily with a tuning slide spiked with five metal elements (Fluidigm). All data were collected using the commercial Fluidigm CyTOF software v.01.
Data processing
MCD files from the Fluidigm Hyperion imaging system were converted into OME-TIFFs using publicly available IMC data preprocessing pipeline (https://github.com/BodenmillerGroup/ImcSegmentationPipeline) based on imctools v.1.0. OME-TIFFs were then converted to single-channel 16-bit TIFF stacks and if necessary zero-padding was added to make the images the same size across a 3D stack. For the main breast carcinoma model, some images that were acquired in multiple acquisition attempts required reassembly. The 152 slices analyzed corresponded to 168 IMC acquisitions with 10 cases where a section was contained in two or three different IMC images. For the main breast cancer model slice 16 is missing and for the second breast carcinoma model slice 59 is missing; missing slices were omitted from downstream analysis.
Image registration
The following Fiji-ImageJ2-linux64 v.1.0 (ref. 35) plugins accessed via Jython scripting were used for image registration: Register Virtual Stack Slices was used to automatically extract SIFT features to calculate a transformation matrix based on a chosen registration model and Transform Virtual Stack Slices was applied to align images from all the channels (https://github.com/fiji/register_virtual_stack_slices/). For an image stack alignment for any 3D IMC model, first a single-channel stack containing images stained for smooth muscle acting were histogram equalized per image and using Register Virtual Stack Slices plugin a similarity (translation + rotation + isotropic scaling) model was applied to this stack to find a transformation matrix for the alignment. The Register Virtual Stack Slices plugin was run with the following default parameters: maxOctaveSize = 1,200, minInlierRatio = 0.05 (RANSAC true matches ratio), featuresModelIndex = RIGID, interpolate = true, rod = 0.86 (closest neighbor distance ratio). The parameter for maxEpsilon = 10 (maximal allowed alignment error in pixels) was determined empirically by calculating a cross-correlation for a small region in the middle of the aligned iridium channel image stack and choosing the parameter with highest cross-correlation value. Once a transformation matrix was determined, then the Transform Virtual Stack Slices plugin was used for separate image stacks for each of the channels present in the antibody panel. Finding the best alignment approach and applying the transformations to all the channels takes about 2 h depending on the number of slices in the model.
Single-cell segmentation
After aligning images across all channels, the stack of raw iridium images was used as a starting point for the input to 3D watershed algorithm36. Segmentation using the nuclear channel facilitates the independence of this pipeline from the antibody panel as antibody panels differ between models. All the steps for segmentation were performed in Fiji-ImageJ2-linux64 v.1.0 (ref. 35) unless custom Python scripts relying heavily on the NumPy package37 are specified. The segmentation pipeline involved multiple commonly used digital image processing steps as described below. The different steps and parameters used were determined empirically by visually assessing the overlap between the cell mask and the nuclear iridium signal (Extended Data Fig. 3b,c). The following steps resulted in single-cell segmentation:
Mean artificial images were generated, where each artificial image was calculated using custom Python script as the mean of iridium channel between a preceding and a subsequent image resulting in matching x–y–z pixel resolution of 1 μm. The image stack composed of measured and artificial images was then used for all the downstream segmentation steps, but not for single-cell data measurements.
¾e raw iridium channel intensity counts for 16-bit images were clipped to value 50 to remove bright outlier pixels followed by histogram equalization using the square root of histogram values with 0.3% noise parameter to enhance details of the image without extreme effects of classical histogram equalization. As histogram equalization enhances noise, the background pixels were removed. The minimum cutoff value was determined by eye for each model separately (for the main breast carcinoma model, the cutoff was 11,467; for the second breast carcinoma model it was 14,862; for the first lymphovascular invasion model (with blood-vessel-invaded tumor cell cluster) it was 20,243 and for the second lymphovascular invasion model (with lymph-vessel-invaded tumor cell cluster) it was 33,760).
Due to varying intensity of iridium staining on some of the images, further pixel normalization across each image was required before implementation of the watershed algorithm. CLAHE, a local histogram equalization algorithm that enhances local contrast38), was applied (parameter block size = 29, maximum slope 10,256 bins, fast = false). Subsequently, local contrast enhancement was applied that used means and standard deviations of local image regions to normalize pixel values (https://github.com/axtimwalde/mpicbg/blob/master/mpicbg/src/main/java/mpicbg/ij/integral/NormalizeLocalContrast.java); the block size was 10 and s.d. sigma value was 3.
After pixel value normalization, a median 3D filter with kernel size of 3 × 3 × 3 was followed by 3D Gaussian blur (sigma = 1) to integrate pixel values across the 3D stack.
The difference of Gaussians for each image in the stack (between sigma 2 and sigma 3) was used as an input to the watershed segmentation algorithm. This approach was chosen as the difference of Gaussian filters is commonly used for blob detection algorithms including nuclei segmentation.
An implementation of watershed called h-watershed (available from https://github.com/mpicbg-scicomp/Interactive-H-Watershed/) without interactive window was used for 3D nuclear segmentation (background parameter = 500 for 16-bit images and 90% flooding threshold). The output from this step was a stack of cell masks equivalent to the input stack where each object is marked with a unique identifier.
Images containing cell masks for the artificial mean images were removed and only masks for true images were kept for next steps, including the final single-cell measurements.
Further post-processing of the segmentation mask obtained from h-watershed was performed using Python scikit-image 1.5.2 package39. The following steps were implemented: (1) objects with size smaller or equal to 10 voxels were removed; (2) for rare objects that were disconnected on an x–y plane but connected in z plane, the smaller part of the object was removed; (3) all remaining object labels were expanded by 1 pixel on x–y plane; (4) objects smaller than 21 voxels were removed; and (5) objects with raw mean iridium signal <1 were removed.
This final cell mask was then cut so that only the overlapping area for the whole 3D stack was analyzed. Dimensions for choosing the overlapping area were determined by eye and differed for each model.
Image preprocessing for this segmentation pipeline takes about 30 min. The 3D watershed implementation takes a few minutes for smaller models (16–30 slices) and a few hours for larger models (92–152 slices). Post-processing of the segmentation mask takes about 30 min. This segmentation workflow greatly benefited from a visualization tool easily launched from Jupyter notebooks via Python called napari v.0.4.1rc2 (ref. 40).
Single-cell data measurements
The final single-cell masks obtained from the segmentation pipeline were used to measure raw data for each marker for each cell. Mean intensity for each channel was measured as implemented in the skimage. measure.regionprops function. Before measuring mean intensity per object, hot pixels caused by artifacts of antibody staining were removed for each channel image stack analogously to our standard 2D IMC segmentation pipeline based on CellProfiler plugins. The hot pixels were given the value of the maximum local neighbor intensity (kernel size 3 × 3). Hot pixels were classified as any pixels that are 50 units higher than the difference between the raw data image and local maximum intensity. After measuring the mean intensities for each object for each channel, the mean intensities were compensated to reduce the rare ‘spillover’ between metal isotopes using the compensation matrix described previously41. The final output from this step was exported as a csv matrix for downstream single-cell analysis.
Single-cell clustering
Single-cell marker expression data was clustered with Python3 implementation of Phenograph20 (https://github.com/jacoblevine/PhenoGraph) using Manhattan distance and ten nearest neighbors. Before clustering, marker expression was range normalized to the 99th percentile across all cells for each channel separately to give equal weight to each marker. The z-scored cluster means were displayed as heat map using the Python Scanpy package42.
Assessment of single-cell segmentation results
The following multipronged strategy was used for assessing the performance of the segmentation approach. (1) Visually assess whether individual cell masks overlapped with direct measurements of a nuclear signal stained with iridium and whether individual cell masks were separated if clear separation was visible in the nuclear signal. (2) Visually assess overlap between raw voxel-level data and the final segmentation masks for expected cell types, such as epithelial cells in blood vessels and immune cells (Supplementary Video 4). (3) Check that the final segmentation mask is not affected by intensity variation between the sequential slices in the rare cases where such variation occurs that most segmented objects extend over multiple slices as expected by calculating the number of objects that are only present on one slice for each slice in the model (except on the first and last slice of the image deck) (Extended Data Figs. 3a and 4b). (4) Check that the 3D watershed algorithm uses information from z direction and the final objects are not biased in x–y direction compared to z direction by measuring the average axis length in x–y direction and x–z direction for each object present in the model (Extended Data Figs. 3b and 4c). (5) Check whether the resulting cell volume for cells belonging to different phenotypic clusters captured the expected average larger size of tumor cell phenotypes in comparison to immune cell phenotypes (Extended Data Figs. 3d and 4d). (6) Assess whether the unsupervised clustering method of the resulting segmented data yielded the expected phenotypes based on the antibody panel, such as CD20+ B cells, CD8+ T cells, CD68+ macrophages, vWF+CD31+ endothelial cells, CK5+ basal cells and different clusters of cytokeratin-positive cells in accordance with biological knowledge of the respective marker expression of these cell types. The clustering method also separated a group of rare objects that are high for all the markers due to experimental artifacts such as folds in the paraffin.
Distance measurements to closest blood vessel
First, the center of each segmented cell in 3D or separately for each slice in the model for 2D analysis was determined by using skimage.measure.regionprops.centroid function in Python using the single-cell mask TIFF stack the main Her2+ breast carcinoma model. For 2D analysis, the 3D segmentation mask for each individual 2D slice was used. Second, blood vessel mask was created with Fiji-ImageJ2-linux64 v.1.0 using an image of vWF+CD31 channel by first normalizing the 16 bit raw data TIFF image, then applying Gaussian blur (sigma 1) and using Renjyi entropy filter to generate a binary vessel mask image. Third, for each cell centroid the Euclidean distance to the closest blood vessel mask pixel was calculated. Because in the 3D single-cell mask TIFF stack objects occur over several slices, for the 2D calculation the minimum distance over all the slices that a cell occurs was taken as the final distance measurement. All the measurements for each cell were then average over phenotypic clusters determined in the previous step.
Measurement of direct neighbors
Direct touching neighbors were measured for the main Her2+ breast carcinoma model using skimage region adjacency graph function for each object in the whole 3D model and in each individual 2D slice using the single-cell mask TIFF stack. For 2D analysis, the final 3D segmentation mask for each individual 2D slice was used. For each cell the proportion of neighboring cells belonging to each phenotypic cluster (determined earlier with Phenograph clustering) was determined in 3D or for each 2D slice using a respective region adjacency graph. First, for each cell the number of cells belonging to each cluster among its directly touching neighbors was counted. Second, from the previous step, all the cells were separated based on the cell type (cluster) assignment and then for each cell type (cluster), the number of cells belonging to each of the cluster was counted (for example, cell type 1 (cluster 1) had a cell type 2 (cluster 2) as a directly touching neighbor in the model for 500 times). Third, for each cell type (cluster) the total number of neighbors was calculated by summing the cell counts across the different clusters and then the proportions were determined by dividing the per-cluster cell counts (step 2) by the total number of neighbors. These steps were repeated for each individual 2D slice. Finally, the difference in the 3D and 2D proportions was calculated by subtracting 2D proportions from 3D proportions.
Analysis of putative invasive cells
The putative invasive cells were determined in the main Her2+ breast carcinoma model by manually selecting and recording the cell label for cells that were detached from the main tumor bulk and were present in the putative invasive region and had mean expression of pan-cytokeratin >0.2 (the expression levels were range normalized to 99th percentile across all the cells for each marker in the antibody panel) using napari v.0.4.1rc2 (ref. 40). A total of 763 putative invasive cells were found. The microenvironment was determined by finding the cell label for all other cells that were within 50-pixel radius from the centroid of each invasive cell. Cell phenotypes were based on the Phenograph clustering conducted in the previous step. A cell was considered to be epithelial if it belonged to clusters 1–7, 11, 20, 23, 24, 31–35 or 37; basal if it belonged to cluster 19, stromal if it belonged to clusters 8–10, 12, 14, 15, 17, 22, 26, 27 or 29; or ‘other’ in all remaining cases.
3D data visualization
Agave v.1.0.0.1 (https://www.allencell.org/software-and-code.html) was used for 3D voxel or object rendering. Before voxel rendering 16-bit TIFF stacks were created for channels of interest. Each image was first histogram equalized, then background pixels were removed and a Gaussian blur filter with radius 1 was applied. Fiji-ImageJ2-linux64 v.1.0 was used to create image stacks for Agave rendering. Supplementary Videos 2–14 were created using Agave v.1.0.0.1 by taking series of images after rotating the model a fixed degree and then assembling the images into an animation.
Statistics and reproducibility
No statistical method was used to predetermine sample size; however, reproducibility of the method has been demonstrated on four different samples. No data were excluded from the analyses. The experiments were not randomized. The Investigators were not blinded to allocation during experiments and outcome assessment15,20,34–42.
Reporting Summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Extended Data
Supplementary Material
Acknowledgements
We thank S. Engler and A. Jacobs (University of Zurich), A. B. Sobottka-Brillout and S. Dettwiler (Institute of Pathology and Molecular Pathology, University Hospital Zurich) and F. Prutek (Tissue Biobank, University Hospital Zurich) for technical support. We thank R. A. Wepf (University of Queensland), H. Gnaegi (Diatome AG) and U. Ziegler, A. Käch and G. Barmettler (University of Zurich) for their invaluable guidance with the paraffin ultramicrotomy procedure. We also thank J. Windhager and V. Zanotelli for their advice on computational analysis. L.K. is supported by a Cancer Research UK Grand Challenge grant. The work of B.B. was supported by the Swiss National Science Foundation R’Equip grant 316030-139220, an SNSF Assistant Professorship grant PP00P3-144874, the European Research Council (ERC) grant under the European Union’s Seventh Framework Programme (FP/2007-2013)/ERC grant no. 336921 and under the European Union’s Horizon 2020 Program ERC Consolidator grant no. 866074, the Cancer Research UK IMAXT Grand Challenge grant and a National Institutes of Health HubMAP grant.
Footnotes
Author contributions
R.C. and B.B. conceived the study. R.C. designed experimental 3D IMC workflow and L.K. and R.C. designed 3D IMC computational workflow, performed all experiments and analyzed the data. A.P. and A.O. assisted with experimentation. P.S. and H.M. contributed clinical samples and scientific guidance. L.K., R.C., N.d.S. and B.B. wrote the manuscript. All authors read and edited the manuscript.
Competing interests
The authors declare no competing interests.
Peer review information Nature Cancer thanks Fritz Koning and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Cancer Grand Challenges IMAXT Consortium:
H. R. Ali, M. Al Sa’d, S. Alon, S. Aparicio, G. Battistoni, S. Balasubramanian, R. Becker, B. Bodenmiller, E. S. Boyden, D. Bressan, A. Bruna, Marcel Burger, C. Caldas, M. Callari, I. G. Cannell, H. Casbolt, N. Chornay, Y. Cui, A. Dariush, K. Dinh, A. Emenari, Y. Eyal-Lubling, J. Fan, A. Fatemi, E. Fisher, E. A. González-Solares, C. González-Fernández, D. Goodwin, W. Greenwood, F. Grimaldi, G. J. Hannon, S. Harris, C. Jauset, J. A. Joyce, E. D. Karagiannis, T. Kovačević, L. Kuett, R. Kunes, A. Küpcü Yoldaş, D. Lai, E. Laks, H. Lee, M. Lee, G. Lerda, Y. Li, A. McPherson, N. Millar, C. M. Mulvey, I. Nugent, C. H. O’Flanagan, M. Paez-Ribes, I. Pearsall, F. Qosaj, A. J. Roth, O. M. Rueda, T. Ruiz, K. Sawicka, L. A. Sepúlveda, S. P. Shah, A. Shea, A. Sinha, A. Smith, S. Tavaré, S. Tietscher, I. Vázquez-García, S. L. Vogl, N. A. Walton, A. T. Wassie, S. S. Watson, J. Weselak, S. A. Wild, E. Williams, J. Windhager, C. Xia, P. Zheng, and X. Zhuang
Data availability
All the data required for the 3D models from this study including IMC high-dimensional tiff images, single-cell masks and single-cell data are available at https://doi.org/10.5281/zenodo.4752030. Source data have been provided as Source Data files. All other data supporting the findings of this study are available from the corresponding author on reasonable request.
Code availability
All code that was used for preprocessing images and analyzing single-cell data is available at https://github.com/BodenmillerGroup/3D_IMC_publication.
References
- 1.Coutu D, Kokkaliaris K, Kunz L, Schroeder T. Multicolor quantitative confocal imaging cytometry. Nat Methods. 2018;15:39–46. doi: 10.1038/nmeth.4503. [DOI] [PubMed] [Google Scholar]
- 2.Li W, Germain RN, Gerner MY. Multiplex, quantitative cellular analysis in large tissue volumes with clearing-enhanced 3D microscopy (Ce3D) Proc Natl Acad Sci USA. 2017 doi: 10.1073/pnas.1708981114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Mano T, et al. Whole-brain analysis of cells and circuits by tissue clearing and light-sheet microscopy. J Neurosci. 2018 doi: 10.1523/JNEUROSCI.1677-18.2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gerdes MJ, et al. Highly multiplexed single-cell analysis of formalin-fixed, paraffin-embedded cancer tissue. Proc Natl Acad Sci USA. 2013;110:11982–11987. doi: 10.1073/pnas.1300136110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gerdes MJ, et al. Single-cell heterogeneity in ductal carcinoma in situ of breast. Mod Pathol. 2018;31:406–417. doi: 10.1038/modpathol.2017.143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Goltsev Y, et al. Deep profiling of mouse splenic architecture with CODEX multiplexed imaging. Cell. 2018;174:968–981. doi: 10.1016/j.cell.2018.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Schubert W, et al. Analyzing proteome topology and function by automated multidimensional fluorescence microscopy. Nat Biotechnol. 2006;24:1270–1278. doi: 10.1038/nbt1250. [DOI] [PubMed] [Google Scholar]
- 8.Murray E, et al. Simple, scalable proteomic imaging for high-dimensional profiling of intact systems. Cell. 2015;163:1500–1514. doi: 10.1016/j.cell.2015.11.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wang X, et al. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science. 2018;361:6400. doi: 10.1126/science.aat5691. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Angelo M, et al. Multiplexed ion beam imaging of human breast tumors. Nat Med. 2014;20:436–442. doi: 10.1038/nm.3488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Thiele H, et al. 2D and 3D MALDI-imaging: conceptual strategies for visualization and data mining. 2D and 3D MALDI-imaging: conceptual strategies for visualization and data mining. Biochim Biophys Acta Proteins Proteom. 2014;1844:117–137. doi: 10.1016/j.bbapap.2013.01.040. [DOI] [PubMed] [Google Scholar]
- 12.Coskun AF, et al. Nanoscopic subcellular imaging enabled by ion beam tomography. Nat Commun. 2021;12:1. doi: 10.1038/s41467-020-20753-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Giesen C, et al. Highly multiplexed imaging of tumor tissues with subcellular resolution by mass cytometry. Nat Methods. 2014;11:417–422. doi: 10.1038/nmeth.2869. [DOI] [PubMed] [Google Scholar]
- 14.Schulz D, et al. Simultaneous multiplexed imaging of mRNA and proteins with subcellular resolution in breast cancer tissue samples by mass cytometry. Cell Syst. 2018;6:25–36. doi: 10.1016/j.cels.2017.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Catena R, Montuenga LM, Bodenmiller B. Ruthenium counterstaining for imaging mass cytometry. J Pathol. 2018;244:479–484. doi: 10.1002/path.5049. [DOI] [PubMed] [Google Scholar]
- 16.Chang Q, Ornatsky O, Hedley D. Staining of frozen and formalin-fixed, paraffin-embedded tissues with metal-labeled antibodies for imaging mass cytometry analysis. Curr Protoc Cytom. 2017;82:12–47. doi: 10.1002/cpcy.29. [DOI] [PubMed] [Google Scholar]
- 17.Bouzekri A, Esch A, Ornatsky O. Multidimensional profiling of drug-treated cells by imaging mass cytometry. FEBS Open Bio. 2019;9:1652–1669. doi: 10.1002/2211-5463.12692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Schapiro D, et al. histoCAT: analysis of cell phenotypes and interactions in multiplex image cytometry data. Nat Methods. 2017;14:873. doi: 10.1038/nmeth.4391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Sy S, Ang LC. Microtomy: cutting formalin-fixed, paraffin-embedded sections. Biobanking. 2019 doi: 10.1007/978-1-4939-8935-5_23. [DOI] [PubMed] [Google Scholar]
- 20.Levine JH, et al. Data-driven phenotypic dissection of AML reveals progenitor-like cells that correlate with prognosis. Cell. 2015;162:184–197. doi: 10.1016/j.cell.2015.05.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Jackson HW, et al. The single-cell pathology landscape of breast cancer. Nature. 2020;578:615–620. doi: 10.1038/s41586-019-1876-x. [DOI] [PubMed] [Google Scholar]
- 22.Raza Ali H, et al. Imaging mass cytometry and multiplatform genomics define the phenogenomic landscape of breast cancer. Nat Cancer. 2020;1:163–175. doi: 10.1038/s43018-020-0026-6. [DOI] [PubMed] [Google Scholar]
- 23.Song YJ, et al. The role of lymphovascular invasion as a prognostic factor in patients with lymph node-positive operable invasive breast cancer. J Breast Cancer. 2011;14:198. doi: 10.4048/jbc.2011.14.3.198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Klahan S, et al. Identification of genes and pathways related to lymphovascular invasion in breast cancer patients: a bioinformatics analysis of gene expression profiles. Tumor Biol. 2017 doi: 10.1177/1010428317705573. [DOI] [PubMed] [Google Scholar]
- 25.Kurozumi S, et al. A key genomic subtype associated with lymphovascular invasion in invasive breast cancer. Br J Cancer. 2019;120:1129–1136. doi: 10.1038/s41416-019-0486-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Schürch CM, et al. Coordinated cellular neighborhoods orchestrate antitumoral immunity at the colorectal cancer invasive front. Cell. 2020;182:1341–1359. doi: 10.1016/j.cell.2020.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Keren L, et al. A structured tumor-immune microenvironment in triple negative breast cancer revealed by multiplexed ion beam imaging. Cell. 2018;174:1373–1387. doi: 10.1016/j.cell.2018.08.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Damond N, et al. A map of human type 1 diabetes progression by imaging mass cytometry. Cell Metab. 2019;29:755–768. doi: 10.1016/j.cmet.2018.11.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Hong SM, et al. Three-dimensional visualization of cleared human pancreas cancer reveals that sustained epithelial-to-mesenchymal transition is not required for venous invasion. Mod Pathol. 2020;33:639–647. doi: 10.1038/s41379-019-0409-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bronsert P, et al. Cancer cell invasion and EMT marker expression: a three-dimensional study of the human cancer-host interface. J Pathol. 2014;234:410–422. doi: 10.1002/path.4416. [DOI] [PubMed] [Google Scholar]
- 31.Weigert M, Schmidt U, Haase R, Sugawara K, Myers G. Star-convex polyhedra for 3D object detection and segmentation in microscopy; Proc IEEE/CVF Winter Conference on Applications of Computer Vision, Snowmass Village, Colorado; 2020. pp. 3666–3673. [Google Scholar]
- 32.Santella A, et al. WormGUIDES: an interactive single cell developmental atlas and tool for collaborative multidimensional data exploration. BMC Bioinf. 2015;16:1–9. doi: 10.1186/s12859-015-0627-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wu Y, et al. Spatially isotropic four-dimensional imaging with dual-view plane illumination microscopy. Nat Biotechnol. 2013;31:1032–1038. doi: 10.1038/nbt.2713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Catena R, Özcan A, Jacobs A, Chevrier S, Bodenmiller B. AirLab: a cloud-based platform to manage and share antibody-based single-cell research. Genome Biol. 2016;17:1–6. doi: 10.1186/s13059-016-1006-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Schindelin J, et al. Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Vincent L, Soille P. Watersheds in digital spaces: an efficient algorithm based on immersion simulations. IEEE Trans Pattern Anal Mach Intell. 1991;13:583–598. [Google Scholar]
- 37.Harris CR, et al. Array programming with NumPy. Nature. 2020;585:357–362. doi: 10.1038/s41586-020-2649-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zuiderveld K. Graphics Gems IV. Academic Press Professional; 1994. pp. 474–485. [Google Scholar]
- 39.van der Walt S, et al. scikit-image: image processing in Python. PeerJ. 2014;2:e453. doi: 10.7717/peerj.453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Napari Contributors. napari: a multi-dimensional image viewer for Python. Zenodo. 2019 doi: 10.5281/zenodo.3555620. [DOI] [Google Scholar]
- 41.Chevrier S, et al. Compensation of signal spillover in suspension and imaging mass cytometry. Cell Syst. 2018;23:612–620. doi: 10.1016/j.cels.2018.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wolf F, Angerer P, Teis F. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19:15. doi: 10.1186/s13059-017-1382-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All the data required for the 3D models from this study including IMC high-dimensional tiff images, single-cell masks and single-cell data are available at https://doi.org/10.5281/zenodo.4752030. Source data have been provided as Source Data files. All other data supporting the findings of this study are available from the corresponding author on reasonable request.
All code that was used for preprocessing images and analyzing single-cell data is available at https://github.com/BodenmillerGroup/3D_IMC_publication.