Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2021 Apr 19;17(4):e1008887. doi: 10.1371/journal.pcbi.1008887

MAUI (MBI Analysis User Interface)—An image processing pipeline for Multiplexed Mass Based Imaging

Alex Baranski 1, Idan Milo 2, Shirley Greenbaum 1, John-Paul Oliveria 1,3, Dunja Mrdjen 1, Michael Angelo 1,*, Leeat Keren 1,2,*
Editor: Dina Schneidman-Duhovny4
PMCID: PMC8084329  PMID: 33872301

Abstract

Mass Based Imaging (MBI) technologies such as Multiplexed Ion Beam Imaging by time of flight (MIBI-TOF) and Imaging Mass Cytometry (IMC) allow for the simultaneous measurement of the expression levels of 40 or more proteins in biological tissue, providing insight into cellular phenotypes and organization in situ. Imaging artifacts, resulting from the sample, assay or instrumentation complicate downstream analyses and require correction by domain experts. Here, we present MBI Analysis User Interface (MAUI), a series of graphical user interfaces that facilitate this data pre-processing, including the removal of channel crosstalk, noise and antibody aggregates. Our software streamlines these steps and accelerates processing by enabling real-time and interactive parameter tuning across multiple images.

Author summary

High-dimensional Imaging technologies allow to simultaneously measure the expression levels of dozens of proteins in biological tissue, providing insight into single-cell phenotypes and organization in situ. Imaging artifacts, resulting from the sample, assay or instrumentation complicate downstream analyses and require correction by domain experts. Here, we present MAUI, a series of graphical user interfaces that facilitate this data pre-processing, including the removal of channel crosstalk, noise and antibody aggregates. MAUI accelerates and automates these steps, such that reproducible, high-quality data will be the input for subsequent stages of analysis.


This is a PLOS Computational Biology Software paper.

Design and implementation

A major goal of biological research and medical diagnosis is to understand how cellular phenotypes at a single cell level relate to the multi-cellular structures in which they reside. This requires the ability to quantify the spatial distribution of multiple proteins or mRNA transcripts across large regions of intact tissue. Recently, several new approaches have been introduced to allow this quantification. These can be broadly grouped to those that target mRNA, by either in situ hybridization [1,2] or sequencing [35] and those that target proteins, primarily using chromogen-, fluorescent- or mass-tagged antibodies [611]. These technologies vary in the number of measured parameters, size of images, speed of acquisition, resolution and compatibility with clinical specimens [1214]. However, the data generated by these methods is ultimately similar, consisting of a set of registered images, one for each measured parameter. For all of these, imaging artifacts, resulting from the sample, assay or instrumentation complicate downstream analyses. While some properties of these artifacts are unique to a specific technology, others are common across several modalities. Here we present MAUI (MBI Analysis User Interface), a graphical user interface that wraps a series of image processing methods to eliminate imaging artifacts in Mass Based Imaging (MBI). Although MAUI was designed for MBI methods, such as Multiplexed Ion Beam Imaging by time of flight (MIBI-TOF) and Imaging Mass Cytometry (IMC), we also demonstrate its utility for fluorescent-based imaging.

We have previously described Multiplexed Ion Beam Imaging by time of flight (MIBI-TOF), a type of MBI method that uses secondary ionization mass spectrometry to image tissues stained with dozens of antibodies conjugated to heavy metal tags [1517]. With MBI, tissue sections are stained with a cocktail of antibodies specific for the proteins of interest, each tagged with an isotopically-enriched elemental reporter. In MIBI-TOF, a primary ion beam scans across the sample to liberate the elemental reporters as secondary ions that are subsequently detected via orthogonal time of flight mass spectrometry. Ultimately, the result is a set of images, one for each antibody in the cocktail, where the intensity of signal in each pixel represents the number of ions (proportional to the amount of antibody and therefore target protein) detected in that physical location of the tissue (Fig 1).

Fig 1. Workflow of Multiplexed Ion Beam Imaging User Analysis Interface (MAUI).

Fig 1

Tissue specimens are stained using a mixture of antibodies labeled with elemental mass tags. The samples are rasterized by a primary ion beam that releases the metals of the bound antibodies as secondary ions, which are recorded by a Time of Flight Mass Spec. This results in an n-dimensional image depicting protein expression in the field. Image processing includes removal of cross-channel contamination, noise and aggregates.

While different factors can contribute to imaging artifacts, they can be broadly grouped into three classes. The first is the presence of channel crosstalk, whereby signal from one channel is detected at variable amounts in another channel. This can result from reporter impurities (e.g. Erbium 152 containing one percent of Erbium 151), peak tailing of high intensity reporters, polyatomic reporter adducts, or the presence of exposed gold slide [17]. The second is the presence of non-specific signal superimposed on top of the biological structures targeted by the antibody. This can typically be attributed to off-target antibody binding, high abundance organic adducts or instrumental noise. Depending on the intensity and spatial distribution of these sources, it includes signals that appear random, as well as dim patterns that correlate with histological features. The third is the occasional presence of aggregates, which show up as small bright specks of signal and are presumed to result from antibody aggregation. Notably, many of these issues are not specific to MIBI-TOF, and are encountered when using other highly multiplexed imaging modalities, including imaging mass cytometry (IMC) [6] or fluorescent approaches such as co-detection by indexing (CODEX) [9].

We have developed a series of algorithms that enables the removal of these imaging artifacts to facilitate downstream analysis [16]. The pre-processing proceeds in three stages, one for each of the types of artifacts described above: the removal of (1) channel crosstalk, (2) nonspecific staining, and (3) aggregates. Since the conditions under which these artifacts occur can vary between experiments, many parameters of the algorithms used are currently tuned by hand and evaluated by eye, leveraging the expert knowledge of pathologists and biologists. To facilitate this we present here MAUI, a graphical user interface that allows for real-time feedback between parameter adjustment and pre-processing output, drastically improving the data quality and efficiently reducing the time for further downstream analyses (Fig 2).

Fig 2. Processing MIBI data using MAUI results in signal similar to Immunohistochemistry.

Fig 2

Shown are serial sections from human colon stained for CD8 by IHC (top left) or MIBI (top right). White arrows indicate cytotoxic T cells in the lamina propria of the gut. Red arrows indicate imaging artifacts including background and cross-channel contamination (bottom left). After processing by MAUI (bottom right) the CD8 signal mirrors the expected histological staining pattern by IHC. Scalebar equals 40μm.

Results

We will demonstrate and discuss the functionalities of MAUI for MIBI data. Examples for similar analysis of IMC and CODEX are shown in S1 and S2 Figs respectively.

Data structure

The input to the software is either a multipage TIFF or a folder of TIFF files. Each page in the multipage TIFF or each image in the folder, respectively, depicts the signal for a pre-specified mass range, which corresponds to a single conjugated antibody. Each processing step with MAUI will add an additional folder of TIFFs to the dataset, using as input the results of the previous processing step.

Channel crosstalk

Channel crosstalk refers to scenarios where the signal of a source channel contaminates a target channel or is correlated with such contamination. Channel crosstalk in MBI can occur for a variety of reasons, some of which we will detail below. The first predominant cause for channel crosstalk is ionic contamination. In addition to the predominating monoatomic ion (X+, where X = m/z), metal reporters used for antibody labeling can also produce relatively small amounts of metal hydrides (XH+), oxides (XO+), and hydroxides (XOH+) [17] with masses of X+1, X+16, and X+17 respectively (Fig 3A). These polyatomic metal adducts can potentially contaminate heavier mass reporters. For example, 149Sm typically produces a metal oxide at 165 amu (149+16) with 4% the intensity of the monoatomic peak that can contaminate signal arising from an antibody tagged with 165Ho [17]. Alternatively, in early versions of the instrument we observed that bare regions of the slide often resulted in high-intensity signal across multiple mass channels. Since this signal is derived from the slide, it has a near-perfect overlap with components of the slide coating, e.g. gold (197Au), tantalum (181Ta) and silicon (28Si) channels (Fig 3B). The current generation of instruments does not suffer from this form of signal contamination (S3A Fig).

Fig 3. Removal of cross-channel contamination.

Fig 3

(A-C) Shown are examples for different sources of cross-channel contamination. In all cases a target channel (top row) is contaminated by another channel (middle row) or is correlated with such contamination. MAUI allows to remove this contamination from the target image (bottom row). (D) The first step for removal of cross-channel contamination is to cap, blur and threshold the contaminating channel to generate a binary mask of the contamination. The second is to remove signal where the mask is positive. (E-H) Examples of the resulting image when varying the algorithm parameters, including the Contaminant Cap (E), Gaussian Radius (F), Threshold (G), and the Remove Value (H). (I) The GUI enables to rapidly evaluate these parameters across many points and channels.

Channel crosstalk can also occur from isotopic impurities in the metal stocks used for conjugation. Many elements naturally exist as a mixture of isotopes, and while these have been enriched to high purity for conjugation, they still have varying amounts of contaminating isotopes of the same element. For example, the element samarium has 7 naturally occurring isotopes with an abundance ranging from 3.08% (144Sm) to 26.74% (152Sm). 152Sm can be purified to >98% by electromagnetic enrichment, but will still contain small amounts of the other samarium isotopes. Finally, channel crosstalk can result from systematic, off-target staining, resulting from the antibodies or metals used. This often presents as dim staining of biological structures that the antibody is not expected to bind to. For example, indium and aluminum preferentially localize to nuclei, and therefore these channels may present nonspecific nuclear staining (Figs 3C and S3B). In all these instances the contaminating signal in one channel mirrors real signal in another channel. Thus, one can use the latter to clean the signal in a target channel.

We designed MAUI to facilitate cleaning of a target channel using the signal of a contaminating source channel. The pipeline is divided into three parts, easily visualized in the user interface (UI). First, the contaminating channel is processed in several steps to generate a binary mask, whereby pixels that are positive for the signal will have a value of one, and pixels that are negative for the signal will have a value of zero. In this step, the user can tune several parameters to influence the resulting mask. Once users are satisfied with the mask, they can evaluate its performance in cleaning different target channels, a step that also involves user-defined parameters. These two steps can be iterated until the user is satisfied with the result. Finally, the user can apply the selected channels and parameters to remove the contamination in multiple images.

A detailed mathematical description of the contamination-removal algorithm is presented in the Methods section. Briefly, the first step of the algorithm is to generate a binary mask of the contaminating channel. Due to the sparse nature of mass spectrometry data, it is necessary to preprocess the contaminating channel before using it to clean the target channel. This includes capping the signal, smoothing it and binarizing it to generate a mask (Fig 3D). Each of these steps requires the user to input parameters that can be easily tuned and evaluated for their effect on the resulting mask. In the capping step, the user inputs a cap parameter, which sets the maximum value of the contaminating channel. Counts greater than the cap value will be set to the cap. This parameter allows for the homogenization of the signal intensity of the contaminating channel, by controlling overly bright signal (Fig 3E). The cap should be proportional to the expected intensity of the contaminating channel. Reducing its value will increase the number of pixels that are classified as positive for contaminant signal in the final mask.

Next, the capped contaminant signal is blurred using a radial Gaussian filter. The user inputs a Gaussian Radius parameter to set the standard deviation of the filter. This step allows for the homogenization and smoothing of the signal in neighboring pixels (Fig 3F). Increasing the radius increases the blurring and will most likely result in more positive pixels in the mask. We have found that a radius of one pixel was suitable in most cases.

Then, the capped and blurred image is normalized between zero and one. The Threshold parameter, which also takes values between zero and one, is then applied to generate a binary mask of the contaminating signal. Any pixel in the normalized contaminant channel above the threshold is set to 1, and any value below the threshold is set to 0. A higher threshold will reduce the number of positive pixels in the mask (Fig 3G). MAUI allows the user to view the original contaminant signal, while easily changing the algorithm parameters and evaluating the resulting mask. It also plots a histogram depicting the frequency of intensity values in the normalized contaminant image to facilitate the user in picking a threshold that adequately captures the signal in the contaminating channel.

Once a binary mask of the contaminating channel has been generated, the user can now evaluate how this mask performs in cleaning any other channel using the Evaluation Parameters UI box. Cleaning is performed by subtracting a fixed value from all pixels in the target channel that were positive in the contaminant channel mask (Fig 3H). The value to remove is indicated by the user-defined parameter Remove Value. After removal, any value in the target channel that is below zero is set to zero. The user is given the option to remove a specific value, rather than, for example, zeroing out these pixels, because sometimes these pixels also have real signal superimposed with the contaminant. It is possible to zero out these pixels by choosing an arbitrarily large Remove Val. MAUI gives the user the option to evaluate all the parameters in cleaning different target channels and in different images.

Finally, once the users are satisfied with the chosen channels and parameters, they can use them to remove contamination in multiple images (Fig 3I). MAUI will create a new folder and populate it with the contamination-removed versions of the loaded data. These can be used as input for further preprocessing, including additional rounds of contamination removal. The software will also save a log file documenting the parameters used, to allow reproducibility of the analysis.

Noise removal

Image noise is traditionally defined as unwanted variation in image information, which deteriorates image quality. In antibody-based visualization of tissues it can result from various factors, ranging from instrumentation to tissue quality to nonspecific binding of antibodies, which generate weak signal that may follow histological patterns. For our purposes, we define noise as any signal outside of the biological structures the stain is expected to mark, and that does not specifically overlap with any of the other channels in the panel. Therefore, it is not possible to remove it by the contamination-removal technique described above.

Here, we apply a K-Nearest-Neighbor (KNN) approach to noise filtering [18]. The approach takes advantage of the relatively low density of noise counts as compared to signal counts (high signal to noise ratio). It is based on the assumption that the presence of the intended biological target at a location will increase the odds of the antibody binding at that location, and that higher ion counts in and near a pixel indicate a higher probability of the associated biological signal being present at that location. We find this approach particularly suitable for MIBI data due to the sparse nature of the data.

The denoising algorithm consists of two stages: first, the count density near each pixel is estimated using an intensity-normalized version of KNN, and second, the signal from low-density pixels is filtered. KNN estimates density by calculating the distances from a specific pixel x(i,j) in the image to the closest k non-zero pixels. The shorter these distances are, the higher the density in that location. The intensity-normalized version performs similarly, but takes into account the count value in each positive pixel. Thus, instead of measuring the distances to the k closest positive pixels, it will calculate the distance to the k closest individual ion counts, whereby several counts can be found in the same pixel.

For example, in Fig 4A we demonstrate the calculation of the 5-nearest-neighbors of the central pixel of the image, with a value if x(i,j) = 2. We think about this value as constituting two separate events. Thus, the first neighbor is inside of the same pixel, with a distance of zero. The next nearest neighbor is one pixel to the right, with a distance of one. The next three nearest-neighbors are two pixels away above, each of these with a distance of two (Fig 4B).

Fig 4. Removal of noise using a K-Nearest-Neighbor (KNN) algorithm.

Fig 4

(A-B) Signal density at each pixel is determined using its K nearest neighbor counts. Shown is an example of the 5 nearest counts (A) and their distances (B) to the central pixel. (C) An example of staining of Histone H3 with noise surrounding the nuclear signal. (D) Shown is the frequency of the average distance to the 25 nearest counts for all pixels in (C). Signal is characterized by short distances (high density), whereas noise is characterized by long distances (low density). The red line is the chosen threshold for denoising, and pixels with large distances to their neighbors will be set to zero. (E) Histone H3 signal after denoising. (F) The average KNN-distribution for Histone H3, same as in (D), after denoising. (G-H) Examples of the resulting image of Histone H3 when varying the algorithm parameters, including the K-value (G) and Threshold (H). (I-J) Examples of the resulting image of a membranous protein, CD8, when varying the algorithm parameters, including the K-value (I) and Threshold (J). (K) The GUI enables to rapidly evaluate these parameters across many points and channels.

We define the ADK as the average distance to the k nearest neighbors. The average distance to the 5-nearest-neighbors in the example above is (0+1+2+2+2)/5 = 1.4. For a given value of k, we then calculate the ADK for every pixel x(i,j) in the image. Fig 4C depicts an image of Histone H3 (HH3), while Fig 4D depicts the corresponding distribution of ADK values for all positive pixels in the image, using a k value of 25. This is a bimodal distribution in which low ADK values correspond to high density of counts and are therefore attributable to signal, whereas high ADK values correspond to low density of counts, and therefore are considered as noise. The actual denoising involves picking a threshold used to separate “noise” from “signal”. Pixels with higher count densities (and therefore lower ADK values) are considered to be true signal and are left unchanged, while pixels with lower count densities (and therefore higher ADK values) are considered to be noise and their count value is set to zero. For example, using a threshold of 1.5 on the HH3 image shown in Fig 4C, any pixel with an ADK above 1.5 is set to zero, resulting in the image and corresponding distribution of ADK values shown in Fig 4E and 4F, correspondingly. A full mathematical derivation of the process is detailed in the Methods section.

MAUI is designed to facilitate, streamline and expedite the process of denoising using the intensity-normalized KNN approach for multiple images at a time (Fig 4G–4K). The k-value and threshold are user-defined parameters, which can be easily adjusted for each marker at the discretion of the domain-expert. MAUI generates real-time feedback as the k-value and threshold are adjusted, making it fast and easy to intuit which value ranges are most effective for denoising a particular dataset. When a given marker is selected on a given image, the ADK distribution for the k-value is plotted, along with an indication of the current threshold value. Easy scrolling between loaded images allows rapid evaluation of these properties on different images. The raw data is plotted, along with the denoised data and the difference between the two. Because the dynamic range of MIBI images can be quite skewed (significant biological information can be contained within ion-count differences of 1 or 2 counts, while outliers of 10 or 20 counts are possible), a display cap parameter is included for easy visual display of the data. We find that both the ADK histogram and the images are useful tools in guiding users’ choice of parameters.

Notably, current experience with this approach has demonstrated that the algorithm is quite robust to the specific value of k and for the most part does not need to be changed given the same imaging conditions. On the other hand, the threshold is dependent on the distributions of signal and noise for each target and is usually defined for each channel separately. Once appropriate k and threshold values have been selected for each channel, the entire dataset can be easily denoised using these parameters. MAUI will create a new folder and populate it with the noise-removed versions of the loaded data. These can be used as input for further processing. The software will also save a log file documenting the parameters used, to increase reproducibility.

Aggregate removal

Aggregates are concentrated areas of high counts that have been reported in both MIBI and IMC data [16,19]. These spots are uncorrelated with any biological structures and are thought to result from conglomerations of antibodies. Aggregates can interfere with downstream analysis, particularly of weak channels, where they may be falsely interpreted as positive staining (Fig 5A). A more detailed discussion of aggregate identification is presented in the Methods section.

Fig 5. Removal of aggregates.

Fig 5

(A) An overview of the aggregate-removal process. Left: An image of denoised Tryptase signal. Aggregates are evident in the top right corner. The image is blurred with a Gaussian filter and then binarized. Objects are extracted from the binary image, and any object small enough is considered an aggregate and removed. (B-C) Examples of the resulting image of Tryptase when varying the algorithm parameters, including the Threshold (B,C) and Blur radius (D-E). (F) The GUI enables to rapidly evaluate these parameters across many points and channels.

The procedure for removing aggregates takes advantage of their small size, much smaller than real signal of biological structures, and involves finding small continuous areas of positive counts. First, the image is blurred using a Gaussian filter, and then the blurred image is thresholded using a cutoff of zero to create a binary mask. The blurring has the effect of merging patches of signal that are near each other, usually indicative of them being part of a larger structure, such as a cell or vessel. Then, once a mask has been generated, all connected objects in the binary mask and their sizes are identified. Any object below a size threshold selected by the user is considered to be an aggregate, and the values of the counts in its constituent pixels are set to zero (Fig 5A).

MAUI allows the user to pick for each channel in the data the radius of the Gaussian blur as well as the size threshold for objects to be considered aggregates, listing the chosen parameters for each channel in a list (Fig 5B and 5C). MAUI will graphically display in real time the raw data and the aggregate-removed data, resulting from the currently chosen parameters. Empirically, we find that mild blurring (e.g. radius = 1) and a single threshold are often adequate across channels and datasets. Once an appropriate set of parameters has been chosen, MAUI allows to easily apply these parameters to all images in the dataset. Just as with the contamination removal and noise removal steps, MAUI will create a new folder and populate it with the aggregate-removed versions of the loaded data, as well as a log file recording the parameters.

Availability and future directions

High-dimensional imaging technologies are poised to revolutionize tissue histology. These new modalities need specialized computational pipelines to facilitate pre-processing such that clean, high-quality data will be the input for subsequent stages of analysis. Here, we surveyed some of the common issues associated with data collected with multiplexed immunohistochemistry imaging platforms. We have described the simple computational solutions developed in our lab to address these issues and make MBI data suitable for subsequent analysis. MAUI, the graphical user interface that wraps these solutions, has substantially improved the speed and ease in which biologists and clinicians in our lab interact with the data and perform these labor and computationally-intensive steps in a reproducible and efficient manner. MAUI can be downloaded from https://github.com/angelolab/MAUI. We expect future developments in assays, instrumentation and computation to further enhance the quality of multiplexed-imaging data, making it more accessible to the large scientific community.

Methods

Contaminant removal

For a given channel of MBI data f(x,y) (1) the user finds an associated contaminating channel h(x,y) (2). The user then picks a cap value c which is used to create a capped version of the contaminating channel hcapped(x,y) (3). The user then picks a standard deviation σ for a Gaussian blur g(x,y) (4), which is convolved with hcapped to create a blurred version hblurred (5). This image is then rescaled to have a maximum of 1, producing an image hrescaled (6). The user then picks a threshold value t which is used to binarize hrescaled, producing a binary mask that can be used for contaminant removal, hmask (7). Finally, the user selects a removal value v. At any pixel location (x,y) where hmask(x,y) is 1, we remove v from f(x,y), producing fremoved (8). Anywhere fremoved<0 is set to zero, since negative values of counts are nonsensical. This produced the final contaminant-removed MBI data, fclean (9).

f(x,y)=TargetMBIdata 1
h(x,y)=ContaminatingMBIdata 2
hcapped(x,y)={h(x,y),h(x,y)cc,h(x,y)>c 3
g(x,y)=12πσ2ex2+y22σ2 4
hblurred=g*hcapped 5
hrescaled=hblurredmax(x,y)[hblurred(x,y)] 6
hmask(x,y)={1,hblurred(x,y)t0,hblurred(x,y)<t 7
fremoved(x,y)={f(x,y),hmask(x,y)=0f(x,y)v,hmask(x,y)=1 8
fclean(x,y)={fremoved(x,y),fremoved(x,y)00,fremoved(x,y)<0 9

Noise removal

We let f(x,y) be the function that maps pixels in an image to ion counts on a particular channel of MBI data (10). We define a function C(x,y) which takes in a single pixel (x,y) and outputs a list with f(x,y)copies of f(x,y) (11). We let this function be defined on lists of pixels (a,b,c,…) such that the output is the concatenation of the outputs of the function on each of the individual pixels in the input list (12). We then define another function N(x,y) which outputs a list of all pixels in the image ordered by their Euclidean distance (denoted by ‖∙‖) from (x,y) (13). We then let N(x,y)=C(N(x,y)) (14) and take only the first K entries, defining NK(x,y), which we call the K-nearest neighbors of (x,y) (15). We then calculate the average distance to the K-nearest neighbors of each pixel (x,y) denoted by AK(x,y) (16). To clean the data, the user picks a threshold tn, which is used to separate “noise” counts from “signal” counts. Any pixel with an average distance to its K-nearest neighbors more than the threshold is treated as noise and zeroed-out, otherwise its value is left unchanged (17).

f(x,y)=MBIdata,x[1,w],y[1,h] 10
C((x,y))=((x,y),(x,y),(x,y))s.t.|C((x,y))|=f(x,y) 11
C((a,b,c,))=C(a)C(b)C(c) 12
N(x,y)=((x,y)1,(x,y)2,(x,y)w×h)s.t.(x,y)iN(x,y), 13
(x,y)i+1(x,y)>(x,y)i(x,y)
N(x,y)=C(N(x,y)) 14
NK(x,y)=((x,y)i|i[1,K]) 15
AK(x,y)=1Ki=1K(x,y)i(x,y) 16
fclean(x,y)={f(x,y),A_K(x,y)tn0,A_K(x,y)>tn 17

Aggregate removal

We let f(x,y) be MBI data, as in (10). The user picks a standard deviation σ for a Gaussian filter g (18), which is convolved with our data to produce fblurred (19). We then binarize this image to create fmask (20).

We use a built-in function in MATLAB called regionprops, here denoted as fobject, which takes in a binary mask and outputs all connected sets of pixels with a value of 1 (thus any set of pixels in the input with a value of 1 which are connected to each other will be considered a single object). All such objects are returned as outputs of the function (21). The user defines a threshold ta for the size of aggregate objects, where objects with fewer pixels than ta are counted as aggregates (22). Finally, all pixels in the data the user is cleaning that are part of an aggregate object are zeroed-out (23).

g(x,y)=12πσ2ex2+y22σ2 18
fblurred=g*f 19
fmask(x,y)={0,fblurred(x,y)01,fblurred(x,y)>0 20
fobject(M)={o1,o2,on}s.t.oiisanobjectinM 21
faggregate(M)={ofobject(M)||o|<ta} 22
fclean(x,y)={f(x,y),ofaggregate(M)s.t.(x,y)o0,ofaggregate(M)s.t.(x,y)o 23

Judging whether large clumps of signal is true signal vs aggregation can range from easy to non-trivial depending on the antibody, tissue and aggregation state. Currently, this process depends to a large extent on experts with domain knowledge. We commonly employ the following guidelines to decide if a signal is real or an aggregate: (A) Familiarity with the expected staining pattern of the antibody. Whereas for some antibodies we expect small, punctate staining, which is difficult to differentiate from aggregates (e.g. Granzyme B), most of the antibodies that we use stain larger cellular structures. We use knowledge on the expected staining patterns, both intra-cellularly (e.g. nuclear vs. membrane), cellularly (e.g. expected coexpression of markers) and histologically (e.g. expected staining in some anatomical locations and not others) to decide whether a “clump of signal” is real or not. While there may be ambiguous cases, for the most part it is quite clear. (B) Comparison between different tissues. In our experience, once an antibody vial begins to aggregate, aggregates will be observed in all tissues stained with that vial. As such, if one suspects that signal is actually an aggregate, it is useful to see whether aggregates are present in this channel across additional images. (C) Design of specific experiments to check for aggregates. When one is uncertain whether an antibody is aggregated, it is possible to design specific staining experiments to address this. For example, staining a negative control tissue that is not expected to express the target protein. Alternatively, one can stain serial sections and see whether the signal repeats between sections.

Supporting information

S1 Fig. Analysis of IMC data using MAUI.

The top left panel shows an image of CD68 from an IMC dataset (Jackson et al., Nature 2020). Staining intensity is shown as a heatmap from blue (low) to high (yellow). White arrows denote real CD68 staining, as validated by coexpression of CD45. Colored arrows denote various imaging artifacts including cross-channel contamination (red), noise (green) and aggregates (orange). Each row in the image shows one stage of processing by MAUI, including removal of cross channel contamination (top), denoising (middle) and removal of aggregates (bottom) as detailed. The final image is shown in the bottom right. Scalebar equals 30μm.

(JPEG)

S2 Fig. Analysis of CODEX data using MAUI.

The top left panel shows an image of CD8 from a CODEX dataset (Schurch et al., Cell 2020). Staining intensity is shown as a heatmap from blue (low) to high (yellow). White arrows denote real CD8 staining, as validated by coexpression of CD3 and CD45. Colored arrows denote various imaging artifacts including cross-channel contamination (red), noise (green) and aggregates (orange). Each row in the image shows one stage of processing by MAUI, including removal of cross channel contamination (top), denoising (middle) and removal of aggregates (bottom) as detailed. The final image is shown in the bottom right. Scalebar equals 50μm.

(JPEG)

S3 Fig. Underlying reasons for cross channel contamination.

(A) Shown is an image for the mass range of 128–132, which has no labeled signal in it (left column) for the first (top) and second (bottom) generations of the instrument. In the first-generation instruments this background signal (right column) mirrors the bare slide and therefore the gold channel (middle column). In the second-generation instruments (bottom), this slide-specific background is mitigated and the background only has salt-and-pepper noise. (B) MIBI-TOF staining of human FFPE tonsil with free 115Indium and antibodies for Histone H3 and CD45 shows nuclear localization of 115Indium. Scalebar equals 10μm.

(JPEG)

Data Availability

The data can be found at https://github.com/angelolab/MAUI.

Funding Statement

IM is funded by the European Union’s Horizon 2020 research and innovation programme under the Marie Sk?odowska-Curie grant agreement No 890733. SG is supported by the Bill and Melinda Gates Foundation OPP1113682. JPO was supported by a Canadian Institutes of Health Research (CIHR) Postdoctoral Fellowship and a Banting Postdoctoral Fellowship. DM is supported by the Swiss National Science Foundation Early Postdoc Mobility Fellowship: P2ZHP3_181563 and Novartis Postdoctoral Fellowship: #18B067. LK is the Fred and Andrea Fallek President's Development Chair and is supported by the Azrieli Foundation, Enoch Foundation Research Fund, European Research Council Horizon 2020 starting grant 948811, Damon Runyon-Dale F. Frey Award 131248 and Israel Science Foundation personal grant 2481/20. This research was partially supported by the Israeli Council for Higher Education (CHE) via the Weizmann Data Science Research Center. MA was supported by the Department of Defence grant W81XWH2110143, the National Institute of Health grants 1-DP5-OD019822, 5R01CA22952903, 1U24CA22430901, 5U54CA20997105, 1R01AG056287, 1R01AG057915, 1U24CA224309, and the Bill and Melinda Gates Foundation. MA is a consultant and shareholder in IonPath Inc. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Moffitt J.R., Bambah-Mukku D., Eichhorn S.W., Vaughn E., Shekhar K., Perez J.D., et al. Molecular, spatial, and functional single-cell profiling of the hypothalamic preoptic region. Science 2018; 362, eaau5324. 10.1126/science.aau5324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Zhu Q., Shah S., Dries R., Cai L., and Yuan G.-C. Identification of spatially associated subpopulations by combining scRNAseq and sequential fluorescence in situ hybridization data. Nat. Biotechnol. 2018; 36, 1183. 10.1038/nbt.4260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ke R., Mignardi M., Pacureanu A., Svedlund J., Botling J., Wählby C., et al. In situ sequencing for RNA analysis in preserved tissue and cells. Nat. Methods 2013; 10, 857–860. 10.1038/nmeth.2563 [DOI] [PubMed] [Google Scholar]
  • 4.Ståhl P.L., Salmén F., Vickovic S., Lundmark A., Navarro J.F., Magnusson J., S. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 2016; 353, 78–82, 10.1126/science.aaf2403 [DOI] [PubMed] [Google Scholar]
  • 5.Wang X., Allen W.E., Wright M.A., Sylwestrak E.L., Samusik N., Vesuna S., et al. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science 2018; 361, eaat5691. 10.1126/science.aat5691 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Chevrier S., Crowell H.L., Zanotelli V.R.T., Engler S., Robinson M.D., and Bodenmiller B. Compensation of Signal Spillover in Suspension and Imaging Mass Cytometry. Cell Syst. 2018; 6, 612–620.e5. 10.1016/j.cels.2018.02.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gerdes M.J., Sevinsky C.J., Sood A., Adak S., Bello M.O., Bordwell A., et al. Highly multiplexed sin- gle-cell analysis of formalin-fixed, paraffin-embedded cancer tissue. Proc. Natl. Acad. Sci. USA 2013; 110, 11982–11987. 10.1073/pnas.1300136110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Gerner M.Y., Kastenmuller W., Ifrim I., Kabat J., Germain R.N. Histo-cytometry: a method for highly multiplex quantitative tissue imaging analysis applied to dendritic cell subset microanatomy in lymph nodes. Immunity 2012; 37(2):364–76. 10.1016/j.immuni.2012.07.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Goltsev Y., Samusik N., Kennedy-Darling J., Bhate S., Hale M., Vazquez G., et al. Deep Profiling of Mouse Splenic Architecture with CODEX Multiplexed Imaging. Cell 2018; 174, 968–981.e15. 10.1016/j.cell.2018.07.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gut G., Herrmann M.D., and Pelkmans L. Multiplexed protein maps link subcellular organization to cellular states. Science 2018; 361, eaar7042. 10.1126/science.aar7042 [DOI] [PubMed] [Google Scholar]
  • 11.Saka S.K., Wang Y., Kishi J.Y., Zhu A., Zeng Y., Xie W., et al. Immuno-SABER enables highly multi-plexed and amplified protein imaging in tissues. Nat. Biotechnol. 2019; 37, 1080–1090. 10.1038/s41587-019-0207-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hartmann F.J., Bendall S.C. Immune monitoring using mass cytometry and related high-dimensional imaging approaches. Nat Rev Rheumatol 2020; 16, 87–99. 10.1038/s41584-019-0338-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Taube J.M., Akturk G., Angelo M. et al. on behalf of the Society for Immunotherapy of Cancer (SITC) Pathology Task Force. The Society for Immunotherapy of Cancer statement on best practices for multiplex immunohistochemistry (IHC) and immunofluorescence (IF) staining and validation. Journal for ImmunoTherapy of Cancer 2020;8:e000155. 10.1136/jitc-2019-000155 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Tan W., Nerurkar S. N., Cai H. Y., Ng H., Wu D., Wee Y., Lim J., Yeong J., & Lim T. Overview of multiplex immunohistochemistry/ immunofluorescence techniques in the era of cancer immunotherapy. Cancer communications (London, England) 2020; 40(4), 135–153. 10.1002/cac2.12023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Angelo M., Bendall S.C., Finck R., Hale M.B., Hitzman C., Borowsky A.D., et al. Multiplexed ion beam imaging of human breast tumors. Nat. Med. 2014; 20, 436–442. 10.1038/nm.3488 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Keren L., Bosse M., Marquez D., Angoshtari R., Jain S., Varma S., et al. A Structured Tumor-Immune Microenvironment in Triple Negative Breast Cancer Revealed by Multiplexed Ion Beam Imaging. Cell 2018; 174. 10.1016/j.cell.2018.08.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Keren L., Bosse M., Thompson S., Risom T., Vijayaragavan K., McCaffrey E., et al. MIBI-TOF: A multiplexed imaging platform relates cellular phenotypes and tissue structure. Sci. Adv. 2019; 5, eaax5851. 10.1126/sciadv.aax5851 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Mastin G.A. Adaptive filters for digital image noise smoothing: An evaluation. Comput. Vision, Graph. Image Process. 1985; 31, 103–121. [Google Scholar]
  • 19.Jackson H.W., Fischer J.R., Zanotelli V.R.T., Ali H.R., Mechera R., Soysal S.D., et al. The single-cell pathology landscape of breast cancer. Nature 2020; 1–6. 10.1038/s41586-019-1876-x [DOI] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008887.r001

Decision Letter 0

Dina Schneidman-Duhovny

21 Dec 2020

Dear Dr. Keren,

Thank you very much for submitting your manuscript "MAUI (MBI Analysis User Interface) - An image processing pipeline for Multiplexed Mass Based Imaging" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. 

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Dina Schneidman

Software Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Baranski and co-workers here present an interactive graphical user interface adaptation of the image processing methods originally described in Keren et al, 2018 Cell. Specifically, the software, Mass Based Imaging Analysis User Interface (MAUI), streamlines image processing from Multiplexed Ion Beam Imaging (MIBI) developed by the Angelo lab.

In addition to the MAUI software, the important work presented here also details and shows convincing examples of the results of each step performed in preliminary MIBI image processing. The authors indicate that the work can potentially extrapolate to other forms of mass spectrometry imaging, and is certainly appropriate for the scope of the journal, given the momentum in adopting various multiplexed imaging platforms in the scientific community (including the methods related to SIMS and the IMC) and the lack of alternative software that enables image cleanup without the use of scripting languages.

I have a number of minor suggestions and comments that I hope the authors can address, in addition to showcasing broad utility of MAUI.

1. In Results 2.2, the authors state tantalum as 182Ta. It would appear that 181Ta is the stable isotope of tantalum.

2. Will the authors be able to clarify how channel 89Y would cause contaminating signals in channel 115In? It does not appear to fall into the typical +1, +16 and +17 contaminants.

3. It is unclear whether MAUI would allow multiple rounds of sequential contamination removal (eg removing a -1 and then a -16 on the same channel). If so, would the authors be able to elaborate further in the text?

4. In Results 2.3, the authors state that the KNN approach here “has been shown to have superior performance to other filters”. It would be more convincing to have some representative images for other methods of denoising, including DBSCAN, another commonly used density-based clustering method.

5. It is unclear how aggregation removal would be judged by a user. IE how would one decide whether large “clumps” of signal is true signal vs aggregation? It would also be useful to elaborate how the signal density of aggregates will be taken into consideration for the clean up.

Additionally (and optionally), it would certainly strengthen the manuscript to see examples of how MAUI processing of publicly available IMC images or other mass-spec imaging data (such as ToF-SIMS, nano-SIMS or OrbiSIMS).

Thank you for your work in putting together a very nice piece of software for the community.

Reviewer #2: The authors developed an MBI Analysis User Interface (MAUI) that facilitates the processing of image data, including the removal of channel crosstalk, noise, and antibody aggregates, and provides a software resource for efficient user data processing of data acquired by Multiplexed Ion Beam Imaging by time of flight (MIBI-TOF).

Recent developments and advancements in MIBI-TOF and Imaging Mass Cytometry (IMC) have provided innovative multiplex molecular imaging methods in the life sciences and have attracted the interest of many scientists in this field. There is no doubt that authors' contributions are important, especially for advanced lifescience researchers who are trying to adopt them.

However, after careful perusal of this manuscript, I believe that several issues need to be clarified. My main comments are as follows.

1) There is no doubt that high-dimensional imaging techniques using mass spectrometry are becoming increasingly important for histopathology, with several different instruments and methods available, including MIBI and IMC. However, since there are many non-specialist readers (especially researchers in the life sciences) who are interested in this field, I would like to see more detailed description of the instruments and methods that are currently available for users to choose from. Moreover, the instruments and methods that can be covered by MAUI should be clearly described.

2) The reviewer is interested in whether a comparison with fluorescence microscopy images using fluorescent secondary antibodies would be useful in validating images acquired with MIBI-TOF and processed with MAUI.

In particular, it was felt that comparison with fluorescence images could clearly show whether the inherent problems in mass spectrometry caused by channel crosstalk are solved by this software. On the other hand, the problems of non-specific signals and antibody aggregation are considered to be common problems in fluorescent immunostaining methods, and it should be discussed whether it is reasonable to solve these problems with this software in MIBI-TOF images.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Sizun Jiang

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-materials-and-methods

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008887.r003

Decision Letter 1

Dina Schneidman-Duhovny

17 Mar 2021

Dear Dr. Keren,

We are pleased to inform you that your manuscript 'MAUI (MBI Analysis User Interface) - An image processing pipeline for Multiplexed Mass Based Imaging' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Dina Schneidman

Software Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Dear Authors,

Thank you for the very nicely written revision comments and edits. These revisions full address my concerns and I recommend publication without reservations.

I have a minor comment on a typo in Figure S2 (Vimenting -> Vimentin) which may not be addressed in subsequent copy editing as it shows up as an image and not text.

Congratulations on a very robust study!

Reviewer #2: My comments have been addressed and the MS is improved. I have no additional comments.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: None

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Sizun Jiang

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008887.r004

Acceptance letter

Dina Schneidman-Duhovny

13 Apr 2021

PCOMPBIOL-D-20-01128R1

MAUI (MBI Analysis User Interface) - An image processing pipeline for Multiplexed Mass Based Imaging

Dear Dr Keren,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Andrea Szabo

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Analysis of IMC data using MAUI.

    The top left panel shows an image of CD68 from an IMC dataset (Jackson et al., Nature 2020). Staining intensity is shown as a heatmap from blue (low) to high (yellow). White arrows denote real CD68 staining, as validated by coexpression of CD45. Colored arrows denote various imaging artifacts including cross-channel contamination (red), noise (green) and aggregates (orange). Each row in the image shows one stage of processing by MAUI, including removal of cross channel contamination (top), denoising (middle) and removal of aggregates (bottom) as detailed. The final image is shown in the bottom right. Scalebar equals 30μm.

    (JPEG)

    S2 Fig. Analysis of CODEX data using MAUI.

    The top left panel shows an image of CD8 from a CODEX dataset (Schurch et al., Cell 2020). Staining intensity is shown as a heatmap from blue (low) to high (yellow). White arrows denote real CD8 staining, as validated by coexpression of CD3 and CD45. Colored arrows denote various imaging artifacts including cross-channel contamination (red), noise (green) and aggregates (orange). Each row in the image shows one stage of processing by MAUI, including removal of cross channel contamination (top), denoising (middle) and removal of aggregates (bottom) as detailed. The final image is shown in the bottom right. Scalebar equals 50μm.

    (JPEG)

    S3 Fig. Underlying reasons for cross channel contamination.

    (A) Shown is an image for the mass range of 128–132, which has no labeled signal in it (left column) for the first (top) and second (bottom) generations of the instrument. In the first-generation instruments this background signal (right column) mirrors the bare slide and therefore the gold channel (middle column). In the second-generation instruments (bottom), this slide-specific background is mitigated and the background only has salt-and-pepper noise. (B) MIBI-TOF staining of human FFPE tonsil with free 115Indium and antibodies for Histone H3 and CD45 shows nuclear localization of 115Indium. Scalebar equals 10μm.

    (JPEG)

    Attachment

    Submitted filename: MAUI_pbp.pdf

    Data Availability Statement

    The data can be found at https://github.com/angelolab/MAUI.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES