DeepION: A Deep Learning-Based Low-Dimensional Representation Model of Ion Images for Mass Spectrometry Imaging

Lei Guo; Chengyi Xie; Rui Miao; Jingjing Xu; Xiangnan Xu; Jiacheng Fang; Xiaoxiao Wang; Wuping Liu; Xiangwen Liao; Jianing Wang; Jiyang Dong; Zongwei Cai

doi:10.1021/acs.analchem.3c05002

. 2024 Feb 20;96(9):3829–3836. doi: 10.1021/acs.analchem.3c05002

DeepION: A Deep Learning-Based Low-Dimensional Representation Model of Ion Images for Mass Spectrometry Imaging

Lei Guo ^†, Chengyi Xie ^‡,^§, Rui Miao ^∥, Jingjing Xu ^∥, Xiangnan Xu ^⊥, Jiacheng Fang ^‡, Xiaoxiao Wang ^‡, Wuping Liu ^∇, Xiangwen Liao ^†,^*, Jianing Wang ^‡,^*, Jiyang Dong ^∥,^*, Zongwei Cai ^‡,^§,^*

PMCID: PMC10918617 PMID: 38377545

Abstract

Mass spectrometry imaging (MSI) is a high-throughput imaging technique capable of the qualitative and quantitative in situ detection of thousands of ions in biological samples. Ion image representation is a technique that produces a low-dimensional vector embedded with significant spectral and spatial information on an ion image, which further facilitates the distance-based similarity measurement for the identification of colocalized ions. However, given the low signal-to-noise ratios inherent in MSI data coupled with the scarcity of annotated data sets, achieving an effective ion image representation for each ion image remains a challenge. In this study, we propose DeepION, a novel deep learning-based method designed specifically for ion image representation, which is applied to the identification of colocalized ions and isotope ions. In DeepION, contrastive learning is introduced to ensure that the model can generate the ion image representation in a self-supervised manner without manual annotation. Since data augmentation is a crucial step in contrastive learning, a unique data augmentation strategy is designed by considering the characteristics of MSI data, such as the Poisson distribution of ion abundance and a random pattern of missing values, to generate plentiful ion image pairs for DeepION model training. Experimental results of rat brain tissue MSI show that DeepION outperforms other methods for both colocalized ion and isotope ion identification, demonstrating the effectiveness of ion image representation. The proposed model could serve as a crucial tool in the biomarker discovery and drug development of the MSI technique.

1. Introduction

Mass spectrometry imaging (MSI) is a high-throughput molecular imaging technique that enables the spatial localization of thousands of biomolecules in tissue sections.¹ The capabilities of MSI in biochemical characterization with spatial information promote its extensive applications in biology, clinical medicine, pharmaceutical research, and environmental science.²⁻⁵ When MSI acquires a full mass spectrum at each pixel of a tissue section, it produces thousands of ion images. Each image depicts the spatial distribution of specific ions or groups of ions. Co-localization refers to the quantification of the spatial correlation between ion images. When paired images from ions show high spatial similarity, they are termed colocalized ions. Identifying these colocalized ions is vital to interpreting complex MSI data in a biological context.

The identification of a colocalization ion involves looking up a similar spatial distribution ion by using distance-based similarity metrics. Although the high spectral resolution and rich spatial information in MSI makes them suitable for determining the subtle metabolic differences between regions of tissue, their high dimensionality, complex spatial structure, and low signal-to-noise ratios (SNR) also pose challenges in data interpretation, including similarity measurements of ion images. While image representation has been an important topic in computer vision and pattern recognition, it conveys information about the images by mapping raw image data directly into abstract semantic representations. Therefore, a “meaningful representation” of an ion image (an ion image representation) which captures significant spectral and spatial information to project onto a dense vector would help to reveal the intrinsic features embedded in the data and simplify the recognition of colocalized ions.⁶ Unlike the task of rendering ion images from MSI data, which is achieved by many commercial and home-built software packages,⁷⁻⁹ the ion image representation aims to mine hyperspectral MSI data for obtaining colocalized or isotope ion images which have similar spatial distributions.

Several methods have been developed to generate ion image representations. Supervised methods such as ColocML rely heavily on high-quality annotated data which makes them sensitive to the experiment artifacts and noise and reduces generalization capacities.¹⁰ On the contrary, unsupervised methods that abstract the underlying spatial patterns of individual 2D ion images without manual annotations are more practical in learning ion image representation.

Unsupervised methods for ion image representation can be roughly divided into three categories based on similarity measurement (SIM),¹⁰⁻¹³ dimensionality reduction (DR),^14,15 and deep learning (DL)^16,17 strategies, respectively. The SIM-based methods utilize vector-based distance calculations to quantify the spatial similarity of ion images through reshaping the original size of the image into a vector. DR-based methods first map the high-dimensional features into a low-dimensional embedding space and then measure the distance between the embeddings of ion images. Obviously, either SIM- or DR-based methods do not take the spatial information on pixels into account, which might be prone to missing relevant, localized differences between ions. Recent advances in deep learning frameworks for image representation in computer vision have opened up new opportunities for learning effective ion image representation.¹⁸⁻²³ However, these DL-based methods are designed exclusively to handle the semantic relevant tasks of natural images, thereby being inadaptable for extracting image features of MSI data in the presence of plenty of noise and missing values. In addition, there is a special case in colocalization ions, i.e., isotope peaks natively from the same molecule (as shown in Figure S1). The ion images from isotope ions show not only the similarity in the spatial distribution but also the correlation of signal intensity. Therefore, discriminating the isotope ions from other colocalized ions of different molecules would assist the accurate and reliable molecular identification in MSI.

In this work, we propose DeepION, a new deep learning approach based on contrastive learning to generate effective ion image representations in a self-supervised manner without manual annotation. Two modes of DeepION, denoted as COL and ISO, are designed for the cases of regular colocalized ions from different molecules and isotope ions from the same molecule, respectively. As modality-specific data augmentation is critical to the performance of models trained by contrastive learning, we propose two data augmentation strategies especially for the COL and ISO mods by considering the characteristics of the Poisson distribution and missing patterns in MSI.²⁴ The MSI data sets from two sections of rat brain tissues are used to assess the capability of DeepION on ion colocalization in comparison with SIM-based methods including the Euclidean distance, cosine distance, Pearson correlation coefficient (PCC), structure similarity index measure (SSIM),²⁵ and determination coefficient R², DR-based methods including principal component analysis (PCA), t-distribution stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection (UMAP), and other DL-based methods including ResNet18²⁶ and SimSiam.²³

2. Materials

2.1. Sample Preparation and Data Acquisition

Four-week-old SD rats are housed in sterile individually ventilated cages with a 12 h light/dark cycle at 22 °C and 45% relative humidity. The rats are sacrificed, and brain tissues are dissected and stored at −80 °C prior to section preparation. Experimental protocols are approved by the Hong Kong Baptist University Committee on the Use of Human and Animal Subjects in Teaching and Research.²⁷

Rat brain tissue is cryosectioned at 10 μm using a CryoStar Nx70 cryostat (Thermal Fisher Scientific, Walldorf, Germany). The tissue cryosections are then transferred onto an indium tin oxide (ITO)-coated glass slide and placed in a vacuum chamber for half an hour before matrix application. Matrix solutions including 2,5-dihydroxybenzoic acid (DHB) and N-(1-naphthyl)ethylenediamine dihydrochloride (NEDC) are prepared at concentrations of 20 and 5 mg/mL in MeOH, respectively. A home-built pneumatic-assisted electrospray deposition system is used for matrix application at a flow rate of 15 μL/min for 13 cycles. MSI data are acquired at 50 μm spatial resolution using timsTOF fleX MALDI-2 (Bruker Daltonics, USA). A m/z range of 100 to 2000 was covered in both positive and negative ion modes. Raw MSI data exported from the instrument are about 4.4 GB (negative mode) and 4.7 GB (positive mode) in size.

2.2. Data Preprocessing

Data preprocessing, including peak picking, peak alignment, peak filtering, peak pooling, hotspot removal, and normalization, is conducted to generate ion images from raw MSI data. Here, peak picking and peak alignment are performed using SCiLs Lab software (Bruker Company, Germany). Peak filtering and peak pooling are performed using in-house Python scripts (details in our previous work^28,29). After that, the raw MSI data is converted to a three-dimensional matrix X_(M×N×H), where M and N are the horizontal and vertical numbers of pixels, respectively, and H represents the dimensionality of ions corresponding to the m/z bins. For each ion image, those signals with intensities higher than 99% maximum intensity are termed hotspots and are truncated to eliminate the influence of extreme high intensities of ions on spectral normalization and visual inspection. Then the intensities of signals on an ion image are normalized to the range of [0, 1]. Taking the ion image x as an example, normalization is performed as follows:

Finally, two MSI data sets containing 2165 images with a 192 × 425 shape (in negative mode) and 2263 images with a 198 × 422 shape (in positive mode) are obtained.

3. Methods

The workflow of DeepION is shown in Figure 1. Two modes of DeepION, i.e., the COL and ISO modes, are designed for ordinary colocalized ions from different molecules and isotope ions from the same molecule, respectively.

Schematic overview of DeepION consisting of four modules. (1) Data augmentation module. The original ion image x is first imported into the data augmentation module T to generate two augmented images x₁ and x₂, where the T_COL, including color jitter, filtering, Poisson noise, and a random missing value, is carried in the COL mode, while T_ISO introduces an additional process for the intensity-dependent missing value in ISO mode. (2) Encoder module. Two images x₁ and x₂ are propagated through a pair of ResNet18-based encoders f that share parameters and then output two 512-dimensional representation vectors r₁ and r₂, respectively. (3) Projection module g and prediction module q are used to avoid collapsing solutions during the optimization process of maximizing the similarity between two augmentations from the same image²³ and ensure the learning of the meaningful representation vectors r₁ and r₂. A contrasting loss is employed to maximize the similarity with a stop-gradient operation on h₂ to prevent the reaction from collapsing during training. (4) Dimensionality reduction module. This module is applied to further reduce the dimensionality of ion image representation to a 20-dimensional vector O for downstream tasks.

3.1. Data Augmentation Based on MSI Prior Knowledge

Data augmentation, which involves applying different transformations to original images to produce new training data for the DeepION model, is crucial for contrastive learning.¹⁸ It is recognized that when the training data closely mirrors the distribution of the test data for a task, the model performs better. To ensure that the learned ion image representations are capable of abstracting high-level features of MSI, a modality-specific data augmentation strategy is designed by incorporating the characteristics of MSI data (detailed descriptions seen in Material S1), including color jitter, filtering, Poisson noise, random missing value, and intensity-dependent missing value (Figure 2). Here, color jitter and filtering are commonly used in natural images to make the model insensitive to the brightness of images and concentrate on the critical features. The operations of Poisson noise and random missing value are carried out in the COL mode of DeepION, which aims to simulate the data distributions within MSI, whereas the intensity-dependent missing value operation is conducted additively in ISO mode by setting the missing ratio of ions proportional to their intensities.

In practical applications, the COL mode and ISO mode of DeepION whose differences lay in the augmentation strategy are flexibly used for the identification of colocalized ions and isotope ions, respectively.

3.2. Architecture of the DeepION Model

The proposed DeepION model is constructed based on the SimSiam model,¹⁸ which aims to extract high-level spatial features from the ion image to obtain a meaningful ion image representation for each ion image in a self-supervised manner. DeepION is composed with the data augmentation module, encoder module, projection module, prediction module, and dimensionality reduction module.

The data augmentation module on the DeepION architecture is applied to create modified copies of the existing ion images with some minor changes, facilitating self-supervised training of the DeepION model.

The encoder module is used to learn an ion image representation function f(•|θ^f) to extract the high-level features from ion images for downstream tasks. It consists of twin networks that share identical parameters and adopt the pretrained Resnet18 as the backbone network (Figure S2A). Taking the ion image x as an example, the encoder takes two augmented views x₁ and x₂ from ion image x as inputs. Then, two ion image representations r¹ (d = 512) and r² (d = 512) corresponding to the same image x are obtained as follows:

The target of projection is to learn a multilayer perceptron (MLP) function g(•|θ^g) to ensure that the econder module outputs a meaningful ion image representation. Here, the projection module consists of three fully connected (FC) layers (as shown in Figure S2B). Then, two outputs h₁ and h₂ are obtained as follows:

The target of prediction is to learn an MLP function to avoid the model producing the collapse results. It consists of two FC layers (as shown in Figure S2C). Then, outputs p₁ and p₂ are obtained as follows:

The target of dimensionality reduction is to generate a dense vector for similarity measurement and alleviate the adverse impact of the “curse of dimensionality” when the high-dimensional ion image representation r (d = 512) from the encoder is directly applied to the calculation. Since the UMAP algorithm has been demonstrated to perform better than other dimensionality reduction methods in many fields, we adopt it to obtain the denser ion image representation O as follows

where the reduced dimension is set to 20 by a trade-off between space–time complexity and information utilization. Then, to facilitate the calculation of similarity, a min–max scaler is adopted to adjust the range of features to [0, 1], as follows:

The output of the dimensionality reduction module is the low-dimensional ion image representation of an ion image that can be applied to downstream tasks.

3.3. Model Training

The Adam optimizer is adopted to train the encoder and prediction module, where the learning rate is set to 0.0003 and the momentum parameters β₁ and β₂ are set to 0.5 and 0.99, respectively. The initialization parameters of the DeepION model are set to follow the normal distribution N(0,0.02). Contrastive learning-based models often benefit from a large batch size so that the batch size is determined by the size allowed by the capacity of the GPU memory. The number of iterations is set to 100. Furthermore, to prevent the DeepION model from collapsing, we use the stop gradient operation during the model training. Specifically, the loss function of the DeepION model is as follows

where the encoder on x₂ receives no gradient from h² in the first term but receives gradients from h² in the second term (and vice versa for x₁).

3.4. Model Implementation

After model training, the output of the encoder is directly input into the dimensionality reduction module, and the output of the dimensionality reduction module is taken as the final output of the DeepION model. The DeepMSI model is implemented in Python using the PyTorch library, and the model is trained on a workstation equipped with a GPU Nvidia GTX 2080Ti graphics card.

4. Results and Discussion

The proposed DeepION is compared with SIM-based, DR-based, and the other DL-based methods for the task of identifying colocalized ions and isotope ions, respectively. The Euclidean distance is used to quantify the similarity between ion image representations generated by DR-based and DL-based methods. The performance of the algorithms is assessed by visual inspection and quantitative evaluations.

4.1. Colocalization Ion Searching

Two data sets of MSI ion images are employed to investigate the capabilities of DeepION on colocalized ions. Here, we select five representative ions to be query ions to obtain the most similar ion images via calculating the distance between ion image representations (Figure 3, Figure S3). For negative ion mode, the high expression of query ion m/z 213.902 is observed on the locations of the cortex and cerebellum, while query ion m/z 214.047 is highly expressed in the olfactory bulb. Intuitively, the colocalization of ion pairs should preserve the structural similarities on tissue between their ion images. By using the DeepION model, the target ions m/z 215.900, m/z 251.877, m/z 253.875, and m/z 249.878 are recognized to be colocalized with m/z 213.902 (Figure 3a), while the target ions m/z 338.912, m/z 340.910, m/z 303.943, and m/z 302.935 are colocalized with m/z 214.047 (Figure 3b). The colocalization ions of query ion m/z 213.902 from R², SSIM, PCA and UMAP (Figure S4) show partial consistency with DeepION by a visual inspection of the morphological structure of ion images. That is not the case for query ion m/z 214.047 (Figure S5), where misrecognized colocalized ions occurred with other methods. It is demonstrated that DeepION performs best in the task of discovering regular colocalized ions.

Identification of a colocalized ion for a representative query ion in negative mode. (a) m/z 213.902, (b) m/z 214.047, (c) m/z 701.509, and (d) m/z 772.634. A shorter distance indicates greater similar between the query ion and colocalized ion candidate.

Then, a benchmark data set of ion images, which were manually annotated by experienced MSI technicians, is introduced for quantitative evaluation. Here, 98 ion images are manually divided into 17 colocalized ion categories within this data set, and details of the data distribution are shown in Figure S6. The performance of ion image representation is quantified by constructing a linear classifier to predict the label of each image in which leave-one-out validation is conducted to obtain the classification accuracy, similar to previous work.^18−20,23 As shown in Table 1, ResNet18, SimSiam, and DeepION methods have higher accuracy than the other methods, which are attributed to the benefits of ion image representations from the inclusion of spatial information. In addition, the COL mode of the DeepION model achieves 86.73%, which outperforms the second-best SimSiam model by about 25%. The main difference between the DeepION model and SimSiam model lies in their data augmentation strategies. The SimSiam model uses a standard data augmentation designed for a natural image, while DeepION utilizes an MSI-specific scheme based on prior knowledge. These results demonstrate that the development of modality-specific data augmentation is critical to the performance of DeepION in the task of discovering colocalized ions.

Table 1. Comparison Results among Different Methods.

Categories	Metric	ACC (%)
SIM-based method	Euclidean	12.24
	Cosine	31.63
	PCC	37.76
	R²	30.61
	SSIM	23.47
DR-based method	PCA	42.86
	t-SNE	14.29
	UMAP	40.82
DL-based method	ResNet18	52.04
	SimSiam	62.24
	DeepION	86.73

Open in a new tab

4.2. Isotope Ion Identification

The ISO mode of the DeepION model is compared with two SIM-based methods, PCC and R², which are used to identify isotope ions in MATASPACE³⁰ and the rMSI package³¹ (details shown in Material S2). Figure 4 shows the colocalization results of monoisotopes (white node) and isotopes (black node) in which the edges represent the pair of colocalized monoisotope–isotope ions. Observing the results in the negative mode(Figure 4a), the relevant isotopes are accurately discovered for query monoisotpes m/z 302.935, m/z 699.493, m/z 718.534, and m/z 1544.847 using DeepION. Figure S7 and Table S1 show the colocalization isotope ions by PCC and R². Although PCC and R² could recognize the isotope for monisotopes with high SNR, they are both incapable of discovering the isotope m/z 1547.8702 for m/z 1544.8471 (PCC < 0.5, R² < 0), where their proportions of missing values are 88.62 and 70.96%. Other results of isotope discovery are displayed in Figures S8 and S9. DeepION with ISO mode attains better performance for isotope discovery during visual inspection, which still detects the isotopes with low SNR.

Isotope ion discovery for rat MSI data. (a) Negative ion mode. (b) Positive ion mode.

In addition to visual inspection, we also conduct the quantitative evaluation for the ISO mode of DeepION. Here, monoisotope–isotopes are manually picked out and annotated one by one. The proposed DeepION model obtains 249 monoisotope–isotope pairs with 75.90% accuracy in negative mode and obtains 442 pairs with 92.76% accuracy in positive mode. The isotope ions identified by the ISO mode of DeepION are listed in Tables S2 and S3, where the rows colored in white and red indicate true and false discovery, respectively. The colocalized ions that are identified to have similar spatial distributions but do not belong to the isotope are colored in blue, for example, m/z 245.882 and m/z 247.879, or m/z 362.642 and m/z 364.639 (as see Figure S10). These probably happen when potential isotopic ions overlap with ions from the matrix or ions with similar m/z values that share identical spatial distributions.³² Additional chemical knowledge of molecules as a constraint of the model may further improve the performance of DeepION and facilitate more reasonable results. The proposed DeepION model introduces MSI domain knowledge to data augmentation to learn the characteristics of MSI data, which further improves the performance of DL-based methods in the task of discovering isotope ions.

5. Conclusions

Effective ion image representation means that ions with similar spatial distributions are close together in the embedded space, whereas ions with different spatial distributions are far away. It can facilitate the identification of colocalized ions and isotope ions. In this study, we present DeepION, a new DL-based method for ion image representation. The current results show that the DeepION model outperforms other previous methods in tasks of identifying colocalized ions and isotope ions.

The benefits of DeepION can be attributed to the following factor: (1) By introducing contrastive learning, the DeepION model targets the extraction of high-level spatial features from ion images to obtain a concise image representation in a self-supervised manner without manual annotation. The training objective of the model is to aggregate the augmented embeddings of the same sample and push away the embeddings from different samples. Figures 3 and 4 demonstrate the excellent abilities of DeepION in discovering colocalized ions and isotope ions, respectively. (2) By designing a novel data augmentation based on MSI domain knowledge, the DeepION model can further improve the image representational ability on the embedding space. The comparisons between SimSiam and DeepION prove the key roles of designed data augmentation in learning image representations of high-level molecular distribution features. (3) DeepION can be flexibly applied in the discovery of colocalized ions and isotope ions by implementing the COL mode and ISO mode, respectively. Figure S1 indicates the fine difference between colocalized ions and isotope ions so that it is necessary to design specific modes for two tasks. The proposed DeepION with COL mode outputs a similarity score for both colocalized ions and isotope ions, while the ISO mode gets only a similarity score on isotope ions.

In summary, the present results show the great potential of DeepION in ion image representation. The proposed method can not only be used to identify the colocalized ions but also can be applicable to isotope ions. Furthermore, the DeepION model is expected to be extended to multiple hyperspectral chemical imaging modalities, such as Raman and infrared microscopy. DeepION would be a promising tool for metabolite identification, biomarker discovery, and even metabolic flux analysis in spatially resolved metabolomics.

Acknowledgments

This work is supported by National Natural Science Foundation of China (82372087), the National Key Research Program of China (2017YFC1600505 and 2017YFE0191000), and the Natural Science Foundation of Fujian Province, China (2022Y0003).

Data Availability Statement

The source code of the DeepION model together with the data set for testing are available on a public repository (https://github.com/BioNet-XMU/DeepION).

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.3c05002.

Data augmentation based on MSI prior knowledge; details of discovering isotope ions; difference between monoisotope–isotope ions and colocalized ions; architecture of each module in DeepION; identification of a colocalized ion for a representative query ion in positive mode; colocalization ion discovery for query ions m/z 213.902 and m/z 214.047 using different methods; data distribution of each colocalized ion category in the manually annotated data set; average mass spectra of four randomly selected regions; isotope ions identified in negative and positive modes; two examples that correspond to the blue marks in Table S2; isotope ion discovery for ions m/z 302.935, m/z 699.493, m/z 718.534, and m/z 1544.847 using PCC and R²; and isotope ions identified by the ISO mode of DeepION in the rat brain data set under negative and positive ion modes (PDF)

Author Contributions

^# L.G. and C.X. contributed equally to this work.

The authors declare no competing financial interest.

Supplementary Material

ac3c05002_si_001.pdf^{(811.9KB, pdf)}

References

Doerr A. Mass spectrometry imaging takes off. Nat. Methods. 2018, 15 (1), 32–32. 10.1038/nmeth.4546. [DOI] [Google Scholar]
Sun C. L.; Li T. G.; Song X. W.; Huang L. J.; Zang Q. C.; Xu J.; Bi N.; Jiao G. G.; Hao Y. Z.; Chen Y. H.; et al. Spatially resolved metabolomics to discover tumor-associated metabolic alterations. Proc. Natl. Acad. Sci. U. S. A. 2019, 116 (1), 52–57. 10.1073/pnas.1808950116. [DOI] [PMC free article] [PubMed] [Google Scholar]
Eberlin L. S.; Norton I.; Orringer D.; Dunn I. F.; Liu X. H.; Ide J. L.; Jarmusch A. K.; Ligon K. L.; Jolesz F. A.; Golby A. J.; et al. Ambient mass spectrometry for the intraoperative molecular diagnosis of human brain tumors. Proc. Natl. Acad. Sci. U. S. A. 2013, 110 (5), 1611–1616. 10.1073/pnas.1215687110. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang J.; Due Q. Q.; Song X. W.; Gao S. S.; Pang X. C.; Li Y.; Zhang R. P.; Abliz Z.; He J. M. Evaluation of the tumor-targeting efficiency and intratumor heterogeneity of anticancer drugs using quantitative mass spectrometry imaging. Theranostics 2020, 10 (6), 2621–2630. 10.7150/thno.41763. [DOI] [PMC free article] [PubMed] [Google Scholar]
Diao X.; Xie C. Y.; Xie G. S.; Song Y. Y.; Liang Y. S.; Li R. J.; Dong C.; Zhu L.; Wang J. N.; Cai Z. W. Mass Spectrometry Imaging Revealed Sulfatides Depletion in Brain Tissues of Rats Exposed in Real Air with High Fine Particulate Matter. Environ. Sci. Technol. Lett. 2022, 9, 856. 10.1021/acs.estlett.2c00615. [DOI] [Google Scholar]
Hu H.; Laskin J.. Emerging Computational Methods in Mass Spectrometry Imaging. Adv. Sci. 2022, 9 ( (34), ). DOI: 10.1002/advs.202203339. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ellin N. R.; Miranda-Quintana R. A.; Prentice B. M. Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry. bioRxiv. 2023, 10.1101/2023.07.27.550838. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zou Y. C.; Tang W. W.; Li B. Spatial segmentation of mass spectrometry imaging data featuring selected principal components. Talanta. 2023, 253 (1), 123958 10.1016/j.talanta.2022.123958. [DOI] [PubMed] [Google Scholar]
Bi S. W.; Wang M. J. C.; Pu Q. L.; Yang J. X.; Jiang N.; Zhao X. S.; Qiu S. Y.; Liu R. Q.; Xu R. J.; Li X.; Hu C. G.; Yang L.; Gu J.; Du D. Multi-MSIProcessor: Data Visualizing and Analysis Software for Spatial Metabolomics Research. Anal. Chem. 2024, 96, 339. 10.1021/acs.analchem.3c04192. [DOI] [PubMed] [Google Scholar]
Ovchinnikova K.; Stuart L.; Rakhlin A.; Nikolenko S.; Alexandrov T. ColocML: machine learning quantifies co-localization between mass spectrometry images. Bioinformatics 2020, 36 (10), 3215–3224. 10.1093/bioinformatics/btaa085. [DOI] [PMC free article] [PubMed] [Google Scholar]
McDonnell L. A.; van Remoortere A.; van Zeijl R. J. M.; Deelder A. M. Mass spectrometry image correlation: Quantifying colocalization. J. Proteome. Res. 2008, 7 (8), 3619–3627. 10.1021/pr800214d. [DOI] [PubMed] [Google Scholar]
Wüllems K.; Kölling J.; Bednarz H.; Niehaus K.; Hans V. H.; Nattkemper T. W.. Detection and visualization of communities in mass spectrometry imaging data. BMC Bioinformatics 2019, 20. DOI: 10.1186/s12859-019-2890-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Inglese P.; Correia G.; Pruski P.; Glen R. C.; Takats Z. Colocalization Features for Classification of Tumors Using Desorption Electrospray Ionization Mass Spectrometry Imaging. Anal. Chem. 2019, 91 (10), 6530–6540. 10.1021/acs.analchem.8b05598. [DOI] [PMC free article] [PubMed] [Google Scholar]
Smets T.; Waelkens E.; De Moor B. Prioritization of m/z Values in Mass Spectrometry Imaging Profiles Obtained Using Uniform Manifold Approximation and Projection for Dimensionality Reduction. Anal. Chem. 2020, 92 (7), 5240–5248. 10.1021/acs.analchem.9b05764. [DOI] [PubMed] [Google Scholar]
Alexandrov T.; Chernyavsky I.; Becker M.; von Eggeling F.; Nikolenkov S. Analysis and Interpretation of Imaging Mass Spectrometry Data by Clustering Mass-to-Charge Images According to Their Spatial Similarity. Anal. Chem. 2013, 85 (23), 11189–11195. 10.1021/ac401420z. [DOI] [PubMed] [Google Scholar]
Hu H.; Bindu J. P.; Laskin J. Self-supervised clustering of mass spectrometry imaging data using contrastive learning. Chem. Sci. 2021, 13 (1), 90–98. 10.1039/D1SC04077D. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang W. Q.; Claesen M.; Moerman T.; Groseclose M. R.; Waelkens E.; De Moor B.; Verbeeck N. Spatially aware clustering of ion images in mass spectrometry imaging data using deep learning. Anal. Bioanal. Chem. 2021, 413 (10), 2803–2819. 10.1007/s00216-021-03179-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chen T.; Kornblith S.; Norouzi M.; Hinton G.. A Simple Framework for Contrastive Learning of Visual Representations. 37th International Conference on Machine Learning 2020, 119. DOI: 10.48550/arXiv.2002.05709. [DOI]
He K. M.; Fan H. Q.; Wu Y. X.; Xie S. N.; Girshick R.. Momentum Contrast for Unsupervised Visual Representation Learning. 33th IEEE Conference on Computer Vision and Pattern Recognition 2020, 9726–9735. DOI: 10.48550/arXiv.1911.05722. [DOI]
Grill J. B.; Strub F.; Altché F.; Tallec C.; Richemond P. H.; Buchatskaya E.; Doersch C.; Pires A. B.; Guo Z. H. D.; Azar M. J.; Piot b.; Kavukcuoglu K.; Munos R.; Valko M. Bootstrap your own latent: A new approach to self-supervised Learning. arxiv 2020, 10.48550/arXiv.2006.07733. [DOI] [Google Scholar]
Liu X.; Zhang F. J.; Hou X. Y.; Mian Li; Wang Z. Y.; Zhang J.; Tang J. Self-supervisedLearning: Generativeor Contrastive. IEEE. T. Knowl. Data. En. 2021, 35 (1), 857–876. 10.1109/TKDE.2021.3090866. [DOI] [Google Scholar]
Van Gansbeke W.; Vandenhende S.; Georgoulis S.; Proesmans M.; Van Gool L. SCAN: Learning to Classify Images without Labels. Arxiv. 2020, 10.48550/arXiv.2005.12320. [DOI] [Google Scholar]
Chen X. L.; He K. M.. Exploring Simple Siamese Representation Learning. 2021 Ieee/Cvf Conference on Computer Vision and Pattern Recognition, 34th IEEE Conference on Computer Vision and Pattern Recognition 2021, 15745–15753. DOI: 10.1109/Cvpr46437.2021.01549. [DOI]
Siy P. W.; Moffitt R. A.; Parry R. M.; Chen Y. F.; Liu Y.; Sullards M. C.; Merrill A. H.; Wang M. D.. Matrix Factorization Techniques for Analysis of Imaging Mass Spectrometry Data. 8th IEEE International Conference on Bioinformatics and Bioengineering 2008. DOI: 10.1109/BIBE.2008.4696797. [DOI] [PMC free article] [PubMed]
Ekelöf M.; Garrard K. P.; Judd R.; Rosen E. P.; Xie D. Y.; Kashuba A. D. M.; Muddiman D. C. Evaluation of Digital Image Recognition Methods for Mass Spectrometry Imaging Data Analysis. J. Am. Soc. Mass. Spectrom. 2018, 29 (12), 2467–2470. 10.1007/s13361-018-2073-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
He K. M.; Zhang X. Y.; Ren S. Q.; Sun J.. Deep Residual Learning for Image Recognition. 29th IEEE Conference on Computer Vision and Pattern Recognition 2016, 770–778. DOI: 10.1109/Cvpr.2016.90. [DOI]
Xie C. Y.; Chen Y. Y.; Wang X. X.; Song Y. Y.; Shen Y. T.; Diao X.; Zhu L.; Wang J. N.; Cai Z. W. Chiral derivatization-enabled discrimination and on-tissue detection of proteinogenic amino acids by ion mobility mass spectrometry. Chem. Sci. 2022, 13 (47), 14114–14123. 10.1039/D2SC03604E. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guo L.; Hu Z. X.; Zhao C.; Xu X. N.; Wang S. J.; Xu J. J.; Dong J. Y.; Cai Z. W. Data Filtering and Its Prioritization in Pipelines for Spatial Segmentation of Mass Spectrometry Imaging. Anal. Chem. 2021, 93 (11), 4788–4793. 10.1021/acs.analchem.0c05242. [DOI] [PubMed] [Google Scholar]
Guo L.; Dong J. Y.; Xu X. N.; Wu Z. C.; Zhang Y. B.; Wang Y. W.; Li P. F.; Tang Z.; Zhao C.; Cai Z. W. Divide and Conquer: A Flexible Deep Learning Strategy for Exploring Metabolic Heterogeneity from Mass Spectrometry Imaging Data. Anal. Chem. 2023, 95 (3), 1924–1932. 10.1021/acs.analchem.2c04045. [DOI] [PMC free article] [PubMed] [Google Scholar]
Palmer A.; Phapale P.; Chernyavsky I.; Lavigne R.; Fay D.; Tarasov A.; Kovalev V.; Fuchser J.; Nikolenko S.; Pineau C.; et al. FDR-controlled metabolite annotation for high-resolution imaging mass spectrometry. Nat. Methods. 2017, 14 (1), 57–60. 10.1038/nmeth.4072. [DOI] [PubMed] [Google Scholar]
Sementé L.; Baquer G.; García-Altares M.; Correig-Blanchar X.; Ràfols P.. rMSIannotation: A peak annotation tool for mass spectrometry imaging based on the analysis of isotopic intensity ratios. Anal. Chim. Acta 2021, 1171. [DOI] [PubMed] [Google Scholar]
Verbeeck N.; Caprioli R. M.; van de Plas R. Unsupervised machine learning for exploratory data analysis in imaging mass spectrometry. Mass. Spectrom. Rev. 2020, 39 (3), 245–291. 10.1002/mas.21602. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ac3c05002_si_001.pdf^{(811.9KB, pdf)}

Data Availability Statement

The source code of the DeepION model together with the data set for testing are available on a public repository (https://github.com/BioNet-XMU/DeepION).

[ref1] Doerr A. Mass spectrometry imaging takes off. Nat. Methods. 2018, 15 (1), 32–32. 10.1038/nmeth.4546. [DOI] [Google Scholar]

[ref2] Sun C. L.; Li T. G.; Song X. W.; Huang L. J.; Zang Q. C.; Xu J.; Bi N.; Jiao G. G.; Hao Y. Z.; Chen Y. H.; et al. Spatially resolved metabolomics to discover tumor-associated metabolic alterations. Proc. Natl. Acad. Sci. U. S. A. 2019, 116 (1), 52–57. 10.1073/pnas.1808950116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref3] Eberlin L. S.; Norton I.; Orringer D.; Dunn I. F.; Liu X. H.; Ide J. L.; Jarmusch A. K.; Ligon K. L.; Jolesz F. A.; Golby A. J.; et al. Ambient mass spectrometry for the intraoperative molecular diagnosis of human brain tumors. Proc. Natl. Acad. Sci. U. S. A. 2013, 110 (5), 1611–1616. 10.1073/pnas.1215687110. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref4] Zhang J.; Due Q. Q.; Song X. W.; Gao S. S.; Pang X. C.; Li Y.; Zhang R. P.; Abliz Z.; He J. M. Evaluation of the tumor-targeting efficiency and intratumor heterogeneity of anticancer drugs using quantitative mass spectrometry imaging. Theranostics 2020, 10 (6), 2621–2630. 10.7150/thno.41763. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref5] Diao X.; Xie C. Y.; Xie G. S.; Song Y. Y.; Liang Y. S.; Li R. J.; Dong C.; Zhu L.; Wang J. N.; Cai Z. W. Mass Spectrometry Imaging Revealed Sulfatides Depletion in Brain Tissues of Rats Exposed in Real Air with High Fine Particulate Matter. Environ. Sci. Technol. Lett. 2022, 9, 856. 10.1021/acs.estlett.2c00615. [DOI] [Google Scholar]

[ref6] Hu H.; Laskin J.. Emerging Computational Methods in Mass Spectrometry Imaging. Adv. Sci. 2022, 9 ( (34), ). DOI: 10.1002/advs.202203339. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref7] Ellin N. R.; Miranda-Quintana R. A.; Prentice B. M. Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry. bioRxiv. 2023, 10.1101/2023.07.27.550838. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref8] Zou Y. C.; Tang W. W.; Li B. Spatial segmentation of mass spectrometry imaging data featuring selected principal components. Talanta. 2023, 253 (1), 123958 10.1016/j.talanta.2022.123958. [DOI] [PubMed] [Google Scholar]

[ref9] Bi S. W.; Wang M. J. C.; Pu Q. L.; Yang J. X.; Jiang N.; Zhao X. S.; Qiu S. Y.; Liu R. Q.; Xu R. J.; Li X.; Hu C. G.; Yang L.; Gu J.; Du D. Multi-MSIProcessor: Data Visualizing and Analysis Software for Spatial Metabolomics Research. Anal. Chem. 2024, 96, 339. 10.1021/acs.analchem.3c04192. [DOI] [PubMed] [Google Scholar]

[ref10] Ovchinnikova K.; Stuart L.; Rakhlin A.; Nikolenko S.; Alexandrov T. ColocML: machine learning quantifies co-localization between mass spectrometry images. Bioinformatics 2020, 36 (10), 3215–3224. 10.1093/bioinformatics/btaa085. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref11] McDonnell L. A.; van Remoortere A.; van Zeijl R. J. M.; Deelder A. M. Mass spectrometry image correlation: Quantifying colocalization. J. Proteome. Res. 2008, 7 (8), 3619–3627. 10.1021/pr800214d. [DOI] [PubMed] [Google Scholar]

[ref12] Wüllems K.; Kölling J.; Bednarz H.; Niehaus K.; Hans V. H.; Nattkemper T. W.. Detection and visualization of communities in mass spectrometry imaging data. BMC Bioinformatics 2019, 20. DOI: 10.1186/s12859-019-2890-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref13] Inglese P.; Correia G.; Pruski P.; Glen R. C.; Takats Z. Colocalization Features for Classification of Tumors Using Desorption Electrospray Ionization Mass Spectrometry Imaging. Anal. Chem. 2019, 91 (10), 6530–6540. 10.1021/acs.analchem.8b05598. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref14] Smets T.; Waelkens E.; De Moor B. Prioritization of m/z Values in Mass Spectrometry Imaging Profiles Obtained Using Uniform Manifold Approximation and Projection for Dimensionality Reduction. Anal. Chem. 2020, 92 (7), 5240–5248. 10.1021/acs.analchem.9b05764. [DOI] [PubMed] [Google Scholar]

[ref15] Alexandrov T.; Chernyavsky I.; Becker M.; von Eggeling F.; Nikolenkov S. Analysis and Interpretation of Imaging Mass Spectrometry Data by Clustering Mass-to-Charge Images According to Their Spatial Similarity. Anal. Chem. 2013, 85 (23), 11189–11195. 10.1021/ac401420z. [DOI] [PubMed] [Google Scholar]

[ref16] Hu H.; Bindu J. P.; Laskin J. Self-supervised clustering of mass spectrometry imaging data using contrastive learning. Chem. Sci. 2021, 13 (1), 90–98. 10.1039/D1SC04077D. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref17] Zhang W. Q.; Claesen M.; Moerman T.; Groseclose M. R.; Waelkens E.; De Moor B.; Verbeeck N. Spatially aware clustering of ion images in mass spectrometry imaging data using deep learning. Anal. Bioanal. Chem. 2021, 413 (10), 2803–2819. 10.1007/s00216-021-03179-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref18] Chen T.; Kornblith S.; Norouzi M.; Hinton G.. A Simple Framework for Contrastive Learning of Visual Representations. 37th International Conference on Machine Learning 2020, 119. DOI: 10.48550/arXiv.2002.05709. [DOI]

[ref19] He K. M.; Fan H. Q.; Wu Y. X.; Xie S. N.; Girshick R.. Momentum Contrast for Unsupervised Visual Representation Learning. 33th IEEE Conference on Computer Vision and Pattern Recognition 2020, 9726–9735. DOI: 10.48550/arXiv.1911.05722. [DOI]

[ref20] Grill J. B.; Strub F.; Altché F.; Tallec C.; Richemond P. H.; Buchatskaya E.; Doersch C.; Pires A. B.; Guo Z. H. D.; Azar M. J.; Piot b.; Kavukcuoglu K.; Munos R.; Valko M. Bootstrap your own latent: A new approach to self-supervised Learning. arxiv 2020, 10.48550/arXiv.2006.07733. [DOI] [Google Scholar]

[ref21] Liu X.; Zhang F. J.; Hou X. Y.; Mian Li; Wang Z. Y.; Zhang J.; Tang J. Self-supervisedLearning: Generativeor Contrastive. IEEE. T. Knowl. Data. En. 2021, 35 (1), 857–876. 10.1109/TKDE.2021.3090866. [DOI] [Google Scholar]

[ref22] Van Gansbeke W.; Vandenhende S.; Georgoulis S.; Proesmans M.; Van Gool L. SCAN: Learning to Classify Images without Labels. Arxiv. 2020, 10.48550/arXiv.2005.12320. [DOI] [Google Scholar]

[ref23] Chen X. L.; He K. M.. Exploring Simple Siamese Representation Learning. 2021 Ieee/Cvf Conference on Computer Vision and Pattern Recognition, 34th IEEE Conference on Computer Vision and Pattern Recognition 2021, 15745–15753. DOI: 10.1109/Cvpr46437.2021.01549. [DOI]

[ref24] Siy P. W.; Moffitt R. A.; Parry R. M.; Chen Y. F.; Liu Y.; Sullards M. C.; Merrill A. H.; Wang M. D.. Matrix Factorization Techniques for Analysis of Imaging Mass Spectrometry Data. 8th IEEE International Conference on Bioinformatics and Bioengineering 2008. DOI: 10.1109/BIBE.2008.4696797. [DOI] [PMC free article] [PubMed]

[ref25] Ekelöf M.; Garrard K. P.; Judd R.; Rosen E. P.; Xie D. Y.; Kashuba A. D. M.; Muddiman D. C. Evaluation of Digital Image Recognition Methods for Mass Spectrometry Imaging Data Analysis. J. Am. Soc. Mass. Spectrom. 2018, 29 (12), 2467–2470. 10.1007/s13361-018-2073-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref26] He K. M.; Zhang X. Y.; Ren S. Q.; Sun J.. Deep Residual Learning for Image Recognition. 29th IEEE Conference on Computer Vision and Pattern Recognition 2016, 770–778. DOI: 10.1109/Cvpr.2016.90. [DOI]

[ref27] Xie C. Y.; Chen Y. Y.; Wang X. X.; Song Y. Y.; Shen Y. T.; Diao X.; Zhu L.; Wang J. N.; Cai Z. W. Chiral derivatization-enabled discrimination and on-tissue detection of proteinogenic amino acids by ion mobility mass spectrometry. Chem. Sci. 2022, 13 (47), 14114–14123. 10.1039/D2SC03604E. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref28] Guo L.; Hu Z. X.; Zhao C.; Xu X. N.; Wang S. J.; Xu J. J.; Dong J. Y.; Cai Z. W. Data Filtering and Its Prioritization in Pipelines for Spatial Segmentation of Mass Spectrometry Imaging. Anal. Chem. 2021, 93 (11), 4788–4793. 10.1021/acs.analchem.0c05242. [DOI] [PubMed] [Google Scholar]

[ref29] Guo L.; Dong J. Y.; Xu X. N.; Wu Z. C.; Zhang Y. B.; Wang Y. W.; Li P. F.; Tang Z.; Zhao C.; Cai Z. W. Divide and Conquer: A Flexible Deep Learning Strategy for Exploring Metabolic Heterogeneity from Mass Spectrometry Imaging Data. Anal. Chem. 2023, 95 (3), 1924–1932. 10.1021/acs.analchem.2c04045. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref30] Palmer A.; Phapale P.; Chernyavsky I.; Lavigne R.; Fay D.; Tarasov A.; Kovalev V.; Fuchser J.; Nikolenko S.; Pineau C.; et al. FDR-controlled metabolite annotation for high-resolution imaging mass spectrometry. Nat. Methods. 2017, 14 (1), 57–60. 10.1038/nmeth.4072. [DOI] [PubMed] [Google Scholar]

[ref31] Sementé L.; Baquer G.; García-Altares M.; Correig-Blanchar X.; Ràfols P.. rMSIannotation: A peak annotation tool for mass spectrometry imaging based on the analysis of isotopic intensity ratios. Anal. Chim. Acta 2021, 1171. [DOI] [PubMed] [Google Scholar]

[ref32] Verbeeck N.; Caprioli R. M.; van de Plas R. Unsupervised machine learning for exploratory data analysis in imaging mass spectrometry. Mass. Spectrom. Rev. 2020, 39 (3), 245–291. 10.1002/mas.21602. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

DeepION: A Deep Learning-Based Low-Dimensional Representation Model of Ion Images for Mass Spectrometry Imaging

Lei Guo

Chengyi Xie

Rui Miao

Jingjing Xu

Xiangnan Xu

Jiacheng Fang

Xiaoxiao Wang

Wuping Liu

Xiangwen Liao

Jianing Wang

Jiyang Dong

Zongwei Cai

Abstract

1. Introduction

2. Materials

2.1. Sample Preparation and Data Acquisition

2.2. Data Preprocessing

3. Methods

Figure 1.

3.1. Data Augmentation Based on MSI Prior Knowledge

Figure 2.

3.2. Architecture of the DeepION Model

3.3. Model Training

3.4. Model Implementation

4. Results and Discussion

4.1. Colocalization Ion Searching

Figure 3.

Table 1. Comparison Results among Different Methods.

4.2. Isotope Ion Identification

Figure 4.

5. Conclusions

Acknowledgments

Data Availability Statement

Supporting Information Available

Author Contributions

Supplementary Material

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases