Abstract
Background
Single-molecule localization microscopy is a super-resolution microscopy technique that allows for nanoscale determination of the localization and organization of proteins in biological samples. For biological interpretation of the data it is essential to extract quantitative information from the super-resolution data sets. Due to the complexity and size of these data sets flexible and user-friendly software is required.
Results
We developed SMoLR (Single Molecule Localization in R): a flexible framework that enables exploration and analysis of single-molecule localization data within the R programming environment. SMoLR is a package aimed at extracting, visualizing and analyzing quantitative information from localization data obtained by single-molecule microscopy. SMoLR is a platform not only to visualize nanoscale subcellular structures but additionally provides means to obtain statistical information about the distribution and localization of molecules within them. This can be done for individual images or SMoLR can be used to analyze a large set of super-resolution images at once. Additionally, we describe a method using SMoLR for image feature-based particle averaging, resulting in identification of common features among nanoscale structures.
Conclusions
Embedded in the extensive R programming environment, SMoLR allows scientists to study the nanoscale organization of biomolecules in cells by extracting and visualizing quantitative information and hence provides insight in a wide-variety of different biological processes at the single-molecule level.
Electronic supplementary material
The online version of this article (10.1186/s12859-018-2578-3) contains supplementary material, which is available to authorized users.
Keywords: Single-molecule localization, Microscopy, Image quantification, Image analysis, Super-resolution, R
Background
The revolutionary advancements in super-resolution microscopy techniques make it possible to study subcellular structures at nanoscale, using fluorescence microscopy. Single-molecule localization microscopy (SMLM) provides the highest spatial resolution that can be achieved with light microscopy today, with a lateral resolution between 10 and 20 nm [1, 2]. SMLM relies on detecting single fluorescent emitters, by separating spatially overlapping signals in time. By detecting and determining the position of individual fluorescent molecules, in densely labelled biological samples, with high precision, images can be reconstructed with a resolution an order of magnitude below the diffraction limit of the light microscope.
In many biological samples a multitude of macromolecular assemblies and protein complexes within one cell can be observed, such as DNA double strand break (DSB) foci [3, 4], nuclear pores [5], focal adhesions [6], virus particles [7] or neuronal spines [8]. Super-resolution microscopy is well suited to study those assemblies, since the increased resolution permits to investigate, at the single-molecule level, the internal composition and protein distribution of these nanoscale assemblies, which have typical diameters ranging from 100 nm up to 2 μm.
In contrast to regular microscopy data which consists of intensity values in a digital image format, SMLM data typically consists of Cartesian coordinates with corresponding localization precision. Therefore, regular image analysis tools do not directly apply to SMLM data. Numerous software packages for detection and localization of single-molecules from single-molecule localization data are available (reviewed and benchmarked in [9]), that allow reliable image reconstruction for SMLM. Additionally tools have been developed which allow more in-depth (3D) visualization of the localization data (PALMsiever [10], ViSP [11], PYME [12]), clustering (SR-Tesseler [13], 3DClusterVisu [14]) and extraction of quantitative information (SharpViSu [15], LAMA [16] and Grafeo [17]) (Table 1).
Table 1.
Programming environment | Visualization | Clustering/ segmentation | Quantification | GUI | Batch mode/Scriptable | Reference | |
---|---|---|---|---|---|---|---|
VisP | C++ | + | – | – | + | – | [11] |
PALMsiever | Matlab | + | – | – | + | + | [10] |
SR-Tesseler | C++ | + | Voronoi | + | + | – | [13] |
PYME | Python | + | – | – | + | + | [12] |
SharpViSu | Matlab | + | Ripley/Voronoi | + | + | – | [15] |
LAMA | Python | – | Ripley/DBSCAN | + | + | – | [16] |
3DClusterViSu | Matlab/Python | + | 3D Voronoi | + | + | + | [14] |
Grafeo | Matlab | + | Ripley/Voronoi | + | + | + | [17] |
SMoLR | R | + | KDE/DBSCAN | + | + | + | this paper |
Here, we present a versatile software package named SMoLR (Single Molecule Localization in R), that enables researchers to analyze large sets of single-molecule localization data in a quantitative way. The pointillist nature of the data gives possibilities for alternative types of analysis, for which the resourceful R programming language can be of great value [18]. With SMoLR we complement existing software, with a software package for analyzing larger data sets with localization data at once in the free open-source R environment.
Implementation
SMLM data consist of Cartesian coordinates of molecules and their respective precision along with all possible extra information that is desired in a specific experiment (i.e. time or frame of detection, channel, estimated number of photons detected etc.). The localization data together with these additional parameters can be imported into SMoLR in different formats obtained by different single-molecule localization software: ThunderSTORM [19], Zeiss ZEN software, SOSplugin [20] or plain text (Fig. 1). SMoLR is versatile and can be used in different ways, where one specifically useful way is to define Regions of Interest (ROIs) from the super-resolution images to analyze the organization of proteins in subcellular structures. Subsequently applying a single analysis to each ROI will result in quantitative information describing the distribution of proteins in a large number of structures.
Workflow
ROIs can be either manually or automatically selected in image analysis software such as ImageJ [21], the localization data of these ROIs can be imported in SMoLR (Fig. 1). Alternatively, ROIs can also be automatically selected using localization clustering functions in SMoLR. The localization data within the different ROIs is selected and stored in a list with localization data from the different ROIs. These objects can subsequently be analyzed by SMoLR at once, using single commands. To visually inspect the ROI data, we provide an interactive application which shows the ROIs in the full super-resolution image together with several statistical parameters (Additional file 1: Figure S1).
Visualization
SMLM data can be visualized in many ways. The most frequently used method is to plot Gaussian distributions for all localizations with standard deviations corresponding to the localization precision (Fig. 2a) [22]. However, with this method intensity values do not directly depend on the density of localizations, but also depend on localization precision. As an alternative approach we implemented a 2D-Kernel density estimation (KDE) method, in which the density of detections per area is normalized to the total number of localizations in the images (Fig. 2b). Therefore, this method is quantitative, making thresholding of the data at a given density of localizations per pixel possible. A third visualization method implemented in SMoLR is an adapted scatter plot that depicts the Cartesian coordinates and can add additional data using the size and color of the plotted points (Fig. 2c). This type of visualization can be used to easily assess the quality of the data and detect potential artefacts such as drift during image acquisition or incorrect grouping. Additionally, we provide a function that formats the single-molecule data in such a way that it can be used in the Spatial Point Pattern Analysis R package spatstat [23]. This opens up the possibility to also include spatstats’ wide range of visualization and clustering options in the analysis.
Clustering
Clustering of SMLM data is comparable to object segmentation in conventional image analysis. Similar to the analysis of objects from segmented images, features can be extracted from the clustered objects to describe the shape and spatial organization within the object. For SMLM data several different approaches for clustering have been proposed in literature, where some of the algorithms are useful to give a global description in the amount of observed clustering, such as Ripley’s K and its derivates, or the recently nonparametric descriptor, J0(r) for clustering density [24]. As previously mentioned, from within SMoLR, the R-package spatstat offers several of these clustering and correlation methods (Ripley-K function, linearized L-function and pair-correlation functions). However, in general, identification of individual clusters is preferred because this allows to analyze the size, shape and spatial distribution of the clusters. In SMoLR, multiple clustering algorithms are available. First, a clustering method based on the binary KDE image can be used to quantify the number of clusters in an image or region of interest (Fig. 2d). We incorporated functions from the EBImage package to calculate image features, such as shape and size, from single clusters [25]. These features together with descriptive statistics (number of localizations, mean position, mean precision, etc.) can be used to categorize individual clusters. Second, the Density Based Clustering Algorithm with Noise (DBSCAN) algorithm is integrated in SMoLR (Fig. 2e) [26, 27]. This frequently used algorithm allows clustering of data based on localization data only. From the defined clusters with localizations, statistics can be calculated such as the cluster area, convex hull and elongation. The earlier mentioned interactive application (Additional file 1: Fig. S1) at this point also allows to manually assess the features (obtained with KDE or DBSCAN clustering) within a data set. Additionally, all parameters can be used for exploration of the data set either manually or using multivariate analysis or machine learning algorithms. Although DBSCAN is able to define clusters and deal with noise, in literature alternative clustering algorithms have been proposed that work better for certain biological samples. Examples are Voronoi tessellation, Bayesian cluster identification and the use of a Gaussian-mixture model [13, 28–30]. A comparison of our KDE and DBSCAN implementations with clustering algorithms by Voronoi tessellation [13, 17] and Bayesian statistics [29] can be found in Additional file 2: Figure S2.
Particle averaging
Merging the localizations from a large number of individual SMLM images of single biological structures such as the nuclear pore complex, synaptonemal complex or viral particles proved to be a powerful tool to reconstruct ultrastructure [5, 31–33]. However, template free particle averaging is a computationally demanding procedure or requires expensive software [33]. Particle averaging also assumes that individual structures represent identical or at least highly similar structures. However, for some biological structures there might be quite some variation in the organization of the individual structures, although they can have certain features in common. We therefore implemented an alignment algorithm, as will be described below, based on extracted features from the individual images, which can be very informative to observe common features from the imaged structures.
Alignment of individual structures can be achieved using features that can be extracted with the SMoLR package (using pixel- or localization-based features). For example, the center of mass of clusters can be used to center the structures. In some cases, the clusters may have specific shapes that enable to rotate and overlay the individual ROIs. For example, elongated structures can be aligned using the major axis of the structure. The presence of multiple clusters within individual ROIs that can be distinguished from each other (for instance on the basis of shape, size or distance to the center of mass), provides another possibility to align structures by rotating the similar clusters towards the same point. The alignments can be averaged or overlaid, and subsequently used to visualize and extract common features from the individual images. This can be used to compare biological structures at different biological conditions or time points. Additionally, these alignments can reveal the relative location of different proteins within the structure, when aligning the structures using one protein as a reference.
The functions in SMoLR are developed based on 2D-localization data. However, 3D data can be visualized in the scatterplot of SMoLR visualizing the z-coordinate using color or size of the plotted points. In principle the DBSCAN algorithm is not limited to 2D data, however 3D clustering is not implemented directly in SMoLR.
Results
To show the use of SMoLR to analyze single-molecule localization data, we applied the functions of the SMoLR package on a previously published data set with images of proteins involved in DNA double strand break (DSB) repair [4]. Precise determination of spatiotemporal localization and organization of these proteins at the sites of damage and how these relate to specific and general protein functions can help to elucidate the mechanisms by which repair of the DSBs take place. In this example we examined two essential DSB repair proteins, the recombinase RAD51 and the tumor suppressor BRCA2. γ-Irradiated cells were immunostained for RAD51 and BRCA2 and imaged using direct stochastic optical reconstruction microscopy (dSTORM) [4]. Single foci were segmented and visualized using the three visualization techniques available in SMoLR (Fig. 2a-c). Subsequent clustering using KDE, DBSCAN and Voronoi tesselation (spatstat) (Fig. 2d-f) allowed for quantitative analysis of multiple foci including number of clusters per protein, per focus and cluster size versus number of localizations (Fig. 2g-h). These analyses can be extended using e.g. cluster shape, co-localization or relative distance between clusters.
In order to gain insight in the relative distribution of RAD51 and BRCA2 in DSBs we averaged their signal after alignment (centered and rotated) based on the elongated shape of the RAD51 clusters (Fig. 2i). This revealed a distinct pattern of protein distributions during DNA repair (explained in more detail in Sánchez et al., 2017).
Conclusions
Visualization and quantitative analysis of the localization of multiple proteins, below the diffraction limit, within macromolecular assemblies or small organelles, under different conditions and at multiple time points, provides the possibility to gain insight in the spatiotemporal organization of protein function during biological processes. In many situations, multiple similar structures are present within a cell and the recorded super-resolution image. By combining the presented methods and work flow to extract relevant features from the localization data, together with the powerful statistics available in R, it is possible to explore the variation in structures, determine common features describing the structures while at the same time comparing different conditions or proteins. Using feature-based alignment and rotational analysis these observed structural organizations can be verified, visualized and combined with simulations to get more insight. Altogether, the workflow presented in our SMoLR package allows researchers to delve deeper into their single-molecule localization data, beyond conventional image analysis.
Availability and requirements
Project name: SMoLR
Project home page: https://github.com/ErasmusOIC/SMoLR
Operating system(s): Platform independent
Programming language: R
Other requirements: R 3.4.0 or higher
License: LGPLv3
Any restrictions to use by non-academics: no.
Additional files
Acknowledgements
We would like to thank prof. dr. Claire Wyman and dr. Ihor Smal for helpful discussions.
Funding
This work has been supported by NWO-CW ECHO 104126 and STW Nanoscopy program.
Availability of data and materials
Software is available online at https://github.com/ErasmusOIC/SMoLR and additional example data https://github.com/ErasmusOIC/SMoLR_data .
The data sets analyzed are described in Sanchez et al. [24] are available from the corresponding author on request.
Abbreviations
- DBSCAN
Density-based spatial clustering of applications with noise
- DSB
Double strand break
- dSTORM
Direct stochastic optical reconstruction microscopy
- KDE
Kernel density estimation
- ROI
Region of interest
- SMLM
Single-molecule localization microscopy
- SMoLR
Single-molecule localization in R
Authors’ contributions
MWP, JAS and HMG developed the software. MWP, JAS, HMG, ZL, WMB, WAC, ABH gave input for software. JAS and ABH supervised the project. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Maarten W. Paul, Email: m.w.paul@erasmusmc.nl
H. Martijn de Gruiter, Email: h.degruiter@erasmusmc.nl.
Zhanmin Lin, Email: z.lin@erasmusmc.nl.
Willy M. Baarends, Email: w.baarends@erasmusmc.nl
Wiggert A. van Cappellen, Email: w.vancappellen@erasmusmc.nl
Adriaan B. Houtsmuller, Email: a.houtsmuller@erasmusmc.nl
Johan A. Slotman, Email: j.slotman@erasmusmc.nl
References
- 1.Betzig E, Patterson GH, Sougrat R, Lindwasser OW, Olenych S, Bonifacino JS, et al. Imaging intracellular fluorescent proteins at nanometer resolution. Science. 2006;313:1642–1645. doi: 10.1126/science.1127344. [DOI] [PubMed] [Google Scholar]
- 2.Rust MJ, Bates M, Zhuang X. Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM) Nat Methods. 2006;3:793–795. doi: 10.1038/nmeth929. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Reid DA, Keegan S, Leo-Macias A, Watanabe G, Strande NT, Chang HH, et al. Organization and dynamics of the nonhomologous end-joining machinery during DNA double-strand break repair. Proc Natl Acad Sci U S A. 2015;112:E2575–E2584. doi: 10.1073/pnas.1420115112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sánchez H, Paul MW, Grosbart M, van Rossum-Fikkert SE, Lebbink JHG, Kanaar R, et al. Architectural plasticity of human BRCA2–RAD51 complexes in DNA break repair. Nucleic Acids Res. 2017;45:4507–4518. doi: 10.1093/nar/gkx084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Szymborska A, de Marco A, Daigle N, Cordes VC, Briggs JAG, Ellenberg J. Nuclear pore scaffold structure analyzed by super-resolution microscopy and particle averaging. Science. 2013;341:655–658. doi: 10.1126/science.1240672. [DOI] [PubMed] [Google Scholar]
- 6.Rossier O, Octeau V, Sibarita J-B, Leduc C, Tessier B, Nair D, et al. Integrins β1 and β3 exhibit distinct dynamic nanoscale organizations inside focal adhesions. Nat Cell Biol. 2012;14:1057–1067. doi: 10.1038/ncb2588. [DOI] [PubMed] [Google Scholar]
- 7.Laine RF, Albecka A, van de Linde S, Rees EJ, Crump CM, Kaminski CF. Structural analysis of herpes simplex virus by optical super-resolution imaging. Nat Commun. 2015;6:5980. doi: 10.1038/ncomms6980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dani A, Huang B, Bergan J, Dulac C, Zhuang X. Superresolution imaging of chemical synapses in the brain. Neuron. 2010;68:843–856. doi: 10.1016/j.neuron.2010.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sage D, Kirshner H, Pengo T, Stuurman N, Min J, Manley S, et al. Quantitative evaluation of software packages for single-molecule localization microscopy. Nat Methods. 2015;12. 10.1038/nmeth.3442. [DOI] [PubMed]
- 10.Pengo T, Holden SJ, Manley S. PALMsiever: a tool to turn raw data into results for single-molecule localization microscopy. Bioinformatics. 2014;31:797–798. doi: 10.1093/bioinformatics/btu720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.El Beheiry M, Dahan M. ViSP: representing single-particle localizations in three dimensions. Nat Methods. 2013;10:689–690. doi: 10.1038/nmeth.2566. [DOI] [PubMed] [Google Scholar]
- 12.Crossman DJ, Hou Y, Jayasinghe I, Baddeley D, Soeller C. Combining confocal and single molecule localisation microscopy: a correlative approach to multi-scale tissue imaging. Methods. 2015;88:98–108. doi: 10.1016/j.ymeth.2015.03.011. [DOI] [PubMed] [Google Scholar]
- 13.Levet F, Hosy E, Kechkar A, Butler C, Beghin A, Choquet D, et al. SR-Tesseler: a method to segment and quantify localization-based super-resolution microscopy data. Nat Methods. 2015;12:1065–1071. doi: 10.1038/nmeth.3579. [DOI] [PubMed] [Google Scholar]
- 14.Andronov L, Michalon J, Ouararhni K, Orlov I, Hamiche A, Vonesch J-L, et al. 3DClusterViSu: 3D clustering analysis of super-resolution microscopy data by 3D Voronoi tessellations. Bioinformatics. 2018;34:3004–12. 10.1093/bioinformatics/bty200. [DOI] [PubMed]
- 15.Andronov L, Lutz Y, Vonesch JL, Klaholz BP. SharpViSu: integrated analysis and segmentation of super-resolution microscopy data. Bioinformatics. 2016;32:2239–2241. doi: 10.1093/bioinformatics/btw123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Malkusch S, Heilemann M. Extracting quantitative information from single-molecule super-resolution imaging data with LAMA – LocAlization microscopy analyzer. Sci Rep. 2016;6:34486. doi: 10.1038/srep34486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Haas KT, Lee M, Esposito A, Venkitaraman AR. Single-molecule localization microscopy reveals molecular transactions during RAD51 filament assembly at cellular DNA damage sites. Nucleic Acids Res. 2018:1–19. 10.1093/nar/gkx1303. [DOI] [PMC free article] [PubMed]
- 18.R Core Team . R: A Language and Environment for Statistical Computing. 2017. [Google Scholar]
- 19.Ovesny M, Křižek P, Borkovec J, Svindrych Z, Hagen GM. ThunderSTORM: a comprehensive ImageJ plugin for PALM and STORM data analysis and super-resolution imaging. Bioinformatics. 2014:1–2. 10.1093/bioinformatics/btu202. [DOI] [PMC free article] [PubMed]
- 20.Reuter M, Zelensky A, Smal I, Meijering E, van Cappellen WA, de Gruiter HM, et al. BRCA2 diffuses as oligomeric clusters with RAD51 and changes mobility after DNA damage in live cells. J Cell Biol. 2014;207:599–613. doi: 10.1083/jcb.201405014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez J-Y, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012;9(7):676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nieuwenhuizen RPJ, Lidke KA, Bates M, Puig DL, Grünwald D, Stallinga S, et al. Measuring image resolution in optical nanoscopy. Nat Methods. 2013;10:557–562. doi: 10.1038/nmeth.2448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Baddeley A, Turner R. spatstat: An R Package for Analyzing Spatial Point Patterns. J Stat Softw. 2005;12. 10.18637/jss.v012.i06.
- 24.Jiang S, Park S, Challapalli SD, Fei J, Wang Y. Robust nonparametric quantification of clustering density of molecules in single-molecule localization microscopy. PLoS One. 2017;12:1–15. doi: 10.1371/journal.pone.0179975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Pau G, Fuchs F, Sklyar O, Boutros M, Huber W. EBImage--an R package for image processing with applications to cellular phenotypes. Bioinformatics. 2010;26:979–981. doi: 10.1093/bioinformatics/btq046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ester M, Kriegel HP, Sander J, Xu X. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. Second Int Conf Knowl Discov Data Min. 1996:226–31.
- 27.Hahsler M. dbscan: Density Based Clustering of Applications with Noise (DBSCAN) and Related Algorithms. 2015. [Google Scholar]
- 28.Andronov L, Orlov I, Lutz Y, Vonesch J-L, Klaholz BP. ClusterViSu, a method for clustering of protein complexes by Voronoi tessellation in super-resolution microscopy. Sci Rep. 2016;6:24084. doi: 10.1038/srep24084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Rubin-Delanchy P, Burn GL, Griffié J, Williamson DJ, Heard NA, Cope AP, et al. Bayesian cluster identification in single-molecule localization microscopy data. Nat Methods. 2015;12:1072–1076. doi: 10.1038/nmeth.3612. [DOI] [PubMed] [Google Scholar]
- 30.Deschout H, Platzman I, Sage D, Feletti L, Spatz JP, Radenovic A. Investigating focal adhesion substructures by localization microscopy. Biophys J. 2017;113:2508–2518. doi: 10.1016/j.bpj.2017.09.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Van Engelenburg SB, Shtengel G, Sengupta P, Waki K, Jarnik M, Ablan SD, et al. Distribution of ESCRT machinery at HIV assembly sites reveals virus scaffolding of ESCRT subunits. Science. 2014;343:653–656. doi: 10.1126/science.1247786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Schücker K, Holm T, Franke C, Sauer M, Benavente R. Elucidation of synaptonemal complex organization by super-resolution imaging with isotropic resolution. Proc Natl Acad Sci. 2015;112:2029–2033. doi: 10.1073/pnas.1414814112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Salas D, Le Gall A, Fiche J-B, Valeri A, Ke Y, Bron P, et al. Angular reconstitution-based 3D reconstructions of nanomolecular structures from superresolution light-microscopy images. Proc Natl Acad Sci. 2017;:201704908. doi:10.1073/pnas.1704908114. [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Software is available online at https://github.com/ErasmusOIC/SMoLR and additional example data https://github.com/ErasmusOIC/SMoLR_data .
The data sets analyzed are described in Sanchez et al. [24] are available from the corresponding author on request.