Abstract
Motivation
Classification of images is an essential task in higher-level analysis of biological data. By bypassing the diffraction limit of light, super-resolution microscopy opened up a new way to look at molecular details using light microscopy, producing large amounts of data with exquisite spatial detail. Statistical exploration of data usually needs initial classification, which is up to now often performed manually.
Results
We introduce nanoTRON, an interactive open-source tool, which allows super-resolution data classification based on image recognition. It extends the software package Picasso with the first deep learning tool with a graphic user interface.
Availability and implementation
nanoTRON is written in Python and freely available under the MIT license as a part of the software collection Picasso on GitHub (http://www.github.com/jungmannlab/picasso). All raw data can be obtained from the authors upon reasonable request.
Contact
jungmann@biochem.mpg.de
Supplementary information
Supplementary data are available at Bioinformatics online.
1 Introduction
Super-resolution fluorescence microscopy allows researchers to visualize structures and dynamics below the classical diffraction limit of light (Sahl et al., 2017). Stochastic super-resolution techniques use switching of fluorescent molecules between so-called dark and bright states in combination with single-molecule localization. The switching creates an apparent blinking of target molecules, which is recorded in a movie and fitted with sub-diffraction precision in post-processing, and the resulting spatial coordinates of localized fluorophores are combined into two-dimensional (2D) histograms to render a super-resolution image (Sauer and Heilemann, 2017). DNA-PAINT (Jungmann et al., 2010) uses the transient binding of dye-labeled DNA oligonucleotides (called ‘imager’ strands, freely diffusing in solution) to their target-bound complementary strands (called ‘docking’ strands) to create the necessary target ‘blinking’ for super-resolution (Fig. 1). DNA-PAINT is part of a large variety of techniques, which are enabled by the use of programmable interactions of DNA molecules: DNA Nanotechnology (Ramezani and Dietz, 2019). One of the most prominent approaches in structural DNA Nanotechnology is undoubtedly DNA origami (Rothemund, 2006). Here, a long single-stranded DNA molecule is ‘folded’ via self-assembly into almost arbitrary shapes and patterns using hundreds of short oligonucleotides. DNA origami enables the manufacturing of millions of nanoscopic structures with nanometer precision in a highly controlled and parallel fashion. These very properties of DNA origami structures and their nanoscale dimensions have led to a symbiotic relationship with super-resolution approaches: DNA origami either serves as tested for assaying new super-resolution approaches (Balzarotti et al., 2017; Jungmann et al., 2016; Schueder et al., 2019; Steinhauer et al., 2009), or super-resolution is used to characterize properties of DNA nanostructures (Johnson-Buck et al., 2013; Strauss et al., 2018). Super-resolution instrumentation, probe design and sample preparation methods are progressing at a rapid pace, enabling cost-efficient, molecular-scale resolution on a routine basis (Auer et al., 2018). Data analysis and post-processing software, however, are currently somewhat lacking behind, in most cases often still exclusively focusing on spot-detection and subsequent binning of localizations to visualize super-resolution data (Sage et al., 2019). Especially more advanced yet increasingly essential post-processing tasks such as particle classification in super-resolution data is often still performed manually. Only recently, the super-resolution community turned their attention to more automated as well as machine learning- and neuronal network-based analysis approaches (Belthangady and Royer, 2019; Danial and Garcia-Saez, 2019; Ouyang et al., 2018; von Chamier et al., 2019). Advances in deep learning are promising for the automation of algorithmic workflows such as detecting specific shapes or pattern, e.g. recognizing handwritten digits (Lecun et al., 1998). This is particularly exciting in the context of super-resolution microscopy applied to the ever-increasing complexity of DNA origami-based assays (Blanchard and Salaita, 2019). By combining super-resolution microscopy, DNA nanotechnology and deep learning, we here present a new software module, termed nanoTRON.
2 Implementation
nanoTRON was implemented in Python (v3.7 and higher) as a component of the Picasso software suite (Schnitzbauer et al., 2017). It deploys the multi-layer perceptron (MLP) of the Python machine learning framework sci-kit learn (Pedregosa et al., 2011). The software combines two of the most important workflows for model-based neural network-assisted data analysis: (i) user-friendly setup and training of artificial neural networks, (ii) classification and export of predicted data for subsequent analysis in a plug-and-play manner, see Supplementary Text S1 and S4. Super-resolution data sets can be loaded into nanoTRON for immediate classification and export, Supplementary Figure S2. The software allows the training of models for classifying of arbitrary patterns via the module ‘Train Model’ (Supplementary Fig. S2). Super-resolution data can be loaded, annotated and converted to 2D super-resolution images — gray-scale images — with a defined resolution (‘oversampling’, see Supplementary Figs S3 and S9). By rotation of every image in multiple steps, the training set can be augmented (Supplementary Fig. S3). nanoTRON supports MLPs up to three hidden layers. For the evaluation of the trained network, nanoTRON uses a train-test data split of 30% of the training set. The learning curve of the training and the confusion matrix generated from the test set visualize the performance of the trained neural network (Supplementary Fig. S2). An exemplary application with DNA origami (Supplementary Figs S1, S4–S7) is described in Supplementary Text S2. Additionally, we included a biological application with DNA origami and the nuclear pore complex (Schlichthaerle et al., 2019; Thevathasan et al., 2019) described in Supplementary Text S3 and visualized with Supplementary Figs S8 and S12.
3 Outlook
nanoTRON enables plug-and-play classification of super-resolution data using deep learning of arbitrary nanoscopic pattern. We expect nanoTRON to serve as important tool in the Picasso software collection, which due to the user-friendly design brings deep learning closer to biological researchers. We see nanoTRON as an instrument, which boosts the analysis of highly multiplexed biophysical assays, where e.g. automated detection and analysis of a plethora of barcoded structures (Lin et al., 2012) for high-content and high-throughput studies would become feasible.
Supplementary Material
Acknowledgements
We thank the Ries and Ellenberg Labs at EMBL for providing the GFP-modified cell line. We thank Florian Schueder for fruitful discussions. A.A. and S.S. acknowledge support from the DFG through the Graduate School of Quantitative Biosciences Munich (QBM).
Funding
This work was supported in part by the German Research Foundation (DFG) through the Emmy Noether Program [DFG JU2957/1-1], the Collaborative Research Center (SFB) 1032 [Nanoagents for spatiotemporal control of molecular and cellular reactions, Project A11], the European Research Council through an ERC Starting Grant [MolMap, Grant agreement number 680241] and an ERC Proof of Concept Grant [Resolve, Grant agreement number 790594] and the Max Planck Society.
Conflict of Interest: none declared.
References
- Auer A. et al. (2018) Nanometer-scale multiplexed super-resolution imaging with an economic 3D-DNA-PAINT microscope. Chemphyschem, 19, 3024–3034. [DOI] [PubMed] [Google Scholar]
- Balzarotti F. et al. (2017) Nanometer resolution imaging and tracking of fluorescent molecules with minimal photon fluxes. Science, 355, 606–612. [DOI] [PubMed] [Google Scholar]
- Belthangady C., Royer L.A. (2019) Applications, promises, and pitfalls of deep learning for fluorescence image reconstruction. Nat. Methods, 16, 1215–1225. [DOI] [PubMed] [Google Scholar]
- Blanchard A.T., Salaita K. (2019) Emerging uses of DNA mechanical devices. Science, 365, 1080–1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Danial J.S.H., Garcia-Saez A.J. (2019) Quantitative analysis of super-resolved structures using ASAP. Nat. Methods, 16, 711–714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson-Buck A. et al. (2013) Multifactorial modulation of binding and dissociation kinetics on two-dimensional DNA nanostructures. Nano Lett., 13, 2754–2759. [DOI] [PubMed] [Google Scholar]
- Jungmann R. et al. (2010) Single-molecule kinetics and super-resolution microscopy by fluorescence imaging of transient binding on DNA origami. Nano Lett., 10, 4756–4761. [DOI] [PubMed] [Google Scholar]
- Jungmann R. et al. (2016) Quantitative super-resolution imaging with qPAINT. Nat. Methods, 13, 439–442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lecun Y. et al. (1998) Gradient-based learning applied to document recognition. Proc. IEEE, 86, 2278–2324. [Google Scholar]
- Lin C. et al. (2012) Submicrometre geometrically encoded fluorescent barcodes self-assembled from DNA. Nat. Chem., 4, 832–839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ouyang W. et al. (2018) Deep learning massively accelerates super-resolution localization microscopy. Nat. Biotechnol., 36, 460–468. [DOI] [PubMed] [Google Scholar]
- Pedregosa F. et al. (2011) Scikit-learn: machine learning in Python. J. Mach. Learn. Res., 12, 2825–2830. [Google Scholar]
- Ramezani H., Dietz H. (2019) Building machines with DNA molecules. Nat. Rev. Genet. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rothemund P.W.K. (2006) Folding DNA to create nanoscale shapes and patterns. Nature, 440, 297–302. [DOI] [PubMed] [Google Scholar]
- Sage D. et al. (2019) Super-resolution fight club: assessment of 2D and 3D single-molecule localization microscopy software (vol 16, pg 387, 2019). Nat. Methods, 16, 561–561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sahl S.J. et al. (2017) Fluorescence nanoscopy in cell biology. Nat. Rev. Mol. Cell Biol., 18, 685–701. [DOI] [PubMed] [Google Scholar]
- Sauer M., Heilemann M. (2017) Single-molecule localization microscopy in eukaryotes. Chem. Rev., 117, 7478–7509. [DOI] [PubMed] [Google Scholar]
- Schlichthaerle T. et al. (2019) Direct visualization of single nuclear pore complex proteins using genetically-encoded probes for DNA-PAINT. Angew Chem. Int. Edit., 58, 13004–13008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schnitzbauer J. et al. (2017) Super-resolution microscopy with DNA-PAINT. Nat. Protoc., 12, 1198–1228. [DOI] [PubMed] [Google Scholar]
- Schueder F. et al. (2019) An order of magnitude faster DNA-PAINT imaging by optimized sequence design and buffer conditions. Nat. Methods, 16, 1101–1104. [DOI] [PubMed] [Google Scholar]
- Steinhauer C. et al. (2009) DNA origami as a nanoscopic ruler for super-resolution microscopy. Angew Chem. Int. Ed. Engl., 48, 8870–8873. [DOI] [PubMed] [Google Scholar]
- Strauss M.T. et al. (2018) Quantifying absolute addressability in DNA origami with molecular resolution. Nat. Commun., 9, 1600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thevathasan J.V. et al. (2019) Nuclear pores as versatile reference standards for quantitative superresolution microscopy. Nat. Methods, 16, 1045–1053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- von Chamier L. et al. (2019) Artificial intelligence for microscopy: what you should know. Biochem. Soc. Trans., 47, 1029–1040. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.