Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2021 Aug 11;11:16289. doi: 10.1038/s41598-021-95593-4

Deep learning-based single-shot phase retrieval algorithm for surface plasmon resonance microscope based refractive index sensing application

Kitsada Thadson 1, Sarinporn Visitsattapongse 1, Suejit Pechprasarn 2,3,
PMCID: PMC8357982  PMID: 34381103

Abstract

A deep learning algorithm for single-shot phase retrieval under a conventional microscope is proposed and investigated. The algorithm has been developed using the context aggregation network architecture; it requires a single input grayscale image to predict an output phase profile through deep learning-based pattern recognition. Surface plasmon resonance imaging has been employed as an example to demonstrate the capability of the deep learning-based method. The phase profiles of the surface plasmon resonance phenomena have been very well established and cover ranges of phase transitions from 0 to 2π rad. We demonstrate that deep learning can be developed and trained using simulated data. Experimental validation and a theoretical framework to characterize and quantify the performance of the deep learning-based phase retrieval method are reported. The proposed deep learning-based phase retrieval performance was verified through the shot noise model and Monte Carlo simulations. Refractive index sensing performance comparing the proposed deep learning algorithm and conventional surface plasmon resonance measurements are also discussed. Although the proposed phase retrieval-based algorithm cannot achieve a typical detection limit of 10–7 to 10–8 RIU for phase measurement in surface plasmon interferometer, the proposed artificial-intelligence-based approach can provide at least three times lower detection limit of 4.67 × 10–6 RIU compared to conventional intensity measurement methods of 1.73 × 10–5 RIU for the optical energy of 2500 pJ with no need for sophisticated optical interferometer instrumentation.

Subject terms: Optical sensors, Imaging and sensing, Microscopy, Interference microscopy, Nanophotonics and plasmonics

Introduction

Surface plasmon resonance (SPR) is a resonant oscillation of electrons on the surface of noble metals when the metal surface is illuminated by p-polarized light momentum matched to the resonant condition. For biosensing applications, the Kretschmann configuration1, as shown in Fig. 1a, is usually employed. The system consists of a glass prism, a coherent light source in red light or infrared wavelength, index matching oil, and a 40–50 nm gold-coated coverslip. The coupling between the incident light and the SPR appears as a dark band in the reflectance spectrum due to the coupling process’s loss mechanisms2. The minimum intensity position in reflectance spectra is called plasmonic dip, and the incident angle corresponding to the plasmonic dip position is called plasmonic angle, θp. The SPR is sensitive to the surrounding medium at around 200 nm from the metal surface due to its evanescent wave3. The SPR has been utilized and employed in a wide range of biomedical applications, including biomolecular interactions monitoring4,5, immunoassays6, DNA hybridization7, voltage sensing across thin membranes8, and binding kinetics9,10.

Figure 1.

Figure 1

(a) Schematic diagram of Kretschmann configuration, (b) simulated reflectance for SPR gold sensors with thicknesses dm of 30 nm, 35 nm, 40 nm, 45 nm, and 50 nm, and (c) phase responses in rad of the SPR sensors, when they were illuminated by p-polarized light at 633 nm. Solid curves for sample refractive index of 1.33 (water) and dashed curves for sample refractive index of 1.37 (liquid bovine serum albumin protein)11.

Figure 1b,c shows simulated SPR reflectance spectra and corresponding phase responses of gold thin films with different thicknesses when the gold films were illuminated with p-polarized light at 633 nm wavelength calculated using Fresnel’s equations and transfer matrix approach. It has been established that the SPR phase-detection provides a lower detection limit of 10–7 to 10–8 refractive index unit (RIU) compared to SPR intensity detection12 due to 3 reasons (1) the phase has a steeper response than the amplitude or intensity, as shown in Fig. 1c. (2) The phase is more tolerant to noise than the amplitude, and (3) the phase-detection allows easier signal processing. However, an interferometer is required to make a phase measurement13. Although optical interferometers provide a more sensitive measurement14, they require a more sophisticated and precise optical alignment15 and a well-controlled environment16, such as vibration isolation13, reference channel3, and temperature control unit17.

Recently, computational phase retrieval algorithms (PR) have been of interest to the optical science community. The phase retrieval algorithms18 have been adopted from X-ray phase retrieval19. The PR has opened up opportunities for new imaging modalities20, such as computational microscopy21,22 and superresolution microscopy23. Furthermore, the PR does not require an interferometric configuration. Therefore, this provides an opportunity to overcome challenges in optical interferometry and apply phase retrieval algorithms in optical phase imaging and optical phase measurement for sensing applications24.

Another recent advancement in computational imaging is deep learning (DL), utilizing artificial neural networks to improve several imaging techniques, such as classification25, segmentation26, and regression27. These data-driven algorithms have been used in optical phase imaging28 and superresolution microscopy29. However, there is still no quantitative measure to assess the performance and reliability of the phase or data recovered from the DL30.

Here, we propose a deep learning-based phase retrieval method with experimental verification for conventional optical microscope configuration and a theoretical framework to quantify output phases from the proposed method and compare its performance with measurement techniques in the literature. The SPR has been employed as an example for phase retrieval. The phase responses of the SPR are established3,31, and they cover a wide range of phase gradients, phase shifts from below 2π to 2π rad, phase positions, and phase transition directions due to the loss mechanism of the SPR2, as shown in Fig. 1c. Simulated data for different optical detection schemes are employed to (1) train the proposed DL-based phase retrieval method and (2) quantify phase profile, analyze the proposed DL-based phase retrieval method’s performance, and (3) compare the performance with well-known methods in the literature using the Monte Carlo-based shot noise model.

This paper demonstrates that optical phase imaging can be achieved through deep learning using pattern recognition of a single-shot intensity image captured under a conventional microscope configuration with no need for sophisticated interferometer instrumentation and a computational retrieval algorithm. To the best of the author’s knowledge, systematically quantifying the DL-based phase retrieval method and experimental validation of the proposed DL-based method have never been reported before in the literature.

Materials and methods

Surface plasmon microscope

The SPR phenomenon can be applied as an imaging technique called SPR microscopy (SPRM)2,32,33 or SPR imaging (SPRI)3436, as depicted in Fig. 2. The method is used for binding10 or sensing events monitoring on the other side of the metallic surface with high sensitivity. This method provides real-time detection and a label-free technique.

Figure 2.

Figure 2

Modified optical microscope for SPR phase imaging.

The phase information can provide higher sensitivity and high resolution based on optical interferometry to improve the imaging technique37,38. The sample defocus technique15 is also employed to measure the phase of the SPR through an embedded optical interferometer.

The conventional microscope system can be modified to image several planes, including the image plane and the Fourier plane. Figure 2 shows a modified SPR microscope system employed in this study to quantify the performance of the proposed DL-based method. In addition, the microscope has been implemented to capture BPF images to validate the proposed network and ensure that it is also feasible to recover the phase profile in the actual experimental context, although the network was trained using the simulated data. The optical system consists of a coherent light source of the He–Ne laser at 633 nm wavelength (10 mW He–Ne laser Thorlabs), an oil immersion high numerical aperture objective (1.49NA Olympus), the gold thin film sample prepared for 46 nm of uniform gold layer coated on a BK7 glass substrate coverslip (No.0 Sigma-Aldrich) using electron beam sputterer, one 50:50 beam splitter (non-polarizing beam splitter Thorlabs), lenses (Aspherical doublet lenses Thorlabs) for beam expansion and projection, tube lens (180 mm focal length Thorlabs), the polarizer (Thorlabs) for selecting the polarization in x-axis and y-axis, and two CCD cameras (8051M-USB Thorlabs). A range of incident angles sinθ0 illuminates a uniform SPR gold sensor on a glass substrate provided by an oil immersion objective lens with a 1.49 numerical aperture (NA), n0sinθ0. The first camera is for capturing a reference optical intensity in the microscope. The other camera images the back focal plane (BFP) of the objective lens.

Context aggregation network

Context Aggregation Network (CAN) architecture39 was employed for phase retrieval in this study. Generally, CAN is applied for image processing operations, such as denoising40, superresolution41, deblurring42, and image filtering processing43. The CAN network architecture shown in Fig. 3 has the input size the same as the output, and its hidden layers are highly adaptable.

Figure 3.

Figure 3

Shows the CAN network with a single BFP image input (256 pixels × 256 pixels), one BFP phase output (256 pixels × 256 pixels), and hidden layers.

The CAN network has been implemented for SPR phase retrieval with a single BFP image input (256 pixels × 256 pixels) and one BFP phase output (256 pixels × 256 pixels) shown in Fig. 3. Note that the phase in the BFP was retrieved instead of the image plane; this is to compare the SPR phase profile with conventional SPR measurement techniques reported in the literature, in which the SPR reflectance dip appears in the BFP. Of course, the proposed DL phase retrieval is not limited to only the BFP, but it is also applicable to the image plane. The completed details of the implemented CAN network are listed in Table 1 below.

Table 1.

Specification of the CAN64.

Layer Activations Learnable variables Descriptions
Image input 256 × 256 × 1 256 × 256 × 1 image
Convolutional 256 × 256 × 64 Weights 3 × 3 × 1 × 64, Bias 1 × 1 × 64 1 padding, 1 stride
Adaptive normalization Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.2
Convolutional Weights 3 × 3 × 64 × 64, Bias 1 × 1 × 64 2 padding, 1 stride, 2 dilation
Adaptive normalization Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.2
Convolutional Weights 3 × 3 × 64 × 64, Bias 1 × 1 × 64 4 padding, 1 stride, 4 dilation
Adaptive normalization Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.2
Convolutional Weights 3 × 3 × 64 × 64, Bias 1 × 1 × 64 8 padding, 1 stride, 8 dilation
Adaptive normalization Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.2
Convolutional Weights 3 × 3 × 64 × 64, Bias 1 × 1 × 64 16 padding, 1 stride, 16 dilation
Adaptive normalization 256 × 256 × 64 Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.2
Convolutional Weights 3 × 3 × 64 × 64, Bias 1 × 1 × 64 32 padding, 1 stride, 32 dilation
Adaptive normalization Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.2
Convolutional Weights 3 × 3 × 64 × 64, Bias 1 × 1 × 64 64 padding, 1 stride, 64 dilation
Adaptive normalization Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.2
Convolutional Weights 3 × 3 × 64 × 64, Bias 1 × 1 × 64 128 padding, 1 stride, 128 dilation
Adaptive normalization Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.2
Convolutional Weights 3 × 3 × 64 × 64, Bias 1 × 1 × 64 1 padding, 1 stride
Adaptive normalization Offset 1 × 1 × 64, Scale 1 × 1 × 64
Leaky ReLU Scale 0.01
Convolutional 256 × 256 × 1 Weights 1 × 1 × 64, Bias 1 × 1 0 padding, 1 stride
Regression Mean square error

The CAN uses an adaptive normalizer that can adapt its weights and biases. It can be computed with the adaptive momentum (Adam) optimizer44. The CAN model has ten layers; layers 1 to 8 are the dilation filters and padding functions, which their sizes increase exponentially. The output from each layer will be the same size. Layer 8 is for 128 dilation and padding functions meaning that the size of the receptive field in this layer is equal to the input image dimension. Layer 9 is for a 3 × 3 filter size, one dilation filter, and one padding function. The last convolutional layer transforms the output to the same channel size as the input using the convolutional layer with a 1 × 1 filter size and zero padding function. The last layer is the regression layer providing the predicted phase output.

The CAN model is trained under the MATLAB2019c environment with a single GPU NVIDIA GeForce GTX 1050. The network was trained with a 0.0001 learning rate and a training iteration of 50 epochs.

Simulated dataset for training, validation, and testing

Here, simulated data were employed to train the CAN network. It will be shown later that the trained network can be employed to analyze experimental data and provide the expected phase profile. The dataset simulation is based on Fresnel’s equation31 and the transfer matrix approach31,45, calculating the reflection and transmission coefficients (Fig. S1). Materials’ parameters, including the gold refractive index (nm) and the thickness (dm), the sample refractive index (ns), were varied to generalize the SPR phenomenon for network training accommodating for errors and discrepancies in experimental measurements. The parameters consist of random gold film thickness in a range of 20 to 60 nm (Fig. S2), random incident wavelength in a range of 550 to 650 nm, the refractive index of the gold46 nm of 0.18 + 3.44i with ± 10% error in real part and imaginary part, and random refractive index of the sample ns in a range of 1.0 to 1.4 as labeled in Fig. 2. The simulated BFP was cropped to only one quadrant (256 × 256 pixels), as shown in Fig. 4. All four quadrants carry redundant information due to symmetry in the BFP of the uniform sample. There were 1000 input and output image pairs in each dataset for training and validation. The dataset was further separated to 90% and 10% ratio for training and validation, respectively.

Figure 4.

Figure 4

Simulated data for n0 of 1.52, nm of 0.18 + 3.44i, dm of 45 nm, ns of 1.00, and l0 of 633 nm for (a) full BFP amplitude image (512 × 512 pixels) before cropping, (b) BFP amplitude input (256 pixels × 256 pixels), and (b) BFP phase output (256 pixels × 256 pixels).

The dataset consists of 2 types of simulated images, including the BFP image input data and the corresponding phase output of the BFP. The phase profiles of the BFP were employed as the label for supervised learning.

The dataset preparation process is shown in Fig. 5. Firstly, the amplitude of BFP and phase of BFP were computed using the Fresnel equations and the transfer matrix approach. These images were then read during CAN network training and validation.

Figure 5.

Figure 5

Dataset preparation flowchart.

For testing the networks, the gold thicknesses of 30 nm, 40 nm, 45 nm, and 50 nm, the incident wavelength of 633 nm, and the sample refractive index range from 1.00 (air) to 1.372 (liquid BSA-protein11) were excluded from the dataset for training and validation to test the performance of the trained network.

Monte Carlo based shot noise model

To estimate the phase noise of the BPF phase profile recovered from the DL-based method, the number of photons with associated shot noise in each camera pixel. The other noise sources, such as white background noise and interferences, were excluded from the consideration. From an electronic point of view, achieving the shot-noise limit is achievable, and the model can be a general performance indicator17. The shot noise model17 was employed to model the noise level using the Poisson distribution47. Shot noise occurs when a pixel of an imaging sensor is measured at a low light level, such as around the plasmonic dip. The shot noise is the baseline for low light intensity measurements, and it is a good indicator for the sensitivity and detection limits of measurements. The shot noise level is described by a square root of the energy detected in a digital camera pixel (E), and the signal to noise ratio of the photon energy captured (E) is also the square root of the photon energy captured (E)17. The test dataset explained in the “Materials and methods” section was shot noise added based on varied total photon energy in each image. The total energy of the image was varied from 90 to 2600 pJ to test the performance of the DL-based phase retrieval and the SPR dip measurement techniques. Note that camera quantum efficiency of 60% was taken into account in the shot noise computation17; the quantum efficiency is a typical value for a standard CCD camera48. Monte Carlo simulation49 was implemented to quantify the mean and variance of the SPR measurements for different noise levels and measurement techniques.

SPR measurement techniques

Deep learning-based method

The trained DL network was applied to recover the one quadrant phase image, then a phase line-scan (ϕ) across the pure p-polarization as shown in Fig. 6. was employed in the later step for determining the plasmonic angle, θp. Local gradients dϕ/dn0sinθ0 of the line scan were then computed, and the maximum local gradient position was identified. The polynomial degree 3 curve fitting was then computed around the highest dϕ/dn0sinθ0 gradients to locate the plasmonic angle from the retrieved phase profile. Note that for one BFP image, each quadrant of the BFP can be phase retrieved separately, leading to 4 output phase profiles, in which the plasmonic angles from the 4 phase outputs were then averaged and stored to compare with other measurement methods as shown in Fig. 6.

Figure 6.

Figure 6

Steps in determining plasmonic angle for the DL-based method.

Polynomial degree 3 curve fitting to BFP line-scan intensity

This method is a conventional SPR dip measurement method from the reflectance spectrum, as shown in Fig. 7. First, a line-scan curve was extracted from the pure p-polarization axis for SPR dip position measurement. Next, the minimum intensity in the line scan was fitted using a 3rd-degree polynomial curve fitting10 to locate the plasmonic angle. Similar to the dip position measurement explained in the earlier section, a BFP image can then separated into four quadrants leading to 4 plasmonic angles. The mean value of the four plasmonic angles was then computed to reduce the method's measurement error and fully utilize the image.

Figure 7.

Figure 7

Steps in determining plasmonic angle from a BFP intensity.

Azimuthal angle averaging

Although the pure p-polarization is only in the x-axis in the BFP, the other azimuthal angles (φ) also have a weaker plasmonic effect due to the interference between the p-polarization and the s-polarization. One approach to locate the plasmonic angle and average the noise is to rotate the BFP within the azimuthal angle φ of − 45 degrees to 45 degrees, as shown in Fig. 8. Here the azimuthal angle step size of 1 degree was employed to rotate the BFP image. The 91 line scans along the x-axis for each rotated BFP image were then stored and summed for noise cancellation. The 3rd-degree polynomial curve fitting10 was then applied to the summed line-scan to locate the plasmonic angle.

Figure 8.

Figure 8

Steps in SPR measurement for azimuthal angle averaging.

Quantitative performance parameters

Root mean square phase error of the predicted phase image (RMSE) is given as the minimum root mean square error between phase profile predicted from the DL-based method with an arbitrary phase offset (offset) and the phase profile computed using Fresnel equation (Fresnel) and the transfer matrix approach. The RMSE is expressed as shown in Eq. (1).

RMSE=j=1Ncolumni=1Nrow(DL(i,j)+offset(i,j)-Fresnel(i,j))2Npixels. 1

where DL is the phase profile recovered using the DL-based method, offset is a constant phase shift varying from 0 rad to 2π rad, Fresnel is the simulated phase profile based on Fresnel equations and the transfer matrix approach, Npixels is the total number of pixels; here, the size of the input and the output images are 256 × 256 pixels, which is 65,536 pixels.

Sensitivity (S) is given by the change in n0sinθsp over the change in sample refractive index (Δns) as expressed in Eq. (2).

S=Δn0sinθpΔns. 2

Limit of detection (LoD) is given by 3.3 times the standard deviation (3.3σ) of the measurements and expressed in its RIU. The 3.3σ value corresponds to 99.9% statistical confidence (α=0.001). The standard deviation was measured through noise simulation and the Monte Carlo model.

Results

Phase responses from the CAN network

The CAN network can recover the phase profile for all BFP images in the test dataset, as shown in Fig. 9. The phase can be recovered correctly from a single BFP image, which means the network has recognized the BFP pattern to predict the phase pattern. Figure 10 shows line scans of the phase responses along the x-axis in Fig. 9. The predicted phases agree with the phase profiles calculated using Fresnel equations, although the phases recovered from the CAN have some noise artifacts. The RMSE errors for single quadrant images were within the range of 0.035 rad (2 degrees) to 0.186 rad (10 degrees) average phase error as shown in Table. 2.

Figure 9.

Figure 9

Input/output for the CAN and corresponding phase profiles from Fresnel equations for (a) dm of 30 nm and ns of 1.00, (b) dm of 40 nm and ns of 1.00, (c) dm of 50 nm and ns of 1.00, (d) dm of 30 nm and ns of 1.33, (e) dm of 40 nm and ns of 1.33, and (f) dm of 50 nm and ns of 1.33.

Figure 10.

Figure 10

Shows line scans of the simulated phase and the predictive phase profiles in solid curves (the phase from Fresnel calculation) and dash curves (the phase recovered from the CAN). (a) ns of 1.00 and (b) ns of 1.33.

Table 2.

Root mean square phase error in rad of the test dataset.

Refractive index of the sample Gold thickness (nm)
30 40 50
1.00 0.126 0.078 0.076
1.33 0.189 0.136 0.035

Although the CAN network has been trained with simulated data, it can be applied to experimental BFP images. Figure 11a shows an experimental BFP image obtained from the optical microscope setup described in the “Materials and methods” section above and Fig. 2. Figure 11b shows the recovered phase profiles for each quadrant. Figure 11c,d shows simulated BFP intensity and the phase response for 46 nm of a uniform gold layer using Fresnel equations. The plasmonic uniform gold sample tested here was prepared by a sputterer equipped with a quartz microbalance to calibrate the thickness deposition during the sputtering process. The thickness of the gold sensor coated on a standard BK7 microscope coverslip was prepared for 46 nm. Figure 11e,f shows a comparison between the line scans BFP intensity shown in Fig. 11a,c, and phase recovered from the experimental result using the CAN and the Fresnel simulation for 46 nm of uniform gold. It can be seen that the phase recovered from the experimental result agrees with the theoretical phase. Thus, we are of a firm view that simulated data can be generated and utilized as training data; however, a significant concern is how well the simulated data can be generalized to represent the experiment. In this study, the SPR imaging for the gold sensor was generalized by simulation parameter randomization discussed in the “Materials and methods” section.

Figure 11.

Figure 11

Shows (a) experimental BFP for 46 nm of uniform gold layer coated on a BK7 glass substrate (b) recovered phase using the CAN. Note that for (b), each quadrant of the BFP phase was recovered separately using the CAN, (c) simulated BFP intensity using Fresnel equation for 46 nm of uniform gold, (d) simulated BFP phase using Fresnel equation, (e) BFP intensity line scan from the experimental result and the simulation using Fresnel equation for 46 nm of uniform gold, and (f) phase recovered from the experimental result using the CAN and the Fresnel simulation for 46 nm of uniform gold.

Sensing performance comparison

An essential property of SPR measurement is its capability to provide a quantitative measure of refractive index change in the sensing region. This section demonstrates that the DL-base method can enhance the detection limit for SPR measurement compared to the conventional SPR dip measurement techniques. For SPR refractive index sensing applications, the typical gold thickness is around 50 nm31; the 50 nm gold thickness case is investigated in this section.

The total photon energy of the BFP image for the 50 nm gold thickness was varied from 90 to 2600 pJ, and shot noise was added with the corresponding amount of shot noise for the different energy levels. Monte Carlo simulations were carried out to measure the mean and the standard deviation of the three SPR measurement methods described in the “Materials and methods” section. Figure 12. The three measurement methods showed probability density distributions when the sample refractive index was 1.33 and 1.34 at four different total photon energy levels in the BFP image. It can be noticed that the lower light levels gave a broader probability density function leading to an unsatisfactory SPR response. The figure also shows that there is a systematic error in the absolute plasmonic dip position measurement; meanwhile, the change in the plasmonic angle Δn0sinθp was the same for all the cases leading to the same sensitivity (S) level of 1.1773.

Figure 12.

Figure 12

Shows probability density functions of plasmonic dip positions for different photon energy levels of 500 pJ, 1000 pJ, and 2000 pJ calculated for 3 plasmonic dip measurement methods. (a) 3rd-degree polynomial curve fitting in intensity line scan image, (b) zoomed-in curves of (a) in the highlighted curves, (c) 45-degree azimuthal averaging, (d) zoomed-in curves of (c) in the highlighted curves, (e) the DL-based method, and (f) zoomed-in curves of (e) in the highlighted curves.

Figure 13 shows the detection limits (LOD) in RIU for the three measurement techniques for different incident light energy levels from 90 to 2600 pJ computed for the sample refractive index ns of 1.33. The LOD for the DL-based method was lower than the 3rd-degree polynomial curve fitting method. The azimuthal angle averaging approach was better than the DL-based method for low light level measurement below 120 pJ. The DL-based performed around 20% better than the azimuthal angle averaging method at a low light energy level of 620 pJ. The DL-based can achieve a similar LOD level to a typical SPR interferometer of 10–7 RIU when the total light energy level in the image was 11 nJ, corresponding to the camera well depth of 80,000 electrons. This camera well depth is within the range of the current state-of-the-art sCMOS50 and EMCCD51 technologies. For the typical camera well depth of 12,000 electrons (equivalent to 1.6 nJ), the DL-based method had 3.5 times lower LOD than the 3rd-degree polynomial curve fitting method and a similar LOD performance azimuthal angle averaging method, respectively. It is crucial to point out that although the LOD performance was similar for a typical well depth camera, the azimuthal angle averaging method does require a reasonable estimation of the image center pixel before applying the image rotation, whereas the other two methods do not. The DL-based method has provided a robust and noise-tolerant method utilizing all the pixels in the BFP for SPR measurement rather than analyzing only some regions around the SPR dip in the image with no need for additional optical components and sophisticated equipment.

Figure 13.

Figure 13

Shows (a) detection limits (LOD) in RIU for the three measurement techniques for different incident light energy levels from 90 to 2600 pJ computed for the sample refractive index ns of 1.33, (b) zoomed-in figure of (a) to compare the CAN method and the azimuthal angle averaging method.

Conclusion

The proposed deep learning-based single-shot phase retrieval method for conventional microscope configuration has been developed. The context aggregation neural network architecture was utilized to predicted phase response for a single-shot microscope image. The network relies on pattern recognition for retrieving the phase profile. We have demonstrated that the simulated data can be employed for training the network. Furthermore, the image dataset can be generalized by parameter randomization. Here, the surface plasmon resonance was employed as an example to quantify the proposed DL-based phase retrieval method. Single quadrant back focal plane images and their corresponding phase images were employed as the input and supervised output for the network. After the training, the network can accurately predict the phase profile of the simulated BFP test dataset and the experimental data with the root mean square phase error below 10 degrees. Here, we have also provided the theoretical framework to analyze the refractive index sensing performance of the proposed DL-based method compared to the SPR dip position measurement methods reported in the literature. For comparison, the 3rd order polynomial curve fitting and the azimuthal angle averaging approaches were simulated to compare the sensitivity and detection limit for different photon energy levels in the image. The sensitivity of the DL-based method is the same as the other intensity detection methods. For low light levels below 120 pJ, the detection limit of the azimuthal angle averaging approach was slightly better than the DL-based method. The detection limit of the azimuthal angle averaging approach outperformed the azimuthal angle averaging for the higher light energy. The detection limit of the DL-based method was 20% better than the azimuthal angle averaging at the light energy level of 620 pJ. For typical cameras with a well depth of 12,000 electrons, the DL-based method performed a 3.5 times better detection limit than the 3rd-degree polynomial curve fitting method. The proposed DL-based method allows us to recover the phase profile of SPR measurement using a conventional microscope configuration through a single-shot BFP image.

Supplementary Information

Acknowledgements

We would like to acknowledge resources and fruitful discussion, suggestions from the College of Biomedical Engineering, Rangsit University, Thailand, Shenzhen Key Laboratory for Micro-Scale Optical Information Technology, Nanophotonics Research Center, Institute of Microscale Optoelectronics, Shenzhen University, Shenzhen, China, and School of Engineering, KMITL, Thailand.

Author contributions

Conceptualization, S.V. and S.P.; methodology, S.V. and S.P.; software, K.T.; Validation, S.P., S.V. and K.T.; formal analysis, S.V. and S.P.; investigation, S.V. and S.P.; resources, S.V. and S.P.; data curation, K.T.; writing—original draft preparation, S.P., S.V. and K.T.; writing—review and editing, S.P., S.V. and K.T.; visualization, K.T.; project administration, S.V. and S.P.; funding acquisition, S.V. and S.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Research Institute of Rangsit University (RSU), by the National Natural Science Foundation of China (Research Fund for International Young Scientists) under Project 12050410256, and the School of Engineering of King Mongkut's Institute of Technology Ladkrabang (KMITL).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-021-95593-4.

References

  • 1.Kretschmann E, Raether H. Notizen: Radiative decay of non radiative surface plasmons excited by light. Z. für Naturforschung A. 1968 doi: 10.1515/zna-1968-1247. [DOI] [Google Scholar]
  • 2.Pechprasarn S, Chow TWK, Somekh MG. Application of confocal surface wave microscope to self-calibrated attenuation coefficient measurement by Goos-Hänchen phase shift modulation. Sci. Rep. 2018;8:8547. doi: 10.1038/s41598-018-26424-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ho AH-P, Kim D, Somekh MG. Handbook of Photonics for Biomedical Engineering. Springer; 2017. pp. 503–543. [Google Scholar]
  • 4.Ramanavièius A, Herberg FW, Hutschenreiter S, Zimmermann B, Lapënaitë I, Kaušaitë A, Ramanavièienë A. Biomedical application of surface plasmon resonance biosensors. Acta Medica Lituanica. 2005;12(3):1–9. [Google Scholar]
  • 5.Englebienne P, Hoonacker AV, Verhas M. Surface plasmon resonance: Principles, methods and applications in biomedical sciences. Spectroscopy. 2003;17:372913. doi: 10.1155/2003/372913. [DOI] [Google Scholar]
  • 6.Mullett WM, Lai EP, Yeung JM. Surface plasmon resonance-based immunoassays. Methods. 2000;22(1):77–91. doi: 10.1006/meth.2000.1039. [DOI] [PubMed] [Google Scholar]
  • 7.Hossain MB, Rana MM. DNA hybridization detection based on resonance frequency readout in graphene on Au SPR biosensor. J. Sens. 2016;2016:6070742. doi: 10.1155/2016/6070742. [DOI] [Google Scholar]
  • 8.Suvarnaphaet P, Pechprasarn S. Quantitative cross-platform performance comparison between different detection mechanisms in surface plasmon sensors for voltage sensing. Sensors. 2018;18:3136. doi: 10.3390/s18093136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Wrapp D, et al. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science. 2020;367:1260. doi: 10.1126/science.abb2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tan H-M, Pechprasarn S, Zhang J, Pitter MC, Somekh MG. High resolution quantitative angle-scanning widefield surface plasmon microscopy. Sci. Rep. 2016;6:20195. doi: 10.1038/srep20195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.McNally KM, et al. Optical and thermal characterization of albumin protein solders. Appl. Opt. 1999;38:6661–6672. doi: 10.1364/AO.38.006661. [DOI] [PubMed] [Google Scholar]
  • 12.Kabashin AV, Patskovsky S, Grigorenko AN. Phase and amplitude sensitivities in surface plasmon resonance bio and chemical sensing. Opt. Express. 2009;17:21191–21204. doi: 10.1364/OE.17.021191. [DOI] [PubMed] [Google Scholar]
  • 13.Pechprasarn S, Zhang B, Albutt D, Zhang J, Somekh M. Ultrastable embedded surface plasmon confocal interferometry. Light Sci. Appl. 2014;3:e187. doi: 10.1038/lsa.2014.68. [DOI] [Google Scholar]
  • 14.Wang QJ, Chung Y-W. Encyclopedia of Tribology. Springer; 2013. pp. 2483–2488. [Google Scholar]
  • 15.Chow T, Lun D, Pechprasarn S, Somekh M. Defocus leakage radiation microscopy for single shot surface plasmon measurement. Meas. Sci. Technol. 2020 doi: 10.1088/1361-6501/ab7def. [DOI] [Google Scholar]
  • 16.McDonnell E, Deck L. Solutions for Environmentally Robust Interferometric Optical Testing. SPIE; 2020. [Google Scholar]
  • 17.Pechprasarn S, Somekh MG. Detection limits of confocal surface plasmon microscopy. Biomed. Opt. Express. 2014;5:1744–1756. doi: 10.1364/BOE.5.001744. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Shechtman Y, et al. Phase retrieval with application to optical imaging: A contemporary overview. IEEE Signal Process. Mag. 2015;32:87–109. doi: 10.1109/MSP.2014.2352673. [DOI] [Google Scholar]
  • 19.Burvall A, Lundström U, Takman PAC, Larsson DH, Hertz HM. Phase retrieval in X-ray phase-contrast imaging suitable for tomography. Opt. Express. 2011;19:10359–10376. doi: 10.1364/OE.19.010359. [DOI] [PubMed] [Google Scholar]
  • 20.Dhawan A, D’Alessandro B, Fu X. Optical imaging modalities for biomedical applications. IEEE Rev. Biomed. Eng. 2010;3:69–92. doi: 10.1109/RBME.2010.2081975. [DOI] [PubMed] [Google Scholar]
  • 21.Chen M, Tian L, Waller L. 3D differential phase contrast microscopy. Biomed. Opt. Express. 2016;7:3940–3950. doi: 10.1364/BOE.7.003940. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tian L, et al. Computational illumination for high-speed in vitro Fourier ptychographic microscopy. Optica. 2015;2:904–911. doi: 10.1364/OPTICA.2.000904. [DOI] [Google Scholar]
  • 23.Descloux A, et al. Combined multi-plane phase retrieval and superresolution optical fluctuation imaging for 4D cell microscopy. Nat. Photonics. 2018;12:165–172. doi: 10.1038/s41566-018-0109-4. [DOI] [Google Scholar]
  • 24.Shen M, Chow TWK, Shen H, Somekh MG. Virtual optics and sensing of the retrieved complex field in the back focal plane using a constrained defocus algorithm. Opt. Express. 2020;28:32777–32792. doi: 10.1364/OE.404573. [DOI] [PubMed] [Google Scholar]
  • 25.Shpilman, A. et al. 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), 1–6.
  • 26.Al-Kofahi Y, Zaltsman A, Graves R, Marshall W, Rusu M. A deep learning-based algorithm for 2-D cell segmentation in microscopy images. BMC Bioinform. 2018;19:365. doi: 10.1186/s12859-018-2375-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Yang W, et al. Deep learning for single image superresolution: A brief review. IEEE Trans. Multimedia. 2019;21:3106–3121. doi: 10.1109/TMM.2019.2919431. [DOI] [Google Scholar]
  • 28.Haan B, Rivenson Y, Wu Y, Ozcan A. Deep-learning-based image reconstruction and enhancement in optical microscopy. Proc. IEEE. 2019 doi: 10.1109/JPROC.2019.2949575. [DOI] [Google Scholar]
  • 29.Wang H, et al. Deep learning enables cross-modality superresolution in fluorescence microscopy. Nat. Methods. 2019;16:103–110. doi: 10.1038/s41592-018-0239-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Schmidt, P. & Biessmann, F. Quantifying Interpretability and Trust in Machine Learning Systems (2019).
  • 31.Suvarnaphaet P, Pechprasarn S. Enhancement of long-range surface plasmon excitation, dynamic range and figure of merit using a dielectric resonant cavity. Sensors. 2018;18:2757. doi: 10.3390/s18092757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Zhang B, Pechprasarn S, Zhang J, Somekh MG. Confocal surface plasmon microscopy with pupil function engineering. Opt. Express. 2012;20:7388–7397. doi: 10.1364/OE.20.007388. [DOI] [PubMed] [Google Scholar]
  • 33.Shen M, Learkthanakhachon S, Pechprasarn S, Zhang Y, Somekh MG. Adjustable microscopic measurement of nanogap waveguide and plasmonic structures. Appl. Opt. 2018;57:3453–3462. doi: 10.1364/AO.57.003453. [DOI] [PubMed] [Google Scholar]
  • 34.Campbell CT, Kim G. SPR microscopy and its applications to high-throughput analyses of biomolecular binding events and their kinetics. Biomaterials. 2007;28:2380–2392. doi: 10.1016/j.biomaterials.2007.01.047. [DOI] [PubMed] [Google Scholar]
  • 35.Rothenhäusler B, Knoll W. Surface–plasmon microscopy. Nature. 1988;332:615–617. doi: 10.1038/332615a0. [DOI] [Google Scholar]
  • 36.Peterson AW, Halter M, Plant AL, Elliott JT. Surface plasmon resonance microscopy: Achieving a quantitative optical response. Rev. Sci. Instrum. 2016;87:093703. doi: 10.1063/1.4962034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Li T. Biomedical Optical Spectroscopy and Diagnostics. Optical Society of America; 2000. [Google Scholar]
  • 38.Chen S-J, Su Y-D, Lin C, Chang C-W. A Surface Plasmon Polariton Phase Microscope with a Subwavelength Grating Structure. SPIE; 2007. [Google Scholar]
  • 39.Chen, Q., Xu, J. & Koltun, V. 2017 IEEE International Conference on Computer Vision (ICCV), 2516–2525.
  • 40.Zhang L, et al. A separation–aggregation network for image denoising. Appl. Soft Comput. 2019;83:105603. doi: 10.1016/j.asoc.2019.105603. [DOI] [Google Scholar]
  • 41.Vedaldi A, Bischof H, Brox T, Frahm J-M, editors. Computer Vision—ECCV. Springer; 2020. pp. 335–351. [Google Scholar]
  • 42.Su, S. et al.2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 237–246.
  • 43.Xu Z, Vong C, Wong C, Liu Q. Ground plane context aggregation network for day-and-night on vehicular pedestrian detection. IEEE Trans. Intell. Transp. Syst. 2020 doi: 10.1109/TITS.2020.2991848. [DOI] [Google Scholar]
  • 44.Kingma, D. & Ba, J. Adam: A method for stochastic optimization. In International Conference on Learning Representations (2014).
  • 45.Pechprasarn S, et al. Grating-coupled Otto configuration for hybridized surface phonon polariton excitation for local refractive index sensitivity enhancement. Opt. Express. 2016;24:19517–19530. doi: 10.1364/OE.24.019517. [DOI] [PubMed] [Google Scholar]
  • 46.Johnson PB, Christy RW. Optical constants of the noble metals. Phys. Rev. B. 1972;6:4370–4379. doi: 10.1103/PhysRevB.6.4370. [DOI] [Google Scholar]
  • 47.Janesick J. Photon Transfer Noise Sources. SPIE; 2007. [Google Scholar]
  • 48.Katayama H, et al. Quantum efficiency of the CCD camera (XIS) for the ASTRO-E mission. Nucl. Instrum. Methods Phys. Res. Sect. A. 1999;436:74–78. doi: 10.1016/S0168-9002(99)00600-2. [DOI] [Google Scholar]
  • 49.Mooney CZ. Monte Carlo Simulation. Sage Publications, Inc.; 1997. [Google Scholar]
  • 50.Amico P, Beletic JW, Beletic JE, editors. Scientific Detectors for Astronomy. Springer; 2005. pp. 103–114. [Google Scholar]
  • 51.Denvir D, Conroy E. Electron-Multiplying CCD: The New ICCD. SPIE; 2003. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES