Skip to main content
Journal of Biomedical Optics logoLink to Journal of Biomedical Optics
. 2021 Mar 25;26(3):036502. doi: 10.1117/1.JBO.26.3.036502

Neural network model assisted Fourier ptychography with Zernike aberration recovery and total variation constraint

Yongbing Zhang a,b, Yangzhe Liu a, Shaowei Jiang c, Krishna Dixit c, Pengming Song d, Xinfeng Zhang e, Xiangyang Ji f, Xiu Li a,*
PMCID: PMC8330837  PMID: 33768741

Abstract.

Significance: Fourier ptychography (FP) is a computational imaging approach that achieves high-resolution reconstruction. Inspired by neural networks, many deep-learning-based methods are proposed to solve FP problems. However, the performance of FP still suffers from optical aberration, which needs to be considered.

Aim: We present a neural network model for FP reconstructions that can make proper estimation toward aberration and achieve artifact-free reconstruction.

Approach: Inspired by the iterative reconstruction of FP, we design a neural network model that mimics the forward imaging process of FP via TensorFlow. The sample and aberration are considered as learnable weights and optimized through back-propagation. Especially, we employ the Zernike terms instead of aberration to decrease the optimization freedom of pupil recovery and perform a high-accuracy estimation. Owing to the auto-differentiation capabilities of the neural network, we additionally utilize total variation regularization to improve the visual quality.

Results: We validate the performance of the reported method via both simulation and experiment. Our method exhibits higher robustness against sophisticated optical aberrations and achieves better image quality by reducing artifacts.

Conclusions: The forward neural network model can jointly recover the high-resolution sample and optical aberration in iterative FP reconstruction. We hope our method that can provide a neural-network perspective to solve iterative-based coherent or incoherent imaging problems.

Keywords: Fourier ptychographic microscopy, pupil recovery, neural network, optics

1. Introduction

In biomedical applications, it is desirable to obtain complex images with both high resolution and wide field. Regardless of advancements in sophisticated mechanical scanning microscope systems and lensless microscopy setups, the modification of conventional microscopes to obtain ideal high-resolution results has been a hot topic of recent research work. Inspired by ptychography,1,2 Fourier ptychography (FP) in particular is a simple and cost-effective analytical method for this application.37 As a newly proposed computational imaging method, FP integrates principles of phase retrieval8,9 and aperture synthesizing10,11 to achieve both high resolution and large field of view (FOV). By introducing a programmable light-emitting diode (LED) array as an angle-varied coherent illumination source, higher-frequency information is shifted into the passband of the objective lens and detected by the image sensor. Then FP can stitch these images together in the Fourier domain to enlarge the system bandwidth via aperture synthesizing. The lost phase information is further recovered via the phase retrieval technique using intensity-only measurements. As such, FP can reconstruct a sample with both high-equivalent numerical aperture (NA) and large FOV. So far, FP has received increasing attention over the past few years, and many types of research have been proposed on the system setup and reconstruction algorithm to correct various system aberrations,12,13 provide robust optimization methods against noise,14,15 report innovative system setup designs,16,17 and so on.1820

Recently, convolutional neural networks (CNNs) have been proven to reliably provide inductive answers to the inverse problem in computational imaging,11 and many biomedical imaging problems have been solved with CNN, such as computed tomography,21 image super-resolution,22 and holography.11,23 The purpose of FP is to converge a high-resolution target from the acquired image sequence. It is natural to introduce the idea of CNN into this inverse problem and learn an underlying mapping from the low-resolution input to a high-resolution output.2427 However, there exist some specific issues in biomedical applications. First, different from natural image applications, it is hard for biomedical imaging problems to access large amounts of images. Therefore, supervised CNNs, which rely on large datasets, have encountered obstacles in training data. Second, computational imaging methods like FP are sensitive to system parameters. Once the system setup is changed or the system aberration is introduced, the performance of the deep-learning-based network will be degraded. Toward the first dilemma, Zhang et al.27 generated datasets with simulations to train the network. However, there is a lack of related networks to solve the second issue. To address these dilemmas properly, it is effective to add constraints based on the application characteristics. Jiang et al.28 proposed a physics-based framework that they established using a forward imaging model of FP via TensorFlow and utilized the back-propagation to optimize the high-resolution sample. This method uses the imaging model of FP as the constraint and solves the above two issues. The neural network is only utilized to imitate the traditional iterative recovery process. Following this idea, related methods can work with limited data29,30 and can work stably toward system aberration.3134 However, these works are still essentially the iterative-based algorithm, and the automatic differentiation (AD) property of the neural network is not fully utilized. It would be much more desirable to design a new neural network to further degrade the noise in the reconstruction and estimate the optical aberration with higher accuracy.

In this paper, we report a neural network model to solve Fourier ptychographic problems. The Zernike aberration recovery and total variation (TV) constraint are introduced as the augmenting modules to ensure aberration corrected reconstruction and robustness against noise. As such, we name this model the integrated neural network model (INNM). INNM is essentially a TensorFlow-based trainable network mimicking the iterative reconstruction of FP. We model the forward imaging process of FP via TensorFlow at first. To estimate the optical aberration properly, the optical aberration of the employed objective lens is modeled as a pupil function in our model and optimized along with the sample through backpropagation. Then we introduce the alternate updating (AU) mechanism to achieve better performance and use the Zernike mode to make a robust and proper aberration estimation. As such, our method can recover the optical aberration with high accuracy and achieve better performance simultaneously. To further eliminate the noise, we incorporated the TV on both the amplitude and phase of the sample image. It turns out that this application can improve the image quality. Our experiments demonstrate that INNM outperforms other methods with higher contrast, especially when validated in a severe aberration condition.

This paper is structured as follows. In Sec. 2, we discuss the fundamental principles and reconstruction procedure of FP. In Sec. 3, we describe the model structure and introduced mechanisms step by step. Then in Sec. 4, we validate our method on both simulated and experimental datasets under variant acquisition conditions and analyze the benefits of introduced methods in detail. Finally, we provide concluding remarks and discuss our ongoing efforts in Sec. 5.

2. Fourier Ptychographic Microscopy

As a classic analytical method, FP is mainly composed of the explicit forward imaging model and the decomposition procedure. Considering the generalized FP schematic diagram setup shown in Fig. 1(a), the sample is successively illuminated by plane waves from the LED matrix at different angles. The exit waves are then captured by the image sensor (CCD) through the objective lens. Sequentially lighting distinct LEDs on the matrix, FP can obtain a low-resolution intensity image sequence to recover a high-resolution complex one.

Fig. 1.

Fig. 1

Fundamental principles of FP: (a) the schematic diagram of the FP experimental setup and the physical comparison and (b) the iterative decomposition procedure with pupil recovery.

The forward imaging procedure is shown in the left column of Fig. 1(a). We denote the thin sample as its transmission function o(r), where r=(x,y) represents the 2D spatial coordinates with its Fourier expression as k=(kx,ky). When obliquely illuminated by the n’th monochromatic LED, the exit wave at object plane can be denoted as o(r)exp(ikn), where denotes the element-wise multiplication and kn=(kxn,kyn) denotes the n’th wave vector corresponding to the angle of incident illumination. The final wave captured by the CCD can be expressed as

Iln(r)=|{o(r)exp(ikn)}*F1{C(k)}|2, (1)

where F denotes Fourier transformation, * denotes convolution, C(k) denotes the coherent transfer function (CTF) of the objective lens, and Iln(r) denotes the captured image under n’th LED illumination. Here we note that subscripts l, h, and n denote low resolution, high resolution, and sequence number, respectively. The standard formulation of the pupil function CTF can be expressed as

C(k)={1,(kx2+ky2)<(NA·k0)2,0,otherwise, (2)

where NA characterizes the range of angles over which the system can accept light, k0=2π/λ and λ is the illumination wavelength.

As for the decomposition procedure shown in Fig. 1(b), FP will first initialize a high-resolution sample image. Then FP will shift the confined CTF into distinct apertures and utilize a phase retrieval method called alternating projection (AP) to obtain a self-consistent complex image. In AP, the pinned aperture in the Fourier domain will be updated by keeping its spatial phase unchanged and replacing its spatial amplitude with the square root of the corresponding captured image. In addition, since optical aberration is a common issue in practical applications,35 it is valuable to embed an extra pupil recovery called ePIE4 after AP to compensate for this interference. The whole decomposition process with pupil recovery is shown in Fig. 1(b) (the red flowchart denotes the original FP and the green one denotes the extended pupil recovery).

3. Method

3.1. Framework of INNM

To solve FP problems via the neural network, we model the forward imaging process via TensorFlow, which could obtain the sample and optical aberration simultaneously. Figure 2 illustrates the detailed pipeline workflow and the framework of our method. As shown in Fig. 2(a), the upsampled central measurement and the standard CTF without pupil aberration are set to the initial guess of the sample and CTF, respectively. Then the optimized targets can be obtained through several training stages. Each training stage consists of a few epochs (for simplicity, only one epoch is shown). As shown in Fig. 2(b), we take the pair of captured image Iln(r) and corresponding plane wave vector kn as a single batch. In each epoch, all batches are fed into the model, and the model parameters are updated through backpropagation. To avoid confusion, we termed these batches as varying angle illumination units (VAIUs) and represented them as a complete epoch.

Fig. 2.

Fig. 2

Illustration of the INNM framework: (a) schematics of pipeline procedure with the AU mechanism; (b) schematics of a training epoch; and (c) the basic framework of INNM with embedded pupil recovery.

The detailed framework of our model with the embedded pupil recovery is shown in Fig. 2(c). This model is transferred from the forward imaging process in the Fourier domain, and the whole process can be formulated as follows:

φln(k)=O(k+kn)C(k), (3)
φhn(k)=F{Iln(r){F1{φln(k)}}}, (4)

where O(k) denotes the Fourier function of the sample, denotes the phase, φln and φhn represent the original and updated apertures in Fourier domain during the AP.

Reflected in Fig. 2(c), the sample and CTF are naturally treated as two-channel (with the real and imaginary parts) learnable feature maps. Then the sample O(k) in the Fourier domain is forward propagated through the well-designed framework following Eq. (3), where the sample is first shifted according to kn and multiplied with the CTF C(k). Then the generated spectrum φln(k) is transmitted via the AP defined in Eq. (4) with the preupsampled input Iln(r). Thus we can generate the updated spectrum φhn(k). Based on the phase retrieval mechanism, these two exit wave series of spectra φln(k) and φhn(k) should contain the same frequency information when converged. Therefore, the whole framework can obtain optimized results by minimizing the differences between these spectra. The first loss function used in INNM can be expressed as

loss=Σn=0Nφhn(k)φln(k)22, (5)

where L2-norm is used to measure the differences and N denotes the number of VAIUs.

As for the detailed composition of the network, self-designed fixed -layers and multiplication layers are widely used to perform operations such as AP. More specifically, the -layer can be flexibly customized to define operations, such as using the roll function in TensorFlow to perform the shifting operation. As such, these layers can be designed to perform the forward imaging model of FP in TensorFlow. We should note that INNM is defined in the complex field, and all the layers consist of the real and imaginary parts. For example, we should rewrite Eq. (3) in the complex domain as

φln(k)={Or(k+kn)+i·Oi(k+kn)}{Cr(k)+i·Ci(k)}=(OrCrOiCi)+i·(OrCi+OiCr), (6)

where the subscripts r and i denote the real and imaginary parts, respectively. At last, customized network layers are used to represent the learnable targets, like sample and CTF. As such, the model is lightweight with limited trainable parameters (sample and CTF). After backpropagation, we can extract parameters in the learnable layers shown in the blue boxes of Fig. 2(c) to get the optimized results. Our Jupyter notebook source code for this algorithm has been made available; a link is provided in the Data, Materials, and Code Availability section, below.

3.2. Alternate Updating Mechanism

Based on the model designed in Fig. 2, we can directly obtain the recovered sample and estimated CTF in theory. However, the sample and CTF have different sensitivities toward the backward gradients. The model cannot converge to the optimal result if the sample and CTF are updated simultaneously with the same step size of gradient descent. As such, the AU mechanism is devised to control the optimized target and the step size of gradient descent.

In general networks, the training process is always consecutive. Here we divide the training process of INNM into plural stages, and the updating objectives differ in adjacent stages. The approach of how we control the updated parameters has shown in Fig. 2(a). In the first stage, we believe the CTF is closer to the practical ground truth. Then the parameters of CTF C(0) will be fixed, and only the sample parameter O(0) can be updated by backpropagation. Then we will update the CTF C(0) and keep O(1) unchanged since the updated sample O(1) contains more detailed and realistic information. In short, INNM alternately keeps either the sample or the CTF constant in a stage. Then INNM can assign the sample and CTF individual learning rates to control the step size of gradient descent. The targets can be better optimized. As the model parameters are alternately updated, the sample and CTF will gradually converge to the optimal point.

For clarity, the details of the sample and CTF in distinct stages are provided in Fig. 2(a). In the first stage, the amplitude and phase images of the sample are improved, and in the second stage, the CTF is updated. We can obtain the estimated CTF and the recovered aberration-free object in the last stage. As such, we can demonstrate that the benefit of applying AU into the INNM is that this mechanism can help to update the parameters more appropriately by controlling the optimized target.36

3.3. Improved Modality of Optical Aberration

In FP applications, if the sampling pattern is a periodic grid in the Fourier domain, it would introduce periodic artifacts to the recovered pupil function. Then this corrupted pupil function will degrade the high-resolution FP reconstruction.18 To address this problem, the Zernike polynomials are incorporated due to their powerful ability to describe wavefront characteristics in optical imaging.3739

In addition, the CTF is always updated as a whole. Nevertheless, if the Zernike mode is applied to model the CTF, the parameters that need to be compensated for the aberration are reduced from the square of image size to the constant number of Zernike modes that need to be fitted. In other words, the optimization degrees of freedom are orders of magnitude lower than the previous implementations.

As such, the new modality of CTF’s phase is updated as

C(k)=l=1Lcl·Zl(k), (7)

where L denotes the number of Zernike modes used in the model, and clR is the coefficient of each polynomial Zl(k). In general, the first nine modes after piston Z00 can already fit the common aberration in microscopy. Consequently, we can only train nine parameters to achieve the same performance as that of its entirety. As for the CTF’s amplitude, it will still be updated as a whole. As such, C(k)=|C(k)|exp{iC(k)} is the final form we used in INNM to model the CTF.

3.4. Enhanced Loss Function

One of the strengths of neural networks lies in the auto-differentiation capabilities of optimization toolboxes. Therefore, different cost functions can lead to different results. If the appropriate functions are introduced, the model can achieve better results.

In FP applications, the recovered sample image will contain noise due to the limited synthesized NA. As such, considering reduce noise without degrading edges, we can use the TV regularizer to achieve better performance. TV regularization is widely used in the field of signal processing and so on.40,41 By calculating the integral of the absolute gradient, it evaluates the degree in which the image is disturbed by noise.

We expand the loss function Eq. (5) with TV as follows:

loss=φhn(k)φln(k)22+α1·TV{|Φhn(r)|}+α2·TV{Φhn(r)}, (8)
TV{o(r)}=(|ox+1,yox,y|2+|ox,y+1ox,y|2)η/2, (9)

where |·| and represent the amplitude and phase of the complex sample Φhn(r). In addition, Φhn(r) is the transmission function of the spectrum φhn(k) in the spatial domain. Equation (8) is composed of three parts, where φhn(k)φln(k)22 denotes the L2-norm during the AP, TV{|Φhn(r)|} and TV{Φhn(r)} represent the TV regularization of the amplitude and the phase of the updated image. Parameters α1 and α2 denote the coefficients of two TV terms, respectively. The larger the values are, the smoother the reconstructed images are. The standard formulation of the TV function is expressed as Eq. (9), where o(r) is used as a template, η represents the power series of TV, and η=1 is used in our experiments. Since the TV regularization is applied to improve the spatial smoothness of two relevant targets, our method can adjust the performance by choosing different TV coefficient combinations.

3.5. FP Reconstruction Procedure with INNM

Combining the above mechanisms, our model can effectively reconstruct the sample and CTF with smooth background and high image contrast.

As shown in Fig. 2(a), the whole procedure can be divided into three parts. In the initialization stage, the upsampled intensity image and the standard CTF without pupil aberration are used as the initial guesses. Next, the sample and CTF are modeled as learnable weights of hidden layers according to Eqs. (3), (4), and (7) and optimized according to Eq. (8). The gradient is then calculated by the auto-differentiation of TensorFlow to optimize the learnable parameters. It should be noted that the captured images should be preupsampled to satisfy the size requirements. In the output stage, we can obtain the sample with recovered CTF simultaneously by extracting the optimized weights of hidden layers.

4. Experiments

We validated our INNM on both simulated and experimental datasets. For comparison, we mostly compared our results with the widely used method ePIE,4 which is a robust and effective FP method with pupil recovery. We also compared with other methods over the experimental dataset, such as Jiang’s method28 and AS.42

4.1. System Setup

The whole setup with its schematic diagram is shown in Fig. 1(a). In our setup, a programmable 15×15 LED matrix (532 nm central wavelength) is placed 90  mm below the sample plane for illumination. The distance between adjacent LEDs is 4 mm. Then the 2×, 0.1 NA objective lens with a 200-mm tube lens is used to build the FP microscopy system, and a camera with a pixel size of 3.45  μm is utilized to capture low-resolution intensity images. As such, the overlap ratio in the Fourier space is about 78% and the synthetic NAsyn could be up to 0.50.

For the generation of simulated datasets, we chose two images with a size of 128×128 as the ground truth. Then 225 32×32 simulated intensity images were generated based on the above system setup. To be more authentic, we additionally assumed the simulated images were captured with a defocus aberration, where the defocus mode Z20 is about -1.44, corresponding to 50  μm defocus.

For the acquisition of experimental datasets, we captured the images under two different conditions. First, we obtained two tissue section datasets with the image size of 64×64. The samples were well placed on the focal plane, and the aberration could be ignored. Then we designed an extreme but valuable extension that we crop a 128×128 tile close to the edge of the entire FOV with severe aberration.

For the experimental datasets, we alternately updated the sample and CTF with 10 stages and 5 epochs for each stage. For simulated datasets, such hyperparameters would be different since the image size was smaller.

4.2. Results on Simulated Datasets

The ground truth of the simulated dataset is shown in Fig. 3(f). The cameraman and street map are used as the amplitude and phase images. The defocus aberration is set to be the optical aberration. The goal of reconstructed methods is to extract the sample and CTF from 225 32×32 intensity images.

Fig. 3.

Fig. 3

Comparison of recovered results and some decomposed Zernike amplitudes. (a)–(f) Recovered results of the sample and pupil function under different methods (the amplitude are normalized into 0 to 1). (g) Decline curve of the INNM. (h) A scatter plot of some decomposed Zernike polynomial coefficients cl (piston coefficient Z00 is not present).

4.2.1. Alternate updating process

For clarity, we call the framework of INNM in Sec. 3.1 as B-INNM. Then recovered results under different settings of INNM are represented in Figs. 3(a)3(d), and INNM stands for B-INNM with all the attributes. Considering the influence of the AU, we compare the recovered results shown in Figs. 3(a) and 3(b). It is apparent that the former sample images are quite ambiguous, and the estimated aberration is also meaningless. Instead, the latter one contains fewer artifacts and achieves higher fidelity, owing to the introduction of the AU mechanism. In addition, we represent the training curve with AU shown in Fig. 3(g). Based on the above discussion, we can demonstrate that AU can lead to the expected optimum by controlling the optimized target and compensate for the aberration to some extent. However, the recovered aberration contains unexpected periodic speckles [Fig. 3(b3)]. This problem is caused by the periodic grid sampling pattern in the Fourier domain and would degrade the high-resolution FP reconstruction.18 To solve this problem, additional principles are incorporated.

4.2.2. Zernike polynomial function

As shown in Fig. 3(c), the greatest contribution made by the Zernike mode is the proper correction of the aberration. Compared to the aberration shown in Fig. 3(b3), the new aberration with the Zernike modes completely removes the pollution of raster-grid speckles. Furthermore, the decomposed Zernike coefficients under various conditions are illustrated in Fig. 3(h). We can find that INNM can fit the polynomial coefficients with high accuracy in all order terms, whereas B-INNM with AU cannot restore these parameters. As such, the biased coefficients in tilt Zernike modes (Z11 and Z11) lead to the raster-grid speckles in Fig. 3(b3). When this issue is resolved, the Zernike mode helps to achieve the higher reconstruction quality shown in Fig. 3(c). Such a phenomenon proves that taking the optical aberration as a power series expansion is better for optimization than taking it as a whole. In addition, we can further find that the decomposed coefficient amplitudes obtained from ePIE are differential from the ground truth in the tilt Zernike modes. This disadvantage will hinder the high-quality reconstruction shown in Fig. 3(e).

4.2.3. Total variation loss

By introducing the above two mechanisms into the B-INNM, our method can predict the optical aberration with high accuracy and eliminate its interference. Here we will provide the results of integrating the TV loss in the final objective function. In FP, the recovered amplitude and phase can always crosstalk with each other and cause background artifacts. As shown in Fig. 3(c), even though we properly estimate the aberration, the recovered sample images still suffer from this noise. To degrade unexpected noises, we introduce the TV regularization and the modified result is shown in Fig. 3(d). It is obvious that the background noise is greatly degraded and achieves higher image quality. In the end, we compare our results [Fig. 3(d)] with ePIE [Fig. 3(e)]. We can clearly find that our reconstructed phase completely eliminates the amplitude interference while ePIE cannot, owing to the introduction of advanced mechanisms.

In summary, by reproducing the imaging system through neural network tools and incorporating advanced mechanisms, our method provides a new perspective for FP reconstruction and can achieve higher image quality against optical aberrations and artifacts.

4.3. Results on Experimental Datasets

In this section, we implement our method over several experimental datasets captured under normal and severe conditions.

4.3.1. Normal condition

We first implement our framework over two general datasets where the samples are well placed on the focal plane. Figure 4 shows the reconstructions of a tissue section stained by immunohistochemistry methodology and a blood smear. In Fig. 4(a), we show the three recovered images of the tissue section illuminated by red, green, and blue monochromatic LEDs. By combining the three images together, the recovered color amplitude is shown in Fig. 4(b1) with its phase shown in Fig. 4(b2). The corresponding results from ePIE are shown in Fig. 4(c). The tissue section dataset is carefully captured so that the aberration could be ignored during the reconstruction. As such, we can find that both INNM and ePIE can obtain images with high quality.

Fig. 4.

Fig. 4

Reconstruction of two datasets in a low-aberration condition. (a) Recovered amplitudes at 632, 532, and 470 nm wavelengths from INNM. (b), (c) The combined color intensity and phase images of a tissue section stained by immunohistochemistry methodology from INNM and ePIE. (d)–(f) Recovered results of blood cells.

In addition, we test the two methods in another blood smear dataset shown in Figs. 4(d)4(f) and draw the same conclusion. The recovered sample images from ePIE and INNM share highly similarly detailed information such as cellular structures and phase distribution, and we can clearly see the performance improvement from these reconstructions compared to raw images shown in Fig. 1(b).

4.3.2. Severe condition

Although the image reconstruction capabilities of INNM and ePIE are similar in generic datasets, the introduced mechanism introduced by INNM greatly enhances the potential to alleviate the optical aberration. We deliberately crop a tile close to the edge of the entire FOV, in which the system aberration is so sophisticated that seriously affects the qualities of captured intensity images. To make our results more credible, we additionally test INNM under different network settings, adjust the number of Zernike modes L to 50, and make an extra comparison with Jiang’s method28 and AS.42

The reconstructed amplitude images with some magnified areas are shown in Fig. 5 (the ground truth is obtained from the aberration-free dataset). First, compared with the ground truth shown in Fig. 5(a), we can easily find that ePIE cannot recover the amplitude with distinguishable cell structures [Fig. 5(e)], which also happens in the reconstruction of INNM without Zernike function [Fig. 5(d)]. Then when focusing on the center of the secondary magnification area, we can find that reconstruction obtained by the INNM without TV is rough in detail due to the noise [Fig. 5(c)], and the information of small cellular structure is totally lost. In contrast, the complete INNM can distinguish this small cell characteristic from surrounding tissue structures, and the overall image is high quality with a smooth background [Fig. 5(b)].

Fig. 5.

Fig. 5

Recovered amplitude images of a tissue slide in condition of severe aberration: (a) high-resolution ground truth; (b)–(d) recovered amplitudes from INNM under different conditions; (e) recovered amplitude from ePIE; and (f), (g) aberrations restored from INNM with and without TV.

In Fig. 6, we further compare the phase images recovered from INNM, ePIE, Jiang’s method, and AS, in which the first two methods take optical aberration into account and the latter two do not (all the methods start with the zero-aberration hypothesis). It is clear that reconstructions from Jiang’s method and AS are highly blurred and cannot provide instructive information due to the influence of aberrations. For more recognizable details shown in the secondary magnification area, we can find that INNM achieves higher contrast than ePIE, compared with GT shown in Fig. 6(a). Also the cell structure of ePIE is quite blur compared with INNM.

Fig. 6.

Fig. 6

Recovered phase images in condition of severe aberration: (a) ground truth; (b)–(e) recovered phases from INNM, ePIE, Jiang’s method, and AS;42 and (f), (g) aberrations restored from INNM without Zernike mode and ePIE.

Reflected in the recovered optical aberration, the results of ePIE and INNM without the Zernike mode [Figs. 6(g) and 6(f)] are quite different from those of INNM with and without TV [Figs. 5(f) and 5(g)], resulting in poor performance on recovered images shown in Fig. 5. This phenomenon further proves the superiority of INNM in compensating aberration.

In conclusion, INNM can reconstruct sample images with high quality in both simulated and experimental datasets. Especially, when validated in severe aberration conditions, INNM performs higher robustness against sophisticated optical aberrations and achieves better performance than other methods. This improved performance relies on the augmenting modules like Zernike aberration recovery on the one hand, and the AD of optimization tools in neural networks on the other hand. The AD approach allows solving FP problems without finding an analytical derivation of the update function since the derivation could be challenging to obtain. In addition, AD is directly benefited from the progress made in the machine-learning community in terms of hardware, software tools, and algorithms. We can expect INNM to be further improved thanks to the fast-paced progress in AD.

5. Conclusion

In this work, we reported a reconstruction method based on the neural network model to solve FP problems and achieved artifacts-free performance. By reproducing the imaging process into the neural network and modeling the sample and aberration as learnable weights of multiplication layers, INNM can obtain an aberration-free complex sample. Advanced tools are further introduced to ensure good performance. The AU can help INNM optimize the sample and aberration in an appropriate alternative way, the introduced Zernike mode can estimate the sophisticated optical aberration with high accuracy and the TV terms is useful to reduce artifacts by encouraging spatial smoothness. We tested our method over both the simulated and experimental datasets, and the results demonstrated that INNM could reconstruct images with smooth backgrounds and detailed information. We believe our recovered aberration can be used as a good estimation toward system optical transmission and we hope our method can provide a neural-network perspective to solve the iterative-based coherent or incoherent imaging problem. Moreover, we can expect such techniques can be further improved thanks to the fast-paced progress in deep-learning toolboxes like TensorFlow or powerful tools like novel regularizations and optimizers.

There are many works worth trying in the future. For example, we can replace some layers of INNM with advanced architectures so that the model can learn some advanced features in advance. In addition, although we can foresee the effect of different coefficient combinations in TV terms, manual adjustment is still required for redundant operation in practice. As such, how to optimize the image evaluation system, with which the network model can automatically adjust the performance, is also a feasible direction of research.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Nos. 61922048 and 62031023), the National Key R & D Program of China (No. 2020AAA0108303), and NSFC 41876098.

Biographies

Yongbing Zhang received his bachelor’s degree in English and his MS and PhD degrees in computer science from Harbin Institute of Technology, Harbin, China, in 2004, 2006, and 2010, respectively. He joined the Graduate School at Shenzhen Tsinghua University from 2010 to 2020. In January 2021, he joined Harbin Institute of Technology (Shenzhen). His current research interests include signal processing, computational imaging, and machine learning.

Yangzhe Liu received his bachelor’s degree from the Department of Automation at Tsinghua University, Beijing, China, in 2018. He is currently working toward his MS degree in the automation from Tsinghua University, Beijing, China. His research interests include computational imaging.

Shaowei Jiang received his BE degree in biomedical engineering from Zhejiang University in 2014. He is currently a PhD candidate at Dr. Guoan Zheng’s Laboratory of the University of Connecticut. His research interests include super-resolution imaging, whole slide imaging, and lensless imaging.

Krishna Dixit received his BS degree in biomedical engineering and his minor degree in electronics and systems in 2019 from the University of Connecticut, where he conducted research on the biomechanics of embryonic zebrafish. He is an electronics engineer at Veeder-Root, Simsbury, Connecticut, USA. He currently designs circuits and firmware for fuel sensing applications.

Pengming Song received his MS degree in control engineering from Tsinghua University in 2018. He is currently a PhD student in electrical engineering at the University of Connecticut. His research interests include Fourier ptychography, whole slide imaging, and lensless imaging.

Xinfeng Zhang received his BS degree in computer science from Hebei University of Technology, Tianjin, China, in 2007 and his PhD in computer science from the Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China, in 2014. He is currently an assistant professor at the School of Computer Science and Technology of the University of Chinese Academy of Sciences. His research interests include video compression, image/video quality assessment, and image/video analysis.

Xiangyang Ji received his bachelor’s degree in materials science and his MS degree in computer science from Harbin Institute of Technology, Harbin, China, in 1999 and 2001, respectively, and his PhD in computer science from the Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China. In 2008, he joined Tsinghua University, Beijing, China, where he is currently a professor in the Department of Automation. His current research interests include signal processing and computer vision.

Xiu Li received her PhD in computer integrated manufacturing in 2000. Since then, she has been worked at Tsinghua University. Her research interests are in the areas of data mining, deep learning, computer vision, and image processing.

Disclosures

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Contributor Information

Yongbing Zhang, Email: zhang.yongbing@sz.tsinghua.edu.cn.

Yangzhe Liu, Email: prolyz@163.com.

Shaowei Jiang, Email: shaowei.jiang@uconn.edu.

Krishna Dixit, Email: krishna.dixit@uconn.edu.

Pengming Song, Email: pengming.song@uconn.edu.

Xinfeng Zhang, Email: zhangxinf07@gmail.com.

Xiangyang Ji, Email: xyji@tsinghua.edu.cn.

Xiu Li, Email: li.xiu@sz.tsinghua.edu.cn.

Data, Materials, and Code Availability

Our code for simulation is available in the Figshare repository via the link https://figshare.com/articles/PgNN-Code-zip/11906832.

References

  • 1.Faulkner H. M. L., Rodenburg J., “Movable aperture lensless transmission microscopy: a novel phase retrieval algorithm,” Phys. Rev. Lett. 93(2), 023903 (2004). 10.1103/PhysRevLett.93.023903 [DOI] [PubMed] [Google Scholar]
  • 2.Rodenburg J. M., “Ptychography and related diffractive imaging methods,” Adv. Imaging Electron Phys. 150, 87–184 (2008). 10.1016/S1076-5670(07)00003-1 [DOI] [Google Scholar]
  • 3.Zheng G., Horstmeyer R., Yang C., “Wide-field, high-resolution Fourier ptychographic microscopy,” Nat. Photonics 7(9), 739–745 (2013). 10.1038/nphoton.2013.187 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Ou X., Zheng G., Yang C., “Embedded pupil function recovery for Fourier ptychographic microscopy,” Opt. Express 22(5), 4960–4972 (2014). 10.1364/OE.22.004960 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Tian L., et al. , “Multiplexed coded illumination for Fourier ptychography with an LED array microscope,” Biomed Opt. Express 5(7), 2376–2389 (2014). 10.1364/BOE.5.002376 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhang Y., et al. , “Self-learning based Fourier ptychographic microscopy,” Opt. Express 23(14), 18471–18486 (2015). 10.1364/OE.23.018471 [DOI] [PubMed] [Google Scholar]
  • 7.Zhang Y., et al. , “Fourier ptychographic microscopy with sparse representation,” Sci. Rep. 7(1), 8664 (2017). 10.1038/s41598-017-09090-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Fienup J. R., “Reconstruction of a complex-valued object from the modulus of its Fourier transform using a support constraint,” J. Opt. Soc. Am. A 4(1), 118–123 (1987). 10.1364/JOSAA.4.000118 [DOI] [Google Scholar]
  • 9.Yeh L. H., et al. , “Experimental robustness of Fourier ptychography phase retrieval algorithms,” Opt. Express 23(26), 33214–33240 (2015). 10.1364/OE.23.033214 [DOI] [PubMed] [Google Scholar]
  • 10.Mico V., et al. , “Synthetic aperture superresolution with multiple off-axis holograms,” J. Opt. Soc. Am. A 23(12), 3162–3170 (2006). 10.1364/JOSAA.23.003162 [DOI] [PubMed] [Google Scholar]
  • 11.Sinha A., et al. , “Lensless computational imaging through deep learning,” Optica 4(9), 1117–1125 (2017). 10.1364/OPTICA.4.001117 [DOI] [Google Scholar]
  • 12.Sun J., et al. , “Efficient positional misalignment correction method for Fourier ptychographic microscopy,” Biomed Opt. Express 7(4), 1336–1350 (2016). 10.1364/BOE.7.001336 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Bian Z., Dong S., Zheng G., “Adaptive system correction for robust Fourier ptychographic imaging,” Opt. Express 21(26), 32400–32410 (2013). 10.1364/OE.21.032400 [DOI] [PubMed] [Google Scholar]
  • 14.Bian L., et al. , “Fourier ptychographic reconstruction using Wirtinger flow optimization,” Opt. Express 23(4), 4856–4866 (2015). 10.1364/OE.23.004856 [DOI] [PubMed] [Google Scholar]
  • 15.Hou L., et al. , “Adaptive background interference removal for Fourier ptychographic microscopy,” Appl. Opt. 57(7), 1575–1580 (2018). 10.1364/AO.57.001575 [DOI] [PubMed] [Google Scholar]
  • 16.Lee H., Chon B. H., Ahn H. K., “Reflective Fourier ptychographic microscopy using a parabolic mirror,” Opt. Express 27(23), 34382–34391 (2019). 10.1364/OE.27.034382 [DOI] [PubMed] [Google Scholar]
  • 17.Tian L., Waller L., “3D intensity and phase imaging from light field measurements in an LED array microscope,” Optica 2(2), 104–111 (2015). 10.1364/OPTICA.2.000104 [DOI] [Google Scholar]
  • 18.Guo K., et al. , “Optimization of sampling pattern and the design of Fourier ptychographic illuminator,” Opt. Express 23(5), 6171–6180 (2015). 10.1364/OE.23.006171 [DOI] [PubMed] [Google Scholar]
  • 19.Guo K., et al. , “Microscopy illumination engineering using a low-cost liquid crystal display,” Biomed Opt. Express 6(2), 574–579 (2015). 10.1364/BOE.6.000574 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bian L., et al. , “Content adaptive illumination for Fourier ptychography,” Opt. Lett. 39(23), 6648–6651 (2014). 10.1364/OL.39.006648 [DOI] [PubMed] [Google Scholar]
  • 21.Hwan J. K., et al. , “Deep convolutional neural network for inverse problems in imaging,” IEEE Trans. Image Process. 26(9), 4509–4522 (2017). 10.1109/TIP.2017.2713099 [DOI] [PubMed] [Google Scholar]
  • 22.Dong C., et al. , “Image super-resolution using deep convolutional networks,” IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2016). 10.1109/TPAMI.2015.2439281 [DOI] [PubMed] [Google Scholar]
  • 23.Rivenson Y., et al. , “Phase recovery and holographic image reconstruction using deep learning in neural networks,” Light Sci. Appl. 7, 17141 (2018). 10.1038/lsa.2017.141 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kappeler A., et al. , “Ptychnet: CNN based Fourier ptychography,” in IEEE Int. Conf. Image Process., IEEE, pp. 1712–1716 (2017). 10.1109/ICIP.2017.8296574 [DOI] [Google Scholar]
  • 25.Nguyen T., et al. , “Deep learning approach for Fourier ptychography microscopy,” Opt. Express 26(20), 26470–26484 (2018). 10.1364/OE.26.026470 [DOI] [PubMed] [Google Scholar]
  • 26.Shamshad F., Abbas F., Ahmed A., “Deep Ptych: subsampled Fourier ptychography using generative priors,” in IEEE Int. Conf. Acoust., Speech and Signal Process., IEEE, pp. 7720–7724 (2019). 10.1109/ICASSP.2019.8682179 [DOI] [Google Scholar]
  • 27.Zhang J., et al. , “Fourier ptychographic microscopy reconstruction with multiscale deep residual network,” Opt. Express 27(6), 8612–8625 (2019). 10.1364/OE.27.008612 [DOI] [PubMed] [Google Scholar]
  • 28.Jiang S., et al. , “Solving Fourier ptychographic imaging problems via neural network modeling and TensorFlow,” Biomed Opt. Express 9(7), 3306–3319 (2018). 10.1364/BOE.9.003306 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kellman M., et al. , “Data-driven design for Fourier ptychographic microscopy,” in IEEE Int. Conf. Comput. Photogr., IEEE, pp. 1–8 (2019). 10.1109/ICCPHOT.2019.8747339 [DOI] [Google Scholar]
  • 30.Kellman M., et al. , “Physics-based learned design: optimized coded-illumination for quantitative phase imaging,” IEEE Trans. Comput. Imaging 5, 344–353 (2019). 10.1109/TCI.2019.2905434 [DOI] [Google Scholar]
  • 31.Sun M., Chen X., Zhu Y., et al. , “Neural network model combined with pupil recovery for Fourier ptychographic microscopy,” Opt. Express 27(17), 24161–24174 (2019). 10.1364/OE.27.024161 [DOI] [PubMed] [Google Scholar]
  • 32.Bostan E., et al. , “Deep phase decoder: self-calibrating phase microscopy with an untrained deep neural network,” Optica 7(6), 559–562 (2020). 10.1364/OPTICA.389314 [DOI] [Google Scholar]
  • 33.Jinlei Z., et al. , “Forward imaging neural network with correction of positional misalignment for Fourier ptychographic microscopy,” Opt. Express 28(16), 23164–23175 (2020). 10.1364/OE.398951 [DOI] [PubMed] [Google Scholar]
  • 34.Zhang J., et al. , “Aberration correction method for Fourier ptychographic microscopy based on neural network,” Proc. SPIE 11550, 1155005 (2020). 10.1117/12.2573583 [DOI] [Google Scholar]
  • 35.Liao J., et al. , “Rapid focus map surveying for whole slide imaging with continuous sample motion,” Opt. Lett. 42(17), 3379–3382 (2017). 10.1364/OL.42.003379 [DOI] [PubMed] [Google Scholar]
  • 36.Yang Y., et al. , “Convolutional neural networks with alternately updated clique,” in Proc. IEEE Conf. Comput. Vision and Pattern Recognit., pp. 2413–2422 (2018). 10.1109/CVPR.2018.00256 [DOI] [Google Scholar]
  • 37.Thibos L. N., et al. , “Standards for reporting the optical aberrations of eyes,” J. Refractive Surg. 18(5), S652–S660 (2002). 10.3928/1081-597X-20020901-30 [DOI] [PubMed] [Google Scholar]
  • 38.Michael C., Zachary F. P., Laura W., “Quantitative differential phase contrast (DPC) microscopy with computational aberration correction,” Opt. Express 26(25), 32888–32899 (2018). 10.1364/OE.26.032888 [DOI] [PubMed] [Google Scholar]
  • 39.Song P., et al. , “Full-field Fourier ptychography (FFP): spatially varying pupil modeling and its application for rapid field-dependent aberration metrology,” APL Photonics 4(5), 050802 (2019). 10.1063/1.5090552 [DOI] [Google Scholar]
  • 40.Kappeler A., et al. , “Video super-resolution with convolutional neural networks,” IEEE Trans. Comput. Imaging 2(2), 109–122 (2016). 10.1109/TCI.2016.2532323 [DOI] [Google Scholar]
  • 41.Mahendran A., Vedaldi A., “Understanding deep image representations by inverting them,” in Proc. IEEE Conf. Comput. Vision and Pattern Recognit., pp. 5188–5196 (2015). 10.1109/CVPR.2015.7299155 [DOI] [Google Scholar]
  • 42.Zuo C., Sun J., Chen Q., “Adaptive step-size strategy for noise-robust Fourier ptychographic microscopy,” Opt. Express 24(18), 20724–20744 (2016). 10.1364/OE.24.020724 [DOI] [PubMed] [Google Scholar]

Articles from Journal of Biomedical Optics are provided here courtesy of Society of Photo-Optical Instrumentation Engineers

RESOURCES