Skip to main content
Journal of Medical Imaging logoLink to Journal of Medical Imaging
. 2020 Aug 27;7(4):044003. doi: 10.1117/1.JMI.7.4.044003

Sphere estimation network: three-dimensional nuclei detection of fluorescence microscopy images

David Joon Ho a,*, Daniel Mas Montserrat b, Chichen Fu b, Paul Salama c, Kenneth W Dunn d, Edward J Delp b
PMCID: PMC7451995  PMID: 32904135

Abstract.

Purpose: Fluorescence microscopy visualizes three-dimensional subcellular structures in tissue with two-photon microscopy achieving deeper penetration into tissue. Nuclei detection, which is essential for analyzing tissue for clinical and research purposes, remains a challenging problem due to the spatial variability of nuclei. Recent advancements in deep learning techniques have enabled the analysis of fluorescence microscopy data to localize and segment nuclei. However, these localization or segmentation techniques would require additional steps to extract characteristics of nuclei. We develop a 3D convolutional neural network, called Sphere Estimation Network (SphEsNet), to extract characteristics of nuclei without any postprocessing steps.

Approach: To simultaneously estimate the center locations of nuclei and their sizes, SphEsNet is composed of two branches to localize nuclei center coordinates and to estimate their radii. Synthetic microscopy volumes automatically generated using a spatially constrained cycle-consistent adversarial network are used for training the network because manually generating 3D real ground truth volumes would be extremely tedious.

Results: Three SphEsNet models based on the size of nuclei were trained and tested on five real fluorescence microscopy data sets from rat kidney and mouse intestine. Our method can successfully detect nuclei in multiple locations with various sizes. In addition, our method was compared with other techniques and outperformed them based on object-level precision, recall, and F1 score. Our model achieved 89.90% for F1 score.

Conclusions: SphEsNet can simultaneously localize nuclei and estimate their size without additional steps. SphEsNet can be potentially used to extract more information from nuclei in fluorescence microscopy images.

Keywords: nuclei detection, fluorescence microscopy, convolutional neural network, synthetic volumes

1. Introduction

Fluorescence microscopy is optical microscopy using visible light and a combination of lenses to image magnified subcellular structures in tissue in 3D.1 Especially, two-photon microscopy, using two photons simultaneously for fluorescence excitation, can image deeper into tissue.2 Fluorescence microscopy enables analyzing intravital studies of organs.3 An essential process of quantifying microscopy images is nuclei detection. In this paper we define “nuclei detection” as a task of finding locations and characteristics of each nucleus, which is important for clinical and research purposes.4 Detecting the size of nuclei is crucial to understand their cell functions.5

An initial step for nuclei detection is to find locations of nuclei accurately in microscopy images. A thresholding technique to separate foreground (nuclei) and background6 can be used to find locations of nuclei. A progressive weighted mean curve of a grayscale histogram was used to select a thresholding value to automatically segment cell membranes and nuclei in cardiospheres presented in fluorescence microscopy volumes.7 Some microscopy data sets may suffer from inhomogeneous intensity in volumes which can be corrected using adaptive histogram equalization as a preprocessing step.8 Detecting locations of nuclei on fluorescence microscopy images is still challenging because some nuclei in a volume may touch or overlap.4 After the thresholding process, watershed-based techniques9 have been used to separate touching nuclei. A method to find watershed seeds based on mathematical morphology in time-lapse microscopy was introduced.10 Watershed-based techniques can be sensitive to irregular biological structures or noise presented in microscopy volumes leading to overdetection.

Active contours, also known as snakes, have been developed to detect structures in irregular shapes to avoid overdetection.11 Active contour can be used on microscopy images because it can capture nuclei in various sizes and shapes.12 A method of coupling multiple active surfaces by penalizing overlaps and constraining volumes was introduced to separate touching nuclei13 using region-based active surfaces,14 showing promising results of detecting objects with poorly defined boundaries. This method was further developed using the watershed technique, a non-PDE-based energy minimization, and the Radon transform.15 A nuclei segmentation technique based on active contours with inhomogeneity correction was described to overcome intensity inhomogeneity in flourescence microscopy volumes.16 Alternatively, a model-based method by identifying nuclei with four primitives representing boundaries and delineating them using region growing was designed.17 A method using the fast radial symmetry transform and a dilation-based nonmaximum suppression to detect nuclei in various sizes and shapes was introduced.18 However, these techniques cannot distinguish nuclei from other subcellular structures causing many false-detection.

Recently, convolutional neural networks (CNNs)19 have become powerful tools to solve computer vision problems, such as classification,20 detection,21 and segmentation.22 Using CNNs, nuclei detection has made a big progress to find the location of nuclei.23 A patch-based method with max-pooling layers was introduced to detect mitosis in breast histology images.24 A window is slid over an image, and a CNN estimates the probability of the center pixel being mitosis. After smoothing the probability map, mitosis are detected by nonmaximal suppression. Another patch-based method to detect tyrosine hydroxylase-containing cells in zebrafish images was described.25 A set of training patches are selected on both cell region and background region using a support vector machine classifier.26 Patch-based methods by sliding a window centered at every pixel are computationally expensive due to repeated convolutional operations when a majority portion of a current patch overlaps with previous patches. To avoid redundant convolutional operations in overlapping patches, fast scanning27 was employed into CNNs by performing convolutional operations on the entire image. With the fast scanning technique, a nucleus localization method in microscopy images, called deep voting, was developed by extracting voting offsets and voting confidences to estimate the voting density map.28 Alternatively, a CNN-based structured regression model trained by a proximity mask assigning higher value to nuclei centers was described.29 A spatially constrained CNN was introduced with a parameter estimation layer and a spatially constrained layer for spatial regression to more accurately locate nuclei in hematoxylin and eosin-stained histopathology images.30 A cell nuclei detection method, known as vector oriented confidence accumulation (VOCA), was developed by accumulating localization vectors pointing to nuclei center locations to accurately detect the center locations.31 These CNN-based methods can successfully detect nuclei locations but they do not provide further characteristics of nuclei such as the size.

To extract size information of objects, a faster R-CNN32 and a mask R-CNN33 were developed to estimate bounding boxes surrounding objects and masks on objects, respectively. A cell proposal network, similarly done to the faster R-CNN,32 was described to detect cells on microscopy images within bounding boxes.34 Cells are generally not in rectangular shapes so detecting cells in bounding boxes may not provide useful information. A mask R-CNN33 was used to detect locations of nuclei within masks in distinct labels in microscopy images.35 Alternatively, a cell detection method by object probability predictions and radian distances in multiple orientations to generate star-convex polygons was described.36 Furthermore, a multi-input multi-output network was designed by utilizing multiple resolutions to segment cells in various sizes in fluorescence microscopy images.37 These methods can only process microscopy data in 2D and cannot use depth information in microscopy volumes.

To utilize the entire information in microscopy 3D volumes, 3D CNNs have been developed. A 3D CNN for nuclei detection and segmentation on microscopy images was described.38 The 3D CNN was composed of a 3D U-Net39 for segmentation followed by a series of convolutional layers for detection. The 3D CNN was trained by a manually labeled training set which may be tedious to produce for various data sets. Data augmentation approaches are widely used to multiply a small set of labeled ground truth volumes.20 Simple transformations, such as translation, reflection, rotation, and color jitter, can increase the number of training volumes. For example, a cell detection and segmentation plugin developed in ImageJ40 using manual labels with data augmentation approaches was presented.41 Our goal is to train 3D CNNs without labeling real ground truth volumes to reduce a burden of the 3D labeling process. We introduced a technique42 to automatically generate synthetic microscopy volumes containing multiple ellipsoids with blurring43 and noise44 operations without using real ground truth volumes to train a 3D segmentation CNN. In addition, we presented a 3D detection-segmentation method to label each nucleus distinctly.45 Specifically, nuclei center coordinates were first detected by a 3D distance transform46 to select center candidates and a 3D classification CNN to remove center candidates on non-nuclei structures. A 3D segmentation CNN trained by synthetic microscopy volumes similarly done as our previous technique42 labeled individual nuclei by processing subvolumes centered at the coordinates from the detection stage. One problem in our previous works42,45 is that the synthetic microscopy volumes do not look realistic. To achieve better performance, generating more realistic synthetic volumes is necessary.

Generative adversarial networks (GANs) are developed to generate realistic synthetic images and volumes to be used during training.47 A cycle-consistent adversarial network, known as a CycleGAN, was introduced where a cycle consistency term was included without requiring any real ground truth volumes.48 More recently, we developed a spatially constrained cycle-consistent adversarial network, known as a SpCycleGAN, including a spatial constraint term to accurately match synthetic nuclei locations to their corresponding nuclei mask locations.49 By training CNNs using synthetic microscopy volumes from the SpCycleGAN49 without real ground truth volumes, we distinctly segmented nuclei.50

To further analyze and understand nuclei in fluorescence microscopy volumes, we present a 3D nuclei detection network called Sphere Estimation Network (SphEsNet) to find both center coordinates and radii of nuclei simultaneously using our single network. Instead of detecting nuclei in bounding boxes,32 our proposed network can detect nuclei in spheres with proper sizes because most cell nuclei are known to be round.5 Manually generating ground truth volumes to train our SphEsNet by annotating center locations of nuclei and especially measuring their radii in 3D can be tedious and time-consuming. We train our model using synthetic microscopy volumes automatically generated from the SpCycleGAN we developed.49 Note no real ground truth volumes are required while generating synthetic volumes. Our work presented in this paper is different from our previous works focusing on nuclei segmentation.45,50 Our previous works require two steps, nuclei center localization followed by segmentation, using multiple CNNs. The biological and clinical goal of the analysis is not segmenting nuclei but extracting characteristics of nuclei. Thus, additional postprocessing steps would be necessary to extract characteristics of nuclei from segmentation masks. For example, an additional least square fitting method51 on segmentation masks would be required to estimate the size of nuclei. The main contribution of this work is to detect the location of nuclei and to estimate radii of nuclei simultaneously using our single CNN without additional postprocessing steps. We process two-photon microscopy data from rat kidney and mouse intestine. Our evaluation using a ground truth volume on real fluorescence microscopy data shows that our SphEsNet can successfully estimate both locations and sizes of nuclei.

2. Proposed Method

Figure 1 is a block diagram of the proposed 3D nuclei detection method. To denote 3D image volumes, we use the notation we defined in our previous works.42,45,49,50 In this paper, we denote I as a 3D image volume of size X×Y×Z voxels and Izp as the p’th 2D focal plane image of the 3D image volume, I, along the z-direction, where p{1,,Z}. For example, Iorig is an original 3D volume and Iz50orig is the 50th focal plane image of the volume. In addition, we denote I(qi:qf,si:sf,pi:pf) as a 3D subvolume of I, whose x coordinate is qixqf, y coordinate is siysf, z coordinate is pizpf, where qi{1,,X}, qf{1,,X}, si{1,,Y}, sf{1,,Y}, pi{1,,Z}, and pf{1,,Z}. It is required to have qiqf, sisf, and pipf. For example, I(193:320,193:320,31:94)det is a subvolume of a detection volume, Idet, whose x coordinate is 193x320, y coordinate is 193y320, and z coordinate is 31z94. Lastly, we denote v as a voxel location.

Fig. 1.

Fig. 1

Block diagram of our proposed method for 3D nuclei detection of fluorescence microscopy images. Our SphEsNet is trained by synthetic microscopy volumes, which are automatically generated using an SpCycleGAN.49 After training is done, an original microscopy volume is processed to detect nuclei in 3D.

The goal of this paper is to simultaneously estimate the location and size of nuclei presented in an original fluorescence microscopy 3D volume, Iorig, using our SphEsNet. To train the SphEsNet, we produce nuclei center volumes, Ictr, to estimate the location of nuclei, nuclei radius volumes, Irad, to estimate the size of nuclei, and nuclei mask volumes, Imask, to generate synthetic microscopy volumes, Isyn, using an SpCycleGAN.49 Note Imask is composed of multiple spherical masks where the size range of the masks matches to the size range of nuclei in the input microscopy volume, Iorig. The location and size of synthetic nuclei in Isyn match to the location and size of spherical masks in Imask. Once our SphEsNet model, M, is trained by Isyn, Ictr, and Irad, it can be used on the real microscopy volume, Iorig, to detect nuclei. After processing Iorig using our SphEsNet, the final detection volume, Idet, contains color-coded estimated spheres representing nuclei in the microscopy volume. Multiple SphEsNet models can be trained to detect nuclei in other real microscopy volumes based on various nuclei size ranges. Our SphEsNet is implemented in PyTorch.52

2.1. Synthetic Volume Generation

The process of manually labeling ground truth for microscopy volumes in 3D is tedious. Thus, a method of automatically generating synthetic volumes to train the SphEsNet is necessary. In this section, we use an SpCycleGAN we developed49 to generate synthetic volumes for training. Note no real ground truth volumes are required in this technique. The synthetic volume generation stage is composed of two steps: the synthetic ground truth volume generation step and the synthetic microscopy volume generation step. The synthetic ground truth volume generation step generates a set of nuclei center volumes, Ictr, a set of nuclei radius volumes, Irad, and a set of nuclei mask volumes, Imask. The synthetic microscopy volume generation step generates a set of synthetic microscopy volume, Isyn, from Imask using the SpCycleGAN.49 Three sets of volumes, Isyn, Ictr, and Irad, are then used to train our SphEsNet. In this stage, 20 synthetic volumes with size of 128×128×128  voxels are generated.

2.1.1. Synthetic ground truth volume generation

During the synthetic ground truth volume generation step, Imask is first generated. Note Imask is used in the synthetic microscopy volume generation step to generate the corresponding Isyn where each mask matches to a synthetic nucleus. To produce Imask with multiple masks, N mask candidate volumes are generated. We denote the i’th mask candidate volume as Imask,i where 1iN. Each mask candidate volume contains one spherical mask which can be potentially included in Imask. Specifically, Imask,i is a binary volume where a mask is centered at a random location ci=(xi,yi,zi) with a random radius, ri:

Imask,i(v)={1,if  vci22ri20,otherwise, (1)

where 1xiX, 1yiY, 1ziZ, and rminrirmax. rmin and rmax are the minimum possible radius of a synthetic nucleus in a volume and the maximum possible radius of a synthetic nucleus in a volume, respectively. N can be selected based on the density of nuclei in Iorig, and rmax and rmin can be selected based on the size of nuclei in Iorig.

After mask candidate volumes are generated, Imask is generated by adding Imask,i in a sequential order. First of all, Imask is initialized to zeros. Note nuclei cannot be overlapped physically in a biological structure. For 1iN, the Imask,i is added to Imask if Imask,i has no overlap with other masks already added in Imask. If Imask,i is overlapped with one of the masks in Imask, Imask,i is not included in Imask. Once this process is done, then Imask contains Nmask number of nuclei where NmaskN. This technique is similar to our previous synthetic binary volume generation technique.42

To generate Ictr and Irad, we first define “central regions.” Note the central regions are defined to more accurately find the location of nuclei centers using our CNN. For the j’th nucleus, where 1jNmask, whose center location is located at cj=(xj,yj,zj) and radius is rj, the j’th central region is defined within a sphere centered at the cj with a radius of arj. a is a parameter determining the range of the central region where 0a1. In this work, we set a=0.5. Within the j’th central region, Ictr,j contains information of the location of the j’th nucleus in Imask and Irad,j contains information of the radius of the j’th nucleus in Imask. Voxel values within the central region of Ictr,j become 1 and of Irad,j become rj whereas other voxel values not in the central region of both Ictr,j and Irad,j become 0:

Ictr,j(v)={1,if  vcj22(arj)20,otherwise, (2)
Irad,j(v)={rj,if  vcj22(arj)20,otherwise. (3)

Then, Ictr and Irad are the sum of Ictr,j and Irad,j for all j, respectively:

Ictr=j=1NmaskIctr,j, (4)
Irad=j=1NmaskIrad,j. (5)

2.1.2. Synthetic microscopy volume generation

In the synthetic microscopy volume generation step, Imask is used to generate the corresponding Isyn using the SpCycleGAN.49 The SpCycleGAN contains generative networks, G1, G2, and H, and discriminative networks, D1 and D2, where the generative networks produce realistic synthetic images, and the discriminative networks determine if the images are real or synthetic. The training loss function of the SpCycleGAN is defined as a sum of four loss terms:49

L(G1,G2,H,D1,D2)=LGAN(G1,D1,Imask,Iorig)+LGAN(G2,D2,Iorig,Imask)+λ1Lcycle(G1,G2,Iorig,Imask)+λ2Lsp(G1,H,Iorig,Imask), (6)

where λ1 and λ2 are weight coefficients. In Eq. (6), G1 is a generative network generating Iorig using Imask, G2 is a generative network generating Imask using Iorig, D1 is a discriminative network distinguishing between Iorig and G1(Imask), D2 is a discriminative network distinguishing between Imask and G2(Iorig), and H is a generative network generating a binary mask, H[G1(Imask)], using G1(Imask). The first loss term in Eq. (6) is the adversarial loss47 between G1 and D1:

LGAN(G1,D1,Imask,Iorig)=EIorig{log[D1(Iorig)]}+EImask(log{1D1[G1(Imask)]}). (7)

The second loss term in Eq. (6) is the adversarial loss47 between G2 and D2:

LGAN(G2,D2,Iorig,Imask)=EImask{log[D2(Imask)]}+EIorig(log{1D2[G2(Iorig)]}). (8)

The third loss term in Eq. (6) is the cycle consistency loss:48

Lcycle(G1,G2,Iorig,Imask)=EImask{G2[G1(Imask)]Imask1}+EIorig{G1[G2(Iorig)]Iorig1}, (9)

where .1 is L1 norm. The last loss term in Eq. (6) is the spatial constraint loss:49

Lsp(G1,H,Iorig,Imask)=EImask{H[G1(Imask)]Imask2}, (10)

where .2 is L2 norm. Including the spatial constraint loss to the training loss function of SpCycleGAN reduces spatial shifts between nuclei masks in Imask and synthetic nuclei in Isyn.49 In this work, we use the same generative network architecture and the same discriminative network architecture from a CycleGAN.48 To train the SpCycleGAN, the Adam optimizer53 is used with a learning rate of 0.001 with the training loss function in Eq. (6) for 200 epochs, and we set λ1=10 and λ2=10. After training the SpCycleGAN, G1 is used to produce synthetic images, Isyn, from labeled images, Imask. Figure 2 shows an example of Iorig, Isyn, Imask, Ictr, and Irad.

Fig. 2.

Fig. 2

A set of examples of Iorig, Isyn, Imask, Ictr, and Irad with size of 128×128  pixels generated by an SpCycleGAN49 (a) a 2D image of Iorig, (b) a 2D image of Isyn. (c) the corresponding 2D image of Imask, (d) the corresponding 2D image of Ictr, and (e) the corresponding 2D image of Irad.

2.2. Sphere Estimation Network

Figure 3 shows the architecture of our SphEsNet. The proposed network is different from our previous works to distinctly segment nuclei.45,50 Our previous works consist of two steps, where the first step is to find center locations of nuclei and the second step is to segment individual nuclei. Using our single network, we can simultaneously find center locations of nuclei and estimate their radii.

Fig. 3.

Fig. 3

Architecture of our SphEsNet. Our SphEsNet is composed of a backbone network, which is a modified 3D U-Net and two branches. CTR branch finds the center locations of nuclei and RAD branch estimates the radii of nuclei.

Our architecture consists of a backbone network and two branches. The first branch estimates center locations of nuclei and the second branch estimates radii of nuclei. In this paper, we denote the first branch as the CTR branch and the second branch as the RAD branch. For the backbone network, we employ a modified 3D U-Net architecture. Our backbone network has the same number of convolutional layers, max-pooling layers, and transposed convolutional layers in the 3D U-Net,39 but the number of channels are reduced. A 3D convolutional layer is a series of a convolutional function using filters with size of 3×3×3 and with padding of 1, a 3D batch normalization,54 and a rectified-linear unit (ReLU) as an activation function. A 3D max-pooling layer using a window with size of 2×2×2 and with a stride of 2 is used as a downsampling operation in the encoder, and a 3D transposed convolutional layer is used as an upsampling operation in the decoder. A concatenation operation between the encoder and the decoder is included to preserve spatial information. The final feature map intensities of the CTR branch are between 0 and 1 to classify each voxel to center location or not, so the Sigmoid function is used as an activation function. The final feature map intensities of the RAD branch are greater than 0 so the ReLU function is used as an activation function. More details on training and inference are provided below.

2.2.1. Training

During training, Isyn, Ictr, and Irad generated from the synthetic volume generation stage are used as an input volume, a ground truth volume in the CTR branch, and a ground truth volume in the RAD branch, respectively. Note that 20 sets of volumes with size of 128×128×128  voxels are generated in the synthetic volume generation stage. We divide each volume to eight volumes with size of 64×64×64  voxels, so 160 sets of volumes are used to train the model, M. Our training loss function, L, is a linear combination of the binary cross entropy (BCE) loss, LBCE, at the CTR branch and the mean squared error (MSE) loss, LMSE, at the RAD branch:

L(Ictr,Irad,Ictr,out,Irad,out)=LBCE(Ictr,Ictr,out)+λLMSE(Irad,Irad,out), (11)

where Ictr is the ground truth volume in the CTR branch, Irad is the ground truth volume in the RAD branch, Ictr,out is the output volume in the CTR branch, and Irad,out is the output volume in the RAD branch, respectively. λ is a weight coefficient that can emphasize one loss term over the other loss term. For example, if λ is small, then the BCE loss at the CTR branch will be more emphasized. Also, if λ is big, then the MSE loss at the RAD branch will be more emphasized. In this work, we set λ=1 unless specified. More specifically, the BCE loss is defined as

LBCE(Ictr,Ictr,out)=1Vv=1V{Ictr(v)logIctr,out(v)+[1Ictr(v)]log[1Ictr,out(v)]}, (12)

and the MSE loss is defined as

LMSE(Irad,Irad,out)=1Vv=1V[Irad(v)Irad,out(v)]2, (13)

where V is the number of voxels in a volume. To train our model, the Adam optimizer53 is used with a learning rate of 0.001.

2.2.2. Inference

Our SphEsNet is trained on volumes with size of 64×64×64  voxels so an input volume to the model should be 64×64×64  voxels. If an input microscopy volume is bigger than 64×64×64  voxels then our model can only process a subvolume of the input with size of 64×64×64  voxels. To process the entire input volume, a 3D window is slid with size of 64×64×64  voxels, similarly done to our previous work.42 First, the input microscopy volume is reflection-padded by 16 voxels. Then, we slide the 3D window to x, y, and z directions by 32. Note that nuclei, which are partially included on the boundary of the 3D window, may generate inaccurate result. Therefore, only the central volume with size of 32×32×32  voxels is used for our final output. We repeat this until the entire input microscopy volume is processed.

Once the process is done, then we have two output volumes, Ictr,out and Irad,out. Note Ictr,out is the output volume from the CTR branch, and Irad,out is the output volume from the RAD branch. First of all, a local maximum of Ictr,out, ck, is selected as the final center coordinate of the k’th nucleus. We avoid selecting more than two center coordinates within a cube with length of rmin because we set the smallest radius of nuclei in a volume is rmin. Sometimes, we observed that multiple local maxima with the same intensity are connected as a 3D region. In this case, a centroid of the 3D region is selected as the final center coordinate. After selecting center coordinates of nuclei, the radius for the k’th nucleus is estimated as Irad,out(ck)+rinf to generate a sphere centered at ck. We observed that the real nuclei in Iorig are not perfectly spherical, and the radius is generally estimated to the shortest distance from the estimated center to the boundary of nuclei. So we increment the estimated radius, Irad,out(ck), by rinf to increase the overlapping region between the estimated sphere and the nucleus. In this work, we set rinf=1. As a final step, any other center locations within a sphere are removed because a center location of a nucleus cannot physically be in another nucleus. The final detection volume, Idet, is generated after color-coding spheres distinctly to clearly visualize 3D nuclei detection results. The computational time for processing a volume with size of 512×512×512  voxels using our model was approximately 3 min using NVIDIA’s GeForce GTX Titan X.

3. Experimental Results

Our method is tested on five various real fluorescence microscopy data sets consisting of grayscale volumes. Data-I, Data-II, Data-III, and Data-IV are imaged from rat kidney where Hoechst 33342 stain is used to label nuclei. Data-I consists of Z=512 images, Data-II of Z=45, Data-III of Z=32, and Data-IV of Z=23, with X=512×Y=512. Data-V is imaged from mouse intestine where DAPI is used to label nuclei, and the size of the volume is X=512×Y=930×Z=157. The resolution in z direction is different from the resolutions in x and y directions for Data-II. To make the shape of nuclei in Data-II spherical, the original volume is interpolated in z direction by a factor of 2. The range of nuclei radii is 4 to 6 voxels, 8 to 10 voxels, 8 to 12 voxels, 3 to 5 voxels, and 8 to 10 voxels in Data-I, Data-II, Data-III, Data-IV, and Data-V, respectively.

In this work, we trained three models, MData-I, MData-II, and MData-III, where parameters of synthetic volumes to train the models are shown in Table 1. Here, rmin, rmax, and N are selected based on the minimum radius of a nucleus, the maximum radius of a nucleus, and the density of nuclei presented in an original fluorescence microscopy volume, respectively. The size of nuclei may vary between real fluorescence microscopy volumes. Thus, we trained three models representing three different size ranges of nuclei. We observed that Data-III contains heavy noise so we selected λ=0.1 to emphasize more on nuclei location to avoid false detection. Data-I was tested on MData-I, Data-II on MData-II, and Data-III on MData-III, respectively. In addition, Data-IV was tested on MData-I and Data-V was tested on MData-II because the size of nuclei in Data-IV is similar to the size of nuclei in Data-I, and the size of nuclei in Data-V is similar to the size of nuclei in Data-II. Figure 4 shows that our method can successfully detect nuclei on various data sets. It is shown that our trained models can detect nuclei on other data sets which were not used during the synthetic volume generation stage.

Table 1.

Parameters of synthetic volumes to train MData-I, MData-II, and MData-III.

Model Iorig rmin rmax N
MData-I Data-I 4 6 1000
MData-II Data-II 8 10 50
MData-III Data-III 8 12 200

Fig. 4.

Fig. 4

Nuclei detection of multiple microscopy data sets. (a) Iz50orig of Data-I, (b) Iz50det of Data-I, (c) Iz16orig of Data-II, (d) Iz16det of Data-II, (e) Iz4orig of Data-III, (f) Iz4det of Data-III, (g) Iz8orig of Data-IV, (h) Iz8det of Data-IV, (i) Iz132orig of Data-V, and (j) Iz132det of Data-V.

We numerically compared our SphEsNet to other nuclei detection methods. An object-based metric is used for our evaluation. First of all, NTP is the number of true positives defined as detected nuclei overlapping at least 50% of the corresponding ground truth nuclei. Otherwise, the detected nuclei are defined as false positives, and we denote the number of false positives as NFP. Lastly, NFN is the number of false negatives defined as ground truth nuclei overlapping less than 50% of the corresponding detected nuclei or without having their corresponding detected nucleus. Precision, recall, and F1 score, denoted as P, R, and F1, respectively, are defined as55,56

P=NTPNTP+NFP, (14)
R=NTPNTP+NFN, (15)
F1=2PRP+R. (16)

We used a subvolume of Data-I with size of 128×128×64  voxels to evaluate multiple nuclei detection methods. A 3D ground truth volume, I(193:320,193:320,31:94)gt, was manually generated using ITK-SNAP57 and our 3D ground truth volume contains 283 nuclei. We removed components whose number of voxels are less than 50  voxels from the ground truth volume and the detection subvolumes to avoid partially included nuclei on the boundary.

We tested multiple training loss functions on our CNN, defined as Eq. (11), by tuning a weight coefficient, λ. Note that the BCE loss, LBCE, was used for center candidate classification and the MSE loss, LMSE, was used for radius regression. Table 2 shows the comparison between various training loss functions. We observed that precision decreased when λ=0.1 and recall decreased when λ=10. We selected λ=1 as our weight coefficient for the training loss function which gives the highest F1 score for Data-I.

Table 2.

Precision, recall, and F1 score for various λ values for training loss functions for Data-I.

  Precision (%) Recall (%) F1 score (%)
λ=0.1 77.51 97.33 86.29
λ=1 84.04 96.63 89.90
λ=10 84.79 95.24 89.81

Note: Bold characters highlight the best result based on F1 scores.

We compared our method with various methods listed as follows. First, we used watershed technique9 which can be used to separate touching nuclei in microscopy volumes. Here, an original volume was binarized by an empirically selected threshold value of 64. Due to inhomogeneous intensity presented in microscopy volumes, we observed that noise in the central region was detected, causing low precision. To correct inhomogeneity, a 3D adaptive histogram equalization8 followed by binarization with an empirically selected threshold value of 192 was used as a preprocessing step before watershed technique. We will denote the method of using a 3D adaptive histogram equalization and watershed technique as Watershed+. The 3D adaptive histogram equalization helped removing noise in the microscopy volume but non-nuclei subcellular structures were still captured. We observed that overdetection was occurred during watershed due to irregular equalized structures and enhanced noise causing a lower precision. A mask R-CNN33 is a 2D segmentation method which can both detect and segment objects in distinct labels. A pretrained mask R-CNN with ResNet-5058 and a feature pyramid network59 was fine-tuned using the same training images used for the SphEsNet. To combine 2D segmentation results into 3D, we used a z-directional combination process as a postprocessing step. The m’th mask on Izp whose centroid located at (xm,p,ym,p) and the n’th mask on Izp+1 whose centroid located at (xn,p+1,yn,p+1) were combined as the same nucleus if the distance between (xm,p,ym,p) and (xn,p+1,yn,p+1) was less than rmax2. We will denote this method as mask R-CNNz. A 2D mask R-CNN cannot utilize depth information, so it performed poor segmentation. In addition, we observed the mask R-CNN was overfitted. We investigated on our previous two-stage method for 3D nuclei detection-segmentation.45 We will denote this method as Det-Seg. The 3D segmentation CNN was trained by a set of synthetic volumes generated by blurring and noise operations. The synthetic volumes used in our previous work45 were not realistic so segmentation masks were not accurate causing a low precision. More recently, we generated an accurate binary segmentation mask using a 3D CNN trained by a set of synthetic volumes generated by a SpCycleGAN.49 To separate touching nuclei in the binary segmentation mask, morphological operations were used as a postprocessing step. We used a 3D erosion, a 3D connected component (for color-coding), and a 3D dilation to separate touching nuclei. A ball with radius of 1 was empirically selected as a structuring element for the 3D erosion and the 3D dilation. We will denote this method as Seg-Morph. We observed that some nuclei were still not separated by morphological operations leading to a low recall. Our method detected the locations of nuclei with their radii accurately so issues caused by overdetection or touching nuclei can be avoided. Note the locations of nuclei and their radii are simultaneously estimated using our SphEsNet without any postprocessing steps. Table 3 shows our method outperforms numerically. The original volume, the ground truth volume, and 3D results are visualized using Voxx60 shown in Fig. 5.

Table 3.

Precision, recall, and F1 score for multiple nuclei detection methods and our proposed method for Data-I.

  Precision (%) Recall (%) F1 score (%)
Watershed9 51.14 92.13 65.78
Watershed+8,9 41.31 97.17 57.97
Mask R-CNNz33 20.85 39.20 27.22
Det-Seg45 68.35 90.22 77.78
Seg-Morph49 91.20 82.01 86.36
Our method 84.04 96.63 89.90

Note: Bold characters highlight the best result based on F1 scores.

Fig. 5.

Fig. 5

Nuclei detection results for Data-I by a 3D visualization tool, Voxx.60 (a) Original microscopy volume, (b) the corresponding ground truth volume, (c) Watershed,9 (d) Watershed+ composed of a 3D adaptive histogram equalization8 and Watershed,9 (e) mask R-CNNz composed of a 2D mask R-CNN33 and a z-directional combination process, (f) Det-Seg composed of a 3D detection and segmentation,45 (g) Seg-Morph composed of a 3D binary segmentation49 and morphological operations, and (h) our proposed method using a SphEsNet.

4. Conclusions

In this paper, we presented a CNN for 3D nuclei detection, called SphEsNet. Training was done using a synthetic training set generated by spatially constrained cycle-consistent adversarial networks without using any real ground truth volumes. Our network can successfully detect locations and sizes of nuclei presented in fluorescence microscopy volumes without any additional postprocessing steps. Our network outperformed other detection/segmentation techniques. We showed that our network can be used on other real fluorescence microscopy volumes when the size of nuclei in an input microscopy volume was similar to the size of nuclei in a training set. One limitation of our work was that the size range of nuclei was an important parameter so that we were not able to train a combined model, which can be used on all testing microscopy volumes. In the future, we plan to extract more characteristics from nuclei in more diverse shapes and sizes.

Acknowledgments

This work was partially supported by a George M. O’Brien Award from the National Institutes of Health under Grant NIH/NIDDK P30 DK079312 and the endowment of the Charles William Harrison Distinguished Professorship at Purdue University. Data-I was provided by Malgorzata Kamocka of Indiana University and was collected at the Indiana Center for Biological Microscopy. Data-IV was provided by Tarek Ashkar of the Indiana University School of Medicine. Data-V was provided by Mike Ferkowicz of the Indiana University School of Medicine. We have followed our institutional policies for the use of animals in our research and the institutional licensing body approved our studies. This work was done while David Joon Ho was with Video and Image Processing Laboratory, School of Electrical and Computer Engineering, Purdue University, West Lafayette, Indiana, USA.

Biographies

David Joon Ho is currently a machine learning scientist at Memorial Sloan Kettering Cancer Center. He received his BS and MS degrees in electrical and computer engineering from the University of Illinois at Urbana-Champaign in 2010 and 2012, respectively, and his PhD in electrical and computer engineering from Purdue University in 2019. His current research interests include machine learning and computer vision on biomedical images. He is a member of IEEE.

Daniel Mas Montserrat graduated from Polytechnic University of Catalonia in 2015. He is currently a PhD candidate at Purdue University under the supervision of Professor Edward J. Delp. He has worked as a research assistant at the Video and Image Processing Laboratory. His main areas of research are deep learning, image processing, and media forensics.

Chichen Fu received his PhD from Purdue University. He is a video algorithm engineer at Zoom Video Communications. His main research interests include image processing, video processing, and computer vision.

Paul Salama received his PhD in electrical engineering from Purdue University in 1999. In 1999, he joined the Department of Electrical and Computer Engineering, in the Purdue School of Engineering and Technology, at Indiana University-Purdue University Indianapolis, Indianapolis, Indiana, where he is currently a professor of electrical and computer engineering and the associate dean for graduate programs within the school. His research interests include image analysis, image and video compression, machine learning, statistical signal processing, and medical imaging. He is a senior member of IEEE and member of SPIE.

Kenneth W. Dunn received his PhD in biology from SUNY Stony Brook in 1986. He then took a position as an NIH postdoctoral fellow at Columbia University. In 1995, he joined the faculty in the Department of Medicine at Indiana University and was promoted to full professor in 2014. His research is broadly focused on the development and application of methods of microscopy, particularly intravital microscopy to the study of cell biology and physiology.

Edward J. Delp is currently the Charles William Harrison distinguished professor of electrical and computer engineering and a professor of biomedical engineering at Purdue University. His research interests include image and video compression, multimedia security, medical imaging, multimedia systems, communication, and information theory. He has published and presented more than 500 papers. He is a fellow of IEEE, a fellow of SPIE, a fellow of the Society for Imaging Science and Technology (IS&T), and a fellow of the American Institute of Medical and Biological Engineering.

Disclosures

No conflicts of interest, financial or otherwise, are declared by the authors.

Contributor Information

David Joon Ho, Email: hod@mskcc.org.

Daniel Mas Montserrat, Email: dmasmont@purdue.edu.

Chichen Fu, Email: fu26@purdue.edu.

Paul Salama, Email: psalama@iupui.edu.

Kenneth W. Dunn, Email: kwdunn@iupui.edu.

Edward J. Delp, Email: ace@ecn.purdue.edu.

References

  • 1.Vonesch C., et al. , “The colored revolution of bioimaging,” IEEE Signal Process. Mag. 23(3), 20–31 (2006). 10.1109/MSP.2006.1628875 [DOI] [Google Scholar]
  • 2.Helmchen F., Denk W., “Deep tissue two-photon microscopy,” Nat. Methods 2(12), 932–940 (2005). 10.1038/nmeth818 [DOI] [PubMed] [Google Scholar]
  • 3.Dunn K. W., et al. , “Functional studies of the kidney of living animals using multicolor two-photon microscopy,” Am. J. Physiol. 283(3), C905–C916 (2002). 10.1152/ajpcell.00159.2002 [DOI] [PubMed] [Google Scholar]
  • 4.Xing F., Yang L., “Robust nucleus/cell detection and segmentation in digital pathology and microscopy images: a comprehensive review,” IEEE Rev. Biomed. Eng. 9, 234–263 (2016). 10.1109/RBME.2016.2515127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Webster M., Witkin K. L., Cohen-Fix O., “Sizing up the nucleus: nuclear shape, size and nuclear-envelope assembly,” J. Cell Sci. 122(10), 1477–1486 (2009). 10.1242/jcs.037333 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Otsu N., “A threshold selection method from gray-level histograms,” IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979). 10.1109/TSMC.1979.4310076 [DOI] [Google Scholar]
  • 7.Salvi M., et al. , “Automated segmentation of fluorescence microscopy images for 3D cell detection in human-derived cardiospheres,” Sci. Rep. 9, 6644 (2019). 10.1038/s41598-019-43137-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ho D. J., et al. , “Boundary segmentation for fluorescence microscopy using steerable filters,” Proc. SPIE 10133, 101330E (2017). 10.1117/12.2254627 [DOI] [Google Scholar]
  • 9.Vincent L., Soille P., “Watersheds in digital spaces: an efficient algorithm based on immersion simulations,” IEEE Trans. Pattern Anal. Mach. Intell. 13(6), 583–598 (1991). 10.1109/34.87344 [DOI] [Google Scholar]
  • 10.Yang X., Li H., Zhou X., “Nuclei segmentation using marker-controlled watershed, tracking using mean-shift, and Kalman filter in time-lapse microscopy,” IEEE Trans. Circuits Syst. I 53(11), 2405–2414 (2006). 10.1109/TCSI.2006.884469 [DOI] [Google Scholar]
  • 11.Kass M., Witkin A., Terzopoulos D., “Snakes: active contour models,” Int. J. Comput. Vision 1(4), 321–331 (1988). 10.1007/BF00133570 [DOI] [Google Scholar]
  • 12.Delgado-Gonzalo R., et al. , “Snakes on a plane: a perfect snap for bioimage analysis,” IEEE Signal Process. Mag. 32(1), 41–48 (2015). 10.1109/MSP.2014.2344552 [DOI] [Google Scholar]
  • 13.Dufour A., et al. , “Segmenting and tracking fluorescent cells in dynamic 3-D microscopy with coupled active surfaces,” IEEE Trans. Image Process. 14(9), 1396–1410 (2005). 10.1109/TIP.2005.852790 [DOI] [PubMed] [Google Scholar]
  • 14.Chan T. F., Vese L. A., “Active contours without edges,” IEEE Trans. Image Process. 10(2), 266–277 (2001). 10.1109/83.902291 [DOI] [PubMed] [Google Scholar]
  • 15.Dzyubachyk O., et al. , “Advanced level-set-based cell tracking in time-lapse fluorescence microscopy,” IEEE Trans. Image Process. 29(3), 852–867 (2010). 10.1109/TMI.2009.2038693 [DOI] [PubMed] [Google Scholar]
  • 16.Lee S., et al. , “Segmentation of fluorescence microscopy images using three dimensional active contours with inhomogeneity correction,” in Proc. IEEE Int. Symp. Biomed. Imaging, Melbourne, pp. 709–713 (2017). 10.1109/ISBI.2017.7950618 [DOI] [Google Scholar]
  • 17.Arslan S., et al. , “Attributed relational graphs for cell nucleus segmentation in fluorescence microscopy images,” IEEE Trans. Med. Imaging 32(6), 1121–1131 (2013). 10.1109/TMI.2013.2255309 [DOI] [PubMed] [Google Scholar]
  • 18.Ram S., Rodriguez J. J., “Size-invariant detection of cell nuclei in microscopy images,” IEEE Trans. Med. Imaging 35(7), 1753–1764 (2016). 10.1109/TMI.2016.2527740 [DOI] [PubMed] [Google Scholar]
  • 19.LeCun Y., Bengio Y., Hinton G., “Deep learning,” Nature 521, 436–444 (2015). 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]
  • 20.Krizhevsky A., Sutskever I., Hinton G. E., “ImageNet classification with deep convolutional neural networks,” in Proc. Neural Inf. Process. Syst., Lake Tahoe, Nevada, pp. 1097–1105 (2012). [Google Scholar]
  • 21.Girshick R., et al. , “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proc. IEEE Conf. Comput. Vision and Pattern Recognit., Columbus, Ohio, pp. 580–587 (2014). 10.1109/CVPR.2014.81 [DOI] [Google Scholar]
  • 22.Long J., Shelhamer E., Darrell T., “Fully convolutional networks for semantic segmentation,” in Proc. IEEE Conf. Comput. Vision and Pattern Recognit., Boston, Massachusetts, pp. 3431–3440 (2015). 10.1109/CVPR.2015.7298965 [DOI] [PubMed] [Google Scholar]
  • 23.Litjens G., et al. , “A survey on deep learning in medical image analysis,” Med. Image Anal. 42, 60–88 (2017). 10.1016/j.media.2017.07.005 [DOI] [PubMed] [Google Scholar]
  • 24.Ciresan D. C., et al. , “Mitosis detection in breast cancer histology images with deep neural networks,” Lect. Notes Comput. Sci. 8150, 411–418 (2013). 10.1007/978-3-642-40763-5_51 [DOI] [PubMed] [Google Scholar]
  • 25.Dong B., et al. , “Deep learning for automatic cell detection in wide-field microscopy zebrafish images,” in Proc. IEEE Int. Symp. Biomed. Imaging, Brooklyn, New York, pp. 772–776 (2015). 10.1109/ISBI.2015.7163986 [DOI] [Google Scholar]
  • 26.Kolesnik M., Fexa A., “Multi-dimensional color histograms for segmentation of wounds in images,” Lect. Notes Comput. Sci. 3656, 1014–1022 (2005). 10.1007/11559573_123 [DOI] [Google Scholar]
  • 27.Giusti A., et al. , “Fast image scanning with deep max-pooling convolutional neural networks,” in Proc. IEEE Int. Conf. Image Process., Melbourne, pp. 4034–4038 (2013). 10.1109/ICIP.2013.6738831 [DOI] [Google Scholar]
  • 28.Xie Y., et al. , “Deep voting: a robust approach toward nucleus localization in microscopy images,” Lect. Notes Comput. Sci. 9351, 374–382 (2015). 10.1007/978-3-319-24574-4_45 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Xie Y., et al. , “Beyond classification: structured regression for robust cell detection using convolutional neural network,” Lect. Notes Comput. Sci. 9351, 358–365 (2015). 10.1007/978-3-319-24574-4_43 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Sirinukunwattana K., et al. , “Locality sensitive deep learning for detection and classification of nuclei in routine colon cancer histology images,” IEEE Trans. Med. Imaging 35(5), 1196–1206 (2016). 10.1109/TMI.2016.2525803 [DOI] [PubMed] [Google Scholar]
  • 31.Xie C., et al. , “VOCA: cell nuclei detection in histopathology images by vector oriented confidence accumulation,” in Proc. Int. Conf. Med. Imaging Deep Learn., London, pp. 527–539 (2019). [Google Scholar]
  • 32.Ren S., et al. , “Faster R-CNN: towards real-time object detection with region proposal networks,” in Proc. Adv. Neural Inf. Process. Syst., Montreal, Canada, pp. 1–9 (2015). [Google Scholar]
  • 33.He K., et al. , “Mask R-CNN,” in Proc. IEEE Int. Conf. Comput. Vision, Venice, Italy, pp. 2980–2988 (2017). 10.1109/ICCV.2017.322 [DOI] [Google Scholar]
  • 34.Akram S. U., et al. , “Cell proposal network for microscopy image analysis,” in Proc. IEEE Int. Conf. Image Process., Phoenix, Arizona, pp. 3199–3203 (2016). 10.1109/ICIP.2016.7532950 [DOI] [Google Scholar]
  • 35.Johnson J. W., “Adapting mask-RCNN for automatic nucleus segmentation,” in Proc. 2019 Comput. Vision Conf., Vol. 2, pp. 1–7 (2018). [Google Scholar]
  • 36.Schmidt U., et al. , “Cell detection with star-convex polygons,” in Int. Conf. Medical Image Comput. Computer-Assisted Intervention, Vol. 11071, pp. 265–273 (2018). [Google Scholar]
  • 37.Raza S. E. A., et al. , “MIMO-Net: a multi-input multi-output convolutional neural network for cell segmentation in fluorescence microscopy images,” in Proc. IEEE Int. Symp. Biomed. Imaging, Melbourne, pp. 337–340 (2017). 10.1109/ISBI.2017.7950532 [DOI] [Google Scholar]
  • 38.Ram S., et al. , “Joint cell nuclei detection and segmentation in microscopy images using 3D convolutional networks,” pp. 1–8, https://arxiv.org/abs/1805.02850v1 (2018).
  • 39.Cicek O., et al. , “3D U-Net: learning dense volumetric segmentation from sparse annotation,” Lect. Notes Comput. Sci. 9901, 424–432 (2016). 10.1007/978-3-319-46723-8_49 [DOI] [Google Scholar]
  • 40.Schneider C. A., Rasband W. S., Eliceiri K. W., “NIH Image to ImageJ: 25 years of image analysis,” Nat. Mathods 9(7), 671–675 (2012). 10.1038/nmeth.2089 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Falk T., et al. , “U-Net: deep learning for cell counting, detection, and morphometry,” Nat. Method 16, 67–70 (2019). 10.1038/s41592-018-0261-2 [DOI] [PubMed] [Google Scholar]
  • 42.Ho D. J., et al. , “Nuclei segmentation of fluorescence microscopy images using three dimensional convolutional neural networks,” in IEEE Conf. Comput. Vision and Pattern Recognit. Workshops, Honolulu, Hawaii, pp. 834–842 (2017). 10.1109/CVPRW.2017.116 [DOI] [Google Scholar]
  • 43.Sarder P., Nehorai A., “Deconvolution methods for 3-D fluorescence microscopy images,” IEEE Signal Process. Mag. 23(3), 32–45 (2006). 10.1109/MSP.2006.1628876 [DOI] [Google Scholar]
  • 44.Luisier F., Blu T., Unser M., “Image denoising in mixed Poisson–Gaussian noise,” IEEE Trans. Image Process. 20(3), 696–708 (2011). 10.1109/TIP.2010.2073477 [DOI] [PubMed] [Google Scholar]
  • 45.Ho D. J., et al. , “Nuclei detection and segmentation of fluorescence microscopy images using three dimensional convolutional neural networks,” in Proc. IEEE Int. Symp. Biomed. Imaging, Washington, DC, pp. 418–422 (2018). 10.1109/ISBI.2018.8363606 [DOI] [Google Scholar]
  • 46.Maurer C. R., Jr., Qi R., Raghavan V., “A linear time algorithm for computing exact Euclidean distance transforms of binary images in arbitrary dimensions,” IEEE Trans. Pattern Anal. Mach. Intell. 25(2), 265–270 (2003). 10.1109/TPAMI.2003.1177156 [DOI] [Google Scholar]
  • 47.Goodfellow I., et al. , “Generative adversarial nets,” in Proc. Adv. Neural Inf. Process. Syst., Montreal, Canada, pp. 2672–2680 (2014). [Google Scholar]
  • 48.Zhu J. Y., et al. , “Unpaired image-to-image translation using cycle-consistent adversarial networks,” in Proc. IEEE Int. Conf. Comput. Vision, Venice, Italy, pp. 2223–2232 (2017). 10.1109/ICCV.2017.244 [DOI] [Google Scholar]
  • 49.Fu C., et al. , “Three dimensional fluorescence microscopy image synthesis and segmentation,” in IEEE/CVF Conf. Comput. Vision and Pattern Recognit. Workshops, Salt Lake City, Utah, pp. 2302–2310 (2018). 10.1109/CVPRW.2018.00298 [DOI] [Google Scholar]
  • 50.Ho D. J., et al. , “Center-extraction-based three dimensional nuclei instance segmentation of fluorescence microscopy images,” in 2019 IEEE EMBS Int. Conf. Biomedical & Health Informatics (BHI), pp. 1–4 (2019). [Google Scholar]
  • 51.Fitzgibbon A., Pilu M., Fisher R. B., “Direct least square fitting of ellipses,” IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 476–480 (1999). 10.1109/34.765658 [DOI] [Google Scholar]
  • 52.Paszke A., et al. , “Automatic differentiation in pytorch,” in Proc. Autodiff Workshop Adv. Neural Inf. Process. Syst., Long Beach, California, pp. 1–4 (2017). [Google Scholar]
  • 53.Kingma D. P., Ba J., “Adam: a method for stochastic optimization,” pp. 1–15, https://arxiv.org/abs/1412.6980 (2014).
  • 54.Ioffe S., Szegedy C., “Batch normalization: accelerating deep network training by reducing internal covariate shift,” pp. 1–11, https://arxiv.org/abs/1502.03167 (2015).
  • 55.Fawcett T., “An introduction to ROC analysis,” Pattern Recognit. Lett. 27(8), 861–874 (2006). 10.1016/j.patrec.2005.10.010 [DOI] [Google Scholar]
  • 56.Powers D. M. W., “Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation,” J. Mach. Learn. Technol. 2(1), 37–63 (2011). [Google Scholar]
  • 57.Yushkevich P. A., et al. , “User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability,” NeuroImage 31(3), 1116–1128 (2006). 10.1016/j.neuroimage.2006.01.015 [DOI] [PubMed] [Google Scholar]
  • 58.He K., et al. , “Deep residual learning for image recognition,” in Proc. IEEE Conf. Comput. Vision and Pattern Recognit., Las Vegas, Nevada, pp. 770–778 (2016). 10.1109/CVPR.2016.90 [DOI] [Google Scholar]
  • 59.Lin T.-Y., et al. , “Feature pyramid networks for object detection,” in Proc. IEEE Conf. Comput. Vision and Pattern Recognit., Honolulu, Hawaii, pp. 936–944 (2017). 10.1109/CVPR.2017.106 [DOI] [Google Scholar]
  • 60.Clendenon J. L., et al. , “Voxx: a PC-based, near real-time volume rendering system for biological microscopy,” Am. J. Physiol. 282(1), C213–C218 (2002). 10.1152/ajpcell.2002.282.1.C213 [DOI] [PubMed] [Google Scholar]

Articles from Journal of Medical Imaging are provided here courtesy of Society of Photo-Optical Instrumentation Engineers

RESOURCES