Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Apr 28.
Published in final edited form as: Med Phys. 2008 Aug;35(8):3453–3461. doi: 10.1118/1.2948349

An Automated CT Based Lung Nodule Detection Scheme Using Geometric Analysis of Signed Distance Field

Jiantao Pu 1, Bin Zheng 1, Ken Leader 1, Xiao-Hui Wang 1, David Gur 1
PMCID: PMC2673651  NIHMSID: NIHMS105710  PMID: 18777905

Abstract

We present a new computerized scheme to automatically detect lung nodules depicted on computed tomography (CT) images. The procedure is performed in the signed distance field of the CT images. To obtain an accurate signed distance field, CT images are first interpolated linearly along the axial direction to form an isotropic dataset. Then, a lung segmentation strategy is applied to smooth the lung border aiming to include as many juxtapleural nodules as possible while minimizing over segmentation of the lung regions. Potential nodule regions are then detected by locating local maxima of signed distances in each sub-volume with values and the sizes larger than the smallest nodule of interest in the three-dimensional space. Finally, all detected candidates are scored by computing the similarity distance of their medial axis-like shapes obtained through a progressive clustering strategy combined with a Marching Cube algorithm from a sphere based shape. A free-response receiver operating characteristics (FROC) curve is computed to assess the scheme performance. A performance test on 52 low-dose CT screening examinations that depict 184 verified lung nodules showed that during the initial stage the scheme achieved an asymptotic maximum sensitivity of 95.1% (175/184) with an average of 1200 suspicious voxels per CT examination. The nine missed nodules included two small solid nodules (with a diameter ≤ 3.1mm) and seven non-solid nodules. The final performance level after the similarity scoring stage was an absolute sensitivity level, namely, including the nine missed during the initial stage, of 81.5% (150/184) with 6.5 false-positive identifications per CT examination. This preliminary study demonstrated the feasibility of applying a simple and robust geometric model using the signed distance field to identify suspicious lung nodules. In our dataset the sensitivity of this scheme was not affected by nodule size. In addition to potentially being a stand alone approach, the signed distance field based method can be easily implemented as an initial filtering step in other CAD schemes.

Keywords: Lung nodules, Computer-aided detection, Shape analysis, Signed distance field

I. INTRODUCTION

Multi-detector, thin-section, x-ray computed tomography (CT) has been widely recognized as a sensitive diagnostic imaging modality for the early detection of lung cancer [1-3], the leading cause of cancer-related deaths in USA [4]. However, the thin-section lung CT examinations result in a large number of images (e.g., as many as 600). While higher spatial resolution of thin-section CT images should enable earlier detection of smaller nodules [5-6], the task of viewing and interpreting these images on a routine basis is quite laborious resulting in large inter-observer variability [7, 8]. Hence, the development of computer-aided detection (CAD) schemes for the detection and classification of lung nodules has been the focus of a large number of investigations in recent years.

The different approaches to CAD for nodule detection can be divided into several broad categories. Intensity based schemes [9-12] explore the intensity (voxel value) properties of lung nodules as compared with the surrounding background (tissue structure). Using multiple threshold levels combined with feature analyses (e.g. shape, size), Giger et al. [10] computed the likelihood of a suspected region representing an actual nodule. The primary advantage of this approach is its simplicity but it is frequently difficult to determine robust threshold levels for different CT examinations often resulting in a relatively low sensitivity and a high false positive detection rate. Li et al. [12] used three enhancement filters for different shapes such as blobs, lines and planes based on an eigenvalues analysis of the Hessian matrix, which is relatively sensitive to noise due to the need for computing second derivatives of voxels' intensity values. Image transformation based approaches [13-15] transform CT images into an intuitive form by emphasizing specific features depicted on the original CT images (e.g. intensity gradient [13], normal overlapping [14], and Hessian matrix [15]). Lung nodules can thereby be visualized better on the transformed, “contrast enhanced” images. Template matching-based approaches [16-21] characterize geometric or intensity based features of lung nodules by using statistical models to search for matched object-templates in the image space. For example, Lee et al. [17] used one template matching based algorithm to detect nodules within the interior lung area and a different template matching algorithm near the lung wall areas to detect juxtapleural nodules. This method is highly scalable by simply adding specific features, as needed. However, the variability in nodule properties makes it difficult to define a global set of templates often resulting in low sensitivity levels and/or high false positive rates. In addition, the matching process is frequently computationally extensive. Last, machine learning classifiers [22-25] characterize nodules using features extracted directly from a reference database of known (verified) nodules and these features are inputted into a learning machine for training. This approach generally requires a large dataset for training and testing. For example, Suzuki et al. [24] described a massive training artificial neural network (MTANN) for this purpose.

Despite significant progress in this area, there are no optimal, widely accepted (and used) CAD schemes for nodule detection, to date. The difficulties in achieving this objective arise from the diversity in appearances of lung nodules (e.g. shape, size, intensity) and as important perhaps the varying appearances of normal tissue structures (e.g. vessels). Differences in CT image noise patterns can also affect CAD performance levels and robustness. We developed and tested a new geometric model based algorithm that aims to efficiently search for and detect lung nodules depicted on low dose CT images. Our hypothesis is that the use of an efficient geometric model based on the analysis of the signed distance field could minimize the impact of the large variability in lung nodule appearances and image based feature values on the performance of the scheme. This paper provides a description of the approach taken and reports our preliminary test results.

II. MATERIALS AND METHOD

Our proposed algorithm first segments the lung regions and computes the signed distance field to identify a set of local maxima points. These points are located in a large number of instances close to the center of the objects of interest including nodules and vessels and are regarded as the centers (seeds) of the suspicious regions for representing possible lung nodules. This process aims to minimize the search and analysis space while maintaining high sensitivity levels. Its performance determines the upper level of the CAD sensitivity since true-positive regions (actual nodules) missed during this stage will not be detected by the scheme. The algorithm then generates clustered volumes and analyzes their geometric shapes in order to rate the likelihood of each region being an actual lung nodule. Given a single lung CT image, our algorithm contains five basic steps [Fig. 1]: (1) linear interpolation along the axial direction to obtain an isotropic volume data set; (2) lung segmentation aimed to minimize possible losses of juxtapleural nodules; (3) signed distance field computation of the lung regions; (4) points of interest (POI) identification; and (5) geometric shape analysis of clustered volumes of all suspicious points (POI). Each step of the algorithm plays a unique role in the detection and classification of suspicious regions. A detailed description of the implementation follows.

Fig. 1.

Fig. 1

Schematic diagram of the lung nodule detection algorithm.

A. Isotropic Interpolation

Since our detection scheme relies on the analysis of the signed distance field of CT images, we first convert all voxels into 3D Cartesian-coordinate grids with uniform spatial resolution in the 3D space to reduce possible errors due to anisotropic representation of grids. Several interpolation techniques [26-28] have been developed for this purpose. Among them, the linear interpolation is widely used for isotropic data reconstruction because of its computational simplicity. Considering that the spatial resolution along the axial direction (slice thickness) in CT examinations is generally different from the spatial resolution within each slice, we simply perform a linear interpolation along the axial direction. Regardless of the actual resolution along the axial direction, an interpolated slice is computed in between all paired neighboring slices.

B. Lung Segmentation

Lung segmentation is often used as a preprocessing step in CAD schemes. Although threshold-based region-filling strategies [29-32] can be used to extract the lung parenchyma quite efficiently, these can also incorrectly exclude potentially important regions (i.e., juxtapleural nodules) when these are contiguous to the chest wall and have a similar density. Our proposed algorithm for correcting possible under-segmentation of juxtapleural nodules is illustrated in Fig. 2. This algorithm consists of two primary processes.

Fig. 2.

Fig. 2

Flowchart of the lung segmentation algorithm: (a) an original 2D CT image; (b) initial thresholded results; (c) non-lung region flooding; (d) enlargement of (c); (e) inner lung border tracing; and (f) lung border smoothing.

1) Threshold-based lung region extraction

This stage removes all non-lung regions from the CT images using three steps. (a)Gaussian Smoothing of the 3D CT Images [Fig.2 (a)] is used to remove isolated pockets of air between the patient and the CT bed that could lead to small contours after the lung border tracing. These small contours could be incorrectly regarded as lung borders. In our implementation, a 3D recursive Gaussian filter with a standard deviation of 1.0 mm [33] is used for this purpose because it requires a small number of multiplications and additions and it is independent of kernel size; (b)Gray-level Thresholding [Fig.2 (b)] at -500 HU is used as the threshold to identify initial lung regions [34]; (c)Floodfill Non-lung Region [Fig.2 (c)] is used to remove any remaining non-lung regions after the thresholding operation. A 2D flooding operation is performed slice by slice using the pixels along boundaries of the segmented lung image as flooding seeds. The reason for using a 2D rather than 3D flooding operation is that a 3D operation may lead the vascular tree to extend far into the image resulting in missing parts of the lung regions.

2) Lung border extraction and boundary smoothing

Two geometry based techniques are employed to extract the lung boundary and assure the inclusion of juxtapleural nodules, if any. (a)Lung Boundary Extraction [Fig.2 (e)] is performed using the inner border tracing algorithm on the first points in the left-to-right and right-to-left scan lines as seeds for lung boundary tracing. This method ignores the central regions altogether. The boundaries of the lungs in each slice are traced as a series of pixels, thus forming a collection of “directed closed anti-clockwise” contours; (b)Lung Boundary Smoothing [Fig.2 (f)] is performed using the Jarvis March algorithm [35] that computes locally rather than globally the convex hull where juxtapleural nodules may be located. First, a point with the largest x or y coordinate located on the convex region of the contour is selected as a seed. Then, the point that makes the largest right-hand turn within a local region along the contour is identified using the approach described in [35], where the “turn” rule is used to determined whether a point is located on the left side or right side of a given contour (or a line). A local region is determined by a “starting point” and a “length” representing the size of a possible juxtapleural nodule. The starting point is then replaced by the identified point with the “largest turn” and the previous step is repeated until the contour is traversed. Finally, a new contour is formed by connecting all the rightmost (or leftmost) points identified in this process and this contour is used as a representation of the smoothed lung boundary.

C. Signed Distance Field

Given a region Ω, its signed distance function determines the closest distance between a given point p and the boundary ∂Ω. This function is negative in the interior region, Ω-, zero on the boundary ∂Ω, and positive in the exterior region Ω+. The signed distance function d can be represented as:

d(x)=sgn(x)minxpfor allpΩ (1)

where

sgn(x)={1ifxΩ+0ifxΩ1ifxΩ} (2)

In an n-dimensional space, the signed distance field has a dimension of n+1. There are a number of choices for the distance metric such as chessboard [36] depending on the specific application of interest. We chose the Euclidean distance because it is intuitive and has a meaningful physical representation. A number of methods have been proposed [37] for an efficient computation of the signed distance. We chose the Fast Marching Method (FMM) [38] which is based on the fact that the signed distance field can be described by the Eikonal equation:

u=(ux)2+(uz)2+(uz)2=1subject touΩ=0 (3)

where u denotes the signed distance from a voxel to the boundary that is determined by a selected threshold as discussed below, ∇ denotes the gradient, and ∂Ω represents the boundary. To solve the nonlinear Eikonal equation numerically, Sethian developed an FMM [38] for a Cartesian grid. The use of FMM reduces the computational complexity from O(N 2) in the traditional “brute force” approach to O(N log N), where N is the total number of grid points. A detailed description of the FMM method has been described elsewhere [39].

In our implementation, we first generated binary CT images by setting a specific threshold (i.e., -600HU). The resulting images are used as an input dataset to the FMM for computing the signed distance field. The region above the threshold is considered “exterior” thereby; all nodule regions have positive distance values. The performance of the algorithm may depend on the selection of this threshold because it could influence the boundary locations and the shapes quantified by the distance transform. Therefore, we randomly selected a subset of 15 CT examinations and systematically tested a series of threshold values from 0 to -1000 HU (at -50 HU increments) to examine the relationship between detection sensitivity and the average number of detected voxels representing false-positive identifications in this testing dataset. The goal is to semi-empirically select a threshold that covers most of the true-positive nodules while maintaining a reasonable number of false-positive identifications. We note that this is the only parameter in the scheme that was selected semi-empirically.

D. Points of Interest (POIs)

After applying the FMM to the segmented lungs, we generated a 3D image dataset recording the distance field. The value at each voxel represents the shortest distance from the point to the boundary. Tissue with positive distance values in the lung region generally appears as one of three shapes: a blob (e.g., nodule), a sheet (e.g., fissure), and a tube (e.g. vessel) [40]. The distance values along the axis of a vessel generally reach a maximum at the point where the vessel “enters” the lung because the size of a vessel decreases progressively toward the lung parenchyma. Ideally, for each vessel, there should be only one point with a maximum distance value regardless of its length. A lung nodule typically appears as a local concentration with a “blob-like” shape. The distribution of the distance values within a lung nodule decrease from the center of the nodule to its boundary with the maximum distance value typically located at the center. For a spherical nodule there should be only one point with the maximum value located at the center of the sphere. An ideal sheet-like shape (e.g. fissure), typically has multiple maxima points and these often have similar, if not equal, distance values forming a plane.

Based on these characteristics we limited our focus only on voxels with the maximum local distance values termed here “points of interest” (POIs). To reduce the total number of POIs while minimizing losses of true-positive lung nodules, we only selected for analysis voxels with maximal distance values within specific local volumes. We are interested in nodules larger than 3mm in size hence, we define the volume of interest of a local region as a 3mm×3mm×3mm cube. To minimize possible impact of image noise that may be incorrectly identified as a POI, we set a threshold of 0.5mm on the distance value when selecting POIs.

E. Geometric Analysis

Under an ideal situation where all nodules are in the form of spheres and vessels are in the forms of truncated cones, the strategy described above could yield high performance in detecting lung nodules. Unfortunately, this is not the case due to a number of reasons including but not limited to the discrete representation of an image, image noise, varying CT scanning parameters, and image reconstruction artifacts. As a result a large number of POIs are actually located at non-nodule tissue regions leading to false positive detections. Fig. 3 shows several examples of possible locations where POIs may be located.

Fig. 3.

Fig. 3

Examples of four POI locations

To classify POIs into two categories namely, located within nodules and located within non-nodule tissues, we tested a shape based analysis approach in which a progressive clustering is performed at each POI and a shape similarity distance to a perfect sphere is computed as the likelihood score (probability) of the region being an actual nodule. This analysis includes two stages.

a) Progressive clustering

To estimate the shape of a region identified by a POI, a progressive clustering operation is used. Using the POI as a seed the 26 neighboring voxels in the 3D space are assessed and the voxel with the maximum distance value is clustered with the POI. Thereafter, the new search volume covers all neighboring voxels of the clustered volume from the previous step and the process is repeated. To minimize the computational cost of the clustering operation while capturing the essential shape of an object associated with the initial clustering seed (POI), we set the following “stop operation” rules: (a) the Euclidean distance from the last clustered point to the initiating POI is larger than 15mm; or (b) the signed distance difference between the last clustered voxel and the initiating POI is larger than the resolution of the isotropic (interpolated) 3D image. When a starting POI is located within a vessel, the first “stop operation” rule frequently terminates the clustering process quickly. When the second “stop operation” condition is met, a layer around the POI is formed, namely the POI is fully enclosed by clustered voxels. This is sufficient to compute the local shape of a given POI hence; the second rule is employed primarily to minimize computational cost. This progressive clustering method generates (grows) a 3D volume in a manner that is similar to skeletonization (i.e., medial axis extraction) of an object. Given a blob-like shape such as a nodule, the clustered volume should be similar to a sphere. For a tube-like shape such as a vessel, the clustered volume should be similar to a string; and for a plane-like shape such as a fissure, the clustered volume should be similar to a plane. As a result it is intuitive to use the similarity distance between a given clustered volume and a sphere as a summary measure associated with the probability that a clustered 3D volume actually represents a lung nodule. Using the Marching Cube Algorithm (MCA), we transformed the clustered volume into a surface-based representation that consists of a set of triangles. Fig. 4 shows the modeled result after the MCA transform is applied to the examples shown in Fig. 3.

Fig. 4.

Fig. 4

Surface-based representations of the clustered volumes representing the POIs in Fig. 3.

2) Shape similarity analysis

A large number of approaches have been proposed for computing similarity between shapes [41-46] and these are summarized in a review article [47]. We are interested here in similarity between an arbitrary shape and a sphere (e.g., an “ideal” nodule). We used the statistical based approach described by Osada et al. [46] because it is simple to implement, computationally efficient and is invariant to fundamental geometrical transformations such as scaling, translation, and rotation. In addition, it is largely insensitive to noise, image blur and “cracks” in a three-dimensional model. The method includes the following steps: (a) Shape Function Selection is simply done by measuring the Euclidean distance between two points on the surface of a 3D shape; (b) Shape Distribution Construction is performed by sampling an adequate number of point pairs to build the associated shape distribution. Of concern during this process is that the implemented sampling strategy should result in reasonably evenly distributed sampled points on the surface of an object; (c) Shape Distribution Normalization is performed to compensate for possible size effects on the shape distribution. We used a normalization approach that aligns the two distributions being compared by their mean values [46] because of its simplicity and applicability to our specific application. For example, Fig. 5 shows shape distributions of three typical representations after normalization and Fig. 6 shows the shape distributions of the objects presented in Fig. 4 after normalization; and (d) Shape Distribution Comparison between paired shape distributions is computed as the sums of distance differences of the corresponding vertices used to represent these distributions:

S(H1,H2)=i=0n(H1(i)H2(i))2 (4)

where H1 and H2 are the two histograms representing the distributions, S is their similarity, and n is the number of vertices forming the histograms. In this study, H1 represents the histogram of a sphere and H2 is the computed histogram of a given clustered (3D) volume of interest. We used S(H1, H2) as a likelihood score for a clustered volume representing a true-positive nodule. To assess the performance of the scheme we computed free-response receiver operating characteristic (FROC) curves.

Fig. 5.

Fig. 5

Shape distributions of a sphere, a tube and a plane, each in the form of a surface mesh.

Fig. 6.

Fig. 6

Shape distributions of the objects displayed in Fig. 4.

F. Testing a CT Dataset

CT images from 52 low-dose screening CT examinations were used in this preliminary study. A detailed description of the CT data acquisition (including the x-ray dose and image reconstruction) protocols has been reported elsewhere [8]. In brief, these examinations were acquired using LightSpeed Plus 4-MDCT or LightSpeed Ultra 8-MDCT scanners (GE Healthcare). The low-dose CT acquisition protocol varied slightly depending on patient size. Exposure settings ranged from 120 to 140 kVp, 29.7 ± 10.7 mAs, with pixel reconstruction dimensions ranging from 0.6 to 0.98 mm. Contiguous (non-overlapping) slices (“volume scans”) encompassing the entire lung areas were acquired. Images were reconstructed with 512×512 pixel matrices using the GE Healthcare “lung” reconstruction kernel. Slice thickness was 2.5mm in all examinations. Three board-certified chest radiologists with 3, 21, and 24 years of experience in reading chest CT images, independently read the dataset. A total of 204 lung nodules were identified and classified by agreement of the independent readings and/or a second review leading to consensus in cases of initial disagreement as non-calcified nodules. Among these nodules, 184 were determined to be of interest ranging in size from 3mm to 28.9mm and 20 were determined to be of sizes less than 3.0mm. Forty four of the 184 nodules that were larger than 3.0mm were classified as “part-solid”, 16 were classified as “non-solid”, and the remaining 124 were classified as “solid”. Fifty eight of the nodules were located at the pleural regions. The size, location and density distributions of the 184 nodules of interest are summarized in Table 1. A reference (“truth”) file was generated to record the type (e.g., solid, non-solid, part-solid), size (including estimated larges axis and effective size), and the center coordinate of each individual nodule.

Table 1.

The distribution by size, location and type of the 184 nodules of interest in our dataset as detected by the radiologists

Categories of nodules Diameter (mm)
Location
Density
3-6 ≥6 Juxtapleural non-juxtapleural solid partial solid non-solid
Number of nodules 56 128 58 126 124 44 16

Our computerized scheme was directly applied to all CT examinations in this dataset without any operator intervention. A nodule was considered “detected” if the distance between the center of a CAD identified nodule and the recorded center of the nodule in the reference file was smaller than the recorded nodule “radius” (half of the effective size). Otherwise, the detected voxel represented a growth seed of a false-positive identification.

III. RESULT

Our experimental results in the 15 randomly selected examinations show that using different threshold values affects the performance curves for the signed distance field based algorithm (Fig. 7). As threshold value decreased from 0 to -600 HU, both detection sensitivity and average number of false-positive identifications per examination increased steadily. In this range sensitivity levels increased faster than false-positive identification rates. As threshold levels decreased below -600HU the detection sensitivity reached a plateau while false-positive identification rate increased rapidly. When the threshold was set below -850HU both sensitivity and false-positive identification rates became somewhat unstable. As a result, we set the threshold at -600HU for the performance analysis. Using this semi-empirically selected threshold, the initial four steps of the algorithm that are based on the local maxima strategy in the signed distance field identified a large number of POIs in each of 52 CT examinations, ranging from 663 to 2287 with an average of 1200 POIs per examination. Among these POIs, 175 of the 184 lung nodules of interest were detected, resulting in a maximum sensitivity level for the scheme of 95.1%. The algorithm missed nine nodules in this dataset including two solid nodules with a diameter 3.1mm and 3.0mm, respectively, and seven non-solid nodules ranging in size from 10.0 to 18.4mm. Thus, the maximum achievable sensitivity of the algorithm in this dataset was 98.8% (166 out of 168) for detecting solid and part-solid nodules and 56.3% (9 of 16) for detecting non-solid nodules, respectively. We note that at this stage all juxtapleural nodules depicted in the dataset were detected.

Fig. 7.

Fig. 7

The relationship between the selected threshold for signed distance field and the number of POIs (top) and the relationship between the selected threshold for signed distance field and the sensitivity (bottom).

The results of the last step of our computerized scheme namely, the geometric shape analysis are summarized by four free-response receiver operating characteristic performance (FROC) curves (Fig. 8). Two curves represent performance levels for the detection nodules equal to or larger than 3.0mm and 6.0mm in size, respectively. The other two curves represent the performance of the scheme for the detection of either solid or part-solid nodules and for the detection of non-solid nodules, respectively. As an example, at a false-positive rate of 6.5 identifications per examination, the detection sensitivity levels of our scheme were 81.5% (150/184) and 82.8% (106/128) for non-calcified nodules larger than 3mm and larger than 6mm, respectively. At the same false positive rates, the detection sensitivity levels for the solid and non-solid nodules are 84.5% (142/168) and 50% (8/16), respectively. These results suggest that the detection sensitivity of our scheme was not substantially affected by nodule size. At a false-positive rate of 6.5 identifications per examination, the detection sensitivity for non-solid nodules was 50% (8/16).

Fig. 8.

Fig. 8

Four performance (FROC) curves for nodule detection representing different sizes of interest (≥3mm and ≤6mm) and different types (solid and non-solid).

IV. DISCUSSION

The computerized scheme developed and tested in this study has a number of advantages as compared with other previously developed schemes. First, it is simple and relatively easy to implement. The actual detection process is reduced to an analysis of the signed distance field of the CT images. The number of detected suspicious voxels depends on the initial threshold value that represents “the lowest” voxel value of a nodule boundary. Generally, because the low conspicuity non-solid and small solid nodules result in lower values in their boundary contours, lower threshold value is required to increase detection sensitivity to these subtle nodules. The threshold of -600HU was semi-empirically selected based on limited experimental data as shown in Fig 7. As a result, the algorithm implemented with this threshold detects initially (before pruning) on average approximately 1200 voxels per examination as possible centers of suspected nodules. This number is in the same range of other previously reported schemes. For example, one study reported an initial sensitivity of 94.4% with 906 false-positive identifications per examination when applied to a set of CT scans acquired at a significantly higher dose (e.g., 120/140 kVP and 220/290 mAs) [49]. Second, the geometric shape is the only classification criterion in determining the likelihood of a clustered 3D volume representing an actual nodule. Typically, shape analysis requires the segmentation of the suspicious regions that may often fail in particular in the case of nodules connected to the other lung tissues (e.g., vessels, airways, and arteries). Similar to a medial axis extraction process, we simply cluster the voxels with the largest signed distances within a local 3D search space. This clustering approach avoids the requirement of nodule (or region) segmentation and the resulting volume reflects a local shape. Thus, the probability of being an actual nodule is computed as the similarity distance to a sphere using a shape distribution approach. A unique characteristic of the clustering approach presented here is that our approach is relatively insensitive to nodule size (as shown in Fig 8). Other reference shapes could be implemented as well but this work is beyond the scope of this preliminary study.

The implementation of the signed distance field and shape similarity computation is computationally efficient. For a typical CT dataset used in this study namely, approximately 110 slices with a slice thickness 2.5mm, the procedure including the isotropic interpolation, segmentation, signed distance computation and shape analysis requires approximately 5 minutes using a desktop PC computer (AMD Athlon™ 64×2 Dual 2.11 GHz with 2GB of RAM) with the majority of the computation time spent on the lung area segmentation operation and the signed distance computation.

This method is also potentially applicable to other CAD schemes. This algorithm could be easily adopted as a preprocessing stage to other CAD schemes due to its simplicity, efficiency and the limited number output. Another unique characteristic is that the POI detected by this scheme is located in the majority instances at (or close to) the center of the suspicious candidate region. This may be useful for other nodule segmentation methods that require knowledge of the centers of suspected regions for appropriate analyses. For example, the algorithm proposed in [48] required a manual identification of a nodule location when used for nodule segmentation. For the purpose of user interface design, it is convenient to control the size range of nodules of interest by setting an appropriate threshold for the signed distance value, which conservatively indicates a nodule size.

Despite these advantages we note that the proposed method did miss a small fraction of nodules namely, 2 small solid and 7 non-solid nodules in our dataset. The missed nodules are in two categories namely, small solid nodules with sizes very close to the 3mm cutoff and several larger non-solid nodules with varying sizes. Missing the small solid nodules can be caused due to our use of a conservative threshold value for size (e.g. 0.5mm) and possible errors generated by radiologists in measuring the actual size of these nodules. The missed non-solid nodules were primarily the result of the initial low intensity threshold (i.e., -600HU) we selected and the discrete nature of some of the voxels identified within the POIs thereby our inability to cluster these into an object. A lower threshold would have resulted in the detection of some of these non-solid, low contrast nodules. However, this would have been done at the cost of substantially increasing the number of initially detected POIs (as shown in Fig. 7). We are currently investigating different approaches to optimize the threshold for this purpose including the possible use of other effective features specifically implemented for distinguishing non-solid nodules. We also note that one of the possible disadvantages of this approach is the initial false-positive detection rate that is somewhat higher than those previously reported for other schemes [13, 50]. However, the CT examinations used in this study were acquired at substantially lower doses (29.7 ± 10.7mAs). Hence, image noise levels in our datasets are relatively high even for “low dose scanning”. The false positive identifications are located primarily at two types of positions namely, bifurcation of vessels and the hilum region. At these locations the clustered volumes frequently looked like spheres. Hence, other features may be required to reduce at least some of these false positive identifications and this topic is currently under investigation. We are also exploring the use of other reference shapes including but not limited to various ellipses as well as the average size dependent representative shapes of actual ensembles of nodules in our databases.

Operationally, this method is memory intensive due to the requirements of the FMM algorithm. Given a dataset after isotropic interpolation (400~600 slices), it is easy to exceed 2.0GB if no prior action is taken. To alleviate this memory issue, we divided the dataset into sub-volumes. Last, the first two steps, namely, isotropic interpolation and lung segmentation, are used to preprocess the images sets. We recognized that the isotropic interpolation only approximated the actual tissue structure being imaged. The impact of this approach could be substantially reduced by scanning with smaller (thinner) slice thicknesses (e.g. 1.25mm or 0.625mm). However, this preliminary study suggests that even when using CT images with 2.5mm slice thickness, the interpolation method is quite adequate for our purpose because we primarily focus on approximating the continuity of lung vessels.

V. CONLCUSION

We developed and tested a new and relatively simple computerized scheme for lung nodule detection based on geometric analysis of the signed distance field. This scheme was tested using 52 low-dose screening CT examinations depicting 184 nodules of interest, including 44 part-solid and 16 non-solid nodules. The preliminary test results are encouraging in that the sensitivity of the scheme was not significantly affected by nodule size. In summary, our computerized scheme is efficient and relatively easy to implement and does not require an extensive training process that is required in other CAD schemes that include multi-image feature based machine learning classifiers. With the signed distance attached to each suspicious candidate, this scheme can support a flexible user interface allowing users to change the focus of analysis on specific nodules of interest with specific pre-determined sizes. The algorithms used in this scheme can also be relatively easily adopted as the first detection stage of other CAD schemes.

ACKNOWLEDGEMENT

This work is supported in part by grants P50 HL084948 and R01 HL085096 from National Heart, Lung, and Blood Institute, National Institutes of Health, to the University of Pittsburgh.

REFERENCES

  • [1].Kaneko M, Eguchi K, Ohmatsu H, Naruke T, Suemasu K, Moriyama N. “Peripheral lung cancer: screening and detection with low-dose spiral CT versus radiography,”. Radiology. 1996;Vol. 201:798–802. doi: 10.1148/radiology.201.3.8939234. [DOI] [PubMed] [Google Scholar]
  • [2].Diederich S, Lenzen H, Windmann R, Puskas R,Z, Yelbuz TM, Henneken S, Klaiber T, Eameri M, Roos N, Peters PE. “Pulmonary nodules: experimental and clinical studies at lowdose CT,”. Radiology. 1999;Vol. 213:289–298. doi: 10.1148/radiology.213.1.r99oc29289. [DOI] [PubMed] [Google Scholar]
  • [3].Ko JP, Naidich DP. “Lung nodule detection and characterization with multislice CT,”. Radiologic Clinics of North America. 2003;Vol. 41:575–597. doi: 10.1016/s0033-8389(03)00031-9. [DOI] [PubMed] [Google Scholar]
  • [4].Detailed guide: what are the key statistics about lung cancer. 2007 October 15; http://www.cancer.org/docroot/
  • [5].Fischback F, Knollmann F, Griesshaber V, Freund T, Akkol E, Felix R. “Detection of pulmonary nodules by multislice computed tomography: Improved detection rate with reduced slice thickness,”. European Radiology. 2003;Vol. 13:2378–2383. doi: 10.1007/s00330-003-1915-7. [DOI] [PubMed] [Google Scholar]
  • [6].Rubin GD, Lyo J, Paik DS, Sherbondy A, Chow L, Leung AN, Mindelzun R, Zinck SE, Naidich DP, Napel S. “Pulmonary nodules in MDCT scans: impact of computer-aided detection,”. Radiology. 2005;Vol. 235:274–283. doi: 10.1148/radiol.2341040589. [DOI] [PubMed] [Google Scholar]
  • [7].Ohtsuka T, Nomori H, Horio H, Naruke T, Suemasu K. “Radiological examination for peripheral lung cancers and benign nodules less than 10 mm,”. Lung Cancer. 2003;vol. 42:291–296. doi: 10.1016/s0169-5002(03)00360-x. [DOI] [PubMed] [Google Scholar]
  • [8].Leader JK, Warfel TE, Fuhrman CR, Golla SK, Weissfeld JL, Avila RS, Turner WD, Zheng B. “Pulmonary nodule detection with low-dose computed tomography of the lung: agreement among radiologists,”. American Journal of Roentgenology. 2005;Vol. 185:973–978. doi: 10.2214/AJR.04.1225. [DOI] [PubMed] [Google Scholar]
  • [9].Ko JP, Betke M. “Chest CT: automated nodule detection and assessment of change over time - preliminary experience,”. Radiology. 2001;Vol. 218:267–273. doi: 10.1148/radiology.218.1.r01ja39267. [DOI] [PubMed] [Google Scholar]
  • [10].Giger M, Bae KT, MacMahon H. “Computerized detection of pulmonary nodules in computed tomography images,”. Investigative radiology. 1994;Vol. 29:459–465. doi: 10.1097/00004424-199404000-00013. [DOI] [PubMed] [Google Scholar]
  • [11].Armato SG, III, Giger ML, MacMahon H. “Automated detection of lung nodules in CT scans: preliminary results,”. Medical Physics. 2001;Vol. 28(8):1552–1561. doi: 10.1118/1.1387272. [DOI] [PubMed] [Google Scholar]
  • [12].Li Q, Sone S, Doi K. “Selective enhancement filters for nodules, vessels, and airway walls in two- and three-dimensional CT scans,”. Medical Physics. 2003;Vol. 30:2040–2051. doi: 10.1118/1.1581411. [DOI] [PubMed] [Google Scholar]
  • [13].Roy AS, Armato SG, III, Wilson A, Drukker K. “Automated detection of lung nodules in CT scans: false-positive reduction with the radial-gradient index,”. Medical Physics. 2006;Vol. 33(4):1133–1140. doi: 10.1118/1.2178450. [DOI] [PubMed] [Google Scholar]
  • [14].Paik DS, Beaulieu CF, Rubin GD, Acar B, Jeffrey RB, Yee J, Dey J, Napel S. Surface normal overlap: a computer-aided detection algorithm with application to colonic polyps and lung nodules in helical CT. IEEE Trans. Medical Imaging. 2004;Vol. 23(6):661–75. doi: 10.1109/tmi.2004.826362. [DOI] [PubMed] [Google Scholar]
  • [15].Sahiner B, Hadjiiski LM, Chan HP, Zhou C, Wei J. Computerized lung nodule detection on screening CT Scans: performance on juxta-pleural and internal nodules. In Proceedings of the SPIE (Medical Imaging 2006: Image Processing) 2006;Vol. 6144:1855–1860. [Google Scholar]
  • [16].Okada K, Comaniciu D, Krishnan A. Robust anisotropic Gaussian fitting for volumetric characterization of pulmonary nodules in multislice CT. IEEE Trans. Medical Imaging. 2005;Vol. 24(3):409–423. doi: 10.1109/tmi.2004.843172. [DOI] [PubMed] [Google Scholar]
  • [17].Lee Y. Automated detection of pulmonary nodules in helical CT images based on an improved template-matching technique. IEEE Trans. Medical Imaging. 2001;Vol. 20(7):595–604. doi: 10.1109/42.932744. [DOI] [PubMed] [Google Scholar]
  • [18].Miwa T, Kako J, Yamamoto S, Matsumoto M, Tateno Y, Iinuma T, Matsumoto T. Automatic detection of lung cancers in chest CT images by the variable N-Quoit filter. Systems and Computers in Japan. 2002;Vol. 33(1):53–63. [Google Scholar]
  • [19].Brown MS, McNitt-Gray MF, Goldin JG, Suh RD, Sayre JW, Aberle DR. Patient-specific models for lung nodule detection and surveillance in CT images. IEEE Trans. Medical Imaging. 2001;Vol. 20(12):1242–1250. doi: 10.1109/42.974919. [DOI] [PubMed] [Google Scholar]
  • [20].Osman O, Ozekes S, Ucan ON. Lung nodule diagnosis using 3D template matching. Computer in Biology and Medicine. 2007;Vol. 37(8):1167–1172. doi: 10.1016/j.compbiomed.2006.10.007. [DOI] [PubMed] [Google Scholar]
  • [21].Takizawa H, Yamamoto S. Recognition of lung nodules from X-ray CT images using 3D Markov random field models. In Proceedings of 16th International Conference on Pattern Recognition. 2002;Vol. 1:99–102. [Google Scholar]
  • [22].Zhang X, McLennan G, Hoffman EA, Sonka M. Computerized detection of pulmonary nodules using cellular neural networks in CT images. In Proceedings of the SPIE (Medical Imaging 2004: Image Processing) 2004;Vol. 5370:30–41. [Google Scholar]
  • [23].Takizawa H, Yamamoto S, Shiina T. Accuracy improvement of pulmonary nodule detection based on spatial statistical analysis of thoracic CT scans. IEICE Trans. Information and System. 2007;Vol. E90-D(8):1168–1174. [Google Scholar]
  • [24].Suzuki K, Armato SG, III, Li F, Sone S, Doi K. Massive training artificial neural network (MTANN) for reduction of lung nodules in low-dose computed tomography. Medical Physics. 2003;Vol. 30((7):1602–1617. doi: 10.1118/1.1580485. [DOI] [PubMed] [Google Scholar]
  • [25].Su H, Qian W, Sankar R, Sun X. A new knowledge-based lung nodule detection system. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. 2004;Vol. 5:445–448. [Google Scholar]
  • [26].Goshtasby A, Turner DA, Ackerman LV. Matching of tomographic slices for interpolation. IEEE Trans. Medical Imaging. 1992;Vol. 11(4):507–516. doi: 10.1109/42.192686. [DOI] [PubMed] [Google Scholar]
  • [27].Lee TY, Wang WH. Morphology-based three-dimensional interpolation. IEEE Trans. Medical Imaging. 2000;Vol. 19(7):711–721. doi: 10.1109/42.875193. [DOI] [PubMed] [Google Scholar]
  • [28].Raya SP, Udupa JK. Shape-based interpolation of multi-dimensional objects. IEEE Trans. Medical Imaging. 1990;Vol. 9(1):32–42. doi: 10.1109/42.52980. [DOI] [PubMed] [Google Scholar]
  • [29].Lee Y, Hara T, Fujita H, Itoh S, Ishigaki T. Automated detection of pulmonary nodules in helical CT images based on an improved template-matching technique. IEEE Trans. Medical Imaging. 2001;Vol. 20(7):595–604. doi: 10.1109/42.932744. [DOI] [PubMed] [Google Scholar]
  • [30].Armato SG, III, Giger MI, Moran CJ, Blackburn JT, Doi K, MacMabon H. Computerized detection of pulmonary nodules on CT scans. Radiographics. 1999;Vol. 19:1303–1311. doi: 10.1148/radiographics.19.5.g99se181303. [DOI] [PubMed] [Google Scholar]
  • [31].Li Q, Katsuragawa S, Doi K. Computer aided diagnostic scheme for lung nodule detection in digital chest radiographs by use of a multiple-template matching technique. Medical Physics. 2001;Vol. 28(10):2070–2076. doi: 10.1118/1.1406517. [DOI] [PubMed] [Google Scholar]
  • [32].Hu S, Hoffman EA, Reinhardt JM. Automatic lung segmentation for accurate quantitation of volumetric X-Ray CT images. IEEE Trans. Medical Imaging. 2001;Vol. 20(6):490–498. doi: 10.1109/42.929615. [DOI] [PubMed] [Google Scholar]
  • [33].Young IT, van Vliet LJ. Recursive implementation of the Gaussian filter. Signal Processing. 1995;Vol. 44:139–-151. [Google Scholar]
  • [34].Wu M, Chang J, Chiang AA, Lu J, Hsu H, Hsu W, Yang C. Use of quantitative CT to predict postoperative lung function in patients with lung cancer. Radiology. 1994;Vol. 191:257–262. doi: 10.1148/radiology.191.1.8134584. [DOI] [PubMed] [Google Scholar]
  • [35].Cormen TH, Leiserson CE, Rivest RL, Stein C. Second Edition MIT Press and McGraw-Hill; 2001. Introduction to algorithms. [Google Scholar]
  • [36].Rosenfeld A, Pfaltz JL. Distance functions on digital pictures. Pattern Recognition. 1968;Vol. 1(1):33–61. [Google Scholar]
  • [37].Jones MW, Barentzen JA, Sramek M. 3D distance fields: a survey of techniques and applications. IEEE Trans. Visualization and Computer Graphics. 2006;Vol. 12(4):581–599. doi: 10.1109/TVCG.2006.56. [DOI] [PubMed] [Google Scholar]
  • [38].Sethian JA. A fast marching level set method for monotonically advancing fronts; Proceedings of the National Academy of Sciences; 1996. pp. 1591–1595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [39].Sethian JA. Cambridge University Press; Cambridge, U.K.: 1999. Level set methods and fast marching methods: evolving interfaces in computational geometry, fluid mechanics, computer vision and materials sciences. [Google Scholar]
  • [40].Danielsson PE, Lin Q. Efficient Detection of Second-Degree Variantion in 2D and 3D images. Journal of Visual Communication and Image Representation. 2001;Vol. 12:255–305. [Google Scholar]
  • [41].Funkhouser T, Min P, Kazhdan M, Chen J, Halderman A, Dobkin D, Jacobs D. A search engine for 3D models. ACM Trans. Graphics. 2003;Vol. 22(1):83–105. [Google Scholar]
  • [42].Kazhdan M, Chazelle B, Dobkin D, Funkhouser T, Rusinkiewicz S. A reflective symmetry descriptor for 3D models. Algorithmica. 2003;Vol. 38(2):201–225. [Google Scholar]
  • [43].Hilaga M, Shinaagagawa Y, Kohmura T, Kunii TL. Topology matching for fully automatic similarity estimation of 3D shapes. Proceedings of SIGGRAPH 2001; Los Angeles, USA: 2001. pp. 203–212. [Google Scholar]
  • [44].Chen DY, Tian XP, Shen YT, Ouhyoung M. On visual similarity based 3D model retrieval. Computer Graphics Forum (Eurographics'2003) 2003;Vol. 22(3):223–232. [Google Scholar]
  • [45].Pu J, Ramani K. On visual similarity based 2D drawing retrieval. Computer aided design. 2006;Vol. 38(3):249–259. [Google Scholar]
  • [46].Osada R, Funkhouser T, Chazelle B, Dobkin D. Shape distribution. ACM Trans Graphics. 2002;Vol. 21(4):807–832. [Google Scholar]
  • [47].Bustos B, Keim D, Saupe D, Schreck T, Vranic D. Feature-based similarity search in 3D object databases. ACM Computing Surveys. 2005;Vol. 37(4):345–387. [Google Scholar]
  • [48].Zhao B, Yankelevitz D, Reeves A, Henschke C. Two-dimensional multi-criterion segmentation of pulmonary nodules on helical CT images. IEEE Trans. Medical Imaging. 2003;Vol. 26(6):889–895. doi: 10.1118/1.598605. [DOI] [PubMed] [Google Scholar]
  • [49].Zhao BS, Gamsu G, Ginsburg MS, Jiang L, Schwartz LH. Automatic detection of small lung nodules on CT utilizing a local density maximum algorithm. Journal of Applied Clinical Medical Physics. 2003;Vol. 4(3):248–260. doi: 10.1120/jacmp.v4i3.2522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [50].Fukano G, Takizawa H, Shigemoto K, Yamamoto S, Matsumoto T, Tateno Y, Iinuma T. Recognition method of lung nodules using blood vessel extraction techniques and 3-D object models. In Proceedings of the SPIE (Medical Imaging 2003: Image Processing) 2003;Vol. 5032:190–196. [Google Scholar]

RESOURCES