Abstract
Purpose: The purpose of this work was to develop and validate a computer-aided method for the 3D segmentation of lymph nodes in CT images. The proposed method can be utilized to facilitate applications like biopsy planning, image guided radiation treatment, or assessment of response to therapy.
Methods: An optimal surface finding based lymph node segmentation method was developed. Based on the approximate center point of a lymph node of interest, a graph is generated, which represents the local neighborhood around the lymph node at discrete locations (graph nodes). A cost function is calculated based on a weighted edge and region homogeneity term. By means of optimization, a surface-based segmentation of the lymph node is derived. In addition, an interactive segmentation refinement algorithm was developed, which allows the user to quickly correct segmentation errors, if needed. For assessment of segmentation accuracy, 111 lymph nodes of mediastinum, abdomen, head/neck, and axillary regions from 35 volumetric CT scans were utilized. For accuracy analysis, lymph nodes were divided into three test sets based on lymph node size and spatial resolution of the CT scan. The average lymph node size for test set I, II, and III was 1056, 1621, and 501 mm3, respectively. Spatial resolution of test set II was lower than for test sets I and III. To generate an independent reference standard for comparison, all 111 lymph nodes were segmented by an expert with a live wire approach.
Results: All test sets were segmented with the proposed approach. Out of the 111 lymph nodes, 40 cases (36%) required computer-aided refinement of initial segmentation results. The refinement typically required 10 s per lymph node. The mean and standard deviation of the Dice coefficient for final segmentations was 0.847 ± 0.061, 0.836 ± 0.058, and 0.809 ± 0.070 for test sets I, II, and II, respectively. The average signed surface distance error was 0.023 ± 0.171, 0.394 ± 0.189, and 0.001 ± 0.146 mm for test sets I, II, and II, respectively. The time required for locating the approximate center point of a target lymph node in a scan, generating an initial OSF segmentation, and refining the segmentation, if needed, is typically less than one minute.
Conclusions: Segmentation of lymph nodes in volumetric CT images is a challenging task due to partial volume effects, nearby strong edges, neighboring structures with similar intensity profiles and potentially inhomogeneous density of lymph nodes. The presented approach addresses many of these obstacles. In the majority of cases investigated, the initial segmentation method delivered results that did not require further processing. In addition, the computer-aided segmentation refinement framework was found to be effective in dealing with potentially occurring segmentation errors.
Keywords: lymph node segmentation, optimal surface finding
INTRODUCTION
The lymphatic system of the human body is a component of the immune system that plays an important role in dealing with viruses, bacteria, and other illnesses. In medicine, the term lymphadenomegaly is used to describe an abnormal lymph node that is not palpable like in the mediastinum or abdomen. It is well known that critical causes of lymphadenomegaly include cancer, sarcoidosis, or fungal infections such as histoplasmosis.
Multidetector computed tomography (MDCT) has become the primary lymph node imaging modality in clinical routine and offers good spatial resolution for measuring lymph nodes.1 Assessment of lymph nodes is required for monitoring and treatment of diseases that directly or indirectly affect the lymphatic system (e.g., cancer). In current clinical practice, lymph nodes imaged with MDCT are analyzed manually based on measures of long and/or short axis length to assess response to therapy in follow-up examinations. Also, the revised response evaluation criteria in solid tumors (RECIST) 1.1 now explicitly include the assessment of lymph nodes.2 The accuracy and reproducibility of size measurements is critical for determining response in clinical practice and informed research studies.1 However, uni- or bidimensional (length) measurement were found to be subject to inter- and intraobserver variability.3, 4 A study by Fabel et al.5 showed that volumetric analysis of lymph nodes by means of semiautomatic segmentation promises to significantly reduce variability and interobserver bias compared to RECIST. In addition, other applications like biopsy planning or image guided radiation treatment (IGRT) require accurate geometric models (i.e., segmentations) of lymph nodes.
Lymph node segmentation in volumetric CT data is a challenging task due to low contrast to adjacent structures, inhomogeneous density values (e.g., calcification), and high variation in size and shape. Cordes et al.6 utilized a manual segmentation approach for neck lymph node segmentation to aid planning of neck dissections. A fast marching approach for semiautomatic segmentation of lymph nodes in 2D CT data has been reported by Yan et al.7 The authors applied their algorithm on a slice-by-slice basis for the 3D segmentation of lymph nodes in volumetric image data. No quantitative evaluation results were provided. Dornheim et al.8 presented a 3D mass-spring model for the segmentation of neck lymph nodes in CT data. The volumetric segmentation error ranged between 39% and 52%.8 The approach presented by Maleike et al.9 utilized a combination of a deformable 3D surface and statistical shape model for lymph node segmentation. In addition, a tool for manual intervention was provided to help the algorithm in converging to the desired object contours. In 6.9% out of 29 cases, the authors reported problems in getting a “usable segmentation”.9 Lu et al.10 utilized a semiautomatic 2D/3D live-wire-based approach for the segmentation of central chest lymph nodes in MDCT images. Unal et al.11 proposed a semiautomatic method for lymph node segmentation in nanoparticle-enhanced lymphotropic magnetic resonance (MR) images in the context of prostate cancer screening studies. The algorithm is based on coupled ellipse flows and implements a joined segmentation and registration approach. It first propagates in 2D on corresponding MR sequences and extends then to other slices in 3D. In conclusion, the range of existing lymph node segmentation methods for CT images is quite diverse. It is generally accepted that a manual segmentation is too time consuming for routine application. 2D slice-by-slice approaches (e.g., Yan et al.7) tend to introduce inconsistencies between slices. Some of the 3D approaches show a rather large segmentation error (Dornheim et al.8), mainly depend on user interaction (Lu et al.10), or have a manual refinement option that does not allow the user to produce desired results in some cases (Maleike et al.9).
In this paper, we propose a new computer-aided method for 3D lymph node segmentation that consists of a two-stage approach. First, the user identifies the lymph node to be analyzed by specifying the approximate center of the lymph node, and an automated segmentation method is applied to generate a segmentation. This segmentation approach is based on preliminary work published by Wang et al.12 Second, the user inspects the segmentation result, and if needed, can use a method for user-steered and computer-aided refinement of incorrect segmentation results. Both the initial segmentation and refinement method are based on an optimal surface finding (OSF) approach. This has the advantage that segmentation refinement is efficient, because the user directly interacts with a segmentation algorithm during refinement, compared to manually editing the result. In the validation study presented in this paper, the proposed method was applied to 111 lymph nodes of mediastinum, abdomen, head/neck, and axillary regions, and the segmentation error was assessed in addition to the required refinement effort.
METHODS
For lymph node segmentation in volumetric CT data, an OSF approach is utilized. OSF was introduced by Li et al.13, 14 The basic idea behind OSF is to convert a segmentation problem into a graph optimization problem. For this purpose, a graph structure and a cost function need to be defined, which represent the segmentation problem. Based on this information, the OSF algorithm generates a surface that represents the globally optimal solution according to the defined cost function. For more information on OSF-based segmentation, the reader is referred to Li et al.14
The proposed approach avoids common problems in lymph node segmentation by utilizing a cost function that consists of a weighted edge and a region homogeneity term in combination with a surface smoothness constraint (Fig. 1). In addition, the OSF method will also be utilized for refining the initial segmentation result, if needed.
Figure 1.
Comparison of lymph node segmentation methods. (a) Result of an edge-based segmentation approach. Arrows indicate major segmentation errors. (b) Result generated with the proposed method. Major segmentation errors due to nearby strong edges are avoided.
Our method consists of four main processing steps. First, the user is required to provide the approximate center point (voxel) ck of lymph node k to be segmented. Second, a directed spherical graph, whose center is located at ck, is constructed, so that the local region around the lymph node is transformed into a graph-based representation. Then the segmentation problem is formulated as an optimization (graph search) problem.14 We define a cost function that enables us to segment lymph nodes and to avoid common shortcomings of existing approaches. Third, the optimization problem is solved to produce a segmentation. Fourth, the user inspects the segmentation result and can correct the segmentation, if needed. In Secs. 2A, 2B, 2C, 2D, 2E, we describe our segmentation approach in detail.
Preprocessing
Lymph nodes have approximately CT densities in a range between –100 and 150 Hounsfield units (HU). Thus, the CT density-values were truncated to this HU range. In addition, all CT scans were resampled to isotropic voxel size before processing, and a 3 × 3 × 3 voxel median filter was applied to reduce image noise.
Graph construction
A node-weighted directed graph G = (V, E) is generated, with node set V and edge set E, representing a spherical volume of interest VOI(ck) around the approximate center point ck of lymph node k. In order to accomplish this task, a sphere-shaped triangular mesh (Fig. 2) is built around ck with radius r. The radius r is a constant and chosen to be larger than the largest expected radius of lymph nodes. Let nv be the number of vertices of the spherical mesh. For each mesh vertex pi with i ∈ {1, …, nv}, the image volume is sampled along the line between center point ck and vertex pi in an equidistant fashion using linear interpolation. The line between center point ck and vertex pi is denoted as column i.
Figure 2.
2D illustration of the graph generation process required for OSF-based segmentation. For this case, Δ = 1 was used for graph construction.
The discrete sample points define nodes in the graph structure that will be utilized for segmentation. In this context, note that pi represents already a sample point/node, whereas ck is not utilized. The gray-value samples on the line between ck and pi form the elements gi( j) of column i with j ∈ [0, 1, …, (ne − 1)]. The number of elements per column is a constant and denoted as ne, and gi(ne − 1) represents the gray-value density sample at the location of vertex pi. The node vi(j) ∈ V corresponds to gray-value sample point gi( j).
Because the goal of the algorithm is to find the minimum cost surface in the graph, elements that belong to the lymph node boundary should be assigned low cost values. The calculation of ci( j) is described in detail in Sec. 2C. Once the node-weighted directed graph is constructed, it is converted to an edge-weighted directed graph, as described by Li et al.14
Overall, there are ne × nv nodes in the graph. Considerations regarding the selection of sampling parameters nv and ne are as follows. Essentially, both numbers need to be adapted to the resolution of the CT image data. If nv is selected too small relative to image voxel size, the mesh will be too sparse and unable to represent the lymph node's surface accurately. Conversely, if nv is chosen too large, the computation time will increase with no major benefits regarding segmentation accuracy. Similar considerations apply for the selection of ne, the number of sample points along columns.
The neighborhood relation between columns is defined by the mesh structure. If (p, q) is an edge of the triangular mesh, then column p and column q are adjacent. A surface smoothness constraint Δ between any two adjacent columns is utilized to specify the maximum allowable change/difference in surface node locations.14 Figure 2 depicts a 2D example of the graph generation. By constructing the graph G as described above, spherical, elliptical, or slightly kidney-shaped lymph nodes can be segmented. In this context, the selection of an adequate smoothness constraint Δ is of importance for our method. If the smoothness constraint is too small, the surface will not be able to follow the lymph node surface in some cases (e.g., elongated lymph nodes). On the other hand, if the smoothness constrain is too large, the resulting surface can be noisy.
Cost function
For segmentation, the following cost function
| (1) | 
is utilized, which defines costs ci( j) for each node vi(j) ∈ V, where cedgei(j) represents an edge term and cghi(j) a gray-value-based homogeneity cost term. wshape(j) is a global shape weight, which is derived from all the nv columns of the graph structure. It captures the overall edge distribution around the center ck. The constant α is used to adjust the influence of the homogeneity term relative to the edge term. Note that the cost function is formulated such that low costs are assigned to nodes which are assumed to belong to the lymph node surface. In the following paragraphs, we will give a detailed explanation of all the components in the cost function.
A cross-sectional CT image of a lymph node is shown in Fig. 3a. The line in Fig. 3a represents a column of the graph. The corresponding gray-value sample points gi( j) are depicted in Fig. 3b. The gray-value homogeneity term cghi(j) = max a = 0, 1, …, j{gi(a)} − min a = 0, 1, …, j{gi(a)} measures the variation of gray-values along the path from column element gi(0) to gi( j) (Fig. 4).
Figure 3.
Gray-value information utilized for calculating the cost function. (a) Cross-sectional CT image showing a lymph node. The user-selected center ck is indicated by a cross. The line indicates one of the columns utilized to build the graph structure shown in Fig. 2. (b) Gray-value profile corresponding to the column marked in (a).
Figure 4.
Gray-value homogeneity function for the gray-value profile shown in Fig. 3b.
Since the variation of lymph node gray-values is typically in a limited range, the larger the value for cghi(j), the more unlikely it is that the element j of column i belongs to the lymph node.
As mentioned before, lymph nodes and the environment around them can vary considerably, thus an edge term that solely relies on edge magnitude is problematic, as demonstrated by the example shown in Fig. 1a. To avoid this problem, an edge cost function cedgei(j) that identifies potential edge locations, but does not directly utilize edge magnitude information, is generated. First, the derivative of cghi(j) (Fig. 5) is calculated by using a central difference function: Note that values for are only generated for j = 3, 4, …, (ne − 4) to avoid dealing with undefined border values, and a lymph node boundary is unlikely to be located at the beginning or end of a column. Second, all local maxima of are detected and the corresponding locations are stored in the set Λi. Third, the edge term is calculated by using with . The function p( j, a) is used to model uncertainty regarding the exact edge location (Fig. 6).
Figure 5.
Derivative of the gray-value homogeneity function in Fig. 4.
Figure 6.
Edge-based cost function for the gray-value profile in Fig. 3.
The relative importance of possible edge locations is globally estimated by the function , and the corresponding plot is shown in Fig. 7.
Figure 7.
Global shape weight.
The idea behind this approach is as follows. Since the user specifies the approximate center ck of a lymph node, its edges approximately appear in concentric patterns around ck. In contrast, other nearby structures (e.g., vessels) within radius r do not lead to such a consistent pattern. Therefore, it is likely that the weight elements of wshape(j) have larger values in proximity of the real lymph node edge. Thus, wshape(j) is utilized to weight the edge cost function cedgei(j), which helps avoiding problems as shown in Fig. 1a. Finally, Fig. 8 depicts the node costs for the profile/column shown in Fig. 3b.
Figure 8.
Cost function for the column depicted in Fig. 3.
Lymph node segmentation
Once the graph G is generated and all costs ci( j) are calculated, a maximum flow algorithm is used to solve the graph optimization problem,14 which runs in low degree polynomial time. The utilized OSF approach guarantees to produce a globally optimal surface captured by our graph G according to the utilized cost function ci( j). For the representation of the segmentation result, the initial spherical triangle mesh is utilized. The position of vertices along the radial direction are adjusted to the surface position found by graph search. Since no topology changes of the mesh structure are required, a mesh of the segmentation result can be quickly generated. Note that mesh-based representation M of the segmentation results can be converted into a volume-based representation S by using a voxelization method,15 if needed.
Segmentation refinement
As mentioned in Sec. 1, lymph node segmentation is a nontrivial problem and local segmentation errors can occur despite the effort in cost function design. Thus, our approach allows the user to correct errors while keeping the time required for segmentation refinement low. This implies that the user must not be required to delineate all the missed portions of the lymph node boundary in a slice-by-slice fashion to correct a segmentation. Instead, the same OSF segmentation framework introduced above will be utilized to make the error correction process fast and intuitive. Figure 9 illustrates the main idea behind the developed refinement approach. In areas that are incorrectly segmented, the user specifies a point located at the correct lymph node boundary and the algorithm will update/manipulate the segmentation in 3D by identifying the affected surface patch, locally changing the OSF node costs, and rerunning the OSF segmentation. The developed refinement algorithm consists of four major steps, which are described in the following.
Figure 9.
Illustration of the segmentation refinement process. (a) and (c) Initial segmentation results, which are partially incorrect. Surface points provided by the user to refine the segmentation in 3D are indicated by dots. (b) and (d) Segmentation refinement results derived from initial results shown in (a) and (c), respectively.
Step I. Extract gray-value information around the user-selected boundary point.
- 
1.
The user locates a local segmentation error and specifies a point pu on the correct boundary.
 - 
2.
The algorithm searches for the closest node on all columns. The closest node is denoted as , where i* and j* are the column and node of the closest node, respectively. Column i* will be denoted as user selected column.
 - 
3.A gray-value profile around node j* from column i* is extracted, which is represented by a set of gray values:
where nre is an integer which controls the length of the gray-value profile.(2)  
Step II. By utilizing a breadth-first-search on the mesh structure, similar neighboring columns are identified and added to the region set Θ. As we have mentioned before in Sec. 2B, the neighborhood relation between columns is defined by the mesh topology. The search starts at column i* and ends when no more similar columns can be added to Θ or the number of edges on the shortest path from the considered column to the user selected column i* is larger than a constant nmax. The similarity criteria for generating the region set Θ is defined as follows. Let the column i be the column that we want to compare with the user selected column i*. First, a set of gray-value profiles Ω(i, j*) = {S(i, j* − γ), S(i, j* − γ + 1), …, S(i, j* + γ)} for column i is produced, where γ is a variable position offset to take vertical changes of the surface between two columns into account. If γ is selected too large, profiles far away from the user selected lymph node surface point may be found. If selected too small, nearby similar profiles might not be detected due to the local shape changes. Thus, we utilize , which adapts γ based on the dissimilarity between and vi(j*). In this context, η is a constant which is utilized to limit the search range, and d() represents the Euclidean distance. Second, each gray-value profiles S(i, j) ∈ Ω(i, j*) of column i is compared to S(i*, j*) by utilizing the following boolean function:
| (3) | 
where ν is a threshold. If ϕ(S(i*, j*), S(i, j)) is true for one of the gray-value profiles in Ω(i, j*), the column i is considered to be similar to S(i*, j*).
Step III. All the node costs of the user selected column i* and columns in the region set Θ are updated. First, the user selected column i* is updated. The cost of node is set to 0 and all the other nodes of column i* are assigned a large constant cost cmax. Thus, the surface is forced to pass through the user defined point. Second, all the other columns which passed the similarity criteria are updated. For column i passing the similarity criteria, the cost function is calculated by
| (4) | 
where the original global shape weight wshape(j) is replaced by a Gaussian weighting function: , where is determined as follows. If there are local maxima located on column i in the range of [j* − γ, j* + γ], then equals to the corresponding location of the local maximum closest to j*, or else is set to j*.
Step IV. Once the user has specified the point pu, a new segmentation is calculated based on the modified node costs. The new result is displayed immediately after the maximum flow calculation is finished. The refinement process can be repeated, if needed. Note, if the user selected column i*1 in the first refinement process, it is also considered as a similar column in the second refinement process, but the node cost of column i*1 will remain unchanged in Step III of the second refinement iteration. In addition, the developed user interface allows the user to “undo” a refinement step.
EXPERIMENTAL METHODS
In Subsections 3A, 3B, 3C, 3D, the experimental setup as well as the utilized image data, independent reference standard, quantitative indices, and algorithm parameters are described. A physician oversaw image data collection, trained and supervised image analysts, and inspected/approved the independent reference standard. This study was approved by the Institutional Review Board (IRB) at the University of Iowa.
Image data and experimental setup
For validation, two experiments were performed, which are described in Secs. 3A1, 3A2.
Segmentation accuracy analysis
For assessment of segmentation accuracy, 111 lymph nodes of the mediastinum, abdomen, head/neck, and axillary regions were selected from 35 volumetric CT data sets. The majority of the lymph nodes was enlarged. Fourteen of the CT data sets were contrast enhanced. The CT data were divided into three test sets:
Test set 1 consisted of 22 lymph nodes from three CT scans. Scans in test set 1 were from patients with large lung cancer masses. Pixel spacing ranged between 0.62 and 0.67 mm, and the slice thickness was 0.50, 0.60, and 3.00 mm, respectively. The majority of lymph nodes in test set 1 was imaged with approximately isotropic voxel size.
Test set 2 contained 45 lymph nodes from nine CT scans. The scans were taken from cancer patients and were acquired for radiation treatment planning. The spatial resolution for test set 2 is lower compared to test set 1. Pixel spacing was between 0.98 and 1.27 mm, and all scans had a slice thickness of 2.00 mm.
Test set 3 consists of 44 lymph nodes from 23 CT scans from patients with lung nodules. Consequently, the lymph nodes in this set are smaller compared to the other two test sets. The pixel spacing was between 0.54 and 0.81 mm. The slice thickness ranged between 0.63 and 1.00 mm.
Based on the independent reference standard (Sec. 3B), the average lymph node volume in test sets 1–3 was 1056, 1621, and 501 mm3, respectively.
For validation, the user (trained image analyst) was asked to select the center of the lymph node, apply the algorithm, and inspect the segmentation result. If the result had errors, the user was required to refine the segmentation.
Center sensitivity analysis
For the initial segmentation, the center position ck of a lymph node to be analyzed is required. Placing ck at different positions may lead to different segmentation results. Also, the center position selected by different users might vary. Hence, the center sensitivity of our method was analyzed in two different cases (Fig. 10). For the first case (volume 1), an elliptical, slightly kidney shaped lymph node was selected, whose long axis length was 14 mm and short axis length was 10 mm. The voxel size of the data set was 0.6719 × 0.6719 × 0.5 mm. For the second case (volume 2), an approximately spherical lymph node was chosen whose diameter was approximately 10 mm, and the voxel size was 0.6152 × 0.6152 × 0.6 mm.
Figure 10.
Axial images of the image data utilized for center sensitivity analysis. The calculated center point ck is marked by the red cross.
First, the center position ck was calculated from the corresponding independent reference standard (Sec. 3B) for both volumes. Second, the center position was shifted along the x-, y-, and z-axis from the original center ck, respectively, and the proposed segmentation method (without refinement) was applied. Third, the mean unsigned distance error (Sec. 3C) was calculated for each center location.
Independent reference standard
The independent reference was generated by an expert image analyst in a slice-by-slice fashion using a semiautomatic live wire16 segmentation tool. The live-wire method was chosen to reduce the time required by the user for generating the reference segmentations. A potential issue of this approach might be that object boundaries were identified more consistently compared to completely manual segmentation. On the other hand, the user had the option to influence the boundary location by using path-cooling techniques.16 The independent reference was utilized for evaluation of the developed lymph node segmentation method.
Quantitative indices
For segmentation error assessment, surface-based and volume-based measures were used. Let VS and VR be binary volume data sets describing the segmentation result and the reference segmentation, respectively. Corresponding surfaces are denoted as S and R. The distance d(x, A) between a point x and a surface A was defined as . The mean unsigned distance error was determined by evaluating dU(S, R) = [I(S, R) + I(R, S)]/[|S| + |R|] with I(A, B) = ∫x ∈ Ad(x, B)dx. Similarly, the mean signed distance error dS was calculated. Distances are counted positive, if the point of the segmentation result surface is located outside of the boundary of the independent reference standard, negative otherwise. To measure the volume error of segmentations, the Dice coefficient17D(VS, VR) = 2|VS∩VR|/[|VS| + |VR|] was determined.
Parameter selection
For all experiments, the following parameters were utilized for segmentation. For graph construction, r was set to 20 mm to be able to segment larger lymph nodes. The spherical mesh consisted of nv = 642 vertices. The number of elements per column was ne = 60. The selection of nv and ne is based on the expected voxel size of CT images to be segmented as well as considerations presented in Sec. 2B. Based on the selected values for nv and ne, Δ was set to four, which is sufficient to segment ellipsoidal-shaped lymph nodes with a long to short axis ratio of approximately 1.5. Another aspect in selecting Δ is that abnormal/diseased lymph nodes are frequently spherically shaped, whereas normal lymph nodes are usually elongated (i.e., bean-shaped). For cost calculation, α = 0.001 and σ = 1.5 were used. The following parameters were used for the refinement method: nre = 5, nmax = 6, η = 3, ν = 30HU, cmax = 1000, and . All parameters were determined on a separate CT image set.
RESULTS
Segmentation accuracy analysis
Results on the three test data sets for the segmentation without refinement are summarized in Table 1(a). For test set 1 and test set 3, the signed mean border positioning error is close to 0, indicating a low border positioning bias. For test set 2, the signed mean border positioning error is 0.421 mm. The spatial resolution for images in test set 2 is low, compared to test set 1. For volumetric CT data with lower resolution, lymph node boundaries are getting blurred [Fig. 15e] and the segmentation performance is impacted by partial volume effects. In such cases, even the manual segmentation of lymph nodes becomes a difficult task. The unsigned mean border positioning error for all three test sets is less than a voxel. Once the approximate center of a lymph node of interest is known, the initial OSF segmentation can be calculated in less than 1 s on a 3 GHz workstation.
Table 1.
Average and standard deviation of validation results on all 111 test sets.
| Test set 1 | Test set 2 | Test set 3 | |
|---|---|---|---|
| (a) Before refinement | |||
| dS (mm) | 0.027 ± 0.178 | 0.421 ± 0.216 | −0.088 ± 0.217 | 
| dU (mm) | 0.524 ± 0.149 | 0.853 ± 0.176 | 0.572 ± 0.235 | 
| D (–) | 0.845 ± 0.059 | 0.828 ± 0.064 | 0.775 ± 0.082 | 
| (b) After refinement | |||
| dS (mm) | 0.023 ± 0.171 | 0.394 ± 0.189 | 0.001 ± 0.146 | 
| dU (mm) | 0.518 ± 0.146 | 0.824 ± 0.146 | 0.501 ± 0.157 | 
| D (–) | 0.847 ± 0.061 | 0.836 ± 0.058 | 0.809 ± 0.070 | 
Figure 15.
Comparison of segmentation results produced by three experts. (a) Axial CT cross-section showing an enlarged lymph node. (b)–(d) Manual segmentation results generated by experts one to three, respectively. (e) Explanation of image features; arrow A points to a region with edge blurring due to partial volume effects and arrow B points to a vessel in close proximity to the lymph node, which is difficult to discern in this particular image slice. (f)–(h) Segmentations generated by experts one to three, respectively, produced with the proposed segmentation approach. In all cases, only the approximate center was specified and no refinement was performed/required.
The number of lymph nodes that required refinement and the average number of refinement iterations (“clicks”) for each test set is summarized in Table 2. The average user interaction time required for refinement was 10 s per lymph node (range: 4–23 s). Table 1(b) summarizes the validation results after refinement for all 111 lymph node segmentations investigated. For lymph nodes that required refinement, Table 3 shows error indices before and after refinement for all three test sets. Examples of segmentation results are depicted in Figs. 1112. Figure 13 shows examples of segmentations before and after refinement.
Table 2.
Segmentation refinement statistics per test set.
| Number of | Average number | Minimum/maximum | |
|---|---|---|---|
| lymph nodes | of points (“clicks”) | of points | |
| refined | required | required | |
| Test set 1 | 4 | 1.25 | 1/2 | 
| Test set 2 | 16 | 1.62 | 1/2 | 
| Test set 3 | 20 | 1.55 | 1/2 | 
Table 3.
Comparison of validation results before and after refinement for cases in test sets 1 to 3 which required refinement.
| Statistical significance | |||
| Before | After | of improvement | |
| refinement | refinement | after refinement | |
| (a) Test set 1 | |||
| dS (mm) | 0.071 ± 0.209 | 0.044 ± 0.170 | p = 0.7025 | 
| dU (mm) | 0.607 ± 0.050 | 0.571 ± 0.060 | p = 0.2786 | 
| D (–) | 0.883 ± 0.022 | 0.894 ± 0.020 | p = 0.1017 | 
| (b) Test set 2 | |||
| dS (mm) | 0.560 ± 0.233 | 0.485 ± 0.201 | p = 0.0833 | 
| dU (mm) | 1.001 ± 0.166 | 0.920 ± 0.141 | p = 0.0038 | 
| D (–) | 0.796 ± 0.079 | 0.820 ± 0.074 | p = 0.0134 | 
| (c) Test set 3 | |||
| dS (mm) | −0.184 ± 0.257 | 0.011 ± 0.161 | p = 0.0051 | 
| dU (mm) | 0.723 ± 0.245 | 0.566 ± 0.164 | p = 0.0002 | 
| D (–) | 0.736 ± 0.080 | 0.812 ± 0.072 | p = 0.0001 | 
Figure 11.
Comparison between segmentation results generated with the proposed method (without refinement) and independent reference standard.
Figure 12.
Examples of lymph node segmentations generated with the proposed method depicting the variation in size, shape, and texture of lymph nodes. None of the results needed refinement by the user.
Figure 13.
Examples of initial segmentation ((a) and (c)) and refinement results ((b) and (d)).
For the subset of lymph node segmentations that were refined, a Student's t-test at a significance level of 0.05 was performed to determine weather the mean values of dS, dU, and D after refinement were significantly different (improved) than before refinement. For test set 1 (n = 4), the error indices were not found to be significantly different [Table 3(a)]. In the case of test set 2 (n = 16), only the mean signed surface distance error dS was not found to be significantly different, all other indices were significantly different [Table 3(b)]. All indices of test set 3 (n = 20) were found to be significantly different [Table 3(c)].
Center sensitivity analysis
Figure 14 depicts the segmentation error for volume 1 and volume 2 when the center position is shifted along the x-, y-, and z-axis between the lymph node boundaries given by the reference segmentation. The location 0 corresponds to the center of a lymph node derived from the independent standard. Figure 14 shows that the error stays approximately in the same low range close to 0.5 mm around the lymph node center. The segmentation error increased when ck was approaching the lymph node boundary.
Figure 14.
Unsigned mean distance error for volume 1 (a), (c), and (e) and volume 2 (b), (d), and (f) in dependence of center location along the x-, y-, and z-axis, respectively. All distances along the horizontal axes are in mm. The minimum and maximum values on the horizontal axis correspond to boundary locations in the reference segmentation.
DISCUSSION
The comparison between segmentation results (without refinement) and independent standard in Fig. 11 shows mostly local differences despite the variation in shape, size, and imaging conditions. The method successfully deals with the variations in lymph node shape depicted in Fig. 12—none of these examples required segmentation refinement. Traditional edge based segmentation methods fail when there are strong edges around a lymph node [Fig. 1a]. In our method, the influence of neighboring strong edges is reduced [Fig. 1b].
A comparison of results on test sets 1 and 2 (Table 1) shows that for the test set with low resolution CT scans (test set 2), segmentation errors, independent of the used indices, are larger before and after refinement. Differences in segmentation performance between test sets 1 and 3 can mainly be observed for the Dice coefficient, which is impacted by the size of the analyzed lymph nodes (Sec. 3A1).
In our cost function design, we assumed that lymph nodes have a spherical or slightly ellipsoid shape, as is the case for most abnormal lymph nodes. However, as can be seen in Figs. 13a, 13c, lymph nodes can deviate from this assumption, which can cause segmentation errors.
If the initial segmentation contains errors, the user can utilize the developed segmentation refinement approach to correct the segmentation result, which takes only a few seconds of user interaction. Examples for initially incorrect lymph node segmentations and corresponding results after refinement are presented in Figs. 913. In our evaluation on 111 lymph nodes, the expert decided to refine 40 cases (36%) of initial results. The segmentation refinement method is efficient; all segmentation performance indices improved (Table 3), and a maximum of two refinement iterations (i.e., mouse clicks) were required (average: 1.55). In addition, the improvement was statistically significant for the majority of calculated indices. The time required for locating a target lymph node in a scan, specifying the center, generating an initial OSF segmentation, and refining the segmentation, if needed, was typically less than one minute. In comparison, the process of generating an independent reference segmentation with the live wire tool in a slice-by-slice fashion required approximately 10 min per lymph node, which is one order of magnitude more compared to our approach.
The developed segmentation refinement method is intuitive to use and allows the user to produce segmentations that can be utilized for assessment of lymph nodes. In addition, the approach has the potential to decrease interobserver variability. For example, Fig. 15 depicts the outcome of an experiment where three expert image analysts were asked to segment a lymph node manually slice-by-slice [Figs. 15b, 15c, 15d] and with the proposed method [Figs. 15f, 15g, 15h]. As can be seen in Fig. 15, manual segmentation results show more variation when compared across experts and to results of the proposed semiautomatic approach. This can be explained by factors like subjective interpretation of image content, choice of gray-value window width and level, blurred object borders due to partial volume effects [arrow A in Fig. 15e], etc., which influence operator performance. Due to a lack of ground truth, it is unclear which of the six segmentation results is closer to “truth”. However, as can be seen in Fig. 15, the proposed method reduces interobserver variability, because it only requires the user to specify the approximate lymph node center point. In addition, because the method works in 3D, the vessel in close proximity to the lymph node [arrow B in Fig. 15e] is correctly excluded in all three semiautomatic segmentations.
Analyzing individual refinement cases leads to the following conclusions. First, letting the user refine a segmentation can introduce operator variability. For example, low resolution scans of lymph nodes will likely introduce a high inter- and intraobserver variability, because it can be difficult for the user to locate the correct lymph node boundary. However, as the experiment depicted in Fig. 15 suggests, variation due to subjective interpretation in manual segmentation will also be higher in such cases. Second, for almost all cases, the average Dice coefficient increases with refinement and has values larger than 0.8. However, for small lymph nodes, the Dice coefficient can remain low, even after refinement, because even a ‘small' segmentation inaccuracy will impact the Dice coefficient more than for larger lymph nodes.
CONCLUSION
The assessment of lymph nodes plays an important role in diagnosis, monitoring, and treatment of diseases, like cancer, sarcoidosis, or histoplasmosis. In this paper, we presented a graph-based approach for the computer-aided 3D segmentation of lymph nodes. Segmentation of lymph nodes in volumetric CT images is a challenging task due to partial volume effects, neighboring structures with similar intensity profiles and potentially inhomogeneous appearance. The presented approach is able to deal with common segmentation problems by utilizing a weighted edge and a region homogeneity term in the developed cost function. In the majority of cases investigated, the initial segmentation method delivered results that did not require further processing. For lymph nodes with irregular shape or large variability in density, the developed refinement method can be utilized to adjust the segmentation result, which takes a few seconds of user interaction. Compared to manual or semiautomated slice-by-slice segmentation approaches, the developed method is fast and offers similar flexibility when dealing with lymph nodes that are difficult to segment. Future work will focus on developing a fully automated lymph node detection method that can be utilized in combination with the proposed segmentation framework.
ACKNOWLEDGMENTS
The authors are grateful to Dr. Geoffrey McLennan for the support of this work. He was involved in data collection, trained and supervised image analysts, and inspected the independent reference standard. This work was supported in part by NIH grant R01-EB004640.
References
- Schwartz L., Bogaerts J., Ford R., Shankar L., Therasse P., Gwyther S., and Eisenhauer E., “Evaluation of lymph nodes with RECIST 1.1,” Eur. J. Cancer 45(2), 261–267 (2009). 10.1016/j.ejca.2008.10.028 [DOI] [PubMed] [Google Scholar]
 - Eisenhauer E., Therasse P., Bogaerts J., Schwartz L., Sargent D., Ford R., Dancey J., Arbuck S., Gwyther S., Mooney M., Rubinstein L., Shankar L., Dodd L., Kaplan R., Lacombe D., and Verwei J., “New response evaluation criteria in solid tumours: Revised RECIST guideline (version 1.1),” Eur. J. Cancer 45(2), 228–247 (2009). 10.1016/j.ejca.2008.10.026 [DOI] [PubMed] [Google Scholar]
 - Schwartz L. H., Mazumdar M., Brown W., Smith A., and Panicek D. M., “Variability in response assessment in solid tumors: Effect of number of lesions chosen for measurement,” Clin. Cancer Res. 9(12), 4318–4323 (2003). [PubMed] [Google Scholar]
 - Erasmus J. J., Gladish G. W., Broemeling L., Sabloff B. S., Truong M. T., Herbst R. S., and Munden R. F., “Interobserver and intraobserver variability in measurement of non-small-cell carcinoma lung lesions: Implications for assessment of tumor response,” J. Clin. Oncol. 21(13), 2574–2582 (2003). 10.1200/JCO.2003.01.144 [DOI] [PubMed] [Google Scholar]
 - Fabel M., von Tengg-Kobligk H., Giesel F. L., Bornemann L., Dicken V., Kopp-Schneider A., Moser C., Delorme S., and Kauczor H.-U., “Semi-automated volumetric analysis of lymph node metastases in patients with malignant melanoma stage III/IV-A feasibility study,” Eur. Radiol. 18(6), 1114–1122 (2008). 10.1007/s00330-008-0866-4 [DOI] [PubMed] [Google Scholar]
 - Cordes J., Dornheim J., Preim B., Hertel I., and Strauss G., “Pre-operative segmentation of neck CT datasets for the planning of neck dissections,” Proc. SPIE 6144(2), 6144–6163 (2006). 10.1117/12.654157 [DOI] [Google Scholar]
 - Yan J., Zhuang T., Zhao B., and Schwartz L. H., “Lymph node segmentation from CT images using fast marching method,” Comput. Med. Imaging Graph. 28, 33–38 (2004). 10.1016/j.compmedimag.2003.09.003 [DOI] [PubMed] [Google Scholar]
 - Dornheim J., Seim H., Preim B., Hertel I., and Strauss G., “Segmentation of neck lymph nodes in CT datasets with stable 3D mass-spring models segmentation of neck lymph nodes,” Acad. Radiol. 14(11), 1389–1399 (2007). 10.1016/j.acra.2007.09.001 [DOI] [PubMed] [Google Scholar]
 - Maleike D., Fabel M., Tetzlaff R., von Tengg-Kobligk H., Heimann T., Meinzer H., and Wolf I., “Lymph node segmentation on CT images by a shape model guided deformable surface method,” Proc. SPIE 6914, 69141S-1–69141S-8 (2008). 10.1117/12.770352 [DOI] [Google Scholar]
 - Lu K., Merritt S. A., and Higgins W. E., “Extraction and visualization of the central chest lymph-node stations,” Proc. SPIE 6915, 69151B-1–69151B-15 (2008). 10.1117/12.768691 [DOI] [Google Scholar]
 - Unal G., Slabaugh G., Ess A., Yezzi A., Fang T., Tyan J., Krieg M. R. R., Seethamraju R., Harisinghani M., and Weissleder R., “Semi-automatic lymph node segmentation in LN-MRI,” in IEEE International Conference on Image Processing (ICIP) (IEEE, 2006), pp. 77–80. 10.1109/ICIP.2006.312366 [DOI]
 - Wang Y. and Beichel R., “Graph-based segmentation of lymph nodes in CT data,” in Proceedings of International Symposium on Visual Computing 2010, Lecture Notes in Computer Science Vol. 6454 (2010), pp. 312–321.
 - Li K., Wu X., Chen D., and Sonka M., “Efficient optimal surface detection: Theory, implementation and experimental validation,” Proc. SPIE 5370, 620–627 (2004). 10.1117/12.537048 [DOI] [Google Scholar]
 - Li K., Wu X., Chen D., and Sonka M., “Optimal surface segmentation in volumetric images—A graph-theoretic approach,” IEEE Trans. Pattern Anal. Mach. Intell. 28, 119–134 (2006). 10.1109/TPAMI.2006.19 [DOI] [PMC free article] [PubMed] [Google Scholar]
 - Reitinger B., Bornik A., and Beichel R., “Efficient volume measurement using voxelization,” in Proceedings of the 19th Spring Conference on Computer Graphics (ACM, New York, NY, 2003), pp. 47–54 (ACM).
 - Barrett W. and Mortensen E., “Interactive live-wire boundary extraction,” Med. Image Anal. 1(4), 331–341 (1997). 10.1016/S1361-8415(97)85005-0 [DOI] [PubMed] [Google Scholar]
 - Dice L., “Measures of the amount of ecologic association between species,” Ecology 26(3), 297–302 (1945). 10.2307/1932409 [DOI] [Google Scholar]
 















