Skip to main content
Medical Physics logoLink to Medical Physics
. 2009 Dec 28;37(1):391–401. doi: 10.1118/1.3273062

Dynamic multiple thresholding breast boundary detection algorithm for mammograms

Yi-Ta Wu 1,a), Chuan Zhou 1, Heang-Ping Chan 1, Chintana Paramagul 1, Lubomir M Hadjiiski 1, Caroline Plowden Daly 1, Julie A Douglas 2, Yiheng Zhang 3, Berkman Sahiner 3, Jiazheng Shi 3, Jun Wei 3
PMCID: PMC2809702  PMID: 20175501

Abstract

Purpose: Automated detection of breast boundary is one of the fundamental steps for computer-aided analysis of mammograms. In this study, the authors developed a new dynamic multiple thresholding based breast boundary (MTBB) detection method for digitized mammograms.

Methods: A large data set of 716 screen-film mammograms (442 CC view and 274 MLO view) obtained from consecutive cases of an Institutional Review Board approved project were used. An experienced breast radiologist manually traced the breast boundary on each digitized image using a graphical interface to provide a reference standard. The initial breast boundary (MTBB-Initial) was obtained by dynamically adapting the threshold to the gray level range in local regions of the breast periphery. The initial breast boundary was then refined by using gradient information from horizontal and vertical Sobel filtering to obtain the final breast boundary (MTBB-Final). The accuracy of the breast boundary detection algorithm was evaluated by comparison with the reference standard using three performance metrics: The Hausdorff distance (HDist), the average minimum Euclidean distance (AMinDist), and the area overlap measure (AOM).

Results: In comparison with the authors’ previously developed gradient-based breast boundary (GBB) algorithm, it was found that 68%, 85%, and 94% of images had HDist errors less than 6 pixels (4.8 mm) for GBB, MTBB-Initial, and MTBB-Final, respectively. 89%, 90%, and 96% of images had AMinDist errors less than 1.5 pixels (1.2 mm) for GBB, MTBB-Initial, and MTBB-Final, respectively. 96%, 98%, and 99% of images had AOM values larger than 0.9 for GBB, MTBB-Initial, and MTBB-Final, respectively. The improvement by the MTBB-Final method was statistically significant for all the evaluation measures by the Wilcoxon signed rank test (p<0.0001).

Conclusions: The MTBB approach that combined dynamic multiple thresholding and gradient information provided better performance than the breast boundary detection algorithm that mainly used gradient information.

Keywords: breast boundary detection, computer-aided detection, mammogram, multiple thresholding

INTRODUCTION

It has been shown that computer-aided detection (CAD) for mammography can increase breast cancer detection sensitivity by radiologists both in the laboratory and in clinical practice.1, 2, 3, 4, 5, 6 Automated detection of breast boundary is one of the fundamental steps for computer-aided analysis of mammograms, including detection of breast lesions,7, 8 estimation of breast density,9 prediction and correlation of mass location in multiview mammograms,10, 11, 12 and other image analysis applications.13, 14 Breast boundary determination therefore plays an important role in CAD systems.

The breast region of digitized mammograms generally has lower x-ray exposure, and thus higher brightness, than the background outside the breast. However, the breast region cannot be correctly separated from the background by a single threshold value because of the variation in x-ray exposure along the boundary. In addition, the markers and labels in the image background may be connected to the breast if they are placed too close to the breast or if the breast is large. Most breast boundary detection algorithms9, 15, 16, 17, 18, 19, 20 share a common approach, in which an initial breast area is obtained by analyzing the gray level histogram and the final breast boundary is determined by a subsequent refinement procedure. These algorithms may fail if the initial breast boundary (MTBB-Initial) deviates too far from the true boundary such that the refinement procedure is misled to the wrong direction.

Bick et al.15 classified the image pixels into potential breast pixels and other background pixels by a modified histogram analysis, and the final breast boundary was derived by region growing and morphological filtering. Ojala et al.17 obtained the initial breast boundary by adaptive histogram thresholding and morphological filtering. The inner stroma edge was extracted by the thresholding method of Otsu,21 and the next boundary point was predicted by analyzing the Euclidean distance between the inner stroma edge and the outer initial confirmed portion. Three different smoothing procedures, Fourier transform, snake, and B-splines, were used to obtain the final breast boundary. Wirth and Stapinski16 utilized dual thresholding22 to obtain the initial control points. After performing edge enhancement and noise removal, those control points were input to the snake to obtain the final breast boundary. Ferrari et al.18 determined the initial breast boundary based on the following three steps. First, the image contrast was enhanced using the logarithmic operator. Second, the breast region was thresholded based on the Lloyd–Max quantizer. Third, the binary morphological opening operator was adopted to reduce the noises along the initial breast boundary. In the refinement stage, the final breast boundary was derived by the adaptive active deformable contour model. Sun et al.19 obtained the initial breast boundary by a combination of adaptive thresholding and connected-component analysis, and then determined the initial confirmed portion of the breast boundary by a greedy range selection procedure. They derived the final breast boundary based on the assumption of Euclidean distance constraint between the initial confirmed portion and the stroma edge computed via bimodal histogram analysis. Raba et al.20 determined the initial breast boundary by an adaptive histogram analysis, and then obtained the final boundary by a region growing approach. A fair comparison would require testing the different algorithms with a common data set. Due to the lack of details for some of these published methods, it would be very difficult to implement the methods correctly and compare the accuracy of our and their algorithms in our data set.

We have previously developed a breast boundary detection method by using a gradient-based breast boundary (GBB) technique to search for the breast boundary.9 The image background was estimated initially by searching for the largest peak in the gray level histogram. After excluding the background from the breast region, an initial edge was derived by a line-by-line gradient analysis from the top to the bottom of the image. The tracking of the breast boundary started from approximately the middle of the breast image and moved upward and downward along the initial boundary. The tracked edges were smoothed to remove noisy fluctuations. In most of these methods, the refinement of the breast boundary may fail if the initial breast boundary was too far from the true boundary.

The initial breast boundary therefore plays an important role in a breast boundary detection system. In this study, we developed a new system for automated breast boundary detection which estimated the initial breast boundary based on a dynamic multiple thresholding method instead of histogram analysis. The final breast boundary was tracked along the initial breast boundary with refinement of the edge location by analysis of the gradient information obtained from horizontal and vertical Sobel filtering. The accuracy of the breast boundary detection was evaluated by comparison with an experienced breast radiologist’s manual segmentation. The performance of the new method was also compared to that of our previous method.

MATERIALS AND METHODS

Data sets

A data set of 716 screen-film mammograms (442 CC view and 274 MLO view) were obtained from 288 consecutive cases of an ongoing NIH-supported and Institutional Review Board approved genetic study of breast density in women from the Old Order Amish population of Lancaster County, PA.23 The mammograms were digitized with a Lumiscan laser scanner at a pixel size of 50×50 μm2 and 12 bits∕pixel. The mammograms were first smoothed with a 16×16 pixel box filter and subsampled by a factor of 16, resulting in a pixel size of 800×800 μm2 and an approximate image size of 225×300 pixels. For each mammogram, the breast boundary was manually traced by an experienced Mammography Quality Standards Act (MQSA) radiologist. The radiologist used the windowing function to enhance the breast boundary and outlined the breast boundary with the cursor on a graphical user interface. The radiologist’s segmented boundaries were used as reference standard for performance evaluation of our method.

Methods

In order to improve the performance of the breast boundary detection system, we developed a new dynamic thresholding based method, referred to as the multiple thresholding breast boundary (MTBB) detection method. Breast boundary detection is performed in two stages: Initial breast boundary determination and breast boundary refinement. The detailed description for each stage is presented below.

Initial breast boundary determination

As mentioned earlier, the quality of the initial breast boundary is critically important for breast boundary detection. Figure 1 shows an example of obtaining several initial breast boundaries by the thresholding approach using different threshold levels. It is obvious that a single threshold level cannot properly determine the initial breast boundary. For example, in Fig. 1b, the anterior portion of the initial boundary is closed to the real boundary. However, the boundary encloses a large number of background pixels in the top and bottom areas which will mislead the boundary tracking in the refinement procedure. The relatively high x-ray intensity in these regions is caused by scattered radiation from the chest wall of the patient. Although the background pixels can be removed by selecting a higher threshold level as shown in Figs. 1c, 1d, 1e, 1f, 1g, the anterior portion of the initial boundary will then move too far inside the breast as shown in Figs. 1f, 1g. In order to obtain the initial breast boundary as close as possible to the true boundary, we have designed a dynamic multiple thresholding method that contains two steps, breast boundary candidate search, and initial breast boundary extraction, as discussed below.

Figure 1.

Figure 1

An example of obtaining several initial breast boundaries by the thresholding approach. (a) The original image with gray levels in range [0,4095]; (b)–(g) the initial breast boundaries derived in threshold levels 300, 500, 700, 900, 1100, and 1300, respectively.

Breast boundary candidate search

Figure 2 shows an example of performing the initial breast boundary candidate (IBBC) search procedure when the breast is on the right side of the mammogram. To initiate the IBBC search, a threshold value h on the image histogram is obtained by the method of Otsu.21 Let LMX_h be the leftmost x-coordinate of the IBBC at the threshold level h within the middle section of the image (from about 1∕4 to 3∕4 along the y dimension of the image), as shown in Fig. 2a. A threshold level l will start from a lower value than h, e.g., 0.5∗h, as shown in Fig. 2b and is gradually increased to remove background pixels, guided by a search criterion described below. At a given threshold l, the leftmost x-coordinate LMX_l is determined and a rectangular area having a width of 2∗(LMX_h−LMX_l) and centered at LMX_h is defined in the anterior portion of the breast candidate as shown in Fig. 2c. Note that the width of the rectangular area is decreasing when the threshold l increases, and it will become 0 if the threshold l reaches the high threshold level h obtained by the method of Otsu. Within the rectangle, each column of pixels is searched to determine its intersection with the anterior of the breast region that is above the threshold l. If the pixel column intersects the breast region at more than one contiguous sections, it indicates that the region boundary is not smooth and unlikely to be the breast boundary. The threshold l will then continue to be increased. Figure 2d shows that the threshold l search procedure is completed since only a single continuous breast region intersects with each pixel column at the anterior of IBBC. This final threshold l is considered to be the starting threshold level in the boundary tracking procedure below.

Figure 2.

Figure 2

An example of performing the initial breast boundary candidate search procedure. (a) LMX_h at threshold level h obtained by the Otsu’s method. Although the IBBC seems to be close to the true boundary in the anterior portion, it contains many background pixels in the top and bottom chest wall areas. (b) The search of a proper threshold level l for boundary tracking will start from a lower value than h and is gradually increased to remove background pixels. (c) and (d) illustrate the procedure to search for l. The white rectangles in (c) and (d) are centered at the fixed location LMX_h shown in (b), and have a width of 2∗(LMX_h−LMX_l) for a given threshold l. The threshold level l is increased after (c) since some columns within the rectangular area contain more than one contiguous section, but stops increasing after (d).

Initial breast boundary extraction

Once the IBBC is obtained in the previous step [Fig. 2d], the initial breast boundary can be tracked by evaluating the differences between every two thresholded images at consecutive threshold levels as shown in Fig. 3. Figures 3a, 3b show two thresholded images obtained at two consecutive threshold levels. Figure 3c shows the differences (white pixels) between the two thresholded images. The boundary tracking procedure is performed in two parts, the top and bottom portion of the breast boundary that were separated by the white dashed line shown in Fig. 3d. The location of the line is determined by the vertical coordinate of the leftmost pixel of the IBBC, which may, but not necessarily, be the nipple. For each portion, a new boundary pixel is found in each iteration at the intersection between the central column of the white region and the gray breast region by comparing the two thresholded images. The boundary pixels found in one of the iterations are marked as gray circles in the example shown in Fig. 3d (pointed by the white arrow). Figure 4 shows the iterative search of obtaining the MTBB-Initial.

Figure 3.

Figure 3

An example illustrating the breast boundary tracking procedure. (a) and (b) The thresholded images in two consecutive threshold levels. (c) Comparison of the two thresholded images by overlapping. (d) The new breast boundary points are derived by analyzing the differences of the two thresholded images.

Figure 4.

Figure 4

An example of tracking the initial breast boundary by dynamic multiple thresholding. (a)–(f) show the iterative tracking of the breast boundary in which the white pixels mark the differences between the two consecutive thresholded images, and the light gray circles (pointed out by the arrows) are the boundary points obtained in each iteration of increasing the threshold. (g) The initial breast boundary, MTBB-Initial, is obtained by collecting all the boundary points.

Breast boundary refinement

The refinement procedure of our MTBB detection system tracks the boundary based on the MTBB-Initial and the gradient information. The tracking procedure starts from the same approximately middle point described above [see example in Fig. 3d] and moves upward and downward along the initial boundary. The use of the gradient information in the two directions is schematically shown in Fig. 5. The gradient information was obtained by Sobel filtering in the horizontal and vertical directions. The horizontal gradient information (vertical Sobel) is utilized to track the edges in the ranges between A and B and between A and D. The vertical gradient information (horizontal Sobel) is utilized to track the edges in the ranges between B and C and between D and E. The selection of either the vertical or horizontal gradient is determined based on the slope: If the absolute slope of the tangent to the current boundary position is greater than 1, the horizontal gradient is selected; otherwise, the vertical gradient is selected. An example of a mammogram in Fig. 6a after vertical and horizontal Sobel filtering is shown in Figs. 6b, 6c, respectively.

Figure 5.

Figure 5

Vertical and horizontal Sobel filtering used in the breast boundary refinement.

Figure 6.

Figure 6

An example showing (a) an original mammogram, (b) horizontal, and (c) vertical Sobel gradient information.

The gradient information is one of the references in the refinement procedure; therefore, the tracking procedure will be affected if the gradient information of the breast boundary is distorted by that of artifacts along the boundary area. In order to alleviate this problem, in each step of boundary tracking, three predicted points (Ai, i=2,3,4) based on the gradient information and four derived points (Bj, j=1,2,3,4) based on the initial breast boundary are estimated as candidates for the next boundary point. Figure 7 shows an example of obtaining the seven points for searching the next boundary point after a boundary point p(x,y) is tracked. The two-step strategy for determining the seven points depends on the region where the breast boundary is being tracked. In the regions using the horizontal gradient information (AB and AD in Fig. 5), the y-coordinates for the seven pixels are first calculated by setting a constant pixel spacing d between each pair of the predicted boundary points and between each pair of the four derived points along the initial breast boundary, i.e., the distances between the y-coordinates of the point pairs (p,A2), (p,A3), and (p,A4) are set to be d, 2d, and 3d, respectively, and those between (B1,B2), (B1,B3), and (B1,B4) to be d, 2d, 3d, respectively. The y-coordinate of B1 is the same as that of p(x,y). The points are spaced upward for the upper portion of the breast boundary (AB in Fig. 5) and spaced downward for the lower portion (AD in Fig. 5) from the last tracked point. The x-coordinates of the three predicted boundary points are determined by using the properties of the local horizontal gradient information that the gradient at the boundary is usually high and will gradually decrease as the distance from the boundary increases. The x-coordinates of the four derived points are taken to be the x-coordinates of the initial boundary points at the corresponding y-coordinates. As the boundary tracking proceeds to the regions using the vertical gradient information (BC and DE in Fig. 5), the roles of the x-coordinates and the y-coordinates described above are reversed.

Figure 7.

Figure 7

An example of obtaining the seven subsequent points following the determination of a given tracked boundary point p(x,y). (a) Original mammogram; (b) vertical Sobel filtered image showing the three predicted boundary points (dark gray squares), which fall on the final breast boundary in most cases; (c) original mammogram superimposed with the initial breast boundary and the four derived points (white circles); and (d) demonstration of the seven points along the initial (white) and final (gray) breast boundaries.

Once the seven subsequent pixels are derived after a tracked boundary point p(x,y), two types of slopes will be determined to predict the next boundary point as follows.

Type I Slope: Obtained from the tracked boundary point p(x,y) and three predicted points, i.e., the slopes (angles) between lines pA2¯, pA3¯, and pA4¯ and the horizontal line in Fig. 7b. For example, slope=1 indicates 45° and slope=0 indicates 0°.

Type II Slope: Obtained from the four pixels on the initial breast boundary, i.e., the slopes (angles) between lines B1B2¯, B1B3¯, and B1B4¯ and the horizontal line in Fig. 7c.

The slope between the current tracked boundary point p(x,y) and its previous tracked points, referred to as the “previous slope,” is used as a reference to determine the next boundary point. First, the Type I slopes using the three predicted points Ai (i=2,3,4) are compared to the previous slope. Since the three predicted points are determined based on the gradient information, it is possible that unreasonable predicted points such as markers near the breast boundary are chosen. Let ThS be the slope change threshold. If at least one of the three differences between the Type I slopes and the previous slope is smaller than or equal to ThS, the suitable slope will be determined from Type I slopes according to the minimum difference. Otherwise, if the differences between all three Type I slopes and the previous slope are greater than ThS the abrupt change in slope may indicate the presence of artifacts and we will search for the next boundary point using the Type II slopes. The suitable slope will be considered to be one of the Type II slopes that has the minimum difference from the previous slope. Finally, the coordinate of the next breast boundary point is determined by choosing a constant distance from the current tracked boundary point in the y-coordinate and to calculate the x-coordinate using the “suitable slope” for the regions AB and AD, or correspondingly choosing a constant distance from the x-coordinate followed by calculating the y-coordinate for the regions BC and DE. The constant distances are chosen to be d, 2d, and 3d when selecting the slopes derived from B1B2¯, B1B3¯, and B1B4¯, respectively, which are the same distances as those for setting up the predicted or derived points. If the minimum difference between the Type II slopes and the previous slope is greater than ThS, a slope change in ThS is used to determine the next boundary point to avoid sharp direction change in the breast boundary.

The constant pixel spacing d and the slope change threshold ThS were determined experimentally. The ranges of these parameters were initially estimated by taking into consideration some a priori knowledge of breast boundaries. First, since the slope is changing gradually along the breast boundary, the value of the constant pixel spacing should be large enough to avoid an abrupt slope change. For example, if the pixel spacing is too small, e.g., 1 pixel, the tracking procedure will be sensitive to noise. The slopes will have large fluctuations and the determination of the next boundary point by comparing the slope differences will be unreliable. Second, since the breast boundary is smooth, the value of the constant pixel spacing should be small enough to provide adequate sampling of the boundary curve. For example, if the pixel spacing is set to be too large, e.g., 30 pixels, two problems may arise: One is that the slope along the breast boundary may change by a large amount and the use of the previous slope as a guide may not be meaningful, and the other is that the final breast boundary points will be piecewise linear. Our experiments indicated that a spacing of 3–5 pixels is the best range to perform the refinement procedure when images with 800 μm pixel size are used. In this study, we chose d=3 pixels. Similarly, for the slope change threshold ThS, if it is too small, e.g., 1°, the tracking procedure prefers to find a nearly straight line along the breast boundary, but the true boundary is curved gradually. On the other hand, if the slope threshold is set to be too large, e.g., 45°, the tracking procedure will allow an abrupt angle change and thus may not be able to avoid a marker or label that overlaps with the breast boundary. Our experiments indicate that 10°–20° is the best range, and we chose ThS=0.3 (around 17°) in this study.

Evaluation methods

Two distance measures and one similarity measure are used to compare the automatically detected breast boundary T={t1,t2,⋯,tp}, which contains p points with the experienced radiologist’s manually traced boundaryR={r1,r2,⋯,rq}, which contains q points. More detailed descriptions regarding those measures can be found in our previous study.24 The minimum Euclidean distance MinDist(α,R) between a point α and a curveR={r1,r2,⋯,rq} is defined as

MinDist(α,R)=mini{1,2,,q}αri. (1)

The Hausdorff distance (HDist) (Ref. 25) HDist(T,R) is used to evaluate the maximum distance between the two curves, and it is defined in terms of the directed Hausdorff distance as

HDist(T,R)=max{hDist(T,R),hDist(R,T)} (2)

in which the directed Hausdorff distance hDist(T,R) identifies the maximum among the distances from the points along curve T to curve R,

hDist(T,R)=maxi{1,2,,p}{MinDist(ti,R)}. (3)

Another distance measure, referred to as the average minimum distance AMinDist(T,R) is used to evaluate how much curves T and R are dissimilar on average. AMinDist(T,R) is defined by averaging the MinDist of T to R and the MinDist of R to T

AMinDist(T,R)=i=1pMinDist(ti,R)2p+i=1qMinDist(ri,T)2q. (4)

The distance measures were calculated in units of pixels (1 pixel=0.8 mm) in this study.

The similarity measure used in this study is the area overlap measure (AOM) between two curves enclosing the breast region, which are formed by T or R at the breast boundary, the chest wall, and the image boundary at the top of the pectoral muscle for MLO view, defined as

AOM(T,R)=Area{STSR}Area{STSR}, (5)

where ST and SR are the interior regions enclosed by the two closed curves, respectively.

RESULTS

Breast boundary detection examples

Figures 8910 show examples of the initial and the final breast boundary detection results using our MTBB system. Figure 8 shows an example in which the initial breast boundary is very close to the final boundary. Figure 9 shows an example that has a high intensity marker near the top-left breast area. This is a difficult case since the edge gradient will lead the refinement procedure to the wrong direction. However, by using the initial breast boundary as a guide, the final breast boundary tracked by the MTBB algorithm is close to the true boundary. Figure 10 shows another example containing a low intensity label covering a small section of the anterior breast boundary. Again, the final breast boundary is closer to the true boundary than the initial boundary.

Figure 8.

Figure 8

(a) Original mammogram. (b) The windowed image of (a) to emphasize the breast boundary. (c) The initial (inner gray line) and the final (outer white line) breast boundaries.

Figure 9.

Figure 9

(a) Original mammogram containing a high intensity marker (the LMLO label) near the top-left boundary. (b) The windowed image of (a) to show the breast boundary. (c) The initial (inner gray line) and the final (outer white line) breast boundaries.

Figure 10.

Figure 10

(a) Original mammogram containing a low intensity object (the shadow of a label) covering a small section of the left breast boundary. (b) The windowed image of (a) to show the breast boundary. (c) The initial (inner gray line) and the final (outer white line) breast boundaries.

Performance evaluation

We evaluated the performance of our dynamic MTBB detection method in comparison with the radiologist’s manual segmentation. We also compared the performance of MTBB to our previously developed GBB detection approach.9 Table 1 and Fig. 11 show the MTBB method achieved smaller distance errors and slightly larger AOMs than the GBB method. The P-values of two-tailed Wilcoxon signed rank test on the following three comparisons (GBB vs MTBB-Initial), [GBB vs final breast boundary (MTBB-Final)], and (MTBB-Initial vs MTBB-Final) using the three evaluation measures are shown in Table 2. All comparisons show that the improvement achieved with our newly developed method (MTBB-Final) is statistically significant.

Table 1.

Comparison of automated boundary detection with an experienced radiologist’s manual segmentation for 716 mammograms.

  GBB (%) MTBB-Initial (%) MTBB-Final (%)
Percentage of images having HDist error less than 6 pixels (4.8 mm) 68 85 94
Percentage of images having AMinDist error less than 1.5 pixels (1.2 mm) 89 90 96
Percentage of images having AOM values larger than 0.9 96 98 99

Figure 11.

Figure 11

Cumulative counts of the number of images having the performance measure; (a) Hausdorff distance and (b) average minimum Euclidean distance, less than a given value, and (c) area overlap greater than a given value by comparing the automated breast boundary detection to an MQSA radiologist’s manual segmentation.

Table 2.

The two-tailed P-values of the improvement in the breast boundary accuracy estimated from Wilcoxon signed rank test between pairs of the three methods.

  HDist AMinDist AOM
GBB vs MTBB-Initial <0.0001 <0.0001 0.50
GBB vs MTBB-Final <0.0001 <0.0001 0.0006
MTBB-Initial vs MTBB-Final <0.0001 0.0002 <0.0001

DISCUSSION

The initial breast boundary plays an important role in the breast boundary detection system since it can lead the tracking or refinement procedure to avoid artifacts along the breast boundary. Figure 12 shows an example in which a label is in contact with the nipple region. Figure 12a shows that the GBB was diverted to the edge of the label (gray line), while the new MTBB method successfully tracked the boundary (white line). In this example, the dynamic multiple thresholding method correctly identified the initial breast boundary but the GBB method initially found the edge of the label because of the simple line-by-line search. The subsequent refinement could not correct for this major error because it deviated too far from the true boundary. For the cases without markers or labels along the boundary, the initial breast boundary still serves as an important guide for the tracking procedure, especially in regions where the boundary is closed to the nipple or chest wall. Figure 13 shows an example comparing the GBB method (gray line) and MTBB-Final (white line) results. In order to extract the nipple, the GBB method used a lax criterion to search for the edge while tracking the boundary. However, the lax criterion is sensitive to noise and thus GBB found an outer edge, probably due to scattered radiation, around the nipple region. The strong scattered radiation near the chest wall caused a similar problem. In these situations, the combined information from the Sobel gradients and the initial breast boundary used in the refinement stage of the MTBB method guided the tracking to the correct boundary. These examples demonstrated that the MTBB method can improve the nipple location and shape along the breast boundary. This will likely improve the accuracy of automated nipple detection for multiple view analysis.

Figure 12.

Figure 12

An example, in which a label is connected to the nipple region, showing (a) the GBB (gray line) and MTBB-Final (white line) results, (b) vertical, and (c) horizontal gradient information by Sobel filtering. In comparison to manual segmentation, the AOM, HDist, and AMinDist measures for the GBB boundary were 0.929 and 20.22 pixels, and 1.52 pixels, respectively, and those for the MTBB-Final boundary were 0.976 and 2.45 pixels, and 0.32 pixels, respectively.

Figure 13.

Figure 13

An example showing (a) the GBB (gray line) and MTBB-Final (white line) results, (b) vertical, and (c) horizontal gradient information by Sobel filtering. In comparison to manual segmentation, the AOM, HDist, and AMinDist measures for the GBB boundary were 0.893 and 8.54 pixels, and 1.34 pixels, respectively, and those for the MTBB-Final boundary were 0.969 and 2.24 pixels, and 0.43 pixels, respectively.

Our experiences in breast boundary detection indicate that the initial breast boundary has a strong impact on the overall accuracy of breast boundary detection. The complexity of the refinement procedure will likely depend on the difference between the initial breast boundary and the true boundary, i.e., the farther the initial breast boundary is from the true one, the more complicated techniques and rules may be needed during the refinement process to find the optimal boundary. Many breast boundary detection systems obtain the initial breast boundary by thresholding. However, our study indicated that it is difficult to obtain a reasonable initial boundary by using a simple thresholding approach. In this study, we developed a dynamic multiple thresholding approach to obtain the initial breast boundary. The key steps of our method include (1) the initial accurate determination of the anterior region of the breast region; (2) the gradual increase in the threshold from low to high levels starting from the breast anterior region; (3) limiting the search for initial boundary points only within a small breast peripheral region at each level; and (4) analyzing the differences between two thresholded images at consecutive threshold levels to prevent large change in the boundary direction. The adaptation of the thresholding to the local breast boundary improves the chance that the initial breast boundary is close to the true boundary despite the variation in the x-ray intensity in the breast boundary region. Furthermore, in the refinement process, the search for the next boundary point is guided by the previously tracked breast boundary direction and the comparison of multiple boundary point candidates. Both techniques reduce the chance of tracking into artifacts at the breast boundary. Because only a small fraction of the mammograms will have the problem of artifacts or noise at the breast boundary, the overall improvement in the performance measures over the entire data set by our new breast boundary tracking methods only changed by small fractions. However, this small fraction of problematic cases, for which commonly used boundary detection methods would fail, makes it difficult to fully automate the image analysis process in many CAD systems such as nipple detection, multiview correlation of lesion detection or characterization.12, 26, 27, 28, 29, 30 The improvement achieved by the new method is a step toward full automation of these processes.

The AOM is a commonly used metric for comparison of the agreement between two segmented objects. However, it is well known that AOM does not clearly reveal spurious errors that do not cause a large difference in the segmented object area. For the breast boundary detection problem, one major source of error is the artifacts that cause local deviations in the boundary as demonstrated in the examples in Figs. 101213. We therefore chose the Hausdorff distance to measure this type of errors. The average minimum Euclidian distance (AMinDist) is a commonly used measure of the average distance between two curves that can more specifically than the AOM show the deviation between the boundaries. In addition, the AMinDist provides the average rather than the maximum of the differences and is thus complementary to the Hausdorff distance.

To estimate the intraobserver and interobserver variabilities in the manual segmentation, a subset of 50 CC view and 50 MLO view mammograms was randomly selected from the data set of 716 mammograms. The same radiologist (R1) who provided the reference standard for the entire data set and a second MQSA radiologist (R2) were asked to outline the breast boundaries of the subset more than a year after the first segmentation. The radiologists performed the new segmentation independently without knowledge of the previous segmentations. The similarity measure, AOM, and distance measures AMinDist and HDist, between pairs of the segmentations, denoted by R1(1), R1(2), and R2, were calculated The intraobserver variability was estimated by the three measures between R1(1) and R1(2), and the interobserver variability was estimated by the three measures between R1(1) and R2 or between R1(2) and R2, as shown in Fig. 14. The intraobserver variation was slightly smaller than the interobserver variations; however, the differences in either the interobserver or intraobserver variations did not achieve statistical significance for these two experienced radiologists, as estimated by the two-tailed P-values of the Wilcoxon signed rank test (Table 3). In comparison to the results in Fig. 11, the differences between the current (MTBB) and previous (GBB) methods are much greater than the interobserver and intraobserver variations, especially for the Hausdorff distance that measures the sporadic large deviations from the reference boundary, which was substantially reduced by the MTBB method.

Figure 14.

Figure 14

Cumulative counts of the number of images having the performance measures; (a) Hausdorff distance and (b) average minimum Euclidean distance, less than a given value, and (c) area overlap greater than a given value by comparing the two segmentations R1(1) and R1(2) by the first radiologist, and one segmentation R2 by the second radiologist.

Table 3.

The two-tailed P-values estimated from the Wilcoxon signed rank test on the differences in the breast boundaries between pairs of the R1 and R2 segmentations obtained by the three performance measures. R1(1)=first outline by R1, R1(2)=second outline by R1, R2=outline by R2. The performance measure was calculated relative to each of the radiologist’s segmentations as “reference” shown in column 2.

  Reference HDist AMinDist AOM
R1(2) and R1(1) vs R2 and R1(1) R1(1) 0.09 0.49 0.19
R1(1) and R1(2) vs R2 and R1(2) R1(2) 0.10 0.14 0.052
R1(1) and R2 vs R1(2) and R2 R2 0.97 0.36 0.61

Our algorithms involve several parameters. We determined these parameters empirically by experimenting with small subsets of the available data set. To evaluate the consistency of the algorithm performance in the large data set, we randomly grouped the 716 images by case into two subsets, each with 358 images. We calculated the three performance measures AOM, AMinDist, and HDist, and estimated the significance in their differences between the two subsets using the nonparametric Mann–Whitney test for unpaired data. The two-tailed P-values for the three measures were 0.19, 0.54, and 0.26, respectively. We repeated the same experiment with two other different groupings. The two-tailed P-values were 0.54, 0.07, and 0.15, respectively, for the second grouping and 0.50, 0.42, and 0.61, respectively, for the third grouping. These experiments indicate that there were no significant differences in the performance measures between the random subsets of images. Therefore, although our chosen parameters may not be optimal and the performance was not evaluated in an independent test set, we expect that the parameters used would be reasonably robust because the validation sample size of over 700 used in this study was relatively large. Even if the entire set was used for training, the training performance would approach that of test performance when the training sample size is sufficiently large.31, 32

The breast boundary detection algorithm developed in this study is mainly useful for digitized mammograms. For digital mammograms, the breast laterality and view label is usually shown only on the viewing workstation so that they will not contribute artifacts to breast boundary detection using the digital files. The breast boundaries are also easier to detect, especially in the “for presentation” images. However, screen-film mammography is still commonly used in breast imaging clinics to date and may continue to be a competitive modality in years to come since digital mammography systems are much more expensive and have not been found to be superior to screen-film mammography in all types of breasts.33 Improvement of the CAD methods for screen-film mammography will continue to be an important area of research. Improving the accuracy of breast boundary detection will be the fundamental step in implementing many advanced techniques that can enhance CAD performance.

Although our MTBB algorithm can circumvent the problem of markers or other artifacts overlapping with the breast boundary in most of the cases as demonstrated in Figs. 910, there are still two cases in which our algorithm failed to exclude the object when the high intensity object happened to be the leftmost point in the initial breast boundary, as shown in an example in Fig. 15. In such cases, the refinement procedure would take the object as the starting point and then moved upward and downward along the initial boundary. The marker would therefore not be excluded by the proposed method. We believe this problem may be alleviated either by designing more intelligent criteria on the selection of the starting point, or by performing a postprocessing procedure to identify unusual shape regions along the breast boundary and refine the boundary locally. Further work is underway to reduce these errors.

Figure 15.

Figure 15

(a) Original mammogram containing a high intensity marker. (b) The vertical Sobel filtered image of (a) to show the horizontal gradient information. (c) The initial (inner gray line) and the final (outer white line) breast boundaries. The marker was considered as the starting point for the refinement procedure so that the final result contained the marker.

CONCLUSIONS

Many breast boundary detection systems obtain the breast boundary by first determining an initial boundary which is used to guide the tracking of a final boundary. However, those systems may fail due to errors in the initial boundary. Our MTBB system determines the initial boundary based on dynamic adaptation of the threshold to local regions of the breast periphery. The final boundary is then tracked based on the initial boundary and the gradient information. In comparison to a gradient-based method, the new method reduces the chances that the detected breast boundary would be misled by artifacts and noise along the breast boundary. The new method improved the agreement between the automated detected breast boundary and that manually outlined by an experienced breast radiologist, as estimated by three performance measures, the HDist errors, the AMinDist errors, and the area overlap. Our results demonstrate that the analysis of thresholded images based on dynamic adaptation of threshold levels is a robust approach to the detection of breast boundary.

ACKNOWLEDGMENTS

This work is supported by USPHS Grant No. RO1 CA95153. The genetic study of breast density in the Amish population is supported by USPHS Grant No. R01 CA122844 (P.I. Julie Douglas, Ph.D.). The content of this article does not necessarily reflect the position of the funding agencies and no official endorsement of any equipment and product of any companies mentioned should be inferred.

References

  1. Chan H. P., Doi K., Vyborny C. J., Schmidt R. A., Metz C. E., Lam K. L., Ogura T., Wu Y., and MacMahon H., “Improvement in radiologists’ detection of clustered microcalcifications on mammograms. The potential of computer-aided diagnosis,” Invest. Radiol. 25, 1102–1110 (1990). 10.1097/00004424-199010000-00006 [DOI] [PubMed] [Google Scholar]
  2. Warren Burhenne L. J., Wood S. A., D’Orsi C. J., Feig S. A., Kopans D. B., O’Shaughnessy K. F., Sickles E. A., Tabar L., Vyborny C. J., and Castellino R. A., “Potential contribution of computer-aided detection to the sensitivity of screening mammography,” Radiology 215, 554–562 (2000). [DOI] [PubMed] [Google Scholar]
  3. Freer T. W. and Ulissey M. J., “Screening mammography with computer-aided detection: Prospective study of 12,860 patients in a community breast center,” Radiology 220, 781–786 (2001). 10.1148/radiol.2203001282 [DOI] [PubMed] [Google Scholar]
  4. Brem R. F., Baum J. K., Lechner M., Kaplan S., Souders S., Naul L. G., and Hoffmeister J., “Improvement in sensitivity of screening mammography with computer-aided detection: A multi-institutional trial,” AJR, Am. J. Roentgenol. 181, 687–693 (2003). [DOI] [PubMed] [Google Scholar]
  5. Destounis S. V., DiNitto P., Logan-Young W., Bonaccio E., Zuley M. L., and Willison K. M., “Can computer-aided detection with double reading of screening mammograms help decrease the false-negative rate? Initial experience,” Radiology 232, 578–584 (2004). 10.1148/radiol.2322030034 [DOI] [PubMed] [Google Scholar]
  6. Helvie M. A. et al. , “Sensitivity of noncommercial computer-aided detection system for mammographic breast cancer detection—A pilot clinical trial,” Radiology 231, 208–214 (2004). 10.1148/radiol.2311030429 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Petrick N., Chan H. P., Sahiner B., and Helvie M. A., “Combined adaptive enhancement and region-growing segmentation of breast masses on digitized mammograms,” Med. Phys. 26, 1642–1654 (1999). 10.1118/1.598658 [DOI] [PubMed] [Google Scholar]
  8. Wei J., Sahiner B., Chan H. P., Petrick N., Hadjiiski L. M., and Helvie M. A., “Computer aided diagnosis system for mass detection: Comparison of performance on full-field digital mammograms and digitized film mammograms,” Radiological Society of North America, Program Book, p. 387 (2003).
  9. Zhou C., Chan H. P., Petrick N., Helvie M. A., Goodsitt M. M., Sahiner B., and Hadjiiski L. M., “Computerized image analysis: Estimation of breast density on mammograms,” Med. Phys. 28, 1056–1069 (2001). 10.1118/1.1376640 [DOI] [PubMed] [Google Scholar]
  10. Hadjiiski L. M., Sahiner B., Caoili E. M., Cohan R. H., and Chan H. P., “Automated detection of ureter abnormalities on multidetector row CT urography,” Proc. SPIE 6144, 1W1–1W7 (2006). [Google Scholar]
  11. Sahiner B., Petrick N., Chan H. P., Paquerault S., Helvie M. A., and Hadjiiski L. M., “Recognition of lesion correspondence on two mammographic views—A new method of false-positive reduction for computerized mass detection,” Proc. SPIE 4322, 649–655 (2001). 10.1117/12.431139 [DOI] [Google Scholar]
  12. Paquerault S., Petrick N., Chan H. P., Sahiner B., and Helvie M. A., “Improvement of computerized mass detection on mammograms: Fusion of two-view information,” Med. Phys. 29, 238–247 (2002). 10.1118/1.1446098 [DOI] [PubMed] [Google Scholar]
  13. Zhou C., Chan H. -P., Paramagul C., Roubidoux M. A., Sahiner B., Hadjiiski L. M., and Petrick N., “Computerized nipple identification for multiple image analysis in computer-aided diagnosis,” Med. Phys. 31, 2871–2882 (2004). 10.1118/1.1800713 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Zhang Y., Chan H. -P., Sahiner B., Wu Y. -T., Zhou C., Ge J., Wei J., and Hadjiiski L. M., “Application of boundary detection information in breast tomosynthesis reconstruction,” Med. Phys. 34, 3603–3613 (2007). 10.1118/1.2761968 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Bick U., Giger M. L., Schmidt R. A., Nishikawa R. M., Wolverton D. E., and Doi K., “Automated segmentation of digitized mammograms,” Acad. Radiol. 2, 1–9 (1995). 10.1016/S1076-6332(05)80239-9 [DOI] [PubMed] [Google Scholar]
  16. Wirth M. A. and Stapinski A., “Segmentation of the breast region in mammograms using snakes,” in Proceedings of the First Canadian Conference on Computer and Robot Vision, 2004, pp. 385–392 (unpublished).
  17. Ojala T., Nappi J., and Nevalainen O., “Accurate segmentation of the breast region from digitized mammograms,” Comput. Med. Imaging Graph. 25, 47–59 (2001). 10.1016/S0895-6111(00)00036-7 [DOI] [PubMed] [Google Scholar]
  18. Ferrari R. J., Rangayyan R. M., Desautels J. E. L., Borges R. A., and Frere A. F., “Identification of the breast boundary in mammograms using active contour models,” Med. Biol. Eng. Comput. 42, 201–208 (2004). 10.1007/BF02344632 [DOI] [PubMed] [Google Scholar]
  19. Sun Y., Suri J. S., Desautels J. E. L., and Rangayyan R. M., “A new approach for breast skin-line estimation in mammograms,” Pattern Anal. Appl. 9, 34–47 (2006). 10.1007/s10044-006-0023-0 [DOI] [Google Scholar]
  20. Raba D., Oliver A., Marti J., Peracaula M., and Espunya J., “Breast segmentation with pectoral muscle suppression on digital mammograms,” in Pattern Recognition and Image Analysis, Lecture Notes in Computer Science Vol. 3523 (Springer, Berlin∕Heidelberg, 2005), pp. 471–478. [Google Scholar]
  21. Otsu N., “A threshold selection method from gray-level histograms,” IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979). 10.1109/TSMC.1979.4310076 [DOI] [Google Scholar]
  22. Rosin P. L., “Unimodal thresholding,” Pattern Recogn. 34, 2083–2096 (2001). 10.1016/S0031-3203(00)00136-9 [DOI] [Google Scholar]
  23. Douglas J. A., Roy-Gagnon M. -H., Zhou C., Mitchell B. D., Shuldiner A. R., Chan H. -P., and Helvie M. A., “Mammographic breast density—Evidence for genetic correlations with established breast cancer risk factors,” Cancer Epidemiol. Biomarkers Prev. 17, 3509–3516 (2008). 10.1158/1055-9965.EPI-08-0480 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Sahiner B., Petrick N., Chan H. P., Hadjiiski L. M., Paramagul C., Helvie M. A., and Gurcan M. N., “Computer-aided characterization of mammographic masses: Accuracy of mass segmentation and its effects on characterization,” IEEE Trans. Med. Imaging 20, 1275–1284 (2001). 10.1109/42.974922 [DOI] [PubMed] [Google Scholar]
  25. Huttenlocher D. P., Klanderman G. A., and Rucklidge W. J., “Comparing images using the Hausdorff distance,” IEEE Trans. Pattern Anal. Mach. Intell. 15, 850–863 (1993). 10.1109/34.232073 [DOI] [Google Scholar]
  26. Hadjiiski L., Chan H. P., Sahiner B., Petrick N., and Helvie M. A., “Automated registration of breast lesions in temporal pairs of mammograms for interval change analysis—Local affine transformation for improved localization,” Med. Phys. 28, 1070–1079 (2001). 10.1118/1.1376134 [DOI] [PubMed] [Google Scholar]
  27. Hadjiiski L., Sahiner B., Chan H. P., Petrick N., Helvie M. A., and Gurcan M. N., “Analysis of temporal change of mammographic features: Computer-aided classification of malignant and benign breast masses,” Med. Phys. 28, 2309–2317 (2001). 10.1118/1.1412242 [DOI] [PubMed] [Google Scholar]
  28. Wei J., Sahiner B., Hadjiiski L. M., Chan H. -P., Helvie M. A., Roubidoux M. A., Zhou C., Ge J., and Zhang Y., “Two-view information fusion for improvement of computer-aided detection (CAD) of breast masses on mammograms,” Proc. SPIE 6144, 241–247 (2006). [Google Scholar]
  29. Wu Y. -T., Wei J., Hadjiiski L. M., Sahiner B., Zhou C., Ge J., Shi J., Zhang Y., and Chan H. P., “Bilateral analysis based false positive reduction for computer-aided mass detection,” Med. Phys. 34, 3334–3344 (2007). 10.1118/1.2756612 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Zheng B., Leader J. K., Abrams G. S., Lu A. H., Wallace L. P., Maitz G. S., and Gur D., “Multiview-based computer-aided detection scheme for breast masses,” Med. Phys. 33, 3135–3143 (2006). 10.1118/1.2237476 [DOI] [PubMed] [Google Scholar]
  31. Chan H. P., Sahiner B., Wagner R. F., and Petrick N., “Classifier design for computer-aided diagnosis: Effects of finite sample size on the mean performance of classical and neural network classifiers,” Med. Phys. 26, 2654–2668 (1999). 10.1118/1.598805 [DOI] [PubMed] [Google Scholar]
  32. Sahiner B., Chan H. P., and Hadjiiski L., “Classifier performance prediction for computer-aided diagnosis using a limited data set,” Med. Phys. 35, 1559–1570 (2008). 10.1118/1.2868757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Pisano E. D., Gatsonis C., Hendrick E., and Yaffe M., “Diagnostic performance of digital versus film mammography for breast-cancer screening,” N. Engl. J. Med. 353, 1773–1783 (2005). 10.1056/NEJMoa052911 [DOI] [PubMed] [Google Scholar]

Articles from Medical Physics are provided here courtesy of American Association of Physicists in Medicine

RESOURCES