Abstract
Forty post-mortem breasts were imaged with a flat-panel based cone-beam x-ray CT system at 50 kVp. The feasibility of breast density quantification has been investigated using standard histogram thresholding and an automatic segmentation method based on the fuzzy c-means algorithm (FCM). The breasts were chemically decomposed into water, lipid, and protein immediately after image acquisition was completed. The percent fibroglandular volume (%FGV) from chemical analysis was used as the gold standard for breast density comparison. Both image-based segmentation techniques showed good precision in breast density quantification with high linear coefficients between the right and left breast of each pair. When comparing with the gold standard using %FGV from chemical analysis, Pearson’s r-values were estimated to be 0.983 and 0.968 for the FCM clustering and the histogram thresholding techniques, respectively. The standard error of the estimate (SEE) was also reduced from 3.92% to 2.45% by applying the automatic clustering technique. The results of the postmortem study suggested that breast tissue can be characterized in terms of water, lipid and protein contents with high accuracy by using chemical analysis, which offers a gold standard for breast density studies comparing different techniques. In the investigated image segmentation techniques, the FCM algorithm had high precision and accuracy in breast density quantification. In comparison to conventional histogram thresholding, it was more efficient and reduced inter-observer variation.
Keywords: breast density, fuzzy c-means, segmentation, cone-beam computed tomography
I. Introduction
Breast density is a compositional measure of the relative amount of parenchymal tissue in the breast. In the basic model, the breast consists of two tissue compartments: glandular and adipose. Breast density is defined as the volumetric percentage of the glandular tissue. There is also a fibrous component in breast tissue that has a similar attenuation to glandular tissue and is therefore included in the glandular compartment of the model. Research has shown that breast density is a moderate to strong independent risk factor for the development of breast cancer. On average, women with high breast density have a four to six-fold increased risk for breast cancer, compared to women who have low breast density.(Boyd et al., 2002; Boyd et al., 2007; Harvey and Bovbjerg, 2004; McCormack and dos Santos Silva, 2006; Phipps et al., 2012; Wolfe, 1976; Yaghjyan et al., 2011) Despite the volume of evidence supporting the importance of breast density in risk analysis, breast density has only been included in a few emerging risk models, and improvements in risk discrimination are already being seen.(Vachon et al., 2007) Quantitative breast density measurements allow for stronger risk predictions on the individual level leading to personalized screening plans that will help to continue reducing breast cancer mortality rates. However, in a 2008 literature review by Jacobi et al.(Jacobi et al., 2008), none of the seven most prevalent risk models included breast density determinations of lifetime risk. This absence is caused by the lack of a generally accepted quantitative measure of breast density.
Mammography is the current standard for breast cancer screening; however, due to its projective nature, quantitative measures of breast density are difficult to implement. The most widely used measure of density is a lexical standard specified in the Breast Imaging Reporting and Data System (BI-RADS) published by the American College of Radiology. Each mammogram is given a rating from 1 to 4 depending on the amount of structural tissue. This category indicates how difficult it is to read the mammogram. Even so, the BI-RADS categories have been used to exhibit a correlation between mammographic density and relative cancer risk.(Vacek and Geller, 2004) Although the BI-RADS specification is useful for identifying patients who would benefit from further screening, its subjective, qualitative nature limits its use in quantitative predictive models. Planimetry, which requires an experienced radiographer to draw tissue boundaries by hand or with a computer, has a much stronger association with risk than categorical measures.(McCormack and dos Santos Silva, 2006) However, quantitative breast density measurements from mammography must overcome a fundamental difficulty. The aforementioned methods calculate a three-dimensional volumetric breast density from two-dimensional images. This requires an accurate model of compressed breast thickness and the knowledge of imaging parameters such as the compression ratio, paddle tilt, and beam energy to a high degree of precision that is difficult to achieve on clinical systems.(Kopans, 2008) The measurement is also dependent on the angle of the projection. Without a gold standard for comparison, it is challenging to prove the accuracy of these complex methods, though there is ongoing research in this area. (Alonzo-Proulx et al., 2003; Pawluczyk et al., 2003) It is more straightforward to use a three-dimensional imaging modality, such as magnetic resonance imaging (MRI) or computed tomography (CT), to quantify volumetric breast density.
Dedicated cone-beam breast CT is a developing modality that has a promising future in breast imaging.(Glick, 2007) Current breast CT systems generally employ a flat panel detector and image the breast in pendant geometry without the need for compression, offering a pain free experience for the patient. The radiation dose of breast CT is now within the FDA approved limits of conventional two-view mammography, and imaging times are on the order of 10–20 seconds.(Boone et al., 2001; Boone et al., 2004b) Phantom and simulation studies have compared the diagnostic capabilities of breast CT to conventional mammography with promising results.(Chen and Ning, 2002b; Gong et al., 2004; Lai et al., 2007; O'Connell et al., 2010; Yang et al., 2007) And an initial clinical study suggests that breast CT is significantly better than mammography for the visualization of soft tissue lesions.(Lindfors et al., 2008) The benefit of breast CT is that overlapping tissue structures do not obscure the image due to the three-dimensional nature of the modality. This allows for more accurate detection of tumors, improved characterization of lesions, and better segmentation of tissues.(Chen and Ning, 2003)
Anatomical noise induced by overlapping tissues can be removed in CT so that volumetric breast density can be quantified with standard segmentation using a single threshold that separates the glandular and adipose tissues. However, the presence of non-linear artifacts, such as beam hardening and scatter, may significantly decrease image quality. In many cases, the glandular and adipose peaks cannot be fully resolved in the histograms of CT images. This leads to uncertainties in threshold selection while using the standard segmentation method. In addition, such uncertainty results in variations between different observers and even the same observer at different times. The inter- and intra-observer differences associated with histogram thresholding reduce the precision of breast density measurements. Another practical aspect is the processing time for breast density quantification. Unlike planar mammography, breast CT generates many images to depict the 3D anatomy of the breast. The reader must view each layer to choose the best threshold values for image segmentation; this process can be time-consuming. It is desirable to have an automatic segmentation method so that breast density can be quantified objectively and efficiently.
In general, clustering algorithms are computerized methods that take a group of objects and assign them to a number of classes based on a specified similarity criterion. For the case of analyzing grayscale images, the algorithm classifies pixels with like gray values into distinct clusters allowing for the separation of different materials by their attenuation properties. The simplest segmentation algorithm is k-means clustering. Mathematically, for a given set of N pixels p = (p1,p2, …, pN), k-means clustering separates the data into k sets S = {S1,.S2, …, Sk} with cluster centers c = (c1,c2,…,ck) in a way that minimizes deviations from the cluster centers. An improved clustering algorithm, fuzzy c-means (FCM) clustering, allows each pixel to be assigned to each of the k classes to varying extents. Since image pixels have finite size, it is likely that multiple tissues are present within a voxel so that the gray value depends on the amount of each tissue; this is termed the partial volume effect. FCM clustering has been widely used in brain MRI image to segment gray matter from white matter.(Ramathilagam et al., 2011; Kannan et al., 2011; Ahmed et al., 2002; Phillips et a.l, 1995) More recently, it was introduced to assess areal breast density in full-field digital mammography.(Keller et al., 2012) It has been suggested that there is a strong association between FCM algorithm-estimated and radiologist-provided areal breast densities for both raw and processed digital mammograms.(Keller et al., 2012) However, the pixel values in a projection mammogram represent a line integral of the attenuation characteristics of the tissue. Thus, it is difficult to give a unique assignment of either glandular or adipose to each pixel. This problem is overcome when FCM clustering is used on breast CT images. Hence, it is interesting to investigate the precision and accuracy of this simple algorithm in breast density quantification.
One fundamental problem with breast density comparison studies is that there is no objective ground truth for comparison.(Kopans, 2008) Past studies have found good correlation coefficients with trained radiologists' visual assessments or with results from another modality, but these results merely show that current measures of breast density are consistent. As an example, consider comparisons between mammography and breast CT. The volumetric density measurements of breast CT are inherently different from areal measurements of mammography. Because of the superposition of tissue structures, areal density measures tend to be higher than volumetric measures on the BI-RADS scale, indicating that the two methods aren't necessarily measuring the same characteristic of the tissue.(Han et al., 2009) In order to evaluate the quantitative accuracy of any measurement of breast density, there needs to be an objective ground truth value that is independent of the imaging process.
In this study, we used the definitive chemical composition of the breasts, in terms of water, lipid and protein contents, as the objective ground truth. We define percent fibroglandular volume (%FGV) as the sum of the volumetric fractions of water and protein, and use it as the gold standard for breast density comparison.(Shepherd et al., 2011; Ducote et al., 2011b; Ding et al., 2012) The relation to breast density can be seen by noting that removing lipid from the %FGV calculation is similar to removing adipose tissue from the two component model. Hence, %FGV is related to the volumetric fraction of non-fatty tissue i.e., breast density. This study aims to evaluate the accuracy of quantitative measures of breast density using both histogram thresholding and FCM clustering applied to breast CT images of ex-vivo post-mortem breasts by comparing the results of image segmentation to the gold standard %FGV determined from chemical decomposition. The use of a definitive measure of breast tissue composition offers an objective comparison for the different segmentation methods.
II. Methods
II.A. Breast computed tomography
An experimental cone-beam CT system was used in this study. The system consisted of a tungsten target x-ray tube (Dynamax 78E) coupled to a Phillips Optimus M200 x-ray generator and a CsI flat-panel detector (PaxScan 4030CB, Varian Medical Inc., Palo Alto, CA) mounted on an optical bench. A high precision motor (Kollmorgen Goldline DDR D062M Danaher Motion, Wood Dale, IL) provided the rotational mechanism and served as the scanning stage. A TTL-logic signal from the flat panel controlled the timing of the x-ray pulse. The synchronizing of this signal with the x-ray pulse was implemented using a field programmable gate array. The detector had an intrinsic pixel size of 0.194 mm, but 2×2 hardware binning effectively increased the pixel size to 0.384 mm. The flat-panel detector dimensions were 40×30 cm2, corresponding to an image size of 1024 by 768 pixels. The source-to-detector distance and the source-to-object distance were 1.5 m and 1 m, respectively. The flat-panel detector was operated at 7.5 frames per second while the motor was rotated at 1.0 rpm, acquiring 462 frames over 62 seconds for each scan in angular increments of 0.78°. Images were acquired at a beam energy of 50 kVp and a tube current of 2.5 mAs. A pre-filtration of 2 mm Al was used to shape the x-ray spectrum according to the published information from other pre-clinical dedicated breast CT systems.(Ning et al., 2010) The half-value layer of the beam after filtration was 2.2 mm of aluminum. The exposure at 50 kVp was measured with an ion chamber (Model 2026C, Radcal Corporation, Monrovia, CA)calibrated in the energy range between 50 and 140 kVp. The ion chamber was placed at the isocenter in the open beam. The exposure was measured as 4.4 mGy. Results from a previous Monte Carlo simulation were used to estimate the mean glandular dose (MGD) from the measured exposure.(Boone et al., 2004a). For a breast with 30% glandularity, the current imaging technique led to an estimated MGD of 3 mGy.
II.B. Postmortem breast preparation
The dataset for this study consisted of twenty pairs (left and right) of post-mortem breasts obtained from the Willed Body Program in the School of Medicine at the University of California, Irvine. The breasts were surgically removed from the cadavers to the pectoralis major muscle. The breasts ranged in mass from 136 to 2330 g and volume from 135 to 2466 cm3. The breast densities, as estimated by histogram thresholding of the CT images, ranged from 2% to 65%. This dataset was used for imaging studies over a period of three. The breasts were preserved by freezing, but were removed from the freezer and kept at approximately 4°C one day before the CT scan. The breasts were kept at room temperature for twenty minutes to allow for tissue relaxation before being weighed and packed for imaging. To prevent any water loss and to preserve the shape of the breast during the CT scan, all breasts were wrapped in a thin, very low attenuation, plastic film (GLAD cling wrap, The Glad Production Co., Oakland, CA) and placed in a cylindrical Styrofoam mold of approximately 10 cm in diameter. The skin of the breasts was included during imaging and chemical decomposition.
II.C. Image processing and reconstruction
Image acquisition was performed using proprietary software (Varian’s VIVA). The raw data underwent preprocessing: background normalization, veiling glare correction, and a correction for beam hardening.(Seibert and Nalcioglu, 1984, 1985; Ding et al., 2013) All preprocessing procedures were implemented in ImageJ, which is an open source image processing software package.(Abràmoff et al., 2004) X-ray scatter in the detector and its housing causes a glare artifact in the cone beam CT images. We performed glare corrections for the raw images in the projection domain following the methodology reported by Seibert et al.(Seibert and Nalcioglu, 1984, 1985) Two millimeter thick lead disks of various diameters were placed directly on the flat panel detector surface for imaging. The signals behind the lead disks were used to calculate the scatter radial extent and scatter fraction of the glare kernel that were then used to correct for glare. Beam hardening artifacts were corrected with a method based on the previously reported Signal-to-equivalent Thickness Calibration (STC),(Vavrik and Jakubek, 2009) using a set of BR12 slab phantoms (CIRS, Norfolk, VA, USA) which closely simulate the x-ray attenuation properties of breast tissue. A non-linear rational fitting function was applied directly to the projection images to correct the beam hardening artifact. Further details of the beam hardening and glare correction techniques have been reported in our previous study.(Ding et al., 2013)
After pre-processing the raw images, the data was reconstructed using a Feldkamp-based filtered back-projection algorithm from Exxim's COBRA suite (Exxim Computing Corporation, Pleasanton, CA USA). The CT images were reconstructed with a voxel size of 0.3 mm × 0.3mm × 0.3 mm. The voxels in the final CT images were presented as effective x-ray attenuation coefficients.
II.D. Image-based breast density quantification
The 50 kVp breast CT reconstructions were analyzed to quantify the volumetric breast density. Quantification was achieved by separating glandular and adipose tissues based on the gray values of the reconstructed image. The corrections described in section II.C benefitted this procedure by removing artifacts and increasing the contrast between the two tissues. Two algorithms were chosen for comparison: histogram thresholding and FCM clustering.
II.D.1. Histogram thresholding
Histogram thresholding is a standard technique used to segment medical images. By visual inspection of the reconstructed volume and the corresponding histogram, three thresholds were chosen: one to separate the breast from the background, one to separate adipose and glandular tissue, and one to separate glandular tissue from calcium deposits. The numbers of voxels of each tissue type were counted, and volumetric breast density was calculated as the quotient of glandular volume over the total volume of the breast. The forty images were arranged in a random order for a blind comparison study to prevent bias in the investigation of the right and left breast density correlation. Two medical physicists performed this measurement independently so that inter-observer variability could be assessed. The training session for the observers included five pairs of breast images, which had densities spanning a wide range from approximately 10% to 70%. The histogram thresholding process was also completed using ImageJ. (Abràmoff et al., 2004)
II.D.2. Fuzzy C-Means segmentation
The FCM algorithm is a generalization of the “hard” k-means algorithm to fuzzy logic. Instead of partitioning into distinct sets, each pixel is assigned a vector of membership values giving its level of belonging to each cluster. A membership matrix U is generated such that , i.e. the sum of the cluster membership values for any pixel is one. Each value in the membership matrix can be interpreted as giving the amount of tissue from cluster j present in the ith voxel. The pixel was then assigned to the cluster to which it had the highest membership value. The objective function to be minimized in the fuzzy clustering case is(Pal and Bezdek, 1995)
Eq. 1 |
The parameter q determines the “fuzziness” of the segmentation. In the limit of q = 1, the memberships approach an all or nothing, “hard”, partitioning. Following common practice, this parameter was set to two for this experiment.(Pal and Bezdek, 1995) Eq. 1 was minimized through an iterative process where the best guess of the membership function and cluster centers at each step are given by the following:
Eq. 2 |
Eq. 3 |
Most of the images in this study were clustered into five classes although the two largest breasts, with the lowest breast density, required nine clusters. These values were chosen to be the minimum number of clusters needed for successful segmentation as determined by visual assessment. The clusters covered every voxel in the CT volume, including the background, skin, and glandular and adipose tissues. In general, skin was classified into the same cluster as glandular tissue, since the x-ray attenuation coefficients were very similar. After running the FCM algorithm, each cluster was assigned to glandular, adipose, or background to complete the segmentation. From an initial experimental observation, we always assigned only the cluster with highest gray values, i.e. x-ray attenuation coefficients, as glandular tissue for each image. Although the presence of calcifications in some voxels may result in higher gray values than normal glandular tissue, the number of these voxels was too few to form a distinct cluster in the current FCM clustering scheme. The segmentation of the entire breast from the background was accomplished by visually analyzing the separation between cluster centers, since the attenuation of the breast and the packing material were very different. The number of voxels of each tissue were recorded and used to calculate the volumetric breast density.
II.E. Chemical decomposition
After imaging, each post-mortem breast was decomposed into water, lipid, and protein components using chemical procedures to measure the definitive composition of the breast. This data was the gold standard for the breast density comparison study. The skin was included in the chemical decomposition and the image analysis. The process of chemical decomposition was based on a USDA standardized procedure to determine the mass of water, lipid, and protein in organic tissue.(Agriculture, 2009) A flowchart of the decomposition procedure is given in Fig. 1. Each breast was weighed before and after CT imaging. The mass difference was assigned as water loss and was added to the final water mass after chemical decomposition (water correction in Fig. 1). For the decomposition, each postmortem breast was cut into 5 mm × 5 mm × 5 mm sized pieces and baked in a vacuum oven at approximately 95° C in order to evaporate the water. After at least 48 hours in the vacuum oven, the breasts were weighed at 1 hr intervals until the mass no longer changed. The total change in mass here was assumed to be water. The remaining dry material was mixed with pure petroleum ether and then agitated at 30° C for at least an hour to dissolve the lipid content into the ether solvent. After cooling at room temperature for at least 24 hours, the mixture was vacuum filtered through a Buchner funnel and washed with 1 L of additional ether to remove any remaining lipids. The lipid content was then measured by evaporating the petroleum ether under vacuum distillation and weighing the amount of material left behind.
The remaining solid in the filter contained mostly protein with small amounts of mineral and lipid. Further analysis was performed to measure the mass percentages of protein, mineral and lipid in the solid. The material was separated into two portions having approximately the same mass. One of them was analyzed, using procedures adapted from the Handbook of Food Analysis(Nollet, 2004), to determine its protein and mineral contents. The sample was placed in a crucible and baked in a furnace at 550 °C for 18 hours to oxidize all carbon-based compounds. The weight lost during this process was used to calculate the mass percentage of protein in the analyzed solid (protein correction in Fig. 1). The final protein mass of each breast was obtained by applying this percentage to the total mass of the remaining solid after vacuum filtration (protein in Fig. 1), assuming that the protein concentration is homogenous. The Ca content of the remaining solid was determined by removing water soluble minerals.
The other portion of the remaining solid was analyzed to determine the amount of residual fat and membrane bound lipids using a method reported by Bligh and Dyer.(Bligh and Dyer, 1959) The sample was mixed with chloroform, methanol, and water, and placed into a centrifuge for 10 min to separate the protein and lipid sediments. The mass percentage of lipid was calculated from the small amount of lipid remaining after removing the chloroform by heating (lipid correction in Fig. 1). The amount of lipid in the solid was calculated by applying this correction to the total mass remaining after filtration (protein in Fig. 1), and was added into the primary lipid mass from the distillation process. The volumetric fractions of water, lipid, and protein for each breast were then determined from the mass, using densities of 1, 0.924, and 1.35 g/cm3 respectively. The %FGV was calculated as the sum of the volumetric fractions of water and protein. Note that the mineral contents have been accurately determined during this analysis. Therefore, they can be safely excluded from the volumetric percentages of water, lipid, and protein.
II.F. Theoretical relation between percent fibroglandular volume and breast density
In this section, the theoretical relationship between percent fibroglandular volume (%FGV = (W + P) / (W + L + P)) and breast density (BD = G / (G + A)) is discussed. In the two-compartment model, the total breast volume is classified into glandular (G) and adipose (A) fractions, while in the three-compartment compositional model, it is categorized into water (W), lipid (L) and protein (P) fractions. Note that these values are fractions of the whole breast tissue, and their sums are normalized, i.e.
Eq. 4 |
Both glandular and adipose tissue contain water, lipid and protein, but in different relative amounts. Assuming that the volumetric fractions of water, lipid, and protein in pure glandular and pure adipose tissue are constants represented by WG, LG, PG, WA, LA, and PA, the two models can be related by the following three equations:
Eq. 5 |
Due to the normalization conditions shown in Eq. 4, the volumetric breast density can be expressed as a function of %FGV as:
Eq. 6 |
Defining %FGVG = WG + PG and %FGVA = WA + PA which correspond to the %FGV of glandular and adipose tissue respectively, the equation of interest becomes
Eq. 7 |
The limits of this equation are as expected: breast density is equal to unity when the breast is completely glandular (%FGV = %FGVG) and is zero when it is completely adipose (%FGV = %FGVA). Hence, there is a direct linear relationship between the %FGV and breast density. Since all fraction values are positive and glandular tissue generally contains more water and protein than adipose tissue, it can be expected from Eq. 7 that the linear correlation between breast density and %FGV has a positive slope and a negative intercept. Such correlation has been validated in our previous study with ex-vivo bovine tissue samples.(Ducote et al., 2011a) The initial assumption that glandular and adipose tissues have the same composition in all breasts is not exact, as indicated by an early work from tissue chemical analysis.(Woodard, 1986) The uncertainty of the chemical compositions among either glandular or adipose tissue could result in variations in the slope and intercept values in Eq. 7. However, the linear association between the two quantities should remain the same.
II.G. Statistical analysis
To determine the precision of the image segmentation techniques, the volumetric breast density measured from the right breast was plotted with respect to the left breast for each of the techniques. Linear regression and SEE analysis were performed on the data. The accuracy of each algorithm was evaluated through the correlation between the measured volumetric breast density and the %FGV. Finally, the inter-observer variation for the histogram thresholding method has been investigated using two observers and a Bland-Altman plot.
III. Results
III.A Breast tissue chemical decomposition
In this study, we used the composition of the post-mortem breasts obtained from chemical analysis as the gold standard for comparing quantitative breast density measurements. Hence, it is critical to evaluate the accuracy of the chemical decomposition. The experimental error was investigated by comparing the pre-imaging mass of each breast with the total mass of water, lipid, and protein after chemical decomposition. The association of the total mass before imaging and after chemical analysis is shown in figure 2. The best-fit line has a slope of 0.995 (±0.003), and an intercept of 0.003. The Pearson’s linear correlation coefficient, r, is larger than 0.999. A paired t-test was performed with the two sets of masses to confirm the null hypothesis. The p-value was calculated to be 0.988, indicating that there are no systematic differences between the two sets of masses. Furthermore, assuming the initial mass as the ground truth, the %RMS error was calculated for each breast. The averaged %RMS was determined to be 1.27%, which sets the limit for the error level of the chemical decomposition.
In the current compositional model, breast tissue was assumed to have three components: water, lipid, and protein. However, human tissue also contains very small amounts of minerals, such as Ca, Na, and K. In Due to the presence of microcalcifications and arterial calcifications, there might be traceable amounts of calcium in breast tissue. In addition, some types of lipid have low solubility in organic solvents and were still present in the remaining solid material after the filtration procedure. For the above reasons, we analyzed the remaining solid further, as described in the methods section. The results suggested that, on average, the remaining solid contained 94.8% protein, 3.2% minerals, and 1.9% lipid by volume. The results of this analysis were used to adjust the final values for the volumetric fractions of water, lipid and protein and the %FGV. It should be noted that the volumetric fraction of protein in the breast is small. The minerals and lipids remaining in the solid material after the initial separation contributed little to the total volumetric fractions of water, lipid and protein.
The measured volumetric fractions of water, lipid and protein contents for all 40 postmortem breasts are summarized in Table 1. The averaged percentages for water, lipid, and protein components were 34.2%, 59.9%, and 5.8%, respectively. The average %FGV was approximately 40.0%, with a wide range from 14.3% to 73.9%. Statistically, due to the small amount of protein, the mean value of %FGV is close to that of the water fraction, and also coincides with the averaged breast density from recent clinical reports.(Boone et al., 2005; Yaffe et al., 2009)
Table 1.
Components | Median | Mean | Standard Deviation |
Minimum | Maximum |
---|---|---|---|---|---|
Water | 30.4% | 34.2% | 15.1% | 12.3% | 63.1% |
Lipid | 65.4% | 59.9% | 17.8% | 26.0% | 85.7% |
Protein | 4.7% | 5.8% | 2.7% | 1.9% | 11.4% |
%FGV | 34.5% | 40.0% | 17.8% | 14.3% | 73.9% |
BD_Thresh | 23.0% | 26.9% | 15.8% | 1.9% | 63.9% |
BD_FCM | 20.8% | 24.6% | 13.7% | 4.4% | 51.5% |
The whole breast was decomposed into water, lipid, and protein, and, since the chemical compositions of glandular and adipose tissues are very different,(Hammerstein et al., 1979; Woodard and White, 1986) the composition of the whole breast varied largely due to the differences in breast densities. To compare our results with existing values from the literature, one can assume an average breast with a breast density of 25%, and calculate the percentages of the water, lipid, and protein contents from reported values of pure glandular and pure adipose tissues.(Hammerstein et al., 1979; Woodard and White, 1986) Using the data from Woodard and White,(Woodard and White, 1986) the an average breast composition calculated this way was 29% (26%~30%), 63% (60%~67%), and 7% (6~9%) for water, lipid, and protein volume fractions, respectively. The expected ranges for each component are shown in the corresponding parentheses. These results are in good agreement with previously reported results on breast composition.
There is a strong correlation among the fractions of water, lipid and protein since breast tissue is composed of mainly these three materials. Examples of these correlations are presented in figure 3 (a) and (b) for the volumetric fractions of lipid and protein as a function of the water fraction. In both cases, the Pearson’s r is close to unity, indicating the strong linear correlation between the three components. As one would expect, the fraction of lipid decreases as the water fraction increases with a slope of −1.180 (±0.010). The protein fraction has a positive correlation with the water fraction with a slope of 0.173 (±0.009), since they both come primarily from glandular tissue.
III.B Image-based breast density quantification
Examples of the image segmentations of a CT image for the histogram thresholding and the FCM clustering methods are depicted in figure 4. The gray values are correlated with the attenuation coefficient of the material; higher pixel values indicate greater attenuation. The gray values of the clustered images are given by the center of the cluster to which that pixel was assigned. The segmented images are visually similar for the two techniques; however, there is a significant difference between the histograms of the raw and clustered images. There are two main peaks in the raw histogram corresponding to the voxels that are predominantly glandular and adipose. Although the centers of the two peaks are well separated, their tails overlap, which leads to the uncertainty in threshold selection. Conversely, FCM was able to cluster the voxels into distinct classes. The two primary classes shown in the histogram represent glandular and adipose tissues. In this case, the segmentation becomes almost definite. The statistics of the image-based breast density quantifications are summarized in Table 1. The average breast density among all forty breasts measured with histogram thresholding and FCM clustering techniques were estimated to be 26.9% and 24.6%, respectively.
To evaluate precision, we calculated the breast density correlation between the right and left breasts under the assumption that the breast densities are equal for breasts from the same donor. This hypothesis was first tested with the compositional results from chemical analysis. The right and left correlation for the %FGV is plotted in figure 5 for the twenty post-mortem breast pairs. This analysis suggested that there is an excellent correlation between the compositions of the right and left breasts, evidenced by the linear fitting with a slope of 0.949 (±0.044), a Pearson’s r of 0.981, and a SEE of 3.53%.
With that in consideration, we plotted the CT breast density results from the right and left breasts of each pair against each other. The results of these correlations obtained from histogram thresholding and FCM clustering techniques are shown in figure 6 (a) and (b), respectively. The slopes were 0.888 (±0.046) and 0.866 (±0.052) with Pearson’s A coefficients of 0.976 and 0.968 for the histogram thresholding and the FCM clustering techniques, respectively. The SEE were calculated to be 3.39% and 3.35%, respectively. These results indicate that the algorithms are reasonably precise in their measurements of breast density.
Two trained readers were asked to independently segment the images using the standard histogram thresholding technique. To examine the variability of the threshold selection procedure, a Bland-Altman plot was constructed using their breast density results and is presented in figure 7. For each point, the abscissa is the mean of the two readings and the ordinate gives the difference between them. The mean difference between the readings was 1.5% in breast density and the standard deviation about the mean was 5.7%. In general, the difference between the readings increases with breast density. As the fraction of glandular tissue increased, the peaks in the histogram became more convoluted, increasing the difficulty in choosing thresholds.
III.C Correlations between breast density and %FGV
In order to evaluate the accuracy of the image based breast density measurements, we compared the total breast volumes measured in the image domain by each algorithm to those measured from chemical decomposition. The volume correlations are shown in figure 8 (a) and (b) yielding slopes of 0.989 (±0.007) and 1.005 (±0.008) for the histogram thresholding and the FCM clustering techniques, respectively. For both methods, the Pearson’s r was greater than 0.99. Therefore, breast volume measurements from both image-based methods are considered accurate.
From the discussion presented in II.F, the relationship between volumetric breast density and %FGV is expected to be linear. The results of this study support this expectation and the linear regressions for each of the applied methods are depicted in figure 9 (a) and (b). The slopes of the linear fittings were 0.861 (±0.036) and 0.757 (±0.023) for the histogram thresholding and the FCM clustering techniques, respectively. Negative intercepts in the best-fit lines were observed in both cases in agreement with the model. FCM clustering had a stronger correlation with %FGV as evidenced by a Pearson’s A coefficient of 0.983 as opposed to 0.968 from the histogram thresholding method. The SEE for the thresholding and the FCM clustering techniques were calculated to be 3.92% and 2.45%, respectively. In figure 9(c) and (d) the breast densities from both segmentation methods are compared to the water percentage measured during the chemical decomposition. The correlation between breast density and water fraction in the breast was similar to the correlation between breast density and %FGV since glandular tissue is mostly water.
IV. Discussions
One of the main purposes of this study was to evaluate breast density measurements from image-based segmentation algorithms by comparing results to an objective ground-truth. Currently, there is no generally accepted procedure for measuring breast density. Most breast density studies employ a comparison across different imaging modalities, including mammography, breast CT, and breast MRI.(Wei et al., 2004) Image-based visual estimations of breast density from experienced radiologists have also been used in some clinical studies. However, none of these techniques can serve as the gold standard for breast density quantification.(Keller et al., 2012) In this paper, we performed a definitive chemical decomposition of post-mortem breasts, in terms of water, lipid, and protein contents, and used the quantitative %FGV as the gold standard for evaluating breast density. This compositional measurement is entirely independent of the imaging process; hence, it offers an objective standard for the comparison of image-based segmentation techniques. The decomposition can be performed with high accuracy and precision on post-mortem specimens using simple analytical chemistry procedures. Our chemical decomposition study on the 40 post-mortem breasts had an error level of approximately 1%, as discussed in section III.A.
Although chemical decomposition is not possible in clinical studies, this technique is a powerful tool for system calibration and breast density comparison studies, since it offers a gold standard. Current comparison studies across imaging modalities suffer from the bias introduced by the differences in the fundamental physical principles of each modality. In general, each imaging modality has its own scaling equation for breast density quantification. The absolute values of breast density recorded with different modalities may vary significantly and, sometimes, are not directly comparable. If a good calibration between image-based breast density and the tissue compositional analysis can be performed for each modality, it might be possible to convert breast densities measured with different imaging techniques into a standardized tissue composition metric (for example water fraction). This way, a unique parameter can be used in breast cancer risk models.
Alternatively, obtaining a gold standard for breast density quantification for a single modality could serve as the starting point for choosing robust algorithms for each modality. Assuming that an accurate algorithm can be found for breast CT, ongoing studies based on breast CT and mammography can then use this accepted method as a gold standard to evaluate mammographic methods. In this way, quantitative measures of breast density amongst different imaging modalities could be compared to evaluate their effectiveness.
In this study, two image-base segmentation techniques were investigated, histogram thresholding and FCM clustering. Thresholding has been an accepted technique for breast density measurement by manually segmenting the glandular and adipose tissues. Our result suggested that, in term of breast density quantification, the FCM–based clustering algorithm performed as well as, if not better than, the standard thresholding method in terms of accuracy and precision. An ideal segmentation algorithm would be precise, fast, and automatic. From this point of view, FCM clustering technique may offer great advantage for breast density quantification in future clinical applications.
The speed of the algorithm influences its applicability in both the clinic and research. In the clinic, it is necessary to receive results quickly in order to increase patient throughput. For research, future studies on breast density using three-dimensional imaging modalities may involve large numbers of patient in order to evaluate the relation between a quantitative measure of breast density and cancer risk. Such large cohort studies would benefit greatly from a fast, automatic algorithm. For the methods studied herein, analyses using histogram thresholding took much longer to complete, requiring training and visual inspection of the complete 3D image set of each breast. On the other hand, the FCM algorithm was implemented to run on the histogram of the 16-bit integer image volumes which considerably decreased the computation time. The clustering and was completed in a few seconds for each image. The speed of the FCM algorithm makes it a very good candidate for use in future studies.
Automation of the segmentation is another important consideration. For the algorithms evaluated in this experiment, FCM clustering was semi-automatic, but histogram thresholding was completely manual requiring selection of thresholds to separate the tissues for each image. Analysis of the variance between two readers for thresholding revealed that there is generally good agreement, but some cases had significant differences of over 10% in breast density. The FCM algorithm changed a continuous selection on the part of the reader into a discrete one. Comparing the histograms in figure 4, it is much simpler to select clusters for FCM than for thresholding. Therefore, the variation induced by observers can be eliminated with FCM. The human interaction in the FCM method also included determining the number of clusters to use for segmentation. The use of five clusters was chosen as being the minimum required for successful segmentation. However, in the case of the two largest breasts, with the lowest breast density, nine clusters were required to segment out the glandular tissue. With fewer clusters, the glandular tissue would not be assigned to its own cluster. Instead the “glandular” cluster became non-localized and included regions of glandular tissue throughout the adipose regions of the breast. The use of additional clusters would further segment the edge pixels of the breast since this is where the gray values varied the most. These additional clusters contained a small number of pixels and hence did not impact the overall breast density significantly with their inclusion or exclusion. Hence, inter-observer variation only has a minimal impact on the FCM segmentation.
The clustering algorithm used in this study may be further improved by making the selection of the number of clusters and image segmentation completely automatic. A fully automatic FCM-based image segmentation technique has been recently reported by Keller et. al. for mammography.(Keller et al., 2012) A fully automated algorithm is more ideal by removing human judgment from the calculations completely. This makes large cohort studies easier to analyze.
In the current study, the post-mortem breasts were imaged at 50 kVp. This tube voltage is much lower than what has been used in conventional clinical CT systems. However, due to the size and the tissue composition of a breast, low tube potential is generally preferable to maximize the contrast between glandular and adipose tissues. The tube voltage was thus selected based on previously reported spectral optimization studies for a dedicated breast CT system.(Chen and Ning, 2002a; Glick et al., 2007; McKinley et al., 2004; Kalender et al., 2012) One limitation is that x-ray scatter within the breast was not corrected in this study. Although there was a large air gap (~ 0.5 m) during imaging, the presence of x-ray scatter may reduce the contrast between the glandular and the adipose tissues, especially for larger breasts. This leads to greater uncertainty in breast density quantification, and contributes partially to the SEE for both segmentation techniques. An algorithm to correct for the body scatter is currently under investigation, so that the accuracy of the breast density quantification with cone-beam breast CT can be further improved.
V. Conclusions
Twenty pairs of post-mortem breasts were imaged with a bench-top cone beam CT flat panel based system at 50 kVp. Volumetric breast densities were quantified with two image-based segmentation methods based on the histogram thresholding and FCM clustering techniques. The breasts were chemically decomposed into water, lipid and protein components soon after imaging. The percent fibroglandular volume determined from the decomposition was used as the gold standard for breast density comparison. Both segmentation methods showed high precision in breast density quantification, evidenced by the good correlation between the right and left breast measurements. There is also a strong linear correlation for the volumetric breast densities obtained from both methods with respect to the tissue compositional metrics, %FGV and water fraction, determined from chemical analysis. The FCM-based image segmentation technique has shown equal or better precision and accuracy in breast density quantification as compared to the histogram thresholding method. The FCM algorithm was also much faster and eliminated the need for a trained reader. A fully automated FCM algorithm has great potential in large cohort breast density studies in breast CT, as it can offer a precise, time-efficient, and objective segmentation. Once an accepted quantitative measure has been found, breast density could be incorporated into assessments of cancer risk to improve their predictive power.
Acknowledgements
This research was supported in part by NIH grant number R01CA13687.
References
- Abràmoff MD, et al. Image processing with ImageJ. Biophontics International. 2004 [Google Scholar]
- Agriculture U S D o. CLG-FAT.03. 2009:1–8. [Google Scholar]
- Ahmed MN, et al. A modified fuzzy C-means algorithm for bias field estimation and segmentation of MRI data. IEEE transactions on medical imaging. 2002;21:193–199. doi: 10.1109/42.996338. [DOI] [PubMed] [Google Scholar]
- Alonzo-Proulx O, et al. Validation of a method for measuring the volumetric breast density from digital mammograms. Phys Med Biol. 2003;55:3027–3044. doi: 10.1088/0031-9155/55/11/003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bligh EG, Dyer WJ. A Rapid Method of Total Lipid Extraction and Purification. Can J Biochem Phys. 1959;37:911–917. doi: 10.1139/o59-099. [DOI] [PubMed] [Google Scholar]
- Boone JM, et al. Technique factors and their relationship to radiation dose in pendant geometry breast CT. Med. Phys. 2005;32:3767–3776. doi: 10.1118/1.2128126. [DOI] [PubMed] [Google Scholar]
- Boone JM, et al. Dedicated Breast CT: Radiation Dose and Image Quality Evaluation. Radiology. 2001;221:657–667. doi: 10.1148/radiol.2213010334. [DOI] [PubMed] [Google Scholar]
- Boone JM, et al. A comprehensive analysis of DgN(CT) coefficients for pendant-geometry cone-beam breast computed tomography. Med. Phys. 2004a;31:226–235. doi: 10.1118/1.1636571. [DOI] [PubMed] [Google Scholar]
- Boone JM, et al. A comprehensive analysis of DgN[sub CT] coefficients for pendant-geometry cone-beam breast computed tomography. Medical Physics. 2004b;31:226. doi: 10.1118/1.1636571. [DOI] [PubMed] [Google Scholar]
- Boyd NF, et al. Heritability of mammographic density, a risk factor for breast cancer. New England Journal of Medicine. 2002;347:886–894. doi: 10.1056/NEJMoa013390. [DOI] [PubMed] [Google Scholar]
- Boyd NF, et al. Mammographic density and the risk and detection of breast cancer. The New England journal of medicine. 2007;356:227–236. doi: 10.1056/NEJMoa062790. [DOI] [PubMed] [Google Scholar]
- Chen B, Ning R. Cone-beam volume CT breast imaging: Feasibility study. Med. Phys. 2002a;29:755–770. doi: 10.1118/1.1461843. [DOI] [PubMed] [Google Scholar]
- Chen B, Ning R. Cone-beam volume CT breast imaging: Feasibility study. Medical Physics. 2002b;29:755. doi: 10.1118/1.1461843. [DOI] [PubMed] [Google Scholar]
- Chen Z, Ning R. Why should breast tumour detection go three dimensional? Physics in medicine and biology. 2003;48:2217–2228. doi: 10.1088/0031-9155/48/14/312. [DOI] [PubMed] [Google Scholar]
- Ding H, et al. Breast composition measurement with a cadmium-zinc-telluride based spectral computed tomography system. Med. Phys. 2012;39:1289–1297. doi: 10.1118/1.3681273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ding H, et al. Measurement of breast tissue composition with dual energy cone-beam computed tomography: A postmortem study. Med. Phys. 2013;40:061902. doi: 10.1118/1.4802734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ducote JL, et al. Volumetric lean percentage measurement using dual energy mammography. Med. Phys. 2011a;38:4498–4504. doi: 10.1118/1.3605632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ducote JL, et al. Volumetric lean percentage measurement using dual energy mammography. Medical Physics. 2011b;38:4498–4504. doi: 10.1118/1.3605632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Glick SJ, Breast CT. . Annual review of biomedical engineering. 2007;9:501–526. doi: 10.1146/annurev.bioeng.9.060906.151924. [DOI] [PubMed] [Google Scholar]
- Glick SJ, et al. Evaluating the impact of X-ray spectral shape on image quality in flat-panel CT breast imaging. Med Phys. 2007;34:5–24. doi: 10.1118/1.2388574. [DOI] [PubMed] [Google Scholar]
- Gong X, et al. Microcalcification detection using cone-beam CT mammography with a flat-panel imager. Physics in Medicine and Biology. 2004;49:2183–2195. doi: 10.1088/0031-9155/49/11/005. [DOI] [PubMed] [Google Scholar]
- Hammerstein GR, et al. ABSORBED RADIATION-DOSE IN MAMMOGRAPHY. Radiology. 1979;130:485–491. doi: 10.1148/130.2.485. [DOI] [PubMed] [Google Scholar]
- Han T, et al. Breast density measurement: 3D cone beam computed tomography (CBCT) images versus 2D digital mammograms. Proceedings of SPIE. 2009;7258 72580L–L-8. [Google Scholar]
- Harvey J, Bovbjerg V. Quantitative Assessment of Mammographic Breast Density: Relationship with Breast Cancer Risk. Radiology. 2004:29–41. doi: 10.1148/radiol.2301020870. [DOI] [PubMed] [Google Scholar]
- Jacobi CE, et al. Differences and similarities in breast cancer risk assessment models in clinical practice: which model to choose? Breast Cancer Research and Treatment. 2008;115:8381–8390. doi: 10.1007/s10549-008-0070-x. [DOI] [PubMed] [Google Scholar]
- Kalender WA, et al. High-resolution spiral CT of the breast at very low dose: concept and feasibility considerations. Eur Radiol. 2012;22:1–8. doi: 10.1007/s00330-011-2169-4. [DOI] [PubMed] [Google Scholar]
- Kannan SR, et al. Effective fuzzy clustering techniques for segmentation of breast MRI. Soft Comput. 2011;15:483–91. [Google Scholar]
- Keller BM, et al. Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation. Medical Physics. 2012;39:4903–4917. doi: 10.1118/1.4736530. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kopans D. Basic Physics and Doubts about Relationship between Mammographically Determined Tissue Density and Breast Cancer Risk. Radiology. 2008;246:348–353. doi: 10.1148/radiol.2461070309. [DOI] [PubMed] [Google Scholar]
- Lai C-J, et al. Visibility of microcalcification in cone beam breast CT: Effects of x-ray tube voltage and radiation dose. Medical Physics. 2007;34:2995. doi: 10.1118/1.2745921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindfors K, et al. Dedicated Breast CT: Initial Clinical Experience. Radiology. 2008;246:725–733. doi: 10.1148/radiol.2463070410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCormack Va, dos Santos Silva I. Breast density and parenchymal patterns as markers of breast cancer risk: a meta-analysis. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 2006;15:1159–1169. doi: 10.1158/1055-9965.EPI-06-0034. [DOI] [PubMed] [Google Scholar]
- McKinley RL, et al. Simulation study of a quasi-monochromatic beam for x-ray computed mammotomography. Med Phys. 2004;31:800–813. doi: 10.1118/1.1668371. [DOI] [PubMed] [Google Scholar]
- Ning R, et al. Koning Cone Beam Breast CT for Breast Cancer Detection, Diagnosis and Treatment. American Journal of Clinical Oncology-Cancer Clinical Trials. 2010;33:526–527. [Google Scholar]
- Nollet LML. Handbook of Food Analysis. 2004 [Google Scholar]
- O'Connell A, et al. Cone-beam CT for breast imaging: Radiation dose, breast coverage, and image quality. AJR. American journal of roentgenology. 2010;195:496–509. doi: 10.2214/AJR.08.1017. [DOI] [PubMed] [Google Scholar]
- Pal NR, Bezdek JC. On Cluster Validity for the Fuzzy c-Means Model. IEEE Transactions on Fuzzy Systems. 1995;3 doi: 10.1109/3477.678624. [DOI] [PubMed] [Google Scholar]
- Pawluczyk O, et al. A volumetric method for estimation of breast density on digitized screen-film mammograms. Med Phys. 2003;30:352–364. doi: 10.1118/1.1539038. [DOI] [PubMed] [Google Scholar]
- Phillips WE, et al. Application of Fuzzy C-Means Segmentation Technique for Tissue Differentiation in Mr-Images of a Hemorrhagic Glioblastoma-Multiforme. Magn Reson Imaging. 1995;13:277–290. doi: 10.1016/0730-725x(94)00093-i. [DOI] [PubMed] [Google Scholar]
- Phipps AI, et al. Breast Density, Body Mass Index, and Risk of Tumor Marker-Defined Subtypes of Breast Cancer. Annals of epidemiology. 2012 doi: 10.1016/j.annepidem.2012.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramathilagam S, et al. Modified fuzzy c-means algorithm for segmentation of T1-T2-weighted brain MRI. J Comput Appl Math. 2011;235:1578–1586. [Google Scholar]
- Seibert J, Nalcioglu O. Characterization of the veiling glare PSF in px-ray image intensified fluoroscopy. Medical physics. 1984 doi: 10.1118/1.595494. [DOI] [PubMed] [Google Scholar]
- Seibert J, Nalcioglu O. Removal of image intensifier veiling glare by mathematical deconvolution techniques. Medical physics. 1985 doi: 10.1118/1.595720. [DOI] [PubMed] [Google Scholar]
- Shepherd Ja, et al. Volume of mammographic density and risk of breast cancer. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 2011;20:1473–1482. doi: 10.1158/1055-9965.EPI-10-1150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vacek PM, Geller BM. A Prospective Study of Breast Cancer Risk Using Routine Mammographic Breast Density Measurements. Cancer Epidemiology, Biomarkers & Prevention. 2004;13:715–722. [PubMed] [Google Scholar]
- Vachon CM, et al. Mammographic density, breast cancer risk and risk prediction. Breast cancer research BCR. 2007;9:217. doi: 10.1186/bcr1829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vavrik D, Jakubek J. Radiogram enhancement and linearization using the beam hardening correction methodNuclear Instruments & Methods in Physics Research Section a-Accelerators Spectrometers Detectors and Associated. Nuclear Instruments & Methods in Physics Research Section a-Accelerators Spectrometers Detectors and Associated Equipment. 2009;607:212–214. [Google Scholar]
- Wei J, et al. Correlation between mammographic density and volumetric fibroglandular tissue estimated on breast MR images. Medical Physics. 2004;31:933. doi: 10.1118/1.1668512. [DOI] [PubMed] [Google Scholar]
- Wolfe J. Breast patterns as an index of risk for developing breast cancer. American Journal of Roentgenology. 1976;126:1130–1137. doi: 10.2214/ajr.126.6.1130. [DOI] [PubMed] [Google Scholar]
- Woodard H. The composition of body tissues. British journal of radiology. 1986:1209–1218. doi: 10.1259/0007-1285-59-708-1209. [DOI] [PubMed] [Google Scholar]
- Woodard HQ, White DR. The Composition of Body-Tissues. British Journal of Radiology. 1986;59:1209–1219. doi: 10.1259/0007-1285-59-708-1209. [DOI] [PubMed] [Google Scholar]
- Yaffe MJ, et al. The myth of the 50–50 breast. Med. Phys. 2009;36:5437–5443. doi: 10.1118/1.3250863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yaghjyan L, et al. Mammographic breast density and subsequent risk of breast cancer in postmenopausal women according to tumor characteristics. Journal of the National Cancer Institute. 2011;103:1179–1189. doi: 10.1093/jnci/djr225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang WT, et al. Dedicated cone-beam breast CT: feasibility study with surgical mastectomy specimens. AJR. American journal of roentgenology. 2007;189:1312–1315. doi: 10.2214/AJR.07.2403. [DOI] [PMC free article] [PubMed] [Google Scholar]