Abstract
Ultrasound elastography is increasingly being used alongside mammography for breast cancer diagnosis, particularly in women with radiographically dense breasts. The elastogram-to-B-Mode ratio (E/B), which compares lesion sizes in B-Mode and stiffness images, has been shown to differentiate malignant (E/B >1) from benign (E/B <1) masses. However, the diagnostic utility of E/B has not yet been evaluated using viscosity images, despite viscosity being an emerging biomarker for malignancy. Furthermore, neglecting viscosity can confound elasticity estimation in viscoelastic media like breast tissue. To address this limitation, we used Viscoelastic Response (VisR) ultrasound to assess the diagnostic value of E/B derived from both relative stiffness (RE) and relative viscosity (RV) images. In a blinded reader study, E/B values from viscosity images exhibited similar trends (>1 in malignant lesions and <1 in benign lesions) to those observed in stiffness images. Additionally, E/B values calculated from RE alone, RV alone, or a combination of RE and RV achieved 4–16% higher AUCs for discriminating malignant lesions compared to E/B derived solely from Acoustic Radiation Force Impulse (ARFI) peak displacement. These results suggest that incorporating VisR-derived viscosity and elasticity metrics into E/B calculations could significantly improve diagnostic accuracy for breast cancer detection.
INDEX TERMS: Viscoelastic response, VisR, acoustic radiation force, ARFI, breast cancer diagnosis, E/B ratio, elastography, non-invasive imaging, malignant, benign
I. INTRODUCTION
Breast cancer continues to impose a significant burden on global healthcare, with around 2.3 million new cases diagnosed and 685,000 deaths every year. The number of breast cancer cases is expected to increase by over 40% in the next two decades due to population increase, demographic trends, and increased screening [1]. Fortunately, early detection greatly increases patient survival [2]. Because of this, screening guidelines are frequently revised to balance detecting malignant masses as early as possible while decreasing false positives. Although false positives are less deadly than false negatives, they are still potentially costly and stressful to patients when they result in unnecessary imaging and biopsy. Recent research has shown that by age 40, regular screenings are beneficial and improve the mortality rate compared to the previous guidelines of 45–50 [3]. It is imperative to consider this growing demographic of younger women in the breast cancer screening population when designing imaging protocols.
A related emerging challenge in breast cancer screening and diagnosis is radiographically dense breasts, which are more prevalent in younger women but overall affect 47% of the screening population. Dense, fibrous breast tissue is attenuating to x-rays and results in decreased sensitivity and specificity in detecting lesions through mammography. In these patients, other imaging modalities such as ultrasound and MRI can supplement mammography and improve the accuracy of diagnosing malignant lesions [4]. Ultrasound increases the rate of correct cancer diagnosis in dense breasts compared to mammography alone or mammography with AI detection tools [5]. Ultrasound elastography, which can measure tissue stiffness, further improves the accuracy of diagnosis of malignant lesions in dense breasts. Ultrasound elastography-supplemented mammographic screening had higher specificity, accuracy, and AUC than MRI-supplemented screening in a multi-center trial [6]. In addition, US is widely available, does not require venous access, and is less expensive than MRI.
Ultrasound-derived stiffness is a promising biomarker for breast cancer [7], [8]. However, the most common ultra-sound elastography techniques have drawbacks. Shear wave elastography (SWE) provides quantitative stiffness measurements but requires lateral spatial averaging over millimeters, which can obfuscate small heterogeneous structures. This is a problem for breast cancer diagnosis, as heterogeneities within the tumor or changes at the tumor margin are predictive of malignancy [9]. Strain elastography (SE) qualitatively predicts tissue stiffness based on its displacement after a mechanical stress and does not require the spatial averaging of SWE. It requires either physiological motion or manually applied compression, which can introduce operator dependence. In addition, SE and many SWE techniques ignore tissue viscosity, which is also a biomarker for tumor malignancy [10], [11]. Furthermore, high viscosity can confound stiffness estimates when it is not accounted for in the material model [12].
Our group has developed viscoelastic response (VisR) ultrasound, which separately estimates stiffness (“relative elasticity” or “RE”) and viscosity (“relative viscosity” or “RV”) based on a mass-spring-damper model of tissue displacement [13]. Compared to techniques using manually or physiologically induced tissue displacement, VisR reduces operator dependence by using automated acoustic radiation force impulse (ARFI) pushes. Other advantages of VisR include on-axis estimation of stiffness and viscosity and force delivery directly to the targeted location, which both reduce spatial averaging. The RE and RV parameters are relative to the acoustic radiation force applied to the tissue, which means that RE and RV values should not be compared across images unless forcing conditions are the same. Thankfully, other diagnostically relevant information can be obtained from RE and RV images without the need for equal forcing conditions.
One of the most common elastography-based diagnostic metrics for breast cancer is the elastogram-to-B-Mode ratio (E/B), the ratio of lesion diameter in a stiffness image to lesion diameter in a B-Mode image. In malignant lesions, E/B tends to be >1, whereas in benign lesions it tends to be <1 [14]. Studies have reported high sensitivity and specificity in using E/B to detect malignancy, but results in clinical practice are variable due to differences in technique, interpretation, and ultrasound systems used [15]. Some methods measure lesion diameters in the largest dimension or in a particular orientation like horizontal or vertical [14], [16], [17], [18], [19]. Other methods require that the lesion be measured in the same position in B-Mode and elastogram, but the position is not important [14], [20]. Several studies do not explicitly describe their measurement methods [7], [21], [22]. Clinical implementation of E/B would benefit from clear and objective criteria for lesion measurement as well as more blinded studies to assess reader variability.
The reasons behind E/B separating benign and malignant tumors are not fully understood, but it may be due to changes in mechanical properties in the tumor margin. Malignant lesion boundaries become stiffer due to the desmoplastic response and more viscous due to vascularization and edema [9]. Therefore, it may also be beneficial to evaluate E/B with viscosity images in addition to stiffness images. A preliminary and unblinded analysis from our group found that E/B derived from both VisR RE and RV were significantly different between malignant and benign breast lesions [23].
In this work, we assess the diagnostic benefit of using VisR stiffness and viscosity images to calculate E/B, incorporating two important biomarkers for breast cancer into a quantitative metric. We designed a novel blinded reader study to assess the effectiveness of VisR-derived E/B in comparison to E/B derived from a displacement-based stiffness estimate which does not account for viscosity, ARFI peak displacement (PD). The study design improved upon our previous work by using multiple readers, blinding the readers to information about malignancy, and using objective criteria for lesion measurement. Furthermore, we compared the readers’ individual judgements of lesion measurement criteria with an automated, algorithmic approach to better understand the impact of reader variability. We hypothesize that, in a blinded reader study, E/B derived from VisR RE and RV performs better in a binary classifier model for malignant/benign breast lesion differentiation than E/B derived from ARFI PD.
II. METHODOLOGY
A. CLINICAL IMAGING
Thirty-four women with suspicious breast masses scored as BIRADS 4 or 5 by a radiologist were imaged at our institution under IRB approval (IRB #17–0949). Informed consent was obtained for all participants. A double-push VisR imaging sequence was implemented using a Siemens S3000 scanner and 9L4 probe (Siemens Healthineers, Ultrasound Division, Issaquah, WA, USA). The VisR sequence consisted of two reference pulses, two ARFI excitations separated by 8 tracking pulses, and 43 additional tracking pulses. ARFI excitations were 300 cycles at 4.21 MHz, and tracking pulses were 2 cycles at 6.15 MHz with a pulse repetition frequency of 11.5 KHz. The ARFI excitations used F/1.5 focal configuration. Reference and tracking pulses used F/1.5 focal configuration on transmit and F/0.75 on receive. For each data acquisition, the VisR sequence was repeated in 40 evenly spaced lateral positions across a 2 cm field of view to acquire a 2D image.
Imaging focal depth was adjusted to match the location of each patient’s suspicious mass. B-Mode and VisR acquisitions of each lesion were taken at four 30° increments, with the transducer rotated about the direction normal to the skin surface. The transducer was positioned manually by the sonographer with real-time positioning guidance from a gyroscope. A total of 136 in vivo lesion images were obtained; 88 were benign and 48 were malignant. Imaging took place prior to the clinically indicated biopsy. Needle core biopsy pathology was used to confirm malignancy or benignity.
B. DATA PROCESSING
Normalized cross-correlation was used to estimate tissue axial displacement profiles from the RF data. Maximum value of the displacement profiles over time was taken to obtain peak displacement (PD). The displacement data was fit to the 1-D mass-spring-damper VisR model to obtain the parameters relative elasticity (RE) and relative viscosity (RV) [13]. Depth and elasticity corrections were also applied to RE and RV as in previous work [24]. Finally, the B-Mode images were shown to an experienced radiologist, blinded to information about malignancy, who delineated the lesion boundaries using a custom MATLAB GUI, The MathWorks, Inc., Natick, MA, USA.
C. READER STUDY
Figure 1 shows the custom MATLAB app constructed for the reader study. For each lesion view, the radiologist-annotated B-Mode and unannotated PD, RE, and RV images were displayed side-by-side using the same image resolution (2049 axial × 40 lateral data points), size, and gray colormap. This display scheme was chosen to minimize confounding effects from differing image dimensions, resolution, or colormaps in comparing lesion boundaries between the four image types. The 128 lesion examples were shown to each reader in randomized order and without any malignant/benign labeling.
FIGURE 1.

A custom MATLAB app was used for the blinded reader study. Readers viewed the radiologist annotated B-mode, ARFI PD, and VisR RE and RV images and measured the diameters using a line region of interest (ROI).
Six research engineer readers participated in the study. The readers were ultrasound engineers who were familiar with B-Mode, PD, and VisR RE and RV images, but were not radiologists or breast imaging specialists. One reader was a trained sonographer. Readers received ~20 minutes of training on how to identify lesion boundaries in PD, RE and RV images, use the app, and measure lesion diameters. None of the images used for training were included in the study dataset. Two methods for lesion diameter measurement were tested by each reader using two slightly different versions of the app. First, they went through the entire dataset using the “manual” measurement method, and then again using the “auto” method. For both methods, readers were instructed to measure the diameter at the largest extent of the lesion, as the most contemporary guidelines suggest [15]. Additionally, readers were instructed to mark any PD, RE, or RV image in which the lesion boundaries could not be visually recognized as “unsure” using the app, and these images were excluded from E/B analysis. This exclusion was intended to both estimate the proportion of low-quality images in the dataset and reduce the impact of low-quality images on the results.
In “manual” mode, readers were asked to determine the largest extent of each lesion based on their visual perception of the radiologist-annotated B-Mode image. They drew the diameter at this location using a line region of interest (ROI) tool, and the line was automatically copied to the corresponding location on the PD, RE, and RV images. The line ROI after being copied was fixed such that it could not be translated or rotated, but the endpoints could be manually adjusted on each image to match the perceived lesion boundaries. Since the B-Mode, PD, RE, and RV data were derived from the same raw data, the corresponding locations in the images represent the same underlying anatomy. In “auto” mode, the radiologist-annotated lesion boundary was used to automatically determine the largest dimension of the lesion on B-Mode and place the diameter line ROI there. The line ROI was copied to the PD, RE, and RV images, and endpoints were adjusted by the reader to match lesion boundaries, as in the manual mode.
After the reader study was completed, E/B ratios were calculated from the data as the ratio of PD, RE, or RV diameter length to B-Mode diameter length. The E/B data was used to construct a binary classifier logistic regression model between malignant and benign. Models were made from each reader’s data using PD-, RE-, and RV-derived E/B individually and in combinations. Lesion characteristics (depth, area, circularity, and PD, RE, and RV degree of anisotropy (DoA)) were also derived from images and correlated with E/B and frequency of being marked unsure. Depth, area, and circularity were calculated from the radiologist B-Mode outline. PD, RE, and RV DoA were calculated from the VisR images as in previous work [25].
III. RESULTS
Figure 2 displays B-Mode, PD, RE, and RV images of sample malignant and benign lesions from the dataset. In the benign lesions, E/B is generally <1 and in the malignant lesions, generally >1, as expected. Additionally, the lesion margins of both have contrasting RE and RV compared to surrounding tissue.
FIGURE 2.

Example images of a benign (a-d) and a malignant (e-h) breast lesions, with diameters manually measured by one of the six readers in B-mode (a,e), PD (b,f), RE (c,g), and RV (d,h). In these examples, all E/B ratios are calculated from PD, RE, and RV images were <1 for the benign lesion and>1 for the malignant lesion.
Linear regression model results for classifying malignancy using PD-, RE-, and RV-derived E/B ratio are shown in Figures 3 and 4. Figure 3 shows individual reader AUC for each model using the manual diameter measurement method, and Figure 4 shows equivalent results for the automated measurement method. Model AUC varies substantially between readers but is not significantly different between the manual and auto measurement methods (p>0.05, Wilcoxon Rank Sum). The AUC for RE and RV-derived models is higher than for PD-derived models. Models using E/B from both RE and RV perform the best, with average AUC=0.73 between all readers. Adding PD E/B to the RE and RV model does not further improve AUC. Average number of E/B samples per model across all readers are shown in Tables 1 and 2. Model size is not significantly different (p>0.05, Wilcoxon Rank Sum) between manual and automated methods. Overall, readers had good agreement in the placement of their diameter measurements in both the manual and automated methods, but the length of diameter measured varied between readers for a given image. The best performing readers had significantly different (p<0.05, Wilcoxon Rank Sum) measured E/B between benign and malignant cases in all three image types for both methods.
FIGURE 3.

AUC from logistic regression classifier model based on E/B measured from PD, RE, and RV are shown for the manual diameter measurement method. Individual readers are designated by marker shape.
FIGURE 4.

AUC from logistic regression classifier model based on E/B measured from PD, RE, and RV are shown for the automated diameter measurement method. Individual readers are designated by marker shape.
TABLE 1.
Logistic regression model Avg. # of E/B samples (Manual Meas).
| E/B Parameter | Average | Std. dev. |
|---|---|---|
| Peak Displacement | N=64 | 20 |
| Relative Elasticity | N=65 | 25 |
| Relative Viscosity | N=56 | 22 |
| PD and RE | N=49 | 24 |
| PD and RV | N=51 | 25 |
| RE and RV | N=61 | 26 |
| PD, RE, and RV | N=47 | 26 |
Average number of E/B samples per model and standard deviation from the manual diameter measurement method are shown.
TABLE 2.
Logistic regression model Avg. # of E/B Samples (Auto Meas).
| E/B Parameter | Average | Std. dev. |
|---|---|---|
| Peak Displacement | N=74 | 26 |
| Relative Elasticity | N=68 | 27 |
| Relative Viscosity | N=67 | 27 |
| PD and RE | N=64 | 29 |
| PD and RV | N=62 | 28 |
| RE and RV | N=64 | 26 |
| PD, RE, and RV | N=59 | 26 |
Average number of E/B samples per model and standard deviation from the automated diameter measurement method are shown.
Correlation between E/B and lesion characteristics is shown in Tables 3 and 4. Reader measured E/B generally had very weak correlation (R<0.19) with lesion characteristics, other than area and RE DoA, which had weak (R = 0.20–0.39) negative correlations.
TABLE 3.
Avg. Corr. Coeff. between reader E/B and lesion properties (Manual Meas).
| Lesion Property | PD E/B | RE E/B | RV E/B |
|---|---|---|---|
| Focal Depth | −0.03 | −0.05 | 0.01 |
| Area | −0.24 | −0.23 | −0.21 |
| Circularity | 0.09 | 0.15 | 0.11 |
| PD DoA | −0.05 | 0.01 | 0.10 |
| RE DoA | 0.11 | 0.22 | 0.07 |
| RV DoA | −0.17 | −0.14 | −0.03 |
Pearson correlation coefficient was calculated between E/B and lesion properties and averaged across readers. This table shows correlation coefficients for the manual measurement method.
TABLE 4.
Avg. Corr. Coeff. between reader E/B and lesion properties (Auto Meas).
| Lesion Property | PD E/B | RE E/B | RV E/B |
|---|---|---|---|
| Focal Depth | 0.04 | 0.06 | 0.23 |
| Area | −0.24 | −0.29 | −0.25 |
| Circularity | 0.05 | 0.14 | 0.13 |
| PD DoA | −0.04 | 0.00 | −0.01 |
| RE DoA | 0.22 | 0.29 | 0.16 |
| RV DoA | −0.17 | −0.20 | −0.03 |
Pearson correlation coefficient was calculated between E/B and lesion properties and averaged across readers. This table shows correlation coefficients for the automated measurement method.
The frequency that a given image was marked unsure by readers can be used as a quality metric. Figure 5 shows the PD, RE, and RV images for the lesion that was marked unsure the most frequently by readers, as well as the corresponding B-Mode. The images notably lack visual contrast between the lesion and background. In comparison, a lesion that was marked unsure zero times by the readers has visually identifiable boundaries in PD, RE, and RV, as well as the corresponding B-Mode.
FIGURE 5.

The top row shows PD, RE, and RV images from the lesion most frequently marked unsure by the readers, with corresponding B-Mode. The bottom row shows PD, RE, and RV images with corresponding B-Mode from a lesion that was not marked unsure by any of the readers.
Percentage of lesions marked unsure between the manual and auto measurement methods are shown in Figure 6A. Differences between manual and automated, as well as between PD, RE, and RV, are insignificant (p>0.05, ranksum). On average, 15–25% of the images were marked unsure by each reader, but standard deviation between readers is large (~30%). Individual readers were consistent in number marked unsure between PD, RE, and RV and between manual and auto measurement techniques. Correlation between amount of PD, RE, and RV images marked unsure and AUC of PD-, RE-, and RV-derived E/B models is shown in Figures 6B and 6C. Models derived from PD (0.6–0.7) and RV (0.6–0.9) have moderate correlation with number excluded, while models derived from RE (0.8–0.9) have strong correlation.
FIGURE 6.

In (a), percentage of images marked unsure by the readers in manual and automated measurement trials are shown. Spearman correlation coefficients between percentage of PD, RE, and RV images marked unsure and E/B model AUC are shown in (b) and (c). Individual readers are designated by marker shape.
Proportion of malignant and benign cases compared to frequency of being marked unsure by readers is shown in Figure 7 (a–c). The overall dataset is ~65% benign and ~35% malignant, and this balance is largely maintained even in lesions that were marked unsure by multiple readers. Most lesions were marked unsure fewer than 6 times across all readers and between manual and auto methods. Several were never marked unsure (22% of PD, 15% of RE, and 10% of RV). In RE ~1% of images and in RV ~2% of images were marked unsure every time. Overall, PD images were marked unsure least frequently. Lesion characteristics are also correlated with frequency of being marked unsure in Table 5. Area, circularity, and DoA have negligible (R<0.3) correlation with being marked unsure, but focal depth has mild correlation.
FIGURE 7.

Frequency distributions for number of times each type of image was marked unsure by readers are shown in (a), (b), and (c), with benign images in yellow and malignant in purple.
TABLE 5.
Corr. Coeff. between times marked unsure by all readers and lesion properties.
| Lesion Property | PD Images | RE Images | RV Images |
|---|---|---|---|
| Focal Depth | 0.32 | 0.42 | 0.26 |
| Area | −0.15 | 0.03 | 0.12 |
| Circularity | 0.20 | 0.19 | 0.10 |
| PD DoA | −0.08 | −0.22 | −0.11 |
| RE DoA | 0.37 | 0.21 | 0.27 |
| RV DoA | 0.21 | 0.09 | 0.18 |
Spearman correlation coefficients between lesion characteristics and number of times the image was marked unsure by readers are shown.
Based on the correlation we observed between number of images marked unsure and improved model performance, we assumed that images that were never marked unsure by any of the readers (in either the manual or automated diameter studies) were the highest quality. To investigate the performance of VisR-based E/B in an “optimal” diagnostic scenario of best-quality data, we created another set of models combining all reader data using the never marked unsure images. The manual measurement data was used, as there was no significant improvement in performance from using the automated measurement method. ROC curves for the “optimal” models are shown in Figure 8. As shown in Table 6, E/B based on PD, RE, and RV individually had improved AUC over the across-reader median of the initial reader studies. Further, the combined model of PD, RE, and RV achieved the best AUC of 0.93 as well as highest specificity of 1.00. The RE and RV combined model had the best sensitivity of 0.94.
FIGURE 8.

Reciever operating characteristic (ROC) curves are shown for “optimal” models using only images approved (not marked unsure) by all readers.
TABLE 6.
“Optimal” images model performance.
| E/B Parameter | AUC | Sens. | Spec. | Model Size |
|---|---|---|---|---|
| PD | 0.70 | 0.28 | 0.94 | N=114 |
| RE | 0.74 | 0.46 | 0.85 | N=78 |
| RV | 0.83 | 0.88 | 0.80 | N=54 |
| PD and RE | 0.78 | 0.39 | 0.93 | N=72 |
| PD and RV | 0.87 | 0.89 | 0.80 | N=48 |
| RE and RV | 0.86 | 0.94 | 0.79 | N=42 |
| PD, RE, RV | 0.93 | 0.58 | 1.0 | N=36 |
The performance of logistic regression models using only images approved (not marked unsure) by all readers is shown by AUC, sensitivity, and specificity. Models were tested for PD-, RE-, and RV-derived E/B, and combinations of the three. The number of samples in each model is shown in the bottom row.
IV. DISCUSSION
Our expectation that E/B models derived from VisR RE and RV would outperform those from ARFI PD was validated through higher average AUC. In the initial reader study, when all of the reader-marked images were considered, models with RE- and RV-derived E/B combined performed the best and were not improved by adding PD- derived E/B. When only the “optimal” images that were read by all readers were considered, AUC for PD-, RE-, and RV-derived E/B was improved, and the combination of the three parameters had the best AUC. In malignant lesion margins stiffness, viscosity, or both may be increased compared to surrounding tissue. This is supported by RE- and RV-derived E/B being higher on average in malignant lesions. Both increased viscosity and increased stiffness result in lower peak displacement. Because of this, high viscosity with low stiffness and low viscosity with high stiffness could result in similar PD values. This confounding effect could have caused underestimation of lesion size in some malignant PD images, potentially explaining why PD-derived E/B had lower sensitivity and AUC for detecting malignancy. In contrast, RE and RV may provide greater sensitivity by independently detecting stiffness and viscosity. The RE and RV combined model had the best sensitivity, supporting this idea. However, it is unknown why adding PD to this model improved specificity.
Lesion characteristics like depth, circularity, and mechanical anisotropy had negligible correlation with PD-, RE-, or RV-derived E/B. This is encouraging for clinical use as it indicates that any of the evaluated E/B measurement approaches are robust to different imaging focal depths and lesion shapes. It is also important for E/B to be robust to different levels of tissue anisotropy, as breast tissue can vary between negligibly and significantly anisotropic [25], [26]. Lesion area is potentially impactful on the E/B metric, and this effect should be studied further. A potential cause could be the differing surface area to volume ratio in larger lesions affecting the width of the tumor boundary area where mechanical properties are altered with malignancy.
Reader training is an area for improvement, as variation in model AUC and number of images marked unsure between readers was high. Readers were internally consistent in both model AUC and images marked unsure, indicating differing interpretation of the lesion measurement criteria. The automated method for selecting diameter measurement location may not be relevant to improving model performance, as it did not improve average model AUC. Additionally, diameter measurement location on the lesions was consistent between readers and between the manual and automated methods, indicating that readers were capable of judging the largest dimension of a lesion without assistance. The readers differed substantially in their judgement of the endpoints of lesion diameter lines, which means clarification on how to interpret lesion boundaries would be helpful. For example, creating criteria like including or excluding the outermost edges of the lesion margins in the diameter measurement could improve results. The impact of more specific measurement criteria on reader agreement and model accuracy could be explored in future work. In clinical practice, using multiple reviewers for evaluating E/B from images may help mitigate variability. If an image is highly uncertain (multiple readers are unable to discern the lesion boundaries), reporting guidelines could involve excluding it from E/B calculations or flagging it as low confidence.
Since the number of images marked unsure was moderately to strongly correlated with E/B model AUC in the initial reader study, E/B likely performs better as a diagnostic metric when higher quality lesion images are used. This idea is supported by the improved performance of the “optimal” models. However, this result could be affected by model overfitting on smaller datasets, so a larger study is needed to confirm. Still, the best performing initial reader models were not significantly (p>0.05, Wilcoxon Rank Sum) smaller than the worst performing models. The correlation between model size and AUC was strongest in RE, which could mean that RE image quality is the most impactful in building an accurate diagnostic model with E/B. Overall, PD images were marked unsure the least frequently, so they were likely the highest quality, but also had the least impact on model performance. Average image quality being higher in PD than RE and RV may be due to cases where tissue displacement did not fit the VisR mass-spring-damper model well. This can occur from noisy displacement tracking due to signal attenuation and abberation, motion during data acquisition, or from the tissue exhibiting nonlinear viscoelastic behavior. Lesion area, circularity, and anisotropy were not correlated with being marked unsure by readers, but focal depth was. This could indicate that deeper lesions are more poorly depicted, perhaps due to ARFI excitation and tracking beam attenuation that reduce CNR for PD, RE, and RV images.
The average AUC and sensitivity of the readers’ models were lower than in the unblinded preliminary study [23]. This may be due to lack of reader experience with interpreting B-Mode, PD, and VisR breast lesion images. The lateral sampling of the PD, RE, and RV images was limited to keep image acquisition time low, which may have resulted in many low-quality images. Further studies with VisR could test a higher lateral sampling rate, but this must be balanced to stay within feasibility for clinical imaging. The factors impacting VisR image quality and the effect of image quality on diagnostic value of E/B should be a focus of future work. Finally, a larger dataset would strengthen the conclusions of this work, as the present study was limited by clinical data collection at a single medical center. Class balance between malignant and benign was adequate, but multi-parametric logistic regression models are most statistically robust with >100 samples.
V. CONCLUSION
We assessed the diagnostic benefit of calculating the E/B metric for breast lesions using three on-axis, ARFI-based viscoelastic parameters: peak displacement, relative elasticity, and relative viscosity. In a blinded study of six readers, in vivo breast lesion diameters were measured using both a manual and partially automated method. In logistic regression binary classifier models for malignancy, E/B derived from RE and RV had higher AUC than E/B derived from PD. Stiffness and viscosity are biomarkers for breast cancer separately measured by RE and RV, whereas PD is a stiffness estimate that is confounded by viscosity. Combining E/B from PD, RE, and RV and using the highest-quality VisR images further enhanced accuracy and achieved an AUC of 0.93 for discriminating malignant lesions. Diagnostic accuracy was similar for the manual and automated E/B measurement methods, supporting the established methodology for calculating E/B by measuring in the reader-determined largest lesion dimension. We found low correlation between E/B and lesion characteristics like shape, depth, and tissue anisotropy, supporting the use of VisR-based E/B as a robust clinical metric. Finally, we found a strong correlation between low-quality images being excluded and improved model AUC, but further analysis with a larger dataset would strengthen this conclusion. Future work exploring the use of VisR for breast cancer diagnosis should focus on improving image quality, image quantity, and reader training. Overall, this study shows increased accuracy in breast cancer diagnosis when VisR viscosity and stiffness images are both used to calculate E/B.
ACKNOWLEDGMENT
The authors would like to thank the UNC Hospital Breast Imaging Division and Radiology Department for their assistance with clinical data collection and Siemens Healthineers, Issaquah, WA, USA, for the in-kind support.
This work was supported in part by the National Institutes of Health under Award R01CA281150, Award R01HL092944, and Award R01DK107740; in part by the University of North Carolina at Chapel Hill Lineberger Comprehensive Cancer Center Pilot Development; and in part by the Provost’s Doctoral Fellowship at North Carolina State University (NCSU).
This work involved human subjects in its research. Approval of all ethical and experimental procedures and protocols was granted by the University of North Carolina at Chapel Hill Office of Human Research Ethics (OHRE), Institutional Review Board (IRB), and by the Lineberger Comprehensive Cancer Center Oncology Protocol Review Committee (PRC).
Biographies

ANNA V. PHILLIPS (Student Member, IEEE) received the B.S. degree in biomedical engineering from The Ohio State University in 2020. She is currently pursuing the joint Ph.D. degree with the Gallippi Laboratory, Lampe Joint Department of Biomedical Engineering, University of North Carolina at Chapel Hill and North Carolina State University. Her research interests include ultrasound elastography, biomedical signal and image analysis, breast cancer imaging, and soft tissue biomechanics.

CHERIE M. KUZMIAK is a Professor of Radiology at the School of Medicine, University of North Carolina (UNC). She is the Breast Section Chief for the American College of Radiology Institute for Radiologic Pathology (ACR-AIRP). In addition, she is a committee member for both the ACR Appropriateness Criteria and the RSNA Annual Meeting Program Planning Committee. Her research focus is on the development of novel breast imaging tools, breast cancer genomics, and the application of artificial intelligence/machine learning.

GABRIELA TORRES (Member, IEEE) received the B.S. degree in electrical engineering from Pontificia Universidad Catolica del Peru, Lima, Peru, in 2015, and the Ph.D. degree in biomedical engineering from the University of North Carolina at Chapel Hill, Chapel Hill, NC, USA, and the North Carolina State University, Raleigh, NC, USA, in 2021. She is a Ph.D. Staff Ultrasound Engineer at Siemens Healthineers. Her current research interests include acoustic radiation force imaging methods and cardiovascular elasticity imaging.

CATERINA M. GALLIPPI (Senior Member, IEEE) received the B.S.E. degree in electrical engineering from Princeton University and the Ph.D. degree in biomedical engineering from Duke University in 2003, with a focus on ultrasonic imaging. She is currently a Professor with the Lampe Joint Department of Biomedical Engineering, University of North Carolina at Chapel Hill and North Carolina State University, NC, USA. She is also the Director of the Unified Medical Ultrasound Technology Development (UNMUTED) NIH T32 Predoctoral Training Program. Her research interests include radiation force imaging, adaptive signal filtering, multidimensional motion tracking, and magneto-motive ultrasound. Her group works to apply these novel technologies to cardiovascular disease, cancer, kidney disease, and musculoskeletal disorders.
REFERENCES
- [1].Arnold M et al. , “Current and future burden of breast cancer: Global statistics for 2020 and 2040,” Breast, vol. 66, pp. 15–23, Dec. 2022. [Online]. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9465273/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Myers ER et al. , “Benefits and harms of breast cancer screening: A systematic review,” JAMA, vol. 314, no. 15, p. 1615, Oct. 2015. [Online]. Available: http://jama.jamanetwork.com/article.aspx?doi=10.1001/jama.2015.13183 [DOI] [PubMed] [Google Scholar]
- [3].Chetlen A, Mack J, and Chan T, “Breast cancer screening controversies: Who, when, why, and how?” Clin. Imag, vol. 40, no. 2, pp. 279–282, Mar. 2016. [Online]. Available: https://linkinghub.elsevier.com/retrieve/pii/S0899707115001473 [DOI] [PubMed] [Google Scholar]
- [4].Devolli-Disha E, Manxhuka-Kërliu S, Ymeri H, and Kutllovci A, “Comparative accuracy of mammography and ultrasound in women with breast symptoms according to age and breast density,” Bosnian J. Basic Med. Sci, vol. 9, no. 2, pp. 131–136, May 2009. [Online]. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638217/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Lee SE, Yoon JH, Son N-H, Han K, and Moon HJ, “Screening in patients with dense breasts: Comparison of mammography, artificial intelligence, and supplementary ultrasound,” Amer. J. Roentgenol, vol. 222, no. 1, Jan. 2024, Art. no. 2329655. [Online]. Available: https://ajronline.org/doi/full/10.2214/AJR.23.29655 [DOI] [PubMed] [Google Scholar]
- [6].Gao L-Y et al. , “Can combined screening of ultrasound and elastography improve breast cancer identification compared with MRI in women with dense breasts—A multicenter prospective study,” J. Cancer, vol. 11, no. 13, pp. 3903–3909, 2020. [Online]. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7171498/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Barr R, Destounis S, Lackey L, Svensson W, and Smith C, “Evaluation of breast lesions using sonographic elasticity imaging a multicenter trial,” J. Ultrasound Med., Off. J. Amer. Inst. Ultrasound Med, vol. 31, pp. 7–281, Feb. 2012. [DOI] [PubMed] [Google Scholar]
- [8].Grajo JR and Barr RG, “Strain elastography for prediction of breast cancer tumor grades,” J. Ultrasound Med, vol. 33, no. 1, pp. 129–134, Jan. 2014. [Online]. Available: https://onlinelibrary.wiley.com/doi/pdf/10.7863/ultra.33.1.129. [DOI] [PubMed] [Google Scholar]
- [9].Imtiaz S, “Breast elastography: A new paradigm in diagnostic breast imaging,” Applied Radiol., vol. 47, no. 3, pp. 14–19, Mar. 2018. [Online]. Available: https://www.crossref.org/ [Google Scholar]
- [10].Nabavizadeh A et al. , “Viscoelastic biomarker for differentiation of benign and malignant breast lesion in ultra- low frequency range,” Sci. Rep, vol. 9, no. 1, p. 5737, Apr. 2019. [Online]. Available: https://www.nature.com/articles/s41598-019-41885-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Zhang H et al. , “Fluidity and elasticity form a concise set of viscoelastic biomarkers for breast cancer diagnosis based on Kelvin–Voigt fractional derivative modeling,” Biomechanics Model. Mechanobiol, vol. 19, no. 6, pp. 2163–2177, Dec. 2020, doi: 10.1007/s10237-020-01330-7. [DOI] [PubMed] [Google Scholar]
- [12].Selzo MR and Gallippi CM, “Viscoelastic response (VisR) imaging for assessment of viscoelasticity in voigt materials,” IEEE Trans. Ultrason., Ferroelectr., Freq. Control, vol. 60, no. 12, pp. 2488–2500, Dec. 2013. [Online]. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4164206/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Selzo MR, Moore CJ, Hossain MM, Palmeri ML, and Gallippi CM, “On the quantitative potential of viscoelastic response (VISR) ultrasound using the one-dimensional mass-spring-damper model,” IEEE Trans. Ultrason., Ferroelectr., Freq. Control, vol. 63, no. 9, pp. 1276–1287, Sep. 2016. [Online]. Available: http://ieeexplore.ieee.org/document/7428963/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Barr RG, “Future of breast elastography,” Ultrasonography, vol. 38, no. 2, pp. 93–105, Apr. 2019. [Online]. Available: http://e-ultrasonography.org/journal/view.php?doi=10.14366/usg.18053 [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Barr RG, “Breast elastography: How to perform and integrate into a ‘best-practice’ patient treatment algorithm,” Journal Ultrasound Med., vol. 39, no. 1, pp. 7–17, Jan. 2020. [Online]. Available: https://onlinelibrary.wiley.com/doi/10.1002/jum.15137 [DOI] [PubMed] [Google Scholar]
- [16].Barr RG, “Real-time ultrasound elasticity of the breast: Initial clinical results,” Ultrasound Quart., vol. 26, no. 2, pp. 61–66, Jun. 2010. [Online]. Available: https://journals.lww.com/ultrasound-quarterly/fulltext/2010/06000/realtimeultrasoundelasticityofthebreast.2.aspx [DOI] [PubMed] [Google Scholar]
- [17].Alhabshi SMI et al. , “Semi-quantitative and qualitative assessment of breast ultrasound elastography in differentiating between malignant and benign lesions,” Ultrasound Med. Biol, vol. 39, no. 4, pp. 568–578, Apr. 2013. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0301562912006655 [DOI] [PubMed] [Google Scholar]
- [18].Destounis S et al. , “Clinical experience with elasticity imaging in a community-based breast center,” J. Ultrasound Med, vol. 32, no. 2, pp. 297–302, Feb. 2013. [Online]. Available: https://onlinelibrary.wiley.com/doi/pdf/10.7863/jum.2013.32.2.297. [DOI] [PubMed] [Google Scholar]
- [19].Nakashima K et al. , “JSUM ultrasound elastography practice guidelines: Breast,” J. Med. Ultrason, vol. 40, no. 4, pp. 359–391, Oct. 2013. [Online]. Available: http://link.springer.com/10.1007/s10396-013-0457-0 [DOI] [PubMed] [Google Scholar]
- [20].Barr RG et al. , “WFUMB guidelines and recommendations for clinical use of ultrasound elastography: Part 2: Breast,” Ultrasound Med. Biol, vol. 41, no. 5, pp. 1148–1160, May 2015. [Online]. Available: https://linkinghub.elsevier.com/retrieve/pii/S0301562915002215 [DOI] [PubMed] [Google Scholar]
- [21].Berg WA et al. , “Shear-wave elastography improves the specificity of breast U.S.: The BE1 multinational study of 939 masses,” Radiology, vol. 262, no. 2, pp. 435–449, Feb. 2012. [Online]. Available: https://pubs.rsna.org/doi/epdf/10.1148/radiol.11110640?src=getftr [DOI] [PubMed] [Google Scholar]
- [22].Ranjkesh M, Hajibonabi F, Seifar F, Tarzamni MK, Moradi B, and Khamnian Z, “Diagnostic value of elastography, strain ratio, and elasticity to B-mode ratio and color Doppler ultrasonography in breast lesions,” Int. J. Gen. Med, vol. 13, pp. 215–224, May 2020. [Online]. Available: https://www.dovepress.com/diagnostic-value-of-elastography-strain-ratio-and-elasticity-to-b-mode-peer-reviewed-article-IJGM [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Phillips AV et al. , “Breast lesion diameter in VisR imaging differs between malignant and benign masses in women,” in Proc. IEEE Int. Ultrason. Symp. (IUS), Venice, Italy, Oct. 2022, pp. 1–3. [Online]. Available: https://ieeexplore.ieee.org/document/9958080/ [Google Scholar]
- [24].Hossain MM and Gallippi CM, “Viscoelastic response ultrasound derived relative elasticity and relative viscosity reflect true elasticity and viscosity: In silico and experimental demonstration,” IEEE Trans. Ultrason., Ferroelectr., Freq. Control, vol. 67, no. 6, pp. 1102–1117, Jun. 2020. [Online]. Available: https://ieeexplore.ieee.org/document/8945347/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Torres G et al. , “Viscoelastic response (VisR)-derived mechanical anisotropy for differentiating malignant from benign breast lesions in women, in vivo,” in Proc. IEEE Int. Ultrason. Symp. (IUS), Glasgow, U.K., Oct. 2019, pp. 1372–1374. [Online]. Available: https://ieeexplore.ieee.org/document/8925814/ [Google Scholar]
- [26].Ramião NG, Martins PS, Rynkevic R, Fernandes AA, Barroso M, and Santos DC, “Biomechanical properties of breast tissue, a state-of-the-art review,” Biomechanics Model. Mechanobiol, vol. 15, no. 5, pp. 1307–1323, Oct. 2016. [DOI] [PubMed] [Google Scholar]
