Key Points
Question
Can in silico imaging trials play a role in the evaluation of new medical imaging systems?
Findings
This diagnostic study used computer-simulated imaging of 2986 synthetic image–based virtual patients to compare digital mammography and digital breast tomosynthesis and found an improved lesion detection performance favoring tomosynthesis for all breast sizes and lesion types. The increased performance for tomosynthesis was consistent with results from a comparative trial using human patients and radiologists.
Meaning
The study’s findings suggest that in silico imaging trials and imaging system computer simulation tools can in some cases be considered viable sources of evidence for the regulatory evaluation of imaging devices.
This diagnostic study uses computer-simulated imaging to evaluate digital breast tomosynthesis vs digital mammography for the detection of lesions.
Abstract
Importance
Expensive and lengthy clinical trials can delay regulatory evaluation of innovative technologies, affecting patient access to high-quality medical products. Simulation is increasingly being used in product development but rarely in regulatory applications.
Objectives
To conduct a computer-simulated imaging trial evaluating digital breast tomosynthesis (DBT) as a replacement for digital mammography (DM) and to compare the results with a comparative clinical trial.
Design, Setting, and Participants
The simulated Virtual Imaging Clinical Trial for Regulatory Evaluation (VICTRE) trial was designed to replicate a clinical trial that used human patients and radiologists. Images obtained with in silico versions of DM and DBT systems via fast Monte Carlo x-ray transport were interpreted by a computational reader detecting the presence of lesions. A total of 2986 synthetic image–based virtual patients with breast sizes and radiographic densities representative of a screening population and compressed thicknesses from 3.5 to 6 cm were generated using an analytic approach in which anatomical structures are randomly created within a predefined breast volume and compressed in the craniocaudal orientation. A positive cohort contained a digitally inserted microcalcification cluster or spiculated mass.
Main Outcomes and Measures
The trial end point was the difference in area under the receiver operating characteristic curve between modalities for lesion detection. The trial was sized for an SE of 0.01 in the change in area under the curve (AUC), half the uncertainty in the comparative clinical trial.
Results
In this trial, computational readers analyzed 31 055 DM and 27 960 DBT cases from 2986 virtual patients with the following Breast Imaging Reporting and Data System densities: 286 (9.6%) extremely dense, 1200 (40.2%) heterogeneously dense, 1200 (40.2%) scattered fibroglandular densities, and 300 (10.0%) almost entirely fat. The mean (SE) change in AUC was 0.0587 (0.0062) (P < .001) in favor of DBT. The change in AUC was larger for masses (mean [SE], 0.0903 [0.008]) than for calcifications (mean [SE], 0.0268 [0.004]), which was consistent with the findings of the comparative trial (mean [SE], 0.065 [0.017] for masses and −0.047 [0.032] for calcifications).
Conclusions and Relevance
The results of the simulated VICTRE trial are consistent with the performance seen in the comparative trial. While further research is needed to assess the generalizability of these findings, in silico imaging trials represent a viable source of regulatory evidence for imaging devices.
Introduction
Expensive and lengthy clinical trials for imaging products often hinder regulatory evaluation, are burdensome, and delay patient access to novel, high-quality devices. The evaluation of novel imaging technologies for screening typically requires a substantial clinical trial to demonstrate benefits compared with the standard of care.1 For example, regulatory approvals of digital breast tomosynthesis (DBT) are supported by clinical trials involving significant resources. A recent submission to the US Food and Drug Administration (FDA) of a DBT system as a replacement for digital mammography (DM), for example,2 relied on a clinical trial involving 400 women in 7 clinical sites receiving double exposure to ionizing radiation over many years, with 31 participating radiologists. We refer to this trial as the comparative trial. Leveraging recent progress in computer modeling and simulation as an alternative source of regulatory evidence with growing widespread support,3,4,5 we report on an in silico replication of the comparative trial to demonstrate the potential of this emerging approach.
While computational models are used to some extent in the regulatory evaluation of medical devices,6 their use in imaging has been rare. Models of imaging systems have significantly improved since the late 1980s,7,8,9 with sophisticated simulation tools increasingly used in research and development. In the last decade, powerful, efficient, and open-source radiation imaging system simulation tools have become freely available, allowing for a greater understanding of design choices. Recent efforts have established that computational methods can model many of the characteristics of breast anatomy and pathology10,11 as well as the physics of imaging.12,13 In addition, image interpretation algorithms have been shown to track the performance of human readers for specific visual tasks.14 While further research to advance in silico methods is needed, all elements required to perform an in silico imaging trial are rapidly approaching, or have already achieved, mature development stages warranting investigations into their use in replacing traditional clinical studies.
We report an all–in silico replication of a previously conducted imaging clinical trial used in support of the regulatory evaluation of DBT as a replacement for DM in breast cancer screening. Although studies have described models of DBT and DM systems,15,16 we know of no report of a computer-simulated imaging trial that replicates in size and nature a trial performed in support of a regulatory evaluation for which all codes and data sets are freely available in open-source format. We first describe the in silico replica (the Virtual Imaging Clinical Trial for Regulatory Evaluation, or VICTRE trial), including the trial population, the physics of the imaging systems, and the image reconstruction and interpretation algorithms. We then compare the results of VICTRE with those obtained in the human comparative trial and examine the limitations, benefits, and cost savings of the in silico approach.
Methods
Imaging Protocol
The study followed the Standards for Reporting of Diagnostic Accuracy (STARD) reporting guideline. Because the VICTRE trial was entirely simulated and made use of no human subject data, review by the FDA’s institutional review board was not applicable per the agency’s internal standard operating procedures.
Synthetic images of virtual patients were obtained using an in silico version of the Siemens Mammomat Inspiration17 DM and DBT system using a customized version of the MC-GPU Monte Carlo transport code18 (eAppendix 1 in the Supplement). The imaging parameters were selected based on publicly available device specifications and measurements19 for each compressed breast thickness. The Monte Carlo algorithm uses the known x-ray interaction physics and a stream of random numbers to sample a large number of x-ray tracks through the patient. Realistic models of the x-ray source and detector were created based on the technical specifications of the device being replicated. The simulation is physics based (no parameter was fitted to artificially force agreement on any performance metric). In the source model, the focal spot was modeled with a truncated Gaussian distribution with full width at half maximum equal to the nominal spot size of 0.3 mm. Radiographic spectra corresponded to 28 kilovolt (peak) (kV[p]) (for fatty and scattered breasts) and 30 kV(p) (for dense and heterogeneously dense breasts) from a tungsten anode with 50-μm rhodium and 1-mm beryllium filters.20 An analytical antiscatter grid21 was included in DM acquisitions (5:1 ratio, 31 line pairs/mm). The detector consisted of 2816 × 3584, 85-μm pixels with a 200-μm amorphous selenium transducer thickness. Depth of interaction and fluorescence emission in the selenium layer were explicitly calculated (eFigure 1 in the Supplement). Electronic noise was described using a Gaussian distribution with a variance of 5200 electrons.22 Pixel gain was set at 50 eV per detected charge with 0.99 Swank noise. The DBT system used 25 projections within a span of 50°. During DBT acquisition, x-ray tube motion was modeled by uniformly extending the focal spot along a 0.18° arc.23 Device simulations were accelerated using graphics processing unit computing24 and a delta-scattering transport algorithm. Variance reduction techniques were not used to preserve the realism of the quantum noise in the simulated images. Memory requirements were reduced using a binary tree voxel geometry.24 We reconstructed DBT volumes using a filtered back-projection algorithm.25,26,27 Because Siemens’ reconstruction algorithm is proprietary, we instead used a smoothing filter for a visually reasonable balance between sharpness and noise with negligible effect on reader performance.16,28 A flow diagram of the trial showing virtual patients’ progress through the study is presented in Figure 1.
Figure 1. Virtual Patients’ Progress Through the Study .
Virtual patients underwent imaging with digital mammography (DM) and digital breast tomosynthesis (DBT). Multiple cases were obtained from each virtual patient image and used for image interpretation. A indicates the area under the receiver operating characteristic curve; 3-D, 3-dimensional.
Trial Population
The trial cohort consisted of virtual female patients whose breasts were generated using a procedural analytic model in which major anatomical structures (fat and glandular tissues, ductal tree, vasculature, and ligaments) are randomly generated within a predefined breast volume bounded by skin and chest wall29 at a voxel resolution of 50 μm (eAppendix 2, eFigure 2, and eFigure 3 in the Supplement). The model allows for control of patient characteristics such as breast volume, compressed thickness, and density, which are known to affect breast cancer lesion detection. Physical breast compression in craniocaudal orientation was performed using finite-element solid mechanics software.30 The breasts in this population fell into 4 density categories: extremely dense (0.548 glandular volume fraction [GVF]), heterogeneously dense (0.339 GVF), scattered fibroglandular densities (0.143 GVF), and almost entirely fat (0.071 GVF), with corresponding compressed thicknesses of 3.5, 4.5, 5.5, and 6 cm (Table 1). The positive cohort contained 2 types of lesions: a spiculated mass11 with a 5-mm nominal diameter and mass density 2% higher than normal glandular tissue and a microcalcification cluster consisting of 5 calcified lesions positioned within a 5-mm3 volume (eAppendix 2 in the Supplement). The calcifications were modeled as 195, 179, and 171 μm of solid calcium oxalate with a mass density scaled by 0.84 (1.78 g/cm3).31 The lesions were digitally inserted in a subset of the compressed breasts to create a positive cohort within each density category. To reduce computing time, up to 8 lesions (4 masses and 4 clustered microcalcifications) were inserted in approximately half of the virtual patients. The location of the inserted pathology was chosen randomly from candidate locations determined by the position of the terminal duct lobular units, a common site for carcinogenesis. Pathologies were nonoverlapping and did not extend into the chest wall or skin layer. Sample images of breast and lesions are presented in Figure 2. The lesion characteristics were adjusted during several prepilot stages to achieve a DM performance comparable to reported values.1
Table 1. Cohort Characteristics of the Virtual Imaging Clinical Trial for Regulatory Evaluation Population With Cases Corresponding to Regions of Interest in DM and Volumes of Interest in DBT by Breast Density Classesa.
Virtual Patients and Cases | No. (%) | ||||
---|---|---|---|---|---|
Total | Extremely Dense | Heterogeneously Dense | Scattered Fibroglandular Densities | Almost Entirely Fat | |
All virtual patients | 2986 (100) | 286 (9.6) | 1200 (40.2) | 1200 (40.2) | 300 (10.0) |
Virtual patients with lesion | 1944 (100) | 189 (9.7) | 780 (40.1) | 780 (40.1) | 195 (10.0) |
DM Cases | |||||
Normal | 15 527 (100) | 1499 (9.7) | 6237 (40.2) | 6232 (40.1) | 1559 (10.0) |
With lesion | 15 528 (100) | 1499 (9.7) | 6237 (40.2) | 6232 (40.1) | 1560 (10.0) |
With spiculated mass | 7756 (100) | 747 (9.6) | 3117 (40.2) | 3112 (40.1) | 780 (10.1) |
With microcalcification cluster | 7772 (100) | 752 (9.8) | 3120 (40.1) | 3120 (40.1) | 780 (10.0) |
DBT Cases | |||||
Normal | 12 443 (100) | 1244 (10) | 4968 (39.9) | 4996 (40.2) | 1235 (9.9) |
With lesion | 15 527 (100) | 1499 (9.7) | 6237 (40.2) | 6232 (40.1) | 1559 (10.0) |
With spiculated mass | 7756 (100) | 747 (9.6) | 3117 (40.2) | 3112 (40.1) | 780 (10.0) |
With microcalcification cluster | 7772 (100) | 752 (9.6) | 3120 (40.1) | 3120 (40.1) | 780 (10.0) |
Glandular volume fraction, mean (SD)b | 0.256 (0.001) | 0.548 (0.001) | 0.339 (0.001) | 0.143 (0.001) | 0.071 (0.001) |
Volume, mean (SD), cm3b | 342.3 (0.001) | 111.5 (0.001) | 218.0 (0.001) | 441.2 (0.001) | 685.6 (0.001) |
Compressed thickness, mean (SD), cmb | 4.94 (0.001) | 3.49 (0.001) | 4.49 (0.001) | 5.50 (0.001) | 5.99 (0.001) |
Abbreviations: DBT, digital breast tomosynthesis; DM, digital mammography.
According to Breast Imaging Reporting and Data System.
Based on a sample of 50 breasts per breast density class.
Figure 2. Example Images From the Virtual Imaging Clinical Trial for Regulatory Evaluation .
A and B, Digital mammography (DM) (A) and selected digital breast tomosynthesis (DBT) slice (B) of a case corresponding to a breast with scattered areas of fibroglandular density containing a microcalcification cluster (inserts). C and D, Digital mammography (C) and selected DBT slice (D) of a case corresponding to a breast with scattered areas of fibroglandular density containing a spiculated mass lesion (arrowheads). Lesions have been made more conspicuous for display purposes by artificially increasing their radiography attenuation during image acquisition.
Study Design
Approximately two-thirds of the trial virtual patients had a cancerous lesion corresponding to a microcalcification cluster or a spiculated mass. In silico cases corresponded to regions of interest extracted from DM images and DBT volumes. Analogously to the enrichment of cancer cases in the comparative trial population with patients found to have abnormal findings under a DM examination, the VICTRE trial population was designed to maximize the study’s statistical power. The target uncertainty was achieved by selecting an appropriate combination of number of diseased and normal cases, number of computational readers, and case difficulty. The design choices were to use the same number of readers as in the comparative trial (30) and vary the proportion of diseased cases to obtain sufficient cases for adequate number of training and testing sets.
Virtual patients were excluded if a computational error occurred during generation, compression, insertion of pathology, or acquisition and reconstruction of images. Exclusion criteria were enforced by monitoring software messages and by visually inspecting all images. Virtual patients whose compression failed to converge to target thicknesses (frequency of 0.03%) were replaced. In total, 14 of the 3000 virtual patients in the initial cohort were excluded.
The VICTRE trial was designed by the authors in consultation with a technical committee of industry, academia, and FDA representatives. The trial methods evolved in 2017 during several prepilot stages for model development, verification of input and output formats, and imaging parameters. All models used in the VICTRE pivotal trial were fixed prior to the pilot stage, which took place in the last half of 2017. The sponsor of the comparative trial had no role in the design and execution of the VICTRE trial.
Image Interpretation
Images were interpreted by a computational reader under a location-known-exactly detection paradigm based on a channelized Hotelling observer16,32,33 using 5 Laguerre-Gauss channels with widths commensurate to lesion size.34 The Laguerre-Gauss channelized Hotelling observer is an efficient model observer and has been shown capable of trending human performance in detecting approximately round targets in backgrounds without strong directional texture.28,35 The computational reader for DBT used 3-dimensional channels by stacking the 2-dimensional channels for each slice following the volumetric approach of Platiša et al.36 For calcifications, spatial frequency filtering was used to adapt to irregular morphological features37 (eAppendix 3 and eFigure 4 in the Supplement). Thirty computational readers interpreted images (DM) and volumes (DBT) with (positive case) or without (negative case) a lesion. For each density group, 30 computational readers were trained with different sets of 100 pairs of positive and negative cases randomly sampled from a larger set of training pairs (260 for dense and fatty and 1000 for heterogeneously dense and scattered density breasts). For dense and fatty breasts, performance was tested on 360 negative and 500 positive cases. For heterogeneously dense and scattered density breasts, performance was tested on 1500 negative and 2000 positive cases. Using a fully crossed interpretation paradigm, all 30 virtual readers in the VICTRE trial interpreted all cases in the test data sets (eFigure 5 in the Supplement).
Comparative Human Trial
The human trial used for the design of the VICTRE trial was conducted between 2012 and 2016, and submitted to the FDA in support of a premarket application for the approval of the DBT mode in Mammomat Inspiration17 as a replacement for DM (T. Mertelmeier, PhD, unpublished data, December 2016). Written informed consent was obtained from all participants in the study and institutional review board approvals were obtained from all collecting sites. The patient cohort in the comparative trial consisted of 326 asymptomatic women enriched based on DM, and double-exposed to radiation under both modalities. It included 21 patients with extremely dense breasts (5 positives), 156 with heterogeneously dense breasts (46 positives), 130 with scattered fibroglandular densities (41 positives), and 19 with almost entirely fatty breasts (10 positives). Follow-up examinations on 141 patients and biopsies on 83 patients were used to verify that the patients did not have malignant breast cancer, with biopsy-verified malignant cancer in 104 patients. The trial involved case selection from 7 clinical sites and took approximately 4 years to complete. Thirty-one certified radiologists reported 134 cancerous lesions, including 85 masses, 29 calcified lesions, 12 architectural distortions, and 8 asymmetric densities, with a total of 108 invasive cancers and 26 ductal carcinomas in situ. The trial reported 29 lesions of size less than 10 mm (21.6%), 40 lesions from 10 to 19 mm (29.9%), 33 lesions from 20 to 29 mm (24.6%), and 27 lesions larger than 30 mm (20.1%). The target, per-view, average (mean) glandular dose (AGD) for the comparative trial was approximately 1.0 and 1.5 mGy for DM and DBT, respectively. The comparative trial differential performance favored DBT by 0.043 (0.017), with area under the curve (AUC) measurements of 0.818 (0.019) and 0.861 (0.019) for 2-view DM and DBT, respectively (see Table 2 for subgroup outcomes).
Table 2. Trial End Points (AUC and Change in AUC) for DM and DBT per Lesion Type and per Radiographic Density With Radiation Dose Estimates for Each Subgroup and Breast-Level Nonparametric AUC for the Comparative Trial.
Subgroup | AUC (SE) | Change in AUC (SE)a | |
---|---|---|---|
DM | DBT | ||
VICTRE Trial | |||
By size and radiographic density | |||
Total | 0.9009 (0.0058) | 0.9596 (0.0035) | 0.0587 (0.0062) |
Extremely dense | 0.8358 (0.0127) | 0.9020 (0.0106) | 0.0657 (0.0148) |
Heterogeneously dense | 0.8643 (0.0067) | 0.9372 (0.0042) | 0.0724 (0.0073) |
Scattered fibroglandular densities | 0.9416 (0.0038) | 0.9865 (0.0014) | 0.0449 (0.0038) |
Almost entirely fat | 0.9475 (0.0061) | 0.9975 (0.0014) | 0.0500 (0.0061) |
By lesion type | |||
Total for spiculated mass | 0.8303 (0.0072) | 0.9207 (0.0050) | 0.0903 (0.008) |
Extremely dense with spiculated mass | 0.679 (0.018) | 0.803 (0.015) | 0.124 (0.021) |
Heterogeneously dense with spiculated mass | 0.760 (0.008) | 0.876 (0.006) | 0.116 (0.009) |
Scattered fibroglandular with spiculated mass | 0.902 (0.005) | 0.975 (0.002) | 0.073 (0.005) |
Almost entirely fat with spiculated mass | 0.972 (0.005) | 0.996 (0.002) | 0.024 (0.005) |
Total for microcalcification cluster | 0.971 (0.004) | 0.9983 (0.0003) | 0.0268 (0.004) |
Extremely dense with microcalcification cluster | 0.991 (0.002) | 1.0000 (0.0001) | 0.008 (0.002) |
Heterogeneously dense with microcalcification cluster | 0.968 (0.005) | 0.9980 (0.0003) | 0.029 (0.005) |
Scattered fibroglandular with microcalcification cluster | 0.981 (0.002) | 0.9980 (0.0003) | 0.017 (0.002) |
Almost entirely fat with microcalcification cluster | 0.923 (0.007) | 0.9990 (0.0003) | 0.076 (0.007) |
Comparative Trial | |||
By size and radiographic density | |||
Total | 0.818 (0.019) | 0.861 (0.019) | 0.043 (0.017) |
Dense | 0.802 (0.027) | 0.844 (0.026) | 0.043 (0.026) |
Nondense | 0.826 (0.026) | 0.873 (0.026) | 0.047 (0.021) |
By lesion type | |||
Masses | 0.858 (0.018) | 0.923 (0.018) | 0.065 (0.017) |
Microcalcifications | 0.796 (0.042) | 0.749 (0.041) | −0.047 (0.032) |
Abbreviations: AUC, area under the curve; DBT, digital breast tomosynthesis; DM, digital mammography; VICTRE, Virtual Imaging Clinical Trial for Regulatory Evaluation.
Change in AUC = AUC of DBT − AUC of DM.
Trial End Points
The primary end point of the trial was the difference in AUC between DBT and DM corresponding to the entire patient population. In addition, we report subgroup analyses corresponding to the change in AUC for the 4 different breast classes and 2 lesion types.
Statistical Analysis
The standard error of the change in AUC was estimated using a fully crossed (all virtual patients were imaged in both modalities), multiple-reader, multiple-case analysis using the iMRMC software (available at https://github.com/DIDSR/iMRMC).38 The trial was sized during a pilot study for an SE of 0.01 in change in AUC, lower than the uncertainty seen in the comparative human trial. To prevent bias, DBT performance was not analyzed as models were developed during prepilot stages. We considered 2-sided 95% confidence intervals (level of significance, P < .05).
Results
In this simulated trial, computational readers analyzed 31 055 DM and 27 960 DBT cases from 2986 virtual patients with the following Breast Imaging Reporting and Data System densities: 286 (9.6%) extremely dense, 1200 (40.2%) heterogeneously dense, 1200 (40.2%) scattered fibroglandular densities, and 300 (10.0%) almost entirely fat.
A total of 2986 images of virtual patients were obtained for both modalities. The demographic characteristics of the cohort were designed to mirror the comparative trial in breast size, compressed thickness, and radiographic density (Table 1). The mean (SE) AGD for the trial population was 0.94 (0.04) mGy and 1.38 (0.06) mGy for DM and DBT, respectively. The AGD increased from dense (0.74 mGy for DM and 1.09 mGy for DBT) to fatty (1.14 mGy for DM and 1.66 mGy for DBT). Although relating in silico and comparative AGD estimates is not straightforward (eAppendix 4 in the Supplement), in silico AGD levels derived from automatic exposure control settings for virtual patients39 were within 15% of the AGD values in the comparative trial (eFigure 6 in the Supplement). Anatomical image textures were characterized by calculated power-law exponents40 (β [SE]) of 3.88 (0.20) and 2.45 (0.35) for DM and DBT, respectively, decreasing from dense to fatty breasts as expected. This reduction in β for DBT (approximately 0.7) is consistent with observations made on patient DM and DBT images41 (eAppendix 5 and eFigure 7 in the Supplement).
The VICTRE trial, including the generation of virtual patients, was performed between January and May 2018 and conducted in mixed-platform computer clusters containing a variety of central processing unit and graphics processing unit processors. On average, the simulation of each virtual patient took approximately 8 central processing unit–hours and 0.5 graphics processing unit–hours. The pivotal trial took approximately 2 weeks of computations. All code, parameters, and data sets are available at https://github.com/DIDSR/VICTRE (eAppendix 6 in the Supplement).
We observed a mean (SE) AUC of 0.9005 (0.0058) for DM and 0.9596 (0.0035) for DBT with a mean (SE) change in AUC of 0.0587 (0.0062), which was statistically significant (P < .001) in favor of DBT. The differential performance favored DBT in all subgroups. The mean (SE) change in AUC was larger for masses (0.0903 [0.008]) than for calcifications (0.0268 [0.004]), which was consistent with the findings of the comparative trial (mean [SE], 0.065 [0.017] for masses and −0.047 [0.032] for calcifications). In addition, the change in AUC was larger for masses than for calcifications for all breast sizes and subgroups (Table 2 and Figure 3). The differential performance observed is consistent not only with the aggregated results of the comparative trial, but also in terms of a larger differential performance for detecting masses vs microcalcifications.
Figure 3. Trial Primary and Secondary Outcomes.
AUC indicates area under the receiver operating characteristic curve; DBT, digital breast tomosynthesis; DM, digital mammography; VICTRE, Virtual Imaging Clinical Trial for Regulatory Evaluation; and error bars, standard errors.
Discussion
On one hand, traditional trials use devices in actual clinical environments, incorporate the complexity of patient anatomy and disease variability, and use end points that are directly associated with clinical use. On the other hand, in silico trials can encompass larger virtual patient cohorts and control sources of variability to efficiently highlight technological differences without augmenting patient risk from radiation exposure. The VICTRE in silico trial found that DBT outperformed DM for all lesions and types of breasts included in the study with changes in AUCs between 0.027 and 0.090. The results of the VICTRE trial are consistent with the overall differential performance observed in the comparative trial.
An observation not clearly seen in the comparative trial data is that DBT undoubtedly outperforms DM in detecting microcalcifications. Several factors might contribute to this observation. First, in the comparative trial, calcified lesions were defined as any lesion with a calcified finding. Second, the computational models in the VICTRE trial do not include patient motion; thus, small microcalcifications might appear sharper than in the comparative trial images. In addition, future developments in in silico imaging trials might have the potential to elucidate device comparisons that are challenging or impractical to investigate with human participants (for instance, due to very low prevalence), thus generating useful evidence beyond traditional trials.42 An additional difference worth noting is that the comparative trial used 2-view DBT and DM per breast, whereas VICTRE results are based on 1-view imaging.
Validating predictions of computational models is always challenging. This work builds on research that has led to increased confidence in the in silico tools. More generally, the FDA has recently issued guidance on reporting criteria for including computational modeling in device submissions.43 However, VICTRE and other in silico clinical trials have considerable advantages over traditional trials, including larger statistical power, due in part to the availability of larger trial population samples, the ability to study rare cases challenging to document in patients, and the study of prototype devices not yet available.
An additional and notable advantage of in silico clinical trials is the substantial savings in resources. A precise estimate of savings will only emerge once additional in silico trials are reported. While resource savings strongly depend on the specifics of the trial (eg, device characteristics, disease prevalence, and availability of target population), estimates indicate that VICTRE required one-third of the resources required for the comparative trial. The VICTRE trial’s expenditure in scientist-hours was comparable to that in the comparative trial (approximately 3 full-time staff). The comparative trial took approximately 4 years to complete, while VICTRE took 1.75 years. Operational costs of the comparative trial, including additional patient recruitment needed to maintain statistical power given participant nonadherence and dropout, were equal to, if not larger than, the computational costs associated with the VICTRE trial. In addition, the estimate disregards other savings, including the risk from double-exposing hundreds of trial participants to ionizing radiation and cost associated with institutional review board approvals, clinical site fees, patient recruitment, and follow-up expenses. In addition to being conservative, our savings estimate is likely to increase over time as computing resources become increasingly inexpensive and more widely available.
The all–in silico method is not intended to replace, but rather complement and minimize, traditional clinical trials. Incrementally incorporating computational results as prior knowledge in Bayesian trial designs decreases sample size and trial length in the evaluation of medical implants.44 In some cases, patient and medical practitioner involvement will likely remain essential.
Limitations
The simulated VICTRE trial considered only one realization of a spiculated mass and a calcification cluster, neglecting lesion variability and 15% of the lesions in the comparative trial (9% architectural distortions and 6% asymmetries). A 3-dimensional imaging system that outperforms DM in detecting masses and calcifications might also perform well in detecting architectural distortions and asymmetries. A recent retrospective study of cases recommended for biopsy reported that architectural distortion was more commonly detected in DBT than in DM.45 In addition, although VICTRE considered a range of breast thicknesses and radiographic densities, the patient variability of its trial population was in many ways different than the variability seen in clinical trial populations. These and other limitations of the VICTRE trial should be further investigated, including the lack of visual search in the image interpretation46 and a trial outcome defined only as detection of lesions and not as probability of malignancy.
Conclusions
The findings of the simulated VICTRE trial suggest that the regulatory assessment of the specific DM and DBT imaging devices based on in silico data would have been similar to the actual regulatory decision made based on the comparative trial. It is useful to note that the findings of the VICTRE trial are not directly generalizable to other implementations of DM and DBT imaging systems and that additional studies will require appropriate model adaptations. The VICTRE trial, performed exclusively with open-source computational methods, suggests that increased use of computational modeling tools in the regulatory assessment of imaging systems could significantly decrease the burden of bringing new and improved imaging technologies to market. The work reported in this article provides evidence that state-of-the-art computational methods, coupled with predictive methods and laboratory testing, can lead to less burdensome regulatory evaluation approaches. Further investigations will help provide necessary validation of the described approach when applied to the evaluation of various other medical products.
eAppendix 1. VICTRE Trial Imaging Physics
eFigure 1. Comparison of Simulated and Experimental Modulation Transfer Function (MTF) of the VICTRE Detector in Different Configurations
eAppendix 2. Virtual Patients in the VICTRE Trial
eFigure 2. Anatomical Components of Virtual Breast Model
eFigure 3. Some Breast Model Elements
eAppendix 3. Implementation of the VICTRE Trial Computer Reader Algorithms
eFigure 4. Flowchart of the Image Interpretation Process by the Computer Readers in VICTRE
eFigure 5. The Typical Reader Templates Used in VICTRE for Detecting Lesions in DM and DBT
eAppendix 4. Radiation Dose Estimates in the VICTRE Trial
eFigure 6. Radiation Dose Distributions in the VICTRE Trial Population
eAppendix 5. Power-Law Beta Estimates of VICTRE Trial Images
eFigure 7. A Scatterplot of β_recon Versus β_mammo for the Four Breast Glandular Densities
eAppendix 6. The VICTRE Trial Imaging Pipeline
References
- 1.Pisano ED, Yaffe MJ. Breast cancer screening: should tomosynthesis replace digital mammography? JAMA. 2014;311(24):-. doi: 10.1001/jama.2014.6421 [DOI] [PubMed] [Google Scholar]
- 2.US Food and Drug Administration Mammomat inspiration with tomosynthesis option premarket approval: P140011/S003. https://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfpma/pma.cfm?ID=P140011. Updated October 22, 2018. Accessed October 22, 2018.
- 3.Faris O, Shuren J. An FDA viewpoint on unique considerations for medical-device clinical trials. N Engl J Med. 2017;376(14):1350-1357. doi: 10.1056/NEJMra1512592 [DOI] [PubMed] [Google Scholar]
- 4.Gottlieb S. How FDA plans to help consumers capitalize on advances in science. https://www.fda.gov/NewsEvents/Newsroom/FDAVoices/ucm612016.htm. Published July 7, 2017. Accessed June 6, 2018.
- 5.House report 115-232: agriculture, rural development, food and drug administration, and related agencies appropriations bill. https://www.congress.gov/congressional-report/115th-congress/house-report/232/1?q=%7B%22search%22%3A%5B%22in+silico+trial%22%5D%7D&r=1. July 7, 2017. Accessed June 6, 2018.
- 6.Morrison TM, Dreher ML, Nagaraja S, Angelone LM, Kainz W. The role of computational modeling and simulation in the total product life cycle of peripheral vascular devices. J Med Device. 2017;11(2):024503. doi: 10.1115/1.4035866 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Van Metter R, Dillon PLP, Huff KE, Rabbani M Computer simulation of radiographic screen-film images. Paper presented at: Application of Optical Instrumentation in Medicine XIV and Picture Archiving and Communication Systems (PACS IV) for Medical Applications; 1986; Newport Beach, CA. doi: 10.1117/12.975380 [DOI] [Google Scholar]
- 8.Burgess AE, Wagner RF, Jennings RJ, Barlow HB. Efficiency of human visual signal discrimination. Science. 1981;214(4516):93-94. doi: 10.1126/science.7280685 [DOI] [PubMed] [Google Scholar]
- 9.Jafroudi H, Muntz EP, Bernstein H, Jennings RJ Multiparameter optimization of mammography: an update. In: Fullerton GD, Mulvaney JA, Haus AG, Properzio WS, eds. SPIE Proceedings Vol 0347: Application of Optical Instrumentation in Medicine X Bellingham, WA: SPIE; 1982. doi: 10.1117/12.933811 [DOI] [Google Scholar]
- 10.Sturgeon GM, Park S, Segars WP, Lo JY. Synthetic breast phantoms from patient based eigenbreasts. Med Phys. 2017;44(12):6270-6279. doi: 10.1002/mp.12579 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.de Sisternes L, Brankov JG, Zysk AM, Schmidt RA, Nishikawa RM, Wernick MN. A computational model to generate simulated three-dimensional breast masses. Med Phys. 2015;42(2):1098-1118. doi: 10.1118/1.4905232 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Fang Y, Badal A, Allec N, Karim KS, Badano A. Spatiotemporal Monte Carlo transport methods in x-ray semiconductor detectors: application to pulse-height spectroscopy in a-Se. Med Phys. 2012;39(1):308-319. doi: 10.1118/1.3669486 [DOI] [PubMed] [Google Scholar]
- 13.Young S, Bakic PR, Myers KJ, Jennings RJ, Park S. A virtual trial framework for quantifying the detectability of masses in breast tomosynthesis projection data. Med Phys. 2013;40(5):051914. doi: 10.1118/1.4800501 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Abbey CK, Barrett HH. Human- and model-observer performance in ramp-spectrum noise: effects of regularization and object variability. J Opt Soc Am A Opt Image Sci Vis. 2001;18(3):473-488. doi: 10.1364/JOSAA.18.000473 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gong X, Glick SJ, Liu B, Vedula AA, Thacker S. A computer simulation study comparing lesion detection accuracy with digital mammography, breast tomosynthesis, and cone-beam CT breast imaging. Med Phys. 2006;33(4):1041-1052. doi: 10.1118/1.2174127 [DOI] [PubMed] [Google Scholar]
- 16.Zeng R, Park S, Bakic P, Myers KJ. Evaluating the sensitivity of the optimization of acquisition geometry to the choice of reconstruction algorithm in digital breast tomosynthesis through a simulation study. Phys Med Biol. 2015;60(3):1259-1288. doi: 10.1088/0031-9155/60/3/1259 [DOI] [PubMed] [Google Scholar]
- 17.Siemens Healthcare Mammomat Inspiration with PRIME technology: the reference in low-dose mammography. https://www.healthcare.siemens.com/mammography/digital-mammography/mammomat-inspiration-prime/features. Accessed October 22, 2018.
- 18.Badal A, Badano A. Accelerating Monte Carlo simulations of photon transport in a voxelized geometry using a massively parallel graphics processing unit. Med Phys. 2009;36(11):4878-4880. doi: 10.1118/1.3231824 [DOI] [PubMed] [Google Scholar]
- 19.Strudley CJ, Warren LM, Young KC Technical evaluation of the Siemens Mammomat Inspiration digital breast tomosynthesis system. https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/505808/nhsbsp-equipment-report-1306v2_uploaded_070316.pdf. Accessed October 24, 2018.
- 20.Siemens. Online tool for the simulation of x-ray spectra. https://www.oem-xray-components.siemens.com/. Accessed October 24, 2018.
- 21.Day GJ, Dance DR. X-ray transmission formula for antiscatter grids. Phys Med Biol. 1983;28(12):1429-1433. doi: 10.1088/0031-9155/28/12/008 [DOI] [PubMed] [Google Scholar]
- 22.Zhao B, Zhou J, Hu YH, Mertelmeier T, Ludwig J, Zhao W. Experimental validation of a three-dimensional linear system model for breast tomosynthesis. Med Phys. 2009;36(1):240-251. doi: 10.1118/1.3040178 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mackenzie A, Marshall NW, Hadjipanteli A, Dance DR, Bosmans H, Young KC. Characterisation of noise and sharpness of images from four digital breast tomosynthesis systems for simulation of images for virtual clinical trials. Phys Med Biol. 2017;62(6):2376-2397. doi: 10.1088/1361-6560/aa5dd9 [DOI] [PubMed] [Google Scholar]
- 24.Badal A, Badano A A GPU-optimized binary space partition structure to accelerate the Monte Carlo simulation of CT projections of voxelized patient models with metal implants. Paper presented at: 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI); May 2-5, 2012; Barcelona, Spain. doi: 10.1109/ISBI.2012.6235628 [DOI] [Google Scholar]
- 25.Siemens. 3D Breast tomosynthesis—intelligent technology for clear clinical benefits. http://www.tomosintesi.org/mail/41_Whitepaper_3D_Breast_21.03_102983620_1.pdf. Published 2012. Accessed October 24, 2018.
- 26.Sengupta A, Zeng R, Sharma D, Badano AG The first freely available, open source software package for performing 3D image reconstruction for digital breast tomosynthesis. Paper presented at: SPIE Medical Imaging; March 9, 2018; Houston, TX. doi: 10.1117/12.2293146 [DOI] [Google Scholar]
- 27.Fessler J. Michigan Image Reconstruction Toolbox. http://web.eecs.umich.edu/~fessler/code/index.html. Accessed October 24, 2018.
- 28.Zeng R, Badano A, Myers KJ. Optimization of digital breast tomosynthesis (DBT) acquisition parameters for human observers: effect of reconstruction algorithms. Phys Med Biol. 2017;62(7):2598-2611. doi: 10.1088/1361-6560/aa5ddc [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Graff CG. A new, open-source, multi-modality digital breast phantom. Paper presented at: SPIE Medical Imaging; March 22, 2016; San Diego, CA. doi: 10.1117/12.2216312 [DOI] [Google Scholar]
- 30.Maas SA, Ellis BJ, Ateshian GA, Weiss JA. FEBio: finite elements for biomechanics. J Biomech Eng. 2012;134(1):011005. doi: 10.1115/1.4005694 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hadjipanteli A, Elangovan P, Mackenzie A, et al. The effect of system geometry and dose on the threshold detectable calcification diameter in 2D-mammography and digital breast tomosynthesis. Phys Med Biol. 2017;62(3):858-877. doi: 10.1088/1361-6560/aa4f6e [DOI] [PubMed] [Google Scholar]
- 32.Myers KJ, Barrett HH. Addition of a channel mechanism to the ideal-observer model. J Opt Soc Am A. 1987;4(12):2447-2457. doi: 10.1364/JOSAA.4.002447 [DOI] [PubMed] [Google Scholar]
- 33.Barrett HH, Myers KJ. Foundations of Image Science. Hoboken, NJ: Wiley Interscience; 2014. [Google Scholar]
- 34.Gallas BD, Barrett HH. Validating the use of channels to estimate the ideal linear observer. J Opt Soc Am A Opt Image Sci Vis. 2003;20(9):1725-1738. doi: 10.1364/JOSAA.20.001725 [DOI] [PubMed] [Google Scholar]
- 35.Petrov D, Marshall N, Young K, Bosmans H Model and human observer reproducibility for detecting microcalcifications in digital breast tomosynthesis images. Paper presented at: SPIE Medical Imaging: Image Perception, Observer Performance, and Technology Assessment; March 7, 2018; Austin, TX. doi: 10.1117/12.2293311 [DOI] [Google Scholar]
- 36.Platiša L, Goossens B, Vansteenkiste E, et al. Channelized Hotelling observers for the assessment of volumetric imaging data sets. J Opt Soc Am A Opt Image Sci Vis. 2011;28(6):1145-1163. doi: 10.1364/JOSAA.28.001145 [DOI] [PubMed] [Google Scholar]
- 37.Diaz I, Abbey CK, Timberg PAS, et al. Derivation of an observer model adapted to irregular signals based on convolution channels. IEEE Trans Med Imaging. 2015;34(7):1428-1435. doi: 10.1109/TMI.2015.2395433 [DOI] [PubMed] [Google Scholar]
- 38.Gallas BD. One-shot estimate of MRMC variance: AUC. Acad Radiol. 2006;13(3):353-362. doi: 10.1016/j.acra.2005.11.030 [DOI] [PubMed] [Google Scholar]
- 39.Svahn TM, Houssami N, Sechopoulos I, Mattsson S. Review of radiation dose estimates in digital breast tomosynthesis relative to those in two-view full-field digital mammography. Breast. 2015;24(2):93-99. doi: 10.1016/j.breast.2014.12.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Burgess AE. Mammographic structure: data preparation and spatial statistics analysis. Paper presented at: SPIE Medical Imaging 1999: Image Processing; May 21, 1999; San Diego, CA. [Google Scholar]
- 41.Cockmartin L, Bosmans H, Marshall NW. Comparative power law analysis of structured breast phantom and patient images in digital mammography and breast tomosynthesis. Med Phys. 2013;40(8):081920. doi: 10.1118/1.4816309 [DOI] [PubMed] [Google Scholar]
- 42.Badano A. In silico imaging: definition, possibilities and challenges. Nucl Instrum Methods Phys Res Sect A. 2011;648(suppl 1):S276-S280. doi: 10.1016/j.nima.2010.11.054 [DOI] [Google Scholar]
- 43.US Food and Drug Administration Reporting of computational modeling studies in medical device submissions—guidance for industry and Food and Drug Administration staff. https://www.fda.gov/downloads/MedicalDevices/DeviceRegulationandGuidance/GuidanceDocuments/UCM381813.pdf. Published 2016. Accessed October 24, 2018.
- 44.Haddad T, Himes A, Thompson L, Irony T, Nair R; MDIC Computer Modeling and Simulation Working Group Participants . Incorporation of stochastic engineering models as prior information in Bayesian medical device trials. J Biopharm Stat. 2017;27(6):1089-1103. doi: 10.1080/10543406.2017.1300907 [DOI] [PubMed] [Google Scholar]
- 45.Bahl M, Lamb LR, Lehman CD. Pathologic outcomes of architectural distortion on digital 2D versus tomosynthesis mammography. AJR Am J Roentgenol. 2017;209(5):1162-1167. doi: 10.2214/AJR.17.17979 [DOI] [PubMed] [Google Scholar]
- 46.Gifford HC, Liang Z, Das M. Visual-search observers for assessing tomographic x-ray image quality. Med Phys. 2016;43(3):1563-1575. doi: 10.1118/1.4942485 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
eAppendix 1. VICTRE Trial Imaging Physics
eFigure 1. Comparison of Simulated and Experimental Modulation Transfer Function (MTF) of the VICTRE Detector in Different Configurations
eAppendix 2. Virtual Patients in the VICTRE Trial
eFigure 2. Anatomical Components of Virtual Breast Model
eFigure 3. Some Breast Model Elements
eAppendix 3. Implementation of the VICTRE Trial Computer Reader Algorithms
eFigure 4. Flowchart of the Image Interpretation Process by the Computer Readers in VICTRE
eFigure 5. The Typical Reader Templates Used in VICTRE for Detecting Lesions in DM and DBT
eAppendix 4. Radiation Dose Estimates in the VICTRE Trial
eFigure 6. Radiation Dose Distributions in the VICTRE Trial Population
eAppendix 5. Power-Law Beta Estimates of VICTRE Trial Images
eFigure 7. A Scatterplot of β_recon Versus β_mammo for the Four Breast Glandular Densities
eAppendix 6. The VICTRE Trial Imaging Pipeline