Abstract
Background
To investigate the reproducibility of automated volumetric bone mineral density (vBMD) measurements from routine thoracoabdominal computed tomography (CT) assessed with segmentations by a convolutional neural network and automated correction of contrast phases, on diverse scanners, with scanner-specific asynchronous or scanner-agnostic calibrations.
Methods
We obtained 679 observations from 278 CT scans in 121 patients (77 males, 63.6%) studied from 04/2019 to 06/2020. Observations consisted of two vBMD measurements from Δdifferent reconstruction kernels (n = 169), Δcontrast phases (n = 133), scan Δsessions (n = 123), Δscanners (n = 63), or Δall of the aforementioned (n = 20), and observations lacking scanner-specific calibration (n = 171). Precision was assessed using root-mean-square error (RMSE) and root-mean-square coefficient of variation (RMSCV). Cross-measurement agreement was assessed using Bland-Altman plots; outliers within 95% confidence interval of the limits of agreement were reviewed.
Results
Repeated measurements from Δdifferent reconstruction kernels were highly precise (RMSE 3.0 mg/cm3; RMSCV 1.3%), even for consecutive scans with different Δcontrast phases (RMSCV 2.9%). Measurements from different Δscan sessions or Δscanners showed decreased precision (RMSCV 4.7% and 4.9%, respectively). Plot-review identified 12 outliers from different scan Δsessions, with signs of hydropic decompensation. Observations with Δall differences showed decreased precision compared to those lacking scanner-specific calibration (RMSCV 5.9 and 3.7, respectively).
Conclusion
Automatic vBMD assessment from routine CT is precise across varying setups, when calibrated appropriately. Low precision was found in patients with signs of new or worsening hydropic decompensation, what should be considered an exclusion criterion for both opportunistic and dedicated quantitative CT.
Relevance statement
Automated CT-based vBMD measurements are precise in various scenarios, including cross-session and cross-scanner settings, and may therefore facilitate opportunistic screening for osteoporosis and surveillance of BMD in patients undergoing routine clinical CT scans.
Key Points
Artificial intelligence-based tools facilitate BMD measurements in routine clinical CT datasets.
Automated BMD measurements are highly reproducible in various settings.
Reliable, automated opportunistic osteoporosis diagnostics allow for large-scale application.
Graphical Abstract
Keywords: Artificial intelligence, Bone density, Densitometry, Osteoporosis, Tomography (x-ray computed)
Background
Osteoporosis is a systemic disease, that destabilizes the bone by demineralization of the osseous tissue and deterioration of the trabecular microstructure [1, 2]. The diagnosis is frequently delayed, because patients remain symptom free, until fragility fractures occur. Those occur in the absence of adequate trauma, but inherit significant morbidity, mortality and enormous socioeconomic consequences [3]. Demographic change aggravates this issue. In the USA, approximately 54 million people over the age of 50 were estimated to suffer from low bone mass or osteoporosis by 2010, and the number is projected to reach 71 million by 2030 [4].
Dual x-ray absorptiometry and quantitative computed tomography (CT) suited to screen for and diagnose osteoporosis are available [5, 6]. However, osteoporosis remains vastly underdiagnosed [2, 6, 7]. Part of the problem is that both methods rely on specialized tools (e.g., calibration phantoms), and, importantly, necessitate a dedicated exam, which inherits a substantial organizational effort.
Opportunistic approaches aim to overcome the limitations by determining bone mineral density (BMD) from exams performed for other indications [8]. The abundance of CT data underscores the possible impact of CT-based opportunistic BMD screening approaches: annual scans in the USA surpassed 278 per 1,000 inhabitants by 2019 [9]. However, the need for exact manual segmentations to determine trabecular volumetric BMD (vBMD) from routine clinical multidetector CT scans limited the viability of this approach so far.
Deep learning-based convolutional neural network frameworks have recently been developed to master this challenge [10]. Such neural networks automatically perform all steps of vertebral body segmentation. The conversion of asynchronous CT-based density values measured in HU into vBMD and corrections for the intravenous contrast media phase can be performed automatically [11–13]. However, to render this approach suitable for mass application, reproducibility has yet to be determined.
Thus, this study aimed to provide comprehensive information on the reproducibility of vBMD measurements performed by a fully automated convolutional neural network framework, utilizing routine clinical thoracoabdominal CT, obtained in different contrast media phases on a diverse set of scanner systems, with varying settings, asynchronously calibrated as well as using a manufacturer-generic, kVp-based calibration.
Methods
Study population
Patients who received at least two consecutive routine thoracoabdominal CT scans with an interscan interval of up to one month and matching regions of interest between April 2019 and June 2020 were identified from the local picture archiving and communication system. The maximum interscan interval of one month was selected to maximize the number of patients eligible for inclusion, while simultaneously aiming to rule out longitudinal changes in vBMD, which have been reported to reach 2% per year in a healthy cohort [14]. To cover the widest possible range of CT scanning systems in this study, scans performed at other institutions but that were imported into our system were also included. All scans were manually checked for lumbar spine coverage by a neuroradiology resident (P.P., 3 years of experience in spine imaging). Scans not covering the lumbar spine, as well as scans with severe beam hardening artifacts or high noise level at the lumbar spine (e.g., due to implants or other foreign material), were excluded. Further exclusion parameters were the presence of inflammatory and neoplastic lesions at the lumbar spine.
Dataset acquisition, scanner calibration, and automated vertebral body segmentation and vBMD extraction
In-house contrast-enhanced scans were performed with a bodyweight-adjusted dosage (≤ 80 kg, 80 mL; 80–100 kg, 90 mL; > 100 kg, 100 mL) of iodined contrast media (Imeron 300, Bracco Imaging Deutschland GmbH, Konstanz, Germany). Tube voltage was 120 kVp (n = 176), or 100 kVp (n = 3) for in-house scanners, with an average tube load of 200 mAs.
In-house scans were obtained on a set of four scanners (Philips Brilliance iCT 256, Philips IQon Spectral CT, and Philips Ingenuity, Philips Medical Systems, Hamburg, Germany; Siemens Somatom Definition AS + , Siemens Healthineers, Erlangen, Germany). External scans were performed on one of eight scanner types (Canon Aquilion, and Canon Aquilion PRIME, Canon Medical Systems, Amstelveen, Netherlands; Siemens Biograph, Siemens Somatom Emotion 16, Siemens Somatom Definition AS, Siemens Somatom Force and Siemens Somatom Emotion 6, Siemens Healthineers, Erlangen, Germany; and Philips Ingenuity Core 128, Philips Medical Systems, Hamburg, Germany).
All reformations with a spatial resolution of ≤ 3 mm craniocaudally and of 5 mm left-right or anterior-posterior were included. In-house scanners were asynchronously calibrated using a commercially available anthropomorphic spine phantom (QRM QSA-717 Phantom; Quality Assurance in Radiology and Medicine GmbH, Möhrendorf, Germany).
In all available reconstructions, automated spine detection, vertebral labelling, and trabecular compartment segmentation steps were performed automatically (SpineQ, version 1.0, Bonescreen GmbH, Munich, Germany, Fig. 1). HU-to-BMD calibration was performed automatically using linear conversion factors based on kVp and scanner type. Automated correction for the contrast media phase was performed using a two-dimensional DenseNET model [12]. Mean trabecular vBMD was calculated across the L1-4 lumbar vertebra (vBMDL1-4) for each observation. Vertebrae considered as unmeasurable according to the ACR criteria [15] were excluded from this mean on a per-patient basis. Specifically, vertebrae with fractures or Modic type 3 changes affecting > 25% of the vertebrae were excluded. For quality assurance, a neuroradiologist trained in spine imaging (P.P., 3 years of experience) reviewed vertebral body segmentation, automatic contrast media phase detection, and supervised in-/exclusion of vertebrae, documenting any manual corrections, if necessary.
Group definitions
Acquisition parameters such as kVp, reconstruction kernel, and slice thickness, as well as scanner model, scan positioning, intravenous contrast media phase, and calibration status are well-known confounders of vBMD measurements [12, 16–19]. To quantify each factor’s impact on reproducibility, we defined a set of six groups with an expected increase in variance. Observations were assigned to groups based on the following criteria:
-
I.
Δrecon: both measurements derived from a single acquisition but using different reconstruction kernels and / or slice thicknesses;
-
II.
Δcontrast: measurements obtained from different contrast media phases, obtained in consecutive acquisitions during a single scan session;
-
III.
Δsession: measurements derived from different scanning sessions at a single calibrated scanner, but both scans had the same contrast media phase;
-
IV.
Δscanner: measurements assessed in the same contrast media phase, but at two different calibrated scanners;
-
V.
Δall: measurements obtained from two different scanners, in different contrast media phases;
-
VI.
scanner-agnostic: measurements obtained from datasets from two different scanners, at least one of which was not asynchronously calibrated, and only a kVp-specific, but scanner-independent, calibration was applied.
Groups I-V consisted of observations from asynchronously calibrated scanners only. Reconstruction kernels, slice thicknesses, and reconstruction planes were not controlled for and varied randomly as per acquisition protocol.
Statistical analysis
All statistical analyses were performed using STATA software version 13.1 (StataCorp LLC, College Station, Texas, USA).
Absolute interscan differences in vBMDL1-4 were calculated by subtracting the vBMDL1-4 measurement 2 from the vBMDL1-4 measurement 1, in each observation, respectively. Relative interscan differences were calculated as percent gain or loss in vBMDL1-4 between measurement 1 and measurement 2, in all observations. Mean absolute vBMD differences and relative vBMD differences and respective standard deviations were calculated groupwise (I–IV).
Root mean square error (RMSE) and root mean square coefficients of variation (RMSCV) were calculated as measure of variance, for each group. To further investigate agreement of both measurements on single observation level, Bland Altman plots were created including 95% confidence intervals (95% CIs) of limits of agreement (mean difference × 1.96 standard deviation of the difference). The statistical significance of group differences in coefficients of variation was assessed using unifactorial ANOVA and Tukey post-hoc test.
Observations with poor agreement, defined as absolute difference exceeding the inner boundaries of the 95% CI limits of agreement, were retrospectively reviewed by an experienced neuroradiologist, specialized on spine imaging (J.S.K., 22 years of experience), and possible factors influencing vBMD measurements were recorded manually.
Results
Group statistics
Following the criteria for inclusion, 970 observations were derived from 146 patients (61 females, 41.8%), aged 63.4 ± 15.0 years (mean ± standard deviation), for a total of 292 patient scans total (146 × 2) (Fig. 2). After excluding observations without evaluable lumbar vertebrae, 679 observations were assigned to groups (Table 1). Across all observations, mean vBMDL1-4 was 173.4 mg calcium hydroxylapatite (CaHA) / cm3 (range 55.4–302.3) at baseline and 173.4 mg CaHA/cm3 (45.3–313.4) at follow-up. Slightly lower values were noted at measurement 2 compared to measurement 1: -0.0 ± 8.8 mg CaHA/cm3; -0.1%, p = 0.962. Observation numbers were greatest in the Δrecon group and decreased through groups II–VI. Notably, the 133 observations assigned to Δcontrast derived from 43 patients only, while 123 observations in the Δsession group derived from 55 patients.
Table 1.
Parameter | Unit | Δrecon | Δcontrast | Δsession | Δscanner | Δall | Scanner-agnostic | |
---|---|---|---|---|---|---|---|---|
Patients | Number | 72 | 43 | 55 | 31 | 10 | 23 | |
Observations | Number | 169 | 133 | 123 | 63 | 20 | 171 | |
Sexa | Male | Number (%) | 102 (60%) | 90 (68%) | 78 (63%) | 37 (59%) | 13 (65%) | 110 (64%) |
Female | Number (%) | 67 (40%) | 43 (32%) | 45 (37%) | 26 (41%) | 7 (35%) | 61 (36%) | |
Agea (years) | Mean ± standard deviation | 60.0 ± 14.4 | 64.0 ± 13.0 | 59.2 ± 14.8 | 57.7 ± 15.3 | 65.7 ± 8.4 | 61.8 ± 13.5 | |
Location | in-house / in-house | Number (%) | 153 (91%) | 116 (87%) | 123 (100%) | 63 (100%) | 20 (100%) | 22 (13%) |
out-of-house / in-house | Number (%) | 0 | 0 | 0 | 0 | 0 | 149 (87%) | |
out-of-house / out-of-house | Number (%) | 16 (9%) | 17 (13%) | 0 | 0 | 0 | 0 |
a Values based on a number of observations
Reproducibility measurements
As expected, RMSCV values increased along the groups from a minimum of 1.3% in measurements obtained from identical scans (Δrecon) to a maximum of 5.9% in the Δall cohort (Table 2). The RMSCV increased significantly between Δrecon and Δcontrast (p < 0.001) and Δcontrast and Δsession (p < 0.001), while the increases between Δsession and Δscanner as well as Δscanner and Δall were not statistically significant (p ≥ 0.770, respectively). The RMSCV difference between Δall and scanner-agnostic did barely not reach statistical significance (p = 0.054). As reflected in the RMSCV, vBMDL1-4 values for both measurements in the Δrecon group (n = 169) were extremely similar, with an average absolute difference of 0.1 ± 3.1 mg CaHA/cm3 and a relative difference of 0.1 ± 2.3%. Consequently, the group showed an overall low absolute error with an RMSE of 3.1 mg CaHA/cm3.
Table 2.
Group name | Group size | Measurement 1 vBMDL1-4 [mg CaHA/cm3] | Measurement 2 vBMDL1-4 [mg CaHA/cm3] | Absolute difference [mg CaHA/cm3] | Relative difference [%] | RMSE [mg CaHA/cm3] | RMSCV [%] |
---|---|---|---|---|---|---|---|
Number | Mean ± standard deviation | Mean ± standard deviation | Mean ± standard deviation | Mean ± standard deviation | |||
Δrecona | 169 | 175.3 ± 54.8 | 175.4 ± 54.7 | 0.1 ± 3.0 | 0.2 ± 2.1 | 3.0 | 1.3 |
Δcontrasta | 133 | 171.2 ± 53.7 | 170.1 ± 55.3 | -1.0 ± 6.4 | -0.9 ± 4.5 | 6.1 | 2.9 |
Δsessiona | 123 | 160.1 ± 54.0 | 162.8 ± 58.9 | 2.7 ± 12.5 | 1.3 ± 7.4 | 11.0 | 4.7 |
Δsession_no-outliersa | 111 | 156.4 ± 51.4 | 157.0 ± 53.6 | 0.6 ± 9.5 | 0.3 ± 5.9 | 9.0 | 4.1 |
Δscannera | 63 | 164.1 ± 56.6 | 162.0 ± 58.3 | -2.1 ± 8.9 | -2.0 ± 7.3 | 8.7 | 4.9 |
Δalla | 20 | 139.2 ± 43.7 | 131.5 ± 45.9 | -7.7 ± 8.0 | -6.3 ± 6.7 | 7.6 | 5.9 |
Scanner-agnostic | 171 | 190.8 ± 51.7 | 191.1 ± 49.3 | 0.4 ± 10.1 | 0.9 ± 5.9 | 10.1 | 3.7 |
a Group contains asynchronously calibrated measurements only. vBMD Volumetric bone mineral density, CaHA Calcium hydroxylapatite, RMSE Root mean square error, RMSCV Root mean square coefficient of variation. Δsession_no-outliers comprises the Δsession group following exclusion of outliers, which exceeded the inner boundaries of the 95% confidence interval limit of agreement of the Bland Altman plot in Fig. 4
Observations containing measurements from different contrast media phases showed slightly greater dispersion, but overall, the absolute and relative errors remained low (Δcontrast RMSE = 6.1 mg CaHA/cm3; RMSCV = 3.1%) (Fig. 3a). While measurements obtained from different scanning sessions (but the same scanner) had a greater absolute error of RMSE = 11.0 mg CaHA/cm3, the RMSCV remained below 5%. Measurements from different scanners showed greater differences (Δscanner absolute difference = -3.8 ± 10.0 mg CaHA/cm3; relative difference = -3.1 ± 7.8%), accompanied by increased errors (RMSE = 9.4 mg CaHA/cm3; RMSCV = 5.7%) (Fig. 3b). The relative error increased further in Δall-observations, to 5.9%. Notably, measurements in scanner-agnostic scans showed acceptable absolute (2.0 ± 10.0 mg CaHA/cm3) and relative (2.0 ± 5.9%) differences, reflected in the mean absolute (RMSE = 9.6 mg CaHA/cm3) and relative error (RMSCV = 4.2%).
Bland Altman plot analysis showed good agreement for all groups, except in Δsession. In this group, 12 observations exceeded the lower 95% CI of the upper limit of agreement (25.5 mg CaHA/cm3) or the upper 95% CI of the lower limit of agreement (-20.15 mg CaHA/cm3) (Fig. 4). Manual review showed that all those patients were scanned twice within a short period due to severe illness with signs of hydropic decompensation, resulting in new or increasing pleural effusion (n = 11), anasarca (n = 10), mesenterial fluid injection or ascites (n = 5) and pulmonary septal thickening (n = 3). Two of the patients were intubated between baseline and follow-up and had received new abdominal drainages (Fig. 5). Exclusion of the identified outliers resulted in lower value dispersion and in improvement of absolute and relative precision errors (Δsession_no-outliers n = 111; absolute difference = 0.6 ± 9.5; relative difference 0.3 ± 5.9; RMSE = 9.0; RMSCV = 4.1).
Discussion
This study investigated the reproducibility of vBMD assessments from routine clinical CT scans using a fully automated convolutional neural network framework in various settings, including cross-scanner and cross-center settings. Reproducibility was excellent for all comparisons derived from a single scanning session. This demonstrates that neither the input convolution kernel nor slice orientation or thickness diminish reproducibility of this approach, and that the contrast media phase can be effectively corrected for. Precision errors for measurements derived from two different scanning sessions or different scanners were higher, but acceptable. While measurement errors may be due in part to changes in patient positioning, we also found evidence that measurement errors may also be driven up by short-term changes in tissue water content of severely ill patients and recommend introducing hydropic decompensation as a general exclusion criterion for quantitative CT measurements.
The Δrecon group showed excellent reproducibility of vBMDL1-4 measurements with precision errors of approximately 1.5% or 3 mg CaHA/cm3. Similar precision has been shown for dual X-ray absorptiometry in repeated measurements without and with repositioning [20–22]. Since the two measurements in this group were derived from reformations created using different reformations of a single scan, the precision error is most likely attributable to slight differences in the segmentation process caused by the different reconstruction kernels, slice orientations and thicknesses, resulting in slightly different mean HU values or volume-of-interest placements by the convolutional Bonescreen neural network framework [10, 23]. Overall, the findings underline the previously reported robustness of the automated segmentation algorithm and HU-to-vBMD conversion following asynchronous calibration [10, 21, 24].
The influence of the contrast media phase on vBMD measurements is well known, especially in setups with external calibration, as the contrast medium augments the attenuation of vascularized body parts but does not influence the reference values obtained in the external phantom [8, 25, 26]. Therefore, a set of studies investigated correction methods for the contrast phase, and Rühling et al recently developed an automated model for contrast media phase detection and correction in a single-scanner setting [12, 25, 27, 28]. The study reported precision errors of 9.5 CaHA/cm3 in arterial and 4.0 mg CaHA/cm3 in portal-venous phase [12]. Contrast phase correction using phase-dependent correction factors performed similarly well in the current dataset, derived from various scanners, with absolute and relative precision errors of 6.1 mg CaHA/cm3 and 2.9% in the Δcontrast group. Moreover, precision errors increased only slightly in the Δall group, compared to the Δsession and Δscanner groups. This indicates that the implemented contrast media phase correction may work similarly well in cross-scanner comparisons. Modern advancements in CT, like spectral imaging, can further minimize the impact of intravenous contrast or other foreign materials on vBMD measurements and has been shown to yield promising results for vBMD measurements and in fracture prediction with good reproducibility [29–31]. While there is barely any use case for spectral CT in mass opportunistic osteoporosis screening due to the limited scanner availability, it may prove to be a pivotal advancement in osteoporosis diagnostics in the future.
Patient positioning and table height are known to severely impact attenuation values in multidetector CT due to x-ray field inhomogeneities and beam hardening effects being highly dependent on the scanner’s isocenter, and thereby also affect vBMD measurements [16–19]. This may in part explain the higher RMSCV and RMSE values in the Δsession, Δscanner, and Δall groups, as all observation pairs in these groups were obtained in different scanning sessions and to a certain extent, differences in patient positioning and table height can be expected between sessions. To encounter this problem, internal calibration has been proposed as an alternative calibration method. Internal calibration uses tissue-specific HU values, e.g., fat and muscle, to calculate a scan-specific conversion factor from HU to BMD [32]. While the method showed promising results in the past, studies have found that it is not superior to asynchronous calibration [27].
Across groups with patient repositioning, retrospective examination of cases with the greatest vBMD-variability revealed that the measurements were partially derived from severely ill patients from the intensive care units of our hospital, who had fluctuating levels of intra-abdominal as well as interstitial fluid and pleural effusion between scans. This bias is particularly difficult to correct for since hydration status in intensive care unit patients may substantially fluctuate daily. We regard this finding particularly important, as hydration status is not a reported confounder for quantitative CT measurements, which thus ought to be critically revised in severely ill patients. However, since our study was not specifically designed to investigate this topic, it warrants further investigation.
Data on cross-session reproducibility for asynchronous vBMD measurements is scarce, even more so for cross-scanner settings. Previous reports of reproducibility measures for asynchronous quantitative CT in single-scanner settings showed precision errors of 3−4 mg CaHA/cm3 or 2.2−3.7% [33, 34]. Sollmann et al compared opportunistically assessed vBMD from a set of six different scanners with QCT and documented different degrees of variation per scanner; an approach that seems somewhat comparable to cross-scanner results [11]. However, the authors did not measure cross-scanner reproducibility directly. With respect to the possible bias of the hydration status, precision errors across Δsession, Δscanner and Δall may be regarded as acceptable, with a maximum relative error of 5.9% in the Δall group and a maximum absolute error of 11 mg CaHA/cm3 in Δsession. Scanner-agnostic observation pairs showed similar reproducibility to asynchronously calibrated measurements from different sessions, and scanner-agnostic yielded better results than Δall, and did only barely not reach statistical significance. In fact, both the RMSE of 10.1 mg CaHA/cm3 and the RMSCV of 3.7% were slightly lower in scanner-agnostic compared to the Δsession group and the RMSCV was markedly lower in scanner-agnostic compared to Δall. Both results suggest that scanner-specific phantom measurements may not necessarily be needed for asynchronous calibration if kVp-specific calibration factors are available for the scanner type.
We acknowledge some limitations of our study. First, we extended the generalizability of our results by including several scanners from outside of our institution. However, despite our efforts, we did not achieve an equal distribution across all main vendors. This may have an adverse impact on the external validity of the determined reproducibility. However, from our point of view, this issue can only be solved in multicentric studies, preferably with scanner-specific asynchronous calibration. Nonetheless, we demonstrated, that even scanner-agnostic kVp-based calibration yields acceptable reproducibility, regardless of the scanner combination. Second, cross-session reproducibility was limited, as we included many severely ill patients. It remains unclear, whether the observed increases in precision errors were attributable to patient positioning or caused by pathophysiological changes to the body composition like changes in the hydration status. Since this issue has not been reported on in the literature and this study was not designed to further investigate this finding, it necessitates further investigation. However, this problem appears to be difficult, as it seems to be inherent in the typical design for this type of study, because healthy individuals would rarely receive two thoracoabdominal CT scans within a single month.
To summarize, the automatic vBMD measurements by a convolutional neural network-based tool with asynchronous calibration and automated correction for the contrast media phase investigated in this study showed good reproducibility. The slightly lower precision in cross-session and cross-scanner settings may be related to patient positioning, but also short-term changes to the patients’ body compositions, necessitating further investigations. Notably, precision was similar in cross-session settings and the group without scanner-dedicated, asynchronous phantom-based calibration. Patient positioning and body composition may thus be of interest as major determinants of reproducibility for further studies.
Acknowledgements
No large language models were used for this manuscript.
Abbreviations
- BMD
Bone mineral density
- CaHA
Calcium hydroxylapatite
- CI
Confidence interval
- RMSCV
Root mean square coefficient of variation
- RMSE
Root mean square error
- vBMD
volumetric BMD
Author contributions
JB, PP, and JSK contributed to study conception and design. JB and PP drafted the manuscript. JB, PP, and JSK acquired the dataset analyzed in this study. JB and JSK performed statistical analyses and created the figures. AS, MEH, and JSK created the software for vBMD calculation. All authors substantively revised the manuscript, approved the final version of the manuscript, and agreed to be personally accountable for the author’s own contributions and to ensure that questions related to the accuracy or integrity of any part of the work, even ones in which the author was not personally involved, were appropriately investigated, resolved, and the resolution documented in the literature.
Funding
The study was funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG, Grant Number 432290010; awarded to JSK and TB). Open Access funding enabled and organized by Projekt DEAL.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Declarations
Ethics approval and consent to participate
This retrospective study was approved by the local institutional review board (Ethikkommission der TU München, 27/19 S-SR, 04/22/2024) and written informed consent was waived.
Competing interests
JSK, AS, MEH, and SR are co-founders of Bonescreen, Munich, Germany (www.bonescreen.de). The remaining authors declare no relationships with any companies, whose products or services may be related to the subject matter of the article.
Consent for publication
Not applicable.
Footnotes
Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Jannis Bodden and Philipp Prucker contributed equally to this work.
References
- 1.NIH Consensus Development Panel on Osteoporosis Prevention, Diagnosis, and Therapy (2001) Osteoporosis prevention, diagnosis, and therapy. JAMA 285:785–795. 10.1001/jama.285.6.785 10.1001/jama.285.6.785 [DOI] [PubMed] [Google Scholar]
- 2.Office of the Surgeon General (US) (2004) Bone health and osteoporosis: A report of the surgeon general. US Department of Health and Human Services, Office of the Surgeon General, Rockville, MD [PubMed]
- 3.Hernlund E, Svedbom A, Ivergård M et al (2013) Osteoporosis in the european union: Medical management, epidemiology and economic burden. A report prepared in collaboration with the international osteoporosis foundation (iof) and the european federation of pharmaceutical industry associations (efpia). Arch Osteoporos 8:136. 10.1007/s11657-013-0136-1 10.1007/s11657-013-0136-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.National Osteoporosis Foundation (2002) America’s bone health: The state of osteoporosis and low bone mass in our nation. National Osteoporosis Foundation, Washington (DC)
- 5.Link TM (2012) Osteoporosis imaging: state of the art and advanced imaging. Radiology 263:3–17. 10.1148/radiol.2631201201 10.1148/radiol.2631201201 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ensrud KE, Crandall CJ (2018) Osteoporosis. Ann Intern Med 168:306–307. 10.7326/L17-0587 10.7326/L17-0587 [DOI] [PubMed] [Google Scholar]
- 7.Chesnut III CH (2001) Osteoporosis, an underdiagnosed disease. JAMA 286:2865–2866. 10.1001/jama.286.22.2865 10.1001/jama.286.22.2865 [DOI] [PubMed] [Google Scholar]
- 8.Engelke K, Chaudry O, Bartenschlager S (2023) Opportunistic screening techniques for analysis of ct scans. Curr Osteoporos Rep 21:65–76. 10.1007/s11914-022-00764-5 10.1007/s11914-022-00764-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.OECD (2023) Computed tomography (ct) exams (indicator). Available via https://www.oecd-ilibrary.org/social-issues-migration-health/computed-tomography-ct-exams/indicator/english_3c994537-en. Accessed April 19 2023. 10.1787/3c994537-en
- 10.Sekuboyina A, Husseini ME, Bayat A et al (2021) Verse: A vertebrae labelling and segmentation benchmark for multi-detector ct images. Med Image Anal 73:102166. 10.1016/j.media.2021.102166 10.1016/j.media.2021.102166 [DOI] [PubMed] [Google Scholar]
- 11.Sollmann N, Löffler MT, El Husseini M et al (2022) Automated opportunistic osteoporosis screening in routine computed tomography of the spine: Comparison with dedicated quantitative ct. J Bone Miner Res 37:1287–1296. 10.1002/jbmr.4575 10.1002/jbmr.4575 [DOI] [PubMed] [Google Scholar]
- 12.Rühling S, Navarro F, Sekuboyina A et al (2022) Automated detection of the contrast phase in mdct by an artificial neural network improves the accuracy of opportunistic bone mineral density measurements. Eur Radiol 32:1465–1474. 10.1007/s00330-021-08284-z 10.1007/s00330-021-08284-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Löffler MT, Jacob A, Scharr A et al (2021) Automatic opportunistic osteoporosis screening in routine ct: Improved prediction of patients with prevalent vertebral fractures compared to dxa. Eur Radiol 31:6069–6077. 10.1007/s00330-020-07655-2 10.1007/s00330-020-07655-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Block JE, Smith R, Glueer CC, Steiger P, Ettinger B, Genant HK (1989) Models of spinal trabecular bone loss as determined by quantitative computed tomography. J Bone Miner Res 4:249–257. 10.1002/jbmr.5650040218 10.1002/jbmr.5650040218 [DOI] [PubMed] [Google Scholar]
- 15.Expert Panel on Musculoskeletal I, Ward RJ, Roberts CC et al (2017) Acr appropriateness criteria((r)) osteoporosis and bone mineral density. J Am Coll Radiol 14:S189–S202. 10.1016/j.jacr.2017.02.018 10.1016/j.jacr.2017.02.018 [DOI] [PubMed] [Google Scholar]
- 16.Bligh M, Bidaut L, White RA, Murphy WA, Stevens DM, Cody DD (2009) Helical multidetector row quantitative computed tomography (qct) precision. Acad Radiol 16:150–159. 10.1016/j.acra.2008.08.007 10.1016/j.acra.2008.08.007 [DOI] [PubMed] [Google Scholar]
- 17.Brett AD, Brown JK (2015) Quantitative computed tomography and opportunistic bone density screening by dual use of computed tomography scans. J Orthop Translat 3:178–184. 10.1016/j.jot.2015.08.006 10.1016/j.jot.2015.08.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hofmann P, Lenhart B, Sedlmair M, Schmidt B, Flort T, Mahnken A (2016) Assessment of the effects of scanning and reconstruction parameters on bone densitometry results for single energy quantitative ct (qct) scans in a phantom study. Paper presented at the European Congress of Radiology 2016, Vienna, Austria. 10.1594/ecr2016/C-2260
- 19.Suetens P (2017) Fundamentals of medical imaging. Cambridge university press, Cambridge Medicine. 10.1017/9781316671849
- 20.Tothill P, Hannan WJ (2007) Precision and accuracy of measuring changes in bone mineral density by dual-energy x-ray absorptiometry. Osteoporos Int 18:1515–1523. 10.1007/s00198-007-0382-4 10.1007/s00198-007-0382-4 [DOI] [PubMed] [Google Scholar]
- 21.Link TM, Kazakia G (2020) Update on imaging-based measurement of bone mineral density and quality. Curr Rheumatol Rep 22:13. 10.1007/s11926-020-00892-w 10.1007/s11926-020-00892-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Engelke K, Gluer CC, Genant HK (1995) Factors influencing short-term precision of dual x-ray bone absorptiometry (dxa) of spine and femur. Calcif Tissue Int 56:19–25. 10.1007/BF00298739 10.1007/BF00298739 [DOI] [PubMed] [Google Scholar]
- 23.Bonescreen. Available via www.bonescreen.de. Accessed 05 2024.
- 24.Löffler MT, Sollmann N, Mei K et al (2020) X-ray-based quantitative osteoporosis imaging at the spine. Osteoporos Int 31:233–250. 10.1007/s00198-019-05212-2 10.1007/s00198-019-05212-2 [DOI] [PubMed] [Google Scholar]
- 25.Bauer JS, Henning TD, Müeller D, Lu Y, Majumdar S, Link TM (2007) Volumetric quantitative ct of the spine and hip derived from contrast-enhanced mdct: conversion factors. AJR Am J Roentgenol 188:1294–1301. 10.2214/ajr.06.1006 10.2214/ajr.06.1006 [DOI] [PubMed] [Google Scholar]
- 26.Pickhardt PJ, Lauder T, Pooler BD et al (2016) Effect of iv contrast on lumbar trabecular attenuation at routine abdominal ct: correlation with dxa and implications for opportunistic osteoporosis screening. Osteoporos Int 27:147–152. 10.1007/s00198-015-3224-9 10.1007/s00198-015-3224-9 [DOI] [PubMed] [Google Scholar]
- 27.Kaesmacher J, Liebl H, Baum T, Kirschke JS (2017) Bone mineral density estimations from routine multidetector computed tomography: A comparative study of contrast and calibration effects. J Comput Assist Tomogr 41:217–223. 10.1097/rct.0000000000000518 10.1097/rct.0000000000000518 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Baum T, Muller D, Dobritz M et al (2012) Converted lumbar bmd values derived from sagittal reformations of contrast-enhanced mdct predict incidental osteoporotic vertebral fractures. Calcif Tissue Int 90:481–487. 10.1007/s00223-012-9596-3 10.1007/s00223-012-9596-3 [DOI] [PubMed] [Google Scholar]
- 29.Nickoloff EL, Feldman F, Atherton JV (1988) Bone mineral assessment: New dual-energy ct approach. Radiology 168:223–228. 10.1148/radiology.168.1.3380964 10.1148/radiology.168.1.3380964 [DOI] [PubMed] [Google Scholar]
- 30.Wesarg S, Kirschner M, Becker M, Erdt M, Kafchitsas K, Khan MF (2012) Dual-energy ct-based assessment of the trabecular bone in vertebrae. Methods Inf Med 51:398–405 10.3414/ME11-02-0034 [DOI] [PubMed] [Google Scholar]
- 31.Gruenewald LD, Koch V, Martin SS et al (2022) Diagnostic accuracy of quantitative dual-energy ct-based volumetric bone mineral density assessment for the prediction of osteoporosis-associated fractures. Eur Radiol 32:3076–3084. 10.1007/s00330-021-08323-9 [DOI] [PMC free article] [PubMed]
- 32.Boden SD, Goodenough DJ, Stockham CD, Jacobs E, Dina T, Allman RM (1989) Precise measurement of vertebral bone density using computed tomography without the use of an external reference phantom. J Digit Imaging 2:31–38. 10.1007/bf03168013 10.1007/bf03168013 [DOI] [PubMed] [Google Scholar]
- 33.Wang L, Su Y, Wang Q et al (2017) Validation of asynchronous quantitative bone densitometry of the spine: Accuracy, short-term reproducibility, and a comparison with conventional quantitative computed tomography. Sci Rep 7:6284. 10.1038/s41598-017-06608-y 10.1038/s41598-017-06608-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Brown JK, Timm W, Bodeen G et al (2017) Asynchronously calibrated quantitative bone densitometry. J Clin Densitom 20:216–225. 10.1016/j.jocd.2015.11.001 10.1016/j.jocd.2015.11.001 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.