Skip to main content
Springer logoLink to Springer
. 2025 Nov 24;36(5):3454–3464. doi: 10.1007/s00330-025-12131-w

Crowd-sourcing optimized abdomen CT protocols from 908,000 examinations in a large radiation dose registry

Rebecca Smith-Bindman 1,2,3,, Taewoon Kang 1, Carly Stewart 1, Philip W Chu 1, Yifei Wang 1, Timothy P Szczykutowicz 4
PMCID: PMC13086641  PMID: 41284032

Abstract

Objectives

Identify routine diagnostic abdomen CT protocols in adults that achieve lower radiation doses, crowd-sourced from a large CT dose registry.

Materials and methods

We retrospectively captured acquisition parameters, patient diameter, and patient size-adjusted and unadjusted radiation dose (CTDIvol and DLP). A protocol was defined as a unique combination of facility, scanner make/model, CT indication, and protocol name. We used k-means clustering to classify protocols into clusters based on similarity of average acquisition parameters (mAs, pitch, kV, collimation, scan length, phase count), each representing a distinct pattern of protocol design choices. For each cluster, we summarized the mean technical parameters and dose metrics of its constituent protocols.

Results

Analyses included 907,992 exams from 1767 protocols at 132 facilities, grouped in 9 clusters. The number of protocols and exams within each cluster ranged from 62 to 508 and 15,317 to 381,457, respectively. Mean size-adjusted DLP varied threefold across clusters, ranging from 486 to 1382 mGy-cm, with no difference in patient diameter. Lowest-dosed clusters 1 and 2 minimized radiation primarily through low kV (around 100). Cluster 8 had the highest acquisition techniques (mean mAs = 338 and kV = 125), and cluster 9 had the highest dose due to phase (mean = 3.3, versus 1.1–1.6 for the remaining clusters).

Conclusion

The clustering approach offers a new framework for identifying optimized protocols, while highlighting variation in practice and worst-in-class technique, e.g., using three phases for routine abdomen. These findings may guide clinicians in protocol design, by providing best-practice protocols from a large registry, as well as a method for analyzing data within their own health system.

Key Points

Question Can we identify best-practice imaging protocols from a large CT registry, to offer clinicians a blueprint for optimizing radiation dose in routine abdomen CT?

Findings Protocols were grouped into clusters using average acquisition parameters. Across clusters, patient size-adjusted radiation dose varied 486–1382 mGy-cm. Lowest-dosed protocols had one phase and low kVp.

Clinical relevance We used crowd-sourcing and k-means clustering to identify routine abdomen CT protocols that achieve lowest radiation dose, offering a playbook for protocol design from real clinical practice. Non-indicated phases were the greatest driver of excessive radiation dose.

Graphical Abstract

graphic file with name 330_2025_12131_Figa_HTML.jpg

Keywords: Computed tomography, Abdomen, Protocol design, Protocol optimization, Radiation dose

Introduction

There is large variation in the radiation doses used for abdomen computed tomography (CT), which, as the second most common type of CT examination in Europe, contributes a higher effective dose than most other common diagnostic imaging procedures [1, 2]. A multi-national analysis of 748,846 abdomen CT examinations in adults performed for routine indications between 2015 and 2020 describes a six-fold range in the average radiation dose across 1033 unique protocols, after accounting for patient size [3]. This variation exposes patients to excess cancer risk [4, 5] and is inconsistent with a guiding principle in radiology to minimize radiation exposure while achieving appropriate image quality [6, 7].

Previous work has shown that while radiation dose varies by patient size [8], patient factors or machine characteristics only modestly drive dose variation across imaging facilities [1]. Instead, variation comes from local decisions made by clinicians as reflected in the protocols, including the selection of phases and other acquisition parameters. Well-known dose reduction approaches and technologies, such as automated kV modulation and limiting unindicated phases, are not fully utilized [1, 3, 9, 10].

While there is agreement that greater standardization of radiation doses is a clinical imperative [6, 7], there are myriad ways of selecting technical parameters to achieve sufficient image quality while minimizing radiation exposure. These choices represent trade-offs between radiation dose, image quality, scan time, and anatomical coverage, and it is not straightforward to identify the “best” protocol. Yet without a consensus-based playbook, the task of developing protocols not only consumes a large amount of time and resources for radiologists, medical physicists, and technologists, but also drives dose variation. When surveyed, individuals who oversee radiology protocols have reported feeling unable to keep up with revising protocols, and it has been estimated that maintaining updated protocols for a single indication consumes the effort of a full-time employee [1115].

In this work, we used a crowd-sourcing approach to identify optimized protocols within a large, international radiation dose registry, and we performed cluster analyses to identify protocols that result in low radiation doses for routine abdomen CT.

Materials and methods

Consecutive diagnostic routine abdomen CT scans performed between January 1, 2015, to March 11, 2021, in patients aged 18–99 years of age and included in the University of California San Francisco (UCSF) International CT Dose Registry (the “registry”) were included. The sample includes examinations from 132 hospitals or outpatient imaging facilities associated with 26 healthcare organizations in 7 countries. Data acquisition details have been described previously [1]. Participating sites submitted DICOM data to the registry via Radimetrics© dose management software (Bayer AG). The UCSF Institutional Review Board approved this research with a waiver of informed consent. Collaborating institutions obtained local approval or relied on UCSF approval to contribute.

These analyses focus on routine abdomen CT, including all scans of the abdomen and/or pelvis, as determined using a validated algorithm based on information in the DICOM headers, including the protocol name and study description [16]. Abdomen exams with angiography or for trauma indications were excluded, as were exams obtained for low-dose indications (e.g., suspected renal stones) or high-dose indications (e.g., liver cancer detection) [16]. A full list of indications for abdomen imaging, and the validation of the framework for assigning CT scans to categories, including routine abdomen, has been published [16].

Data elements

Date of exam, technical parameters, and radiation dose metrics included those directly reported by the scanner (volume computed tomography dose index (CTDIvol), dose length product (DLP), kilovoltage (kV), tube current (milliamperage per second (mAs)), collimation, pitch, phase, and scan length), and those calculated in the registry (size-adjusted CTDIvol, size-adjusted DLP). Patient effective diameter is calculated and reported by Radimetrics. Scans with missing values for any of the parameters or dose metrics were eliminated (9.9% of exams). Additionally, to exclude likely erroneous outliers, we dropped all scans with values less than the 0.1th percentile or greater than the 99.9th percentile values of the technical parameters or radiation dose, and we excluded impossibly low mAs (< 5.0), impossibly low or high pitch (< 0.5 or > 3.3), and phase count > 4 (6.2% of exams). To remove scans obtained from rarely used machine models where we would have insufficient data, this study was limited to machine models with at least 5 individual scanners in the registry (10.1% of exams were excluded).

We define a protocol as a unique combination of the facility, scanner make/model, individual scanner, CT indication, and protocol name. To ensure protocols were in routine use (which we took to mean that the radiologist found image quality to be sufficient), we required a sample of at least 20 examinations per protocol for inclusion, thereby eliminating protocols that were used infrequently (potentially due to insufficient image quality for diagnosis). For each protocol, we averaged the year of study, radiation dose, and technical parameters used across the individual exams performed with that protocol because, while the protocol name might be the same, the technical parameters can vary patient to patient [17].

Size-adjusted radiation dose

To minimize the impact of patient size on dose variation, we calculated size-adjusted CTDIvol and DLP by normalizing these metrics using the log-linear mixed regression between them and Radimetrics-reported effective diameter, with facility included as a random effect, using published methods [3]. This differs from the size-specific dose estimate (SSDE), which normalizes to obtain a consistent dose per unit of tissue as reflected in a phantom, but does not adjust dose for patient case mix across hospitals because patients with higher BMI have significantly higher SSDE than smaller patients [18, 19]. Our size normalization allows comparison of facilities by eliminating the effect of patient size on radiation dose, using the relationship between size and dose observed in our registry [3].

Analysis

To study how variation in technical parameters drives differences in radiation dose, protocols were grouped using k-means clustering, a form of latent cluster analysis that categorizes observations into the cluster (i.e., group) with the nearest mean technical parameter value, reflected by its center point. In layman’s terms, k-means clustering groups data points that share similar features with a goal of minimizing variation within each cluster. Thus, observations within a cluster are more similar to each other than to those in other clusters. Having first summarized the protocols by average technical parameters and radiation dose metrics, we identified clusters using observed technical parameter values and sorted each protocol into the cluster closest to its own mean technical parameters [20]. Each cluster represents one pattern of behavior broadly employed in protocol design.

We performed k-means clustering analysis using six technical parameters as inputs: mAs, pitch, kV, collimation, scan length, and number of phases. These parameters were selected because they are major determinants of radiation dose and can be directly modified in protocol design. We expected these six parameters to predict the DLP and size-adjusted DLP of each cluster, reflecting the total imparted dose per examination. Since the ranges and variations of these technical parameters are quite different, their impacts on the distance to each cluster’s center point are not equally weighted. We standardized them to have a mean of 0 and a standard deviation of 1 when classifying protocols into clusters, though we show the results using their original scales.

We next determined the optimal number of clusters, ensuring it was neither too few as to obscure meaningful practice variation, nor too numerous as to introduce undue noise. We repeatedly performed k-means clustering for different numbers of clusters from 2 to 30 and calculated their total within-cluster sum of squares and cubic clustering criterion. We then plotted the sum of squares between each cluster as a function of cluster number, and identified the cluster number after which the sum of squares metric stops decreasing at an appreciable rate, identifying the elbow so that adding another cluster did not produce better modeling [2022]. From this elbow method, we selected the optimal number, which was also a local peak of the cubic clustering criterion [23, 24].

The clusters are then numbered and sorted in tabular form by size-adjusted DLP, where cluster 1 uses the lowest radiation dose and cluster 9 the highest. To summarize the cluster, we report the mean of the technical parameters and dose metrics for its constituent protocols. While neither DLP nor CTDIvol is used in the analyses to create the clusters, they are included in the summaries to demonstrate the result of parameter choices on dose.

To graphically visualize the size, shape, and overlap of clusters as well as how combinations of parameters drive radiation dose, we defined two domains. On the X-axis, we have the acquisition technique domain, which encompasses mAs, pitch, and kV and is calculated as [mAs / pitch × (kV/120)2.5]. By this formula, a pitch > 1 or a kV < 120 will markedly reduce the X value, pushing it further to the left in the figure. On the Y-axis, we have the practice culture domain, which reflects phase and scan length and is calculated as [number of phases × scan length/450 mm], with 450 mm reflecting the average scan length for a single-phase routine abdomen CT. A lower number of phases or a scan length < 450 will reduce the Y value, pushing the point lower on the vertical axis. We plot each cluster along these axes as an ellipse, with a circle at the center point that is scaled in size by that cluster’s average size-adjusted DLP. The colors of the ellipses reflect their ranking in size-adjusted DLP from blue to red, reflecting the lowest to highest dose cluster, respectively. The clusters reflect the 90% prediction ellipse, illustrating a region that would contain 90% of protocols belonging to the cluster [25, 26].

We accounted for the potential for patient size to influence (confound) the variation of dose between clusters by using size-adjusted dose metrics. To assess whether the size adjustment successfully removes the influence of size, we grouped CT scans by decile based on effective diameter and then assessed the size-adjusted DLP by decile. Similarly, to explore whether the year of exam could influence (confound) the variation of dose between clusters, we assessed the proportion of exams belonging to each year, within each cluster. We additionally computed the mean size-adjusted DLP within each year, for each cluster, and assessed whether the ranking of size-adjusted DLP among clusters differed from year to year. All analyses were conducted using SAS version 9.4 and R.

Results

A total of 907,992 routine abdomen CT examinations generated from 1767 protocols are included, with the majority (92.4%) from the United States. The average number of examinations per protocol was 514 and ranged from 20 to 21,487. Most examinations (76.9%) were obtained on a Siemens Healthineers or GE HealthCare machine. Demographic variables and descriptive statistics for the technical parameters and radiation dose metrics overall are in Tables 1 and 2.

Table 1.

Descriptive characteristics of the routine abdomen CT examinations included in this report

Number of CT examinations Number of protocols
N % N %
Total 907,992 100.0 1767 100.0
Sex
Men 386,963 42.6
Women 520,302 57.3
Other or unknown 727 0.1
Age
18–19 11,104 1.2
20–29 82,975 9.1
30–39 109,554 12.1
40–49 134,006 14.8
50–59 169,726 18.7
60–69 173,364 19.1
70–79 134,905 14.9
80–89 74,489 8.2
90–99 17,869 2.0
Manufacturer
Canon 67,135 7.4 76 4.3
GE 405,388 44.6 890 50.4
Philips 142,433 15.7 304 17.2
Siemens 293,036 32.3 497 28.1
Country
Germany 6402 0.7 35 2.0
Israel 17,276 1.9 15 0.8
Japan 9469 1.0 25 1.4
Netherlands 18,178 2.0 48 2.7
Switzerland 1554 0.2 2 0.1
United Kingdom 16,273 1.8 47 2.7
United States 838,840 92.4 1595 90.3
Number of organizations 26
Number of unique facilities 132

Table 2.

Average technical parameters and radiation dose metrics across the included CT examinations

Examinations Protocols
Included units 907,992 1767
Mean Standard deviation Mean Standard deviation
Patient diameter (mm) 308 45 309 23
Technical parameters
 mAs 178 87 178 63
 Pitch 1.0 0.3 1.0 0.3
 kV 118 8 118 7
 Scan length (mm) 483 85 450 82
 Number of phases 1.3 0.6 1.4 0.6
 Collimation 37 17 35 17
Radiation dose metrics
 Volumetric CT dose index (CTDIvol), mGy 13 7 13 5
 Size-adjusted CTDIvol, mGy 11 5 12 5
 Dose length product (DLP), mGy-cm 756 496 775 418
 Size-adjusted DLP (mGy-cm) 657 352 669 361

The six technical parameters listed here were used as inputs in the k-means clustering analysis. DLP dose length product

The k-means clustering identified 9 clusters as the optimum number to characterize protocols into unique, representative groups. Summary statistics of the clusters are provided in Table 3. Each cluster encompasses between 62 to 508 unique protocols and between 15,317 to 381,457 examinations performed on those protocols. Size-adjusted DLP varies threefold across the clusters, from 486 mGy-cm (95% CI 461, 512) in cluster 1 to 1382 mGy-cm (95% CI 1260, 1504) in cluster 9.

Table 3.

Average technical parameters for each of the 9 clusters created through k-means clustering

Variables used as model inputs to characterize clusters Dose Metrics
Cluster no. No. of protocols Mean year of exams No. of exams Effective diameter mm (95% CI) mAs Pitch kV Scan length No. of phases Coll. (mm) CTDIvol (mGy) Size-Adj.
CTDIvol
(mGy)
DLP (mGy-cm) Size-adjusted
DLP
(mGy-cm)
N (%) Mean (95% CI) Mean Mean mean Mean Mean Mean Mean Mean Mean (95%CI) Mean (95%CI)
1 186 2018 82,751 (9.1) 305 (295, 317) 159 0.75 105 437 1.2 36 11 10 551 (522, 580) 486 (461, 512)
2 62 2018 36,197 (4.0) 282 (262, 309) 185 1.37 102 465 1.4 35 8 8 491 (422, 561) 507 (452, 562)
3 264 2017 169,794 (18.7) 314 (299, 327) 206 1.35 120 516 1.1 33 13 10 735 (700, 771) 598 (570, 626)
4 224 2018 56,164 (6.2) 304 (284, 325) 158 0.84 120 336 1.2 27 16 15 672 (622, 722) 615 (569, 662)
5 202 2017 3562 (7.0) 309 (280, 332) 194 1.37 120 384 1.6 27 12 11 722 (676, 768) 626 (591, 661)
6 508 2018 381,457 (42.0) 310 (291, 328) 147 0.89 120 490 1.2 31 12 11 741 (713, 769) 643 (616, 670)
7 142 2018 84,950 (9.4) 306 (282, 324) 168 0.9 120 499 1.3 77 13 11 774 (722, 885) 699 (645, 753)
8 91 2017 17,800 (2.0) 338 (294, 292) 338 1.03 125 495 1.2 43 24 18 1416 (1297, 1535) 989 (916, 1062)
9 88 2017 15,317 (1.7) 303 (268, 318) 166 0.96 117 361 3.3 32 13 12 1483 (1368, 1597) 1382 (1260, 1504)
Total 1767 907,992 (100)

The clusters are sorted by size-adjusted dose length product (DLP)

Coll collimation, CTDIvol Volumetric CT dose index

The relationship between the 9 identified clusters is shown in Fig. 1, and the interpretation of these configurations is aided by Table 3. Clusters 1 and 2 achieve the lowest mean size-adjusted DLP (486 and 507 mGy-cm, respectively), while clusters 8 and 9 have the highest (989 and 1382 mGy-cm, respectively) (Table 3). On the figure, these four protocols occupy different locations reflecting differences in technical parameters. Clusters 1 and 2 differ from the remaining clusters primarily by using a lower mean kV of 102–105, compared with a kV of around 120 or greater for the remaining clusters (Table 3). Cluster 1 has a slightly higher size-adjusted CTDIvol than cluster 2 (10 versus 8 mGy), but 6% lower scan length and 12% lower phase. Cluster 8 has a higher acquisition technique (resulting in a higher value on the X-axis), including the highest mean mAs (338) and kV (125) values amongst all clusters. Cluster 9 has about average technique in terms of mAs, kV, and pitch, but a mean phase count more than twice as high as any other cluster (3.3 phases), resulting in its position high on the Y-axis. Notably, the fewest number of patients are imaged with these two higher dose protocols: 2.0% of patients in cluster 8 and 1.7% in cluster 9. By comparison, 13% of examinations used protocols in clusters 1 and 2.

Fig. 1.

Fig. 1

Relationship between 9 protocol-groups (clusters) identified through k-means clustering analyses using 6 technical parameters as inputs. The 90% prediction ellipse for each cluster is shown. For each of the clusters, the center bubble is proportional in size to the average size-adjusted DLP, and the clusters are colored on a gradient from blue to red, reflecting the lowest to highest size-adjusted dose length product. DLP dose length product

Protocols that fall within clusters 3–7 have average size-adjusted DLP in the range of 598–699 mGy-cm with similar technical parameter values. Cluster 6 is the largest, encompassing 508 protocols and 381,457 examinations, or 42% of all examinations. This cluster, reflecting the most common practice, results in size-adjusted DLP of 643 mGy-cm and size-adjusted CTDIvol of 11 mGy. Cluster 7 has notably high collimation (mean = 77 mm) due to a concentration of GE Revolution CT and Philips ICT 256 machines in that cluster, which perform helical scanning using 80 mm beam collimation. The use of 80 mm helical scanning is ideal for “wide axial” CT scanners for torso imaging, where high scan speeds are preferred and/or it is advantageous to increase tube output via the use of a wider collimation to minimize tube heating issues.

The concepts of acquisition technique and practice culture were meant to distinguish different approaches for dose optimization. Acquisition technique refers to practices geared at improving image quality (by decreasing visual noise), while practice culture refers to the number and length of scans. The summarized protocols in Table 3 illustrate these choices and trade-offs. Observe, for example, the differences between clusters 1 and 2. Cluster 1 has considerably lower pitch (45% less than cluster 2) but balances it with low mAs and low kV. Aside from pitch, clusters 1 and 2 have very close mAs and kV, but cluster 2 has the higher size-adjusted DLP, driven by a slightly longer scan length and 14% higher phase. Cluster 3 has interestingly both the lowest mean phase count (1.1) and the highest scan length (516 mm) of all the clusters, yet its size-adjusted dose is 18% greater than cluster 2, driven largely by a kV of 120. Cluster 4 is lowest on the Y-axis in Fig. 1; in Table 3, we see that this is a result of it having the shortest scan length and a low (but not the lowest) number of phases.

Ellipse size is also important. Clusters 8 and 9 have comparatively larger ellipses, reflecting greater variation within their protocols.

When compared with the mean size-adjusted DLP of the most common cluster (cluster 6), examinations in cluster 1 have a 24% reduction in dose, whereas examinations in cluster 9 have approximately 214% of the dose.

We observed no relationship between patient size and size-adjusted dose: e.g., the effective diameter in cluster 1 was 305 mm versus 303 mm in cluster 9 (Table 3). When patients were grouped by decile in effective diameter, the unadjusted dose varied considerably, as expected, while size-adjusted dose did not change with effective diameter (Table 4). For example, the size-adjusted DLP was 651 mGy-cm in the smallest decile and 649 mGy-cm in the largest. However, when patients were grouped by effective diameter and cluster, size-adjusted DLP varied markedly within each decile across clusters (Table 5). For example, patients in the first decile of size had a mean size-adjusted DLP of 473 mGy-cm in cluster 1 versus 1297 mGy-cm in cluster 9.

Table 4.

Patient size-adjusted and unadjusted dose length product (DLP) and CT dose index volume (CTDIvol), by decile of patients’ effective diameter

Decile in effective diameter N Effective diameter
(mm)
DLP
(mGy-cm)
CTDIvol (mGy)
Unadjusted Adjusted Unadjusted Adjusted
Mean Median Mean Median Mean Median Mean Median Mean Median
1st 90,803 238 241 391 329 651 545 7 6 11 10
2nd 90,799 262 262 454 390 619 533 8 8 11 10
3rd 90,803 276 276 512 441 622 536 9 8 11 10
4th 90,792 288 288 577 497 634 547 10 9 11 10
5th 90,811 300 300 649 564 651 566 11 11 11 11
6th 90,791 311 311 728 640 667 586 12 12 11 11
7th 90,800 323 323 823 731 683 607 14 13 12 11
8th 90,798 337 337 940 855 696 633 16 15 12 11
9th 90,799 355 355 1097 1022 697 649 18 18 12 12
10th 90,796 393 388 1393 1282 649 601 23 22 11 11

Table 5.

Patient size-adjusted dose length product (DLP, in mGy-cm) by decile in patients’ effective diameter and by cluster

Decile in effective diameter Cluster number
1 2 3 4 5 6 7 8 9 Total
N Mean Adj. DLP N Mean Adj. DLP N Mean Adj. DLP N Mean Adj. DLP N Mean Adj. DLP N Mean Adj. DLP N Mean Adj. DLP N Mean Adj. DLP N Mean Adj. DLP N
1st 8002 473 7411 543 15,301 544 5616 671 3492 753 38,824 676 9560 774 1088 892 1509 1297 90,803
2nd 8637 466 6230 494 16,352 548 5654 634 3665 789 38,825 626 8570 738 1262 794 1604 1295 90,799
3rd 9070 471 5097 460 16,621 557 5998 664 4470 804 38,374 617 8259 747 1286 818 1628 1291 90,803
4th 8925 482 3970 453 16,924 577 6005 690 5569 819 38,451 619 8022 744 1352 841 1574 1343 90,792
5th 9064 491 3126 450 16,848 603 5814 717 6205 812 38,696 631 8001 752 1398 870 1659 1332 90,811
6th 9047 497 2587 469 17,098 633 5996 744 7056 813 37,973 642 7987 750 1419 896 1628 1337 90,791
7th 8534 506 2172 516 17,478 664 5842 765 7783 801 37,771 654 8133 738 1521 917 1566 1381 90,800
8th 8010 521 1905 598 17,192 702 5548 776 8251 777 37,901 663 8546 715 1871 935 1574 1355 90,798
9th 7440 542 1930 715 17,429 718 5282 768 8322 726 37,898 665 8720 694 2315 902 1463 1315 90,799
10th 6022 571 1769 749 18,551 648 4409 734 8749 592 36,744 621 9152 671 4288 808 1112 1151 90,796
Total 82,751 36,197 169,794 56,164 63,562 381,457 84,950 17,800 15,317 907,992

Unadjusted and size-adjusted radiation dose metrics decreased approximately 20% over the study years (Supplementary Table 1). Nevertheless, within all years, exams were assigned to all clusters, demonstrating a large dose gradient across clusters within each year (Fig. 2 and Supplementary Table 2). The exceptions were cluster 2, which had more recent exams (2018–2020), and cluster 8, which had older exams (2015–2017) (Supplementary Table 2).

Fig. 2.

Fig. 2

Relationship between year of exam and the 9 clusters, and mean size-adjusted dose length product (DLP, in mGy-cm) by cluster and year

Discussion

There exists a large array of approaches for performing routine abdomen CT, and these choices result in very different doses to patients. The clustering approach allowed us to identify optimally dosed protocols used across multiple sites and a large number of patients. It also demonstrated a large variation in practice, with some patients receiving excessive doses. In this sample of 1767 protocols used in nearly 1 million examinations, we observed a nearly 3-fold difference in average size-adjusted DLP between the lowest and highest protocol clusters, independent of patient size.

While patient size influences average parameter selections and was associated with a higher unadjusted dose, the lack of a relationship between patient size and size-adjusted dose indicates that patient size did not confound the dose-cluster relationship. In other words, the observed dose differences between clusters were not attributable to patient size. Furthermore, we documented that significantly lower radiation doses are feasible in larger patients, as evidenced by clusters 3 and 6. Regarding year, the distributions of exam year in clusters 2 and 8 deviated from the norm, suggesting that year may have been a confounding factor in the assignment to these clusters. However, the distribution of exam years for the other 7 clusters was relatively stable, allowing us to conclude that the primary and substantial differences in dose between clusters were not attributable to exam year. Figure 2 illustrates significant variation between clusters within each year. Even if clusters 2 and 8 were removed from the graph, the overall conclusion would remain unchanged. Cluster 2’s dose was similar to that of cluster 1, where the distribution of exam year was equal (meaning cluster 1 contained exams from all years more or less equally), while cluster 8’s dose fell midway between clusters 7 and 9, which also have equal exam year distribution.

While numerous published radiation dose benchmarks have offered median (“achievable”) and 75th percentile (“dose reference level”) doses for common indications, none of these prior works instruct how to attain the target doses [8, 27, 28]. Crowd-sourcing these clusters from a large radiation dose registry enables us to identify best practices in real, routine use and to share these with the radiology community as a playbook for radiation dose optimization. Clusters 1 through 6 in this analysis represent various viable and frequently used strategies to image routine abdomen indications in the range of published achievable doses (generally under 650 size-adjusted DLP) [8, 27]. The use of k-means clustering and graphing the resultant clusters highlights common features of the lowest dose protocols: low mAs, low kV, shorter scan length, and single phase. The features of high-dose protocols are also apparent: unnecessary phases, longer scan length, and higher than needed mAs. The takeaway message is not that every imaging provider should use the absolute lowest dose, as underlying indications and patient size may necessitate variation. Rather, providers should look to clusters 1–6 as offering a variety of avenues to image at target doses.

We did not evaluate image quality resulting from these protocols, which is a limitation of our study. However, all protocols were used in a minimum of 20 examinations (25th percentile = 38 examinations; median = 94; 75th = 350), which we presume to indicate they were acceptable to the radiologists reading the exams. Furthermore, the size-adjusted DLP for the lower-dosed clusters (particularly 3 through 6) are in the range of published achievable dose benchmarks for routine abdomen CT, suggesting these are not low-dose protocols. For example, Kanal et al give an achievable dose of 620 mGy-cm (the weighted average of abdomen and pelvis with contrast and without contrast) for patients with a diameter of 29–33 cm, approximately the median in the UCSF registry (30 cm), against which our data were size-adjusted [8]. The underlying assumption of this work is that protocols must achieve sufficient image quality for diagnosis, and that additional image quality (which comes at the cost of higher radiation dose) is generally unnecessary. Thus, assuming we have excluded truly inferior protocols, the goal is to identify those that have achieved diagnostic image quality using the lowest dose.

In addition to image quality, phase is a major driver of radiation dose. A protocol such as those in cluster 9 with relatively low acquisition technique and low CTDIvol can multiply the dose of a single-phase protocol through additional phases, with no difference in image quality. In this study, 710,803 (78.3%) of exams use 1 phase, which is appropriate for routine abdomen CT indications [10, 29].

Previous work has found kV to be the parameter most impactful on radiation dose, although it is infrequently modified from 120. Wang et al found that a 10% increase in kV led to a 29% increase in DLP, compared to an 8% or 9% DLP increase from a 10% increase in phase or mAs, respectively [9]. In clusters 1 and 2 in this analysis, a lower average kVp resulted in the lowest size-adjusted dose.

The strength of this study is its large sample of protocols and examinations from numerous, diverse institutions. There are several limitations. First, as noted, we presumed that all protocols generated sufficient image quality based on routine use of the protocols. Second, this analysis assumes all examinations were performed for routine abdomen indications, yet some may have been done for indications requiring higher or lower radiation dose, or different scan lengths. Indeed, scan length and phase count are strongly dependent on indication, and furthermore, the distribution of indications may vary by institution. However, the approach to categorizing CT examinations was previously validated and found to be 90% accurate compared to a detailed chart review, and protocols were further reviewed manually to eliminate non-routine dose indications such as those for urography, bladder cystography, any cancer assessment (including masses, lesions, nodules), acute bleeding, transplant, and others [16]. Small differences in dose and scan length between routine abdomen indications did not drive the 3-fold difference in average size-adjusted dose across the clusters. Third, as body weight is unavailable in the registry, we did not include it in our model, though body weight can affect radiation dose significantly in contrast-enhanced abdominopelvic CT. We also did not specifically address whether automatic tube current modulation or tube voltage selection techniques were used, though we presume the former was used frequently and the latter was infrequently used, as kVp rarely changed. Furthermore, it is possible that these findings are not generalizable as all contributing institutions used dose management software, perhaps reflecting heightened dose awareness. However, previous comparisons of the doses in the UCSF registry to others, such as the larger American College of Radiology Dose Index Registry, found our doses remarkably similar [8]. Finally—less a limitation than a caveat—the combination of technical parameters within the clusters in Table 3 are summaries: they are averages of the average parameters within each constituent protocol. They are not offered as recipes to follow precisely, but as guidelines for possible approaches from real practice at peer institutions.

Identifying patterns through this crowd-sourcing and clustering approach provides a useful framework for quality improvement, since dose reduction can be achieved by changes to either practice culture or acquisition technique. The outcomes highlight how variable current practice is, and how many options exist for “getting it right.” It is our hope that imaging clinicians can determine their own optimal protocols by adopting some of the practices of these lower-dose clusters: most notably, limiting phase to 1 and using kV < 120 when possible.

Supplementary information

Abbreviations

CT

Computed tomography

CTDIvol

Volume computed tomography dose index

DLP

Dose length product

mAs

Milliamperage per second

SSDE

Size-specific dose estimate

Funding

This research was supported by awards from the National Cancer Institute (1R01CA181191-01A1) and the Patient-Centered Outcomes Research Institute (CD-1304-7043 and DI-2018C1-11375). PCORI and NCI had no role in study design, collection, analysis, interpretation, and reporting of data, or decision to publish. The views in this article are solely the responsibility of the authors and do not necessarily represent the views of PCORI or NCI.

Compliance with ethical standards

Guarantor

The scientific guarantor of this publication is Dr. Rebecca Smith-Bindman.

Conflict of interest

Dr. Smith-Bindman is a co-founder and Dr. Timothy Szczykutowicz is a consultant of Alara Imaging, Inc., a new company focused on improving the clinical and operational aspects of health systems, including collecting and reporting CT radiation dose and image quality as part of payer-led quality programs. Alara Imaging had no role in the study, and Dr. Smith-Bindman has not received funding from Alara Imaging. The other authors have no conflicts to disclose.

Statistics and biometry

Dr. Taewoon Kang performed all statistical analyses for this study, with biostatistical consultation from Mr. Philip Chu and Dr. Yifei Wang.

Informed consent

The UCSF International CT Dose Registry (from which this work derived) was approved by the UCSF Institutional Review Board with a waiver of informed consent for all subjects. The waiver was approved for three reasons: the attempt to contact subjects poses a greater risk than this study; the large number of records required makes it impracticable to contact all potential subjects; and the researchers do not know the identity of the study subjects and therefore cannot contact them.

Ethical approval

Institutional Review Board approval was obtained.

Study subjects or cohorts overlap

As these data come from a CT radiation dose registry, some of the CT exams included in this study may be represented in previous publications, including the following: (1) Smith-Bindman et al [3]; (2) Wang et al [9]; (3) Smith-Bindman et al [16]; (4) Smith-Bindman et al [1].

Methodology

  • Retrospective

  • Observational

  • Multicenter study

Footnotes

Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1007/s00330-025-12131-w.

References

  • 1.Smith-Bindman R, Wang Y, Chu P et al (2019) International variation in radiation dose for computed tomography examinations: prospective cohort study. BMJ 364:k4931 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Masjedi H, Zare MH, Keshavarz Siahpoush N, Razavi-Ratki SK, Alavi F, Shabani M (2020) European trends in radiology: investigating factors affecting the number of examinations and the effective dose. Radiol Med 125:296–305 [DOI] [PubMed] [Google Scholar]
  • 3.Smith-Bindman R, Kang T, Chu PW et al (2024) Large variation in radiation dose for routine abdomen CT: reasons for excess and easy tips for reduction. Eur Radiol 34:2394–2404 [DOI] [PMC free article] [PubMed]
  • 4.Board of Radiation Effects Research Division on Earth and Life Sciences National Research Council of the National Academies (2006) Health risks from exposure to low levels of ionizing radiation: BEIR VII phase 2. The National Academies Press, 500 Fifth Street, NW, Washington, DC
  • 5.Brenner DJ, Sachs RK (2006) Estimating radiation-induced cancer risks at very low doses: rationale for using a linear no-threshold approach. Radiat Environ Biophys 44:253–256 [DOI] [PubMed] [Google Scholar]
  • 6.Hricak H, Brenner DJ, Adelstein SJ et al (2011) Managing radiation use in medical imaging: a multifaceted challenge. Radiology 258:889–905 [DOI] [PubMed]
  • 7.U.S. Food and Drug Administration (2019) FDA white paper: initiative to reduce unnecessary radiation exposure from medical imaging. Available via http://www.fda.gov/Radiation-EmittingProducts/RadiationSafety/RadiationDoseReduction/ucm199994.htm. Accessed 7 November 2025
  • 8.Kanal KM, Butler PF, Sengupta D, Bhargavan-Chatfield M, Coombs LP, Morin RL (2017) U.S. diagnostic reference levels and achievable doses for 10 adult CT examinations. Radiology 284:120–133 [DOI] [PubMed] [Google Scholar]
  • 9.Wang Y, Chu P, Szczykutowicz TP, Stewart C, Smith-Bindman R (2024) CT acquisition parameter selection in the real world: impacts on radiation dose and variation amongst 155 institutions. Eur Radiol 34:1605–1613 [DOI] [PMC free article] [PubMed]
  • 10.Guite KM, Hinshaw JL, Ranallo FN, Lindstrom MJ, Lee FT Jr (2011) Ionizing radiation in abdominal CT: unindicated multiphase scans are an important source of medically unnecessary exposure. J Am Coll Radiol 8:756–761 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Solberg LI, Wang Y, Whitebird R, Lopez-Solano N, Smith-Bindman R (2020) Organizational factors and quality improvement strategies associated with lower radiation dose from CT examinations. J Am Coll Radiol 17:951–959 [DOI] [PMC free article] [PubMed]
  • 12.Whitebird RR, Solberg LI, Bergdall AR, Lopez-Solano N, Smith-Bindman R (2021) Barriers to CT dose optimization: the challenge of organizational change. Acad Radiol 28:387–392 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Whitebird RR, Solberg LI, Chu PW, Smith-Bindman R (2022) Strategies for dose optimization: views from health care systems. J Am Coll Radiol 19:534–541 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Szczykutowicz TP, Bour RK, Pozniak M, Ranallo FN (2015) Compliance with AAPM practice guideline 1.a: CT protocol management and review—from the perspective of a university hospital. J Appl Clin Med Phys 16:5023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Siegelman JR, Gress DA (2013) Radiology stewardship and quality improvement: the process and costs of implementing a CT radiation dose optimization committee in a medium-sized community hospital system. J Am Coll Radiol 10:416–422 [DOI] [PubMed] [Google Scholar]
  • 16.Smith-Bindman R, Yu S, Wang Y et al (2022) An image quality-informed framework for CT characterization. Radiology 302:380–389 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Lee CH, Goo JM, Ye HJ et al (2008) Radiation dose modulation techniques in the multidetector CT era: from basics to practice. Radiographics 28:1451–1459 [DOI] [PubMed] [Google Scholar]
  • 18.American Association of Physicists in Medicine (2011) American Association of Physicists in Medicine Report No. 204. Size-specific dose estimates (SSDE) in pediatric and adult body CT examinations. American Association of Physicists in Medicine, College Park, Maryland.
  • 19.McCollough C, Bakalyar DM, Bostani M et al (2014) Use of water equivalent diameter for calculating patient size and size-specific dose estimates (SSDE) in CT: the report of AAPM Task Group 220. AAPM Rep 2014:6–23 [PMC free article] [PubMed] [Google Scholar]
  • 20.Marutho D, Hendra Handaka S, Wijaya E, Muljono (2018) The determination of cluster number at k-mean using elbow method and purity evaluation on headline news. In: Proceedings of the 2018 international seminar on application for technology of information and communication. IEEE, Semarang
  • 21.Thorndike RL (1953) Who belongs in the family? Psychometrika 18:267–276 [Google Scholar]
  • 22.Syakur M, Khotimah B, Rochman E, Satoto B (2018) Integration K-means clustering method and elbow method for identification of the best customer profile cluster. IOP Conf Ser Mater Sci Eng 336:012017
  • 23.SAS Institute Inc. (2008) SAS/STAT® 9.2 user’s guide. SAS Institute Inc., Cary, North Carolina
  • 24.Ketchen D, Shook C (1996) The application of cluster analysis in strategic management research: an analysis and critique. Strategic Manag J 17:441–458 [Google Scholar]
  • 25.Wicklin R (2014) SAS Blogs: SAS. Add a prediction ellipse to a scatter plot in SAS. Available via https://blogs.sas.com/content/iml/2014/07/21/add-prediction-ellipse.html. Accessed 31 May 2024
  • 26.Wicklin R (2014) SAS Blogs: SAS. Computing prediction ellipses from a covariance matrix. Available via https://blogs.sas.com/content/iml/2014/07/23/prediction-ellipses-from-covariance.html#:~:text=To%20get%20a%20prediction%20ellipse,on%20the%20sample%20size%20n. Accessed 31 May 2024
  • 27.Bos D, Yu S, Luong J et al (2022) Diagnostic reference levels and median doses for common clinical indications of CT: findings from an international registry. Eur Radiol 32:1971–1982 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Tsapaki V, Damilakis J, Paulo G et al (2021) CT diagnostic reference levels based on clinical indications: results of a large-scale European survey. Eur Radiol 31:4459–4469 [DOI] [PubMed] [Google Scholar]
  • 29.American College of Radiology (2021) ACR–SABI–SAR–SPR practice parameter for the performance of computed tomography (CT) of the abdomen and computed tomography (CT) of the pelvis. American College of Radiology. Accessed 7 November 2025

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from European Radiology are provided here courtesy of Springer

RESOURCES