Skip to main content
Springer logoLink to Springer
. 2021 Oct 25;32(4):2301–2312. doi: 10.1007/s00330-021-08308-8

Digital breast tomosynthesis (DBT) plus synthesised two-dimensional mammography (s2D) in breast cancer screening is associated with higher cancer detection and lower recalls compared to digital mammography (DM) alone: results of a systematic review and meta-analysis

Sylvia H Heywang-Köbrunner 1,2,, Alexander Jänsch 1,2, Astrid Hacker 1,2, Sina Weinand 3, Tobias Vogelmann 3
PMCID: PMC8921114  PMID: 34694451

Abstract

Objectives

Digital breast tomosynthesis (DBT) plus digital mammography (DM) in screening is problematic due to increased radiation by the double exposure. Synthesised two-dimensional mammography (s2D) calculated from DBT datasets at no additional dose appears a sensible alternative compared to adding DM. This systematic review and meta-analysis focuses on screening performance outcomes in women screened with DBT plus s2D compared to DM alone.

Methods

PubMed was searched from January 1, 2010, to September 2, 2020. Studies comparing DBT plus s2D to DM alone in breast cancer screening were included. Pooled risk ratios (RR) were estimated for cancer detection rates (CDR), recall rates, interval cancer rates (ICR), biopsy rates, and positive predictive values for recalls (PPV-1), for biopsies recommended (PPV-2), and for biopsies performed (PPV-3). Sensitivity analyses were performed using the leave-one-out approach. Risk of bias (RoB) was assessed using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS)-2 tool.

Results

Twelve papers covering 414,281 women were included from 766 records identified. CDR is increased ([RR, 95% CI] 1.35, 1.20–1.52), recall rates are decreased (0.79, 0.64–0.98), and PPV-1 is increased (1.69, 1.45–1.96) when using DBT plus s2D compared to DM alone. ICR and biopsy rates did not differ, but PPV-2 respectively PPV-3 increased with DBT plus s2D (1.57, 1.08–2.28 respectively 1.36, 1.17–1.58). Overall RoB of studies was assessed to be low.

Conclusion

Results show improved diagnostic outcomes with DBT plus s2D compared to DM alone and underline the value of DBT in combination with s2D in breast cancer screening.

Key Points

DBT plus s2D is associated with higher CDR, lower recall rates, and a higher PPV-1 compared to DM alone in breast cancer screening.

No differences in biopsy rates were found between screening modalities, but PPV-2 and PPV-3 were higher in women screened with DBT plus s2D compared to DM alone.

We identified inconsistent results of ICR in two studies comparing DBT plus s2D to DM alone—resulting in no differences when pooling ICR in meta-analysis.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00330-021-08308-8.

Keywords: Systematic review, Meta-analysis, Mammography, Breast neoplasms, Early detection of cancer

Introduction

Female breast cancer is the most common cancer worldwide and surpassed lung cancer in 2020 [1]. Population-based breast cancer screening programmes have been implemented to reduce mortality by early cancer detection. DM represents the current standard in most screening programmes. In the last decade, technological advances in image acquisition resulted in the development of DBT. DBT allows for the calculation of (mostly) 1 mm mammographic slices of the 3D volume imaged in a desired projection. This avoids superimposition, enhances cancer detection, and improves the sensitivity compared to DM. Based on the technology available at that time, initial studies compared the combination of DBT plus DM to DM alone. While the combination proved to be more sensitive [24] and more specific [4] than DM alone, it is associated with a ~ 2–2.4-fold dosage compared to DM alone [58]. The addition of DM to DBT compared to DM alone may allow for easier assessment of breast symmetry and facilitate the comparison with prior mammograms. Both may yield diagnostically important additional information. s2D from DBT images has been developed as a substitute for the additional DM [6, 9]. Synthetic mammographic images are calculated from the DBT dataset, requiring no double exposure. Thus, the combination of DBT plus s2D promised to maintain the diagnostic advantages of the new method at a radiation dose which is comparable to or only slightly higher than that of DM.

While the diagnostic superiority of DBT plus DM compared to DM alone has already been shown in several meta-analyses [24, 1012], we identified two meta-analyses [13, 14] evaluating DBT plus s2D compared to DM alone. Giampietro et al. [14] estimated an overall higher CDR when using DBT plus s2D compared to DM, whereas no statistically significant differences were observed for recalls. According to Alabousi et al. [13], DBT plus s2D is associated with higher CDR, lower recall rates, and higher PPV-1 compared to DM alone [13]. The differing results of these meta-analyses may be explained by methodological differences. Since differences in recall rates are inconsistent between studies [14], sensitivity analyses are needed to investigate the effect of each study included. Hitherto, no meta-analysis has considered the outcomes of biopsy rate, PPV-2, and PPV-3.

We aimed to perform a systematic review and meta-analysis of CDR and recall rates, focusing on available information on biopsy rates, PPV-1, -2, and -3, and of ICR, limited to studies comparing DBT plus s2D to DM alone. Furthermore, we planned to systematically perform sensitivity analyses to address heterogeneity among studies.

Materials and methods

Literature search

A systematic literature search was performed to identify relevant studies published on PubMed between January 1, 2010, and September 2, 2020. Search terms are available in ESM, S1. The search was limited to studies published in English language and with available abstracts. This study is reported in accordance to the Preferred Reporting Items for Systematic reviews and Meta-Analysis (PRISMA) guidelines [15, 16]. Our protocol was registered on INPLASY under registration number INPLASY202140073.

Eligibility criteria

Prospective and retrospective studies with a comparative design reporting original data were evaluated eligible if, first, asymptomatic women with an average risk of breast cancer presenting for screening were considered, including women with symptoms at a regular screening. No restriction regarding age, gender, or country was set (Patients). Second, studies were included if DBT plus s2D (Index test) was compared to DM alone (Comparator). Third, at least data of CDR, recall rates, ICR, biopsy rates, or PPV 1–3 (Outcomes) had to be reported. Studies without human subjects and studies evaluating solely women with symptoms, findings suspicious of breast cancer, or screened for diagnostic work-up only were excluded.

Study selection and data extraction

Records identified through database search were first screened for eligibility based on information provided in title and abstract and second, using the full texts of articles. In case of any disagreement among reviewers, a third reviewer assessed the record and consensus was reached by discussion. Data of studies assessed to be eligible after full-text screening were extracted into a pretested spreadsheet by two independent reviewers. Reviewers were not blinded to the authors and institutions of studies undergoing review. The data extraction spreadsheet was designed according to the checklist of the data extraction for complex meta-analysis (DECiMAL) guide [17] (ESM, S1 for detailed information).

Quality assessment

The RoB and applicability were evaluated by two reviewers independently using QUADAS-2 [18]. In case of any disagreement, a third reviewer was asked for assessment and consensus was reached by discussion. Studies were assessed for RoB regarding the dimensions (I) patient selection, (II) index test, (III) reference standard, and (IV) flow and timing. Applicability was evaluated by dimensions I to III.

Statistical analysis

Since all outcomes were dichotomous, RR with a 95% CI were used as summary statistics (inverse variance) to express the outcome in women screened with DBT plus s2D in relation to women screened with DM alone. p values less than 0.05 were defined as an indicator of statistical significance. Heterogeneity among the studies for each outcome was ascertained visually by forest plots and statistically using the Higgins I2 for quantification inconsistency. Random effects models (REM) were used to pool an effect size when I2 > 50%—indicating a moderate to high probability of heterogeneity, otherwise (I2 ≤ 50%) fixed effects models (FEM) were used. Studies reporting outcomes based on the same population were included once, whereas the study with the larger sample size was included. Analysis was conducted using Microsoft Excel and Review Manager 5.4. To address heterogeneity among studies [19] and to test the robustness of results, sensitivity analyses were done using the leave-one-out approach if at least three studies were included in analysis. Each study was excluded once from meta-analysis and results were recorded to verify whether findings are depending on any study. The results are presented in summary tables in ESM, S2.

Results

Study selection

A total of 766 records were identified in PubMed. Five hundred eight records were excluded after title and abstract screening and 258 records were assessed in full text for eligibility. A total of 246 studies were excluded after full-text screening for eligibility. Twelve studies [5, 2030], representing the results of 10 populations, were included in at least one meta-analysis of CDR [5, 2028], recall rate [20, 21, 2327, 29], ICR [21, 30], biopsy rate [20, 2325, 27], PPV-1 [20, 21, 2327, 29], PPV-2 [20, 23], or PPV-3 [20, 2325, 27]. The selection process is shown in Fig. 1.

Fig. 1.

Fig. 1

PRISMA flowchart of studies describing the process of selecting studies included in meta-analysis

Study characteristics

Table 1 summarises characteristics of studies comparing DBT plus s2D versus DM alone. Twelve studies [5, 2030] represent results of 10 unique study populations with 414,281 women. Two studies were conducted in the USA (79,209 women) [20, 23], one in Australia (10,146 women) [26], and 9 in Europe, representing results of 7 populations (324,926 women) [5, 21, 22, 24, 25, 2730]. Two of the European studies reported outcomes from Trento, Italy (Bernardi et al. 2020 [21] and Bernardi et al. 2016 (STORM-2) [5]). Women who were previously enrolled in STORM/STORM-2 were excluded from analysis in Bernardi et al. (2020).

Table 1.

Summary of study characteristics of included studies comparing DBT plus s2D versus DM alone

Inclusion criteria Number of analysis units
Study Year Country Study design Reading Period Age£ Analysis unit DBT + s2D DM Total
Studies from the United States (US) 25,698 53,511 79,209
Aujero et al. [20] 2017 USA R Unpaired Single 10, 2011–06, 2016 (DM); 08, 2015–06, 2016 (DBT plus s2D) NA Women« 16,173 32,076 48,249
Freer et al. [23] 2017 USA R Unpaired NA 10, 2013–12, 2015 (DM); 01, 2015–12, 2015 (DBT plus s2D) NA Women 9,525 21,435 30,960
Studies from Europe (EU) 182,025 192,953 324,932
Bernardi et al. [21] 2020 Italy P Unpaired Double 01, 2013–10, 2014 (DM); 10, 2014–10, 2016 (DBT plus s2D) 50 Women« 46,343 37,436 83,779
Caumo et al., A (Verona-SC) [22] 2018 Italy P Unpaired Double 04, 2013–03, 2015 (DM); 04, 2015–03, 2017 (DBT plus s2D) 50 Women 34,071 29,360 63,431
Caumo et al., B (Verona-SC) [29] 2018 Italy P Unpaired Double 04, 2014–03, 2015 (DM); 04, 2015–03, 2016 (DBT plus s2D) 50 Women 16,666 14,423 31,089
Bernardi et al. (STORM-2) [5] 2016 Italy P Paired Double 05, 2013–05, 2015 49 Screensϕ 9,677 9,677 9,677
Romero Martin et al. [27] 2018 Spain P Paired Single 01, 2015–12, 2016 50 Screens¤ 16,068 16,068 16,068
Hofvind et al. (OVVV) [24] 2018 Norway P Unpaired Double 02, 2014–01, 2016 50 Women 37,185 61,742 98,927
Hovda et al. (OVVV) [30] 2020 Norway P Unpaired Double 02, 2014–12, 2015 (Round 1); 02, 2016–12, 2017 (Round 2) 50 Women 34,641 57,763 92,404
Hofvind et al. (To-Be) [25] 2019 Norway RCT Unpaired Double 01, 2016–12, 2017 50 Women 14,380 14,369 28,749
Skaane et al. (OTST) [28] 2019 Norway P Paired Single 11, 2010–12, 2012 50 Women 24,301 24,301 24,301
Studies from Australia (AUS) 5,018 5,166 10,184
Houssami et al. [26] 2019 Australia P Unpaired Double 08, 2017 – 11, 2018 40 Screens¦ 5,018 5,166 10,184
Total number of analysis units 212,741 251,630 414,325
Total number of women 212,710 251,611 414,281

The bold numbers represent the subtotals (number of analysis units) for studies from the United States (US), Europe (EU), and Australia (AUS)

STORM-2, screening with tomosynthesis or standard mammography-2; Verona-SC, Verona screening; OVVV, Oslo, Vestfold, and Vestre Viken; OTST, Oslo Tomosynthesis Screening Trial; To-Be, tomosynthesis trial in Bergen; R, retrospective; P, prospective

£Minimum age of included women

Number of analysis units from Caumo et al. B (2018) and Hovda et al. (2020) were not included

Single-reading of DBT plus s2D and double-reading of DM

«Number of women is equal to the number of screens

ϕ9,672 women, since 5 women had bilateral breast cance

¤16,067 women, since one women had bilateral cancer

¦10,146 women (DBT plus s2D: 4,993; DM: 5,153), since 38 (DBT plus s2D: 25; DM: 13) women were screened annually had two screening episodes each during the trial

Quality assessment

Figure 2 shows the RoB and applicability assessment. All studies were evaluated having a high RoB in ‘flow of timing’, as not every woman received the same reference test after screening. Low RoB would require that all women, including women with inconspicuous findings in screening, subsequently undergo histopathological assessment for verification. Since this is ethically not acceptable, studies assessed with a high RoB in the domain ‘flow and timing’ only were assessed with an overall low RoB (ESM, S3).

Fig. 2.

Fig. 2

Quality assessment of included studies using QUADAS-2

Synthesis of results

CDR, recall rate, and PPV-1

The CDR reports the number of cancers detected among 1,000 women screened/examinations. Ten studies [5, 2028] were included in the meta-analysis of CDR. Two studies [29, 30] were not included to avoid double counting of women. The CDR was estimated to be significantly higher when using DBT plus s2D compared to DM alone (RR: 1.35, 95% CI: 1.20–1.52, p < 0.01, I2: 58%) using REM (Fig. 3). Sensitivity analyses demonstrated robustness regarding statistical significance (ESM, S2).

Fig. 3.

Fig. 3

Forest plots for (a) cancer detection rates (CDR) and (b) recall rates in women screened with DBT plus s2D compared to women screened with DM alone. Squares with horizontal lines represent individual study estimates and 95% confidence interval (CI). Diamond represents the pooled estimate and 95% CI. REM random effects model, Can cancers detected, Rec recalls, % weight

The number of women recalled per 100 women/examinations is represented by the recall rate. Eight studies [20, 21, 2327, 29] were included in the meta-analysis of recall rates (Fig. 3). The study of Bernardi et al. 2016 [5] was excluded since only false-positive recalls were reported. Data of recalls in Skaane (2019) (Oslo Tomosynthesis Screening Trial (OTST)) [28] were not available for the study group DBT plus s2D (Arm D); recalls were reported in total for women screened with DBT plus DM/s2D (Arm C + D). Previous published OTST studies were excluded from analysis, since they did not report recall rates in women screened with DBT plus s2D compared to DM alone [3133]. Recall rates by REM were significantly lower when using DBT plus s2D compared to DM alone (RR: 0.79, 95% CI: 0.64–0.98, p: 0.03, I2: 97%). Results were not robust with regard to statistical significance if single studies were left out (ESM, S2).

The relation of cancers detected by women recalled is represented by the PPV-1. Eight studies [20, 21, 2327, 29] were included in the meta-analysis of PPV-1 (Fig. 4). Two studies [21, 26] did not separately report the PPV-1; therefore, we calculated PPV-1. There was a statistically significant higher cancer detection when being recalled in screening with DBT plus s2D compared to DM alone (RR: 1.69, 95% CI: 1.45–1.96, p < 0.01, I2: 73%), using REM. Sensitivity analyses demonstrated statistically significant robust results (ESM, S2).

Fig. 4.

Fig. 4

Forest plot for positive predictive value for recalls (PPV-1) in women screened with DBT plus s2D compared to women screened with DM alone. Squares with horizontal lines represent individual study estimates and 95% confidence interval (CI). Diamond represents the pooled estimate and 95% CI. REM random effects model, Can cancers detected, Rec recalls, % weight

Data of a screening programme in Australia are included in the performed meta-analyses of CDR, recall rate, and PPV-1. Houssami et al. [26] performed sensitivity analyses in which screens of women who reported symptoms at screening were excluded. Symptomatic women are also likely to participate in breast cancer screening programmes; however, the results of the other screening programmes were not stratified for asymptomatic and symptomatic women at screening. Therefore, we included data of all women to enhance comparability between studies, but performed sensitivity analyses, in which the data of asymptomatic women only were included. Results of sensitivity analyses showed no differences of risk ratios for CDR, recall rates, and PPV-1 (ESM, S2).

Biopsy rate, PPV-2, and PPV-3

Biopsy rates indicate how many biopsies were performed per 1,000 women/examinations. Five studies [20, 2325, 27] were included in the meta-analysis of biopsy rates. In two studies [24, 27], biopsy rates were calculated using the percentage of PPV-3 or CDR. No statistically significant differences in biopsies in women screened with DBT plus s2D compared to DM alone were observed. The RR calculated using REM (RR: 0.87, 95% CI: 0.70–1.09, p: 0.22, I2: 91%) demonstrates a potentially lower number of biopsies when using DBT plus s2D in screening compared to DM alone (Fig. 5). Sensitivity analyses demonstrated robustness of results with regard to statistical significance (ESM, S2).

Fig. 5.

Fig. 5

Forest plots for (a) biopsy rates, (b) positive predictive value for biopsies recommended (PPV-2), and (c) positive predictive value for biopsies performed (PPV-3) in women screened with DBT plus s2D compared to women screened with DM alone. Squares with horizontal lines represent individual study estimates and 95% confidence interval (CI). Diamond represents the pooled estimate and 95% CI. REM random effects model, Biop biopsies performed, BR biopsy rate, Biop R biopsies recommended, Can cancers detected, % weight

PPV-2, or PPV-3 respectively, indicates the number of cancers detected among 100 biopsies recommended, or performed, respectively. Since some studies report PPV-2 or PPV-3 only, we analysed both. Five studies were included in the meta-analysis of PPV-3 [20, 2325, 27]. Cancer detection in women being biopsied after screening with DBT plus s2D is statistically significantly higher (RR: 1.36, 95% CI: 1.17–1.58, p < 0.01) compared to women screened with DM alone (Fig. 5), using REM (I2: 67%). Sensitivity analyses demonstrated robustness of results regarding statistical significance (ESM, S2). PPV-2 (Fig. 5) was reported in two studies [20, 23]. CDR is higher in women with recommended biopsy after screening with DBT plus s2D compared to DM alone using REM (RR: 1.57, 95% CI: 1.08–2.28, p: 0.02, I2: 82%).

ICR

The ICR indicates the number of interval cancers per 1,000 women screened/examinations. Two European studies [21, 30] were identified reporting interval cancers in women screened with DBT plus s2D compared to women screened with DM alone (Fig. 6). No statistically significant difference in ICR was observed for both the pooled estimate using REM (RR: 1.03, 95% CI: 0.66–1.63, p: 0.88, I2: 70%) and single ICR reported in the studies.

Fig. 6.

Fig. 6

Forest plots for interval cancer rates (ICR) in women screened with DBT plus s2D compared to women screened with DM alone. Squares with horizontal lines represent individual study estimates and 95% confidence interval (CI). Diamond represents the pooled estimate and 95% CI. REM random effects model, IC interval cancers, % weight

Discussion

Summary of results

Twelve studies comparing DBT plus s2D to DM alone in screening were included in our meta-analyses. We found that screening with DBT plus s2D compared to DM alone is associated with a higher CDR ([RR, 95% CI] 1.35, 1.20–1.52), decreased recalls (0.79, 0.64–0.98), and a higher cancer detection among recalls (1.69, 1.45–1.96). Cancer detection after recommended and performed biopsies was higher with DBT plus s2D compared to DM alone (PPV-2: 1.57, 1.08–2.28; PPV-3: 1.36, 1.17–1.58). We did not identify any differences in biopsy rates and ICR.

Results interpretation and comparison with literature

Our results regarding CDR, recall rates, and PPV-1 were in line with Alabousi et al. [13]. In comparison to Giampietro et al. [14], we found a statistically significant difference in recalls with fewer recalls for DBT plus s2D than for DM alone. However, the inclusion criteria of the latter study differ. The better results in our and Alabousi’s study may indicate lower recall rates for DBT plus s2D than for DBT plus DM. They may also reflect the learning curve from prior to more recent studies. Statistical significance was lost in our sensitivity analyses of recalls, if, e.g. the US studies Aujero et al. [20] or Freer et al. [23] were excluded. Since screening characteristics like reading procedure or screening intervals differ in the USA, these could be potential factors impacting heterogeneity. In contrast to the other studies, Houssami et al. [26] reported a statistically significantly higher risk of being recalled for one pilot screening trial when using DBT plus s2D compared to DM alone. In this population, women screened with DBT plus s2D were younger, reported symptoms more often, and participated more often in the prevalent screening round compared to women screened with DM alone [26]. Even if recall rates are contrary between studies, the number of cancers detected per 100 recalled women is consistently higher in women screened with DBT plus s2D. DBT plus s2D is associated with higher CDR and concurrently with fewer recalls. Furthermore, higher cancer detection per 100 women with recommended or performed biopsy underlines that DBT plus s2D is more precise in identifying cancers than DM alone. A 9%-point higher sensitivity (83%, 95% CI: 78–87%) for DBT plus s2D compared to DM alone (74%, 95% CI: 65–81%) was also reported by Abdullah et al. [34].

Since high CDR may be related with overdiagnoses, ICR is the more clinically relevant outcome parameter as it reflects potentially important delays in diagnosis and treatment. We identified two European studies [21, 30] reporting ICR comparing DBT plus s2D to DM alone. While Bernardi et al. [21] defined interval cancers as ‘cancers identified over two-year follow-up’ [21], Hovda et al. [30] defined interval cancers as ‘cancers diagnosed 0–24 months after negative screening findings or 6–24 months after false positive baseline screening findings’ [30]. An inconsistent trend of ICR per screening modality and small sample sizes resulted in no statistically significant difference. The same results are shown in a recently published meta-analysis by Houssami et al.[35]. They assessed ICR in women screened with DBT compared to DM. Sensitivity analyses had shown no statistically significant differences in ICR comparing DBT plus s2D to DM alone [35].

Published data on interval cancers following DBT plus s2D are limited and inconsistent. In principle, high CDR and unchanged ICR may be associated with a smaller than expected improvement of mortality reduction and with the risk of increased overdiagnosis. However, the effect of over-detection on mortality reduction and the risk of overdiagnosis can only be estimated after results of follow-up rounds, cancer stages, and biology become available from appropriately designed studies. For improved interpretation of the increased CDR, cancer biology and results for different breast densities may play a role. While Winter et al. [36] reported a lower rate of node positive interval cancers after DBT screening, Bahl et al. [37] reported comparable biology of interval cancer after DBT versus FFDM. Both of these study designs, however, cannot exclude bias. One very recent study, the only study using wide-angle DBT (without s2D), presented reduced ICR after screening with DBT compared to DM [38]. While differences might be associated with the different technologies, possible bias in the control group must also be discussed. Considering the differing results of the limited data on interval cancers, a possible correlation between the amount of recall reduction, additional detection, and effect on ICR might also be worth discussing. Also, most of the included studies were originally not designed and powered to show differences in ICR. Meta-analyses with pooled estimates based on data from underpowered studies with small sample sizes are also likely to be underpowered [39]. Finally, results concerning additional detection and effect on ICR may vary for different ranges of breast densities. To date, these data are not yet available from large studies. Given the fact that the European Commission recommends mammography screening for women aged 45–49 years old [40], DBT could be a more effective alternative, since younger women tend to have more dense breasts and the accuracy of mammography may be poorer. Furthermore, overdiagnoses may be lower in younger women with a longer remaining life-time, as small cancers have a higher risk, or longer time, respectively, for negative development.

Biases and limitations

This study has several limitations. First, search was carried out in only PubMed and studies that did not have available abstracts and English full text were excluded. Second, in 9 of the ten underlying trials women were not assigned randomly to screening modalities (concerning ~ 385,532 women among a total of 414,281). Also, 4 of the ten underlying trial study groups differ in time periods (concerning ~ 226,419 women among a total of 414,281). Unpaired and non-randomised study designs, for example in which participant characteristics (e.g. breast density, family history, or availability of screening modalities) may differ, lead to potential bias by confounding variables. Since a systematic assessment of potential confounding variables was beyond the scope of our work, a comparison of screening performance of modalities in women with dense breast tissue only seems to be useful in further subgroup analyses. Third, heterogeneity among studies was observed. We used REM and strived to interpret results only considering potential factors impacting heterogeneity. However, our study did not address other influencing or limiting factors. Valuable further data from randomised designs will become available in the near future, for example from a large RCT for which the recruitment of a prospectively acquired study population of 80,000 women [41] was recently completed.

Conclusion

To our best knowledge, this is the first meta-analysis of biopsy rates, PPV-2, and PPV-3 in women screened with DBT plus s2D compared to DM alone. Statistically significant differences in favour of DBT plus s2D compared to DM alone were found for CDR, recall rates, PPV-1, PPV-2, and PPV-3. Biopsy rates and ICR did not differ between screening modalities. Further research regarding ICR stratified by age and breast density is needed.

Supplementary Information

Below is the link to the electronic supplementary material.

Abbreviations

CDR

Cancer detection rate

CI

Confidence interval

DBT

Digital breast tomosynthesis

DM

Digital mammography

FEM

Fixed effects model

ICR

Interval cancer rate

INPLASY

International Platform of Registered Systematic Review and Meta-analysis Protocols

OTST

Oslo Tomosynthesis Screening Trial

PPV-1

Positive predictive value for recalls

PPV-2

Positive predictive value for biopsies recommended

PPV-3

Positive predictive value for biopsies performed

PRISMA

Preferred Reporting Items for Systematic reviews and Meta-Analysis

QUADAS

Quality Assessment of Diagnostic Accuracy Studies

REM

Random effects model

RoB

Risk of bias

RR

Risk ratio or relative risk

s2D

Synthesised two-dimensional mammography

Funding

This study has received funding from Hologic.

Declarations

Guarantor

The scientific guarantor of this publication is Prof. Dr. Sylvia H. Heywang-Köbrunner.

Conflict of interest

The authors of this manuscript declare relationships with the following companies: Tobias Vogelmann is owner and employee and Sina Weinand is an employee of LinkCare GmbH; both received consulting honorariums from Hologic. The authors of this manuscript declare having no other relationships with any companies whose products or services may be related to the subject matter of this article.

Statistics and biometry

Authors Tobias Vogelmann and Sina Weinand have significant statistical expertise.

Informed consent

Not applicable.

Ethical approval

Institutional Review Board approval was not required because all data included was available in the public domain.

Methodology

• Systematic review

• Meta-analysis

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Sung H, Ferlay J, Siegel RL, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021 doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]
  • 2.Phi X-A, Tagliafico A, Houssami N, Greuter MJW, de Bock GH. Digital breast tomosynthesis for breast cancer screening and diagnosis in women with dense breasts - a systematic review and meta-analysis. BMC Cancer. 2018 doi: 10.1186/s12885-018-4263-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Alabousi M, Zha N, Salameh J-P, et al. Digital breast tomosynthesis for breast cancer detection: a diagnostic test accuracy systematic review and meta-analysis. Eur Radiol. 2020;30:2058–2071. doi: 10.1007/s00330-019-06549-2. [DOI] [PubMed] [Google Scholar]
  • 4.Lei J, Yang P, Zhang L, Wang Y, Yang K. Diagnostic accuracy of digital breast tomosynthesis versus digital mammography for benign and malignant lesions in breasts: a meta-analysis. Eur Radiol. 2014;24:595–602. doi: 10.1007/s00330-013-3012-x. [DOI] [PubMed] [Google Scholar]
  • 5.Bernardi D, Macaskill P, Pellegrini M, et al. Breast cancer screening with tomosynthesis (3D mammography) with acquired or synthetic 2D mammography compared with 2D mammography alone (STORM-2): a population-based prospective study. Lancet Oncol. 2016;17:1105–1113. doi: 10.1016/S1470-2045(16)30101-2. [DOI] [PubMed] [Google Scholar]
  • 6.Svahn TM, Houssami N, Sechopoulos I, Mattsson S. Review of radiation dose estimates in digital breast tomosynthesis relative to those in two-view full-field digital mammography. Breast. 2015;24:93–99. doi: 10.1016/j.breast.2014.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Yaffe MJ. Reducing radiation doses for breast tomosynthesis? Lancet Oncol. 2016;17:1027–1029. doi: 10.1016/S1470-2045(16)30155-3. [DOI] [PubMed] [Google Scholar]
  • 8.Pattacini P, Nitrosi A, Giorgi Rossi P, et al. Digital mammography versus digital mammography plus tomosynthesis for breast cancer screening: the Reggio Emilia Tomosynthesis randomized trial. Radiology. 2018;288:375–385. doi: 10.1148/radiol.2018172119. [DOI] [PubMed] [Google Scholar]
  • 9.Gur D, Zuley ML, Anello MI, et al. Dose reduction in digital breast tomosynthesis (DBT) screening using synthetically reconstructed projection images: an observer performance study. Acad Radiol. 2012;19:166–171. doi: 10.1016/j.acra.2011.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hodgson R, Heywang-Köbrunner SH, Harvey SC, et al. Systematic review of 3D mammography for breast cancer screening. Breast. 2016;27:52–61. doi: 10.1016/j.breast.2016.01.002. [DOI] [PubMed] [Google Scholar]
  • 11.Yun SJ, Ryu C-W, Rhee SJ, Ryu JK, Oh JY. Benefit of adding digital breast tomosynthesis to digital mammography for breast cancer screening focused on cancer characteristics: a meta-analysis. Breast Cancer Res Treat. 2017;164:557–569. doi: 10.1007/s10549-017-4298-1. [DOI] [PubMed] [Google Scholar]
  • 12.Marinovich ML, Hunter KE, Macaskill P, Houssami N. Breast cancer screening using tomosynthesis or mammography: a meta-analysis of cancer detection and recall. J Natl Cancer Inst. 2018;110:942–949. doi: 10.1093/jnci/djy121. [DOI] [PubMed] [Google Scholar]
  • 13.Alabousi M, Wadera A, Kashif Al-Ghita M, et al. Performance of digital breast tomosynthesis, synthetic mammography and digital mammography in breast cancer screening: a systematic review and meta-analysis. J Natl Cancer Inst. 2021;113:680–690. doi: 10.1093/jnci/djaa205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Giampietro RR, Cabral MVG, Lima SAM, Weber SATW, Nunes-Nogueira VDS. Accuracy and effectiveness of mammography versus mammography and tomosynthesis for population-based breast cancer screening: a systematic review and meta-analysis. Sci Rep. 2020 doi: 10.1038/s41598-020-64802-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.McInnes MDF, Moher D, Thombs BD, et al. Preferred Reporting Items for a Systematic Review and Meta-analysis of diagnostic test accuracy studies: the PRISMA-DTA statement. JAMA. 2018;319:388–396. doi: 10.1001/jama.2017.19163. [DOI] [PubMed] [Google Scholar]
  • 16.Moher D, Liberati A, Tetzlaff J, Altman D. Preferred Reporting Items for Systematic Reviews and Meta-analyses: the PRISMA statement. PLoS Med. 2009;6:e1000097. doi: 10.1371/journal.pmed.1000097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Pedder H, Sarri G, Keeney E, Nunes V, Dias S. Data extraction for complex meta-analysis (DECiMAL) guide. Syst Rev. 2016 doi: 10.1186/s13643-016-0368-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Whiting PF, Rutjes AWS, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155:529–536. doi: 10.7326/0003-4819-155-8-201110180-00009. [DOI] [PubMed] [Google Scholar]
  • 19.Higgins JP, Green S (2011) Cochrane Handbook for Systematic Reviews of Interventions (Version 5.1.0) - Part 2: General methods for Cochrane reviews - Chapter 9.5.3 Strategies for addressing heterogeneity. The Cochrane Collaboration. https://training.cochrane.org/handbook. Accessed 15 Mar 2021
  • 20.Aujero MP, Gavenonis SC, Benjamin R, Zhang Z, Holt JS. Clinical performance of synthesized two-dimensional mammography combined with tomosynthesis in a large screening population. Radiology. 2017;283:70–76. doi: 10.1148/radiol.2017162674. [DOI] [PubMed] [Google Scholar]
  • 21.Bernardi D, Gentilini MA, De Nisi M, et al. Effect of implementing digital breast tomosynthesis (DBT) instead of mammography on population screening outcomes including interval cancer rates: Results of the Trento DBT pilot evaluation. Breast. 2020;50:135–140. doi: 10.1016/j.breast.2019.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Caumo F, Romanucci G, Hunter K, et al. Comparison of breast cancers detected in the Verona screening program following transition to digital breast tomosynthesis screening with cancers detected at digital mammography screening. Breast Cancer Res Treat. 2018;170:391–397. doi: 10.1007/s10549-018-4756-4. [DOI] [PubMed] [Google Scholar]
  • 23.Freer PE, Riegert J, Eisenmenger L, et al. Clinical implementation of synthesized mammography with digital breast tomosynthesis in a routine clinical practice. Breast Cancer Res Treat. 2017;166:501–509. doi: 10.1007/s10549-017-4431-1. [DOI] [PubMed] [Google Scholar]
  • 24.Hofvind S, Hovda T, Holen ÅS, et al. Digital breast tomosynthesis and synthetic 2D mammography versus digital mammography: evaluation in a population-based screening program. Radiology. 2018;287:787–794. doi: 10.1148/radiol.2018171361. [DOI] [PubMed] [Google Scholar]
  • 25.Hofvind S, Holen ÅS, Aase HS, et al. Two-view digital breast tomosynthesis versus digital mammography in a population-based breast cancer screening programme (To-Be): a randomised, controlled trial. Lancet Oncol. 2019;20:795–805. doi: 10.1016/S1470-2045(19)30161-5. [DOI] [PubMed] [Google Scholar]
  • 26.Houssami N, Lockie D, Clemson M, et al. Pilot trial of digital breast tomosynthesis (3D mammography) for population-based screening in BreastScreen Victoria. Med J Aust. 2019;211:357–362. doi: 10.5694/mja2.50320. [DOI] [PubMed] [Google Scholar]
  • 27.Romero Martín S, Raya Povedano JL, Cara García M, Romero ALS, Garriguet MP, Álvarez Benito M. Prospective study aiming to compare 2D mammography and tomosynthesis + synthesized mammography in terms of cancer detection and recall. From double reading of 2D mammography to single reading of tomosynthesis. Eur Radiol. 2018;28:2484–2491. doi: 10.1007/s00330-017-5219-8. [DOI] [PubMed] [Google Scholar]
  • 28.Skaane P, Bandos AI, Niklason LT, et al. Digital mammography versus digital mammography plus tomosynthesis in breast cancer screening: the Oslo Tomosynthesis Screening Trial. Radiology. 2019;291:23–30. doi: 10.1148/radiol.2019182394. [DOI] [PubMed] [Google Scholar]
  • 29.Caumo F, Zorzi M, Brunelli S, et al. Digital breast tomosynthesis with synthesized two-dimensional images versus full-field digital mammography for population screening: outcomes from the Verona Screening Program. Radiology. 2018;287:37–46. doi: 10.1148/radiol.2017170745. [DOI] [PubMed] [Google Scholar]
  • 30.Hovda T, Holen ÅS, Lång K, et al. Interval and consecutive round breast cancer after digital breast tomosynthesis and synthetic 2D mammography versus standard 2D digital mammography in BreastScreen Norway. Radiology. 2020;294:256–264. doi: 10.1148/radiol.2019191337. [DOI] [PubMed] [Google Scholar]
  • 31.Skaane P, Bandos AI, Eben EB, et al. Two-view digital breast tomosynthesis screening with synthetically reconstructed projection images: comparison with digital breast tomosynthesis with full-field digital mammographic images. Radiology. 2014;271:655–663. doi: 10.1148/radiol.13131391. [DOI] [PubMed] [Google Scholar]
  • 32.Skaane P, Bandos AI, Gullien R, et al. Comparison of digital mammography alone and digital mammography plus tomosynthesis in a population-based screening program. Radiology. 2013;267:47–56. doi: 10.1148/radiol.12121373. [DOI] [PubMed] [Google Scholar]
  • 33.Skaane P, Sebuødegård S, Bandos AI, et al. Performance of breast cancer screening using digital breast tomosynthesis: results from the prospective population-based Oslo Tomosynthesis Screening Trial. Breast Cancer Res Treat. 2018;169:489–496. doi: 10.1007/s10549-018-4705-2. [DOI] [PubMed] [Google Scholar]
  • 34.Abdullah P, Alabousi M, Ramadan S, et al. Synthetic 2D mammography versus standard 2D digital mammography: a diagnostic test accuracy systematic review and meta-analysis. AJR Am J Roentgenol. 2021;217:314–325. doi: 10.2214/AJR.20.24204. [DOI] [PubMed] [Google Scholar]
  • 35.Houssami N, Zackrisson S, Blazek K, et al. Meta-analysis of prospective studies evaluating breast cancer detection and interval cancer rates for digital breast tomosynthesis versus mammography population screening. Eur J Cancer. 2021;148:14–23. doi: 10.1016/j.ejca.2021.01.035. [DOI] [PubMed] [Google Scholar]
  • 36.Winter AM, Kazmi S, Hardy AK, Bennett DL. Comparison of interval breast cancers with 2D digital mammography versus 3D digital breast tomosynthesis in a large community-based practice. Breast J. 2020;26:1953–1959. doi: 10.1111/tbj.14047. [DOI] [PubMed] [Google Scholar]
  • 37.Bahl M, Gaffney S, McCarthy AM, Lowry KP, Dang PA, Lehman CD. Breast cancer characteristics associated with 2D digital mammography versus digital breast tomosynthesis for screening-detected and interval cancers. Radiology. 2018;287:49–57. doi: 10.1148/radiol.2017171148. [DOI] [PubMed] [Google Scholar]
  • 38.Johnson K, Lång K, Ikeda DM, Åkesson A, Andersson I, Zackrisson S. Interval breast cancer rates and tumor characteristics in the prospective population-based Malmö Breast Tomosynthesis Screening Trial. Radiology. 2021;299:559–567. doi: 10.1148/radiol.2021204106. [DOI] [PubMed] [Google Scholar]
  • 39.Durand MA, Friedewald SM, Plecha DM, et al. False-negative rates of breast cancer screening with and without digital breast tomosynthesis. Radiology. 2021;298:296–305. doi: 10.1148/radiol.2020202858. [DOI] [PubMed] [Google Scholar]
  • 40.European Commission (2019) European Commission Initiative on Breast Cancer - European breast cancer guidelines: screening ages and frequencies. The European Commission Initiative on Breast Cancer (ECIBC). https://healthcare-quality.jrc.ec.europa.eu/european-breast-cancer-guidelines/screening-ages-and-frequencies. Accessed 15 Mar 2021
  • 41.Weigel S, Gerss J, Hense H-W, et al. Digital breast tomosynthesis plus synthesised images versus standard full-field digital mammography in population-based screening (TOSYMA): protocol of a randomised controlled trial. BMJ Open. 2018 doi: 10.1136/bmjopen-2017-020475. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from European Radiology are provided here courtesy of Springer

RESOURCES