Deep learning supported mitoses counting on whole slide images: A pilot study for validating breast cancer grading in the clinical workflow

Stijn A van Bergeijk; Nikolas Stathonikos; Natalie D ter Hoeve; Maxime W Lafarge; Tri Q Nguyen; Paul J van Diest; Mitko Veta

doi:10.1016/j.jpi.2023.100316

. 2023 May 4;14:100316. doi: 10.1016/j.jpi.2023.100316

Deep learning supported mitoses counting on whole slide images: A pilot study for validating breast cancer grading in the clinical workflow

Stijn A van Bergeijk ^a,¹, Nikolas Stathonikos ^a,¹, Natalie D ter Hoeve ^a, Maxime W Lafarge ^b,^c, Tri Q Nguyen ^a, Paul J van Diest ^a,^⁎, Mitko Veta ^b

PMCID: PMC10238836 PMID: 37273455

Abstract

Introduction

Breast cancer (BC) prognosis is largely influenced by histopathological grade, assessed according to the Nottingham modification of Bloom-Richardson (BR). Mitotic count (MC) is a component of histopathological grading but is prone to subjectivity. This study investigated whether mitoses counting in BC using digital whole slide images (WSI) compares better to light microscopy (LM) when assisted by artificial intelligence (AI), and to which extent differences in digital MC (AI assisted or not) result in BR grade variations.

Methods

Fifty BC patients with paired core biopsies and resections were randomly selected. Component scores for BR grade were extracted from pathology reports. MC was assessed using LM, WSI, and AI. Different modalities (LM-MC, WSI-MC, and AI-MC) were analyzed for correlation with scatterplots and linear regression, and for agreement in final BR with Cohen’s κ.

Results

MC modalities strongly correlated in both biopsies and resections: LM-MC and WSI-MC (R² 0.85 and 0.83, respectively), LM-MC and AI-MC (R² 0.85 and 0.95), and WSI-MC and AI-MC (R² 0.77 and 0.83). Agreement in BR between modalities was high in both biopsies and resections: LM-MC and WSI-MC (κ 0.93 and 0.83, respectively), LM-MC and AI-MC (κ 0.89 and 0.83), and WSI-MC and AI-MC (κ 0.96 and 0.73).

Conclusion

This first validation study shows that WSI-MC may compare better to LM-MC when using AI. Agreement between BR grade based on the different mitoses counting modalities was high. These results suggest that mitoses counting on WSI can well be done, and validate the presented AI algorithm for pathologist supervised use in daily practice. Further research is required to advance our knowledge of AI-MC, but it appears at least non-inferior to LM-MC.

Keywords: Breast cancer, Artificial intelligence, Digital pathology, Light microscopy, Bloom & Richardson grade, Mitotic index

Highlights

•
Breast cancer grading using mitotic count suffers from poor reproducibility.
•
Mitotic count on whole slide images correlates well with AI-assisted mitotic count.
•
Using AI-assisted mitotic counting can be implemented in daily practice as a viable alternatice to traditional microscope counting.

Introduction

The yearly worldwide breast cancer (BC) incidence is over 2 million, which makes it the most diagnosed cancer. Female BC currently occupies the fifth place in cancer mortality worldwide, and incidence keeps rising.¹ However, when diagnosed in an early stage, the prognosis of BC can be good.¹^,² One of the strongest factors to determine BC prognosis is histological grade, usually assessed according to the Nottingham modification of Bloom-Richardson (BR) grade.³^,⁴ BR requires the pathologist to score 3 features: tubule formation, nuclear pleomorphism, and mitotic count (MC). Each category gets a score from 1 to 3. Scores 3–5 define grade 1, 6–7 grade 2, and 8–9 make up grade 3 BC. Grade 1 cancers have a significantly better survival than grade 2 or 3 cancers.³^,⁵^,⁶ Studies have shown histological grading, tumor size, and lymph node status to be of equal importance for the prognosis of BC.⁵^,⁶ Furthermore, histological grade proved to be decisive in up to a third of treatment decisions.⁷

MC is, as a marker of tumor proliferation, the strongest constituent of BR grade, and a high MC is associated with poor prognosis.8, 9, 10 Several studies have shown a moderate to good reproducibility for BR.11, 12, 13 When focusing solely on MC, reproducibility also ranges from moderate to high.¹⁴^,¹⁵ However, concerns for reproducibility still exist as 1 recent study again found substantial inter- and intra-laboratory variations in BR in more than 33 000 patients.⁷ Because of these variations and the importance of MC for the prognosis of BC, higher reproducibility is required.

With the development of digital whole slide imaging (WSI), breast cancer diagnostics have increasingly been performed digitally as WSI have been validated for diagnostic purposes.¹⁶^,¹⁷ It has been argued that standard WSI has limitations for reliable histologic grading, as the quality of the images may not be high enough for properly assessing the MC in all cases due to lack of a z-axis (i.e. fine-tuning of the focal length), which pathologists often use when microscopically assessing MC. Pathologist familiarity with WSI in the clinical workflow might also be limiting factor. Also, a change in ergonomics is required when using a computer mouse instead of a microscope which might further influence pathologist opinion on WSI. Two studies have shown that MCs in WSI and traditional light microscopy (LM) show comparable results.¹⁸^,¹⁹ However, other studies suggest that although the inter-observer agreement on WSI is similar to LM, MC tends to be systematically lower on WSI.¹⁶^,¹⁷^,²⁰^,²¹

The increased usage of WSI has stimulated the rise of artificial intelligence (AI) algorithms in pathology. Several of these have been developed for assisting the pathologist in performing MC, expecting to improve the reproducibility of MC, often tested in validation cohorts.⁵^,¹⁹^,22, 23, 24, 25, 26, 27 The next step is to test AI algorithms in a clinical setting. The present study validates an in-house developed AI algorithm for mitoses counting in BC on digital WSI by comparing AI supported MC to light microscopic MC and evaluating influence of putative differences of these MC modalities on BR grade in breast cancer.

Methods

Study design and population

Fifty BC patients with paired core biopsies and resections were randomly selected from the workflow of the Department of Pathology at the UMC Utrecht between December 2018 and February 2020. For each patient, tubular differentiation (scored 1, 2, or 3) and nuclear polymorphism scores (1, 2, or 3) according to Elston and Ellis³ were taken from the original pathology report (14 grade 1, 28 grade 2, and 8 grade 3). An approval from our Institutional Review Board was requested and granted under the application number TCBio-20-777.

An experienced Pathologist Assistant (PA) trained in breast microscopy first determined the most cellular and proliferative area of the tumor using LM without prior knowledge of the BR grade and MC. The MC was reassessed using LM (LM-MC) in 2 mm² of adjacent fields.¹⁴ After getting the exact count, MC was scored as 1, 2, or 3 points, for respectively ≤7, 8–12, and ≥13 mitoses. After a washout period of at least 2 months, MC was assessed digitally using WSI (WSI-MC), and after another 2 months washout period, MC was assessed supported by the AI algorithm (AI-MC).

Prior to start using the AI algorithm, a standard operation procedure document (SOP) was made for the AI tool and the PA was trained on the usage of the tool on the test PACS environment.

Digital pathology and AI

Slides had routinely been scanned within the workflow of the UMC Utrecht at 40× magnification (resolution of 0.22 μm per pixel) with a Nanozoomer 2.0-XR (Hamamatsu, Japan). All WSI were viewed using standard high-resolution 4k computer screens in the Sectra PACS (Linköping, Sweden).

The automated mitosis detection system was developed internally based on the methodology introduced by Cireşan et al²⁸ and the improvements upon this work by Lafarge et al.²⁹ The model was trained using Tensorflow 1.12 on python 2.7 and is based on rotation invariant group convolutional neural networks. We used the TUPAC16 and AMIDA13 grand challenge (GC) dataset to train the network as well as a smaller annotated dataset containing mostly hard-negatives and ink artifacts to improve robustness. Most GC datasets include examples from within the tumor and rarely from the periphery of the slide—here the most ink artifacts and other mimics are found—which can lead to performance degradation when whole slide inference is performed.

The model is a 6-layer group CNN, the architecture is extensively described in Lafarge et al.²⁹ In short, we used a patch size of 68×68 pixels with a batch size of 64 and it was trained on NVIDIA K80 and NVIDIA V100 hardware. We evaluated the performance of the model on test sets of the GC datasets and used the F2-score threshold for the clinical implementation. The F2-score threshold gives more weight to recall than precision in contrast to F1-score which gives equal weight to both. This threshold allows the pathologist to review more objects while not overwhelming them with too many objects to review.

The model takes large image patch of 40× resolution and generates a probability map of that patch. Then by using local-maxima extraction, it gets the positions of mitosis on that patch. The MC AI algorithm (both model and integration with PACS) was in-house developed. In the Sectra PACS, an area of interest of the appropriate size of 2 mm² (as described for LM-MC) is interactively drawn, after which the algorithm automatically identifies candidate mitoses and mitoses-like objects and displays them in 2 galleries. Objects are interactively reviewed and dragged to the correct gallery, resulting in a final AI MC per 2 mm² (Fig. 1).

Fig. 1 — Screenshot of the Sectra PACS where an area of interest has interactively been drawn on the right-hand side, after which an AI algorithm has found candidate mitoses and mitosis-like objects, which are displayed in the galleries in the upper left-hand side of the screen. By clicking on a thumbnail in either of the galleries, the PACS displays the candidate object in the center on the right for review, and false positives can be dragged to the negative gallery and vice versa, after which a final AI supported MC is established.

Data analysis

Using the MC from the 3 modalities, 3 BR grades were composed for each biopsy and resection as usual by summing up the scores from tubular differentiation, nuclear polymorphism, and MC, total score 3–5 defining grade 1, scores 6–7 grade 2, and scores 8–9 grade 3. Data for biopsies and resections were separately analyzed. MC data were pairwise displayed in logarithmic scatterplots with reference lines between the different MC modalities and R² was calculated to detect systematic differences. To assess the concordance in BR resulting from the different MC modalities, crosstabs were created, using Cohen’s κ to assess BR agreement between the different MC modalities.³⁰ Scores of 0 meant no agreement, 0.01–0.20 none to slight, 0.21–0.40 fair, 0.41–0.60 moderate, 0.61–0.8 substantial, and 0.81–1.00 almost perfect agreement.³² All statistics were done using Python version 3.8.5. and scikit-learn 1.0.2 and pingouin 0.5.2 python packages.

Results

Biopsies

Scatterplots for pairwise comparison between the 3 MC modalities are shown in Fig. 2, Fig. 3, Fig. 4. All MC modalities were strongly correlated: R² between LM-MC and WSI-MC was 0.85, 0.85 between LM-MC and AI-MC, and 0.77 between WSI-MC and AI-MC.

Fig. 4 — Scatterplot showing a high concordance between artificial intelligence-based mitotic count (AI-MC) and whole slide image-based digital MC (WSI-MC) in 50 breast cancer biopsies.

The crosstabs for the BR grades resulting from the different MC modalities are shown in Table 1, Table 2, Table 3, all showing high κ values: 0.93 for LM-MC versus WSI-MC-based BR, 0.89 for LM-MC versus AI-MC-based BR, and 0.96 for WSI-MC versus AI-MC-based BR.

Table 1.

Crosstab between Bloom & Richardson (BR) grade based on light microscopic mitotic count (LM-MC) and artificial intelligence supported MC (AI-MC) in 50 breast cancer biopsies (κ=0.894, 95% CI 0.78–1.01).

	AI-MC-based grade
LM-MC-based grade	1	2	3	Total
1	17	0	0	17
2	1	26	1	28
3	0	1	4	5
Total	18	27	5	50

Open in a new tab

Table 2.

Crosstab between Bloom & Richardson (BR) grade based on light microscopic mitotic count (LM-MC) and whole slide image-based digital MC (WSI-MC) in 50 breast cancer biopsies (κ=0.928, 95% CI 0.83–1.01).

	WSI-MC-based grade
LM-MC-based grade	1	2	3	Total
1	17	0	0	17
2	1	27	0	28
3	0	1	4	5
Total	18	28	4	50

Open in a new tab

Table 3.

Crosstab between Bloom & Richardson (BR) grade based on whole slide image-based digital mitotic count (WSI-MC) and artificial intelligence supported MC (AI-MC) in 50 breast cancer biopsies (κ=0.964, 95% CI 0.90–1.03).

	AI-MC-based grade
WSI-MC-based grade	1	2	3	Total
1	17	0	0	17
2	1	26	1	28
3	0	1	4	5
Total	18	27	5	50

Open in a new tab

Resections

Scatterplots for pairwise comparison between the 3 MC modalities are shown in Fig. 5, Fig. 6, Fig. 7. All MC modalities were strongly correlated: R² between LM-MC and WSI-MC was 0.83, 0.95 between LM-MC and AI-MC and 0.83 between WSI-MC and AI-MC.

Fig. 7 — Scatterplot showing a high concordance between artificial intelligence-based mitotic count (AI-MC) and whole slide image-based digital MC (WSI-MC) in 50 breast cancer resections.

The crosstabs for the BR grades resulting from the different MC modalities are shown in Table 4, Table 5, Table 6, all showing high κ values: 0.83 for LM-MC-based BR versus WSI-MC, 0.83 for LM-MC versus AI-MC-based BR, and 0.73 for WSI-MC versus AI-MC-based BR.

Table 4.

Crosstab between Bloom & Richardson (BR) grade based on light microscopic mitotic count (LM-MC) and whole slide image-based digital MC (WSI-MC) in 50 breast cancer resections (κ=0.834, 95% CI 0.70–0.97).

	WSI-MC-based grade
LM-MC-based grade	1	2	3	Total
1	13	1	0	14
2	2	24	2	28
3	0	0	8	8
Total	15	25	10	50

Open in a new tab

Table 5.

Crosstab between Bloom & Richardson (BR) grade based on light microscopic mitotic count (LM-MC) and artificial intelligence supported MC (AI-MC) in 50 breast cancer resections (κ=0.825, 95% CI 0.68–0.97).

	AI-MC-based grade
LM-MC-based grade	1	2	3	Total
1	13	1	0	14
2	2	26	0	28
3	0	2	6	8
Total	15	29	6	50

Open in a new tab

Table 6.

	AI-MC-based grade
WSI-MC-based grade	1	2	3	Total
1	13	2	0	15
2	2	23	0	25
3	0	4	6	10
Total	15	29	6	50

Open in a new tab

Discussion

In this study, we investigated whether mitoses counting in BC using digital WSI compares better to LM-MC when assisted by AI, and to which extent differences in digital MC (AI assisted or not) result in BR grade variations.

For biopsies, LM-MC and AI-MC showed an equal R² when compared with LM-MC and WSI-MC, despite the latter already correlating well. For resections, R² of LM-MC and AI-MC even surpassed that of LM-MC and WSI-MC. This data suggests that not only does AI correlate as well as WSI with LM for mitotic count, but also might perhaps compare better to LM.

It was noted that AI-MC resulted in systematically slightly lower MC values compared to LM-MC and WSI-MC. This indicates that the AI algorithm may miss some mitoses and needs further improvement. However, as the observer checked the results, the observer may not have been critical enough when reviewing mitoses which the AI classified as mitoses-like objects. This could lower AI-MC compared to LM-MC and WSI-MC and underlines the importance of careful human supervision of the output of algorithms when AI is used in daily practice.

Several other studies showed similar results regarding the comparability between LM-MC and WSI-MC.¹⁶^,²⁰^,³¹^,³² Noted differences between LM-MC and WSI-MC were perceived to be within the range of inter-observer differences in LM-MC. Also, studies which used 40× magnification for scanning and high-resolution displays noted that differences between WSI and LM tended to get smaller, suggesting that a certain standard of technology is required for proper mitoses counting on WSI. As to AI, a recent study applying AI to select a mitoses hotspot in which to count showed improved inter-observer agreement in interactive mitoses counting on WSI, with similar inter-observer κ values for LM-MC and AI-MC.¹⁹ However, one study demonstrated higher inter-observer agreement for AI-MC compared to LM-MC, and a substantial saving in time.³³ So, different studies seem to point at least to non-inferiority of AI-MC compared to LM-MC in BC. The potential to save time is another reason to further explore the possibilities of AI.

Both biopsies and resections showed near perfect agreement in BR between different modalities, although the κ for WSI-MC versus AI-MC-based BR in the resection group was slightly lower. This indicates that differences in MC between different modalities hardly influence BR grade.

One study compared BR based on LM and WSI in over 1600 cases, showing a strong association (Cramer’s V: 0.58) between both modalities.¹⁶ Another study focusing on inter-observer differences in BR when using WSI, showed the concordance to be similar to inter-observer differences in BR using LM.²¹ These studies substantiate our results. To the best of our knowledge, no previous study has been conducted that compares agreement of BR using LM-MC or WSI-MC and AI-MC. The high agreement in BR in this study is probably related to 2 factors. Firstly, WSI-MC and AI-MC were performed on the exact same slide as LM-MC, whereas larger tumors may be heterogeneous across different tissue blocks. Secondly, grading in different modalities was assessed by the same observer, causing the criteria for mitotic figures to be interpreted singularly and increasing the chance of selecting the same hotspot.

This study has some limitations. First, the gold-standard is LM-MC assessed by a single observer. Due to significant inter-observer differences for LM-MC, a study with multiple observers may provide a more realistic view on the added value of AI. Another option would be to use Phosphohistone H3 immunohistochemistry, which enhances recognition of mitotic figures and may make LM-MC (and perhaps even AI-MC) more reproducible.³⁴ Secondly, this study has a relatively small number of cases.

In daily pathology practice, digital WSI is increasingly used worldwide. This study, in combination with previous studies in this field, shows WSI-MC to be suitable for grading BC. Especially pathology laboratories which have a digital workflow could thereby incorporate WSI-MC in their daily practice of grading BC.

In general, AI algorithms show great promise in improving pathology practice. This study demonstrates that mitoses counting in BC can not only be performed by an AI algorithm, but also might compare better to LM than WSI. We expect the next generation algorithms to be improved even further.³⁵ These algorithms may also save valuable interaction time for the pathologist, especially when algorithms run in the background on WSI, providing the pathologist with mitotic hotspots.

In conclusion, this first validation study shows that WSI-MC might compare better to LM-MC by using AI. Agreement between different modalities for BR was high. WSI-MC appears as a viable alternative to LM-MC. Further research is required to advance our knowledge of AI-MC, but it appears at least non-inferior to LM-MC and has the potential to save time.

Funding sources

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

1.Sung H., Ferlay J., Siegel R.L., et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71:209–249. doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]
2.Ahmad A. In: Breast Cancer Metastasis and Drug Resistance: Challenges and Progress. Advances in Experimental Medicine and Biology. Ahmad A., editor. Springer International Publishing; 2019. Breast cancer statistics: recent trends; pp. 1–7. [DOI] [PubMed] [Google Scholar]
3.Elston C.W., Ellis I.O. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology. 1991;19:403–410. doi: 10.1111/j.1365-2559.1991.tb00229.x. [DOI] [PubMed] [Google Scholar]
4.Genestie C., Zafrani B., Asselain B., et al. Comparison of the prognostic value of Scarff-Bloom-Richardson and Nottingham histological grades in a series of 825 cases of breast cancer: major importance of the mitotic count as a component of both grading systems. Anticancer Res. 1998;18:571–576. doi: 10.1038/modpathol.3800161. [DOI] [PubMed] [Google Scholar]
5.van Dooijeweert C., van Diest P.J., Ellis I.O. Grading of invasive breast carcinoma: the way forward. Virchows Archiv. 2021;1:1–11. doi: 10.1007/s00428-021-03141-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Rakha E.A., Reis-Filho J.S., Baehner F., et al. Breast cancer prognostic classification in the molecular era: the role of histological grade. Breast Cancer Res. 2010;12(4) doi: 10.1186/bcr2607. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.van Dooijeweert C., van Diest P.J., Willems S.M., et al. Significant inter- and intra-laboratory variation in grading of invasive breast cancer: A nationwide study of 33,043 patients in the Netherlands. Int. J. Cancer. 2020;146:769–780. doi: 10.1002/ijc.32330. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.van Diest P.J., van der Wall E., Baak J.P.A. Prognostic value of proliferation in invasive breast cancer: a review. J Clin Pathol. 2004;57:675. doi: 10.1136/jcp.2003.010777. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Baak J.P.A., van Diest P.J., Voorhorst F.J., et al. Prospective multicenter validation of the independent prognostic value of the mitotic activity index in lymph node-negative breast cancer patients younger than 55 years. J. Clin. Oncol. 2005;23:5993–6001. doi: 10.1200/JCO.2005.05.511. [DOI] [PubMed] [Google Scholar]
10.Klintman M., Strand C., Ahlin C., et al. The prognostic value of mitotic activity index (MAI), phosphohistone H3 (PPH3), cyclin B1, cyclin A, and Ki67, alone and in combinations, in node-negative premenopausal breast cancer. PLoS One. 2013;8 doi: 10.1371/journal.pone.0081902. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Meyer J.S., Alvarez C., Milikowski C., et al. Breast carcinoma malignancy grading by Bloom-Richardson system vs proliferation index: reproducibility of grade and advantages of proliferation index. Modern Pathol. 2005;18:1067–1078. doi: 10.1038/modpathol.3800388. [DOI] [PubMed] [Google Scholar]
12.Robbins P., Pinder S., de Klerk N., et al. Histological grading of breast carcinomas: a study of interobserver agreement. Human Pathol. 1995;26:873–879. doi: 10.1016/0046-8177(95)90010-1. [DOI] [PubMed] [Google Scholar]
13.Theissig F., Kunze K.D., Haroske G., et al. Histological grading of breast cancer: interobserver, reproducibility and prognostic significance. Pathol Res Pract. 1990;186:732–736. doi: 10.1016/S0344-0338(11)80263-3. [DOI] [PubMed] [Google Scholar]
14.van Diest P.J., Baak J.P.A., Matze-Cok P., et al. Reproducibility of mitosis counting in 2,469 breast cancer specimens: Results from the Multicenter Morphometric Mammary Carcinoma Project. Human Pathol. 1992;23:603–607. doi: 10.1016/0046-8177(92)90313-r. [DOI] [PubMed] [Google Scholar]
15.Boiesen P., Bendahl P.O., Anagnostaki L., et al. Histologic grading in breast cancer–reproducibility between seven pathologic departments. South Sweden Breast Cancer Group. Acta Oncol. 2000;39(1):41–45. doi: 10.1080/028418600430950. [DOI] [PubMed] [Google Scholar]
16.Rakha E.A., Aleskandarani M., Toss M.S., et al. Breast cancer histologic grading using digital microscopy: concordance and outcome association. J Clin Pathol. 2018;71:680–686. doi: 10.1136/jclinpath-2017-204979. [DOI] [PubMed] [Google Scholar]
17.Williams B., Hanby A., Millican-Slater R., et al. Digital pathology for primary diagnosis of screen-detected breast lesions - experimental data, validation and experience from four centres. Histopathology. 2020;76:968–975. doi: 10.1111/his.14079. [DOI] [PubMed] [Google Scholar]
18.Al-Janabi S., van Slooten H.J., Visser M., et al. Evaluation of mitotic activity index in breast cancer using whole slide digital images. PLoS One. 2013;8(12) doi: 10.1371/journal.pone.0082576. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Balkenhol M.C.A., Tellez D., Vreuls W., et al. Deep learning assisted mitotic counting for breast cancer. Lab Investig. 2019;99:1596–1606. doi: 10.1038/s41374-019-0275-0. [DOI] [PubMed] [Google Scholar]
20.Lashen A., Ibrahim A., Katayama A., et al. Visual assessment of mitotic figures in breast cancer: a comparative study between light microscopy and whole slide images. Histopathology. 2021;79:913–925. doi: 10.1111/his.14543. [DOI] [PubMed] [Google Scholar]
21.Ginter P.S., Idress R., D’Alfonso T.M., et al. Histologic grading of breast carcinoma: a multi-institution study of interobserver variation using virtual microscopy. Modern Pathol. 2021;34:701–709. doi: 10.1038/s41379-020-00698-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Malon C., Brachtel E., Cosatto E., et al. Mitotic figure recognition: agreement among pathologists and computerized detector. Anal Cell Pathol. (Amsterdam) 2012;35(2):97. doi: 10.3233/ACP-2011-0029. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Veta M., van Diest P.J., Willems S.M., et al. Assessment of algorithms for mitosis detection in breast cancer histopathology images. Med Image Anal. 2015;20(1):237–248. doi: 10.1016/j.media.2014.11.010. [DOI] [PubMed] [Google Scholar]
24.Roux L., Racoceanu D., Loménie N., et al. Mitosis detection in breast cancer histological images An ICPR 2012 contest. J Pathol Inform. 2013;4(1):8. doi: 10.4103/2153-3539.112693. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Nateghi R., Danyali H., Helfroush M.S. A deep learning approach for mitosis detection: application in tumor proliferation prediction from whole slide images. Artif Intel Med. 2021;114 doi: 10.1016/j.artmed.2021.102048. [DOI] [PubMed] [Google Scholar]
26.Li C., Wang X., Liu W., et al. Weakly supervised mitosis detection in breast histopathology images using concentric loss. Med Image Anal. 2019;53:165–178. doi: 10.1016/j.media.2019.01.013. https://pubmed.ncbi.nlm.nih.gov/30798116/ [DOI] [PubMed] [Google Scholar]
27.Bertram C.A., Aubreville M., Donovan T.A., et al. Computer-assisted mitotic count using a deep learning-based algorithm improves interobserver reproducibility and accuracy. Vet Pathol. 2022;59:211–226. doi: 10.1177/03009858211067478. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Cireşan D.C., Giusti A., Gambardella L.M., Schmidhuber J. International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer; Berlin, Heidelberg: 2013, September. Mitosis detection in breast cancer histology images with deep neural networks; pp. 411–418. [DOI] [PubMed] [Google Scholar]
29.Lafarge M.W., Bekkers E.J., Pluim J.P., et al. Roto-translation equivariant convolutional networks: application to histopathology image analysis. Med Image Anal. 2021;68 doi: 10.1016/j.media.2020.101849. [DOI] [PubMed] [Google Scholar]
30.McHugh M.L. Interrater reliability: the kappa statistic. Biochem Med. 2012;22:276–282. doi: 10.11613/BM.2012.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Wei B.R., Halsey C.H., Hoover S.B., et al. Agreement in histological assessment of mitotic activity between microscopy and digital whole slide images informs conversion for clinical diagnosis. Acad Pathol. 2019;6 doi: 10.1177/2374289519859841. 2374289519859841. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Shaw E.C., Hanby A.M., Wheeler K., et al. Observer agreement comparing the use of virtual slides with glass slides in the pathology review component of the POSH breast cancer cohort study. J Clin Pathol. 2012;65:403–408. doi: 10.1136/jclinpath-2011-200369. [DOI] [PubMed] [Google Scholar]
33.Pantanowitz L., Hartman D., Qi Y., et al. Accuracy and efficiency of an artificial intelligence tool when counting breast mitoses. Diagn Pathol. 2020;15:80. doi: 10.1186/s13000-020-00995-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.van Steenhoven J.E.C., Kuijer A., Kornegoor R., et al. Assessment of tumour proliferation by use of the mitotic activity index, and Ki67 and phosphohistone H3 expression, in early-stage luminal breast cancer. Histopathology. 2020;77:579–587. doi: 10.1111/his.14185. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Auberville M, Stathonikos N, Bertram CA, et al. Mitosis domain generalization in histopathology images - the MIDOG challenge. arXiv:2204.03742 [eess.IV]. https://arxiv.org/pdf/2204.03742.pdf. Preprint. [DOI] [PubMed]

[bb0005] 1.Sung H., Ferlay J., Siegel R.L., et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71:209–249. doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]

[bb0010] 2.Ahmad A. In: Breast Cancer Metastasis and Drug Resistance: Challenges and Progress. Advances in Experimental Medicine and Biology. Ahmad A., editor. Springer International Publishing; 2019. Breast cancer statistics: recent trends; pp. 1–7. [DOI] [PubMed] [Google Scholar]

[bb0015] 3.Elston C.W., Ellis I.O. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology. 1991;19:403–410. doi: 10.1111/j.1365-2559.1991.tb00229.x. [DOI] [PubMed] [Google Scholar]

[bb0020] 4.Genestie C., Zafrani B., Asselain B., et al. Comparison of the prognostic value of Scarff-Bloom-Richardson and Nottingham histological grades in a series of 825 cases of breast cancer: major importance of the mitotic count as a component of both grading systems. Anticancer Res. 1998;18:571–576. doi: 10.1038/modpathol.3800161. [DOI] [PubMed] [Google Scholar]

[bb0025] 5.van Dooijeweert C., van Diest P.J., Ellis I.O. Grading of invasive breast carcinoma: the way forward. Virchows Archiv. 2021;1:1–11. doi: 10.1007/s00428-021-03141-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0030] 6.Rakha E.A., Reis-Filho J.S., Baehner F., et al. Breast cancer prognostic classification in the molecular era: the role of histological grade. Breast Cancer Res. 2010;12(4) doi: 10.1186/bcr2607. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0035] 7.van Dooijeweert C., van Diest P.J., Willems S.M., et al. Significant inter- and intra-laboratory variation in grading of invasive breast cancer: A nationwide study of 33,043 patients in the Netherlands. Int. J. Cancer. 2020;146:769–780. doi: 10.1002/ijc.32330. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0040] 8.van Diest P.J., van der Wall E., Baak J.P.A. Prognostic value of proliferation in invasive breast cancer: a review. J Clin Pathol. 2004;57:675. doi: 10.1136/jcp.2003.010777. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0045] 9.Baak J.P.A., van Diest P.J., Voorhorst F.J., et al. Prospective multicenter validation of the independent prognostic value of the mitotic activity index in lymph node-negative breast cancer patients younger than 55 years. J. Clin. Oncol. 2005;23:5993–6001. doi: 10.1200/JCO.2005.05.511. [DOI] [PubMed] [Google Scholar]

[bb0050] 10.Klintman M., Strand C., Ahlin C., et al. The prognostic value of mitotic activity index (MAI), phosphohistone H3 (PPH3), cyclin B1, cyclin A, and Ki67, alone and in combinations, in node-negative premenopausal breast cancer. PLoS One. 2013;8 doi: 10.1371/journal.pone.0081902. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0055] 11.Meyer J.S., Alvarez C., Milikowski C., et al. Breast carcinoma malignancy grading by Bloom-Richardson system vs proliferation index: reproducibility of grade and advantages of proliferation index. Modern Pathol. 2005;18:1067–1078. doi: 10.1038/modpathol.3800388. [DOI] [PubMed] [Google Scholar]

[bb0060] 12.Robbins P., Pinder S., de Klerk N., et al. Histological grading of breast carcinomas: a study of interobserver agreement. Human Pathol. 1995;26:873–879. doi: 10.1016/0046-8177(95)90010-1. [DOI] [PubMed] [Google Scholar]

[bb0065] 13.Theissig F., Kunze K.D., Haroske G., et al. Histological grading of breast cancer: interobserver, reproducibility and prognostic significance. Pathol Res Pract. 1990;186:732–736. doi: 10.1016/S0344-0338(11)80263-3. [DOI] [PubMed] [Google Scholar]

[bb0070] 14.van Diest P.J., Baak J.P.A., Matze-Cok P., et al. Reproducibility of mitosis counting in 2,469 breast cancer specimens: Results from the Multicenter Morphometric Mammary Carcinoma Project. Human Pathol. 1992;23:603–607. doi: 10.1016/0046-8177(92)90313-r. [DOI] [PubMed] [Google Scholar]

[bb0075] 15.Boiesen P., Bendahl P.O., Anagnostaki L., et al. Histologic grading in breast cancer–reproducibility between seven pathologic departments. South Sweden Breast Cancer Group. Acta Oncol. 2000;39(1):41–45. doi: 10.1080/028418600430950. [DOI] [PubMed] [Google Scholar]

[bb0080] 16.Rakha E.A., Aleskandarani M., Toss M.S., et al. Breast cancer histologic grading using digital microscopy: concordance and outcome association. J Clin Pathol. 2018;71:680–686. doi: 10.1136/jclinpath-2017-204979. [DOI] [PubMed] [Google Scholar]

[bb0085] 17.Williams B., Hanby A., Millican-Slater R., et al. Digital pathology for primary diagnosis of screen-detected breast lesions - experimental data, validation and experience from four centres. Histopathology. 2020;76:968–975. doi: 10.1111/his.14079. [DOI] [PubMed] [Google Scholar]

[bb0090] 18.Al-Janabi S., van Slooten H.J., Visser M., et al. Evaluation of mitotic activity index in breast cancer using whole slide digital images. PLoS One. 2013;8(12) doi: 10.1371/journal.pone.0082576. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0095] 19.Balkenhol M.C.A., Tellez D., Vreuls W., et al. Deep learning assisted mitotic counting for breast cancer. Lab Investig. 2019;99:1596–1606. doi: 10.1038/s41374-019-0275-0. [DOI] [PubMed] [Google Scholar]

[bb0100] 20.Lashen A., Ibrahim A., Katayama A., et al. Visual assessment of mitotic figures in breast cancer: a comparative study between light microscopy and whole slide images. Histopathology. 2021;79:913–925. doi: 10.1111/his.14543. [DOI] [PubMed] [Google Scholar]

[bb0105] 21.Ginter P.S., Idress R., D’Alfonso T.M., et al. Histologic grading of breast carcinoma: a multi-institution study of interobserver variation using virtual microscopy. Modern Pathol. 2021;34:701–709. doi: 10.1038/s41379-020-00698-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0110] 22.Malon C., Brachtel E., Cosatto E., et al. Mitotic figure recognition: agreement among pathologists and computerized detector. Anal Cell Pathol. (Amsterdam) 2012;35(2):97. doi: 10.3233/ACP-2011-0029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0115] 23.Veta M., van Diest P.J., Willems S.M., et al. Assessment of algorithms for mitosis detection in breast cancer histopathology images. Med Image Anal. 2015;20(1):237–248. doi: 10.1016/j.media.2014.11.010. [DOI] [PubMed] [Google Scholar]

[bb0120] 24.Roux L., Racoceanu D., Loménie N., et al. Mitosis detection in breast cancer histological images An ICPR 2012 contest. J Pathol Inform. 2013;4(1):8. doi: 10.4103/2153-3539.112693. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0125] 25.Nateghi R., Danyali H., Helfroush M.S. A deep learning approach for mitosis detection: application in tumor proliferation prediction from whole slide images. Artif Intel Med. 2021;114 doi: 10.1016/j.artmed.2021.102048. [DOI] [PubMed] [Google Scholar]

[bb0130] 26.Li C., Wang X., Liu W., et al. Weakly supervised mitosis detection in breast histopathology images using concentric loss. Med Image Anal. 2019;53:165–178. doi: 10.1016/j.media.2019.01.013. https://pubmed.ncbi.nlm.nih.gov/30798116/ [DOI] [PubMed] [Google Scholar]

[bb0135] 27.Bertram C.A., Aubreville M., Donovan T.A., et al. Computer-assisted mitotic count using a deep learning-based algorithm improves interobserver reproducibility and accuracy. Vet Pathol. 2022;59:211–226. doi: 10.1177/03009858211067478. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0140] 28.Cireşan D.C., Giusti A., Gambardella L.M., Schmidhuber J. International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer; Berlin, Heidelberg: 2013, September. Mitosis detection in breast cancer histology images with deep neural networks; pp. 411–418. [DOI] [PubMed] [Google Scholar]

[bb0145] 29.Lafarge M.W., Bekkers E.J., Pluim J.P., et al. Roto-translation equivariant convolutional networks: application to histopathology image analysis. Med Image Anal. 2021;68 doi: 10.1016/j.media.2020.101849. [DOI] [PubMed] [Google Scholar]

[bb0150] 30.McHugh M.L. Interrater reliability: the kappa statistic. Biochem Med. 2012;22:276–282. doi: 10.11613/BM.2012.031. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0155] 31.Wei B.R., Halsey C.H., Hoover S.B., et al. Agreement in histological assessment of mitotic activity between microscopy and digital whole slide images informs conversion for clinical diagnosis. Acad Pathol. 2019;6 doi: 10.1177/2374289519859841. 2374289519859841. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0160] 32.Shaw E.C., Hanby A.M., Wheeler K., et al. Observer agreement comparing the use of virtual slides with glass slides in the pathology review component of the POSH breast cancer cohort study. J Clin Pathol. 2012;65:403–408. doi: 10.1136/jclinpath-2011-200369. [DOI] [PubMed] [Google Scholar]

[bb0165] 33.Pantanowitz L., Hartman D., Qi Y., et al. Accuracy and efficiency of an artificial intelligence tool when counting breast mitoses. Diagn Pathol. 2020;15:80. doi: 10.1186/s13000-020-00995-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0170] 34.van Steenhoven J.E.C., Kuijer A., Kornegoor R., et al. Assessment of tumour proliferation by use of the mitotic activity index, and Ki67 and phosphohistone H3 expression, in early-stage luminal breast cancer. Histopathology. 2020;77:579–587. doi: 10.1111/his.14185. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0175] 35.Auberville M, Stathonikos N, Bertram CA, et al. Mitosis domain generalization in histopathology images - the MIDOG challenge. arXiv:2204.03742 [eess.IV]. https://arxiv.org/pdf/2204.03742.pdf. Preprint. [DOI] [PubMed]

PERMALINK

Deep learning supported mitoses counting on whole slide images: A pilot study for validating breast cancer grading in the clinical workflow

Stijn A van Bergeijk

Nikolas Stathonikos

Natalie D ter Hoeve

Maxime W Lafarge

Tri Q Nguyen

Paul J van Diest

Mitko Veta

Abstract

Introduction

Methods

Results

Conclusion

Highlights

Introduction

Methods

Study design and population

Digital pathology and AI

Fig. 1.

Data analysis

Results

Biopsies

Fig. 2.

Fig. 3.

Fig. 4.

Table 1.

Table 2.

Table 3.

Resections

Fig. 5.

Fig. 6.

Fig. 7.

Table 4.

Table 5.

Table 6.

Discussion

Funding sources

Conflict of interest

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases