Abstract
Immunohistochemical data (IHC) plays an important role in clinical practice, and is typically gathered in a semi-quantitative fashion that relies on some degree of visual scoring. However, visual scoring by a pathologist is inherently subjective and manifests both intra-observer and inter-observer variability. In this study, we introduce a novel computer-aided quantification methodology for immunohistochemical scoring that uses the algebraic concept of persistent homology. Using 8 bit grayscale image data derived from 90 specimens of invasive ductal carcinoma of the breast, stained for the replicative marker Ki-67, we computed homology classes. These were then compared to nuclear grades and the Ki-67 labeling indices obtained by visual scoring. Three metrics for IHC staining were newly defined: Persistent Homology Index (PHI), center coordinates of positive and negative groups, and the sum of squares within groups (WSS). This study demonstrates that PHI, a novel index for immunohistochemical labeling using persistent homology, can produce highly similar data to that generated by a pathologist using visual evaluation. The potential benefits associated with our novel technology include both improved quantification and reproducibility. Since our method reflects cellularity and nuclear atypia, it carries a greater quantity of biologic data compared to conventional evaluation using Ki-67.
Introduction
Breast cancer is the most commonly diagnosed cancer in Japanese women1, for which invasive ductal carcinoma is the most frequent type of invasive carcinoma. Its histological or nuclear grading provides useful data for predicting prognoses2,3. Ki-67, a nuclear protein and cellular marker of proliferation, is used as a predictive and prognostic marker in many cancers. Multiple studies have shown that a higher proliferation rate (i.e. elevated staining of Ki-67) is associated with a poor prognosis4–6. Further, the St. Gallen Consensus (2009) states that the Ki-67 labeling index is a useful parameter with which to select patients with hormone receptor-positive breast cancers who may benefit from chemotherapy as well as endocrine therapy. Breast cancers are classified as low, intermediate, or highly proliferative according to their Ki-67 labeling index, with these categories linked to labeling indices of under 15%, 16–30%, or over 30%, respectively7. On the other hand, gene expression profiling enabled us to classify breast tumors under five intrinsic subtypes, i.e., luminal A, luminal B, HER2 over-expression, basal and normal-like tumors8,9. This classification is approximated by information of ER, PgR, HER2, and Ki-6710,11. However, since the evaluation of Ki-67 labeling is itself not standardized, and visual scoring is used in many facilities, index values vary considerably.
In this study, we propose a novel quantitative evaluation method for immunohistochemical labeling based on persistent homology. Persistent homology is a relatively new algebraic tool developed from a practical standpoint (details are provided in the section of Methods). In terms of shape recognition in image analysis, the classical tools for capturing the characteristics of shape from black and white images are sensitive to noise in data sets. Coping with this difficulty, persistent homology enables us to measure stable topological features in a meaningful manner. The presumption is that stable topological features (and by inference the most valuable) are revealed by their persistence. In this work, we investigated the correlation between the newly defined persistent homology-derived index, nuclear grade, and Ki-67 labeling index, as obtained by traditional visual scoring.
Methods
Patients
Ethical approval for this study was obtained from the institutional review board (IRB) of the Hokkaido Cancer Center and Asahikawa Medical University, which waived the need for informed consent. All experiments and methods were performed in accordance with relevant guidelines and regulations. We selected 90 cases of invasive ductal carcinoma of the breast that were surgically resected at Hokkaido Cancer Center, Sapporo, Japan, from January 2015 through September 2015. The median patient age was 60 (range 25–87). 44 patients were diagnosed with scirrhous carcinoma, 41 with papillotubular carcinoma, and 5 with solid-tubular carcinoma. These 90 patients comprised 56 cases of tumors assigned the Japanese nuclear grade of 1, 19 cases of grade 2, and 15 cases of grade 3.
The Japanese nuclear grade is defined as the sum of two parameters, namely nuclear atypia (l for low-degree atypia; 2 for intermediate-degree atypia; 3 for high-degree atypia), and mitotic count/10 high power fields (x40 objective lens). A score of l denotes 0–4 mitoses. Two indicates 5–9 mitoses, and 3, ≥10 mitoses. To derive a nuclear grade we then sum the scores (g) for the two parameters. If g equals 2 or 3, then the nuclear grade is designated 1. For a g score of 4, the nuclear grade is 2. Grade 3 is reserved for g scores of 5 or 63. An initial assessment of Ki-67 staining was made at × 40 magnification, with the most intense fields selected for assessment using a Nikon ECLIPSE 80i type microscope (for which a × 40 field size corresponds to 0.0625mm2). Sections were scanned using a Scorpion IEEE-1394 camera (Point Grey Research® Inc.) with images saved in TIFF format (1280 × 960 pixels). The Ki-67 labeling index was recorded from the original pathology report. If this number was given as a range (e.g. 30–40%), then we used the mean value (35% in the example given) for statistical analyses. Scanned images (magnification: × 40, total images: 90) were processed using Apple MacPro (3.0 GHz 8Core).
Immunohistochemistry
Sections were cut at 3 to 4μm intervals and immuno-stained using the monoclonal antibody Ki-67 (clone 30-9, pre-diluted, Ventana-Roche, Arizona, USA). Immunohistochemical stains were subsequently performed using Bench Mark GX with the iView DAB Detection kit (Ventana-Roche, Arizona, USA).
Persistent Homology
Persistent homology is a fairly recent mathematical concept (reviewed12–17). Let M be a grayscale image of size X × Y with gray intensities 2n (in our case, n = 8 (8 bit grayscale)). M is considered to be an integer-valued function of two variables f(x, y) on the intervals [0, 2n − 1]. Let K i denote the sublevel set f −1([0, i]) of the function f. For each i, j , we have a sequence K:
1 |
which is called a sublevel set filtration. For each , we can calculate the p-th homology group and p-th Betti number 18. An element of is called a homology class. Intuitively, counts the number of connected components, and counts the number of “holes” or “voids”. For , the image of the induced homomorphism is called a persistent homology group, and its rank is called a persistent Betti number of f. We can track the birth (i) and death (j) of each homology class (feature) during the evolution of this sequence, which can be represented using the interval , which is called the persistence interval.
Persistent Betti numbers are visualized by plotting , i.e., [0,255]2 with multiplicity given by at each point in two dimensions. This plot, according to the birth and death of two dimensional coordinates is called a p-th persistence diagram (PDp), in which all points lie above the diagonal, i.e, i < j, and the vertical distance to the diagonal indicates the persistence of p-dimensional holes. Those points close to the diagonal (i.e. short vertical bars): { in PDp | |j − i| < δ} are classified as topological noise artifacts; long distances from the diagonal (i.e. persistent) represent more robust features. In this study, we assigned a value of 40 for δ. We computed persistence diagrams (PDp) from 8 bit grayscale image data using the open source software, Persistent Homology Algorithm Toolbox (PHAT, v1.4.1)19, and used MATLAB R2015b to draw persistence diagrams. To improve the visibility of the persistence diagrams, we divided them into 322 grids and demonstrated the multiplicity in each grid by using color gradients.
Simple Example of a Persistence Diagram
Figure 1 shows schematics that demonstrate a p-th persistence diagram PDp, with p = 0 for connected components, and p = 1 for holes, respectively. For simplicity we assume that there are only 3 levels of grayscale, namely, black (low, L), gray (medium, M), and white (high, H), corresponding to positive cells, negative cells, and background, respectively (Fig. 1(a1) and (b1)). We generated the filtration sequence for the pixel images by increasing the grayscale threshold from low to high values (Fig. 1(a2) and (b2)). In Fig. 1(a3) and (b3), the persistent Betti numbers are plotted as the colored squares in two-dimensional space with respect to the (birth, death)-coordinates. The white number on each square shows the multiplicities at a point (i, j) on the persistence diagram. Fig. 1(a4) shows the schematic image at the medium filtration level, in which each color of blocks are associated with the color of squares in (a3). Both of green- and magenta-colored blocks appear in the lower filtration level. Each 2 green-colored blocks are connected to the magenta-colored blocks through the orange-colored blocks, and then, they merge into the connected domains. We suppose that the green-colored blocks are adsorbed into the connected domain with the corresponding magenta-colored blocks. That is, the 2 green-colored blocks die at the medium filtration level and the 3 magenta-colored blocks survive to the higher level. Hence we put a green square at the point (L, M) with a multiplicity number 2, which means that the persistence intervals for the lower cells are [L, M]. Moreover we put a magenta square at the point (L, H), i.e., the persistence intervals for the upper cells are [L, H] with multiplicity number 3. On the other hand, the cyan square at the point (M, H) in (a3) indicates the 5 cyan-colored blocks which newly appear at the medium level in Fig. 1(a4). The persistence interval is [M, H] and the multiplicity number is 5. For the case of PD0 (Fig. 1(a3)), the zeroth persistent Betti numbers associated with positive and negative cells appear in the lower and higher birth value regions in the diagram, indicated by the magenta- and cyan-colored squares, respectively. For the case of PD1 (Fig. 1(b3)), the first persistent Betti numbers indicate the robustness of each hole. The white numbers depicted on colored squares indicate the multiplicities at each point (i, j) on the (birth, death)-coordinates. Fig. 1(b4) shows the schematic image colored at the medium filtration level. The magenta-colored blocks surround 2 holes at the lower level, and then these 2 magenta-colored holes disappear at the medium level by being filled with the orange-colored blocks, i.e., the persistence interval of the 2 magenta-colored holes is [L, M]. Therefore, we put a magenta square at the point (L, M) with the multiplicity number 2 in (b3). Moreover, the 2 cyan-colored holes newly appear and survive to the higher level. We put a cyan square at the point (M, H) corresponding to the persistence interval [M, H] with the multiplicity number 2. The purple-colored blocks in (b4) indicate the overlapped part of the magenta-colored and cyan-colored blocks.
Persistent Homology Index
The nuclei of the Ki-67 positive cells have smaller birth parameters and larger death parameters in the persistence diagram PD0; negative cells exhibit larger birth and death parameters. Therefore the homology classes of positive cells are distributed at the upper left hand region of the persistence diagram, with those of negative cells at the upper right hand region. The homology classes on the persistence diagrams are classified into two clusters using the k-means clustering algorithm20. In Fig. 2(d1–d3), we plot the persistent Betti number in the original 2562 grids using red and black colored points. The persistent Betti number associated with positive (resp. red) and (resp. black) negative cells appear in the upper-left and upper-right regions of PD0. Let C 1 and C 2 be cluster centers of two clusters, such that the birth-parameter of C 1 is smaller than that of C 2 (noting that the positively stained cells have smaller birth parameters). The homology classes that induce C 1 are called positive, and those that induce C 2 are called negative. Centerx1 and Centery1 denote the coordinates of C 1, and Centerx2 and Centery2 denote the coordinates of C 2. The persistent homology index (PHI) is defined by the quotient of the total number of positive homology classes divided by the total number of positive and negative homology classes.
2 |
Although this index resembles the Ki-67 labeling index it is actually extracted from a more sophisticated dataset insofar as the results are informed by cellularity and nuclear atypia, as well as by Ki-67 labeling.
The sum of squares between groups (BSS) represents the sum of the squares of the distances between the overall center, and the centers of all groups. The sum of squares within groups (WSS) is given by the sum of squares between the overall center within each group, and all points within the group. WSS1 denotes the WSS of a positive group, with WSS2 denoting a WSS of negative groups. The ratio of the sum of squares (RSS) is defined by . The larger the RSS becomes, the clearer the division between groups.
Statistical Analyses
Persistent homology indices and other data were compared with nuclear grade and the standardized Ki-67 labeling index (mean percentage) as extrapolated from pathology reports. Spearman correlation and Kruskal-Wallis tests were performed using statistical software R version 3.2.021.
Data Availability
The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.
Results
Persistence Diagrams
Figure 2 shows representative persistence diagrams of Ki-67 staining for the three different nuclear grades (Grade 1–Grade 3). Fig. 2(b1–b3 and c1–c3) show the zeroth and first persistence diagrams (PD0 and PD1) in which the space of [0,255]2 is divided into 322 grids, with the multiplicity of persistent Betti numbers at each grid indicated using the color gradient [0, 40]. The dark colored region indicates a high multiplicity of persistent Betti numbers. In Fig. 2(d1–d3), persistent Betti numbers of positive and negative cell groups are plotted using red and black colored points, respectively. The zeroth persistence diagrams PD0 of (d1-d3) is shown in two-dimensional space of [0,255]2. The Betti numbers corresponding to Ki-67 positive and negative cells appear in the upper left-hand side of the PD0 diagram, and upper right-hand side, respectively. With advancing nuclear grade, the number of positive cells increases.
Persistent Homology Index and the other indices obtained from the persistence diagrams PD0 and PD1 versus visual scoring of Ki-67 labeling index (LI) and nuclear Grades
Figure 3(a) and (b) show a box plot of the persistent homology index (PHI) and RSS, using the persistence diagram (vertical axis), versus a pathologist-derived visual score of nuclear grade (horizontal axis). We found that PHI was positively associated with nuclear grade (p < 0.0001: Kruskal-Wallis rank sum test). Mean values ± 95% CIs are provided in Table 1.
Table 1.
Grade | Mean | 95% CI | |
---|---|---|---|
PHI | G1 | 0.324 | 0.292–0.356 |
G2 | 0.435 | 0.362–0.508 | |
G3 | 0.616 | 0.570–0.662 | |
RSS | G1 | 0.663 | 0.642–0.683 |
G2 | 0.684 | 0.663–0.704 | |
G3 | 0.629 | 0.580–0.678 | |
WSS1 | G1 | 42.00 | 39.84–44.17 |
G2 | 42.97 | 39.00–46.94 | |
G3 | 41.57 | 37.91–45.23 | |
WSS2 | G1 | 26.57 | 25.23–27.90 |
G2 | 29.00 | 26.57–31.42 | |
G3 | 33.96 | 31.61–36.32 | |
G1 | 52.13 | 47.83–56.42 | |
G2 | 43.76 | 39.15–48.37 | |
G3 | 34.49 | 31.56–37.42 | |
G1 | 175.1 | 169.0–181.3 | |
G2 | 170.7 | 155.3–186.0 | |
G3 | 137.4 | 121.1–153.6 | |
G1 | 147.3 | 142.6–152.0 | |
G2 | 146.3 | 137.0–155.6 | |
G3 | 129.6 | 110.7–148.5 | |
G1 | 206.6 | 203.8–209.5 | |
G2 | 205.9 | 199.8–212.0 | |
G3 | 193.9 | 182.4–205.4 |
Figure 3(c) shows a scatter diagram for Persistent Homology index (PHI) versus the pathologist-derived visual scoring of the Ki-67 labeling index (LI). These data revealed a high correlation between PHI and visual scoring of Ki-67 LI (Spearman correlation 0.71, p < 0.0001). In the diagram, the red-colored points of G1 (resp. blue colored-points of G3) appear in the left-bottom (resp. right-upper) regions, respectively. We estimate the accuracy rate for binary classification with respect to the Ki-67 LI and PHI, respectively. We categorize total data points for 90 patients into two types of grades, i.e., 56 points of G1 (type A) and 34 points of G2 and G3 (type B). This binary categorization is often taken account in determining the chemotherapy treatments with the classification to luminal A and B types. Applying the binary logistic regression with respect to Ki-67 LI, we obtain the threshold value for the binary classification as 0.378 (indicated by the vertical line in the figure). Testing the threshold value for 56 data points of type A (resp. 34 points of type B), the 48 points (resp. 15 points) are correctly classified into type A (resp. type B). Hence the accuracy rate for the Ki-67 LI is 0.700 ( = 63/90). In the same manner, we obtain the threshold value with respect to PHI as 0.467 (indicated by the horizontal line in the figure). For 56 data points of type A (resp. 34 points of type B), the 50 points (resp. 25 points) are correctly classified into type A (resp. type B). The accuracy rate for the PHI is 0.833 ( = 75/90). It is remarkable that the sensitivity for the binary classification is clearly improved from 15/34 to 25/34 by replacing Ki-67 LI into PHI.
Figure 4(a–c) illustrate the characteristics of a group of positive cells, and shows the box plot of , , and WSS1 from the persistence diagram (vertical axis) versus a visual score of nuclear grade (again, on the horizontal axis). and were found to be negatively associated with nuclear grade (p < 0.0001: Kruskal-Wallis rank sum test). There is no significant difference in WSS1 (p = 0.7003: Kruskal-Wallis rank sum test). Mean values ± 95% CIs are shown in Table 1.
Figure 4(d–f) illustrate a similar treatment for the group of negative cells, with a box plot of , , and WSS2 from the persistence diagram (vertical axis), versus the pathologist-derived visual scores of nuclear grade (horizontal axis). and were negatively associated with nuclear grade (p = 0.00064: Kruskal-Wallis rank sum test; p = 0.01442: Kruskal-Wallis rank sum test, respectively), with WSS2 positively associated with nuclear grade (p < 0.0001:Kruskal-Wallis rank sum test). Mean values ± 95% CIs are shown in Table 1.
Figure 5 and Table 2 illustrate the characteristics for PD1 using a box plot of , , and WSS using the persistence diagram PD1 (vertical axis), versus the pathologist-derived visual scoring of nuclear grade (horizontal axis). and are both negatively associated with nuclear grade (p = 0.00019:Kruskal-Wallis rank sum test; p = 0.00413:Kruskal-Wallis rank sum test, respectively), whereas WSS is positively associated with nuclear grade (p = 0.00023:Kruskal-Wallis rank sum test). Mean values ± 95% CIs are shown in Table 2.
Table 2.
Grade | Mean | 95% CI | |
---|---|---|---|
G1 | 222.8 | 220.2–225.4 | |
G2 | 221.3 | 214.7–227.8 | |
G3 | 207.4 | 197.5–217.4 | |
G1 | 229.2 | 226.9–231.5 | |
G2 | 227.6 | 221.7–233.5 | |
G3 | 216.0 | 207.3–224.7 | |
WSS | G1 | 28.32 | 25.93–30.71 |
G2 | 27.59 | 22.45–32.73 | |
G3 | 41.36 | 33.70- 49.02 |
Discussion
Computational topology is a new discipline that includes methods for calculating topological invariants such as homology groups and Betti number from digital images. Various applications for this discipline may be found in areas of materials science. For example, Hiraoka et al. recently discovered hierarchical structures of amorphous solids using persistent homology22,23. The homology concept can be applied to the geometrical characterization of 2-dimensional fracture surfaces and 3-dimensional morphologies in polymer mixtures24,25. In medical sciences, Adcock et al. used persistent homology to classify liver lesions in CT images26. Nakane and Takiyama also applied homology methods for the identification of regions of interest in colonic pathological images27. Ferri et al. generated a feasibility study for a persistent homology based k-Nearest Neighbor search algorithm in melanoma detection28.
In this study, we have defined a persistent homology index (PHI). Comparisons of PHI versus the Ki-67 labeling index derived by a pathologist using visual scoring demonstrated a high Spearman correlation of 0.71 (p < 0.0001). Positive correlations were observed between PHI, WSS2, WSS, and nuclear grade, as assessed by visual scoring; negative correlations were observed for center coordinates. There were significant differences among the three groups of metric corresponding to each nuclear grade in PHI, center coordinates (except for ), and WSS2 (p < 0.001) for the zeroth persistence diagram PD0, and center coordinates and WSS for the first persistence diagram PD1 (p < 0.01). The PHI, our novel index for immunohistochemical scoring using persistent homology, generated highly similar data that produced by visual scoring.
Positive correlations between WSS2, WSS and nuclear grade, and negative correlations between center coordinates and nuclear grade, indicate that persistence diagrams reflect not only labeling index, but also cellularity and nuclear atypia. The denser cellularity becomes, the earlier homology classes contact each other, such that the center of positive homology classes moves to a lower position. Nuclear atypia, such as differences in the size of nuclei and their irregular distribution, provoke this shift in the persistence diagram. These changes appear in and data. Since earlier contacts of homology classes bring about earlier births in the PD1 diagram, these changes also appear in , , and WSS calculated from PD1. Histological grading or nuclear grading of invasive ductal carcinoma is a useful prognostic factor2,3,29,30. Higher nuclear grades are linked to poorer prognoses, with many studies indicating a high correlation between Ki-67 labeling index and prognosis4. However, histological grading or nuclear grading by visual scoring is inherently subjective and manifests intra-observer and inter-observer variabilities31, with these exacerbated by the lack of a standardized Ki-67 labeling index32. Our new persistent homology index is objective and therefore provides more reliable data. Our method may provide a means to improve IHC data quality, with persistence diagrams carrying more information than the Ki-67 labeling index alone. Furthermore, our quantitative analyses can be applied to other immunohistochemical nuclear stains such as estrogen receptor (ER) and progesterone receptor staining (PgR). The quantification of immunohistochemical membrane staining, such as human epidermal growth factor receptor 2 (HER2), represents another possible application as this provokes as increase in 1-dimensional Betti number.
Our study has some limitations. For example, we did not exclude stroma from our analyses. The use of pattern-recognition software to identify and dial-out stromal data may therefore improve precision.
A recent study by Chabot-Richards et al. revealed that there is no substantial benefit to using quantitative image analysis for Ki-67 because of the good correlation between pathologist estimate and quantitative image analysis33. Their data suggested that pathologist visual scoring may be more closely associated with survival outcome than that of computerized image analysis. They explained that this may be caused by selecting appropriate areas of the slides by pathologists, however, the possible influence of unconscious recognition of nuclear atypia and cellularity by pathologists cannot be excluded. Since our new method reflects nuclear atypia and cellularity, and it has a good correlation with nuclear grade assessed by pathologist that associates prognosis, our new method may be more useful prognostic factor than the traditional image analysis for Ki-67.
In conclusion, this study demonstrates that PHI, a novel index for immunohistochemical labeling using persistent homology, can produce highly similar data to that generated by a pathologist using visual evaluation. The potential benefits associated with our novel technology include both improved quantification and reproducibility. Since our method reflects cellularity and nuclear atypia, it carries a greater quantity of biologic data compared to conventional evaluation using Ki-67.
Acknowledgements
This work was partially supported by a KAKENHI Grant-in-Aid for Scientific Research 15KT0100.
Author Contributions
A.T. and T.T. performed the experiments, analyzed the results and wrote the manuscript. A.T., H.S. and K.Y. performed data collection and annotation. S.T. supervised the work and was involved in setting up the experimental design. All authors reviewed the manuscript and agree with its contents.
Competing Interests
The authors declare that they have no competing interests.
Footnotes
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Akihiro Takiyama, Email: takiyama@med.hokudai.ac.jp.
Takashi Teramoto, Email: teramoto@asahikawa-med.ac.jp.
References
- 1.Taira N, et al. The Japanese Breast Cancer Society clinical practice guidelines for epidemiology and prevention of breast cancer, 2015 edition. Breast Cancer. 2016;23:343–356. doi: 10.1007/s12282-016-0673-8. [DOI] [PubMed] [Google Scholar]
- 2.Elston CW, Ellis IO. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology. 1991;19:403–410. doi: 10.1111/j.1365-2559.1991.tb00229.x. [DOI] [PubMed] [Google Scholar]
- 3.Tsuda H, et al. Establishment of histological criteria for high-risk node-negative breast carcinoma for a multi-institutional randomized clinical trial of adjuvant therapy. Japan National Surgical Adjuvant Study of Breast Cancer (NSAS-BC) Pathology Section. Jpn. J. Clin. Oncol. 1998;28:486–491. doi: 10.1093/jjco/28.8.486. [DOI] [PubMed] [Google Scholar]
- 4.Synnestvedt M, et al. Combined analysis of vascular invasion, grade, HER2 and Ki67 expression identifies early breast cancer patients with questionable benefit of systemic adjuvant therapy. Acta Oncol. 2013;52:91–101. doi: 10.3109/0284186X.2012.713508. [DOI] [PubMed] [Google Scholar]
- 5.Bjerre C, et al. Association of tissue inhibitor of metalloproteinases—1 and Ki67 in estrogen receptor positive breast cancer. Acta Oncol. 2013;52:82–90. doi: 10.3109/0284186X.2012.734922. [DOI] [PubMed] [Google Scholar]
- 6.Stuart-Harris R, Caldas C, Pinder SE, Pharoah P. Proliferation markers and survival in early breast cancer: a systematic review and meta-analysis of 85 studies in 32,825 patients. Breast. 2008;17:323–334. doi: 10.1016/j.breast.2008.02.002. [DOI] [PubMed] [Google Scholar]
- 7.Goldhirsch A, et al. Thresholds for therapies: highlights of the St Gallen International Expert Consensus on the primary therapy of early breast cancer 2009. Ann. Oncol. 2009;20:1319–1329. doi: 10.1093/annonc/mdp322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Perou CM, et al. Molecular portraits of human breast tumours. Nature. 2000;406:747–752. doi: 10.1038/35021093. [DOI] [PubMed] [Google Scholar]
- 9.Sørlie T, et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci USA. 2001;98:10869–10874. doi: 10.1073/pnas.191367098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Goldhirsch A, et al. Strategies for subtypes—dealing with the diversity of breast cancer: highlights of the St. Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2011. Ann Oncol. 2011;22:1736–1747. doi: 10.1093/annonc/mdr304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Goldhirsch A, et al. Personalizing the treatment of women with early breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2013. Ann Oncol. 2013;24:2206–2223. doi: 10.1093/annonc/mdt303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Edelsbrunner H, Harer J. Persistent homology—a survey. Contemporary mathematics. 2008;453:257–282. doi: 10.1090/conm/453/08802. [DOI] [Google Scholar]
- 13.Edelsbrunner, H. & Harer, J. Computational Topology: An Introduction (American Mathematical Society, 2010).
- 14.Carlsson G. Topology and data. B. Am. Math. Soc. 2009;46:255–308. doi: 10.1090/S0273-0979-09-01249-X. [DOI] [Google Scholar]
- 15.Carlsson G, Zomorodian A. The Theory of Multidimensional Persistence. Discrete Comput. Geom. 2009;42:71–93. doi: 10.1007/s00454-009-9176-0. [DOI] [Google Scholar]
- 16.Carlsson G, Singh G, Zomorodian A. Computing Multidimensional Persistence. J. Comput. Geom. 2010;1:72–100. [Google Scholar]
- 17.Ghrist, R. Elementary Applied Topology (Createspace Independent Publishing Platform, 2014).
- 18.Kaczynski, T., Mischaikow, K. & Mrozek, M. Computational Homology (Springer, 2004).
- 19.Bauer, U., Kerber, M., Reininghaus, J. & Wagner, H. PHAT (Persistent Homology Algorithm Toolbox), v1.4. 1. Available from: https://bitbucket.org/phat-code/phat (updated 2016 May 13).
- 20.MacQueen JB. Some Methods for classification and Analysis of Multivariate Observations. Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability. 1967;1:281–297. [Google Scholar]
- 21.R Development Core Team. R: A Language and Environment for Statistical Computing Available from: https://www.r-project.org/ (2016).
- 22.Gameiro M, et al. A topological measurement of protein compressibility. Japan. J. Indust. Appl. Math. 2015;32:1–17. doi: 10.1007/s13160-014-0153-5. [DOI] [Google Scholar]
- 23.Hiraoka Y, et al. Hierarchical structures of amorphous solids characterized by persistent homology. Proc. Natl. Acad. Sci. USA. 2016;113:7035–7040. doi: 10.1073/pnas.1520877113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Takiyama A, Nakane K, Kida K. An image analyzing method by a homology concept for fracture surfaces. Adv. Mat. Res. 2015;1102:135–138. [Google Scholar]
- 25.Teramoto T, Nishiura Y. Morphological characterization of the diblock copolymer problem with topological computation. Japan. J. Indust. Appl. Math. 2010;27:175–190. doi: 10.1007/s13160-010-0014-9. [DOI] [Google Scholar]
- 26.Adcock A, Rubin D, Carlsson G. Classification of Hepatic Lesions using the Matching Metric. Comput. Vis. Image Und. 2014;121:36–42. doi: 10.1016/j.cviu.2013.10.014. [DOI] [Google Scholar]
- 27.Nakane K, Takiyama A, Mori S, Matsuura N. Homology-based method for detecting regions of interest in colonic digital images. Diagn. Pathol. 2015;10:36. doi: 10.1186/s13000-015-0244-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ferri, M., Tomba, I., Visotti, A. & Stanganelli, I. A feasibility study for a persistent homology based k-Nearest Neighbor search algorithm in melanoma detection. Available from: https://arxiv.org/abs/1605.09781 (cited April 4th 2016).
- 29.Bult P, et al. In primary breast cancer the mitotic activity yields similar prognostic information as the histological grade: a study with long-term follow-up. Breast Cancer Res. Treat. 2010;122:77–86. doi: 10.1007/s10549-009-0526-7. [DOI] [PubMed] [Google Scholar]
- 30.Ono M, et al. Prognostic impact of Ki-67 labeling indices with 3 different cutoff values, histological grade, and nuclear grade in hormone-receptor-positive, HER2-negative, node-negative invasive breast cancers. Breast Cancer. 2015;22:141–152. doi: 10.1007/s12282-013-0464-4. [DOI] [PubMed] [Google Scholar]
- 31.Dalton LW, Page DL, Dupont WD. Histologic grading of breast carcinoma. A reproducibility study. Cancer. 1994;73:2765–2770. doi: 10.1002/1097-0142(19940601)73:11<2765::AID-CNCR2820731119>3.0.CO;2-K. [DOI] [PubMed] [Google Scholar]
- 32.Dowsett M, et al. Assessment of Ki67 in breast cancer: recommendations from the International Ki67 in Breast Cancer working group. J. Natl. Cancer Inst. 2011;103:1656–1664. doi: 10.1093/jnci/djr393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Chabot-Richards DS, Martin DR, Myers OB, Czuchlewski DR, Hunt KE. Quantitative image analysis in the assessment of diffuse large B-cell lymphoma. Mod. Pathol. 2011;24:1598–1605. doi: 10.1038/modpathol.2011.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.