Abstract
PURPOSE
Tumor-infiltrating lymphocytes (TILs) and their spatial characterizations on whole-slide images (WSIs) of histopathology sections have become crucial in diagnosis, prognosis, and treatment response prediction for different cancers. However, fully automatic assessment of TILs on WSIs currently remains a great challenge because of the heterogeneity and large size of WSIs. We present an automatic pipeline based on a cascade-training U-net to generate high-resolution TIL maps on WSIs.
METHODS
We present global cell-level TIL maps and 43 quantitative TIL spatial image features for 1,000 WSIs of The Cancer Genome Atlas patients with breast cancer. For more specific analysis, all the patients were divided into three subtypes, namely, estrogen receptor (ER)–positive, ER-negative, and triple-negative groups. The associations between TIL scores and gene expression and somatic mutation were examined separately in three breast cancer subtypes. Both univariate and multivariate survival analyses were performed on 43 TIL image features to examine the prognostic value of TIL spatial patterns in different breast cancer subtypes.
RESULTS
The TIL score was in strong association with immune response pathway and genes (eg, programmed death-1 and CLTA4). Different breast cancer subtypes showed TIL score in association with mutations from different genes suggesting that different genetic alterations may lead to similar phenotypes. Spatial TIL features that represent density and distribution of TIL clusters were important indicators of the patient outcomes.
CONCLUSION
Our pipeline can facilitate computational pathology-based discovery in cancer immunology and research on immunotherapy. Our analysis results are available for the research community to generate new hypotheses and insights on breast cancer immunology and development.
INTRODUCTION
The interaction between the tumor and its microenvironment (TME) plays a critical role in cancer development and progression. TME consists of various cells, including fibroblasts and a wide spectrum of immune cells.1 The host immune system is crucial in regulating tumor growth by continuous immunosurveillance and initiation of inflammatory reactions.2,3 Many observations indicate that tumor-infiltrating lymphocytes (TILs) and their spatial characteristics have significant diagnostic and prognostic values in multiple types of cancers.4-9 For example, recent studies suggest that high TIL densities correlate with favorable clinical outcomes in colorectal cancer, non–small-cell lung cancer, and head and neck cancers.10,11 Currently, the quantification and scoring of TILs on whole-slide images (WSIs) has been mainly performed and interpreted by domain experts, which can be subjective and influenced by human bias. As a result, automated image analysis methods are desired to reduce labor costs and provide consistent and accurate TIL evaluations. A fully automatic approach for TIL quantification and analysis should involve three technical issues: (1) selection of field of views (FOVs) on WSIs, (2) lymphocyte detection in FOVs, and (3) quantification of TILs for clinical assessment.
CONTEXT
Key Objective
To develop a fully automatic pipeline that enables accurate quantification and thorough exploration of tumor-infiltrating lymphocytes (TILs) using histopathology image data and multiomics data.
Knowledge Generated
Correlations between TIL spatial features derived from our deep-learning–based pipeline and gene expression data indicate that there are different cellular processes associated with the patient’s immune response in triple-negative and other breast cancer subtypes. Both the genomic correlations and survival analysis results imply that the clustering dispersion pattern of TILs is an important factor for evaluating immune response.
Relevance
Our image-processing pipeline can be easily used for TIL quantification on histopathology images, and help to reduce labor costs and human bias. The difference between genes in correlation with TIL features in triple-negative and other breast cancer subtypes will bring new insights into future immunologic research for breast cancer treatment.
Many approaches have been proposed for lymphocyte detection.12-17 Lymphocytes typically have small (7-10 μm), round, and dark nuclei with little cytoplasm, which is distinctive from malignant (epithelial) cells or stromal cells. Based on these characteristics, some studies first perform nuclei detection algorithms to distinguish all nucleus from cytoplasm in the hematoxylin and eosin (H&E)–stained images, then use a support vector machine algorithm to classify cellular components into different categories based on nuclear morphology.18,19 These methods are indirect and may sacrifice the accuracy of lymphocyte detection for overall classification performance. Recently, deep learning has become popular in computer vision and image-processing tasks because of its outstanding performance, and some studies have applied deep-learning methods to detect lymphocytes.20-22 However, because these methods require manual preselection of representative tumor regions for each slide, directly applying such models for overall TIL analysis on WSIs remains a practical challenge. Furthermore, this work did not deeply explore the relationships between TILs and omics data.
For instance, Amgad et al23 has developed an effective deep-learning–based method for joint region-level and nucleus-level segmentation of TILs, even though their work was limited by the lack of validation on large-scale image datasets and additional analysis between spatial TIL features and biologic data. Saltz et al24 have presented global mappings, as well as the spatial organization and molecular correlation of TILs for over 5,000 H&E diagnostic WSIs from The Cancer Genome Atlas (TCGA) dataset, which represented a benchmark for TIL analysis. However, because of the convolutional neural network model in Saltz et al,24 with only a calculated lymphocytic probability for each patch, it could not provide more specific cellular information, such as TIL counts and distribution that were related to clinical outcomes. Klauschen et al25 summarized different automated TIL scoring approaches in computational pathology and pointed out that it would increasingly contribute to the applications in immune research, but also emphasized that additional omics analysis, including mutational profiling, gene expression, and machine learning, were needed to enhance precision medicine.
Our study extended the limitations of previous work from both the image and biologic analysis perspectives. We first present a fully automatic image-processing pipeline following the clinical steps to quantitatively characterize TILs on histopathologic slide images. Our framework consists of three parts: (1) automatic identification of FOV; (2) a cascade-trained U-net model for lymphocyte detection; and (3) global quantification of TILs on WSIs. We applied our method on 1,000 unannotated TCGA breast cancer diagnostic WSIs and generated TIL maps containing rich cellular information of lymphocytes. We also extracted a set of TIL spatial features based on our TIL maps and explored the relationship between these TIL features and different omics data for different breast cancer subtypes. The associations between TIL features and genetic mutations, as well as gene expression, indicate that there may be different cell activation processes regulating the immune responses in different breast cancer subtypes. A set of features related to the spatial dispersion of TIL clusters was also found to be associated with survival in different breast cancer subtypes. All these analysis results are available for the research community to generate new hypotheses and insights on breast cancer immunology and development. The main contribution of this article is that our work is a complete pipeline using multiple sources, which not only enables fully automatic TIL evaluation on images but also incorporates omics analysis from different biologic data. Our pipeline can be easily extended to histologic images of other cancers and can facilitate the computational pathology-based discovery in immunology and research on immunotherapy.
METHODS
Data Source and Selection
Two breast cancer datasets were used in this study, namely, the TCGA-BRCA dataset and the lymphocyte detection dataset released by Janowczyk and Madabhushi.20 The TCGA dataset includes matched H&E-stained diagnostic images, transcriptome, somatic mutation, and clinical information for patients with breast cancer. Patients with missing molecular data or images with too-severe cryoartifacts or low-level lymphocyte infiltration were excluded, leaving a set of 1,000 samples. The dataset by Janowczyk and Madabhushi,20 denoted as , includes 200 small images of 200 × 200 pixels at 40× magnification with lymphocyte centers annotated by human experts, which contain a total of 3,064 lymphocytes. The demographic and clinical information of the patients is summarized in Table 1.
TABLE 1.
Demographic and Clinical Characteristics of the Patients With Breast Cancer
Histopathologic Image-Processing Pipeline
Figure 1A outlines the identification of FOV in our pipeline. Because the H&E stains basic cellular structures either red or pink in an image, we can distinguish different tissues from the background area by color classification. Given a WSI , we first downsampled into by a factor of 16:1. was then converted from the RGB color space to CIELAB color space. Next, we performed a K-means clustering algorithm to separate the pixels in CIELAB space into three groups. Considering that corners of pathology images are often unstained, pixels in the same cluster as the upper-left corner pixel in were considered background, whereas the other pixels were considered either tumor or stromal tissues. Denoting the smallest rectangle region containing the largest continuous tissue area in as , we mapped the coordinates of onto and obtained the FOV in the original WSI, Finally, was cropped from for later processing.
FIG 1.
The pipeline for identification of tumor-infiltrating lymphocyte (TIL) maps on whole-slide histopathologic images. (A) Identification of field of view (FOV) for whole-slide image. (B) Cascade training of the U-net model for lymphocyte detection. (C) Generating the global TIL map. TCGA, The Cancer Genome Atlas.
Next, we constructed a U-net–based neural network to identify lymphocytic regions on H&E-stained images. Considering that our annotated lymphocyte dataset might not be large enough to train a robust model from scratch, we adopted the first five blocks of the Resnet18 model26 as the encoder in our model. This implementation was inspired by the success of using Resnet18 for objection detection in Ren et al27 and Redmon et al,28 and we perceived it would help to improve the efficiency and performance of our model. The encoder was then followed by a decoder containing five repeated upsampling blocks, each of which consisted of a concatenation layer, a 2 × 2 deconvolution layer, and two 3 × 3 convolution layers. Parameters in the encoder were initialized with the pretrained weights from He et al,26 and the decoder layers were randomly initialized using the Xavier method.29
We proposed a cascade-training scheme that involved feedback from domain experts to obtain a robust model for lymphocyte detection. Our framework is illustrated in Figure 1B. In the first stage, was used for end-to-end training and evaluation. Note that only provided annotations for lymphocyte centers, which did not perfectly match the design of our U-net–based network. Considering that lymphocytes are usually round cells with diameters approximately 8 μm (32 pixels at 40× magnification), we constructed binary lymphocytic masks by dilating each annotated center into a circular area with a diameter of 32 pixels to use the U-net–based model. Eighty percent of (160 images of 200 × 200 pixels) were randomly picked out for training, and the remaining 20% were used for testing. Random mirror and random crop were performed to augment the training set. Both the training and evaluation loss converge on after training for 200 epochs.
Denoting the U-net model well trained on as , whereas can identify lymphocytes on , it was not robust enough to detect lymphocytes in other cohorts. To further improve the robustness, we performed the second-stage training involving feedback from pathologists on . In this stage, we used an iterative cycle of review and refinement for training. In the initial iteration, we manually cropped a group of 1,000 patches of 200 × 200 pixels from TCGA WSIs, denoted as . Fifty percent of was collected from immune hotspots with densely clustered lymphocytes, whereas the remaining 50% were randomly collected from tissue regions scattered by lymphocytes. Each patch in was fed into , and the predicted lymphocyte mask was evaluated and refined by two domain experts. Then, we used and its refined masks to train until convergence. We repeated this iterative process with new groups of patches until the pathologists considered the prediction accuracy to be 0.9. Two iterations were performed on TCGA groups in our experiment. The model after final iteration, denoted as , was then used to generate TIL maps for all TCGA breast cancer WSIs.
Figure 1C outlines the steps for generating global TIL maps in our pipeline. For each WSI , its FOV, was split into nonoverlapping patches of 200 × 200 pixels. Patches with more than 80% background were discarded. The remaining valid patches were then fed into , and all the patch-level predictions were combined to generate a global TIL map for . We estimate an overall TIL score for by computing the percentage of TIL areas as:
(1) |
where and represent the region of lymphocytes and the number of tissue pixels in the ith valid patch in , respectively, and represents the total number of valid patches in .
Spatial Features of TIL
We estimated 42 TIL spatial features based on the TIL maps derived from our pipeline. For each TIL map, we first picked out TIL patches as independent data points. Then, we used the APCluster R package30 to obtain local TIL cluster patterns by applying the affinity propagation algorithm31 on the data points. Next, we used the clusterCrit R package to extract statistical TIL spatial features from the TIL clusters, as listed in Table S1 of the Data Supplement.
Analysis of the Relationship Between TIL Score and Multiomics Data, Including Gene Transcription and Somatic Mutations
We divided all patients in the TCGA-BRCA cohort into three subtypes, estrogen receptor (ER) positive, ER negative, and triple negative, based on patients’ status of markers, including ER, progesterone receptor, and human epidermal growth factor receptor 2. We first computed and sorted the Spearman correlation coefficients between gene transcription levels and the TIL score, as well as all the 42 TIL features derived from imaging data for each subtype. Then we select the genes whose transcription levels were significantly correlated with TIL scores (PSpearman > .3) for each subtype. For the selected gene symbols, we performed function and pathway enrichment analysis using Ingenuity Pathway Analysis (IPA).
In addition, for each gene with somatic mutation information available, we performed the Wilcoxon rank sum test between patients with and without nonsynonymous mutations for the TIL scores in each subtype. Specifically, we selected genes with nonsynonymous mutations in at least 10 patients. In total, we had 33 genes for the ER-positive patients, 61 genes for the ER-negative patients, and 25 genes for the triple-negative group.
Machine-Learning Methods for Prognostic Prediction
We performed both univariate and multivariate survival analysis using the Cox proportional hazard model for patients with breast cancer of different subtypes, based on the TIL spatial statistical features. For the univariate method, we followed the approach in Uhlen et al.32 Specifically, for each feature, we selected different cutoffs (from the 20 percentile to the 80 percentile, with 1 percentile increments) to stratify the patients into two groups using the approach in Uhlen et al.32 Different cutoffs led to different choices of patient groups. According to the group information, we then calculated the P values of the log-rank test for comparing survival times between the two groups.
For the multivariate method, we compared the prognostic power of different approaches by stratifying patients with cancer into two subgroups (ie, the high- and low-survival risk groups) with different predicted outcomes. Specifically, for all the TIL spatial features of different patients, the K-means clustering algorithm was adopted to aggregate the patients into different subgroups. Then, we tested whether these groups had significantly different survival outcomes using the log-rank test. Finally, based on the divided groups of different patients, the Lasso-based feature selection model was applied to identify important biomarkers that can distinguish different patient groups.
RESULTS
Detection and Quantification of TIL Spatial Patterns in Histopathologic Images
To evaluate the effectiveness of the proposed U-net for lymphocyte detection, we tested and compared our model with three widely used open-source biomedical image analysis software programs: CellProfiler,33 QuPath,34 and Fiji,35 on . Comparison of lymphocyte detection results with CellProfiler, QuPath, Fiji, and U-net are provided in Figure 2C and Fig 2D. From Figures 2C and 2D, we observe that Fiji achieves the highest average recall (0.8439) and F1-score (0.7471) among the three existing software programs. However, the proposed U-net model had substantially better performance (recall, 0.9536; precision, 0.901; F1-score, 0.9266). This makes sense because the existing software programs perform unsupervised algorithms for nuclei detection, whereas our U-net model was specifically trained for lymphocyte detection.
FIG 2.
Evaluation and visualization of results for tumor-infiltrating lymphocyte (TIL) detection. (A) TIL maps for three breast cancer whole-slide images (WSIs) from The Cancer Genome Atlas in different TIL patterns; (left) hematoxylin and eosin (H&E) whole-slide diagnostic image; (middle) predicted TIL maps; (right) absolute error between our map and the map from Saltz et al.24 Values in the color bar represent different TIL levels: 0 (blue) corresponds to patches without TIL, and 250 (red) corresponds to patches filled with TILs. (B) Spearman correlation for comparison of TIL proportion from WSI TIL maps and molecular estimates of TIL content from genomics assays.36 (C) Cell-level results of lymphocyte detection predicted by our model and the three software programs. Green, yellow, and magenta dots in the prediction represent true positive, false positive, and false negative, respectively. (D) Performance comparison between our model and the three software programs for lymphocyte detection.
We applied our pipeline for identification of TIL maps to 1,000 TCGA-BRCA WSIs. Because there was no available TIL annotation for TCGA WSIs, we compared our results with the only referable TIL maps in Saltz et al.24 Figure 2A shows the visualization of TIL maps for three TCGA WSIs in different TIL patterns: Brisk Diffuse pattern, showing a strong immune infiltration within the tumor; Brisk Band-like pattern, showing immune infiltration forming boundaries bordering the tumor; and Nonbrisk Multifocal pattern, showing a weak immune response with loosely scattered TILs. Figure 2A demonstrates that our framework can efficiently identify different types of immune spots.
We also performed a more specific evaluation for the cell-level performance of our framework compared with the TIL maps in Salz et al.24 We compared the TIL scores derived from our method with the molecular estimates of TIL content from genomics assays.36 The Spearman correlation coefficients between TIL proportion from imaging and molecular estimates are shown in Figure 2B. As can be seen, the TIL maps generated by our method are more consistent with the molecular estimates (PSpearman, .3602; P = 6 × 10–25) than TIL maps in Salz et al24 (PSpearman, .3409; P = 3.622 × 10–22). This improvement indicates that our cell-level TIL maps are not only in line with the associated molecular data, but also provide more accurate and detailed immune estimation than patch-level TIL maps.
Correlate TIL Spatial Features With Gene Expression
To investigate which genes contribute to the development of immune infiltrating in different breast cancer subtypes, we performed enrichment analysis for the genes that were in high correlation with TIL scores. The enrichment results are shown in Figure 3, with the gene lists in Table S2 of the Data Supplement. When we applied a threshold value of 0.3 to the Spearman correlation coefficients, 54, 307, and 263, genes were selected for ER-positive, ER-negative, and triple-negative breast cancer subtypes, respectively. Among them, many were immune response–related genes. For instance, important immune therapy–related genes, such as PDCD1 (also known as programmed death-1 [PD-1] and CD279) and CTLA4 were both observed in all three subtypes. As can be seen in Figure 3, enrichment analysis on canonical pathways and functions from IPA confirmed the strong and consistent enrichment in processes related to immune and inflammatory responses in all three subtypes. An example of the strong involvement of the cancer immunotherapy pathway is also shown with genes observed for the ER-positive patient, marked in gray with magenta boundaries in Figure 3 (ER positive: Gene Network).
FIG 3.
Enrichment of different canonical pathways and functions in three breast cancer subtypes for genes whose transcription levels showed high correlations with tumor-infiltrating lymphocyte (TIL) score. PD-1, programmed death-1; PD-L1, programmed death-ligand 1.
We also performed Spearman correlation between the TIL spatial features and the gene expression data. We found that in both the ER-positive and ER-negative subtypes, the “banfeld_raftery” feature, which represents the TIL cluster extent, was highly correlated with immune therapy genes, such as CD38 and CXCL9. These genes are also correlated with the TIL score. But for the triple-negative subtype, we found that there were high correlations between the “g_plus” feature and genes (PCDHGA2, PCDHGA3, and PCDHB12) that have unusual immunoglobulin-like organization similar to that of B-cell and T-cell receptor gene clusters. These genes have not been observed to be associated with TIL score. Because the “g_plus” feature represents the cluster dispersion, these observations may imply that the TIL area and TIL distribution in the triple-negative subtype are regulated by different sets of genes leading to different immune response.
Compare TIL Score With Somatic Mutation Status of Genes
We also investigated the relationship between somatic mutations and TIL scores. Specifically, for each gene, the patients with and without somatic mutations were separated into two groups, and the rank sum test was applied to test whether there was any difference between the TIL scores. Figure 4 demonstrates two representative examples of the results for each breast cancer subtype. Table S3 in the Data Supplement lists the P values for genes whose somatic mutation status is significantly associated with the TIL score (P < .05). It is interesting to observe that the genes whose mutation status was strongly associated with the TIL scores were different between the ER-positive and ER-negative groups, whereas there were some similarities between the ER-negative and triple-negative breast cancer groups.
FIG 4.
Examples of the associations between tumor-infiltrating lymphocyte (TIL) score and somatic mutations. ER, estrogen receptor.
TIL Spatial Pattern Predicts Survival in Different Breast Cancer Subtypes
We also examined whether the spatial patterns of TIL might affect or be associated with patient survival in different breast cancer subtypes using both univariate and multivariate methods. We calculated 43 spatial statistical features for the detected TILs for each patient. Results of survival-associated TIL features are shown in Figure 5. For univariate analysis, we adopted the approach in Uhlen et al,32 where multiple partitions of the patient cohort based on different thresholds were used for log-rank tests, and the threshold leading to the most significant difference was selected for the specific spatial feature. For multivariate analysis, a Lasso-Cox regression model was used to select the features, with large weights for influencing patient survival times. It can be observed that different subtypes tend to be separated by different TIL spatial features in both univariate and multivariate analysis. In addition, the univariate results are better than the multivariate results. This unintuitive result is due to the fact that the multivariate uses the Lasso-Cox model with a single cutoff for separating the patients, whereas the univariate used multiple cutoff values to identify the best separation.
FIG 5.
Associations of tumor-infiltrating lymphocyte (TIL) spatial features with survival outcomes in different breast cancer subtypes. (A) Univariate analysis of the prognostic value for single TIL spatial features in different subtypes. (B) Multivariate analysis of the prognostic value for all the TIL spatial features in different subtypes; (top) represents the Kaplan-Meier curve by multiple TIL features; (bottom) represents the main contributing TIL features and their coefficients for survival prediction.
DISCUSSION
In this article, we proposed an automatic pipeline based on a cascade-training U-net model to generate high-resolution TIL maps on WSIs. Both qualitative and quantitative results demonstrate that our framework can provide reliable quantification of TILs from H&E-stained histopathology images. The global TIL maps we present can also be used for in-depth biologic analysis.37-39 For example, TILs in different tissue regions can be examined separately for survival prediction.40,41
The high-resolution TIL maps then enable comprehensive integrative analysis with multiomics data and clinical outcomes. The correlation analysis between the TIL score and gene transcription levels clearly confirmed the validity of the quantification of TILs using our method based on the strong enrichment of immune response genes and pathways. In particular, the strong association with immunotherapy pathway and genes (eg, PD-1 and CLTA4) suggests that the WSI-based analysis can be potentially used in the future for assessing or predicting immunotherapy potential.
The comparative analysis for genes with somatic mutations suggests that multiple genes affect the TIL score. However, because the somatic mutations are genetic alterations in the tumor (epithelial) cells, their effects on the TIL score are indirect through the interaction within the TME. The mechanisms for the association warrant additional investigation. In addition, the observation that different subtypes of breast cancers show TIL associations with mutations from different genes suggests that different genetic alterations may lead to similar phenotypes. For the triple-negative subtype, observations between TIL spatial features and gene expression data show that there may be different cell activation processes governing the patient’s immune response.
Last but not least, we also demonstrated that different spatial statistical features for the TIL distributions can be potentially used to predict patient outcomes (ie, survival times). Most of these spatial features (eg, GDI12, PBM, Tau, Gamma) were associated with dispersion and extent of the clustering, suggesting the clumping patterns of the TILs are important indicators of patient outcomes. Thus, except from the overall TIL infiltration, clustering of dispersion of TILs on histopathologic images should receive more attention in clinical settings and will be of great interest as a future research direction.
Overall, in this study, we developed an effective deep-learning–based pipeline for detecting TILs at the cellular level from WSIs of breast cancers. This pipeline was validated on multiple datasets, and correlation analysis with molecular data confirmed its strong associations with immune responses. Our results provide a resource for breast cancer biologists and informaticians for deep investigations between TIL patterns and molecular and clinical data, which will lead to new hypotheses and insights for both cancer biology and translational biomarker discovery. This pipeline can also be applied to study other types of cancers.
SUPPORT
Supported by the Indiana University Precision Health Initiative (K.H. and J.Z.) and the National Natural Science Foundation of China-Guangdong United Foundation of China (No. U1501256; Q.F.).
AUTHOR CONTRIBUTIONS
Conception and design: Zixiao Lu, Jie Zhang, Qianjin Feng, Kun Huang
Financial support: Kun Huang
Provision of study materials or patients: Kun Huang
Collection and assembly of data: Zixiao Lu, Zhi Han
Data analysis and interpretation: Zixiao Lu, Siwen Xu, Wei Shao, Yi Wu, Zhi Han
Manuscript writing: All authors
Final approval of manuscript: All authors
Accountable for all aspects of the work: All authors
AUTHORS' DISCLOSURES OF POTENTIAL CONFLICTS OF INTEREST
The following represents disclosure information provided by authors of this manuscript. All relationships are considered compensated unless otherwise noted. Relationships are self-held unless noted. I = Immediate Family Member, Inst = My Institution. Relationships may not relate to the subject matter of this manuscript. For more information about ASCO's conflict of interest policy, please refer to www.asco.org/rwc or ascopubs.org/cci/author-center.
Open Payments is a public database containing information reported by companies about payments made to US-licensed physicians (Open Payments).
No potential conflicts of interest were reported.
REFERENCES
- 1.Langley RR, Fidler IJ. The seed and soil hypothesis revisited--The role of tumor-stroma interactions in metastasis to different organs. Int J Cancer. 2011;128:2527–2535. doi: 10.1002/ijc.26031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Mittal D, Gubin MM, Schreiber RD, et al. New insights into cancer immunoediting and its three component phases--Elimination, equilibrium and escape. Curr Opin Immunol. 2014;27:16–25. doi: 10.1016/j.coi.2014.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Couzin-Frankel J. Cancer immunotherapy. Baby’s leukemia recedes after novel cell therapy. Science. 2015;350:731. doi: 10.1126/science.350.6262.731. [DOI] [PubMed] [Google Scholar]
- 4.Denkert C, Loibl S, Noske A, et al. Tumor-associated lymphocytes as an independent predictor of response to neoadjuvant chemotherapy in breast cancer. J Clin Oncol. 2010;28:105–113. doi: 10.1200/JCO.2009.23.7370. [DOI] [PubMed] [Google Scholar]
- 5.Denkert C, von Minckwitz G, Brase JC, et al. Tumor-infiltrating lymphocytes and response to neoadjuvant chemotherapy with or without carboplatin in human epidermal growth factor receptor 2-positive and triple-negative primary breast cancers. J Clin Oncol. 2015;33:983–991. doi: 10.1200/JCO.2014.58.1967. [DOI] [PubMed] [Google Scholar]
- 6.Lee HJ, Lee J-J, Song IH, et al. Prognostic and predictive value of NanoString-based immune-related gene signatures in a neoadjuvant setting of triple-negative breast cancer: Relationship to tumor-infiltrating lymphocytes. Breast Cancer Res Treat. 2015;151:619–627. doi: 10.1007/s10549-015-3438-8. [DOI] [PubMed] [Google Scholar]
- 7.Dieci MV, Radosevic-Robin N, Fineberg S, et al. Update on tumor-infiltrating lymphocytes (TILs) in breast cancer, including recommendations to assess TILs in residual disease after neoadjuvant therapy and in carcinoma in situ: A report of the International Immuno-Oncology Biomarker Working Group on Breast Cancer. Semin Cancer Biol. 2018;52:16–25. doi: 10.1016/j.semcancer.2017.10.003. [DOI] [PubMed] [Google Scholar]
- 8.Salgado R, Denkert C, Demaria S, et al. The evaluation of tumor-infiltrating lymphocytes (TILs) in breast cancer: Recommendations by an International TILs Working Group 2014. Ann Oncol. 2015;26:259–271. doi: 10.1093/annonc/mdu450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Mlecnik B, Tosolini M, Kirilovsky A, et al. Histopathologic-based prognostic factors of colorectal cancers are associated with the state of the local immune reaction. J Clin Oncol. 2011;29:610–618. doi: 10.1200/JCO.2010.30.5425. [DOI] [PubMed] [Google Scholar]
- 10.Angell H, Galon J. From the immune contexture to the Immunoscore: The role of prognostic and predictive immune markers in cancer. Curr Opin Immunol. 2013;25:261–267. doi: 10.1016/j.coi.2013.03.004. [DOI] [PubMed] [Google Scholar]
- 11.Fridman WH, Pagès F, Sautès-Fridman C, et al. The immune contexture in human tumours: Impact on clinical outcome. Nat Rev Cancer. 2012;12:298–306. doi: 10.1038/nrc3245. [DOI] [PubMed] [Google Scholar]
- 12.Turkki R, Linder N, Kovanen PE, et al. Antibody-supervised deep learning for quantification of tumor-infiltrating immune cells in hematoxylin and eosin stained breast cancer samples. J Pathol Inform. 2016;7:38. doi: 10.4103/2153-3539.189703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ehteshami Bejnordi B, Veta M, Johannes van Diest P, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA. 2017;318:2199–2210. doi: 10.1001/jama.2017.14585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Toss MS, Miligy I, Al-Kawaz A, et al. Prognostic significance of tumor-infiltrating lymphocytes in ductal carcinoma in situ of the breast. Mod Pathol. 2018;31:1226–1236. doi: 10.1038/s41379-018-0040-8. [DOI] [PubMed] [Google Scholar]
- 15.Rakaee M, Kilvaer TK, Dalen SM, et al. Evaluation of tumor-infiltrating lymphocytes using routine H&E slides predicts patient survival in resected non-small cell lung cancer. Hum Pathol. 2018;79:188–198. doi: 10.1016/j.humpath.2018.05.017. [DOI] [PubMed] [Google Scholar]
- 16.Eriksen AC, Sørensen FB, Lindebjerg J, et al. The prognostic value of tumor-infiltrating lymphocytes in stage II colon cancer. A nationwide population-based study. Transl Oncol. 2018;11:979–987. doi: 10.1016/j.tranon.2018.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.McIntire PJ, Irshaid L, Liu Y, et al. Hot spot and whole-tumor enumeration of CD8+ tumor-infiltrating lymphocytes utilizing digital image analysis is prognostic in triple-negative breast cancer. Clin Breast Cancer. 2018;18:451–458.e1. doi: 10.1016/j.clbc.2018.04.019. [DOI] [PubMed] [Google Scholar]
- 18.Yuan Y, Failmezger H, Rueda OM, et al. Quantitative image analysis of cellular heterogeneity in breast tumors complements genomic profiling. Sci Transl Med. 2012;4:157ra143. doi: 10.1126/scitranslmed.3004330. [DOI] [PubMed] [Google Scholar]
- 19.Corredor G, Wang X, Lu C, et al. A watershed and feature-based approach for automated detection of lymphocytes on lung cancer images. Med Imaging. 2018;10581:105810R. [Google Scholar]
- 20.Janowczyk A, Madabhushi A. Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases. J Pathol Inform. 2016;7:29. doi: 10.4103/2153-3539.186902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Amgad M, Elfandy H, Hussein H, et al. Structured crowdsourcing enables convolutional segmentation of histology images. Bioinformatics. 2019;35:3461–3467. doi: 10.1093/bioinformatics/btz083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Linder N, Taylor JC, Colling R, et al. Deep learning for detecting tumour-infiltrating lymphocytes in testicular germ cell tumours. J Clin Pathol. 2019;72:157–164. doi: 10.1136/jclinpath-2018-205328. [DOI] [PubMed] [Google Scholar]
- 23.Amgad M, Sarkar A, Srinivas C, et al. Joint region and nucleus segmentation for characterization of tumor infiltrating lymphocytes in breast cancer. . Proc SPIE Int Soc Opt Eng. 2019;10956:109560M. doi: 10.1117/12.2512892. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Saltz J, Gupta R, Hou L, et al. Spatial organization and molecular correlation of tumor-infiltrating lymphocytes using deep learning on pathology images. Cell Rep. 2018;23:181–193.e7. doi: 10.1016/j.celrep.2018.03.086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Klauschen F, Müller K-R, Binder A, et al. Scoring of tumor-infiltrating lymphocytes: From visual estimation to machine learning. Semin Cancer Biol. 2018;52:151–157. doi: 10.1016/j.semcancer.2018.07.001. [DOI] [PubMed] [Google Scholar]
- 26. He K, Zhang X, Ren S, et al: Deep residual learning for image recognition. Presented at the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, June 26-July 1, 2016. [Google Scholar]
- 27. Ren S, He K, Girshick R, et al: Faster R-CNN: Towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28:91-99, 2015. [DOI] [PubMed] [Google Scholar]
- 28. Redmon J, Divvala SK, Girshick RB, Farhadi A. You only look once: Unified, real-time object detection. http://openaccess.thecvf.com/content_cvpr_2016/papers/Redmon_You_Only_Look_CVPR_2016_paper.pdf.
- 29. Glorot X, Bengio Y: Understanding the difficulty of training deep feedforward neural networks. Proc Thirteen Int Conf Artif Intell Stat 249-256, 2010. [Google Scholar]
- 30.Bodenhofer U, Kothmeier A, Hochreiter S. APCluster: An R package for affinity propagation clustering. Bioinformatics. 2011;27:2463–2464. doi: 10.1093/bioinformatics/btr406. [DOI] [PubMed] [Google Scholar]
- 31.Frey BJ, Dueck D. Clustering by passing messages between data points. Science. 2007;315:972–976. doi: 10.1126/science.1136800. [DOI] [PubMed] [Google Scholar]
- 32.Uhlen M, Zhang C, Lee S, et al. A pathology atlas of the human cancer transcriptome. Science. 2017;357:eaan2507. doi: 10.1126/science.aan2507. [DOI] [PubMed] [Google Scholar]
- 33.Carpenter AE, Jones TR, Lamprecht MR, et al. CellProfiler: Image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 2006;7:R100. doi: 10.1186/gb-2006-7-10-r100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bankhead P, Loughrey MB, Fernández JA, et al. QuPath: Open source software for digital pathology image analysis. Sci Rep. 2017;7:16878. doi: 10.1038/s41598-017-17204-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Schindelin J, Arganda-Carreras I, Frise E, et al. Fiji: An open-source platform for biological-image analysis. Nat Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Thorsson V, Gibbs DL, Brown SD, et al: Cancer Genome Atlas Research Network: The immune landscape of cancer. Immunity 48:812-830.e14, 2018 [Erratum: Immunity 51:411-412, 2019] [Google Scholar]
- 37.de Melo Gagliato D, Cortes J, Curigliano G, et al. Tumor-infiltrating lymphocytes in breast cancer and implications for clinical practice. Biochim Biophys Acta Rev Cancer. 2017;1868:527–537. doi: 10.1016/j.bbcan.2017.10.003. [DOI] [PubMed] [Google Scholar]
- 38.Natrajan R, Sailem H, Mardakheh FK, et al. Microenvironmental heterogeneity parallels breast cancer progression: A histology–genomic integration analysis. PLoS Med. 2016;13:e1001961. doi: 10.1371/journal.pmed.1001961. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Heindl A, Sestak I, Naidoo K, et al. Relevance of spatial heterogeneity of immune infiltration for predicting risk of recurrence after endocrine therapy of ER+ breast cancer. J Natl Cancer Inst. 2018;110:166–175. doi: 10.1093/jnci/djx137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Loi S, Michiels S, Salgado R, et al. Tumor infiltrating lymphocytes are prognostic in triple negative breast cancer and predictive for trastuzumab benefit in early breast cancer: Results from the FinHER trial. Ann Oncol. 2014;25:1544–1550. doi: 10.1093/annonc/mdu112. [DOI] [PubMed] [Google Scholar]
- 41.Agahozo MC, Hammerl D, Debets R, et al. Tumor-infiltrating lymphocytes and ductal carcinoma in situ of the breast: Friends or foes? Mod Pathol. 2018;31:1012–1025. doi: 10.1038/s41379-018-0030-x. [DOI] [PubMed] [Google Scholar]