Skip to main content
NPJ Breast Cancer logoLink to NPJ Breast Cancer
. 2020 May 12;6:15. doi: 10.1038/s41523-020-0155-1

Application of a risk-management framework for integration of stromal tumor-infiltrating lymphocytes in clinical trials

Jan Hudeček 1, Leonie Voorwerk 2, Maartje van Seijen 3, Iris Nederlof 2, Michiel de Maaker 3, Jose van den Berg 4, Koen K van de Vijver 5, Karolina Sikorska 6, Sylvia Adams 7, Sandra Demaria 8,9, Giuseppe Viale 10,11, Torsten O Nielsen 12, Sunil S Badve 13, Stefan Michiels 14,15, William Fraser Symmans 16, Christos Sotiriou 17, David L Rimm 18,19, Stephen M Hewitt 20, Carsten Denkert 21,22, Sibylle Loibl 23, Sherene Loi 24, John M S Bartlett 25,26,27, Giancarlo Pruneri 28,29, Deborah A Dillon 30, Maggie C U Cheang 31, Andrew Tutt 32, Jacqueline A Hall 33, Zuzana Kos 34, Roberto Salgado 24,35,, Marleen Kok 2,36, Hugo M Horlings 3,4; International Immuno-Oncology Biomarker Working Group
PMCID: PMC7217941  PMID: 32436923

Abstract

Stromal tumor-infiltrating lymphocytes (sTILs) are a potential predictive biomarker for immunotherapy response in metastatic triple-negative breast cancer (TNBC). To incorporate sTILs into clinical trials and diagnostics, reliable assessment is essential. In this review, we propose a new concept, namely the implementation of a risk-management framework that enables the use of sTILs as a stratification factor in clinical trials. We present the design of a biomarker risk-mitigation workflow that can be applied to any biomarker incorporation in clinical trials. We demonstrate the implementation of this concept using sTILs as an integral biomarker in a single-center phase II immunotherapy trial for metastatic TNBC (TONIC trial, NCT02499367), using this workflow to mitigate risks of suboptimal inclusion of sTILs in this specific trial. In this review, we demonstrate that a web-based scoring platform can mitigate potential risk factors when including sTILs in clinical trials, and we argue that this framework can be applied for any future biomarker-driven clinical trial setting.

Subject terms: Breast cancer, Biomarkers, Tumour biomarkers, Tumour immunology

Introduction

Clinical trials in cancer research are increasingly incorporating biomarkers, for example, as an inclusion criterion or for stratification of patients to control for confounding factors. Practical challenges, such as interobserver variation in the assessment of biomarkers during the execution of the trial, are often overlooked. If not handled appropriately, these challenges can limit the effectiveness and ability to complete the biomarker and drug development process. According to Hall et al.1, the risks inherent to biomarker integration can be divided into risks to patients, operational risks, and direct risks to biomarker development. A practical risk-management framework developed by a National Cancer Institute (NCI), National Cancer Research Institute (NCRI), and European Organization for Research and Treatment of Cancer (EORTC) Working Group1 was proposed to manage the risks inherent to biomarker integration into clinical trials.

Stromal tumor-infiltrating lymphocytes (sTILs) have been strongly associated with prognosis in early-stage triple-negative breast cancer (TNBC) and HER2-positive breast cancer. In addition, sTILs are predictive for neo-adjuvant chemotherapy response in early breast cancer2,3. Furthermore, sTILs correlate with outcome after immune checkpoint blockade in metastatic TNBC46. The readout of sTILs, however, can be challenging impeding its effective use as a biomarker and its usage in the clinic7. The International Immuno-Oncology Biomarker Working Group (hereafter called the TIL Working Group) has provided guidelines for the scoring of sTILs in breast cancer8, and the St. Gallen Breast Cancer Conference of 2019 endorsed sTILs being routinely characterized in TNBC and reported according to these guidelines8.

Risks associated with integration of biomarkers in clinical trials

In contemporary clinical research there is an increasing trend toward the use of biomarker results obtained in daily practice to select patients for inclusion in clinical trials. Although biomarker research is more and more prominent in clinical trials, most biomarkers will not make into the clinic9. Therefore, continuous monitoring of the predefined risks and the solutions can improve the quality of the biomarker, which can be applied in a clinical trial setting, as well as in daily practice. The recommendations of the TIL Working Group8,10 for appropriate scoring, and the risk-management framework of the NCI, NCRI, and EORTC Working Groups1 will help to effectively and efficiently improve the incorporation of biomarkers in clinical trials in first instance.

Several risks are associated with biomarker development and integration of biomarkers in clinical trials. Roughly, risks can be divided into three categories: risks to patient safety, operational risks, and risks to biomarker development. Not all risks are applicable to all clinical trials and upon designing a biomarker-incorporating clinical trial, risks should be defined and mitigation approaches formulated. It is highly recommended that during a clinical trial, risks are not only pre-identified but are also continuously monitored to prevent stagnation in biomarker development1. For example, incorporating biomarkers in a large multi-center international clinical trial involves different risks than a small single-center trial. In the first case, there might be different legislation regarding data confidentiality, and inter-laboratory variability can be an issue. When incorporating a biomarker as inclusion criterion or stratification factor in clinical trials, rapid turnaround times are needed and the highest level of quality is necessary for correct interpretation of the results. In the next steps of biomarker development, high-quality results are needed to ensure implementation in daily clinical practice.

Use of digital pathology in clinical trials and development of a novel web application

In larger trials, usually phase II–III, central pathology review (CPR) plays an important role in the reliable assessment of biomarker scoring. However, logistical issues, such as the sending of tumor blocks or slides, can be time consuming, costly for the pathology laboratory, and error prone with significant consequences for patient inclusion if the wrong material is sent to the central lab. Digital sharing of histology slides and patient data simplifies logistics for CPR11. Besides digital sharing and scoring of slides, digital image analysis and machine learning approaches are emerging in clinical research12,13. The use of digital pathology or digital evaluation of histology slides most prominently mitigates risks associated with operational processes. It can reduce the number of missing samples, since the sharing of material is simplified; it enables rapid turnaround times; reduces manual errors; and can streamline local versus central assessment of biomarker.

For clinicians and researchers to use digital pathology, applications and websites should be user-friendly and intuitive. As an example, a web-based tool called Slide Score (www.slidescore.com) was developed as a cross-platform web application to facilitate the scoring of whole slide images and tissue microarray (TMA) cores. Application programming interface (API) was implemented that allowed programmatic administration of studies, uploading slides, fetching results, and retrieving pixel data for regions of images. This API enabled automating creation of new studies from internal database system for managing biobanking workflows. Additionally, a plugin was developed for QuPath14—open-source image analysis software—which uses this API to run image analysis algorithms on slides stored on the Slide Score platform avoiding the need to download the slides. This web-based platform was used in high-impact projects6,15, for example, for the digital scoring of biomarkers in the first stage of the TONIC trial6, and the estimation of the immune infiltrate of tumors of melanoma patients used for single-cell sequencing15. Furthermore, the web-based platform is currently used for several other types of research, such as interrater variability studies, retrospective TMA, and whole slide scoring and prospective biomarker scoring.

Design of a workflow to mitigate risks associated with biomarker development: an example

We identified seven distinct risks with the risk-management framework published by Hall et al.1 as possibly interfering with the quality and integration of prospective sTILs scores in a clinical trial, and designed our workflow accordingly (Table 1). These risks are specific for this trial, but some of them are applicable also to other trials. They span all three categories mentioned above1 and included (1) poor-quality biopsies, (2) possible loss of data confidentiality, (3) interrater variability, (4) poor sample quality, (5) poor scoring quality, (6) delay in patient registration, and (7) manual errors (Table 1). We then defined solutions to mitigate these risks and integrated these solutions in a workflow that can be applied across clinical trials and across biomarkers (Fig. 1). The workflow can be modified according to local guidelines, research questions, and clinical trial designs. We used the following workflow to obtain timely and reliable sTILs scores (summary in Supplementary Fig. 1).

Table 1.

Risks with possible high impact identified in a phase II immunotherapy trial6 based on the perspectives of Hall et al.1 with our approach to mitigation of that risk.

Type of risk Risk Description of risk Mitigation approach
1. Risks to patients No stroma or tumor cells in biopsy Discomfort and risks associated with a sampling intervention Take multiple biopsies from one lesion at the same time (a minimum of three biopsies per lesion9) and check amount of tumor cells before analysis of sTILs and inclusion in the trial
2. Risks to patients Loss of data confidentiality Patient samples sent to multiple institutions and reviewers Pseudonymization should be applied, hide slide labels, implement strict access control, ensure no metadata is linked to a slide
3. Risks to biomarker development Inter-laboratory variability and interobserver variability Different methodologies used to score slides, interrater variability Use of international guidelines for scoring and training, use consensus score of four expert pathologists from three institutes
4. Operational risks Failure of sample collection, processing, and quality Missing or poor-quality samples resulting in poor consensus scoring Standardized tissue processing, workflow management
5. Operational risks Inadequate image quality or no ability to access image Missing, poor, or inaccurate scoring Track the scores of all pathologists and notify when scores are inconsistent
6. Operational risks/risks to patients Long turnaround time Delay in patient randomization and treatment Timeline tracking incorporated in workflow
7. Operational risks Data management failure Errors in collecting manual scores, typos, data conversion issues Structured digital scores from pathologist to the analyst

Fig. 1. Organization of a workflow for reliable and timely biomarker scoring in a general single-center or multi-center trial.

Fig. 1

Personnel at individual centers scan the slides after processing by the local pathology department. Digital slides are uploaded to a central web-based repository, such as Slide Score. A study-specific identifier is assigned to each sample. The central manager is notified by the system when new slides are available and requests pathologists to review it. When a consensus score is obtained, the trial office is notified for randomization of the patient.

After obtaining informed consent of a patient, three biopsies of one metastatic lesion (lymph node, skin, liver, or other) were obtained in this trial. Previous research has shown that three 14 G core needle biopsies should be sufficient for accurate breast cancer diagnosis16. A hematoxylin and eosin (H&E)-stained slide of one biopsy was then evaluated, to ensure that the biopsy contained enough tumor cells (more than 100 cells) for further analysis (risk 1). Next, a high-resolution digital scan was obtained and automatically pseudonymized with study-specific identifiers (risk 2) before uploading to Slide Score. Display of the original labels was masked to ensure confidentiality of all data within Slide Score (Supplementary Fig. 2b). Pathologists and administrators had to login with their username and password to access the slides and were able to add a two-factor authentication application. Four well-trained breast pathologists, based in three different institutes and in two different countries, were notified via email to score each slide using existing sTIL scoring guidelines of the TIL Working Group8,10 to reduce interrater variability (risk 3). sTILs are scored as the percentage of lymphocytes in the total stromal area (in close proximity of the tumor cells). Interrater variability can lead to bias in the results, when assessment of a biomarker is skewed towards either the lower or higher ranges. When there was a disagreement (using a 5% cut-off) a concordance-score was agreed upon (Supplementary Fig. 1). Low-quality, inaccurate collection or processing of samples can result in low sample availability and introduce batch effects or bias in the results (risk 4) and lead to non-consistent scores (risk 5). High quality of samples was ensured by standardization of our workflow in which all steps were performed in the same manner for every biopsy (Supplementary Fig. 1). Oversight of the entire workflow by one person, referred to as the central manager, is essential for timely identification of technical errors. The central manager tracked the timing of the biopsies, notified the pathologists immediately after the scan was uploaded and sent reminders if necessary, kept track of the scores and timing, and noted the score in the patient record for trial office notification. We predefined acceptable timeframes for obtaining the scores of the reviewers and tracked these during the study progress (risk 6; Supplementary Fig. 1). Pathologists were notified via email the next working day when the slide was not scored yet to minimize the waiting period to start treatment (risk 6). Finally, using Slide Score, we reduced the risks of typos and other manual errors by collecting all slides within one online study group (collection of slides) and a customized scoring form was built to standardize scores and obtain structured data (risk 7).

Implementation of workflow in the TONIC trial

The TONIC trial (NCT02499367)6 is a phase II, non-comparative randomized multi-cohort single-center trial (full title: Adaptive phase II randomized non-comparative Trial Of Nivolumab after Induction treatment in TNBC patients), designed to assess the efficacy of induction of an anti-cancer immune response by low-dose chemotherapy or irradiation to increase response to anti-PD-1 in patients with metastatic TNBC. In the first part of the trial6, patients with metastatic TNBC were randomized to nivolumab (1) without induction or two-week low-dose induction, with (2) irradiation (3 × 8 Gy), (3) cyclophosphamide, (4) cisplatin, or (5) doxorubicin, all followed by nivolumab (anti-PD-1; 3 mg/kg). Based on a Simon’s two-stage design17 and prespecified pick-the-winner criteria, only the doxorubicin cohort was allowed to continue in the second part of the trial6. In the second part of the TONIC trial, patients were randomized between anti-PD-1 monotherapy (control group) and two cycles of low-dose doxorubicin (15 mg flat dose, weekly), followed by anti-PD-1 (Supplementary Fig. 2a). Randomization was stratified for sTILs. Stratification is done by dividing patients in two categories, namely sTILhigh (equal or exceeding 5%) and sTILlow (lower than 5%). The cut-off was determined based on data obtained in the first part of the TONIC trial, in which we observed that sTILs were predictive of response to anti-PD-1, both continuous and when a cut-off of 5% was used6. These data confirmed the predictive value of sTILs of at least 5% in another trial, which tested the efficacy of anti-PD-1 in patients with metastatic TNBC4. The full protocol, including four amendments, and the informed consent form were approved by the medical-ethical committee of The Netherlands Cancer Institute. All patients provided written informed consent before enrollment. The trial was registered on 17 August 2015. The 47 patients of the second part of the trial were randomized between March 2018 and July 2019. Full eligibility criteria and trial procedures have been described previously6.

In the second part of the TONIC trial, we could implement our workflow with a focus on accurate and reproducible sTIL scores within a reasonable timeframe after a biopsy was taken (72 h). For all 47 patients included in the trial, reliable sTIL scores were obtained with 45 biopsies scored within the 72-h timeframe (Supplementary Fig. 3). During the course of the study, the server of Slide Score was available 99.9% of the time. Five biopsies had to be re-evaluated due to a discrepancy in the categorical scores, when not all pathologists agreed on the appropriate category of the sTIL score (lower than 5% versus higher or equal to 5%). In three of these cases the score of one pathologist was higher (5 or 10%) than the score of the other two or three pathologists (0–3%). The average sTIL score was obtained and the pathologist causing the disagreement was notified. In the fourth and fifth case, two pathologists scored 5 and 10%, whereas the other pathologists scored 1%. All four pathologists were notified of the disagreement and a consensus score of 5% was obtained. We observed an intraclass correlation coefficient of 0.94 (95% confidence interval (CI): 0.91–0.97) for sTILs as a continuous variable. Interrater agreement for the categorical variable used in the stratification (sTILs <5% or ≥5%) was 0.86 (multirater Fleiss’ κ18; 95% CI: 0.73–1; Supplementary Fig. 2c). In the anti-PD-1 monotherapy cohort, we observed that 13 out of 23 patients (56.5 %) had sTILs below 5%, as compared to 15 out of 24 patients in the doxorubicin cohort (62.5 %; Fisher’s exact test p value 0.77). The distribution of the sTIL scores is depicted in Supplementary Fig. 2d. These data indicate effective stratification based on the cut-off of 5%, but a slightly uneven distribution in the higher ranges of sTIL scores (10% or higher) inherent to the use of our cut-off. We observed a median time from biopsy until the scanning of the H&E slide of 30 h (range 24–98 h) and a median time from the biopsy until at least three scores were obtained of 43 h (range 27–106 h). In total, the median time from biopsy until registration in the patient records was 49 h (range 41–106 h; Supplementary Fig. 2e), with 96% of biopsies scored within 72 h. Two biopsies were not scored within the 72-h time limit, due to additional processing of one sample and one delay in registration time due to the absence of the central manager (Supplementary Figs 1 and 3).

Advantages and limitations of a web-based risk-mitigation workflow

Our proposed solutions involved standardization of our workflow, obtaining digital images and the use of a web-based tool such as Slide Score for the managing and scoring of digital images. Anticipating the incorporation of digital images in routine diagnostics, our workflow shows that it is feasible for a pathologist to score digital images with high reliability. Moreover, a web-based tool can facilitate the process of coordinated uploading of digital images, pseudonymizing slides, and regulate access to studies and proper data management. Web-based platforms are therefore of high interest in biomarker research and can help with automation that can be transferred to clinical practice in the future.

In this study, we obtained sTIL scores within 72 h after a biopsy was taken, which is a reasonable timeframe for clinicians to start randomization of patients to treatment arms in a clinical trial. We observed an excellent interrater agreement score between our panel of four expert pathologists. In an accompanying paper7 we demonstrate using data from three RING studies of the TIL Working Group that the concordance achieved using a risk-management approach as detailed in this study is substantially higher than observed outside this risk-management perspective as observed in the three RING studies and in other published studies19,20. However, our sample size is small and the four pathologists in the current study were trained and experienced in the scoring of sTILs in breast cancer. Also, the biopsies used in this study were checked for containing sufficient tumor cells (≥100 cells) before the slide was scored for sTILs, which could have further improved our results. In the future, it is to be expected that computational workflows will further improve the scoring of sTILs13. Although we obtained reliable and timely results in 96% of cases, the presence of a central manager is crucial. In one case there was a delay in registration time due to the absence of the central manager. The manual intervention of quality checks, processing of the slides, and data cannot be circumvented in our workflow.

Stratification in this study was performed using sTILs as a binary variable (lower than 5% versus higher or equal to 5%). Consequently, we observed an uneven distribution in continuous sTILs scores between the cohorts (Supplementary Fig. 2d). This was mainly due to more patients with sTILs scores above 10% in the anti-PD-1 monotherapy cohort. Inherent to the use of a binary cut-off for stratification, the median of the continuous measurement might still differ between cohorts. Alternatively, multiple categories for the same variable can be used in stratification. However, this approach generates more strata, with lower number of patients in each stratum, possibly leading to an imbalance in distribution21,22. Moreover, at the time of writing of this paper no cut-offs for sTILs are established and/or properly validated for predictive purposes.

During the trial, we continuously monitored whether our strategy was still feasible within the set timeframe by means of regular evaluation by the pathologists and the study coordinators. This led to rapid adjustment of the workflow if needed, ensuring the quality of the sTIL scores. For example, pathologists could easily login remotely and score a digital H&E outside the hospital ensuring that sTILs were still scored within 72 h after biopsy. Ongoing evaluation during the clinical trial is of critical importance for risk mitigation in biomarker research1.

Future applications of the workflow

Our strategy can serve as a template for risk management and mitigation of all identified risks in future clinical trials incorporating biomarkers for inclusion, enrichment, or stratification. By no means will risks identified in this study be similar for all clinical trials. Each trial will have its own risks that need to be mitigated, although there will be similarities between the risks across clinical trials. Defining the risks that come with biomarker development will help tested biomarkers eventually make their way to the clinic. However, one may even argue that a similar risk-management strategy can be applied in daily practice. In the BELLINI trial (NCT03815890), two cycles of neo-adjuvant anti-PD-1 are administered in patients with early-stage TNBC or luminal B breast cancer. All patients are required to have at least 5% sTILs in the pretreatment biopsy and patients are thereafter stratified in three sTIL categories. Our workflow will be used to ensure timely and reliable sTIL scores for the right patient selection. By using our workflow, scoring of sTILs is highly standardized, allowing also smaller centers with less extensive experience in sTILs scoring to participate in a clinical trial.

Conclusions

In contemporary clinical research there is an increasing trend toward the use of biomarker results obtained in daily practice to select patients for inclusion in clinical trials. Therefore, continuous monitoring of the predefined risks and the solutions can improve the quality of the biomarker, as can be applied in a clinical trial setting, as well as in daily practice. The recommendations of the TIL Working Group8,10 for appropriate scoring, the risk-management framework of the NCI, NCRI, and EORTC Working Groups1, as well as our proposed strategies to reduce risks will help to effectively and efficiently improve the incorporation of biomarkers in clinical trials in first instance, herewith illustrated using sTILs as a paradigm of this development.

Supplementary information

Supplementary Material (651.3KB, pdf)

Acknowledgements

The Department of Pathology of the Netherlands Cancer Institute is thanked for the support of this study and ensuring the rapid turnaround times. The Breast Cancer Research Foundation and Bristol-Myers-Squibb (BMS) are thanked for financial support. We also thank the BMS-International Immuno-Oncology Network (BMS/II-ON) and the Dutch Cancer Society (NKI2015-7710) for funding the clinical trial costs and this feasibility study (NKI2016-10510). S.L., R.S., and M.K. are supported by a grant from the Breast Cancer Research Foundation (BCRF, NY, US). The following is a list of current members of the International Immuno-Oncology Working Group (TILs Working Group). A member is defined as a person willing to be involved, informed and be part of the activities of the TILs Working Group. The authors alone are responsible for the views expressed in the work of the TIL Working Group and they do not necessarily represent the decisions, policy, or views of their employer.

Author contributions

J.H. developed Slide Score and wrote the manuscript with L.V., R.S., M.K. and H.M.H. L.V. coordinated study procedures and performed data-analyses. M.v.S., I.N., S.A., S.D., G.V., T.O.N., S.S.B., S.M., W.F.S., C.S., D.L.R., S.H., C.D., S.L., Sh.L., J.M.S.B., G.P., D.A.D., M.C.U.C., A.T., J.A.H. and Z.K. gave critical input. M.d.M. provided logistical support with the sample processing. J.v.d.B., K.K.v.d.V., R.S. and H.M.H. scored the slides. K.S. performed the statistical analysis. M.K. is the principal investigator of the TONIC trial. J.H., L.V., R.S., M.K. and H.M.H. designed this feasibility study. All authors edited and approved the manuscript.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Competing interests

J.H. is the owner of Slide Score. B.V. L.V., M.v.S., I.N., M.d.M., J.v.d.B., K.K.v.d.V., K.S., S.A., S.D., G.V., S.S.B., S.M., W.F.S., C.S., S.M.H., C.D., S.L., G.P., M.C.U.C., Z.K., and H.M.H. have nothing to disclose. T.O.N. has consulted for Nanostring and received compensation and has intellectual property rights/ownership interests from Bioclassifier LLC, not related to the subject material under consideration and received funding support from the Canadian Cancer Society. D.L.R. reports research funding from AstraZeneca, Cepheid, Navigate BioPharma, NextCure, Lilly, and Ultivue; instrument support from Ventana, Akoya/PerkinElmer, and NanoString; advisory board of Amgen, AstraZeneca, Cell Signaling Technology, Cepheid, Daiichi Sankyo, GSK, Konica/Minolta, Merck, NanoString, PerkinElmer, Ventana, and Ultivue; consultancy for Biocept; honorarium and travel support from BMS; royalties from Rarecyte and is a founder and equity holder of PixelGear. Sh.L. receives research funding to her institution from Novartis, Bristol Meyers Squibb, Merck, Roche-Genentech, Puma Biotechnology, Pfizer, and Eli Lilly, acted as consultant (not compensated) to Seattle Genetics, Pfizer, Novartis, BMS, Merck, AstraZeneca, and Roche-Genentech and acted as consultant (paid to her institution) to Aduro Biotech. J.M.S.B. reports research funding from ThermoFisher, Genoptix, Agendia, NanoString Technologies, Stratifyer GmbH, and Biotheranostics and advisory roles for Insight Genetics, BioNTech AG, Biotheranostics, Pfizer, RNA Diagnostics, and OncoXchange. J.M.S.B. reports the following patents: Methods and Devices for Predicting Anthracycline Treatment Efficacy, US utility (January 2017; 15/325,472; EPO – 15822898.1; Canada – not yet assigned), Systems, Devices and Methods for Constructing and Using a Biomarker, US utility (January 2017; 15/328,108; EPO – 15824751.0; Canada – not yet assigned), Histone gene module predicts anthracycline benefit (October 2016; PCT/CA2016/000247), 95‐Gene Signature of Residual Risk Following Endocrine Treatment (December 2016; PCT/CA2016/000304), Immune Gene Signature Predicts Anthracycline Benefit (December 2016; PCT/CA2016/000305). D.A.D. is on the advisory board and consults for Oncology Analytics Inc., and has consulted for and received travel funds from Novartis for work unrelated to the current manuscript. A.T. reports benefits from ICR’s Inventors Scheme associated with patents for one of PARP inhibitors in BRCA1/2-associated cancers. A.T. also reports Honoraria from Pfizer, Vertex, Prime Oncology, and Artios, honoraria and stock in InBioMotion, honoraria and financial support for research from AstraZeneca, Medivation, Myriad Genetics, and Merck Serono. J.A.H. is the director and owner of Vivactiv Ltd. R.S. reports research funding from Roche, Puma, and Merck; advisory board and consultancy for BMS; travel funding from Roche, Merck, and AstraZeneca, outside the scope of this work. M.K. reports funding to the institute from BMS, Roche and an advisory role for BMS, outside the submitted work.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A list of authors and their affiliations appears at the end of the paper.

Contributor Information

Roberto Salgado, Email: roberto@salgado.be.

International Immuno-Oncology Biomarker Working Group:

Aini Hyytiäinen, Akira I. Hida, Alastair Thompson, Alex Lefevre, Alexander J. Lazar, Allen Gown, Amy Lo, Anna Sapino, Anant Madabhushi, Andre Moreira, Andrea Richardson, Andrea Vingiani, Andrew H. Beck, Andrew M. Bellizzi, Angel Guerrero, Anita Grigoriadis, Anna Ehinger, Ana Garrido-Castro, Anne Vincent-Salomon, Anne-Vibeke Laenkholm, Ashish Sharma, Ashley Cimino-Mathews, Ashok Srinivasan, Balazs Acs, Baljit Singh, Benjamin Calhoun, Benjamin Haibe-Kans, Benjamin Solomon, Bibhusal Thapa, Brad H. Nelson, Brandon D. Gallas, Carlos Castaneda, Carmen Ballesteros-Merino, Carmen Criscitiello, Carolien Boeckx, Cecile Colpaert, Cecily Quinn, Chakra S. Chennubhotla, Charles Swanton, Cinzia Solinas, Crispin Hiley, Damien Drubay, Daniel Bethmann, David A. Moore, Denis Larsimont, Dhanusha Sabanathan, Dieter Peeters, Dimitrios Zardavas, Doris Höflmayer, Douglas B. Johnson, E. Aubrey Thompson, Edi Brogi, Edith Perez, Ehab A. ElGabry, Elisabeth Specht Stovgaard, Elizabeth F. Blackley, Elvire Roblin, Emily Reisenbichler, Enrique Bellolio, Eva Balslev, Ewa Chmielik, Fabien Gaire, Fabrice Andre, Fang-I Lu, Farid Azmoudeh-Ardalan, Federico Rojo, Tina Gruosso, Francesco Ciompi, Franklin Peale, Fred R. Hirsch, Frederick Klauschen, Frédérique Penault-Llorca, Gabriela Acosta Haab, Gelareh Farshid, Gert van den Eynden, Giuseppe Curigliano, Giuseppe Floris, Glenn Broeckx, Gonzalez-Ericsson, Harmut Koeppen, Harry R. Haynes, Heather McArthur, Heikki Joensuu, Helena Olofsson, Huang-Chun Lien, I-Chun Chen, Ian Cree, Isabel Frahm, Iva Brcic, Jack Chan, James Ziai, Jane Brock, Jelle Wesseling, Jennifer Giltnane, Jennifer K. Kerner, Jeppe Thagaard, Jeremy P. Braybrooke, Jeroen A. W. M. van der Laak, Jerome Lemonnier, Jiping Zha, Joana Ribeiro, Jochen K. Lennerz, Jodi M. Carter, Joel Saltz, Johan Hartman, Johannes Hainfellner, John Le Quesne, Jonathon W. Juco, Jorge Reis-Filho, Joselyn Sanchez, Joseph Sparano, Joël Cucherousset, Juan Carlos Araya, Julien Adam, Justin M. Balko, Kai Saeger, Kalliopi Siziopikou, Karen Willard-Gallo, Karsten Weber, Katherine L. Pogue-Geile, Keith E. Steele, Kenneth Emancipator, Khalid AbdulJabbar, Khalid El Bairi, Kim R. M. Blenman, Kimberly H. Allison, Konstanty Korski, Lajos Pusztai, Laura Comerma, Laurence Buisseret, Lee A. D. Cooper, Leming Shi, Loes F. S. Kooreman, Luciana Molinero, M. Valeria Estrada, Magali Lacroix-Triki, Maise Al Bakir, Manu M. Sebastian, Marc van de Vijver, Marcelo Luiz Balancin, Maria Vittoria Dieci, Marie-Christine Mathieu, Marlon C. Rebelatto, Martine Piccart, Matthew G. Hanna, Matthew P. Goetz, Matthias Preusser, Mehrnoush Khojasteh, Melinda E. Sanders, Meredith M. Regan, Michael Barnes, Michael Christie, Michael Misialek, Michail Ignatiadis, Mieke van Bockstal, Miluska Castillo, Mohamed Amgad, Nadia Harbeck, Nadine Tung, Nele Laudus, Nicolas Sirtaine, Nicole Burchardi, Nils Ternes, Nina Radosevic-Robin, Oleg Gluz, Oliver Grimm, Paolo Nuciforo, Paul Jank, Paula Gonzalez-Ericsson, Pawan Kirtani, Petar Jelinic, Peter H. Watson, Peter Savas, Prudence A. Francis, Prudence A. Russell, Rajendra Singh, Rim S. Kim, Robert H. Pierce, Robert Hills, Roberto Leon-Ferre, Roland de Wind, Ruohong Shui, Sabine De Clercq, Sam Leung, Sami Tabbarah, Sandra C. Souza, Sandra O’Toole, Sandra Swain, Sarah Dudgeon, Scooter Willis, Scott Ely, Seong-Rim Kim, Shahinaz Bedri, Sheeba Irshad, Shi-Wei Liu, Shom Goel, Shona Hendry, Simonetta Bianchi, Sofia Bragança, Soonmyung Paik, Stephan Wienert, Stephen B. Fox, Stephen J. Luen, Stephen Naber, Stuart J. Schnitt, Luz F. Sua, Sunil R. Lakhani, Susan Fineberg, Teresa Soler, Thomas Gevaert, Timothy D’Alfonso, Tom John, Tomohagu Sugie, Uday Kurkure, Veerle Bossuyt, Venkata Manem, Vincente Peg Cámara, Weida Tong, Weijie Chen, Wentao Yang, William T. Tran, Yihong Wang, Yinyin Yuan, Yves Allory, Zaheed Husain, and Zsuzsanna Bago-Horvath

Supplementary information

Supplementary information is available for this paper at 10.1038/s41523-020-0155-1.

References

  • 1.Hall JA, Salgado R, Lively T, Sweep F, Schuh A. A risk-management approach for effective integration of biomarkers in clinical trials: perspectives of an NCI, NCRI, and EORTC working group. Lancet Oncol. 2014;15:e184–193. doi: 10.1016/S1470-2045(13)70607-7. [DOI] [PubMed] [Google Scholar]
  • 2.Loi S, et al. Tumor-infiltrating lymphocytes and prognosis: a pooled individual patient analysis of early-stage triple-negative breast cancers. J. Clin. Oncol. 2019;37:559–569. doi: 10.1200/JCO.18.01010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Denkert C, et al. Tumour-infiltrating lymphocytes and prognosis in different subtypes of breast cancer: a pooled analysis of 3771 patients treated with neoadjuvant therapy. Lancet Oncol. 2018;19:40–50. doi: 10.1016/S1470-2045(17)30904-X. [DOI] [PubMed] [Google Scholar]
  • 4.Loi S, et al. LBA13—relationship between tumor infiltrating lymphocyte (TIL) levels and response to pembrolizumab (pembro) in metastatic triple-negative breast cancer (mTNBC): results from KEYNOTE-086. Ann. Oncol. 2017;28(Suppl_5):v605–49. doi: 10.1093/annonc/mdx440. [DOI] [Google Scholar]
  • 5.Emens LA, et al. Long-term clinical outcomes and biomarker analyses of atezolizumab therapy for patients with metastatic triple-negative breast cancer: a phase 1 study. JAMA Oncol. 2018 doi: 10.1001/jamaoncol.2018.4224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Voorwerk L, et al. Immune induction strategies in metastatic triple-negative breast cancer to enhance the sensitivity to PD-1 blockade: the TONIC trial. Nat. Med. 2019 doi: 10.1038/s41591-019-0432-4. [DOI] [PubMed] [Google Scholar]
  • 7.Kos, Z. et al. Pitfalls in assessing stromal tumor infiltrating lymphocytes (sTILs) in breast cancer. npj Breast Cancer. 10.1038/s41523-020-0156-0 (2020). [DOI] [PMC free article] [PubMed]
  • 8.Salgado R, et al. The evaluation of tumor-infiltrating lymphocytes (TILs) in breast cancer: recommendations by an International TILs Working Group 2014. Ann. Oncol. 2015;26:259–271. doi: 10.1093/annonc/mdu450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hayes DF, et al. Breaking a vicious cycle. Sci. Transl. Med. 2013;5:196cm196. doi: 10.1126/scitranslmed.3005950. [DOI] [PubMed] [Google Scholar]
  • 10.Hendry S, et al. Assessing tumor-infiltrating lymphocytes in solid tumors: a practical review for pathologists and proposal for a standardized method From the International Immunooncology Biomarkers Working Group: Part 1: assessing the host immune response, TILs in invasive breast carcinoma and ductal carcinoma in situ, metastatic tumor deposits and areas for further research. Adv. Anat. Pathol. 2017;24:235–251. doi: 10.1097/PAP.0000000000000162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Mroz P, Parwani AV, Kulesza P. Central pathology review for phase III clinical trials: the enabling effect of virtual microscopy. Arch. Pathol. Lab. Med. 2013;137:492–495. doi: 10.5858/arpa.2012-0093-RA. [DOI] [PubMed] [Google Scholar]
  • 12.Pell R, et al. The use of digital pathology and image analysis in clinical trials. J. Pathol. 2019;5:81–90. doi: 10.1002/cjp2.127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Amgad, M. et al. Report on computational assessment of tumor infiltrating lymphocytes from the international immuno-oncology biomarker working group. npj Breast Cancer. 10.1038/s41523-020-0154-2 (2020). [DOI] [PMC free article] [PubMed]
  • 14.Bankhead P, et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 2017;7:16878. doi: 10.1038/s41598-017-17204-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Li H, et al. Dysfunctional CD8 T cells form a proliferative, dynamically regulated compartment within human melanoma. Cell. 2019;176:775–789.e718. doi: 10.1016/j.cell.2018.11.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Sauer G, et al. Ultrasound-guided large-core needle biopsies of breast lesions: analysis of 962 cases to determine the number of samples for reliable tumour classification. Br. J. Cancer. 2005;92:231–235. doi: 10.1038/sj.bjc.6602303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Simon R. Optimal two-stage designs for phase II clinical trials. Control. Clin. Trials. 1989;10:1–10. doi: 10.1016/0197-2456(89)90015-9. [DOI] [PubMed] [Google Scholar]
  • 18.Fleiss, J. L. Statistical Methods for Rates and Proportions 2nd edn (1981).
  • 19.Tramm T, et al. Standardized assessment of tumor-infiltrating lymphocytes in breast cancer: an evaluation of inter-observer agreement between pathologists. Acta Oncol. 2018;57:90–94. doi: 10.1080/0284186X.2017.1403040. [DOI] [PubMed] [Google Scholar]
  • 20.Dieci V, et al. Association of tumor-infiltrating lymphocytes with distant disease-free survival in the ShortHER randomized adjuvant trial for patients with early HER2+ breast cancer. Ann. Oncol. 2019;30:418–423. doi: 10.1093/annonc/mdz007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Therneau TM. How many stratification factors are “too many” to use in a randomization plan? Control. Clin. Trials. 1993;14:98–108. doi: 10.1016/0197-2456(93)90013-4. [DOI] [PubMed] [Google Scholar]
  • 22.Silcocks P. How many strata in an RCT? A flexible approach. Br. J. Cancer. 2012;106:1259–1261. doi: 10.1038/bjc.2012.84. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material (651.3KB, pdf)

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.


Articles from NPJ Breast Cancer are provided here courtesy of Nature Publishing Group

RESOURCES