Abstract
A meeting among expert pathologists was held in 2019 in Rome to verify the results of the previous harmonization efforts on the PD-L1 immunohistochemical testing by scoring a representative series of non-small cell lung cancer (NSCLC) digital slides. The current paper shows the results of this digital experimental meeting and the expertise achieved by the community of Italian pathologists. PD-L1 protein expression was determined using tumor proportion score (TPS), i.e., the percentage of viable tumor cells showing partial or complete membrane staining at any intensity. The gold standard was defined as the final PD-L1 score formulated by a panel of seven lung committed pathologists. PD-L1 status was clustered in three categories, namely negative (TPS < 1), low (TPS 1–49%), and high (TPS ≥ 50%). In 23 cases (71.9%) PD-L1 staining was performed using the companion diagnostic 22C3 pharmDx kit on Dako Autostainer, while in nine (28.1%) cases it was performed using the SP263 Ventana kit on BenchMark platform. A complete PD-L1 scoring agreement between the panel of experts and the participants was reached in 57.1% of cases, whereas a minor disagreement in 16.1% of cases was recorded. Italian pathologists performed best in strong positive cases (i.e., tumor proportion score TPS > 50%), whereas only 10.8% of disagreement with the gold standard was observed, and 55.6% regarded a single challenging case. The worst performance was achieved in the negative cases, with 32.0% disagreement. A significant difference resulted from the analysis of the data separated by the different clones used: 22.3% and 38.1% disagreement (p = 0.01) was found in the group of cases analyzed by 22C3 and SP263 antibody clones, respectively. In conclusion, this workshop record proposed the application of a digital pathology platform to share controversial cases in educational meetings as an alternative possibility for improving the interpretation and reporting of specific histological tools. Due to the crucial role of PD-L1 TPS for the selection of patients for immunotherapy, the identification of unconventional approaches as virtual slides to focus experiences and give more detailed practical verifications of the standard quality reached may be a considerable option.
Keywords: PD-L1, immunotherapy, NSCLC
1. Introduction
During the last two decades, a significant revolution in pathology laboratories has been represented by the implementation of whole slide imaging and digital scanners [1]. Digital pathology may play a significant role as a didactic, diagnostic and research tool. In fact, digital slides are more efficient to adequately replicate the microscope experience than static images [2]. Another advantage in this setting is represented by the possibility to highlight and analyze interesting areas during didactic sessions [2]. Despite a more realistic experience can be obtained by adopting multi-headed microscopes, a limitation in this approach is related by the low number of viewing head extensions (5 to 14) [2] while digital slides may allow, by using dedicated monitor, an adequate experience to a higher number of attendants [3].
The evaluation of the diagnostic performances of a single tissue biomarker like programmed death-ligand 1 (PD-L1) seems to be a good opportunity to test in parallel the application of digital pathology as a routine tool in educational programs.
PD-L1 immunohistochemical (IHC) testing plays an essential role to select advanced stage non-small cell lung cancer (NSCLC) patients for treatment with immune system checkpoint inhibitors [4,5,6]. In this setting, standardization of both pre-analytical and analytical procedures is a crucial, albeit not trivial, task. Several antibodies have been adopted for PD-L1 staining and usually developed on different immunostainers. The most reproducible results were obtained with the 22C3 (Dako, Carpinteria, CA, USA), 28-8 (Dako), and SP263 clones (Ventana, Tucson, AZ, USA) [7,8]. So far, a constellation of many different research papers has been published to harmonize and standardize PD-L1 IHC assays [9,10,11]. However, interpretation remains challenging, and specific expertise is required to correctly assess PD-L1 status [12,13,14,15,16]. Various clinical interests converge towards the continuous education of pathologists in improving the reproducibility of PD-L1 analysis.
The evaluation of the diagnostic performance of a single tissue biomarker such as PD-L1 in NSCLC is a paradigmatic opportunity to implement digital pathology in highly specialized educational programs. For this purpose, a working group of Italian pathologists met in 2019 in Rome to assess the reproducibility of PD-L1 scoring using e-learning solutions. In this study, we share the results of this initiative, focusing on the reliability and affordability of this “2.0 approach” in pathology clinical practice.
2. Materials and Methods
2.1. Cases Selection
A panel of seven pathologists with broad experience in lung cancer pathology (MB, CD, GF, FF, PG, AM, and GT) selected from the archives of their respective institutions representative NSCLC histologic blocks, as an e-learning material. All cases were anonymized, and the study was fully compliant with the local ethical guidelines and the declaration of Helsinki. The study group was composed of 32 NSCLC cases, including 27 (84.4%) surgical resections and 5 (15.6%) biopsies. The latter were obtained from either primary (n = 3; 60%) or metastatic (n = 2; 40%) sites. The molecular status of epidermal growth factor receptor (EGFR), ROS proto-oncogene 1 receptor tyrosine kinase (ROS1) and anaplastic lymphoma kinase (ALK) was retrieved from the clinical databases. The inclusion criteria were the following: adult patients (>18 years old); no previous neoadjuvant chemoradiotherapy administered; PD-L1 testing available.
2.2. PD-L1 Status Assessment and Tissue Slides Digitalization
For each case, representative hematoxylin and eosin (H&E) and PD-L1 stained slides were digitized using a ScanScope CS digital scanner (Aperio, Park Center Dr, Vista, CA, USA) at a 40x magnification. The original PD-L1 protein expression was determined using tumor proportion score (TPS), i.e., the percentage of viable tumor cells showing partial or complete membrane staining at any intensity. Then, PD-L1 status was clustered into three categories, namely, negative (TPS < 1%), low (TPS 1–49%), and high (TPS ≥ 50%). In 23 (71.9%) cases PD-L1 staining was performed using companion diagnostic 22C3 pharmDx kit on Dako Autostainer, while in 9 (28.1%) cases using the SP263 Ventana kit and OptiView DAB IHC detection kit on BenchMark platform, as recommended by the manufacturer. For all cases, the PD-L1 TPS was re-assessed by the panel of pathologists by using the same classification to obtain the final gold standard. To define tumor cells as positive for PD-L1 staining two different approaches were adopted. Briefly, for the 22C3 clone, a complete circumferential or partial linear membranous staining of tumor cells at any intensity was necessary to define as positive the tumor cells, whereas any membranous and/or cytoplasmic immunoreactivity in tumor cells was considered positive for the SP263 clone. Discordant cases were discussed by the panel of pathologists in order to obtain a consensus.
2.3. E-Learning-Based PD-L1 Analysis Harmonization among General Pathologists
All cases were shared with 37 general pathologists who participated at the meeting using e-learning platforms. The study participants were divided into seven groups (i.e., orange, red, violet, green, yellow, pink, blue); each group was supervised by one pathologist from the expert panel as a tutor. Within the working groups, each digital case was blindly scored by each pathologist individually (TESI SPA, Milan, Italy). The automatic informatics system allowed each pathologist to express his TPS for each case on a digital table within 10 min, subsequently closing the session and saving the relative data. Finally, the informatics system stopped the timing and saved the data. A final consensus on TPS was reached by collegial discussion and the second round of scores for the digital cases.
2.4. Statistical Analysis
All study data were digitally recorded and collected. Subsequently, inferential statistical analysis was performed using Microsoft Excel embedded tools (EGR, FB, MJ). The inter- and intra-observer reproducibility was assessed by using positive and negative percent agreement, sensitivity (or true positive rate), specificity (or true negative rate), positive and negative predictive values. Ninety-five percent of confidence intervals were computed for all measurements.
3. Results
In Table 1 the clinical-pathological features of the case series are summarized. From a histological point of view, and according to the 2015 WHO classification of lung cancer, our series included 22 adenocarcinomas and 10 squamous cell carcinomas. Patients’ age ranged from 41 to 86 years old. A higher number of male patients was reported (22 vs. 10).
Table 1.
CASE Codes | SEX | AGE | DIAGNOSIS | ALK | ROS1 | EGFR |
---|---|---|---|---|---|---|
1 | M | 76 | adenocarcinoma | neg | neg | neg |
2 | M | 76 | adenocarcinoma | neg | neg | neg |
3 | M | 78 | squamous cell carcinoma | n.a. | n.a. | n.a. |
4 | F | 76 | adenocarcinoma | neg | neg | neg |
5 | M | 78 | adenocarcinoma | neg | neg | neg |
6 | F | 54 | adenocarcinoma | neg | neg | neg |
7 | M | 65 | adenocarcinoma | neg | neg | neg |
8 | M | 67 | squamous cell carcinoma | n.a. | n.a. | n.a. |
9 | M | 56 | adenocarcinoma | neg | neg | neg |
10 | M | 68 | adenocarcinoma | neg | neg | neg |
11 | M | 61 | adenocarcinoma | neg | neg | neg |
12 | M | 74 | adenocarcinoma | neg | neg | neg |
14 | F | 62 | adenocarcinoma (liver mts) | neg | neg | neg |
15 | M | 70 | adenocarcinoma (lymph node mts) | neg | neg | neg |
16 | M | 80 | squamous cell carcinoma (biopsy) | n.a. | n.a. | n.a. |
17 | M | 64 | squamous cell carcinoma (biopsy) | n.a. | n.a. | neg |
20 | M | 86 | squamous cell carcinoma | n.a. | n.a. | n.a. |
22 | M | 75 | adenocarcinoma | neg | neg | neg |
30 | M | 68 | adenocarcinoma | neg | n.a. | neg |
33 | M | 71 | squamous cell carcinoma | n.a. | n.a. | n.a. |
34 | F | 76 | adenocarcinoma | n.a. | n.a. | neg |
35 | F | 63 | adenocarcinoma | neg | n.a. | neg |
36 | M | 55 | adenocarcinoma | neg | neg | neg |
37 | F | 74 | squamous cell carcinoma | n.a. | n.a. | n.a. |
38 | F | 71 | squamous cell carcinoma | n.a. | n.a. | n.a. |
41 | F | 41 | adenocarcinoma | neg | neg | neg |
45 | M | 71 | squamous cell carcinoma | neg | neg | neg |
50 | F | 62 | adenocarcinoma | neg | neg | neg |
51 | M | 59 | squamous cell carcinoma | n.a. | n.a. | n.a. |
52 | M | 78 | adenocarcinoma | neg | neg | neg |
54 | F | 67 | adenocarcinoma | n.a. | neg | neg |
55 | M | 72 | adenocarcinoma | neg | neg | neg |
Abbreviations: ALK: Anaplastic Lymphoma Kinase; EGFR: Epidermal Growth Factor Receptor; F: female; M: male; n.a.: not assessed; mts: metastasis; neg: negative; ROS1: ROS Proto-Oncogene 1, Receptor Tyrosine Kinase.
In 24 out of 32 selected cases (75.0%), the PD-L1 score was concordant between the original report and the expert panel of pathologists (Supplementary Table S1). In the remaining 8 cases, an under (n = 4) or over-estimation (n = 4) of positive tumor cells was reported. Seven cases were negative for PD-L1 expression, 16 were considered to be intermediate expressors, and 9 were classified as strong positive by the expert panel (Supplementary Table S1). The 37 learner pathologists, divided into 7 groups, evaluated all the digital slides on a laptop (Supplementary Table S2). The complete agreement between the expert panel (gold standard) and the participants was reached in a range of 37.5% (group red) to 78.1% of cases (group yellow); the average value was 57.1% (Table 2). A minor disagreement (1 participant per group) was recorded in 16.1% of cases (average value). In the red, orange, and green groups, two cases had no agreement with the referee scores. In Table 3, TPS disagreements were stratified according to the clinical threshold of PD-L1 scoring. The higher agreement rate among pathologists was obtained in PD-L1 high cases, with only 10.8% (36/333 instances) of disagreements with the gold standard, mainly regarding a single challenging case (55.6%, 20/36 instances; n = 37). Conversely, the worst performance was achieved in PD-L1 negative cases, where 32.0% (83/259 instances) of disagreements was demonstrated. Figure 1 highlights some paradigmatic challenging pitfalls due to the staining of macrophages in PD-L1 negative tumors (cases n. 10 and n. 22). In the category of intermediate expressors, 21.3% (126/259 instances) of disagreements was detected. In Figure 2, a spectrum of the possible reactions in cases near to the threshold between the intermediate and strong expressors is shown (cases n. 33 and n. 51). The specific evaluation of the participants’ performances is shown in Supplementary Table S2. The best performance of correct interpretation was 93.8% (pathologist G2). No statistically significant difference in percentage of disagreement (≥2 pathologists) with the gold standard was observed between adenocarcinomas and squamous cell carcinomas (27.3% vs. 25.7%, Supplementary Table S3). A significant difference emerged when considering the different clones adopted (Supplementary Table S4). A lower number of discrepancies (≥2 pathologists) was highlighted in the 22C3 clone group (22.3%) than in the SP263 one (38.1%, p = 0.01)
Table 2.
R (n = 5) | Y (n = 5) | O (n = 5) | V (n = 5) | B (n = 6) | P (n = 6) | G (n = 5) | |
---|---|---|---|---|---|---|---|
1 | |||||||
2 | |||||||
3 | |||||||
4 | |||||||
5 | |||||||
6 | |||||||
7 | |||||||
8 | |||||||
9 | |||||||
10 | |||||||
11 | |||||||
12 | |||||||
45 | |||||||
14 | |||||||
15 | |||||||
16 | |||||||
17 | |||||||
41 | |||||||
20 | |||||||
22 | |||||||
54 | |||||||
52 | |||||||
50 | |||||||
37 | |||||||
55 | |||||||
30 | |||||||
33 | |||||||
34 | |||||||
35 | |||||||
36 | |||||||
38 | |||||||
51 |
Table 3.
(A) PD-L1 NEGATIVE CASES (n = 7) | |||||||
Cases | R (n = 5) | Y (n = 5) | O (n = 5) | V (n = 5) | B (n = 6) | P (n = 6) | G (n = 5) |
3 | |||||||
6 | |||||||
10 | |||||||
22 | |||||||
30 | |||||||
35 | |||||||
36 | |||||||
(B) PD-L1 [1–49] expressorS (n = 16) | |||||||
R (n = 5) | Y (n = 5) | O (n = 5) | V (n = 5) | B (n = 6) | P (n = 6) | G (n = 5) | |
1 | > | < | < | < | |||
4 | < | <(3) | >(1); <(1) | < | |||
9 | <(3) | > | < | <(3) | |||
11 | <(1); >(1) | <(4) | <(2) | < | <(4) | <(2) | <(5) |
12 | > | >(2) | |||||
16 | > | > | > | ||||
17 | < | <(3) | <(4) | <(2) | |||
33 | > | ||||||
38 | < | <(2) | |||||
41 | <(5) | ||||||
45 | |||||||
50 | >(3) | >(2) | >(2); <(1) | >(4) | >(4) | >(2) | |
51 | >(5) | >(3) | >(3); <(1) | >(5) | >(2) | > | |
52 | >(4) | >(5) | >(3) | >(2) | >3 | ||
54 | >(4) | >(1); <(1) | |||||
55 | > | ||||||
(C) PD-L1 STRONG POSITIVE (n = 9) | |||||||
R (n = 5) | Y (n = 5) | O (n = 5) | V (n = 5) | B (n = 6) | P (n = 6) | G (n = 5) | |
2 | |||||||
5 | |||||||
7 | |||||||
8 | |||||||
14 | |||||||
15 | |||||||
20 | |||||||
34 | |||||||
37 |
4. Discussion
Digital pathology technologies for educational, diagnostic, and research purposes are rapidly replacing the use of static images in this era of next-generation pathology. Digitalization of histologic or cytologic slides ensures the possibility to cover the entire slide surface, enabling pathologists to analyze the whole section as with the “traditional” microscope [2]. The application of e-learning during an educational pathology course may reduce the subjectivity and variability of semiquantitative analysis (by using dedicated software) and makes it possible to share interesting or challenging cases for a second opinion [17]. However, despite these possibilities several issues remain unsolved. In particular, efforts have to be devoted to validating the use of digital slides, as reported by The College of American Pathologists (CAP) and the Digital Pathology Association (DPA) guidelines [17,18]. In addition, a not negligible percentage of pathologists feel a lower compliance with digital diagnosis respect to classical microscope evaluation [17]. The gradual introduction of similar tools during informal meetings and courses may help in decreasing the reluctance of pathologists in accepting the digital slides as an option and not an enemy. The development of user-friendly platforms that contribute to the transformation of the traditional vertical didactic exposition of paradigmatic cases into more dynamic and quiz-like experiences is currently available especially to approach single-item issues like the evaluation of a tissue biomarker.
In the era of personalized medicine, immunotherapy with anti-PD-1/PD-L1 monoclonal antibodies represents a relevant clinical option for patients with advanced stage NSCLC [19]. In particular, taking into account the results that have been reported in KEYNOTE 024 and 042 clinical trials, the adoption of pembrolizumab should be considered as the first line or as a valid second line treatment in advanced stage NSCLC patients harboring a PD-L1 expression level ≥50% or between 1–49%, respectively [20,21]. Despite the improvements in molecular techniques, IHC still plays a key role in the evaluation of PD-L1 expression, in order to administer immunotherapy in this setting of patients. However, careful attention should be paid due to extreme intra- and inter-observer variability in the staining interpretation, in particular due to disparities in the training of pathologists [22]. In this setting, digital scanning of selected challenging cases and collegial discussion with a panel of highly trained pathologists may have an educational aim, helping pathologists to better define the expression level of PD-L1 and to ensure the best treatment choice for advanced stage NSCLC patients.
After 3 years of routine testing in the laboratories, this paper focused on the challenges in PD-L1 staining score evaluation among different pathologists by adopting digital slides. Despite a good technical confidence generally being achieved, as demonstrated in previous reports [5], several interpretative issues are remaining unsolved. To overcome these issues, a panel of seven expert pathologists and a large group of pathologists joined a meeting that raised the main diagnostic issues in specific challenging situations. All cases were re-evaluated by the panel of experts, and, after a collegial discussion, in a high percentage of cases (75.0%) an agreement with the score referred in the original report was reached.
Several pre-analytical factors may influence PD-L1 staining score evaluation. Different IHC protocols (i.e., type and duration of antigen retrieval, primary antibody dilution or incubation time, number of steps of amplification) are adopted by diverse laboratories, even when the same primary antibody is used on the same platform with the same detection kit [23,24]. Since recent literature data reported an acceptable overlapping between 22C3 PharmDx clone on Dako Autostainer or SP263 clone on BenchMark platform, the panel enrolled NSCLC cases indifferently stained by using these clones [5]. The SP263 clone demonstrates higher sensitivity than all other assays, but its specificity was lower [19]. A comprehensive meta-analysis showed the sensitivity of PD-L1 IHC 22C3 pharmDx superior than PD-L1 IHC 28-8 pharmDx and then Ventana PD-L1 (SP263) [24]. Other studies have evaluated the interchangeability of PD-L1 assays [10]. However, no data are available in prospective clinical trials. From the patient’s perspective, lower specificity may be acceptable if the progression free survival, overall survival, and adverse effects in patients with PD-L1-negative tumors treated by immunotherapy are comparable to that of conventional chemotherapy [25]. The employment of Ventana platform usually produces more intense immunoreactivity, although the morphological detail of the background tissue is not always well preserved, representing a possible issue in the distinction among tumor and the immune cells (usually alveolar macrophages). On the other hand, the Dako platform provides fairly soft immunoreactions with a better definition of the morphological features of the different cell types [26]. Although the selection of the cases enrolled in the study reflected the normal routine cases, the choice aimed to highlight the intra- and inter-tumoral heterogeneity of PD-L1 expression, which may affect the reproducibility of this analysis (Figure 2). For this reason, challenging borderline cases, characterized by a PD-L1 expression close to the proposed threshold, were included.
To better represent the routine diagnostic cases, the panel selected a mixture of small specimens (bronchoscopy and core needle biopsies), and surgical specimens of NSCLC with different histotypes and respective molecular findings (Table 1). Results were compared with the TPS assigned by the panel of experts (gold standards). The project highlighted interpretative inaccuracies potentially leading to false negative results like the suboptimal application of the magnification rule, which recommends the use of a high-power field for the scoring of mild/focal immunoreactivity [27]. On the contrary, a false positive background was a frequent source of pitfalls due to macrophages staining (Figure 1). Unspecific peri-membranous reactivity in mucous-rich adenocarcinomas may be an additional source of errors for pathologists.
An important issue was related to the evaluation of viable tumor cells. Analytical interpretation of PD-L1 staining score resulted in a false negative result if the pathologists did not establish the specimen adequacy of the tumor sample defined by at least 100 viable neoplastic cells. Another relevant aspect is the staining under-estimation when incomplete tumor membrane staining was evaluated. In previous reports, Cohen’s κ coefficient was 0.68 (95% CI, 0.65–0.71) for the 1% cut point sample set, indicating improvable inter-observer agreement in this subgroup of patients [28]. Conversely, in our series in strong positive patients a large agreement was recorded with only 10.8% of disagreements (Table 3). When considering a positive tumor cell staining around 50%, only a careful and extensive quantitative evaluation may avoid an under-estimation. Moreover, the intensity of PD-L1 staining in heterogeneous cases produced questionable interpretations (Figure 2). Literature data of a similar project showed a slightly lower 0.58 Cohen’s κ coefficient (95% CI, 0.55–0.62) for the 50% cut point sample set, indicating moderate inter-observer agreement [28]. The lower reproducibility at the 50% cut point suggested that training and external quality assessment programs should improve reproducibility of the samples with ≥50% of stained cells, particularly as this cut point is used to select NSCLC patients for anti-PD1 therapy [28]. The misclassification of patients may lead to two orders of problems: patients who do not receive the first line PD-1 blockade therapy, and patients who receive a treatment that is not beneficial [29]. All interpretation issues were a potential concern for pathologists, regardless the assay type [30]. In our study, a complete agreement between the expert panel evaluation (gold standard) and the participants was reached in 57.1% of cases (average value, Table 2). A minor additional disagreement was recorded in 16.1% of cases. The best performer featured 93.8% of correct interpretation (Supplementary Table S2).
The present paper highlights once again the potential of digital pathology. The possibility of sharing slides for educational purpose, to use WSI for agreement and quality control studies and the implementation of a digital pathology workflow for primary histological diagnosis have been widely reported [1,2,3,29]. We are approaching the third revolution in pathology with the implementation of the artificial intelligence (AI) tool even in the everyday practice with the possibility to combine the use of digital pathology and AI tool to support pathologists in the diagnostic setting [30,31]. The possibility of predicting IHC findings directly from H&E slides has also been reported [32], including the PD-L1 status by the developing of a deep learning model [33]. The obtained results were robust to interpathologist variability, opening new possibilities for PD-L1 assessment, especially when there is insufficient tissue for all the needed IHC and molecular tests.
5. Conclusions
In the educational setting, despite an initial reluctance, Italian pathologists’ welcome digital slides as an opportunity to share opinions and challenging cases and evaluate specific biomarkers’ performances like PD-L1. The educational digital pathology meeting planned in Rome in 2019 stressed the diagnostic challenges of the PD-L1 IHC testing, collecting NSCLC samples from the Italian national laboratories network. Despite the small number of cases (n = 32) and the limited inter-observer reproducibility due to the impossibility to compare the results before and after training, the projected addresses some interpretative components underlying possible sources of disagreement related to the PD-L1 testing. Due to the crucial role of PD-L1 TPS for the selection of patients for immunotherapy, the identification of educational unconventional approaches as virtual slides to focus experiences and give more detailed practical verifications of the standard quality reached may be a considerable option.
Acknowledgments
Greta | Alì | Pisa |
Camilla | Comin | Florence |
Cristiana | Bellan | Siena |
Luisa | Bercich | Brescia |
Francesca | Bono | Monza |
Germana | Bortuzzo | Legnago |
Simonetta | Buglioni | Rome |
Severo | Campione | Naples |
Anna Maria | Chiaravalli | Varese |
Patrizia | Cocco | Padua |
Filippo | Crivelli | Busto Arsizio |
Antonella | D’Angelo | Ascoli Piceno |
Rossella | De Cecio | Naples |
Vladimiro | Dell’Anna | Lecce |
Arianna | Di Napoli | Rome |
Alessandra | Fabbri | Milan |
Amedeo | Ferlosio | Rome |
Nicola | Fusco | Milan |
Maria Grazia | Galasso | Catania |
Alfio | Iearaci | Pescara |
Giorgia | Leone | Catania |
Narciso | Mariani | Alessandria |
Maurizio | Martini | Rome |
Andrea | Menin | Vicenza |
Pietro | Micheli | Naples |
Marco | Mora | Genoa |
Enrico | Orvieto | Rovigo |
Angelina | Pernazza | Rom |
Giuseppe | Perrone | Rome |
Francesca Romana | Piro | Rome |
Michele | Pirrelli | Taranto |
Agnese | Proietti | Pisa |
Daniele | Remotti | Rome |
Giosuè | Scognamiglio | Naples |
Angelo | Sidoni | Perugia |
Silvia | Taccogna | Albano Laziale |
Valentina | Terrinazzi | Empoli |
Xenia | Trabucco | Bari |
Paolo | Visca | Rome |
Giulia | Vita | Rionero in Vulture |
Supplementary Materials
The following are available online at https://www.mdpi.com/2072-6694/12/7/1800/s1, Table S1: Original PD-L1 evaluation, expert board consensus and prevalent scoring for every group; discrepancies between the original evaluation and the panel are highlighted in orange; participants errors in yellow. Table S2: Participant performances. In the columns, the cases of the study were grouped from negative (green, TPS< 1%) to strong positive (red, TPS ≥50%). In the first row, the gold standard results recorded by the board of experts were listed. In the following rows, for every pathologist who attended the meeting a progressive number and a colour of identification corresponding to the table were assigned. (ie.e R1: red table, participant number 1). Table S3: Performances by histotype. Table S4: Performances by clone.
Author Contributions
Conceptualization, F.P., U.M., C.D., G.F., F.F., P.G., A.M., M.B., G.T.; methodology, E.G.R., P.P., E.V.V., F.B., M.J., N.F.; software, E.G.R., F.B., M.J.; validation, F.P., U.M., C.D., G.F., F.F., P.G., A.M., M.B., G.T.; formal analysis, E.G.R., F.B., M.J.; investigation, E.G.R., P.P., E.V.V., F.B., M.J., N.F.; resources, F.P., U.M., C.D., G.F., F.F., P.G., A.M., M.B., G.T.; data curation, F.P., U.M., P.P., M.J.; writing—original draft preparation, F.P., U.M.; writing—review and editing, all authors; visualization, all authors; supervision, F.P., U.M., C.D., G.F., F.F., P.G., A.M., M.B., G.T.; project administration, F.P., U.M., C.D., G.F., F.F., P.G., A.M., M.B., G.T. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Pantanowitz L., Sharma A., Carter A.B., Kurc T., Sussman A., Saltz J. Twenty Years of Digital Pathology: An Overview of the Road Travelled, What is on the Horizon, and the Emergence of Vendor-Neutral Archives. J. Pathol. Inform. 2018;9:40. doi: 10.4103/jpi.jpi_69_18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Romer D.J., Suster S. Use of virtual microscopy for didactic live-audience presentation in anatomic pathology. Ann. Diagn. Pathol. 2003;7:67–72. doi: 10.1053/adpa.2003.50021. [DOI] [PubMed] [Google Scholar]
- 3.Williams B.J., Bottoms D., Treanor D. Future-proofing pathology: The case for clinical adoption of digital pathology. J. Clin. Pathol. 2017;70:1010–1018. doi: 10.1136/jclinpath-2017-204644. [DOI] [PubMed] [Google Scholar]
- 4.Tsao M.S., Kerr K.M., Kockx M., Beasley M.B., Borczuk A.C., Botling J., Bubendorf L., Chirieac L., Chen G., Chou T.Y., et al. PD-L1 Immunohistochemistry Comparability Study in Real-Life Clinical Samples: Results of Blueprint Phase 2 Project. J. Thorac. Oncol. 2018;13:1302–1311. doi: 10.1016/j.jtho.2018.05.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Marchetti A., Barberis M., Franco R., De Luca G., Pace M.V., Staibano S., Volante M., Buttitta F., Guerini-Rocco E., Righi L., et al. Multicenter Comparison of 22C3 PharmDx (Agilent) and SP263 (Ventana) Assays to Test PD-L1 Expression for NSCLC Patients to Be Treated with Immune Checkpoint Inhibitors. J. Thorac. Oncol. 2017;12:1654–1663. doi: 10.1016/j.jtho.2017.07.031. [DOI] [PubMed] [Google Scholar]
- 6.Gong J., Chehrazi-Raffle A., Reddi S., Salgia R. Development of PD-1 and PD-L1 inhibitors as a form of cancer immunotherapy: A comprehensive review of registration trials and future considerations. J. Immunother. Cancer. 2018;6:8. doi: 10.1186/s40425-018-0316-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ratcliffe M.J., Sharpe A., Midha A., Barker C., Scott M., Scorer P., Al-Masri H., Rebelatto M.C., Walker J. Agreement between Programmed Cell Death Ligand-1 Diagnostic Assays across Multiple Protein Expression Cutoffs in Non-Small Cell Lung Cancer. Clin. Cancer Res. 2017;23:3585–3591. doi: 10.1158/1078-0432.CCR-16-2375. [DOI] [PubMed] [Google Scholar]
- 8.McLaughlin J., Han G., Schalper K.A., Carvajal-Hausdorf D., Pelekanou V., Rehman J., Velcheti V., Herbst R., LoRusso P., Rimm D.L. Quantitative Assessment of the Heterogeneity of PD-L1 Expression in Non-Small-Cell Lung Cancer. JAMA Oncol. 2016;2:46–54. doi: 10.1001/jamaoncol.2015.3638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Rimm D.L., Han G., Taube J.M., Yi E.S., Bridge J.A., Flieder D.B., Homer R., West W.W., Wu H., Roden A.C., et al. A Prospective, Multi-institutional, Pathologist-Based Assessment of 4 Immunohistochemistry Assays for PD-L1 Expression in Non-Small Cell Lung Cancer. JAMA Oncol. 2017;3:1051–1058. doi: 10.1001/jamaoncol.2017.0013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hendry S., Byrne D.J., Wright G.M., Young R.J., Sturrock S., Cooper W.A., Fox S.B. Comparison of Four PD-L1 Immunohistochemical Assays in Lung Cancer. J. Thorac. Oncol. 2018;13:367–376. doi: 10.1016/j.jtho.2017.11.112. [DOI] [PubMed] [Google Scholar]
- 11.Adam J., Le Stang N., Rouquette I., Cazes A., Badoual C., Pinot-Roussel H., Tixier L., Danel C., Damiola F., Damotte D., et al. Multicenter harmonization study for PD-L1 IHC testing in non-small-cell lung cancer. Ann. Oncol. 2018;29:953–958. doi: 10.1093/annonc/mdy014. [DOI] [PubMed] [Google Scholar]
- 12.Büttner R., Gosney J.R., Skov B.G., Adam J., Motoi N., Bloom K.J., Dietel M., Longshore J.W., López-Ríos F., Penault-Llorca F., et al. Programmed Death-Ligand 1 Immunohistochemistry Testing: A Review of Analytical Assays and Clinical Implementation in Non-Small-Cell Lung Cancer. J. Clin. Oncol. 2017;35:3867–3876. doi: 10.1200/JCO.2017.74.7642. [DOI] [PubMed] [Google Scholar]
- 13.Yu H., Boyle T.A., Zhou C., Rimm D.L., Hirsch F.R. PD-L1 Expression in Lung Cancer. J. Thorac. Oncol. 2016;11:964–975. doi: 10.1016/j.jtho.2016.04.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Scheel A.H., Dietel M., Heukamp L.C., Jöhrens K., Kirchner T., Reu S., Rüschoff J., Schildhaus H.U., Schirmacher P., Tiemann M., et al. Harmonized PD-L1 immunohistochemistry for pulmonary squamous-cell and adenocarcinomas. Mod. Pathol. 2016;29:1165–1172. doi: 10.1038/modpathol.2016.117. [DOI] [PubMed] [Google Scholar]
- 15.Hirsch F.R., McElhinny A., Stanforth D., Ranger-Moore. J., Jansson M., Kulangara K., Richardson W., Towne P., Hanks D., Vennapusa B., et al. PD-L1 Immunohistochemistry Assays for Lung Cancer: Results from Phase 1 of the Blueprint PD-L1 IHC Assay Comparison Project. J. Thorac. Oncol. 2017;12:208–222. doi: 10.1016/j.jtho.2016.11.2228. [DOI] [PubMed] [Google Scholar]
- 16.Vigliar E., Malapelle U., Bono F., Fusco N., Cortinovis D., Valtorta E., Spyridon A., Bimbatti M., Zocchi M., Piva C., et al. The Reproducibility of the Immunohistochemical PD-L1 Testing in Non-Small-Cell Lung Cancer: A Multicentric Italian Experience. BioMed Res. Int. 2019;2019:6832909. doi: 10.1155/2019/6832909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Griffin J., Treanor D. Digital pathology in clinical use: Where are we now and what is holding us back? Histopathology. 2017;70:134–145. doi: 10.1111/his.12993. [DOI] [PubMed] [Google Scholar]
- 18.Pantanowitz L., Sinard J.H., Henricks W.H., Fatheree L.A., Carter A.B., Contis L., Beckwith B.A., Evans A.J., Lal A., Parwani. A.V., et al. Validating whole slide imaging for diagnostic purposes in pathology: Guideline from the College of American Pathologists Pathology and Laboratory Quality Center. Arch. Pathol. Lab. Med. 2013;137:1710–1722. doi: 10.5858/arpa.2013-0093-CP. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kim H., Chung J.H. PD-L1 Testing in Non-small Cell Lung Cancer: Past, Present, and Future. J. Pathol. Transl. Med. 2019;53:199–206. doi: 10.4132/jptm.2019.04.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Torlakovic E., Lim H.J., Adam J., Barnes P., Bigras G., Chan A.W.H., Cheung C.C., Chung J.H., Couture C., Fiset P.O., et al. “Interchangeability” of PD-L1 immunohistochemistry assays: A meta-analysis of diagnostic accuracy. Mod. Pathol. 2020;33:4–17. doi: 10.1038/s41379-019-0327-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Munari E., Rossi G., Zamboni G., Lunardi G., Marconi M., Sommaggio M., Netto G.J., Hoque M.O., Brunelli M., Martignoni G., et al. PD-L1 Assays 22C3 and SP263 are Not Interchangeable in Non-Small Cell Lung Cancer When Considering Clinically Relevant Cutoffs: An Interclone Evaluation by Differently Trained Pathologists. Am. J. Surg. Pathol. 2018;42:1384–1389. doi: 10.1097/PAS.0000000000001105. [DOI] [PubMed] [Google Scholar]
- 22.Pagni F., Guerini-Rocco E., Schultheis A.M., Grazia G., Rijavec E., Ghidini M., Lopez G., Venetis K., Croci G.A., Malapelle U., et al. Targeting Immune-Related Biological Processes in Solid Tumors: We do Need Biomarkers. Int. J. Mol. Sci. 2019;20:5452. doi: 10.3390/ijms20215452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Giunchi F., Gevaert T., Scarpelli M., Fiorentino M. Status of Programmed Death Ligand 1 (PD-L1) by immunohistochemistry and scoring algorithms. Curr. Drug Targets. 2020 doi: 10.2174/1389450121666200123124642. Epub Ahead of Print. [DOI] [PubMed] [Google Scholar]
- 24.Cooper W.A., Russell P.A., Cherian M., Duhig E.E., Godbolt D., Jessup P.J., Khoo C., Leslie C., Mahar A., Moffat D.F., et al. Intra- and Interobserver Reproducibility Assessment of PD-L1 Biomarker in Non-Small Cell Lung Cancer. Clin. Cancer Res. 2017;23:4569–4577. doi: 10.1158/1078-0432.CCR-17-0151. [DOI] [PubMed] [Google Scholar]
- 25.Mazieres J., Kowalski D., Luft A., Vicente D., Tafreshi A., Gümüş M., Laktionov K., Hermes B., Cicin I., Rodríguez-Cid J., et al. Health-Related Quality of Life With Carboplatin-Paclitaxel or nab-Paclitaxel With or Without Pembrolizumab in Patients With Metastatic Squamous Non-Small-Cell Lung Cancer. J. Clin. Oncol. 2020;38:271–280. doi: 10.1200/JCO.19.01348. [DOI] [PubMed] [Google Scholar]
- 26.Fujimoto D., Sato Y., Uehara K., Ishida K., Fukuoka J., Morimoto T., Kawachi H., Mori R., Ito M., Teraoka S., et al. Predictive Performance of Four Programmed Cell Death Ligand 1 Assay Systems on Nivolumab Response in Previously Treated Patients with Non-Small Cell Lung Cancer. J. Thorac. Oncol. 2018;13:377–386. doi: 10.1016/j.jtho.2017.11.123. [DOI] [PubMed] [Google Scholar]
- 27.Tsao M.S., Kerr K.M., Dacic S.A., Yatabe Y.A., Hirsch F.R. IASLC atlas of PD-L1 immunohistochemistry testing in lung cancer. 1st ed. Editorial Rx Press; Aurora, CO, USA: 2017. [Google Scholar]
- 28.Kim H., Kwon H.J., Park S.Y., Park E., Chung J.H. PD-L1 immunohistochemical assays for assessment of therapeutic strategies involving immune checkpoint inhibitors in non-small cell lung cancer: A comparative study. Oncotarget. 2017;8:98524–98532. doi: 10.18632/oncotarget.21567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Fraggetta F., Garozzo S., Zannoni G.F., Pantanowitz L. Rossi ED Routine Digital Pathology Workflow: The Catania Experience. J. Pathol. Inform. 2017;8:51. doi: 10.4103/jpi.jpi_58_17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Salto-Tellez M., Maxwell P., Hamilton P. Artificial Intelligence-The Third Revolution in Pathology. Histopathology. 2019;74:372–376. doi: 10.1111/his.13760. [DOI] [PubMed] [Google Scholar]
- 31.Campanella G., Hanna M.G., Geneslaw L., Miraflor A., Silva V.W.K., Busam K.J., Brogi E., Reuter V.E., Klimstra D.S., Fuchs T.J. Clinical-grade Computational Pathology Using Weakly Supervised Deep Learning on Whole Slide Images. Nat. Med. 2019;25:1301–1309. doi: 10.1038/s41591-019-0508-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Martino F., Varricchio S., Russo D., Merolla F., Ilardi G., Mascolo M., dell’Aversana G.O., Califano L., Toscano G., Pietro G., et al. A Machine-learning Approach for the Assessment of the Proliferative Compartment of Solid Tumors on Hematoxylin-Eosin-Stained Sections. Cancers. 2020;12:1344. doi: 10.3390/cancers12051344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Sha L., Osinski B.L., Ho I.Y., Tan T.L., Willis C., Weiss H., Beaubier N., Mahon B.M., Taxter T.J., Yip S.S.F. Multi-Field-of-View Deep Learning Model Predicts Nonsmall Cell Lung Cancer Programmed Death-Ligand 1 Status From Whole-Slide Hematoxylin and Eosin Images. J. Pathol. Inform. 2019;10:24. doi: 10.4103/jpi.jpi_24_19. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.