Prognostic value and reproducibility of different microscopic characteristics in the WHO grading systems for pTa and pT1 urinary bladder urothelial carcinomas

Vebjørn Kvikstad; Ok Målfrid Mangrud; Einar Gudlaugsson; Ingvild Dalen; Hans Espeland; Jan P A Baak; Emiel A M Janssen

doi:10.1186/s13000-019-0868-3

. 2019 Aug 14;14:90. doi: 10.1186/s13000-019-0868-3

Prognostic value and reproducibility of different microscopic characteristics in the WHO grading systems for pTa and pT1 urinary bladder urothelial carcinomas

Vebjørn Kvikstad ^1,^2,^✉,^#, Ok Målfrid Mangrud ^3,^#, Einar Gudlaugsson ¹, Ingvild Dalen ⁴, Hans Espeland ⁵, Jan P A Baak ^1,^6,^7,^#, Emiel A M Janssen ^1,^2,^#

PMCID: PMC6694469 PMID: 31412916

Abstract

Background

European treatment guidelines for pTa and pT1 urinary bladder urothelial carcinoma depend highly on stage and WHO-grade. Both the WHO73 and the WHO04 grading systems show some intra- and interobserver variability. The current pilot study investigates which histopathological features are especially sensitive for this undesired lack of reproducibility and the influence on prognostic value.

Methods

Thirty-eight cases of primary non-muscle invasive urothelial carcinomas, including thirteen cases with stage progression, were reviewed by three pathologists. Thirteen microscopic features were extracted from pathology textbooks and evaluated separately. Reproducibility was measured using Gwet’s agreement coefficients. Prognostic ability regarding progression was estimated by the area under curve (AUC) of the receiver operating characteristics (ROC) function.

Results

The best reproducible features (Gwet’s agreement coefficient above 0.60) were papillary architecture, nuclear polarity, cellular maturation, nuclear enlargement and giant nuclei. Nucleoli was the strongest prognostic feature, and the only feature with an AUC above 0.70 for both grading systems, but reproducibility was not among the strongest. Nuclear polarity also had prognostic value with an AUC of 0.70 and 0.67 for the WHO73 and WHO04, respectively. The other features did not have significant prognostic value.

Conclusions

The reproducibility of the histopathological features of the different WHO grading systems varied considerably. Of all the features evaluated, only nuclear polarity was both prognostic and significantly reproducible. Further validation studies are needed on these features to improve grading of urothelial carcinomas.

Keywords: Papillary urothelial carcinoma, Grading, Reproducibility, Prognosis

Background

Bladder cancer is the ninth most frequently diagnosed cancer worldwide. The incidence is highest in developed countries, and is the fourth most common cancer among men in Norway [1, 2]. Urothelial carcinoma accounts for about 90% of bladder cancers in industrialized countries [3], and 70–80% of these are non-muscle-invasive bladder cancers (NMIBC), pTa, pT1 or pTis, on first diagnosis. Among these 50–70% will recur, while only 15–25% will progress to a higher stage [4]. The follow-up of these patients is labor-intensive [5, 6], causing massive costs for the health care systems [7].

Papillary urothelial carcinomas are the most frequent in western countries and are graded based on the degree of anaplasia. In 1973 the World Health Organization (WHO) introduced a classification system, in which papillary carcinomas were divided into three groups; grades 1, 2 and 3 (WHO73). A new classification system was introduced in the 2004 WHO Classification of tumours of the urinary system (“blue book”), following an International Society of Urological Pathology (ISUP) consensus conference in 1998 (WHO04). This grading system is maintained in the 4th.edition, 2016, of the WHO blue book. Currently, both systems are being used in routine diagnostics at pathology departments around the world [8]. The WHO04 classification system divided the papillary urothelial tumours into papillary urothelial neoplasm of low malignant potential (PUNLMP), low and high grade carcinomas. The histologic features are described in detail, aiming to improve reproducibility. However, several studies have shown considerable interobserver variability for both classification systems [9–11]. In a recent review Soukup et al. [12] conclude, on behalf of the European Association of Urology (EAU), that the “Current grading classifications in NMIBC are suboptimal”, both with regards to reproducibility (poor to fair) and with regards to prognostication.

Grading of papillary urothelial carcinomas according to the WHO73 and the WHO04 classification systems is based on a variety of histopathological features. However, these are not necessarily consciously and systematically analysed one-by-one in a routine diagnostic setting by diagnostic pathologists. Rather than a time consuming analytical approach, many pathologists make a first-glance low-magnification diagnosis, and zoom in on special areas or features to get their diagnosis confirmed. This is a quick, time-effective method but a drawback is lack of reproducibility, with classification shifts from one to other grades and hence prognostic variation as well.

The aim of this pilot study was to systematically analyse the reproducibility and prognostic value of each of the microscopic features. As far as we know, this has not been done before; although previous work on mitotic activity in urothelial carcinoma has found mitosis to be a prognostic factor [13, 14].

Methods

The study was approved by the Norwegian Regional Ethics Committee (#106/09). All patients with a primary non-muscle-invasive papillary urothelial carcinoma, at Stavanger University Hospital (SUH) from January 2002 to January 2007 were investigated (N = 228). All patients with urothelial carcinoma outside the urinary bladder (except for those with tumour in the pericollicular area in the urethra) were excluded. Thirty-five cases were excluded because of inadequate sample quality (necrotic tumour, fragmentation, thermal damage and insufficient material), leaving a total of 185 patients. Of these, 13 patients had stage progression; 12 within 5 years, and one after 5 years and 1 month.

In this pilot study we selected a group of 38 patients, including the 13 with progression and 25 without progression. Among the 13 patients with progression 10 were high grade and 3 were low grade according to WHO04. Patients without progression were randomly selected from the remaining 172 patients. There were no statistical significant differences between the grade, age, sex, recurrence or follow-up time of the selected 25 and the other 147 patients without progression.

Tumour tissue was obtained by transurethral resection or biopsy. Tissue was fixed in 4% buffered formaldehyde, dehydrated and embedded in paraffin. For microscopic evaluation four μm thick sections stained with haematoxylin-eosin-saffron (HES) were used.

The patients were treated according to the national guidelines at the time of diagnoses. The treatment consisted of transurethral resection (TUR), followed by a single instillation of a cytotoxic agent (epirubicin hydrochloride). Most patients defined as high risk patients were offered regular instillations with Bacillus Calmette Guérin (BCG), but some were offered alternative treatment with regular instillations containing a combination of epirubicin hydrochloride and interferon alpha. High risk patients included stage T1, grade 3 (WHO73), concurrent or later carcinoma in situ (pTis), three or more separate tumours diagnosed within 18 months or recurrences at multiple sites at first or second follow-up. Provided that the first follow-up cystoscopy was negative, patients with Ta grade 1 tumours would undergo control cystoscopies 3 months after initial diagnosis, 9 months later, and then annually for 5 years. All other patients would have cystoscopies every 3 months for the first 2 years, every 4 months for the 3rd year, every 6 month the 4th and 5th years, followed by annual cystoscopies thereafter.

Follow-up data were retrieved from the medical- and laboratory records at SUH. We defined progression as any advance in TNM stage, including both from pTa to pT1 or to pT2, and from pT1 to pT2. Progression to muscle invasive disease is clinically most relevant due to major differences in therapy. We also included cases with progression from pTa to pT1 as these tumours have gained the capability to infiltrate the stroma, a basic trait for progression.

The histopathological features constituting the grading systems were derived from urological pathology textbooks [15–17]. A list of the microscopic features and their interpretation, both for WHO73 and WHO04, is shown in Table 1. We extracted 13 features: papillae architecture, superficial layer, papillary fusion, nuclear polarity, cell maturation, cohesion, mitoses, nuclear enlargement, nuclear shape, nuclear hyperchromasia, chromatin pattern, nucleoli and giant nuclei.

Table 1.

The microscopic features with descriptions for each grade (WHO73/ 04)

	WHO73			WHO04
	Grade 1	Grade 2	Grade 3	Low grade	High grade
Architecture
Papillae	Delicate	Varies	Broad, varies	Slender	Broad
Superficial layer (umbrella cell layer)	Usually present	Usually present	Partially or completely lost	Usually present	Partially or completely lost
Papillary fusion	Some	Varies	Common	Some	Varies
Nuclear arrangement
Polarity	Preserved	Moderate loss	Lost	Preserved, moderate loss	Lost
Maturation	Normal	Some	Lost	Preserved, moderate loss	Lost
Cohesion	Normal	Some	Lost	Some	Lost
Proliferation
Mitotic figures	Rare, basal	Lower half	Common, atypical	Rare	Common
Nuclear atypia
Nuclear enlargement	Mild	Mild	Varies	Mild	Varies
Nuclear shape	Uniform	Moderate variation	Pleomorphic	Moderate variation	Pleomorphic
Nuclear hyperchromasia	Mild	Moderate	Varies	Mild to moderate	Varies
Chromatin pattern	Finely granular	Granular	Coarse	Fine	Coarse
Nucleoli	Occasional	Occasional	Common	Occasional	Common
Giant nuclei	No	No	Yes	No	Yes

Open in a new tab

All specimens were evaluated by three pathologists, focusing on grading criteria of the individual features, one at a time, for both WHO73 and WHO04. In tumours with morphological heterogeneity the “worst” area was graded. The evaluations were done without any knowledge about the original diagnosis or the other pathologists’ results. At a later stage, all three pathologists contributed to a consensus assessment for all the variables. Concerning the WHO04, only low grade and high grade were used as only three cases were classified as PUNLMP in our original cohort. In a previous study we found that recurrence and stage progression in the PUNLMPs and the low grade tumours by univariate survival analysis on our material were no different [18]. A later publication by Kim et al. [19] also showed no difference in progression between PUNLMP and low grade carcinomas.

Statistics

Reproducibility was measured using Gwet’s AC₁ agreement coefficient [20] for features with two categories, and using Gwet’s AC₂ agreement coefficient with quadratic weights for features with > 2 categories [21]. Fleiss’ generalized kappa [22] is also reported for reference; however, due to its vulnerability to skewed marginal distributions [23], the focus in this paper is on Gwet’s agreement coefficients. A coefficient of < 0.2 is defined as poor agreement, 0.2–0.4 fair agreement, 0.4–0.6 moderate agreement, 0.6–0.8 good agreement and > 0.8 as very good agreement [24]. Confidence intervals (CIs) for the reliability measures were based on the normal approximation [21].

Prognostic ability with regard to progression for the consensus classification of each feature was estimated by the area under curve (AUC) of the receiver operating characteristics (ROC) function, which is reported with a normal based confidence interval [25]. Statistical analysis was performed in R version 3.4.0 with syntax provided at http://www.agreestat.com/r_functions.html (downloaded 24.05.2018) and with package pROC [25].

Results

The median age at diagnosis was 72 years (range 56–87). Thirty patients were male (79%) and eight female (21%) (M:F ratio = 3.8). Median follow-up time was 73 months (range 5–168). Not all samples were regarded adequate for assessing all the microscopic features by all three pathologists. These cases were not included in the calculation of reliability for that particular feature (Table 2). At the consensus meeting, there was agreement that two cases could not be used to assess the feature “papillary fusion”. There were also two cases in which “maturation” could not be reliably assessed, and in one case “superficial layer” could not be assessed. This left between 36 to 38 total cases for each of the different features.

Table 2.

Reproducibility and prognostic value for each of the microscopic characteristics

Feature	n*	AC₁/AC₂ (95% CI)	Fleiss’ κ (95% CI)	n**	Consensus grade (prob. of progression)	AUC_ROC (95% CI) ***
Papillae73	36	0.62 (0.42 to 0.82)	0.63 (0.45 to 0.82)	38	Delicate (1/10) Varies (4/11) Broad, varies (8/17)	0.67 (0.51 to 0.83)
Papilae04	36	0.61 (0.39 to 0.82)	0.59 (0.37 to 0.81)	38	Slender (3/16) Broad (10/22)	0.64 (0.49 to 0.80)
Superficial layer73/04	36	0.51 (0.30 to 0.73)	0.50 (0.29 to 0.72)	37	Usually present (4/12) Partially lost (8/25)	0.49 (0.33 to 0.66)
Papillary fusion73	34	0.64 (0.44 to 0.84)	0.67 (0.48 to 0.86)	36	Some (2/13) Varies (4/9) Common (7/14)	0.67 (0.50 to 0.84)
Papillary fusion04	34	0.53 (0.32 to 0.75)	0.53 (0.31 to 0.75)	36	Some (4/19) Varies (8/17)	0.67 (0.51 to 0.84)
Polarity73	38	0.68 (0.53 to 0.82)	0.70 (0.55 to 0.84)	38	Preserved (1/9) Moderate (4/14) Lost (8/15)	0.70 (0.54 to 0.86)
Polarity04	38	0.66 (0.47 to 0.86)	0.63 (0.43 to 0.84)	38	Preserved (5/23) Lost (8/15)	0.67 (0.50 to 0.83)
Maturation73	36	0.60 (0.43 to 0.78)	0.59 (0.42 to 0.76)	36	Normal (1/9) Some (5/14) Lost (6/13)	0.66 (0.49 to 0.83)
Maturation04	36	0.62 (0.42 to 0.82)	0.60 (0.40 to 0.81)	36	Some (6/23) Lost (6/13)	0.60 (0.43 to 0.78)
Cohesion73	37	0.57 (0.42 to 0.71)	0.47 (0.28 to 0.65)	38	Normal (1/12) Some (9/21) Lost (3/5)	0.71 (0.56 to 0.85)
Cohesion04	37	0.54 (0.30 to 0.77)	0.23 (−0.02 to 0.47)	38	Some (10/33) Lost (3/5)	0.58 (0.44 to 0.71)
Mitosis73	38	0.47 (0.23 to 0.71)	0.41 (0.18 to 0.64)	38	Rare, basal (8/31) Lower half (1/1) Common, atypical (4/6)	0.65 (0.50 to 0.80)
Mitosis04	38	0.64 (0.43 to 0.85)	0.49 (0.25 to 0.72)	38	Rare (9/32) Common (4/6)	0.61 (0.47 to 0.76)
Nuclear enlargement73/04	38	0.65 (0.45 to 0.85)	0.65 (0.45 to 0.84)	38	Mild (4/19) Varies (9/19)	0.65 (0.48 to 0.81)
Nuclear shape73	38	0.58 (0.41 to 0.74)	0.51 (0.32 to 0.69)	38	Uniform (3/10) Moderate (8/23) Pleomorphic (2/5)	0.53 (0.36 to 0.71)
Nuclear shape04	38	0.58 (0.34 to 0.81)	0.41 (0.21 to 0.61)	38	Moderate (11/33) Pleomorphic (2/5)	0.52 (0.40 to 0.64)
Nuclear hyperchromasia73	38	0.51 (0.38 to 0.65)	0.51 (0.35 to 0.68)	38	Mild (3/11) Moderate (6/17) Varies (4/10)	0.56 (0.37 to 0.74)
Nuclear hyperchromasia04	38	0.51 (0.28 to 0.74)	0.43 (0.21 to 0.65)	38	Mild to moderate (9/28) Varies (4/10)	0.53 (0.38 to 0.69)
Chromatin pattern73	38	0.51 (0.29 to 0.73)	0.46 (0.26 to 0.67)	38	Finely granular (7/25) Granular (4/10) Coarse (2/3)	0.60 (0.43 to 0.78)
Chromatin pattern04	38	0.66 (0.47 to 0.86)	0.55 (0.31 to 0.79)	38	Fine (9/31) Coarse (4/7)	0.59 (0.45 to 0.74)
Nucleoli73/04	38	0.54 (0.33 to 0.76)	0.54 (0.33 to 0.75)	38	Occasional (2/16) Common (11/22)	0.70 (0.56 to 0.85)
Giant nuclei	38	0.85 (0.72 to 0.98)	0.78 (0.59 to 0.98)	38	No (8/28) Yes (5/10)	0.59 (0.43 to 0.75)

Open in a new tab

AC₁/ AC₂ Gwet’s AC₁/ AC₂ coefficient, CI Confidence interval, AUC_ROC Area under Receiver Operating Characteristics Curve.

* Number of cases evaluated by all three pathologists

** Number of cases for which consensus was reached

The reproducibility varies among the different microscopic features according to the calculated Gwet’s AC_1/2 agreement coefficient (Table 2). The values range from 0.47 for mitosis in the WHO73 system to 0.85 for giant nuclei. This corresponds to moderate to very good reproducibility. The other features yielded evenly distributed values, with papillae architecture, nuclear polarity, cell maturation, nuclear enlargement and giant nuclei as the most reproducible, all with Gwet’s AC_1/2 agreement coefficient above 0.60 (=good agreement) for both grading systems. Several of the values have very wide confidence intervals, making them less robust. For instance, for mitosis73 the confidence interval ranges from 0.23 to 0.71.

Prognostic ability for the different features, estimated by AUC, ranged from 0.49 for superficial layer, to 0.71 for cohesion in WHO73. To qualify as reliable, we wanted the features to be convincing (> 0.7) for both WHO73 and WHO04. For instance, cohesion generated an AUC of 0.58 for WHO04, and should therefore not be relied on in our material. Only nucleoli achieved an AUC above 0.7 for both WHO73 and WHO04, which is seen as an acceptable discrimination for progression or not. Polarity tends to show some prognostic information for both grading systems with AUC 0.70/ 0.67 for WHO73 and WHO04 respectively. These two features and papillary fusion gave estimated confidence intervals ≥0.5 for both grading systems. The other ten features showed no statistical significant prognostic value.

Nuclear polarity was the only feature with both reasonable reproducibility and prognostic value in this pilot study.

Discussion

Grade is seen as one of the most important prognostic factors in bladder cancer, with impact on treatment and patient follow-up. As reproducibility of both WHO73 and WHO04 is suboptimal, we systematically analysed the reproducibility and prognostic value of each of the microscopic features described as being part of grading. Each of the 13 features, which theoretically should be used to reach the final grade, carries its own uncertainty in terms of reproducibility and prognostic value.

In the absence of a formal prognostic decision tree of microscopic features in urinary bladder cancers, and lack of a descriptive atlas with typical pictures, pathologists will emphasize each feature differently while grading a urinary bladder tumour. The assessment of grade is therefore more or less based on intuition, as the features are not evaluated in a systematic manner, and only rarely truly quantitatively. This partially explains the considerable difficulty with reproducibility. Furthermore, the thresholds for the different subclasses of each of the included features are very subjective (example: the described thresholds for cohesion are: normal, some or lost). Such descriptive and subjective criteria lead to diagnostic confusion. In the process of grading, pathologists will also be challenged by laboratory variables like section thickness which might blur nuclear hyperchromasia or the introduction of artefacts that might mimic dyscohesiveness. The individual prognostic values of these features has never been analysed separately in urinary bladder tumours.

Before our analyses we expected mitoses to be a useful feature, as reported in a previous study on bladder cancer [13]. In the current analyses, mitosis was one of the least reproducible and prognostic features. However, mitotic activity in the current study was assessed in a semi-quantitative manner. Contrary, previous studies which reported mitoses as a strong prognostic factor, counted mitoses in a defined area by using the protocol for Mitotic Activity Index (MAI) as it is used and developed for breast cancer, and the final number of mitoses was used to categorize the tumours. When grading according to either of the WHO-systems, a rough mitotic impression, rather than a formalized mitotic count is used. This may explain the differences in prognostic value and reproducibility. Such a prognostic difference between mitotic activity as the MAI (truly quantitative) and mitotic impression (a rough estimate) has previously been shown in breast cancer [26], and may be true for urothelial carcinoma as well.

To be clinically useful, a grading system should be well reproducible to assure the intended sensitivity and specificity. As known the final grade is the sum of an evaluation of different microscopic features, therefore if one of these features is not truly quantitative, it inevitably will lack reproducibility and this will affect the final grade as well. Individual features may have a prognostic potential, which might be hidden by low overall reproducibility. It is crucial to minimalize the interobserver variability, making these features more reliable before extracting and emphasizing the features giving the best prognostic information. These features might be evaluated separately in a new grading system.

One way to improve reproducibility could be to provide pathologists with an image atlas with examples of the various features, facilitating comparison with the tumour to be graded. In prostate adenocarcinoma, the Gleason score has been well documented, tested and tried since its introduction in 1966 [27]. It has been claimed that the success of the system may in part be attributed to the ease of application and the simplicity of the original drawings [15]. Although the Gleason score has issues regarding reproducibility as well, especially when differentiating between Gleason grade group 2 and 3 [28, 29], the system as a whole has proven to be an important predictor of prognosis [30, 31]. A similar system with simplified, stylized illustrations may improve grading reproducibility in bladder cancer as well.

In this study nuclear polarity stands out as the most valuable histopathological feature in grading. This supports the current view that architectural and cytological order versus disorder decides whether a lesion should be regarded as low or high grade in the WHO04 grading system. Strict definitions will be necessary to further improve reproducibility of this feature as well. One approach could be to grade nuclear polarity according to how much the axis of the nuclei tends to deviate from a line perpendicular to the basement membrane (Fig. 1).

Fig. 1 — The images 1–3 show decreasing nuclear polarity at 40 x magnification. The red line is for comparison with the axis of the nuclei

The introduction of digital pathology introduces a multitude of possibilities for measurement of structures like nuclei, nucleoli and papillae. This can be exploited in grading, in an attempt to achieve standardization. Digital images can be further analysed by computer based algorithms, thereby analysing features not easily measured directly, like polarity, nuclear shape and mitotic Figures. A first attempt, using a local binary pattern (LBP) and local variance (VAR) operators followed by a RUSboost classifier, on a small test set of 42 patients with NMIBC resulted in an accuracy of 70%, a sensitivity of 84% and a specificity of 45% for prediction of recurrences [32]. Although only performed using a small dataset these results show the potential of these methods. Further studies using bigger datasets are necessary to further investigate these new measurements.

The value of the data in this pilot study is limited by the small sample size, not allowing any final conclusions. Although, our data suggest a substantial variety among the different histopathological features when it comes to reproducibility. Also, the prognostic value is disappointing for most of the features. Our data calls for further validation studies to highlight the most reproducible and most prognostic microscopic features making up the current grading system. We hope this article will contribute to developing a new approach.

when it comes to grading of papillary urothelial carcinomas.

Conclusion

WHO grading is based on the use of 13 histopathological features, which in our material vary considerably in reproducibility and prognostic value. Of all the features evaluated in this small study, only nuclear polarity was both reasonably prognostic and reproducible. Further validation studies on the individual histopathological features are needed to improve the assessment of grade of urothelial carcinomas. A new grading system should be based upon more clear-cut definitions and features with true prognostic value.

Acknowledgements

We would like to thank Bianca van Diermen Hidle, Melinda Lillesand, Eliza Peixoto Albernaz and Anne Elin Varhaugvik for technical assistance. We also want to thank the Department of Pathology at the Stavanger University Hospital for the opportunity to work on this project.

Abbreviations

AUC: Area under curve
BCG: Bacillus Calmette Guérin
CI: Confidence interval
EAU: European association of urology
HES: Haematoxylin-eosin-saffron
ISUP: International Society of Urological Pathology
LBP: Local binary pattern
MAI: Mitotic Activity Index
NMIBC: Non-muscle-invasive bladder cancer
PUNLMP: Papillary urothelial neoplasia of low malignant potential
ROC: Receiver operating characteristics
SUH: Stavanger University Hospital
TUR: Transurethral resection
VAR: Local variance
WHO: World Health Organization
WHO04: The World Health Organization grading system from 2004
WHO73: The World Health Organization grading system from 1973

Authors’ contributions

VK updated the data and wrote the article. OMM performed all histopathological evaluation and contributed in writing the article. EG performed histopathological evaluation. ID performed all the statistical analyses. HE provided clinical data. JPAB also performed histopathological evaluations and was involved in designing and supervising the study. Emiel A. M. Janssen designed and supervised the study and contributed to writing the paper. All authors critically evaluated the manuscript. All authors read and approved the final manuscript.

Funding

The authors have no support or funding to report.

Availability of data and materials

The datasets used and analysed during the current study are available from the corresponding author on reasonable request.

Ethics approval and consent to participate

The study was approved by the Norwegian Regional Ethics Committee (#106/09). With approval from REK Vest, informed consent was not obtained as the tissue samples had already been removed for diagnostic and treatment purposes.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Jan P. A. Baak and Emiel A.M. Janssen are Both senior authors contributed equally.

Contributor Information

Vebjørn Kvikstad, Email: vebjorn.kvikstad@sus.no.

Ok Målfrid Mangrud, Email: ok.malfrid.mangrud@sykehuset-innlandet.no.

Einar Gudlaugsson, Email: einar.gudbjorn.gudlaugsson@sus.no.

Ingvild Dalen, Email: ingvild.dalen@sus.no.

Hans Espeland, Email: hans.espeland@sus.no.

Jan P. A. Baak, Email: jpabaak47@yahoo.com

Emiel A. M. Janssen, Email: emilius.adrianus.maria.janssen@sus.no

References

1.Antoni S, Ferlay J, Soerjomataram I, Znaor A, Jemal A, Bray F. Bladder Cancer incidence and mortality: a global overview and recent trends. Eur Urol. 2017;71(1):96–108. doi: 10.1016/j.eururo.2016.06.010. [DOI] [PubMed] [Google Scholar]
2.Norway Cro . Cancer in Norway 2016 - Cancer incidence, mortality, survival and prevalence in Norway. 2017. [Google Scholar]
3.Pasin E, Josephson DY, Mitra AP, Cote RJ, Stein JP. Superficial bladder cancer: an update on etiology, molecular development, classification, and natural history. Rev Urol. 2008;10(1):31–43. [PMC free article] [PubMed] [Google Scholar]
4.Moch H, Humphrey PA, Ulbright TM, Reuter VE. World Health Organization Classification of tumours. 2016. pp. 77–135. [Google Scholar]
5.Holmang S, Hedelin H, Anderstrom C, Johansson SL. The relationship among multiple recurrences, progression and prognosis of patients with stages ta and T1 transitional cell cancer of the bladder followed for at least 20 years. J Urol. 1995;153(6):1823–1826. [PubMed] [Google Scholar]
6.Larsson P, Wijkstrom H, Thorstenson A, Adolfsson J, Norming U, Wiklund P, et al. A population-based study of 538 patients with newly detected urinary bladder neoplasms followed during 5 years. Scand J Urol Nephrol. 2003;37(3):195–201. doi: 10.1080/00365590310008037. [DOI] [PubMed] [Google Scholar]
7.Sievert KD, Amend B, Nagele U, Schilling D, Bedke J, Horstmann M, et al. Economic aspects of bladder cancer: what are the benefits and costs? World J Urol. 2009;27(3):295–300. doi: 10.1007/s00345-009-0395-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Babjuk M, Bohle A, Burger M, Capoun O, Cohen D, Comperat EM, et al. EAU guidelines on non-muscle-invasive urothelial carcinoma of the bladder: update 2016. Eur Urol. 2017;71(3):447–461. doi: 10.1016/j.eururo.2016.05.041. [DOI] [PubMed] [Google Scholar]
9.Bol MG, Baak JP, Buhr-Wildhagen S, Kruse AJ, Kjellevold KH, Janssen EA, et al. Reproducibility and prognostic variability of grade and lamina propria invasion in stages ta, T1 urothelial carcinoma of the bladder. J Urol. 2003;169(4):1291–1294. doi: 10.1097/01.ju.0000055471.78783.ae. [DOI] [PubMed] [Google Scholar]
10.Mangrud OM, Waalen R, Gudlaugsson E, Dalen I, Tasdemir I, Janssen EA, et al. Reproducibility and prognostic value of WHO1973 and WHO2004 grading systems in TaT1 urothelial carcinoma of the urinary bladder. PLoS One. 2014;9(1):e83192. doi: 10.1371/journal.pone.0083192. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Yorukoglu K, Tuna B, Dikicioglu E, Duzcan E, Isisag A, Sen S, et al. Reproducibility of the 1998 World Health Organization/International Society of Urologic Pathology classification of papillary urothelial neoplasms of the urinary bladder. Virchows Arch. 2003;443(6):734–740. doi: 10.1007/s00428-003-0905-0. [DOI] [PubMed] [Google Scholar]
12.Soukup V, Capoun O, Cohen D, Hernandez V, Babjuk M, Burger M, et al. Prognostic Performance and Reproducibility of the 1973 and 2004/2016 World Health Organization grading classification Systems in non-muscle-invasive Bladder Cancer: a European Association of Urology non-muscle invasive bladder Cancer guidelines panel systematic review. Eur Urol. 2017;72(5):801–813. doi: 10.1016/j.eururo.2017.04.015. [DOI] [PubMed] [Google Scholar]
13.Bol MG, Baak JP, Rep S, Marx WL, Kruse AJ, Bos SD, et al. Prognostic value of proliferative activity and nuclear morphometry for progression in TaT1 urothelial cell carcinomas of the urinary bladder. Urology. 2002;60(6):1124–1130. doi: 10.1016/s0090-4295(02)01906-4. [DOI] [PubMed] [Google Scholar]
14.Liukkonen T, Rajala P, Raitanen M, Rintala E, Kaasinen E, Lipponen P. Prognostic value of MIB-1 score, p53, EGFr, mitotic index and papillary status in primary superficial (stage pTa/T1) bladder cancer: a prospective comparative study. The Finnbladder Group. Eur Urol. 1999;36(5):393–400. doi: 10.1159/000020039. [DOI] [PubMed] [Google Scholar]
15.Cheng L, Lopez-Beltran A, MacLennan GT, Montironi R, Bostwick DG. Neoplasms of the urinary bladder. In: Bostwick DG, Cheng L, editors. Urological surgical pathology. 3. Philadelphia: Elsevier Saunders; 2014. pp. 230–317. [Google Scholar]
16.Reuter VE, Algaba F, Amin MB, Cao D, Cheng L, Comperat E. Non-invasive urothelial lesions. In: Moch H, Humphrey P, Ulbright T, Reuter V, editors. WHO classification of tumours of the urinary system and male genital organs. 4. Lyon: International agency for research on cancer; 2016. pp. 99–108. [Google Scholar]
17.Ordóñez Nelson G., Rosai Juan. Rosai and Ackerman's Surgical Pathology. 2011. Urinary tract; pp. 1101–1286. [Google Scholar]
18.Mangrud OM, Gudlaugsson E, Skaland I, Tasdemir I, Dalen I, van Diermen B, et al. Prognostic comparison of proliferation markers and World Health Organization 1973/2004 grades in urothelial carcinomas of the urinary bladder. Hum Pathol. 2014;45(7):1496–1503. doi: 10.1016/j.humpath.2014.03.001. [DOI] [PubMed] [Google Scholar]
19.Kim JK, Moon KC, Jeong CW, Kwak C, Kim HH, Ku JH. Papillary urothelial neoplasm of low malignant potential (PUNLMP) after initial TUR-BT: comparative analyses with noninvasive low-grade papillary urothelial carcinoma (LGPUC) J Cancer. 2017;8(15):2885–2891. doi: 10.7150/jca.20003. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Gwet KL. Computing inter-rater reliability and its variance in the presence of high agreement. Br J Math Stat Psychol. 2008;61(1):29–48. doi: 10.1348/000711006X126600. [DOI] [PubMed] [Google Scholar]
21.Gwet KL. Handbook of inter-rater reliability: the definitive guide to measuring the extent of agreement among raters: advanced analytics. 2014. [Google Scholar]
22.Fleiss JL. Measuring nominal scale agreement among many raters. Psychol Bull. 1971;76(5):378. [Google Scholar]
23.Quarfoot D, Levine RA. How robust are multirater interrater reliability indices to changes in frequency distribution? Am Stat. 2016;70(4):373–384. [Google Scholar]
24.Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–174. [PubMed] [Google Scholar]
25.Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12(1):77. doi: 10.1186/1471-2105-12-77. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Skaland I, van Diest PJ, Janssen EA, Gudlaugsson E, Baak JP. Prognostic differences of World Health Organization-assessed mitotic activity index and mitotic impression by quick scanning in invasive ductal breast cancer patients younger than 55 years. Hum Pathol. 2008;39(4):584–590. doi: 10.1016/j.humpath.2007.08.016. [DOI] [PubMed] [Google Scholar]
27.Gleason DF. Classification of prostatic carcinomas. Cancer Chemother Rep. 1966;50(3):125–128. [PubMed] [Google Scholar]
28.Egevad L, Ahmad AS, Algaba F, Berney DM, Boccon-Gibod L, Comperat E, et al. Standardization of Gleason grading among 337 European pathologists. Histopathology. 2013;62(2):247–256. doi: 10.1111/his.12008. [DOI] [PubMed] [Google Scholar]
29.Melia J, Moseley R, Ball RY, Griffiths DF, Grigor K, Harnden P, et al. A UK-based investigation of inter- and intra-observer reproducibility of Gleason grading of prostatic biopsies. Histopathology. 2006;48(6):644–654. doi: 10.1111/j.1365-2559.2006.02393.x. [DOI] [PubMed] [Google Scholar]
30.Chan TY, Partin AW, Walsh PC, Epstein JI. Prognostic significance of Gleason score 3+4 versus Gleason score 4+3 tumor at radical prostatectomy. Urology. 2000;56(5):823–827. doi: 10.1016/s0090-4295(00)00753-6. [DOI] [PubMed] [Google Scholar]
31.Epstein JI, Amin M, Boccon-Gibod L, Egevad L, Humphrey PA, Mikuz G, et al. Prognostic factors and reporting of prostate carcinoma in radical prostatectomy and pelvic lymphadenectomy specimens. Scand J Urol Nephrol Suppl. 2005;216:34–63. doi: 10.1080/03008880510030932. [DOI] [PubMed] [Google Scholar]
32.Urdal J, Engan K, Janssen EA. Prognostic prediction of histopathological images by local binary patterns and RUSBoost. In: Signal Processing Conference (EUSIPCO), 2017 25th European. Kos, Greece: IEEE; 2017.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The datasets used and analysed during the current study are available from the corresponding author on reasonable request.

[CR1] 1.Antoni S, Ferlay J, Soerjomataram I, Znaor A, Jemal A, Bray F. Bladder Cancer incidence and mortality: a global overview and recent trends. Eur Urol. 2017;71(1):96–108. doi: 10.1016/j.eururo.2016.06.010. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Norway Cro . Cancer in Norway 2016 - Cancer incidence, mortality, survival and prevalence in Norway. 2017. [Google Scholar]

[CR3] 3.Pasin E, Josephson DY, Mitra AP, Cote RJ, Stein JP. Superficial bladder cancer: an update on etiology, molecular development, classification, and natural history. Rev Urol. 2008;10(1):31–43. [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Moch H, Humphrey PA, Ulbright TM, Reuter VE. World Health Organization Classification of tumours. 2016. pp. 77–135. [Google Scholar]

[CR5] 5.Holmang S, Hedelin H, Anderstrom C, Johansson SL. The relationship among multiple recurrences, progression and prognosis of patients with stages ta and T1 transitional cell cancer of the bladder followed for at least 20 years. J Urol. 1995;153(6):1823–1826. [PubMed] [Google Scholar]

[CR6] 6.Larsson P, Wijkstrom H, Thorstenson A, Adolfsson J, Norming U, Wiklund P, et al. A population-based study of 538 patients with newly detected urinary bladder neoplasms followed during 5 years. Scand J Urol Nephrol. 2003;37(3):195–201. doi: 10.1080/00365590310008037. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Sievert KD, Amend B, Nagele U, Schilling D, Bedke J, Horstmann M, et al. Economic aspects of bladder cancer: what are the benefits and costs? World J Urol. 2009;27(3):295–300. doi: 10.1007/s00345-009-0395-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Babjuk M, Bohle A, Burger M, Capoun O, Cohen D, Comperat EM, et al. EAU guidelines on non-muscle-invasive urothelial carcinoma of the bladder: update 2016. Eur Urol. 2017;71(3):447–461. doi: 10.1016/j.eururo.2016.05.041. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Bol MG, Baak JP, Buhr-Wildhagen S, Kruse AJ, Kjellevold KH, Janssen EA, et al. Reproducibility and prognostic variability of grade and lamina propria invasion in stages ta, T1 urothelial carcinoma of the bladder. J Urol. 2003;169(4):1291–1294. doi: 10.1097/01.ju.0000055471.78783.ae. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Mangrud OM, Waalen R, Gudlaugsson E, Dalen I, Tasdemir I, Janssen EA, et al. Reproducibility and prognostic value of WHO1973 and WHO2004 grading systems in TaT1 urothelial carcinoma of the urinary bladder. PLoS One. 2014;9(1):e83192. doi: 10.1371/journal.pone.0083192. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Yorukoglu K, Tuna B, Dikicioglu E, Duzcan E, Isisag A, Sen S, et al. Reproducibility of the 1998 World Health Organization/International Society of Urologic Pathology classification of papillary urothelial neoplasms of the urinary bladder. Virchows Arch. 2003;443(6):734–740. doi: 10.1007/s00428-003-0905-0. [DOI] [PubMed] [Google Scholar]

[CR12] 12.Soukup V, Capoun O, Cohen D, Hernandez V, Babjuk M, Burger M, et al. Prognostic Performance and Reproducibility of the 1973 and 2004/2016 World Health Organization grading classification Systems in non-muscle-invasive Bladder Cancer: a European Association of Urology non-muscle invasive bladder Cancer guidelines panel systematic review. Eur Urol. 2017;72(5):801–813. doi: 10.1016/j.eururo.2017.04.015. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Bol MG, Baak JP, Rep S, Marx WL, Kruse AJ, Bos SD, et al. Prognostic value of proliferative activity and nuclear morphometry for progression in TaT1 urothelial cell carcinomas of the urinary bladder. Urology. 2002;60(6):1124–1130. doi: 10.1016/s0090-4295(02)01906-4. [DOI] [PubMed] [Google Scholar]

[CR14] 14.Liukkonen T, Rajala P, Raitanen M, Rintala E, Kaasinen E, Lipponen P. Prognostic value of MIB-1 score, p53, EGFr, mitotic index and papillary status in primary superficial (stage pTa/T1) bladder cancer: a prospective comparative study. The Finnbladder Group. Eur Urol. 1999;36(5):393–400. doi: 10.1159/000020039. [DOI] [PubMed] [Google Scholar]

[CR15] 15.Cheng L, Lopez-Beltran A, MacLennan GT, Montironi R, Bostwick DG. Neoplasms of the urinary bladder. In: Bostwick DG, Cheng L, editors. Urological surgical pathology. 3. Philadelphia: Elsevier Saunders; 2014. pp. 230–317. [Google Scholar]

[CR16] 16.Reuter VE, Algaba F, Amin MB, Cao D, Cheng L, Comperat E. Non-invasive urothelial lesions. In: Moch H, Humphrey P, Ulbright T, Reuter V, editors. WHO classification of tumours of the urinary system and male genital organs. 4. Lyon: International agency for research on cancer; 2016. pp. 99–108. [Google Scholar]

[CR17] 17.Ordóñez Nelson G., Rosai Juan. Rosai and Ackerman's Surgical Pathology. 2011. Urinary tract; pp. 1101–1286. [Google Scholar]

[CR18] 18.Mangrud OM, Gudlaugsson E, Skaland I, Tasdemir I, Dalen I, van Diermen B, et al. Prognostic comparison of proliferation markers and World Health Organization 1973/2004 grades in urothelial carcinomas of the urinary bladder. Hum Pathol. 2014;45(7):1496–1503. doi: 10.1016/j.humpath.2014.03.001. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Kim JK, Moon KC, Jeong CW, Kwak C, Kim HH, Ku JH. Papillary urothelial neoplasm of low malignant potential (PUNLMP) after initial TUR-BT: comparative analyses with noninvasive low-grade papillary urothelial carcinoma (LGPUC) J Cancer. 2017;8(15):2885–2891. doi: 10.7150/jca.20003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Gwet KL. Computing inter-rater reliability and its variance in the presence of high agreement. Br J Math Stat Psychol. 2008;61(1):29–48. doi: 10.1348/000711006X126600. [DOI] [PubMed] [Google Scholar]

[CR21] 21.Gwet KL. Handbook of inter-rater reliability: the definitive guide to measuring the extent of agreement among raters: advanced analytics. 2014. [Google Scholar]

[CR22] 22.Fleiss JL. Measuring nominal scale agreement among many raters. Psychol Bull. 1971;76(5):378. [Google Scholar]

[CR23] 23.Quarfoot D, Levine RA. How robust are multirater interrater reliability indices to changes in frequency distribution? Am Stat. 2016;70(4):373–384. [Google Scholar]

[CR24] 24.Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–174. [PubMed] [Google Scholar]

[CR25] 25.Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12(1):77. doi: 10.1186/1471-2105-12-77. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Skaland I, van Diest PJ, Janssen EA, Gudlaugsson E, Baak JP. Prognostic differences of World Health Organization-assessed mitotic activity index and mitotic impression by quick scanning in invasive ductal breast cancer patients younger than 55 years. Hum Pathol. 2008;39(4):584–590. doi: 10.1016/j.humpath.2007.08.016. [DOI] [PubMed] [Google Scholar]

[CR27] 27.Gleason DF. Classification of prostatic carcinomas. Cancer Chemother Rep. 1966;50(3):125–128. [PubMed] [Google Scholar]

[CR28] 28.Egevad L, Ahmad AS, Algaba F, Berney DM, Boccon-Gibod L, Comperat E, et al. Standardization of Gleason grading among 337 European pathologists. Histopathology. 2013;62(2):247–256. doi: 10.1111/his.12008. [DOI] [PubMed] [Google Scholar]

[CR29] 29.Melia J, Moseley R, Ball RY, Griffiths DF, Grigor K, Harnden P, et al. A UK-based investigation of inter- and intra-observer reproducibility of Gleason grading of prostatic biopsies. Histopathology. 2006;48(6):644–654. doi: 10.1111/j.1365-2559.2006.02393.x. [DOI] [PubMed] [Google Scholar]

[CR30] 30.Chan TY, Partin AW, Walsh PC, Epstein JI. Prognostic significance of Gleason score 3+4 versus Gleason score 4+3 tumor at radical prostatectomy. Urology. 2000;56(5):823–827. doi: 10.1016/s0090-4295(00)00753-6. [DOI] [PubMed] [Google Scholar]

[CR31] 31.Epstein JI, Amin M, Boccon-Gibod L, Egevad L, Humphrey PA, Mikuz G, et al. Prognostic factors and reporting of prostate carcinoma in radical prostatectomy and pelvic lymphadenectomy specimens. Scand J Urol Nephrol Suppl. 2005;216:34–63. doi: 10.1080/03008880510030932. [DOI] [PubMed] [Google Scholar]

[CR32] 32.Urdal J, Engan K, Janssen EA. Prognostic prediction of histopathological images by local binary patterns and RUSBoost. In: Signal Processing Conference (EUSIPCO), 2017 25th European. Kos, Greece: IEEE; 2017.

PERMALINK

Prognostic value and reproducibility of different microscopic characteristics in the WHO grading systems for pTa and pT1 urinary bladder urothelial carcinomas

Vebjørn Kvikstad

Ok Målfrid Mangrud

Einar Gudlaugsson

Ingvild Dalen

Hans Espeland

Jan P A Baak

Emiel A M Janssen

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Table 1.

Statistics

Results

Table 2.

Discussion

Fig. 1.

Conclusion

Acknowledgements

Abbreviations

Authors’ contributions

Funding

Availability of data and materials

Ethics approval and consent to participate

Consent for publication

Competing interests

Footnotes

Contributor Information

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases