Skip to main content
Canadian Urological Association Journal logoLink to Canadian Urological Association Journal
. 2018 Sep 27;13(4):115–119. doi: 10.5489/cuaj.5196

Classification tree for the prediction of malignant disease and the prediction of non-diagnostic biopsies in patients with small renal masses

Michael Organ 1, Landan P MacDonald 2,, Michael AS Jewett 3, Henry Ajzenberg 4, Ashraf Almatar 5, Mohamed Abdolell 6, Matthew R Acker 7, Ricardo Rendon 8
PMCID: PMC6456341  PMID: 30059288

Abstract

Introduction

Preoperative prediction of benign vs. malignant small renal masses (SRMs) remains a challenge. This study: 1) validates our previously published classification tree (CT) with an external cohort; 2) creates a new CT with the combined cohort; and 3) evaluates the RENAL and PADUA scoring systems for prediction of malignancy.

Methods

This study includes a total of 818 patients with renal masses; 395 underwent surgical resection and 423 underwent biopsy. A CT to predict benign disease was developed using patient and tumour characteristics from the 709 eligible participants. Our CT is based on four parameters: tumour volume, symptoms, gender, and symptomatology. CART modelling was also used to determine if RENAL and PADUA scoring could predict malignancy.

Results

When externally validated with the surgical cohort, the predictive accuracy of the old CT dropped. However, by combining the cohorts and creating a new CT, the predictive accuracy increased from 74% to 87% (95% confidence interval 0.84–0.89). RENAL and PADUA score alone were not predictive of malignancy. One limitation was the lack of available histological data from the biopsy series.

Conclusions

The validated old CT and new combined-cohort CT have a predictive value greater than currently published nomograms and single-biopsy cohorts. Overall, RENAL and PADUA scores were not able to predict malignancy.

Introduction

Renal cell carcinoma (RCC) is the third most common urological malignancy.1 Its incidence has been increasing due to increased detection of small renal masses (SRMs, defined as <4 cm).2 The majority of SRMs are asymptomatic at diagnosis and have a non-aggressive behavior;3,4 however some (up to 6.0%) can present with metastasis.5,6 Curative therapy, including partial and radical nephrectomy, are associated with considerable morbidity.79

SRMs are found to be histologically benign in 20–46% of cases.1013 This prompted investigation into methods to predict benign vs. malignant disease. We have previously developed a classification tree (CT) using patient and tumour characteristics (tumour volume, location, and symptoms) to predict benign vs. malignant disease, with an overall accuracy of 89%14 (Fig. 1). Two nomograms have been published that can be used as tools to predict surgical histology.15,16 These are complex and have not been widely adopted into clinical practice. It has been suggested that the RENAL scoring, which was designed to predict complications after partial nephrectomy,17,18 can predict benign from malignant disease, with more complex lesions having a greater malignant potential.16,19,20

Fig. 1.

Fig. 1

Original Rendon et al classification tree predicting benign vs. malignant disease.

The aims of this study were to externally validate our previously published CT14 using patient and tumour characteristics, and create a new CT with a combined cohort from two different institutions to predict histology using tumour size, location, symptoms, and patient gender. Furthermore, the RENAL and PADUA scoring systems were evaluated to determine if they could predict malignancy or non-diagnostic biopsies.

Methods

Local institutional review boards approved this study, which includes 818 renal masses. A total of 423 patients were treated at the Princess Margaret Cancer Centre (PM), Toronto, Canada, and underwent a renal mass biopsy; 395 consecutive patients from the Queen Elizabeth II Health Sciences Centre (QEII) in Halifax, Nova Scotia, Canada, underwent surgical treatment for renal mass.

The PM cohort was obtained from a prospectively maintained database of 423 patients with SRMs undergoing treatment or surveillance. All patients undergoing percutaneous SRM biopsy between January 2000 and December 2009 were eligible for inclusion in this study. Biopsies were performed for planning management (surveillance or intervention), at the time of thermal ablation, or in the post-ablation period for suspicion of recurrence.

The QEII cohort included 395 patients who had open or laparoscopic partial or radical nephrectomy for renal masses ≤5 cm between July 1, 2001 and June 30, 2010. Patients were identified from an institutionally maintained prospective database of patients with renal masses and from physician records. All patients were >18 years of age and had a renal mass with imaging characteristics consistent with RCC. Patients with renal angiomyolipoma were excluded.

We used the combined PM+QEII cohort to determine the validity of our previously published CT.14 We then used binary recursive partitioning analysis (RPA) to create the new, combined CT model for the prediction of tumour histology (benign vs. malignant), and to evaluate RENAL and PADUA scoring for the same. Potential patient prognostic factors used to develop the new CT included age, sex, and symptoms at diagnosis. Each renal mass was classified broadly as either benign or malignant. Pre-treatment images were reviewed for each renal mass. Potential radiographical predictive factors included tumour volume (three dimensions: V = 4/3 π · xyz/8; two dimensions V = 4/3 π · xy/8 · (x + y)/2; and one dimension: V = 4/3 π · ×3/8); tumour location (central or peripheral); degree of endophytic component (1–100%); and tumour axis location. Tumour location was defined as central if the tumour was in direct contact with or invading the collecting system and/or renal sinus. All other renal masses were defined as peripheral. Degree of endophytic component was recorded as the percentage of the tumour that was within the normal contour of the kidney. Tumour axis location was designated according to three renal axes: 1) upper pole, interpolar, or lower pole; 2) medial or lateral; and 3) anterior or posterior.

Binary RPA implemented via classification and regression tree (CART) methodology is a semi-parametric modelling algorithm that generates simply understood binary decision tree models that stratify cases into various risk categories according to several prognostic factors. From the perspective of knowledge translation, a CT is appealing as a clinical decision tool, as it is simply represented and implemented as a set of binary decision rules. The CT models predict malignant or benign outcome as a function of a set of explanatory variables. The CT models were developed automatically using the rpart package in the R language for statistical computing.21 Sensitivity, specificity, accuracy, and positive and negative predictive values were computed for the CT models.

For the new CT, a random forest was used. Random forest is an ensemble method that constructs a collection of CTs with strategically injected variation by combining the concepts of bagging and the random selection of features. Random forests are effective at generating internal unbiased estimates of the generalization error. While random forests are difficult to interpret because they combine many models together, they typically outperform any single model generated by CART analysis. The random forest model was developed using the randomForest package in the R language for statistical computing.22 Sensitivity, specificity, accuracy, and positive and negative predictive values were obtained from the confusion matrix generated by the random forest model.

Results

The total combined cohort sample was 818 patients. Renal biopsy was initially performed in 423 PM cohort patients, 278 (66%) of which were male; 288 (68%) masses were detected incidentally. The median tumour volume was 8.7 cm3, while the median endophytic component was 50%. Only 357 (84% of the total 423 PM cohort masses) who underwent a diagnostic biopsy were included in the cohort and 281 (79%) of these revealed malignancy, the majority of which were clear-cell RCC. Forty-three participants with unknown symptomatology from both PM and QEII cohorts were excluded from the combined cohort, further reducing the participants to a final number of 709 (see Table 1 for further demographic data). RENAL and PADUA scores were also calculated for all the masses in the biopsy series.

Table 1.

Patient, tumour characteristics and histopathological findings

All PM patients Diagnostic PM + QEII patients
Patients 423 752
 Diagnostic biopsies 357
 Non-diagnostic, n (%) 66 (16) 0
 Male, n (%) 278 (66) 447 (59)
 Age at surgery or biopsy, median (range) 65 (32–91) 63 (24–91)
 Incidental, n (%) 288 (68) 563 (75)
 Symptomatic, n (%) 84 (20) 146 (19)
 Symptoms not available, n (%) 51 (12) 43 (6)
Tumour characteristics
 Imaging tumour volume in cm3, median (range) 8.7 (0–2252) 13 (0.8–2252)
 Endophytic %, median (range) 50 (5–100) 50 (1–100)
 Tumour histology
 Malignant, n (%) 281 (79) 631 (84)
  Clear-cell RCC, n (%) 167 (47) 431 (57)
  Papillary RCC, n (%) 45 (13) 113 (15)
  Chromophobe RCC, n (%) 14 (4) 31 (4)
  Malignant other (%) 55 (15) 56 (7)
 Benign, n (%) 76 (21) 121 (16)
  Oncocytoma, n (%) 47 (13) 67 (9)
  Angiomyolipoma, n (%) 16 (5) 27 (4)
  Benign cystic, n (%) 1 (0) 7 (1)
  Metanephric adenoma, n (%) 1 (0) 6 (1)
  Leiomyoma, n (%) 0 1 (0)
  Cystic nephroma, n (%) 0 1(0)
  Benign other, n (%) 11 (3) 12 (2)
 TMN stage
  T1a 355 527
  T1b 58 141
  T2a 17 13
  T2b 13 13
  T3a 0 13

PM: Princess Margaret; QE: Queen Elizabeth; RCC: renal cell carcinoma.

As previously reported, the old CT demonstrated an accuracy of 89%.14 When externally validated with the biopsy cohort (PM), the accuracy decreased to 74% (95% confidence interval [CI] 0.69–0.78). The QEII and PM patients were then combined and the new CT was developed, which demonstrated an accuracy of 87% (95% CI 0.84–0.89). As this new CT was not externally validated with a separate cohort, a random forest technique was used to create a more robust CT (Fig. 2).

Fig. 2.

Fig. 2

The new, combined Organ et al classification tree using Queen Elizabeth II + Princess Margaret patients. For each branch the absolute number of benign (B) and malignant (M) participants is provided, along with the percentage of benign participants.

Using the CART methodology, the RENAL nephrometry score alone was not able to predict whether the SRM was benign or malignant. When looking individually at the components of the RENAL score, “nearness to the collecting system” was associated with malignancy, with those within 7 mm of the collecting system (N scores 2 and 3) having a higher rate of malignancy (30%) than those further away (15%). Alone, size of the tumour was not predictive of malignancy, but when used with “nearness to the collecting system” in the form of a CT, larger masses (R scores 2 and 3) were more likely to be malignant. Neither the overall PADUA score nor the individual components were predictive of malignancy (Fig. 3).

Fig. 3.

Fig. 3

Classification tree showing predictive value of the N-component of RENAL (nearness to the collecting system) in predicting benign vs. malignant disease. For each branch the absolute number of benign (B) and malignant (M) participants is provided, along with the percentage of benign participants.

Discussion

Currently SRMs are being over-treated, as not all are malignant. Preoperative prediction of malignancy in SRMs is a significant challenge, as there is currently no validated, noninvasive way to predict benign vs. malignant renal disease.

Few groups have developed predictive tools to determine the risk of benign vs. malignant disease for SRMs.15,16 Lane et al used age, gender, radiological size at diagnosis, symptoms at presentation, and smoking history to develop a nomogram.15 Their nomogram predicted benign SRM with an area under the curve (AUC) of 64.4%. Kutikov et al developed nomograms using the RENAL nephrometry scoring system with an AUC of 76% in predicting malignancy and 74% in predicting low- vs. high-grade.16 Although both of the aforementioned studies are nomograms predicting benign masses, when comparing accuracy, our CT performs better. The RENAL nomogram predicting grade has been externally validated23 and has been shown to be unable to predict malignancy.24,25 This group demonstrated the only significant individual variables of the RENAL score were tumour size and proximity to the hilum. Despite this finding, all aspects of the RENAL score were included in the nomogram, making it less predictive and complex and difficult to use clinically. Neither of these two nomograms has been adopted clinically to predict benign vs. malignant disease. Mullins et al used RENAL nephrometry scoring to predict malignancy by subdividing the score in low- (4–6), intermediate- (7–9), and high-risk (10–12), showing that in intermediate and highly complex masses, there was an increased risk of malignancy.20

The old CT had been previously created and reported using patient and tumour characteristics to predict benign vs. malignant disease to help clinicians better treat SRMs.14 The current study validates this using an external biopsy cohort. The decrease in overall predictive accuracy is an expected finding when the old CT was externally validated with a different mix of patients. The new CT, created using both cohorts with a random forest algorithm, had an accuracy of 87%. It is likely that each group of renal masses is quite different, as the QEII is a surgical group while the PM group comes from a biopsy cohort (Table 2).

Table 2.

Descriptive statistics of original and combined classification tree before and after validation

Cohort External validation
 QEII
  Accuracy (95% CI) 0.89 (0.85, 0.92) 74 (0.69, 0.79)
  Sensitivity 0.97 0.93
  Specificity 0.22 0.05
  Positive predictive value 0.91 0.78
  Negative predictive value 0.53 0.17
 Combined QEII+PM
  Accuracy (95% CI) 0.87 (0.84, 0.89)
  Sensitivity 0.94
  Specificity 0.46
  Positive predictive value 0.90
  Negative predictive value 0.61

CI: confidence interval; PM: Princess Margaret; QE: Queen Elizabeth.

We have demonstrated that this tool is better than the currently published nomograms and non-invasive tools that try and predict benign vs. malignant disease. The advantage of a CT is its ease of use, as it mimics the clinicians’ thought process and is easily memorized, making it more likely to be used clinically. Additionally, the diagnostic accuracy of these CTs is as high as that reported in most renal mass biopsy series.

When comparing this tool to a “biopsy all” approach, we feel it may provide a personalized tactic to each individual patient. Large centres have high accuracy rates after biopsy (upwards of 80%), but it is likely that lower-volume centres do not share these success rates. Also, a CT does not incur any morbidity to the patient or costs to the system.

Our new CT implies that small, symptomatic tumours are more likely to be benign. DeRoche et al found that symptomatology was not associated with SRM malignancy;26 however, the consensus shows the opposite.27 It is possible this counterintuitive result is explained by statistical error. The small number of participants included in this category presented with unrelated symptoms and were incidentally found to have an SRM <2.7 cm3 (≈ 1.7 cm diameter, assuming a sphere) and were included in this category.

We evaluated the relationship of RENAL score and malignancy in the PM cohort. The biopsy cohort did not demonstrate a relationship with overall RENAL score and risk of malignancy. There was, however, the novel finding that nearness to the collecting system was positively correlated with malignancy.

We did not find a relationship with the overall PADUA score and malignancy. Each component was also independently assessed for a relationship with the prediction of benign or malignant masses and no association was found. These data are not surprising, as RENAL and PADUA were created to predict complications after partial nephrectomy and not malignancy. Other studies have shown that the RENAL nomogram is not effective at predicting malignant potential in SRMs.24,25

There are limitations to this study. The technique of CART modelling can leave out predictive factors. In each level of the CT, the statistical model chooses the greatest predictor, potentially leaving out useful predictors of malignancy. The evaluation of radiographical films to determine size and location of the masses is subjective and it has been demonstrated that tumour characteristics, including size, do not have a perfect correlation, but this represents actual clinical practice. The two cohorts are different, one being a surgical series while the other was a biopsy series, which may lead to lower accuracy rates when validated. The QEII cohort included tumours ≤5 cm, i.e., tumours were included that do not meet the criteria of SRM for the creation of the new CT. The majority of tumours were ≤4 cm, however, there is still additional statistical error because of this.

Our new CT is not validated. The random forest technique helped the CT become more robust, but it is no substitute for external validation. We recommend that our new CT be externally validated, especially considering the counterintuitive result of symptomatic SRMs with volume <2.7cm3 being more likely to be benign.

Conclusion

In patients with SRMs, benign vs. malignant disease can be predicted using our new CT based on patient and tumour characteristics, with an accuracy that is greater than currently available nomograms and most biopsy series. RENAL and PADUA scores were not associated with the prediction of malignancy.

We propose a practical approach to evaluate SRMs using a prognostic tool, such as our CT, to determine which patients should undergo renal mass biopsy, treatment, or active surveillance.

Footnotes

Competing interests: Dr. Jewett has participated in advisory board meetings for Pfizer and Theralase Technologies; has been a consultant for Olympus, Pfizer, and Theralase Technologies; and holds investments in Theralase Technologies. Dr. Acker has received speaker honoraria from Abbvie, Astellas, and Sanofi. Dr. Rendon has participated in advisory board meetings, been a speaker for, and received honoraria from Abbvie, Amgen, Astellas, AstraZeneca, Bayer, Ferring, Jansen, and Sanofi. The remaining authors report no competing personal or financial interest related to this work.

This paper has been peer-reviewed.

References

  • 1.Siegel RL, Miller KD, Jemal A. Cancer Statistics, 2017. CA Cancer J Clin. 2017;67:7–30. doi: 10.3322/caac.21387. [DOI] [PubMed] [Google Scholar]
  • 2.Kane CJ, Mallin K, Ritchey J, et al. Renal cell cancer stage migration: Analysis of the National Cancer Data Base. Cancer. 2008;113:78–83. doi: 10.1002/cncr.23518. [DOI] [PubMed] [Google Scholar]
  • 3.Hollingsworth JM, Miller DC, Daignault S, et al. Rising incidence of small renal masses: A need to reassess treatment effect. J Natl Cancer Inst. 2006;98:1331–4. doi: 10.1093/jnci/djj362. [DOI] [PubMed] [Google Scholar]
  • 4.Campbell SC, Novick AC, Belldegrun A, et al. Guideline for management of the clinical T1 renal mass. J Urol. 2009;182:1271–9. doi: 10.1016/j.juro.2009.07.004. [DOI] [PubMed] [Google Scholar]
  • 5.Lee H, Lee JK, Kim K, et al. Risk of metastasis for T1a renal cell carcinoma. World J Urol. 2016;34:553–9. doi: 10.1007/s00345-015-1659-4. [DOI] [PubMed] [Google Scholar]
  • 6.Daugherty M, Sedaghatpour D, Shapiro O, et al. The metastatic potential of renal tumours: Influence of histologic subtypes on definition of small renal masses, risk stratification, and future active surveillance protocols. Urol Oncol. 2017;35:153.e15–e20. doi: 10.1016/j.urolonc.2016.11.009. [DOI] [PubMed] [Google Scholar]
  • 7.Huang WC, Levey AS, Serio AM, et al. Chronic kidney disease after nephrectomy in patients with renal cortical tumours: A retrospective cohort study. Lancet Oncol. 2006;7:735–40. doi: 10.1016/S1470-2045(06)70803-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Thompson RH, Boorjian SA, Lohse CM, et al. Radical nephrectomy for pT1a renal masses may be associated with decreased overall survival compared with partial nephrectomy. J Urol. 2008;179:468–71. doi: 10.1016/j.juro.2007.09.077. discussion 472–73. [DOI] [PubMed] [Google Scholar]
  • 9.Berg WT, Tomaszewski JJ, Yang H, et al. Complications of renal surgery. Urol Clin North Am. 2017;44:275–88. doi: 10.1016/j.ucl.2016.12.013. [DOI] [PubMed] [Google Scholar]
  • 10.Leveridge MJ, Finelli A, Kachura JR, et al. Outcomes of small renal mass needle core biopsy, non-diagnostic percutaneous biopsy, and the role of repeat biopsy. Eur Urol. 2011;60:578–84. doi: 10.1016/j.eururo.2011.06.021. [DOI] [PubMed] [Google Scholar]
  • 11.Frank I, Blute ML, Cheville JC, et al. Solid renal tumours: An analysis of pathological features related to tumour size. J Urol. 2003;170:2217–20. doi: 10.1097/01.ju.0000095475.12515.5e. [DOI] [PubMed] [Google Scholar]
  • 12.Corcoran AT, Russo P, Lowrance WT, et al. A review of contemporary data on surgically resected renal masses — benign or malignant? Urology. 2013;81:707–13. doi: 10.1016/j.urology.2013.01.009. [DOI] [PubMed] [Google Scholar]
  • 13.Chopra S, Liu J, Alemozaffar M, et al. Improving needle biopsy accuracy in small renal mass using tumour-specific DNA methylation markers. Oncotarget. 2017;8:5439–48. doi: 10.18632/oncotarget.12276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rendon RA, Mason RJ, Kirkland S, et al. A classification tree for the prediction of benign vs. malignant disease in patients with small renal masses. Can J Urol. 2014;21:7379–84. [PubMed] [Google Scholar]
  • 15.Lane BR, Babineau D, Kattan MW, et al. A preoperative prognostic nomogram for solid enhancing renal tumours 7 cm or less amenable to partial nephrectomy. J Urol. 2007;178:429–34. doi: 10.1016/j.juro.2007.03.106. [DOI] [PubMed] [Google Scholar]
  • 16.Kutikov A, Smaldone MC, Egleston BL, et al. Anatomic features of enhancing renal masses predict malignant and high-grade pathology: A preoperative nomogram using the RENAL nephrometry score. Eur Urol. 2011;60:241–8. doi: 10.1016/j.eururo.2011.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Ficarra V, Novara G, Secco S, et al. Preoperative aspects and dimensions used for an anatomical (PADUA) classification of renal tumours in patients who are candidates for nephron-sparing surgery. Eur Urol. 2009;56:786–93. doi: 10.1016/j.eururo.2009.07.040. [DOI] [PubMed] [Google Scholar]
  • 18.Kutikov A, Uzzo RG. The RENAL nephrometry score: A comprehensive standardized system for quantitating renal tumour size, location, and depth. J Urol. 2009;182:844–53. doi: 10.1016/j.juro.2009.05.035. [DOI] [PubMed] [Google Scholar]
  • 19.Satasivam P, Sengupta S, Rajarubendra N, et al. Renal lesions with low RENAL nephrometry score are associated with more indolent renal cell carcinomas (RCCs) or benign histology: Findings in an Australian cohort. BJU Int. 2012;109( Suppl):44–7. doi: 10.1111/j.1464-410X.2012.11046.x. [DOI] [PubMed] [Google Scholar]
  • 20.Mullins JK, Kaouk JH, Bhayani S, et al. Tumour complexity predicts malignant disease for small renal masses. J Urol. 2012;188:20726. doi: 10.1016/j.juro.2012.08.027. [DOI] [PubMed] [Google Scholar]
  • 21.Abdolell M, LeBlanc M, Stephens D, et al. Binary partitioning for continuous longitudinal data: Categorizing a prognostic variable. Stat Med. 2002;21:3395–3409. doi: 10.1002/sim.1266. [DOI] [PubMed] [Google Scholar]
  • 22.Liaw A, Wiener M. Classification and regression by random Forest. R News. 2002;2:18–22. Available at: http://cran.r-project.org/doc/Rnews/ [Google Scholar]
  • 23.Wang H-K, Zhu Y, Yao X-D, et al. External validation of a nomogram using RENAL nephrometry score to predict high grade renal cell carcinoma. J Urol. 2012;187:1555–60. doi: 10.1016/j.juro.2011.12.099. [DOI] [PubMed] [Google Scholar]
  • 24.Koo KC, Yoo H, Shin TY, et al. External validation of the RENAL nephrometry score nomogram for predicting high-grade renal cell carcinoma in solid, enhancing, and small renal masses. World J Urol. 2014;32:249–55. doi: 10.1007/s00345-013-1159-3. [DOI] [PubMed] [Google Scholar]
  • 25.Antonelli A, Furlan M, Sandri M, et al. The RENAL nephrometric nomogram cannot accurately predict malignancy or aggressiveness of small renal masses amenable to partial nephrectomy. Clin Genitourin Cancer. 2014;12:366–72. doi: 10.1016/j.clgc.2014.02.003. [DOI] [PubMed] [Google Scholar]
  • 26.DeRoche T, Walker E, Magi-Galluzzi C, et al. Pathologic characteristics of solitary small renal masses: Can they be predicted by preoperative clinical parameters? Am J Clin Pathol. 2008;130:560–4. doi: 10.1309/YR7P42XUVQHPHDWL. [DOI] [PubMed] [Google Scholar]
  • 27.Crestani A, Rossanese M, Calandriello M, et al. Introduction to small renal tumours and prognostic indicators. Int J Surg. 2016;36:495–503. doi: 10.1016/j.ijsu.2016.03.038. [DOI] [PubMed] [Google Scholar]

Articles from Canadian Urological Association Journal are provided here courtesy of Canadian Urological Association

RESOURCES