The impact of stage-related features in melanoma recurrence prediction: A machine learning approach

Guihong Wan; Bonnie Leung; Nga Nguyen; Mia S DeSimone; Feng Liu; Min Seok Choi; Diane Ho; Valerie Laucks; Stacey Duey; Ryan J Sullivan; Genevieve M Boland; Nicole R LeBoeuf; David Liu; Alexander Gusev; Shawn G Kwatra; Peter K Sorger; Kun-Hsing Yu; Yevgeniy R Semenov

doi:10.1016/j.jdin.2022.08.014

. 2022 Aug 30;10:28–30. doi: 10.1016/j.jdin.2022.08.014

The impact of stage-related features in melanoma recurrence prediction: A machine learning approach

Guihong Wan ^a,^b,^c, Bonnie Leung ^a, Nga Nguyen ^a, Mia S DeSimone ^d, Feng Liu ^e, Min Seok Choi ^a, Diane Ho ^a, Valerie Laucks ^a, Stacey Duey ^a, Ryan J Sullivan ^f, Genevieve M Boland ^g, Nicole R LeBoeuf ^h, David Liu ⁱ, Alexander Gusev ⁱ, Shawn G Kwatra ^j, Peter K Sorger ^c, Kun-Hsing Yu ^b,^e, Yevgeniy R Semenov ^a,^c,^∗

PMCID: PMC9678753 PMID: 36425793

Graphical abstract

To the Editor: The American Joint Committee on Cancer 8th Edition (AJCC8) stages localized melanomas by tumor thickness and ulceration.¹ However, the specific role of tumor thickness and ulceration in early-stage melanoma recurrence prediction using machine learning remains understudied. We leveraged a multi-institutional cohort of early-stage melanomas to evaluate the impact of stage-related features (thickness, ulceration, anatomic level, and clinical stage) and other clinicopathologic features in recurrence prediction.

We identified a retrospective cohort of 1166 (229 recurrences vs 937 nonrecurrences) stage I/II primary melanomas diagnosed between 2000 and 2020 at Mass General Brigham (original cohort). We extracted 11 clinicopathologic features: sex, race, age at diagnosis, AJCC8 clinical stage, tumor thickness, ulceration, anatomic level, histologic type, tumor anatomic location, mitotic rate, and total surgical margins. The following histologic subtypes were included: superficial spreading, lentigo maligna, nodular, and malignant melanoma, not otherwise specified. Based on clinical guidelines,² melanomas where sentinel lymph node biopsy was indicated but not performed were excluded from this study. Melanomas with any unknown stage-related features were also excluded. A manual chart review by 2 independent reviewers was conducted to ascertain the first recurrence. A minimum of 5-year follow-up was used to ensure sufficient time to observe a recurrence. Melanomas that were stage IV at the time of recurrence, based on AJCC8, were labeled as “distant,” and other recurrent melanomas were labeled as “regional.” We also conducted analyses where nonrecurrences were randomly selected to match the number of recurrences in each machine learning experiment (matched cohort).

We experimented with 2 classification tasks: (1) recurrence versus nonrecurrence, and (2) distant versus regional recurrence by applying 3 well-known machine learning algorithms: random forest, gradient boosting, and logistic regression.³ Model parameters were optimized by cross-validated grid search and evaluated by 50 repeated 5-fold cross-validations using 3 criteria: balanced accuracy (BACC),³ area under the receiver operating characteristic curve (AUC), and positive predictive value (PPV).

In the first task, the achieved AUC (0.805) was consistent with previous studies,⁴ but the BACC (0.657) and PPV (0.662) were limited (Table I). The BACC and PPV on the matched cohort were better (BACC: 0.733; PPV: 0.737; P < .001) than those on the original cohort. In the second task, significant performance deterioration was observed in both cohorts (P < .001). We ranked features by conducting permutation importance³ with 50 repeats and AUC for scoring (Fig 1). Mitotic rate, which is no longer a criterion used in the AJCC8, appeared more important than ulceration in all 3 models.

Table I.

Best performance of machine learning models in melanoma recurrence prediction with the original cohort and the matched cohort

	Original cohort			Matched cohort
	BACC	AUC	PPV	BACC	AUC	PPV
	Recurrence versus nonrecurrence (229 vs 937)			Recurrence versus nonrecurrence (229 vs 229)
Stage-related features^∗ (mean and 95% CI)	0.657 0.653-0.661	0.805 0.801-0.809	0.662 0.652-0.672	0.733 0.728-0.739	0.804 0.798-0.810	0.737 0.731-0.744
All extracted features (mean and 95% CI)	0.687 0.685-0.688	0.831 0.826-0.836	0.694 0.684-0.703	0.756 0.749-0.763	0.820 0.814-0.825	0.765 0.758-0.772
P value^†	<.001	<.001	<.001	.024	<.001	<.001

	Distant versus regional (117 vs 112)			Distant versus regional (112 vs 112)
Stage-related features (mean and 95% CI)	0.539 0.531-0.54	0.540 0.531-0.548	0.528 0.519-0.536	0.525 0.518-0.533	0.540 0.531-0.549	0.516 0.509-0.524
All extracted features (mean and 95% CI)	0.585 0.578-0.593	0.612 0.604-0.619	0.585 0.577-0.592	0.590 0.583-0.597	0.617 0.610-0.625	0.589 0.581-0.597
P value^†	<.001	<.001	<.001	<.001	<.001	<.001

Open in a new tab

AUC, Area under the receiver operating characteristic curve; BACC, balanced accuracy; CI, confidence interval; PPV, positive predictive value.

^∗

Stage-related features: thickness, ulceration, anatomic level, and clinical stage.

^†

P value: t test for comparing the results when only using the stage-related features and when using all extracted features.

Fig 1 — The ranked average feature importance in the recurrence versus nonrecurrence prediction by the 3 machine learning models. The experiments were conducted on the original cohort. Categorical features were converted by one-hot encoding. Features with zero importance were ignored. All extracted features were presented for the random forest model. *AUC*, Area under the receiver operating characteristic curve; CI, confidence interval.

In summary, we collected clinicopathologic features currently used to inform clinical decisions in surveillance and therapeutic planning for early-stage melanomas and compared their roles in predicting melanoma recurrence. Despite near-universal dependence of clinical management on the stage-related features, our results demonstrate that the prediction performance of these features has limitations. Although stage-related features play an important role in recurrence risk stratification, entirely relying on these features for therapeutic planning will lead to many missed recurrent cases and delayed treatments. Further studies with potentially predictive data, such as genomics and digital histopathology,⁵ are needed to improve recurrence risk stratification of early-stage melanomas.

Conflicts of interest

Y.R.S. is an advisory board member/consultant and has received honoraria from Incyte Corporation, Castle Biosciences, Galderma, and Sanofi outside of the submitted work.

Footnotes

Drs Wan and Leung are co-first authors.

Drs Yu and Semenov are co-senior authors.

Funding sources: K-H.Y. is supported in part by R35GM142879 from the National Institute of General Medical Sciences, NIH. Y.R.S. is supported in part by the Department of Defense under award number W81XWH2110819 and by the Dermatology Foundation under the Medical Dermatology Career Development Award. The other authors received no funding for this research.

IRB approval status: Reviewed and approved by Mass General Brigham Institutional Review Boards (Protocol # 2020P002179).

Key words: early-stage melanoma; machine learning; mitotic rate; recurrence prediction; stage; tumor thickness; ulceration.

References

1.Gershenwald J.E., Scolyer R.A. Melanoma staging: American Joint Committee on Cancer (AJCC) and beyond. Ann Surg Oncol. 2018;25(8):2105–2110. doi: 10.1245/s10434-018-6513-7. [DOI] [PubMed] [Google Scholar]
2.Wong S.L., Faries M.B., Kennedy E.B., et al. Sentinel lymph node biopsy and management of regional lymph nodes in melanoma: American Society of Clinical Oncology and Society of Surgical Oncology clinical practice guideline update. Ann Surg Oncol. 2018;25(2):356–377. doi: 10.1245/s10434-017-6267-7. [DOI] [PubMed] [Google Scholar]
3.Pedregosa F., Varoquaux G., Gramfort A., et al. Scikit-learn: machine learning in Python. J machine Learn Res. 2011;12:2825–2830. [Google Scholar]
4.El Sharouni M.A., Ahmed T., Varey A.H., et al. Development and validation of nomograms to predict local, regional, and distant recurrence in patients with thin (T1) melanomas. J Clin Oncol. 2021;39(11):1243–1252. doi: 10.1200/JCO.20.02446. [DOI] [PubMed] [Google Scholar]
5.Wan G., DeSimone M., Liu F., et al. 649 CNN-based histopathology image analysis for early-stage melanoma recurrence. J Invest Dermatol. 2022;142(8):S112. [Google Scholar]

[bib1] 1.Gershenwald J.E., Scolyer R.A. Melanoma staging: American Joint Committee on Cancer (AJCC) and beyond. Ann Surg Oncol. 2018;25(8):2105–2110. doi: 10.1245/s10434-018-6513-7. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Wong S.L., Faries M.B., Kennedy E.B., et al. Sentinel lymph node biopsy and management of regional lymph nodes in melanoma: American Society of Clinical Oncology and Society of Surgical Oncology clinical practice guideline update. Ann Surg Oncol. 2018;25(2):356–377. doi: 10.1245/s10434-017-6267-7. [DOI] [PubMed] [Google Scholar]

[bib3] 3.Pedregosa F., Varoquaux G., Gramfort A., et al. Scikit-learn: machine learning in Python. J machine Learn Res. 2011;12:2825–2830. [Google Scholar]

[bib4] 4.El Sharouni M.A., Ahmed T., Varey A.H., et al. Development and validation of nomograms to predict local, regional, and distant recurrence in patients with thin (T1) melanomas. J Clin Oncol. 2021;39(11):1243–1252. doi: 10.1200/JCO.20.02446. [DOI] [PubMed] [Google Scholar]

[bib5] 5.Wan G., DeSimone M., Liu F., et al. 649 CNN-based histopathology image analysis for early-stage melanoma recurrence. J Invest Dermatol. 2022;142(8):S112. [Google Scholar]

PERMALINK

The impact of stage-related features in melanoma recurrence prediction: A machine learning approach

Guihong Wan, PhD

Bonnie Leung, BS

Nga Nguyen, MD, MPH

Mia S DeSimone, MD, MPH

Feng Liu, PhD

Min Seok Choi, MS

Diane Ho, RHIA, CTR

Valerie Laucks, BS

Stacey Duey, MCPH

Ryan J Sullivan, MD

Genevieve M Boland, MD, PhD

Nicole R LeBoeuf, MD, MPH

David Liu, MD, MPH

Alexander Gusev, PhD

Shawn G Kwatra, MD

Peter K Sorger, PhD

Kun-Hsing Yu, MD, PhD

Yevgeniy R Semenov, MD, MA

Graphical abstract

Table I.

Fig 1.

Conflicts of interest

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

The impact of stage-related features in melanoma recurrence prediction: A machine learning approach

Guihong Wan, PhD

Bonnie Leung, BS

Nga Nguyen, MD, MPH

Mia S DeSimone, MD, MPH

Feng Liu, PhD

Min Seok Choi, MS

Diane Ho, RHIA, CTR

Valerie Laucks, BS

Stacey Duey, MCPH

Ryan J Sullivan, MD

Genevieve M Boland, MD, PhD

Nicole R LeBoeuf, MD, MPH

David Liu, MD, MPH

Alexander Gusev, PhD

Shawn G Kwatra, MD

Peter K Sorger, PhD

Kun-Hsing Yu, MD, PhD

Yevgeniy R Semenov, MD, MA

Graphical abstract

Table I.

Fig 1.

Conflicts of interest

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases