Skip to main content
Journal of Clinical Pathology logoLink to Journal of Clinical Pathology
. 2005 Nov;58(11):1194–1198. doi: 10.1136/jcp.2005.026765

Interobserver reproducibility of histological features in cutaneous malignant melanoma

C Urso 1, F Rongioletti 2, D Innocenzi 3, C Saieva 4, D Batolo 5, S Chimenti 6, R Filotico 7, R Gianotti 8, M Lentini 5, C Tomasini 9, A Rebora 2, M Pippione 9
PMCID: PMC1770759  PMID: 16254111

Abstract

Aims: To assess the interobserver reproducibility of certain histological features proposed for the diagnosis of melanoma.

Methods: In a series of melanomas, 13 histological parameters were analysed: dimension > 6 mm, asymmetry, poor circumscription, irregular confluent nests, single melanocytes predominating, absence of maturation, suprabasal melanocytes, asymmetrical melanin, melanin in deep cells, cytological atypia, mitoses, dermal lymphocytic infiltrate, and necrosis.

Results: The agreement (reproducibility) between the nine observers was excellent (κ > 0.75) for 10 of the 13 examined features (dimension > 6 mm, poor circumscription, irregular confluent nests, single melanocytes predominating, absence of maturation, suprabasal melanocytes, asymmetrical melanin, melanin in deep cells, mitoses, and necrosis). The agreement for asymmetry was very close to excellence (κ  =  0.74), and that for cytological atypia (κ  =  0.65) and dermal lymphocytic infiltrate (κ  =  0.47) was slightly lower, but in the fair to good agreement range. The κ values obtained by comparison with the majority diagnosis were generally high (⩾ 0.85); the mean value of κ was lower (0.70) for only one parameter (dermal lymphocytic infiltrate).

Conclusions: The parameters investigated showed an overall good reproducibility.

Keywords: melanoma, reproducibility, diagnosis, skin, histopathology


Several morphological features are frequently used in the histopathological diagnosis of cutaneous melanoma, including the following: pagetoid infiltration, melanocytic atypia, mitotic figures, a band-like dermal inflammatory infiltrate containing melanophages, lack of maturation of abnormal melanocytes with the progressive descent into the dermis, pronounced variation in size and in shape of the melanocytic nests, poor circumscription of the intraepidermal melanocytes, confluence of the melanocytic nests, necrosis of melanocytes, involvement of epithelial adnexal structures by atypical melanocytes, asymmetry of the lesion, dimension greater than 6 mm, abundant melanin, mitoses near the base of the neoplasm, uneven distribution of melanin, melanocytes as solitary units predominating over nests, and uneven base of the lesion.1,2,3,4,5,6,7,8,9,10,11 However, despite the relatively high number of diagnostic criteria proposed, melanoma diagnosis remains problematic in a large proportion of cases, with possible diagnostic discord, even among experts.12–14 This could be the result of several factors, one of which might be the interobserver reproducibility of the histological criteria used.

“Despite the relatively high number of diagnostic criteria proposed, melanoma diagnosis remains problematic in a large proportion of cases, with possible diagnostic discord, even among experts”

The aim of our present study was to analyse a series of conventional cutaneous malignant melanomas (melanomas not referable to a specific subtype, such as spitzoid melanoma, desmoplastic melanoma, neurotropic melanoma, myxoid melanoma, etc.) to evaluate the interobserver reproducibility of certain histological features used for the diagnosis of melanoma.

MATERIALS AND METHODS

Dermatopathologist panel

Our study was undertaken by nine dermatopathologists, affiliated to the melanocytic lesion group of the Italian Association of Dermatopathology (AIDEPAT), from eight Italian institutions, namely: dermatopathology section, S M Annunziata Hospital, Health Unit 10 of Florence (CU); dermatopathology centre – Di. S E M, University of Genoa (FR); institute of dermatology, University La Sapienza, Rome (DI); department of human pathology, University of Messina (DB, ML); institute of dermatology, University Tor Vergata, Rome (SC); institute of dermatology, University of Bari (RF); institute of dermatology, University of Milan (RG); and institute of dermatology, University of Turin (CT).

Cases studied

Eighty melanocytic lesions, originally diagnosed as melanomas, were retrieved from the files of eight Italian institutions (10 consecutive cases for each institution, one slide for each case). The tissue fragments, containing the entire lesion, had been fixed in buffered formalin, processed routinely, and stained with haematoxylin and eosin. All identifying marks were removed from the slides, and they were relabelled with a standard label for the study, randomised, and renumbered from 1 to 80 by a person who did not take part in the histological evaluation.

Definition of histological features

Thirteen histological features commonly used in the diagnosis of malignant melanoma were considered. (1) Dimension > 6 mm,8,11 lesions, cut along their major axes, were measured on the histological slides. (2) Asymmetry, defined as present when a central line divided the lesion into two parts that looked different in shape, in thickness, or in number and position of dermal cells,8,10,11 and as absent only when the lesion appeared perfectly symmetrical. (3) Poor circumscription of the lesion, defined as present when the epidermal melanocytic proliferation, extending beyond the dermal component of the lesion, ended with single cells, rather than with a well defined nest.9–11,15 (4) Irregular and confluent nests, defined as epidermal melanocytic nests, variable in size and in shape and tending to confluence8,9,10,11; the position of melanocytic nests in the epidermal layers was not considered. (5) Single melanocytes predominating, defined as epidermal melanocytes disposed as solitary units predominating over melanocytic nests in some high power fields10,11; the position of melanocytes in the epidermal layers was not considered. (6) Absence of maturation, defined as failure of melanocyte nuclei to become smaller with progressive descent into the dermis.8,9,10,11,15 (7) Suprabasal melanocytes, defined as melanocytes above the dermo–epidermal junction8,10,11; the number of suprabasal cells, their location in the centre or in the periphery of the lesion, and whether cells appeared single or in nests were not considered. (8) Asymmetrical melanin, defined as present when a central line divided the lesion into two parts that showed different amounts of dermal pigment.10,11 (9) Melanin in deep cells, defined as the presence of melanin in melanocytes near the base of the neoplasm10,11; the feature was not graded: it was considered as present even if only small numbers of melanin granules were seen in the cytoplasm of deep melanocytes. (10) Cytological atypia, defined as melanocytic nuclei enlarged (more than keratinocytic ones), variable in size and in shape, hyperchromatic, with eosinophilic or amphophilic nucleoli8,9,10,11,15; atypia was not graded to avoid subjective evaluations: it was considered as present when it was slight/moderate and when it was severe. (11) Mitoses,8,9,10,11 defined as present when at least one was seen in the dermal component of the lesion. (12) Necrosis, defined as present when dermal necrotic melanocytes were seen.8,9,10,11 (13) Dermal lymphocytic infiltrate, defined as present when a dermal lymphocytic infiltrate was evident underlying and/or in the context of the melanocytic proliferation15; the feature was not graded: it was considered as present when it was sparse and pronounced, forming a continuous band in the dermis. Parameters, proposed by the coordinator of the study (CU), were individually discussed in detail, and finally approved by all the members of the panel.

Review procedure

Cases were examined at the multiheaded microscope. Each case was analysed for all the given features, which were evaluated on the basis of a yes/no decision, as present or absent. The dimension of the lesion was measured by one of us (CU) and results were checked by all the other members of the panel. When a feature could not be evaluated in a given case (for example, absence of maturation in an epidermal melanocytic lesion or melanin in deep cells in a superficially compound one), it was considered as non-applicable. For each lesion, participants individually recorded their own evaluations and their final diagnosis on a special form. Each observer evaluated the cases independently in a blinded fashion; that is, without knowledge of which institution had provided the slide and of the other investigators’ evaluations. No clinical data were provided. Subsequently, data were collected and elaborated. In each lesion, each feature was considered as present if at least five of the nine observers had agreed on its presence.

Statistical analysis

For the statistical study, data were computerised and analysed by the statistical package SAS (SAS Institute Inc Cary, North Carolina, USA; The logistic procedure, SAS/STAT user’s guide, Release 6.03. Cary, NC: SAS Institute, 1988). For the measurement of concordance between the nine pathologists, the evaluations of each pathologist were compared with those of each of the other members of the panel. Agreement among the nine pathologists was assessed using the κ statistic, which is a widely used index for measuring chance corrected agreement on a nominal or ordinal scale.16 Landis and Koch have characterised different ranges of values for κ with respect to the degree of agreement they suggest.17 For most purposes, values greater than 0.75 represent excellent agreement beyond chance, values below 0.40 represent poor agreement beyond chance, and values between 0.40 and 0.75 represent fair to good agreement beyond chance. A κ value close to 1 means near perfect agreement, whereas a κ value close to 0 does not necessarily means that agreement is poor, but only that it is not greater than that expected by chance alone. Negative values denote less than chance agreement. For each characteristic considered, a 2 × 2 diagnostic table was composed, using dichotomous categories, and specific κ values calculated. An overall value of κ as an average of the individual κ values was also calculated. Finally, the performance of each pathologist for each parameter considered was assessed by comparing his/her ratings with the majority diagnosis. The κ values obtained by comparison with the majority diagnosis are an estimate of the agreement between each pathologist and the “true” diagnosis.

RESULTS

Sixteen of the 80 lesions studied were excluded. Two cases showing melanoma cells associated with naevus remnants were excluded because naevus cells could bias the evaluation of one of the parameters studied (absence of maturation); one case was excluded because it displayed spitzoid features—large spindle and epithelioid cells; two cases were excluded because of the poor quality of the histological technique. Eleven cases were excluded because the panel members could not come to a unanimous diagnostic agreement. Sixty four cases, in which the diagnosis of melanoma was unanimously confirmed by the panel, were studied. Table 1 shows the histological characteristics (histotype, Clark level, and thickness). Two of the nine melanomas in situ were of the lentigo maligna type and seven of the superficial spreading type. Fifty three of the 55 invasive melanomas were superficial spreading melanomas, one a nodular melanoma, and one a lentigo maligna melanoma. Tables 2 and 3 show data on the prevalence of the studied parameters. Cytological atypia (fig 1A), dermal lymphocytic infiltrate (fig 1B), and irregular and confluent nests (fig 1D) were present in all nine cases of melanoma in situ. Single melanocytes predominating (fig 1C) were found in eight, and suprabasal melanocytes (fig 1C) and asymmetry in seven of the nine cases. Poor circumscription of the lesion (fig 1E) was seen in six, dimension > 6 mm in four, and asymmetrical melanin in three of the nine cases (table 2). Cytological atypia was present in all 55 cases of invasive melanoma, dermal lymphocytic infiltrate in 54 cases, suprabasal melanocytes in 53 cases, and asymmetry in 52 cases. Irregular and confluent nests, single melanocytes predominating, dimension > 6 mm, poor circumscription of the lesion, asymmetrical melanin, melanin in deep cells (fig 1G), and absence of maturation (fig 1F) were seen in most lesions. Mitoses (fig 1A) and necrosis (fig 1H) were found less frequently (table 3).

Table 1.

 Histological characteristics of the 64 malignant melanomas

Characteristic N %
Histotype
Superficial spreading 60 93.8
Nodular 1 1.5
Lentigo maligna 3 4.7
Clark level
    I 9 14.1
    II 37 57.8
    III 14 21.9
    IV 4 6.2
pT
pTis 9 14.1
pT1 (⩽1 mm) 41 64.1
pT2 (1.01–2.0 mm) 9 14.1
pT3 (2.01–4.0 mm) 5 7.7
Total 64 100.0

Table 2.

 Prevalence of 13 histological parameters in 9 malignant melanomas in situ

Feature Prevalence %
Cytological atypia 9 100
Dermal lymphocytic infiltrate 9 100
Irregular and confluent nests 9 100
Single melanocytes predominating 8 88.9
Suprabasal melanocytes 7 77.8
Asymmetry 7 77.8
Poor circumscription 6 66.7
Dimension >6 mm 4 44.5
Asymmetrical melanin 3 33.4
Melanin in deep cells NA
Absence of maturation NA
Mitoses NA
Necrosis NA

NA, not applicable.

Table 3.

 Prevalence of 13 histological parameters in 55 invasive malignant melanomas

Feature Prevalence %
Cytological atypia 55 100
Dermal lymphocytic infiltrate 54 98.2
Suprabasal melanocytes 53 96.4
Asymmetry 52 94.5
Irregular and confluent nests 48 87.3
Single melanocytes predominating 43 78.2
Dimension >6 mm 37 67.3
Poor circumscription 35 63.6
Asymmetrical melanin 32 58.2
Melanin in deep cells 29 52.7
Absence of maturation 29 52.7
Mitoses 16 29.1
Necrosis 1 1.8

Figure 1.

Figure 1

 (A) Melanocytic atypia and mitoses (haematoxylin and eosin (H&E) stain; original magnification, ×400). (B) Dermal lymphocytic infiltrate (H&E stain; original magnification, ×100). (C) Melanocytes as solitary units, often located in the suprabasal layers, predominate over melanocytic nests (H&E stain; original magnification, ×200). (D) Irregular nests tend to confluence (H&E stain; original magnification, ×100). (E) Poor circumscription of the lesion: the epidermal melanocytic proliferation ends with single cells (H&E stain; original magnification, ×100). (F) Absence of maturation: deep melanocytic nuclei (bottom) are not smaller than superficial ones (top) (H&E stain; original magnification, ×200). (G) Melanin granules in deep melanocytes (H&E stain; original magnification, ×100). (H) Necrosis of atypical melanocytes (H&E stain; original magnification, ×200).

Table 4 summarises the results of the agreement for each histological parameter considered (κ values). Agreement (reproducibility) between the nine observers was excellent (κ > 0.75) for 10 of the 13 examined features. The results for asymmetry (κ  =  0.74), cytological atypia (κ  =  0.65), and dermal lymphocytic infiltrate (κ  =  0.47) were in the fair to good agreement range (table 4). The κ values obtained by comparison with the majority diagnosis were > 0.75; the mean value of κ was less than 0.75 for one parameter only (dermal lymphocytic infiltrate; table 5). The κ values of absence of maturation, melanin in deep cells, mitoses, and necrosis were calculated in the 55 cases of invasive melanoma with a dermal melanocytic component (tables 4, 5).

Table 4.

 Kappa values for the 13 parameters evaluated in 64 malignant melanomas, comparing the performance of each pathologist for each parameter with each of the other members of the panel

Feature Mean SD Min 25th Median 75th Max
Dimension >6 mm 1.00
Asymmetry 0.74 0.30 −0.05 0.64 0.78 1.00 1.00
Poor circumscription 0.84 0.15 0.49 0.80 0.91 0.93 1.00
Irregular and confluent nests 0.91 0.17 0.60 1.00 1.00 1.00 1.00
Single melanocytes predominating 0.83 0.15 0.50 0.79 0.89 0.90 1.00
Absence of maturation* 0.87 0.11 0.63 0.78 0.92 0.93 1.00
Suprabasal melanocytes 0.76 0.32 0.20 0.65 1.00 1.00 1.00
Asymmetrical melanin 0.76 0.18 0.32 0.69 0.81 0.88 1.00
Melanin in deep cells* 0.86 0.09 0.65 0.83 0.83 0.92 1.00
Cytological atypia 0.65 0.48 −0.15 0.05 1.00 1.00 1.00
Mitoses* 0.90 0.08 0.69 0.83 0.91 1.00 1.00
Necrosis* 1.00
Dermal lymphocytic infiltrate 0.47 0.37 −0.06 0.06 0.47 0.65 1.00

Min, minimum value; 25th/75th, 25th/75th centile; Max, maximum value.

*Calculated in 55 invasive lesions.

Table 5.

 Kappa values for the 13 parameters evaluated in 64 malignant melanomas, comparing the performance of each pathologist for each parameter with the majority diagnosis

Feature Mean
Dimension >6 mm 1.00
Asymmetry 0.95
Poor circumscription 0.92
Irregular and confluent nests 0.96
Single melanocytes predominating 0.91
Absence of maturation* 0.93
Suprabasal melanocytes 0.87
Asymmetrical melanin 0.87
Melanin in deep cells* 0.93
Cytological atypia 0.85
Mitoses* 0.94
Necrosis* 1.00
Dermal lymphocytic infiltrate 0.70

*Calculated in 55 invasive lesions.

DISCUSSION

The histopathological diagnosis of cutaneous melanoma is a multifactorial process, based on the recognition of several architectural, cytological, and dermal histological criteria, identified over a long period of time.1,2,3,4,5,6,7,8,9,10,11 A melanocytic lesion is diagnosed as melanoma in the presence of a combination of histological criteria, none of which by itself is diagnostic of melanoma.12 In a subset of cases, however, the diagnosis may be particularly difficult for several reasons, which have not yet been adequately clarified. Among these, is the inconsistent presence of diagnostic histological criteria in a given lesion. Although possible in theory, in practice, a single melanoma will not show all the diagnostic parameters commonly used, but will generally exhibit only a certain number of them, and can sometimes show conflicting characteristics.14 Another factor is related to the fact the almost all criteria used are frequently found in benign melanocytic lesions, including special naevi, such as Spitz or Reed naevi, and common naevi.18–20 Finally, a further factor may be the poor reproducibility of the parameters currently used.

Several studies have investigated the interobserver reproducibility of diagnostic and prognostic criteria in melanocytic lesions.12,21,22,23,24 In our present study, we analysed 13 histological features used in the diagnosis of melanoma in a series of malignant melanomas. The statistical study of the agreement between the nine observers, using κ statistics, showed that 10 of the investigated features (dimension >6 mm, poor circumscription, irregular confluent nests, single melanocytes predominating, absence of maturation, suprabasal melanocytes, asymmetrical melanin, melanin in deep cells, mitoses, and necrosis) showed high reproducibility (κ > 0.75); asymmetry (κ  =  0.74), cytological atypia (κ  =  0.65), and dermal lymphocytic infiltrate (κ  =  0.47) were less reproducible (table 4). The relatively low κ value for cytological atypia (κ  =  0.65) resulted from a strict interpretation of the definition by some pathologists, who considered such a parameter absent in cases where the melanocytes did not show all the alterations required (enlargement of the nuclei, nuclear variation of size and shape, nuclear hyperchromatism, eosinophilic or amphophilic nucleoli). The relatively lower reproducibility of the dermal lymphocytic infiltrate was confirmed by comparison with the majority diagnosis (κ  =  0.70; table 5). In a previous study, the interobserver reproducibility of some features (symmetry, sharp circumscription, pagetoid infiltration, and nuclear atypia) was lower than that obtained in our present study, but results are not comparable, because the parameter definitions and methods used were different.23 The good degree of reproducibility obtained in our present study resulted from an accurate definition and a preliminary discussion about each parameter; this excellent reproducibility may also reflect the fact that the pathologists in our study were all dermatopathologists with a special interest in melanoma. However, in conclusion, the relatively high degree of agreement between observers suggests that diagnostic discord in melanoma diagnosis is not the result of the poor reproducibility of current parameters.

Take home messages.

  • We assessed the degree of interobserver agreement for 13 histological diagnostic criteria in 64 malignant melanomas and found that agreement between the nine observers was excellent (κ > 0.75) for 10 of the 13 features, one feature had a κ value close to excellence, and the other two were slightly lower, but in the fair to good agreement range

  • Thus, the relatively high degree of agreement between observers suggests that diagnostic discord in melanoma diagnosis is not the result of the poor reproducibility of current parameters

REFERENCES

  • 1.Tan MAL, Ackerman AB. Criteria for histopathologic diagnosis of melanoma, 1947–2000: a critique in historical perspective. Dermatopathology: Practical and Conceptual 2001;7:39–53. [Google Scholar]
  • 2.Lever WF. Histopathology of the skin. Philadelphia: JB Lippincott, 1949.
  • 3.Milne JA. An introduction to the diagnostic histopathology of the skin. Baltimore: Williams & Wilkins, 1972.
  • 4.Montgomery H. Dermatopathology. New York: Harper & Row Publishers, 1967.
  • 5.Percival GH, Montgomery GL, Dodds TC. Atlas of histopathology of the skin. 2nd ed. Maryland: Williams & Wilkins, 1962.
  • 6.Pinkus H, Mehregan AH. A guide to dermatohistopathology. New York: Meredith Co, 1969.
  • 7.Wayte DM. Pathology of nevi and melanomas. In: Helwig EB, Mostofi FK, eds. The skin by 30 authors. Maryland: Williams & Wilkins, 1971:522–4.
  • 8.Kamino H, Ackerman AB. Malignant melanoma in situ: the evolution of malignant melanoma within the epidermis. In: Ackerman AB, ed. Pathology of malignant melanoma. New York: Masson, 1981:59–91.
  • 9.Price NM, Rywlin AM, Ackerman AB. Histologic criteria for the diagnosis of superficial spreading malignant melanoma formulated on the basis of proven metastatic lesions. Cancer 1976;38:2434–41. [DOI] [PubMed] [Google Scholar]
  • 10.Dewan M, Ackerman AB. What is your diagnosis of these melanocytic neoplasms? Dermatopathology: Practical and Conceptual 1999;5:328–35. [Google Scholar]
  • 11.Roses DF, Harris MN, Ackerman AB. Diagnosis and management of cutaneous malignant melanoma. Philadelphia: WB Saunders Co, 1983.
  • 12.Corona R, Mele A, Amini M, et al. Interobserver variability on the histopathologic diagnosis of cutaneous melanoma and other pigmented skin lesions. J Clin Oncol 1996;14:1218–23. [DOI] [PubMed] [Google Scholar]
  • 13.Farmer ER, Gonin R, Hanna MO. Discordance in the histopathologic diagnosis of melanoma and melanocytic neoplasms between expert pathologists. Hum Pathol 1996;27:528–31. [DOI] [PubMed] [Google Scholar]
  • 14.Glusac EJ. Under the microscope: doctors, lawyers, and melanocytic lesions. J Cutan Pathol 2003;30:287–93. [DOI] [PubMed] [Google Scholar]
  • 15.Elder DE, Murphy GF. Melanocytic tumors of the skin. Atlas of tumor pathology, 3rd Series, Fascicle 2. Washington, DC: AFIP, 1991.
  • 16.Fleiss JL. Statistical methods for rates and proportions. New York: John Wiley, 1981.
  • 17.Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977;33:159–74. [PubMed] [Google Scholar]
  • 18.Okun MR. Silhouette symmetry. An unsupportable histologic criterion for distinguishing Spitz nevi and compound nevi from malignant melanoma. Arch Pathol Lab Med 1997;121:48–53. [PubMed] [Google Scholar]
  • 19.Okun MR. Histologic demarcation of lateral borders. An unsupportable histologic criterion for distinguishing malignant melanoma from Spitz naevus and compound naevus. Histopathology 1998;33:158–62. [DOI] [PubMed] [Google Scholar]
  • 20.Urso C, Rongioletti F, Innocenzi D, et al. Histological features used in the diagnosis of melanoma are frequently found in benign melanocytic naevi. J Clin Pathol 2005;58:409–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Heenan PJ, Matz LR, Blackwell JB, et al. Inter-observer variation between pathologists in the classification of cutaneous malignant melanoma in Western Australia. Histopathology 1984;8:717–29. [DOI] [PubMed] [Google Scholar]
  • 22.de Wit PEJ, van’t Hof-Grootenboer B, Ruiter DJ, et al. Validity of the histopathological criteria used for diagnosis of dysplastic nevi. An interobserver study by the pathology subgroup of the EORTC malignant melanoma cooperative group. Eur J Cancer 1993;29A:831–9. [DOI] [PubMed] [Google Scholar]
  • 23.Lock-Andersen J, Hou-Jensen K, Hansen JPH, et al. Observer variation in histological classification of cutaneous malignant melanoma. Scand J Plast Reconstr Surg Hand Surg 1995;29:141–8. [DOI] [PubMed] [Google Scholar]
  • 24.Cook MG, Clarke TJ, Humphrey S, et al. The evaluation of diagnostic and prognostic criteria and the terminology of thin cutaneous malignant melanoma by the CRC melanoma pathology panel. Histopathology 1996;28:497–512. [DOI] [PubMed] [Google Scholar]
  • 25.Scolyer RA, Shaw HM, Thompson JF, et al. Interobserver reproducibility of histopathologic prognostic variables in primary cutaneous melanoma. Am J Surg Pathol 2003;27:1571–6. [DOI] [PubMed] [Google Scholar]

Articles from Journal of Clinical Pathology are provided here courtesy of BMJ Publishing Group

RESOURCES