Skip to main content
Cureus logoLink to Cureus
. 2020 Mar 11;12(3):e7239. doi: 10.7759/cureus.7239

Ultrasound Classification of Thyroid Nodules: A Systematic Review

Rakesh Mistry 1,, Christopher Hillyar 2, Anjan Nibber 3, Thushanth Sooriyamoorthy 4, Nirmal Kumar 5
Editors: Alexander Muacevic, John R Adler
PMCID: PMC7067371  PMID: 32190531

Abstract

Ultrasound (US) based classification systems exist for the stratification of thyroid nodules based on the risk for malignancy. This systematic review aimed to assess the evidence for the performance of US-based thyroid nodule classification systems through correlation with fine needle aspiration biopsy (FNAB). PubMed and Scopus were searched using keywords that included ‘ultrasound classification’, ‘thyroid nodules’, ‘fine needle aspiration’, and ‘malignancy’. Inclusion criteria were as follows: studies/reviews reporting on US imaging for the classification of thyroid nodules. Exclusion criteria were as follows: no comparison between US imaging findings and histology reports based on FNAB, no full English text available/accessible. The database searches identified 66 publications. After evaluation, 12 studies met the inclusion criteria. Two US-based classification systems for thyroid nodules were assessed: the Thyroid Imaging Reporting and Data System (TIRADS) and the American Thyroid Association (ATA) guidelines. For TIRADS, the sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) ranged from 70.6% to 97.4%, 29.3% to 90.4%, 23.3% to 64.3%, and 87.1% to 99.0%, respectively. The median sensitivity, specificity, PPV, and NPV for TIRADS was 90.0%, 57.4%, 49.0%, and 91.0%, respectively. One study comparing TIRADS with the ATA guidelines demonstrated that TIRADS was superior in terms of sensitivity, whereas the ATA guidelines were superior in terms of specificity and PPV. The high sensitivity and NPV of the US-based TIRADS classification system have excellent utility for correctly classifying nodules as positive for malignant disease and for predicting the absence of malignant disease. The paucity of studies assessing the ATA guidelines highlights avenues for further research comparing TIRADS with other systems of thyroid nodule classification.

Keywords: thyroid, nodules, ultrasound, fine needle aspiration biopsy, tirads, ata, bethesda, thyroid cancer

Introduction and background

High-resolution ultrasound (US) is the gold standard test for the identification of thyroid nodules. Despite the high prevalence of thyroid nodules (up to 12% of adults), the incidence of thyroid cancer is relatively low (3.2 per 100,000) [1]. Although the majority of thyroid nodules are asymptomatic, it is recommended that all patients with palpable nodules undergo US imaging to determine whether the nodule requires a fine needle aspiration biopsy (FNAB), US follow-up, or no further evaluation. However, because of a lack of correlation between clinical symptoms and malignancy, the American Association of Clinical Endocrinologists recommends that all nodules smaller than 10 mm or any suspicious nodules on US imaging should be investigated further using FNAB. This recommendation is based on studies that established that prognosis is inversely related to nodule size [2,3].

Recently, guidelines have been developed in order to permit US imaging to be used for the identification and stratification of nodules based on the risk of malignancy [4-10]. These guidelines include the Thyroid Imaging Reporting and Data System (TIRADS), which was developed and validated based on existing multi-institutional data and expert opinion [11]. The risk stratification of thyroid nodules not only serves to identify patients that require FNAB but also reduces the unnecessary risk and cost associated with performing invasive procedures, such as FNAB, in patients with low-risk nodules that require either US follow-up or no further investigation. Therefore, the decision to perform FNAB should be based on the risk of malignancy rather than the size of the nodule per se.

Currently, multiple systems are used worldwide for the risk stratification of thyroid nodule features on US scanning. Many of these systems use complex algorithms based on several US imaging features, which may be difficult to use depending on the experience of the individual performing the US scan. The aim of this study was to review the current evidence for US classification systems of thyroid nodules and their correlation to subsequent FNAB findings, with a view of providing suggestions for avenues for future research.

Review

Methods

Searches

Recommendations from the Preferred Reporting Items for Systematic Reviews and Meta-Analyses were incorporated into this review [12]. Keywords included ‘ultrasound classification’, ‘thyroid nodules’, ‘fine needle aspiration’, and ‘malignancy’. Electronic searches were performed on PubMed and Scopus databases for English language studies published between September 2012 and September 2017. The search term used in each database were as follows: PubMed (ultrasound classification AND thyroid nodules AND fine needle aspiration AND malignancy); Scopus (TITLE-ABS-KEY [ultrasound classification AND thyroid nodules AND fine needle aspiration AND malignancy]). No filters for journal, study design, or subject were applied to the search, although conference proceedings and abstracts were excluded.

Study Selection

Two authors (Rakesh Mistry and Thushanth Sooriyamoorthy) independently assessed all studies from both searches against the eligibility criteria. Studies were included that identified the use of US imaging for the classification of thyroid nodules. The following studies were excluded in this study: studies with no full English text available/accessible and studies with no US-based system of classification of thyroid nodules and/or no attempt to correlate US findings with histology based on FNAB.

Data Extraction

Two authors (R.M. and T.S.) independently extracted data from the included studies into a self-designed template, referring to the Cochrane Handbook for Systematic Reviews of Interventions as a guide [13]. Study information and clinical characteristics were extracted from all studies (where reported), including the performance (sensitivity, specificity, positive predictive value [PPV], and negative predictive value [NPV]) of US imaging for the classification of thyroid nodules. Multiple authors (Christopher Hillyar, Anjan Nibber, R.M., and T.S.) evaluated the extracted data for accuracy and agreement.

Data Analysis

Not all studies reported all variables. Items that were not reported or were unclear were not included in the analysis. Data were analyzed using Microsoft Excel. Two authors (C.H. and A.N.) independently conducted data analysis.

Results

Study Characteristics

The database searches identified 100 publications. After evaluation, 12 studies met the inclusion criteria (Figure 1). A summary of the results extracted from these studies is presented in Table 1.

Figure 1. Identification of the eligible studies for analysis.

Figure 1

Table 1. Summary of data extracted from the included studies.

TIRADS, Thyroid Imaging Reporting and Data System; ATA, American Thyroid Association; PPV, positive predictive value; NPV, negative predictive value; NR, not reported

Study type Population characteristics Nodule classification system Ultrasound scan performance Reference
Participants (nodules); n Male:female participants (nodules); n:n, % male Mean/median age (range); years Sensitivity (%) Specificity (%) PPV (%) NPV (%)
Guidelines NR NR NR TIRADS 88.0 49.0 49.0 88.0 Gharib et al. [14]
Multi-center study 3315 participants (3822 nodules) NR (766:3056, 20% male) Mean: 54 (18-97) TIRADS NR NR NR NR Middleton et al. [15]
Prospective study 2921 participants (3980 nodules) 951:1970, 33% male (NR) Mean: 51 (16-78) TIRADS 97.0 90.0 40.0 91.0 Zhang et al. [16]
Prospective study 238 participants (272 nodules) NR (78:194, 29% male) Patients with benign nodules, mean of 42 (NR); patients with malignant nodules, mean 43 (NR) TIRADS NR NR 63.9 NR Chandramohan et al. [17]
Prospective study 105 participants (NR) 15:90, 14% male (NR) Median: 46 (16-80) ATA guidelines NR NR NR NR Rosário et al. [18]
Prospective study 100 participants (157 nodules) 14:86, 14% male (NR) Mean: 49 (NR) Bethesda NR NR NR NR Puno-Ramos et al. [19]
Retrospective study 1241 participants (1293 nodules) NR (NR) Mean: 50 (18-87) TIRADS vs ATA guidelines 97.4 vs 95.3 29.3 vs 37.4 23.3 vs 98.1 98.1 vs 97.3 Yoon et al. [20]
Retrospective study 906 participants (1000 nodules) NR (NR) NR (NR) TIRADS NR NR NR NR Rahal Jr et al. [21]
Retrospective study 100 participants (NR) NR (NR) NR (NR) TIRADS 70.6 90.4 NR 93.8 Singaporewalla et al. [22]
Retrospective study 84 participants (87 nodules) 33:51, 18% male (NR) Mean: 59 (34-85) TIRADS 90.0   57.4 64.3 87.1 Yoon et al. [23]
Review NR NR (NR) NR (NR) Bethesda NR NR NR NR Heller et al. [24]
Review NR NR (NR) NR (NR) NR NR NR NR NR Papini et al. [25]

The included studies consisted mainly of prospective (n=4) and retrospective studies (n=4) followed by reviews/guidelines (n=3) and a multi-center study (n=1). In terms of population characteristics, four larges studies consisted of 3,315 participants (3,822 nodules), 2921 participants (3,980 nodules), 1,241 participants (1,293 nodules), and 906 participants (1,000 nodules) [15,16,20,21]. Five comparatively small studies included 238 participants (272 nodules), 105 participants (nodules not reported), 100 participants (157 nodules), 100 participants (nodules not reported), and 84 participants (87 nodules) [17-19, 22,23]. A total of 9,010 participants (10,611 nodules) were reported from all studies [14-25]. The proportion of participants who were male (nodules from male participants) reported from studies ranged from 14% to 33% (20-29%) [15-19,22,23]. The mean age of all participants was relatively consistent and reported to be 46-59 years [15,16,18-20,23]. One study reported the mean age only for participants with benign and malignant nodules: 42 and 43 years, respectively [17]. Another study reported the median age for all participants (46 years) [18]. By comparison, the reported age range varied to a greater extent, with a lower limit of 16-34 years and an upper limit of 78-97 [15,16,18,20,23].

Thyroid Nodule Classification Systems

In total, two US-based classification systems for thyroid nodules were identified: TIRADS and the American Thyroid Association (ATA) guidelines. In addition, the Bethesda classification system of histological reporting for FNAB of thyroid tissue was also reported. Eight (66%) studies reported the use of TIRADS [14-17,20-23] followed by two studies (8%) that used the ATA guidelines [18,20] and two (17%) studies that used the Bethesda system [19,24].

Performance of Ultrasound Imaging for the Classification of Thyroid Nodules

Of the eight studies using TIRADS, six reported performance parameters for US-based classification of thyroid nodules (e.g. sensitivity, specificity, PPV, and NPV) [14,16,17,20,22,23]. Of the two studies reporting on the ATA guidelines, only one included US imaging performance parameters [20]. No publications reported US imaging performance parameters with the Bethesda system. Table 2 summarizes the performance of US imaging for the classification of thyroid nodules using the TIRADS system.

Table 2. Performance of ultrasound-based classification of thyroid nodules using TIRADS.

PPV, positive predictive value; NPV, negative predictive value; TIRADS, Thyroid Imaging Reporting and Data System

Ultrasound performance parameter Participants (nodules) Median (%) Range (%) References
Sensitivity 4346 participants (5360 nodules) 90.0 70.6-97.4 [14,16,20,22,23]
Specificity 4346 participants (5360 nodules) 57.4 29.3-90.4 [14,16,20,22,23]
PPV 4484 participants (5632 nodules) 49.0 23.3-64.3 [14,16,17,20,23]
NPV 4346 participants (5360 nodules) 91.0 87.1-99.0 [14,16,20,22,23]

The sensitivity, specificity, PPV, and NPV ranged from 70.6% to 97.4%, 29.3% to 90.4%, 23.3% to 64.3%, and 87.1% to 99.0%, respectively. The median sensitivity reported for TIRADS was 90.0%, with three studies reporting a sensitivity for TIRADS of 90.0% or above [16,20,23]. Although one study reported that the sensitivity of TIRADS was 70.6%, this retrospective study assessed a small cohort of only 100 participants [22]. In contrast, analysis of the sensitivity of TIRADS from larger studies gave sensitives of 97.0%-97.4%, when a total of 5,273 nodules (from 4,162 participants) were assessed [16,20]. Due to a wide range of reported results, the median specificity (57.5%) for TIRADS was considerably poorer than the sensitivity. Only one large study assessing 3,980 nodules (from 2,921 participants) reported favorable results with a specificity of 90.0% [16], whereas two studies, including one assessing 1,293 nodules (from 1,241 participants), reported specificities of less than 50%, with the lowest specificity reported being 29.3% [14,20]. The median PPV (49.0%) for TIRADS was similarly poor, with two large studies including a total of 4,162 nodules (from 5,273 participants) reporting PPVs of 23.3%-40.0% [16,20]. Finally, the median NPV (91.0%) of TIRADS was excellent and was the most consistently reported performance parameter with the lowest range. Three of five studies reported NPVs of greater than 90.0% [16,20,22], with two of these being large studies including a total of 5,273 nodules (from 4,162 participants), which reported an NPV of 91.1%-98.1% [16,20].

In one study, a direct comparison was made between the TIRADS and the ATA guidelines [20]. Yoon et al. reported that TIRADS was superior to the ATA guidelines in terms of sensitivity (97.4% vs 95.3%; p<0.001), although the ATA guidelines were superior to TIRADS in terms of specificity (37.4% vs 29.3%; p<0.001) and PPV (98.1% vs 23.3%; p<0.001). No statistically significant difference was observed between TIRADS and the ATA guidelines in terms of NPV. This study also reported that, unlike TIRADS, some nodules could not be classified using the ATA guidelines [20].

Conclusions

This study assessed the literature on US-based thyroid nodule classification systems, which demonstrated that TIRADS has utility at classifying thyroid nodules. The variability of the specificity of TIRADS, which was borne out by large studies assessing thousands of thyroid nodules, suggests that the performance of US, especially at classifying nodules as negative for disease, is highly dependent on the skill of the operator. In clinical practice, the poor PPV of TIRADS may be associated with an excess number of FNABs of benign nodules and represents a source of procedural risk, reduced cost-effectiveness, and unnecessary discomfort and concern for the patient. Although mild pain from FNAB can be controlled with paracetamol, future research should focus on quantifying the pain and stress encountered by patients undergoing FNAB for thyroid nodules. The favorable NPV of TIRADS may offset the impact of the PPV and help reduce the number of unnecessary FNABs of benign thyroid nodules. The paucity of studies assessing the ATA guidelines makes any comparison with TIRADS a tentative assessment at best and represents a significant opportunity for further research. Thus, research directed at improving the TIRADS system using powered studies with large patient populations is required to compare TIRADS with other classification systems (ATA guidelines/Bethesda) in order to demonstrate superiority. This may be used to inform and update the British Thyroid Association (BTA) guidelines, an area of particular importance in the UK.

Acknowledgments

Authors Rakesh Mistry and Christopher Hillyar contributed equally to this work and should be considered joint-first authors.

The content published in Cureus is the result of clinical experience and/or research by independent individuals or organizations. Cureus is not responsible for the scientific accuracy or reliability of data or conclusions published herein. All content published within Cureus is intended only for educational, research and reference purposes. Additionally, articles published within Cureus should not be deemed a suitable substitute for the advice of a qualified health care professional. Do not disregard or avoid professional medical advice due to content published within Cureus.

Footnotes

The authors have declared that no competing interests exist.

References

  • 1.Thyroid cancer statistics. [Feb;2020 ];Cancer Research UK. https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/thyroid-cancer/ 2020
  • 2.The prognostic value of primary tumor size in papillary and follicular thyroid cancer. Machens A, Holzhausen HJ, Dralle H. Cancer. 2005;103:2269–2273. doi: 10.1002/cncr.21055. [DOI] [PubMed] [Google Scholar]
  • 3.Cancer stat facts: thyroid cancer. [Feb;2020 ];https://seer.cancer.gov/statfacts/html/thyro.html 2020
  • 4.Management of thyroid nodules detected at US: Society of Radiologists in Ultrasound consensus conference statement. Frates MC, Benson CB, Charboneau JW, et al. Radiology. 2005;237:794–800. doi: 10.1148/radiol.2373050220. [DOI] [PubMed] [Google Scholar]
  • 5.2015 American Thyroid Association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. Haugen BR, Alexander EK, Bible KC, et al. Thyroid. 2016;26:1–133. doi: 10.1089/thy.2015.0020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Risk stratification of thyroid nodules on ultrasonography with the French TI-RADS: description and reflections. Russ G. Ultrasonography. 2016;35:25–38. doi: 10.14366/usg.15027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ultrasonography diagnosis and imaging-based management of thyroid nodules: revised Korean Society of Thyroid Radiology consensus statement and recommendations. Shin JH, Baek JH, Chung J, et al. Korean J Radiol. 2016;17:370–395. doi: 10.3348/kjr.2016.17.3.370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.NCCN guidelines: thyroid carcinoma. [Feb;2020 ];https://www.nccn.org/professionals/physician_gls/default.aspx 2020
  • 9.Guidelines for the management of thyroid cancer. Perros P, Boelaert K, Colley S, et al. Clin Endocrinol (Oxf) 2014;81:1–122. doi: 10.1111/cen.12515. [DOI] [PubMed] [Google Scholar]
  • 10.Guidelines of the French Society of Endocrinology for the management of thyroid nodules. Wémeau JL, Sadoul JL, d'Herbomez M, et al. Ann Endocrinol (Paris) 2011;72:251–281. doi: 10.1016/j.ando.2011.05.003. [DOI] [PubMed] [Google Scholar]
  • 11.Multiinstitutional analysis of thyroid nodule risk stratification using the American College of Radiology thyroid imaging reporting and data system. Middleton WD, Teefey SA, Reading CC, Langer JE, Beland MD, Szabunio MM, Desser TS. AJR Am J Roentgenol. 2017;208:1331–1341. doi: 10.2214/AJR.16.17613. [DOI] [PubMed] [Google Scholar]
  • 12.Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. [Mar;2020 ];Moher D, Shamseer L, Clarke M, et al. Syst Rev. 2015 4:1. doi: 10.1186/2046-4053-4-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Higgins JPT, Green Sally, eds eds. Vol. 1. The Cochrane Collaboration; [Feb;2020 ]. 2019. Cochrane Handbook for Systematic Reviews of Interventions; p. 2019. [Google Scholar]
  • 14.American Association of Clinical Endocrinologists, American College of Endocrinology, and Assoziazione Medici Endocrinologi medical guidelines for the diagnosis and management of thyroid nodules - 2016 update. Gharib H, Paini E, Garber JR, et al. Enocr Pract. 2016;22:1–60. doi: 10.4158/EP161208.GL. [DOI] [PubMed] [Google Scholar]
  • 15.Multiinstitutional analysis of thyroid nodule risk stratification using the American College of Radiology Thyroid Imaging Reporting and Data System. Middleton WD, Teefey SA, Reading CC, Langer JE, Beland MD, Szabunio MM, Desser TS. AJR Am J Roentgenol. 2017;208:1331–1341. doi: 10.2214/AJR.16.17613. [DOI] [PubMed] [Google Scholar]
  • 16.Prospective validation of an ultrasound-based thyroid imaging reporting and data system (TI-RADS) on 3980 thyroid nodules. [Mar;2020 ];Zhang J, Liu BJ, Xu HX, et al. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4484032/ Int J Clin Exp Med. 2015 8:5911–5917. [PMC free article] [PubMed] [Google Scholar]
  • 17.Is TIRADS a practical and accurate system for use in daily clinical practice? Chandramohan A, Khurana A, Pushpa BT, et al. Indian J Radiol Imaging. 2016;26:145–152. doi: 10.4103/0971-3026.178367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Thyroid nodules with highly suspicious ultrasonographic features, but with benign cytology on two occasions: is malignancy still possible? Rosário PW, Calsolari MR. Arch Endocrinol Metab. 2016;60:402–404. doi: 10.1590/2359-3997000000176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ultrasound features of thyroid nodules predictive of thyroid malignancy as determined by fine needle aspiration biopsy. Puno-Ramos MPG, Villa ML, Kasala RG, Arzadon J, Alcazaren ENS. https://www.pcp.org.ph/files/PJIM%20Vol53%20No2/Ultrasound_Features_of_Thyroid_Nodules_Predictive_of_Thyroid_Malignancy_as_determined_by_Fine_Needle_Aspiration_Biopsy.pdf Philippine J Internal Med. 2015;53:1–8. [Google Scholar]
  • 20.Malignancy risk stratification of thyroid nodules: comparison between the thyroid imaging reporting and data system and the 2014 American Thyroid Association management guidelines. Yoon JH, Lee HS, Kim EK, Moon HJ, Kwak JY. Radiology. 2015;278:917–924. doi: 10.1148/radiol.2015150056. [DOI] [PubMed] [Google Scholar]
  • 21.Correlation of thyroid imaging reporting and data system (TI-RADS) and fine needle aspiration: experience in 1,000 nodules. Rahal Jr A, Faldarella PM, Rocha RD, et al. Einstein (Sao Paulo) 2016;14:119–123. doi: 10.1590/S1679-45082016AO3640. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Clinico-pathological correlation of thyroid nodule ultrasound and cytology using the TIRADS and Bethesda Classifications. Singaporewalla RM, Hwee J, Lang TU, Desai V. World J Surg. 2017;41:1807–1811. doi: 10.1007/s00268-017-3919-5. [DOI] [PubMed] [Google Scholar]
  • 23.Thyroid incidentalomas detected on 18F-fluorodeoxyglucose-positron emission tomography/computed tomography: thyroid imaging reporting and data system (TIRADS) in the diagnosis and management of patients. Yoon JH, Cho A, Lee HS, Kim EK, Moon HJ, Kwak JY. Surgery. 2015;158:1314–1322. doi: 10.1016/j.surg.2015.03.017. [DOI] [PubMed] [Google Scholar]
  • 24.Correlation of ultrasound findings with the Bethesda cytopathology classification for thyroid nodule fine-needle aspiration: a primer for radiologists. Heller MT, Gilbert C, Ohori NP, Tublin ME. AJR Am J Roentgenol. 2013;201:0. doi: 10.2214/AJR.12.9071. [DOI] [PubMed] [Google Scholar]
  • 25.Diagnosis of endocrine disease: thyroid ultrasound (US) and US-assisted procedures: from the shadows into an array of applications. Papini E, Pacella CM, Hegedus L. Eur J Endocrinol. 2014;170:0. doi: 10.1530/EJE-13-0917. [DOI] [PubMed] [Google Scholar]

Articles from Cureus are provided here courtesy of Cureus Inc.

RESOURCES