Introduction
Over the last two decades the use of sonographic risk stratification systems (SRSS) has become an integral part in the evaluation of patients with thyroid nodules.1–4 The aim of these systems is to standardize the evaluation of thyroid cancer risk according to ultrasound features, in order to individualize the management of patients with thyroid nodules.1–3,5,6 Clinical evidence and practice guidelines further support the routine use of risk stratification, as it can decrease the number of unnecessary thyroid biopsies and improve reporting of thyroid cancer risk while avoiding missed diagnoses of clinically relevant thyroid cancer.7–10
More recently, artificial intelligence (AI) methodologies have been applied to improve thyroid cancer risk stratification and the care of patients with thyroid nodules with promising results.11,12 This article aims to review ultrasound features, SRSS and current clinical evidence incorporating AI in the diagnostic evaluation of patients with thyroid nodules.
Association of thyroid nodules ultrasound features and thyroid cancer
Certain individual US features are specific for malignancy while others are associated with benign histology.1,2,13 US features of benign and malignant nodules may overlap, limiting their diagnostic properties as a single “rule-in” or “rule-out” test for thyroid cancer diagnosis. However, they are extremely useful to stratify the risk for thyroid malignancy.1,2,13 Suspicious features include micro-calcifications, local invasion, taller-than-wide shape, irregular/infiltrative margins, and markedly reduced echogenicity. However, the absence of a halo, ill-defined margins, solid composition, and vascularity, are less specific.14–16 Cystic nodules and spongiform nodules have a high diagnostic odds ratio to predict benign nodules.17 While a solitary suspicious finding may be associated with variable rates of cancer, the accumulation of suspicious findings significantly increases the malignancy risk. Indeed, a retrospective study of 1658 thyroid nodules measuring ≥ 1 cm showed in a univariate analysis that there was a significant association of malignancy with a single suspicious ultrasound feature and on multivariate analysis the malignancy risk was further increased with an escalating number of high risk sonographic features.18
Table 1 summarizes thyroid ultrasound features, their associated histologic findings and diagnostic odds ratios to predict benign or malignant thyroid nodules.2,13,15,19
Table 1:
Thyroid ultrasound features, definitions, and diagnostic odds ratio
| US feature | Definition | Descriptors | Histological background | Diagnostic Odds Ratio (DOR)13 |
|---|---|---|---|---|
| Composition | Proportion of solid tissue and fluid in a nodule | Solid Predominantly solid Predominantly cystic Cystic |
Hyperplastic nodules have abundant colloid which appears cystic on sonography. Neoplasms can undergo cystic degeneration and have cystic areas. |
Solid (DOR-M) (4.45, 2.63–7.5) Cystic (DOR-B) (6.78, 2.26–20.3) Spongiform (DOR-B) (12, 0.61–234.3) |
| Echogenicity | Brightness relative to normal thyroid parenchyma | Hyperechoic Isoechoic Hypoechoic Very hypoechoic |
Classic PTC and MTC produce less acoustic interfaces than micro-follicles and may appear hypoechoic | Hypoechoic (DOR-M) (4.5, 3.2–6.4) Isoechoic (DOR-B) (3.6, 2–6.3) |
| Borders | Interface between nodule and surrounding thyroid parenchyma | Smooth Ill-defined Irregular Infiltrative Extra-thyroidal extension |
Infiltrative or lobular borders may be concerning for invasive thyroid carcinoma. Hyperplastic nodules may have ill-defined borders due to similar echogenicity to surrounding parenchyma, which may not be suspicious for malignancy |
Infiltrative margins (DOR-M) (6.89, 3.35–14.1) |
| Shape | Evaluated in the axial plane by comparing the height (“tallness”) and width of a nodule measured parallel and perpendicular to the ultrasound beam | Taller-than-wide Wider-than-tall |
Disproportionate growth in anteroposterior dimension is considered an aggressive growth pattern | Taller-than-wide (DOR-M) (11.14, 6.6–18.9) |
| Echogenic foci | Hyperechoic foci | Large comet tail artifacts Macrocalcifications Peripheral rim calcifications Punctate echogenic foci |
Large comet tail artifacts are associated with colloid. Punctuate echogenic foci have been associated with Psammoma bodies, which may be associated with PTC. |
Internal microcalcifications (DOR-M) (6.78, 4.48–10.24) |
DOR-M, diagnostic odds ratio to predict malignant nodules and DOR-B, diagnostic odds ratio to predict benign nodules, presented as estimate and 95% Confidence Interval. PTC, papillary thyroid cancer; MTC, medullary thyroid cancer.
Contemporary thyroid ultrasound risk stratification systems - overview
Numerous SRSS exist with the goal of standardizing the evaluation of patients with thyroid nodules.1,3,20,21 Fundamentally, these systems aim to separate thyroid nodules according to thyroid cancer risk and propose a nodule size at which biopsy should be recommended, according to the estimated risk for thyroid cancer.1,3,5,21
Evaluation of individual thyroid ultrasound features provides insight into the nature of the underlying lesion and its estimated cancer risk. Moreover, the presence of ultrasound features suspicious for malignancy are rarely independent of each other.15 For this reason, establishing sonographic patterns to aid the management of patients with thyroid nodules is based on the concept that sonographic features are present concurrently. While pattern-based SRSS are associated with improved inter-observer variability in assigning risk classification1, these systems1,3,22 have the disadvantage that 5–17% of nodules are non-classifiable.23,24 Therefore, point based systems, such as the American College of Radiology Thyroid Imaging Reporting & Data System (ACR TIRADS) have the advantage of being inclusive of all nodule types.2 However, classification of nodules using the ACR TIRADS is limited by interobserver variability in assigning scores for each feature and can be time-consuming in the setting of classifying multiple nodules.2
Table 2 summarizes contemporary thyroid nodule SRSS, their diagnostic performance and clinical implications.1–3,5,7,21,23,25,26
Table 2:
Diagnostic performance of SRSS
| SRSS | System type | Categories | Biopsy cut-off | POM5 | Unnecessary biopsy rate7 (95% CI) | Missed malignancy rate23 % | Sensitivity25 (95% CI) | Specificity25 (95% CI) | Likelihood ratio for positive results (LR+)25 (95% CI) | DOR25 (95% CI) | Inter-observer agreement26 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ATA | Ultrasound pattern based | Benign | No FNA | 0 | 51% (44–58) | 4.1% | 87% (75–94) | 31% (24–40) | 1.2 (1.0–1.4) | 3.1 (1.3–7.1) | 0.5 (0.4–0.6) |
| Very low risk | > 2cm or no FNA | <3% | |||||||||
| Low risk | 1.5 cm | 5–10% | |||||||||
| Intermediate risk | ≥ 1 cm | 10–20% | |||||||||
| High Risk | ≥ 1 cm | 70–90% | |||||||||
| ACR-TIRADS | Weighted point-based system | Benign (TIRADS-1) | No FNA | <2% | 25% (22–29) | 2.2% | 74% (61–83) | 64% (56–70) | 1.9 (1.6–2.3) | 4.9 (3.1–7.7) | 0.5 (0.4–0.6) |
| Not suspicious (TIRADS-2) | No FNA | <2% | |||||||||
| Mildly suspicious (TIRADS-3) | ≥ 2.5 cm | 5% | |||||||||
| Moderately suspicious (TIRADS-4) | ≥ 1.5 cm | 5–20% | |||||||||
| Highly suspicious (TIRADS-5) | ≥ 1 cm | >20% | |||||||||
| ETA | Ultrasound pattern based | EU-TIRADS 1 - normal | - | 0 | 38% (16–66) | 3.2% | 54% (51–57) | 53% (51–55) | 1.4 (1.0–1.8) | 2.2 (0.9–5.1) | 0.6 (0.5–0.6) |
| EU-TIRADS 2- benign | No FNA | 0 | |||||||||
| EU-TIRADS 3- low risk | > 2 cm | 2–4% | |||||||||
| EU-TIRADS 4- intermediate risk | > 1.5 cm | 6–17% | |||||||||
| EU-TIRADS 5- high risk | > 1 cm | 26–87% | |||||||||
| AACE/ ACE/ AME | Ultrasound pattern based | Low risk | > 2 cm, increase in size, clinical features | 1% | N/A | 2.9% | 74% (71–78) | 53 (51–55) | 1.5 (1.1–2.1) | 3.1 (1.0–9.4) | 0.4 (0.4–0.6) |
| Intermediate risk | > 2 cm | 5–15% | |||||||||
| High risk | ≥ 1 cm | 50–90% | |||||||||
| K-TIRADS | Ultrasound pattern based | K-TIRADS 1 – no nodule | NA | - | 55% (42–67) | 3.5% | 86 (73–94) | 28 (20–38) | 1.2 (1.0–1.4) | 2.5 (1.1–5.5) | 0.5 (0.4–0.6) |
| K-TIRADS 2 – benign | ≥ 2 cm | <3% | |||||||||
| K-TIRADS 3 – low suspicion | ≥ 1.5 cm | 3–15% | |||||||||
| K-TIRADS 4 – Intermediate suspicion | ≥ 1 cm | 15–50% | |||||||||
| K-TIRADS 5 – High suspicion | ≥ 1 cm (>0.5 cm selective) | >60% |
AACE, American Association of Clinical Endocrinologists; ACE, American College of Endocrinology; ACR, American College of Radiology; AME, Associazione Medici Endocrinologi (AME); ATA, American Thyroid Association; CI, confidence interval; DOR, diagnostic odds ratio; ETA, European Thyroid Association, ETA; EU-TIRADS, European Thyroid Imaging Reporting and Data System; FNA, fine needle aspiration; K-TIRADS, Korean Thyroid Imaging Reporting and Data System; LR, likelihood ratio; TIRADS, - Thyroid Imaging Reporting and Data System; N/A, not available; POM, prevalence of malignancy
Clinical benefits of thyroid nodule risk stratification
The use of thyroid nodule SRSS has revolutionized the management of patients with thyroid nodules by 1) improving the quality of thyroid ultrasound assessment and communication between clinicians, 2) potentially decreasing the number of unnecessary biopsies and 3) using thyroid cancer risk to personalize patient care.
While the details included in thyroid and neck ultrasound reports are variable, SRSS provide a means to standardize reporting and communication among clinicians. For example, a retrospective study of ultrasound reports of 478 thyroid nodules, found that most reports did not include the size of the dominant nodule (71%), comment on suspicious features (91%) or the presence of suspicious lymph nodes (83%), with 46% of the reports lacking a description of malignancy risk.27 Multiple studies have shown a positive effect of implementation of thyroid nodule risk stratification in the quality of the ultrasound report, with increased inclusion of ultrasound features of interest, thyroid cancer risk and suggestions of management strategies to address this risk.10,28,29
Moreover, better reporting of thyroid ultrasound features and cancer risk translates to improved clinical care. For example, observational data suggest that strict implementation of recommendations for thyroid biopsy according to the different SRSS can result in decreased rates of unnecessary biopsies.5,7,23,30 However, most commonly, these studies consider nodules with benign cytology as unnecessary. This is a rather narrow definition, given that the necessity and appropriateness of a thyroid biopsy depends on additional factors beyond thyroid nodule risk and size. 5,6,31
Comparisons of current thyroid nodule risk stratification systems
Castellana et al performed a systematic review and meta-analysis to compare five thyroid nodule SRSS and found that American Thyroid Association (ATA) guidelines had the highest pooled sensitivity and ACR-TIRADS the highest pooled specificity.25 Moreover, when performing a direct comparison analysis, the diagnostic odds ratio for selecting patients for thyroid biopsy was higher for ACR-TIRADS than for ATA (5.6 vs 2.9 p=0.002) or Korean Society of Thyroid Radiology’s Thyroid Imaging Reporting and Data System (K-TIRADS) (4.5 vs 2.5, p=0.002). The review included 12 studies and 18,750 thyroid nodules.
Kim et al performed a systematic review of 8 articles including 13,092 nodules and compared unnecessary biopsy rates among four thyroid nodule SRSS.7 ACR-TIRADS had the lowest unnecessary biopsy rates (25%). However, low unnecessary biopsy rates may exist at the expense of higher missed malignancy rates.5,32 ACR-TIRADS is associated with the higher proportion of malignant nodules that do not receive a recommendation for biopsy; however, for nodules that do not meet biopsy criteria but carry a relevant risk of malignancy, active surveillance instead of biopsy is recommended, potentially decreasing the risk of a significant adverse outcome.33 Overall, SRSS improve the prediction of benign or malignant disease, with high risk categories having a strong association to malignant or suspicious histology.20,23,25,34
Size threshold
Current guidelines offer variable size cutoffs according to risk stratification category, with a lower threshold for biopsy as the risk for thyroid cancer increases, although limited clinical evidence guides the selection of these thresholds.1–3,21 In fact, the selection of size thresholds for thyroid nodule fine needle aspiration (FNA) has remained controversial given that 1) the association of thyroid nodule size and malignancy risk is not linear (but could result in higher risk for false negative cytology) and 2) size only partly correlates with thyroid cancer prognosis.5,35–38 Machens et al reported increased cumulative risk for distant metastases from papillary thyroid cancer and follicular thyroid cancer above 2 cm.39 Based on this, ACR-TIRADS selects a slightly higher threshold for biopsy in the lower risk categories of 2.5 cm, after considering the reported discrepancy of pathologic tumor size and estimated sonographic size.2
Ha et al evaluated the effect of size thresholds using the current biopsy indications for ATA, K-TIRADS, and ACR-TIRADS, in 3323 consecutive thyroid nodules. Overall, the sensitivity for detecting malignancy was higher for ATA and K-TIRADS compared to ACR-TIRADS at 89.6% and 94.5%, versus 74.7%, respectively. ACR-TIRADS, on the other hand, was associated with higher specificity (67.3%) compared to ATA (33.2%) and K-TIRADS (26.4%). This improved specificity with ACR-TIRADS translated to the lowest rate of unnecessary FNA at 25.3%, in comparison with ATA (51.7%) and K-TIRADS (56.9%). However, if a higher threshold for thyroid biopsy of 1.5 cm for intermediate risk thyroid nodules (instead of the existing 1 cm threshold) and of 2.5 cm for low suspicion thyroid nodules (as opposed to the existing 1.5 cm threshold) were used, the specificity and rates of unnecessary biopsies for this ‘revised’ ATA system would be similar to those for ACR-TIRADS, highlighting the effects of size thresholds on the performance of these SRSS.5,40,41
Limitations of thyroid nodule risk stratification
The clinical evidence supporting the prevalence of malignancy associated with the different thyroid cancer risk categories is mostly derived from observational studies that are enriched with malignant cases, which can lead to over-estimation of the diagnostic performance.5,42 In addition, SRSS have been validated mostly among thyroid nodules potentially harboring PTC, however the diagnostic performance for patients with medullary or follicular thyroid carcinoma might differ.25,43 Another limitation when using risk stratification in practice is the inability to classify all nodules (when using a pattern-based system), with an unknown percentage of these nodules harboring malignancy.34,43,44
Structured evaluation and report of thyroid ultrasounds requires expertise and availability of clinical time, both potential barriers to implementation.10 Similarly, reproducibility of ultrasound assessments is pivotal, yet most of the clinical evidence reporting moderate interobserver agreement is derived from small group of expert thyroid examiners. 26,45–48
Finally, another potential unintended consequence of the implementation of thyroid nodule risk stratification is the over-reliance in the radiology report and recommendations (of observation and/or biopsy), that may not take into consideration the clinical situation of the patient.6
Clinical factors and risk stratification
Because thyroid nodules are such a common clinical entity, individual risk stratification based on patient characteristics is an important component of the evaluation. Various clinical and demographic factors should be assessed to further refine the malignancy risk and aid in the next steps of clinical management.
Age at diagnosis of a nodule influences the risk that any nodule is cancerous; younger age is associated with an increased malignancy risk.49,50 In a single center study of 6391 adult patients with thyroid nodules, the risk of malignancy was highest (22.9%) in the group aged 20–30 years, and declined by 2.2% per year until the risk reached a plateau of 12.6% at age 60. It is important to note, however, that in spite of the overall lower likelihood of malignancy in the older patients, there was a greater risk of a more aggressive histologic phenotype when a cancer was identified.49
Sex also plays an important role in risk stratification of patients with thyroid nodules. Although not consistently reported,51,52 a greater risk of malignancy in those with a thyroid nodule has been suggested in men compared to women.53 In the largest reported experience with thyroid nodules, a single-study center of 20,001 consecutive thyroid nodules determined that the odds ratio for malignancy in men was 1.7 (1.5–1.9, CI, p<0.0001), compared to women.54
While the etiology of most thyroid malignancies is unclear, a small proportion of differentiated cancers may be attributable to an inherited condition. Because thyroid cancer may be seen in up to 10–15% of the population at autopsy,55,56 the occurrence of two cases of thyroid cancer among first degree relatives in most families is attributable to chance, rather than a genetic predisposition.57 However, the identification of nonmedullary thyroid cancer in three or more first-degree relatives is associated with a 96% chance of an inherited condition.57
Exposure to radiation in childhood is a well-recognized cause of differentiated thyroid carcinoma and the impact may be seen for decades after therapy.58–60 Additionally, those patients exposed during childhood or adolescence to ionizing radiation from a nuclear fallout are at increased risk of malignancy.61 Consequently, clinicians should consider these risk factors in the decision-making process of whether to perform FNA.1
Another important consideration in the evaluation of a patient with a thyroid nodule is the presence of comorbid conditions. It is well recognized that thyroid cancers are often associated with an indolent clinical course. If the patient has a low-risk thyroid nodule in the setting of another, more aggressive primary malignancy or uncontrolled chronic condition, consideration may be given to postponing the evaluation of an identified thyroid nodule. A recent study examined the likelihood of identification of a clinically significant thyroid cancer (defined as anaplastic, medullary, or poorly differentiated carcinoma, or the presence of distant metastases) in a cohort of 1129 patients over the age of 70 years. They found only 17 (1.5%) significant risk thyroid cancers (SRTC) among 2527 nodules, all of which were readily identifiable by initial imaging and cytology. During a median follow up of four years, there were only 10 (0.9%) thyroid cancer deaths, all of which occurred in patients with SRTC. Interestingly, they found that among all other patients, there were 160 deaths (14.4%), and there was a more than two-fold increased risk of death among those with a separate non-thyroidal malignancy or coronary artery disease at the time of nodule evaluation. The authors concluded that the evaluation of thyroid nodules in patients over the age of 70 years without high risk imaging or cytologic findings should be tempered in the setting of significant comorbid illness.62
Artificial intelligence and thyroid nodule risk stratification
Background and definition of artificial intelligence
AI refers to the imitation of human abilities such as learning and problem solving by computer systems.63–65 Although the idea of AI is not new, the application to enhance clinical care has expanded recently given the availability of computers that are fast and able to analyze large datasets.65,66 Within AI, machine learning is the use of mathematical operations to solve problems and for systems to learn from the evaluation of data over time.65,67 Machine learning algorithms can be classified according to how the data is provided and whether the outcome of interest is labeled during the development of the model.65,67,68 Specifically, in thyroid nodule risk stratification, the use of images in which the ultrasound features of interest have been manually extracted by a human operator (e.g. echogenicity, composition) is considered conventional machine learning. On the other hand, if thyroid nodule images are analyzed as raw data (pixels) by the computer model, this is considered deep learning. Similarly, if the nodule is labeled as thyroid cancer or a benign nodule during the model development, this is considered supervised machine learning where the goal is for the system to learn to classify/predict the desired outcome (e.g. malignancy). Alternatively, if the thyroid nodule is not labeled for the outcome of interest (thyroid cancer or benign), this is considered unsupervised learning and the goal is the identification of new clusters/groups (Figure 1).67–69
Figure 1.

Structure of conventional machine learning and deep learning (according to human or computer model extraction of the images or pixels) for thyroid nodule risk stratification. Supervised machine learning as during development classification/prediction of thyroid cancer/benign is provided to the model.
A particular modality of interest for imaging-based diagnosis is computer-aided diagnosis (CAD). These methods are based on machine learning models for the evaluation of images and can be used to complement and enhance the human operator assessment.70 Augmented intelligence refers to AI models that seek to support human cognition, instead of replacing it. In addition, models of “human in the loop” refer to models in which the outputs from machine learning algorithms require the review from human operators.64
Overview of how AI can improve ultrasound risk stratification
Thyroid nodule risk stratification allows clinicians to estimate the risk for thyroid cancer and personalize the care of patients with thyroid nodules.6,31 However, concerns in terms of thyroid nodule ultrasound feature assessment reproducibility, time requirement for these assessments and overall precision of the current frameworks can limit the clinical benefits.10,33,71 The use of AI can help resolve these concerns by: 1) improving our frameworks for thyroid nodule risk stratification and 2) increasing the automation of thyroid nodule risk stratification.11,69,72 In addition, these systems can be used in areas in which experts in thyroid ultrasonography are not available, possibly improving the quality of care across different healthcare systems.72
1). Improving frameworks for thyroid nodule risk stratification
The use of machine learning algorithms offers an opportunity to re-evaluate the weight of the ultrasound features that we currently use for thyroid cancer risk stratification and to increase the number of variables that are evaluated.11
For example, Zhang et al aimed to develop a machine learning model for the diagnosis of thyroid nodules, based on the evaluation of 11 ultrasound features that were manually extracted and real-time elastography.73 Their model was developed using a sample of 2064 nodules with histological diagnosis of thyroid cancer in 36%. The diagnostic properties were compared to a radiologist. The machine learning model was able to integrate information from the provided variables, with higher weight given to the presence of calcification, hypoechoic halo and elastography.73 In fact, the random forest algorithm had a higher area under the curve than the radiologist (AUC = 0.92 [95% confidence interval (CI) 0.90–0.95] vs. 0.83 [95% CI 0.82–0.85]).73 Interestingly, size was included as a potential variable and was given lower importance by the machine-learning model, highlighting the need to understand potential biases in the datasets used to train machine-learning models and the importance of collaboration between expert clinicians and machine-learning data scientists. The evidence that links thyroid nodule size and malignancy risk is somewhat controversial and instead, this variable may provide more prognostic information.5,35,36,39,74
Similarly, the study by Wildman-Tobriner et al, aimed to optimize the diagnostic performance of ACR-TIRADS using AI.75 A dataset of 1425 biopsy proven thyroid nodules (malignancy rate of 11%) was evaluated for the five ACR-TIRADS components and a genetic AI algorithm used to optimize the classification.75 The optimized AI ACR-TIRADS, assigned new weights to 8 features and for 6 the new system assigned 0 points. The optimized AI ACR- TIRADS had similar receiver operator curves and sensitivity, but higher specificity than the radiologists using the original ACR-TIRADS.75
Multiple additional studies have explored the use of AI to improve our understanding of what ultrasound features are associated with thyroid cancer risk, what weight should be assigned to each of the features and taking advantage of the ability of the algorithms to integrate variables in order to provide a more accurate and precise estimate of thyroid cancer.11,12 Future studies, should include large datasets of patients with thyroid nodules, with different risk for thyroid cancer and representative of the populations of interest, in order to take full advantage of AI methods and enhancing thyroid cancer risk estimation.
2). Increasing automation of thyroid nodule risk stratification
Another barrier to routine implementation of thyroid nodule risk stratification is the need for accurate and reproducible assessment of thyroid nodule features.33,71 These assessments have shown that highly trained evaluators and time are important considerations that can affect reproducibility, given that ultrasound assessment is an operator-dependent task.10,48 AI algorithms could help solve this clinical gap by supporting highly reproducible, fast and automated thyroid nodule risk stratification.
Xu et al, conducted a systematic review that included 19 studies (4781 nodules) assessing the diagnostic performance of machine learning algorithms for the diagnosis of thyroid nodules.76 Only the results from the external validation cohorts were included in the analysis. There were 6 studies that evaluated conventional machine learning and 13 that evaluated deep learning models. Overall, the studies were considered to be at moderate risk of bias and the prevalence of thyroid cancer cases was high.76 The conventional machine learning models had a sensitivity of 0.86 (95% CI 0.79-.92), specificity 0.85 (95% CI 0.77–0.91), and diagnostic odds ratio (DOR) 37.4 (95% CI 24.9–56.2). Models based on deep learning had a sensitivity of 0.89 (95% CI 0.81–0.93), specificity 0.84 (95% CI 0.75–0.90), DOR 40.9 (95% CI 18.1–92.1). The performance of deep learning based algorithms was comparable to the diagnostic performance of radiologists [DOR 40.1 (95%CI 15.6–103.3) vs DOR 44.9 (95%CI 30.7–65.6), respectively].76 These findings are consistent with the systematic review by Chambara et al, that evaluated 14 studies of CAD models for thyroid nodules, demonstrating comparable diagnostic performance of the AI models to human evaluators.70 These reviews suggest good diagnostic performance of machine learning algorithms for the identification of thyroid cancer.76
The study by Li et al, was a large multicenter study conducted in China including >300,000 images from 17 627 patients with thyroid cancer and 25 325 patients with benign thyroid nodules.69 The area under the curve in each external validation site was 0.91 (95% CI .87–0·96) and 0.91 (95% CI 0.89–0.93). Moreover, the sensitivity of the machine learning model was 84.3% vs 92.9% (p=0.048) in one site and 84.7% vs 89% (p=0.25), while the specificity was 86.9% vs 56.1% (p<0·0001) and 87.8% vs 68.6% in the second site (p<0·0001), when compared to radiologists.69
Thomas et al, developed a similarity algorithm for thyroid nodule risk stratification and used heat maps to attempt to explain the algorithm output. Heatmaps are visual representation that can help identify the areas (in an image in this case) that were used to support the output of the AI model.77 Their development cohort included 2025 images from 482 patients and the overall malignancy prevalence was 32%.77 In this model, the physician is always “in the loop” as they select the images of interest and are presented with the final output that they can accept/reject. The similarity model was used to classify test images and demonstrated an accuracy of 81.5%.77 Interestingly, the heatmap was not able to completely explain the rationale for the model outputs.77
This body of evidence suggests that machine learning algorithms can enhance thyroid nodule risk stratification by analyzing thyroid nodule images without the need for expert extraction of ultrasound features.
Lessons learned and next steps for the use of AI for thyroid nodule risk stratification
The ultimate goal of applying AI methods to improve thyroid nodule risk stratification is to improve the care of patients with thyroid nodules (Figure 2). The preliminary evidence presented suggests reasonable diagnostic performance of machine learning algorithms. However, a few concerns arise. Similar to the existent thyroid nodule diagnostic literature, the datasets that are used to train these models suffer from bias related to: high prevalence of thyroid cancer (improves diagnostic performance), over-representation of papillary thyroid cancer, variable gold standards and inappropriate exclusions.5,11,42 These biases cannot be completely overcome by large datasets or by advanced machine learning algorithms and limit the validity and generalizability of findings.78
Figure 2.

Potential applications of AI to improve thyroid nodule risk stratifications, areas that need to be addressed in future studies in order to achieve the goal of improved clinical outcomes.
A large number of studies have focused on deep learning algorithms, but our understanding of the features and/or rationale used by the algorithm to reach conclusions continues to be poorly understood for most models.12 This “black box” and limited interpretability significantly hinder the reliability, trust and adoption of these methods in practice.79
Similarly, as is the case for other areas of AI for diagnosis based on imaging, there is paucity of randomized, controlled trials focused not only on diagnostic performance but that also assess effects on clinical care and outcomes (i.e. increased reproducibility, accuracy/discussion of thyroid cancer risk, unnecessary biopsies). Although most models have focused on providing a dichotomous answer of benign/malignant classification for thyroid nodules, models that provide an estimate of thyroid cancer risk can also assist and support personalization of care for patients with thyroid nodules.79 Moreover, the possible unintended consequences of implementation of these models in clinical practice have rarely been assessed (i.e. Clinician/patient distrust, legal implications).64,68
Summary
Ultrasound risk stratification is an essential tool in the management of patients with thyroid nodules. The application of AI to thyroid nodule risk stratification can help improve the diagnostic accuracy of current SRSS and the reproducibility of assessments. The results of preliminary studies using AI to enhance thyroid nodule risk stratification are promising, supporting the need for future research that moves from diagnostic performance to effects on clinical outcomes.
Key points.
Sonographic risk stratification of thyroid nodules is the first step to personalize the care of patients with thyroid nodules
Multiple systems are available for sonographic risk stratification of thyroid nodules, each associated with different advantages and limitations
The use of artificial intelligence to improve thyroid cancer risk estimation and assessment reproducibility is promising
Future studies using artificial intelligence should focus on improving patient outcomes and use rigorous scientific methods
Synopsis.
Clinical evidence supports the association of ultrasound features with benign or malignant thyroid nodules and serves as the basis for sonographic stratification of thyroid nodules, according to an estimated thyroid cancer risk. Contemporary guidelines recommend management strategies according to thyroid cancer risk, thyroid nodule size and the clinical scenario. Yet, reproducible and accurate thyroid nodule risk stratification requires expertise, time and understanding of the weight different ultrasound features have on thyroid cancer risk. The application of artificial intelligence to overcome these limitations is promising and has the potential to improve the care of patients with thyroid nodules.
Clinics Care Points –
The use of risk stratification systems in the evaluation of thyroid nodules can lead to a decrease of unnecessary thyroid biopsies without missed diagnosis of clinically relevant thyroid cancer
Reproducible and accurate risk stratification of thyroid nodules can be time-consuming and requires expertise, which represent potential limitations for widespread implementation
Deep learning models for thyroid nodule risk stratification have shown good diagnostic properties for thyroid cancer, yet their impact on patient care requires further evaluation
Footnotes
Disclosure statement: The authors have nothing to disclose
References
- 1.Haugen BR, Alexander EK, Bible KC, et al. 2015 American Thyroid Association Management Guidelines for Adult Patients with Thyroid Nodules and Differentiated Thyroid Cancer: The American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. Thyroid. 2016. doi: 10.1089/thy.2015.0020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Tessler FN, Middleton WD, Grant EG, et al. ACR Thyroid Imaging, Reporting and Data System (TI-RADS): White Paper of the ACR TI-RADS Committee. J Am Coll Radiol. 2017;14(5):587–595. doi: 10.1016/j.jacr.2017.01.046 [DOI] [PubMed] [Google Scholar]
- 3.Pearce SHS, Brabant G, Duntas LH, et al. 2013 ETA Guideline: Management of Subclinical Hypothyroidism. Eur Thyroid J. 2014;2:215–228. doi: 10.1159/000356507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Tessler FN, Middleton WD, Hoang JK. Commentary on a direct comparison of the ATA and TI-RADS ultrasound scoring systems. Endocr Pract. 2019;25(5):503–505. doi: 10.4158/EP-2019-006 [DOI] [PubMed] [Google Scholar]
- 5.Singh Ospina N, Iñiguez-Ariza NM, Castro MR. Thyroid nodules: Diagnostic evaluation based on thyroid cancer risk assessment. BMJ. 2020;368. doi: 10.1136/bmj.l6670 [DOI] [PubMed] [Google Scholar]
- 6.Ospina NS, Genere N, Hoang JK, Brito JP. ACR TI-RADSRecommendations: A Call to Contextualize Radiologists’ Recommendations for Thyroid Nodules With the Clinical Scenario. J Am Coll Radiol. 2021. doi: 10.1016/j.jacr.2021.04.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kim PH, Suh CH, Baek JH, Chung SR, Choi YJ, Lee JH. Unnecessary thyroid nodule biopsy rates under four ultrasound risk stratification systems: a systematic review and meta-analysis. Eur Radiol. 2021. doi: 10.1007/s00330-020-07384-6 [DOI] [PubMed] [Google Scholar]
- 8.Zhang Q, Ma J, Sun W, Zhang L. Comparison of diagnostic performance between the American college of radiology thyroid imaging reporting and data system and American thyroid association guidelines: A systematic review. Endocr Pract. 2020;26(5):552–563. doi: 10.4158/EP-2019-0237 [DOI] [PubMed] [Google Scholar]
- 9.Li W, Wang Y, Wen J, Zhang L, Sun Y. Diagnostic performance of American college of radiology TI-RADS: A systematic review and meta-analysis. Am J Roentgenol. 2021;216(1):38–47. doi: 10.2214/AJR.19.22691 [DOI] [PubMed] [Google Scholar]
- 10.Hamour AF, Yang W, Lee JJW, et al. Association of the Implementation of a Standardized Thyroid Ultrasonography Reporting Program with Documentation of Nodule Characteristics. JAMA Otolaryngol - Head Neck Surg. 2021;147(4):343–349. doi: 10.1001/jamaoto.2020.5233 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Li LR, Du B, Liu HQ, Chen C. Artificial Intelligence for Personalized Medicine in Thyroid Cancer: Current Status and Future Perspectives. Front Oncol. 2021;10. doi: 10.3389/fonc.2020.604051 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Thomas J, Ledger GA, Mamillapalli CK. Use of artificial intelligence and machine learning for estimating malignancy risk of thyroid nodules. Curr Opin Endocrinol Diabetes Obes. 2020;27(5):345–350. doi: 10.1097/MED.0000000000000557 [DOI] [PubMed] [Google Scholar]
- 13.Brito JP, Gionfriddo MR, Al Nofal A, et al. The accuracy of thyroid nodule ultrasound to predict thyroid cancer: Systematic review and meta-analysis. J Clin Endocrinol Metab. 2014;99(4):1253–1263. doi: 10.1210/jc.2013-2928 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wienke JR, Chong WK, Fielding JR, Zou KH, Mittelstaedt CA. Sonographic Features of Benign Thyroid Nodules. J Ultrasound Med. 2003;22(10):1027–1031. doi: 10.7863/jum.2003.22.10.1027 [DOI] [PubMed] [Google Scholar]
- 15.Moon W-J, Lee YH, Kim J, Kim HS. Benign and Malignant Thyroid Purpose : Methods : Results : Conclusion : Radiology. 2008;247(3):762–770. [DOI] [PubMed] [Google Scholar]
- 16.Hoang JK, Wai KL, Lee M, Johnson D, Farrell S. US features of thyroid malignancy: Pearls and pitfalls. Radiographics. 2007;27(3):847–860. doi: 10.1148/rg.273065038 [DOI] [PubMed] [Google Scholar]
- 17.Kim DW, Lee EJ, In HS, Kim SJ. Sonographic differentiation of partially cystic thyroid nodules: A prospective study. Am J Neuroradiol. 2010;31(10):1961–1966. doi: 10.3174/ajnr.A2204 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kwak JY, Han KH, Yoon JH, et al. Thyroid imaging reporting and data system for us features of nodules: A step in establishing better stratification of cancer risk. Radiology. 2011;260(3):892–899. doi: 10.1148/radiol.11110206 [DOI] [PubMed] [Google Scholar]
- 19.Mandel SJ, Langer JE. Ultrasound of Thyroid Nodules. Thyroid Parathyr Ultrasound Ultrasound-Guided FNA. 2018:189–223. doi: 10.1007/978-3-319-67238-0_7 [DOI] [Google Scholar]
- 20.Trimboli P, Durante C. Ultrasound risk stratification systems for thyroid nodule: between lights and shadows, we are moving towards a new era. Endocrine. 2020;69(1):20–23. doi: 10.1007/s12020-020-02196-6 [DOI] [PubMed] [Google Scholar]
- 21.Gharib H, Papini E, Garber JR, et al. American association of Clinical Endocrinologists, American college of endocrinology, and Associazione Medici Endocrinologi medical guidelines for clinical practice for the diagnosis and management of thyroid nodules - 2016 update. Endocr Pract. 2016;22(May):1–60. doi: 10.4158/EP161208.GL [DOI] [PubMed] [Google Scholar]
- 22.Yi KH. The revised 2016 Korean thyroid association guidelines for thyroid nodules and cancers: Differences from the 2015 american thyroid association guidelines. Endocrinol Metab. 2016;31(3):373–378. doi: 10.3803/EnM.2016.31.3.373 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Grani G, Lamartina L, Ascoli V, et al. Reducing the number of unnecessary thyroid biopsies while improving diagnostic accuracy: Toward the “Right” TIRADS. J Clin Endocrinol Metab. 2019;104(1):95–102. doi: 10.1210/jc.2018-01674 [DOI] [PubMed] [Google Scholar]
- 24.Lauria Pantano A, Maddaloni E, Briganti SI, et al. Differences between ATA, AACE/ACE/AME and ACR TI-RADS ultrasound classifications performance in identifying cytological high-risk thyroid nodules. Eur J Endocrinol. 2018;178(6):595–603. doi: 10.1530/EJE-18-0083 [DOI] [PubMed] [Google Scholar]
- 25.Castellana M, Castellana C, Treglia G, et al. Performance of five ultrasound risk stratification systems in selecting thyroid nodules for FNA. J Clin Endocrinol Metab. 2020;105(5):1–32. doi: 10.1210/clinem/dgz170 [DOI] [PubMed] [Google Scholar]
- 26.Grani G, Lamartina L, Cantisani V, Maranghi M, Lucia P, Durante C. Interobserver agreement of various thyroid imaging reporting and data systems. Endocr Connect. 2018;7(1):1–7. doi: 10.1530/EC-17-0336 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Karkada M, Costa AF, Imran SA, et al. Incomplete thyroid ultrasound reports for patients with thyroid nodules: Implications regarding risk assessment and management. Am J Roentgenol. 2018;211(6):1348–1353. doi: 10.2214/AJR.18.20056 [DOI] [PubMed] [Google Scholar]
- 28.Griffin AS, Mitsky J, Rawal U, Bronner AJ, Tessler FN, Hoang JK. Improved Quality of Thyroid Ultrasound Reports After Implementation of the ACR Thyroid Imaging Reporting and Data System Nodule Lexicon and Risk Stratification System. J Am Coll Radiol. 2018;15(5):743–748. doi: 10.1016/j.jacr.2018.01.024 [DOI] [PubMed] [Google Scholar]
- 29.Ghazizadeh S, Kelly TL, Khajanchee YS, et al. Standardization of thyroid ultrasound reporting in the community setting decreases biopsy rates. Clin Endocrinol (Oxf). 2021;94(6):1035–1042. doi: 10.1111/cen.14431 [DOI] [PubMed] [Google Scholar]
- 30.Persichetti A, Di Stasio E, Guglielmi R, et al. Predictive Value of Malignancy of Thyroid Nodule Ultrasound Classification Systems: A Prospective Study. J Clin Endocrinol Metab. 2018;103(4):1359–1368. doi: 10.1210/jc.2017-01708 [DOI] [PubMed] [Google Scholar]
- 31.Singh Ospina N, Papaleontiou M. Thyroid Nodule Evaluation and Management in Older Adults: A Review of Practical Considerations for Clinical Endocrinologists. Endocr Pract. 2021;27(3):261–268. doi: 10.1016/j.eprac.2021.02.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Cawood TJ, Mackay GR, Hunt PJ, OShea D, Skehan S, Ma Y. TIRADS management guidelines in the investigation of thyroid nodules; illustrating the concerns, costs, and performance. J Endocr Soc. 2020;4(4):1–13. doi: 10.1210/JENDSO/BVAA031 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Hoang JK, Middleton WD, Tessler FN. Update on ACR TI-RADS: Successes, Challenges, and Future Directions, From the AJR Special Series on Radiology Reporting and Data Systems. Am J Roentgenol. 2021:1–9. doi: 10.2214/ajr.20.24608 [DOI] [PubMed] [Google Scholar]
- 34.Middleton WD, Teefey SA, Reading CC, et al. Comparison of performance characteristics of American college of radiology TI-RADS, Korean society of thyroid radiology TIRADS, and American thyroid association guidelines. Am J Roentgenol. 2018;210(5):1148–1154. doi: 10.2214/AJR.17.18822 [DOI] [PubMed] [Google Scholar]
- 35.Han K, Kim EK, Kwak JY. 1.5–2 cm tumor size was not associated with distant metastasis and mortality in small thyroid cancer: A population-based study. Sci Rep. 2017;7. doi: 10.1038/srep46298 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Shin JJ, Caragacianu D, Randolph GW. Impact of thyroid nodule size on prevalence and post-test probability of malignancy: A systematic review. Laryngoscope. 2014;125(1):263–272. doi: 10.1002/lary.24784 [DOI] [PubMed] [Google Scholar]
- 37.Park VY, Lee HS, Kim EK, Kwak JY, Yoon JH, Moon HJ. Frequencies and malignancy rates of 6-tiered Bethesda categories of thyroid nodules according to ultrasound assessment and nodule size. Head Neck. 2018;40(9):1947–1954. doi: 10.1002/hed.25179 [DOI] [PubMed] [Google Scholar]
- 38.Hong MJ, Na DG, Baek JH, Sung JY, Kim JH. Impact of nodule size on malignancy risk differs according to the ultrasonography pattern of thyroid nodules. Korean J Radiol. 2018;19(3):534–541. doi: 10.3348/kjr.2018.19.3.534 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Machens A, Holzhausen HJ, Dralle H. The prognostic value of primary tumor size in papillary and follicular thyroid carcinoma: A comparative analysis. Cancer. 2005;103(11):2269–2273. doi: 10.1002/cncr.21055 [DOI] [PubMed] [Google Scholar]
- 40.Ha SM, Baek JH, Na DG, et al. Diagnostic performance of practice guidelines for thyroid nodules: Thyroid nodule size versus biopsy rates. Radiology. 2019;291(1):92–99. doi: 10.1148/radiol.2019181723 [DOI] [PubMed] [Google Scholar]
- 41.Yim Y, Baek JH, Chung SR, Choi YJ, Lee JH. Recurrence and additional treatment of cystic thyroid nodules after ethanol ablation: validation of three proposed criteria. Ultrasonography. 2021;40(3):378–386. doi: 10.14366/usg.20039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Spencer-Bonilla G, Singh Ospina N, Rodriguez-Gutierrez R, et al. Systematic reviews of diagnostic tests in endocrinology: an audit of methods, reporting, and performance. Endocrine. 2017;57(1):18–34. doi: 10.1007/s12020-017-1298-1 [DOI] [PubMed] [Google Scholar]
- 43.Trimboli P, Castellana M, Piccardo A, et al. The ultrasound risk stratification systems for thyroid nodule have been evaluated against papillary carcinoma. A meta-analysis. Rev Endocr Metab Disord. 2021;22(2):453–460. doi: 10.1007/s11154-020-09592-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Castellana M, Piccardo A, Virili C, et al. Can ultrasound systems for risk stratification of thyroid nodules identify follicular carcinoma? Cancer Cytopathol. 2020;128(4):250–259. doi: 10.1002/cncy.22235 [DOI] [PubMed] [Google Scholar]
- 45.Itani M, Assaker R, Moshiri M, Dubinsky TJ, Dighe MK. Inter-observer Variability in the American College of Radiology Thyroid Imaging Reporting and Data System: In-Depth Analysis and Areas for Improvement. Ultrasound Med Biol. 2019;45(2):461–470. doi: 10.1016/j.ultrasmedbio.2018.09.026 [DOI] [PubMed] [Google Scholar]
- 46.Basha MAA, Alnaggar AA, Refaat R, et al. The validity and reproducibility of the thyroid imaging reporting and data system (TI-RADS) in categorization of thyroid nodules: Multicentre prospective study. Eur J Radiol. 2019;117(February):184–192. doi: 10.1016/j.ejrad.2019.06.015 [DOI] [PubMed] [Google Scholar]
- 47.Persichetti A, Di Stasio E, Coccaro C, et al. Inter- and Intraobserver Agreement in the Assessment of Thyroid Nodule Ultrasound Features and Classification Systems: A Blinded Multicenter Study. Thyroid. 2020;30(2):237–242. doi: 10.1089/thy.2019.0360 [DOI] [PubMed] [Google Scholar]
- 48.Liu H, Ma AL, Zhou YS, et al. Variability in the interpretation of grey-scale ultrasound features in assessing thyroid nodules: A systematic review and meta-analysis. Eur J Radiol. 2020;129(July 2019):109050. doi: 10.1016/j.ejrad.2020.109050 [DOI] [PubMed] [Google Scholar]
- 49.Kwong N, Medici M, Angell TE, et al. The influence of patient age on thyroid nodule formation, multinodularity, and thyroid cancer risk. J Clin Endocrinol Metab. 2015;100(12):4434–4440. doi: 10.1210/jc.2015-3100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Cherella CE, Feldman HA, Hollowell M, et al. Natural History and Outcomes of Cytologically Benign Thyroid Nodules in Children. J Clin Endocrinol Metab. 2018;103(9):3557–3565. doi: 10.1210/jc.2018-00895 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.He LZ, Zeng TS, Pu L, Pan SX, Xia WF, Chen LL. Thyroid Hormones, Autoantibodies, Ultrasonography, and Clinical Parameters for Predicting Thyroid Cancer. Int J Endocrinol. 2016;2016. doi: 10.1155/2016/8215834 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Al Dawish MA, Robert AA, Thabet MA, Braham R. Thyroid nodule management: Thyroid-stimulating hormone, ultrasound, and cytological classification system for predicting malignancy. Cancer Inform. 2018;17. doi: 10.1177/1176935118765132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Frates MC, Benson CB, Doubilet PM, et al. Prevalence and distribution of carcinoma in patients with solitary and multiple thyroid nodules on sonography. J Clin Endocrinol Metab. 2006;91(9):3411–3417. doi: 10.1210/jc.2006-0690 [DOI] [PubMed] [Google Scholar]
- 54.Angell TE, Maurer R, Wang Z, et al. A Cohort Analysis of Clinical and Ultrasound Variables Predicting Cancer Risk in 20,001 Consecutive Thyroid Nodules. J Clin Endocrinol Metab. 2019;104(11):5665–5672. doi: 10.1210/jc.2019-00664 [DOI] [PubMed] [Google Scholar]
- 55.Pazaitou-Panayiotou K, Capezzone M, Pacini F. Clinical features and therapeutic implication of papillary thyroid microcarcinoma. Thyroid. 2007;17(11):1085–1092. doi: 10.1089/thy.2007.0005 [DOI] [PubMed] [Google Scholar]
- 56.Furuya-Kanamori L, Bell KJL, Clark J, Glasziou P, Doi SAR. Prevalence of differentiated thyroid cancer in autopsy studies over six decades: A meta-analysis. J Clin Oncol. 2016;34(30):3672–3679. doi: 10.1200/JCO.2016.67.7419 [DOI] [PubMed] [Google Scholar]
- 57.Charkes ND. On the prevalence of familial nonmedullary thyroid cancer in multiply affected kindreds. Thyroid. 2006;16(2):181–186. doi: 10.1089/thy.2006.16.181 [DOI] [PubMed] [Google Scholar]
- 58.Schneider AB. Dose-response relationships for radiation-induced thyroid cancer and thyroid nodules: evidence for the prolonged effects of radiation on the thyroid. J Clin Endocrinol Metab. 1993;77(2):362–369. doi: 10.1210/jc.77.2.362 [DOI] [PubMed] [Google Scholar]
- 59.Somerville HM, Steinbeck KS, Stevens G, Delbridge LW, Lam AH, Stevens MM. Thyroid neoplasia following irradiation in adolescent and young adult survivors of childhood cancer. Med J Aust. 2002;176(12):584–587. doi: 10.5694/j.1326-5377.2002.tb04589.x [DOI] [PubMed] [Google Scholar]
- 60.White MG, Cipriani NA, Abdulrasool L, et al. Radiation-Induced Differentiated Thyroid Cancer Is Associated with Improved Overall Survival but Not Thyroid Cancer-Specific Mortality or Disease-Free Survival. Thyroid. 2016;26(8):1053–1060. doi: 10.1089/thy.2015.0634 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Pacini F, Vorontsova T, Demidchik EP, et al. Post-chernobyl thyroid carcinoma in Belarus children and adolescents: Comparison with naturally occurring thyroid carcinoma in Italy and France. J Clin Endocrinol Metab. 1997;82(11):3563–3569. doi: 10.1210/jc.82.11.3563 [DOI] [PubMed] [Google Scholar]
- 62.Wang Z, Vyas CM, Van Benschoten O, et al. Quantitative Analysis of the Benefits and Risk of Thyroid Nodule Evaluation in Patients ≥70 Years Old. Thyroid. 2018;28(4):465–471. doi: 10.1089/thy.2017.0655 [DOI] [PubMed] [Google Scholar]
- 63.Stead WW. Clinical implications and challenges of artificial intelligence and deep learning. JAMA - J Am Med Assoc. 2018;320(11):1107–1108. doi: 10.1001/jama.2018.11029 [DOI] [PubMed] [Google Scholar]
- 64.Matheny M, Israni ST, Ahmed M, Whicher D. AI in Health Care-The Hope, the Hype, the Promise, the Peril. Natl Acad Med. 2019. [PubMed] [Google Scholar]
- 65.Gubbi S, Hamet P, Tremblay J, Koch CA, Hannah-Shmouni F. Artificial intelligence and machine learning in endocrinology and metabolism: The dawn of a new era. Front Endocrinol (Lausanne). 2019;10(MAR). http://www.embase.com/search/results?subaction=viewrecord&from=export&id=L627959380%0A 10.3389/fendo.2019.00185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Hinton G Deep learning-a technology with the potential to transform health care. JAMA - J Am Med Assoc. 2018;320(11):1101–1102. doi: 10.1001/jama.2018.11100 [DOI] [PubMed] [Google Scholar]
- 67.Liu Y, Chen P-HC, Krause J, Peng L. How to Read Articles That Use Machine Learning. Jama. 2019;322(18):1806. doi: 10.1001/jama.2019.16489 [DOI] [PubMed] [Google Scholar]
- 68.Nagendran M, Chen Y, Lovejoy CA, et al. Artificial intelligence versus clinicians: Systematic review of design, reporting standards, and claims of deep learning studies in medical imaging. BMJ. 2020;368. doi: 10.1136/bmj.m689 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Li X, Zhang S, Zhang Q, et al. Diagnosis of thyroid cancer using deep convolutional neural network models applied to sonographic images: a retrospective, multicohort, diagnostic study. Lancet Oncol. 2019;20(2):193–201. doi: 10.1016/S1470-2045(18)30762-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Chambara N, Ying M. The diagnostic efficiency of ultrasound computer–aided diagnosis in differentiating thyroid nodules: A systematic review and narrative synthesis. Cancers (Basel). 2019;11(11). doi: 10.3390/cancers11111759 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Edwards MK, Singh Ospina N. Implementation of Thyroid Nodule Risk Stratification in a High Volume Clinic. Clin Thyroidol. 2021;33(5):221–224. doi: 10.1089/ct.2021;33.221-224 [DOI] [Google Scholar]
- 72.Buda M, Wildman-Tobriner B, Hoang JK, et al. Management of thyroid nodules seen on us images: Deep learning may match performance of radiologists. Radiology. 2019;292(3):695–701. doi: 10.1148/radiol.2019181343 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Zhang B, Tian J, Pei S, et al. Machine Learning-Assisted System for Thyroid Nodule Diagnosis. Thyroid. 2019;29(6):858–867. doi: 10.1089/thy.2018.0380 [DOI] [PubMed] [Google Scholar]
- 74.Hammad ARY, Noureldine SI, Hu T, Ibrahim Y, Masoodi HM, Kandil E. A meta-analysis examining the independent association between thyroid nodule size and malignancy. Gland Surg. 2016;5(3):312–317. doi: 10.21037/gs.2015.11.05 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Wildman-Tobriner B, Buda M, Hoang JK, et al. Using artificial intelligence to revise ACR TI-RADS risk stratification of thyroid nodules: Diagnostic accuracy and utility. Radiology. 2019;292(1):112–119. doi: 10.1148/radiol.2019182128 [DOI] [PubMed] [Google Scholar]
- 76.Xu L, Gao J, Wang Q, et al. Computer-Aided Diagnosis Systems in Diagnosing Malignant Thyroid Nodules on Ultrasonography: A Systematic Review and Meta-Analysis. Eur Thyroid J. 2020;9(4):186–193. doi: 10.1159/000504390 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Thomas J, Haertling T. AIBx, Artificial Intelligence Model to Risk Stratify Thyroid Nodules. Thyroid. 2020;30(6):878–884. doi: 10.1089/thy.2019.0752 [DOI] [PubMed] [Google Scholar]
- 78.Doshi-Velez F, Perlis RH. Evaluating Machine Learning Articles. JAMA - J Am Med Assoc. 2019;322(18):1777–1779. doi: 10.1001/jama.2019.17304 [DOI] [PubMed] [Google Scholar]
- 79.Amann J, Blasimme A, Vayena E, Frey D, Madai VI. Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC Med Inform Decis Mak. 2020;20(1). doi: 10.1186/s12911-020-01332-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
