Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Oct 17.
Published in final edited form as: IEEE Access. 2024 Apr 11;12:53277–53292. doi: 10.1109/access.2024.3387702

A Framework for Interpretability in Machine Learning for Medical Imaging

ALAN Q WANG 1,2, BATUHAN K KARAMAN 1,2, HEEJONG KIM 2, JACOB ROSENTHAL 2,3, RACHIT SALUJA 1,2, SEAN I YOUNG 4,5, MERT R SABUNCU 1,2
PMCID: PMC11486155  NIHMSID: NIHMS1987669  PMID: 39421804

Abstract

Interpretability for machine learning models in medical imaging (MLMI) is an important direction of research. However, there is a general sense of murkiness in what interpretability means. Why does the need for interpretability in MLMI arise? What goals does one actually seek to address when interpretability is needed? To answer these questions, we identify a need to formalize the goals and elements of interpretability in MLMI. By reasoning about real-world tasks and goals common in both medical image analysis and its intersection with machine learning, we identify five core elements of interpretability: localization, visual recognizability, physical attribution, model transparency, and actionability. From this, we arrive at a framework for interpretability in MLMI, which serves as a step-by-step guide to approaching interpretability in this context. Overall, this paper formalizes interpretability needs in the context of medical imaging, and our applied perspective clarifies concrete MLMI-specific goals and considerations in order to guide method design and improve real-world usage. Our goal is to provide practical and didactic information for model designers and practitioners, inspire developers of models in the medical imaging field to reason more deeply about what interpretability is achieving, and suggest future directions of interpretability research.

Keywords: Interpretability, explainability, medical imaging, machine learning

1. INTRODUCTION

Machine learning (ML) has seen remarkable advancement in recent years. ML’s intersection with medical imaging (which we abbreviate as MLMI) is amongst the most promising, offering potential advances to quality of patient care [1], [2], [3]. However, the most performant machine learning models like those in computer vision and deep learning are generally regarded as black boxes – they output predictions without revealing to human users how they arrived at those predictions. As such, there has been a surge of papers calling for and proposing methods that make their decision-making interrogable, understandable, or explainable by users. This subfield has gone by names like “interpretable,” “explainable,” and “transparent” ML. There is a clear need for such methods [4], [5], and the rising interest in this field is a reflection of the safety-critical, high-stakes setting in which medical imaging applications are deployed.

However, in the medical imaging field, there is a general sense of murkiness in how these words are or can be used. Many works claim a certain approach to be “non-interpretable” and thus sub-optimal, while others claim to increase interpretability, without a formalism that we can commonly agree on. Our motivation in this paper is to introduce a formal framework to use for considering, motivating, studying, validating, and discussing interpretability in MLMI. Central to our work is answering these questions:

  1. Why does the need for interpretability in MLMI arise?

  2. What does one actually seek when interpretability is needed? That is, what are the underlying elements of interpretability in MLMI?

  3. Informed by the above two points, how can we formalize them into a framework for interpretability in MLMI?

This paper attempts to address these three points. In particular, within the domain of medical imaging, we pinpoint elements from a purely applications-based perspective, starting from enumerating common tasks in medical image analysis and its intersection with ML, reasoning about their real world goals and considerations, and distilling from these five underlying elements: localizability, visual recognizability, physical attribution, model transparency, and actionability. This culminates in a framework for interpretability in MLMI, which serves as a step-by-step guide to approaching interpretability in this context.

We believe this paper can act as a guide for users and designers of interpretability in the medical imaging field by clarifying concepts in this context, suggesting future directions of research, and reasoning more deeply about how to evaluate the efficacy of their proposed methods. In addition, we hope that our work can provide concrete and practical utility for clinicians and researchers who may benefit from these models in their own workflows.

The paper is organized as follows. In Section II, we overview works similar to ours from both general machine learning and medical imaging perspectives, and identify the gap which our work aims to fill. In Section III, we broadly describe the goals and measurements which are sought in medical image analysis. In Section IV, we enumerate real-world tasks which comprise medical image analysis and which ML methods are employed to solve, as well as list possible real-world objectives which might be sought in the context of medical imaging and which cannot be explicitly optimized for. In Section V, we formally describe the elements of interpretable MLMI distilled from these real world tasks and goals. In Section VI, we connect these elements to existing literature by reframing existing methods. These sections culminate in Section VII, which provides a step-by-step guide to practitioners on approaching interpretability for MLMI. Finally, in Section VIII, we discuss further implications, including possible future directions of exploration and limitations of interpretability as a whole.

To summarize, our contributions in this work are as follows:

  • We introduce a framework for interpretability in MLMI, starting from goals of medical image analysis and motivated by the need for various real-world considerations in this context, including trustworthiness, continual adaptation, and fairness.

  • In doing so, we identify key elements of interpretability in MLMI and tie them to previously-proposed interpretability methods, in an effort to enable practical utility for clinicians and researchers.

  • We discuss implications, limitations, and potential future exploration directions, aiming to foster deeper understanding and evaluation of interpretability methods in MLMI.

II. RELATED WORKS

Interpretability in machine learning has been discussed and argued for by many articles [6], [7], [8]. Several papers in interpretable machine learning attempt to define desiderata [9], [10], [11]. Fundamentally, these surveys identify the need for interpretability arising from a mismatch [9] or incompleteness [10] between the objectives optimized during training (i.e. test set predictive performance) and the real world costs and benefits in a deployment setting. Lipton enumerates several desiderata for interpretability, including trust, causality, informativeness, transferability, and fair and ethical decision-making [9]. Murdoch et al. view interpretability from the context of the life-cycle of models, including the relationship of the model with the practitioner and the audience [11]. From their vantage point of general machine learning, these above surveys strive to cover all aspects of interpretability in this domain. While broad, they are necessarily vague in order to encompass all facets of ML, sacrificing practical utility in the process.

Our work follows the guidelines of Rudin [6], who argues that interpretability should be defined in a domain-specific manner. However, even in the narrower context of MLMI, there is not a consensus in the literature on what precisely defines interpretability. Many review papers survey the landscape of interpretability methods available to medical imaging practitioners like clinicians and researchers [12], [13], [14]. For example, Huff et al. categorize models according to whether they seek to understand model structure and function or whether they seek to understand model output [13]. van der Valden et al. propose a categorization based on model-based vs. post-hoc, model-specific vs. model-agnostic, and global vs. local methods [14]. While comprehensively categorizing available methodologies, these surveys are not focused on obtaining or distilling from these methods any underlying elements.

Some works go beyond categorizing methodologies and propose higher-level frameworks for interpretability in this narrower context. Schuhmacher et al. propose a framework for falsifiable explanations in medical imaging contexts [15]. Their framework requires that any interpretability method connect to the physical reality of the sample from which the data originate, and also require an explanation to be a falsifiable hypothesis. Their framework can be seen as a middle ground between the inductive nature of machine learning training and the hypothetico-deductive nature of empirical research. Our framework can be seen as an elaboration upon this framework with additional elements which we believe render it more comprehensive.

Common to all aforementioned works is the fact that any core properties which are gleaned from the literature are distilled from previously proposed papers and methods. Typically, these works draw a distinction between models that are inherently interpretable by design, and models that are interpretable post-hoc, possibly by a separate trained model [6], [9], [11]. We believe such a meta-analytical approach runs counter to the applied nature of the field. Instead, in this work, we pinpoint elements by first considering real-world tasks and considerations surrounding medical image analysis and its intersection with ML. We distill elements of interpretability from this applied perspective, without being influenced by current literature. An approach which starts from available methodologies is liable to restrict itself to elements which are currently feasible, instead of those which are truly desired.

III. GOALS OF MEDICAL IMAGE ANALYSIS

Medical image analysis is a multifaceted discipline, comprising many different imaging modalities, anatomies, and end goals. To provide a solid foundation upon which to clarify goals and elements for interpretable MLMI, we start by enumerating the use cases of medical image analysis, as well as the underlying measurements which are derived from medical imaging.

A. USE CASES

Here, we enumerate several real-world use cases of medical image analysis. The ultimate goals for these use cases could be clinical or scientific in nature. On the clinical side, some example goals might include pathology detection and characterization such as in bone fracture assessment, screening high risk groups such as with lung imaging, and assessing treatment outcomes such as with molecular imaging. On the scientific side, example use cases include cell biology, neuroscience, and drug development.

1). DIAGNOSIS

Medical imaging can help in identifying the nature of an illness [16], [17], [18], which may require revealing abnormalities such as tumors, fractures, infections, or organ damage that may not be apparent through physical examinations alone.

2). DISEASE STAGING

The stage and severity of diseases can be facilitated by medical imaging [19]. For example, medical images in oncology can show the size and spread of tumors, enabling identification of the cancer severity and furthermore enable treatment planning.

3). TREATMENT PLANNING

Treatment planning is concerned with tailoring the application of available treatment resources to each patient’s individual goals and needs [20], [21]. Common examples include planning surgical procedures and radiation therapy. With medical imaging, surgeons can visualize the precise location and extent of abnormalities, helping them make informed decisions about the best approach to treatment.

4). TREATMENT MONITORING

Similar to treatment planning, treatment monitoring seeks to assess the effectiveness of treatments over time [22]. Medical imaging can monitor changes in the size and characteristics of tumors or other disease markers, which can indicate how well a treatment is working.

5). FORECASTING

Also known as prognostication, forecasting is concerned with predicting the likely course of a disease and its potential outcomes [23], [24]. As a prerequisite, early detection of diseases can be facilitated through routine screening using imaging techniques like mammography or computed tomography (CT) angiography.

6). ASSESSMENT

Medical imaging is useful for assessing function of different parts of the body using imaging techniques such as functional magnetic resonance imaging (fMRI) and positron emission tomography (PET) scans. This is valuable for understanding brain activity, blood flow, and metabolic processes. Medical images also offer ways to interrogate structural health, such as in x-rays and bone density scans (DXA) used to evaluate bone health, detect fractures, and assess the risk of osteoporosis.

7). REAL-TIME GUIDANCE

Real-time guidance for minimally invasive procedures [25], [26], such as biopsies, catheter insertions, and needle aspirations helps to ensure precision and minimize risks.

8). RESEARCH AND EDUCATION

Medical imaging data is invaluable for clinicians and scientists for purposes of research and education. For example, histopathology imaging is an important tool for researchers studying the human body [27], allowing them to investigate diseases and develop new treatments. Additionally, medical imaging is an important tool for educating medical students and other healthcare professionals.

9). POPULATION HEALTH STUDIES

Population health studies utilize aggregated medical imaging data to identify trends, risk factors, and patterns of disease within communities [28], [29].

B. UNDERLYING MEASUREMENTS

Fulfilling these use cases may require various physical measurements which are uncovered through medical imaging. In this section, we enumerate several types of underlying signals that are measured in and derived from various imaging modalities. In subsequent sections, we consider how connecting and referencing to such measurements are, amongst other factors, central to interpretable MLMI.

1). SIZE, SHAPE AND DIMENSION

Medical imaging can precisely measure the size, shape, and dimensions of organs, tumors, lesions, and other anatomical structures [30], [31], [32]. Techniques like ultrasound, X-ray, CT, and magnetic resonance imaging (MRI) are commonly used for this purpose.

2). DENSITY AND COMPOSITION

Imaging techniques such as X-ray and CT scan provide information about tissue density and composition [33], [34]. Different tissues have varying X-ray absorption characteristics, allowing the differentiation of bone, soft tissue, and air, for example.

3). PERFUSION

Perfusion imaging measures blood flow to tissues and organs [35], [36]. Techniques like dynamic contrast-enhanced MRI and nuclear medicine scans (e.g. PET) can assess tissue perfusion, helping diagnose conditions like stroke or assess the viability of transplanted organs.

4). METABOLISM AND FUNCTION

PET and fMRI are imaging techniques that can probe metabolic activity and functional processes in the body [37], [38]. They are used to study brain function, detect cancer, and assess organ function.

5). TISSUE ELASTICITY

Elastography is a specialized ultrasound technique that measures tissue elasticity or stiffness [39], [40]. It is valuable in assessing the stiffness of liver tissue in conditions like cirrhosis and for breast cancer diagnosis.

6). OXYGENATION

Near-infrared spectroscopy (NIRS) and functional MRI can measure tissue oxygenation and oxygen levels in the blood, aiding in the assessment of brain function and detecting oxygenation problems in tissues [38], [41].

7). TEMPERATURE

Infrared thermography uses thermal imaging to measure the temperature of the skin’s surface [42]. It has applications in detecting inflammation and circulatory problems [43].

8). CHEMICAL COMPOSITION

Magnetic Resonance Spectroscopy (MRS) is a specialized MRI technique that can provide information about the chemical composition of tissues, including the presence of specific metabolites [44].

9). RADIOTRACER UPTAKE

PET scans use radiotracers labeled with radioactive isotopes to measure the uptake of specific substances in tissues, such as glucose or neurotransmitters [45], [46].

IV. MLMI OBJECTIVES

A. PRIMARY ML OBJECTIVES

As evidenced by the previous section, medical image analysis represents an array of use cases as well as imaging-derived measurements which are necessary to fulfill these use cases. MLMI is therefore comprised of many different ML tasks. In ML, tasks are primarily characterized by the structures of their input features and output predictions. In MLMI, inputs are images or features derived from images, sometimes combined with meta-data such as patient information. The structure of output predictions are determined by the task. In this section, we detail some of the most common tasks, and visualize them in Fig. 2.

FIGURE 2.

FIGURE 2.

Common tasks in MLMI. Tasks are primarily characterized by the structures of their input features and output predictions. In MLMI, inputs are images or features derived from images, sometimes combined with meta-data such as patient information. The structure of output predictions are determined by the task.

  • Image registration [47] seeks to align scans of the same anatomy. The output is typically a transformed image aligned to another fixed image.

  • Image segmentation [48] seeks to delineate different anatomical structures into discrete categories. The output is pixel-level class predictions.

  • Image enhancement, reconstruction, or restoration predict a clean version of a noisy image [49]. The output is pixel-level scalar predictions.

  • Object detection [50], [51] seeks to detect or isolate specific objects in the image, like organs or lesions. The outputs can be image coordinates (e.g. landmarks) or bounding boxes.

  • Image (or patient)-level classification seeks to categorize an image at the patient-level into one of several distinct categories (e.g. diagnosis) [52], [53]. The output is a single class label.

  • Image (or patient)-level regression seeks to associate with an image a real number that may represent some quantity (e.g. disease severity or some physical measurement) [54]. The output is a scalar real number.

The task and its corresponding structure of output prediction typically imply a prescribed objective which is optimized (e.g. mean-squared-error or cross-entropy). This objective is mathematically defined and thus can be optimized explicitly. However, in the real world, we often have considerations that go beyond these objectives. This is especially true in the context of medical imaging, where image-based tasks form one component of a larger clinical workflow, and where real world considerations such as safety, trust, fairness, and ease-of-use must be balanced alongside mathematical loss functions [55]. In the next section, we turn to these real world goals.

B. OTHER REAL-WORLD OBJECTIVES

In the previous section, we enumerated common tasks in medical image analysis and characterized them by the structure of their predictions; this structure lends itself to natural objectives that quantify performance and can be explicitly optimized (such as Dice overlap score for image segmentation or accuracy for classification). However, often times in real world deployment, we have goals beyond this natural objective. In this section, we enumerate several such (non-exhaustive) goals.

1). FACILITATING TROUBLESHOOTING AND/OR AUDITING.

Clinicians and researchers may require that models enable some way of facilitating troubleshooting and/or auditing. This is especially necessary when models are performing poorly, or when the robustness of models are called into question. As an example, Zech et al. used activation maps to uncover models leveraging spurious features (e.g. “shortcuts”) in chest radiographs [56]. In particular, activation maps showed that image artifacts (such as indicators of R and L sides) that were heavily correlated with the outcome of interest were driving the prediction. Such a finding might lead to removing such artifacts or other changes to remove model dependence on this shortcut.

2). ENABLING CONTINUAL IMPROVEMENT AND ADAPTATION.

Related but distinct from troubleshooting and auditing, clinicians and researchers may require that models continually improve and adapt over time. Data is a reflection of the real world and is therefore dynamic and constantly evolving. This can stem from availability of more data over time, but also may stem from shifts in the underlying statistics of the images due to changes in prevalence of diseases, changes in imaging protocols, or upgrades in measurement technologies [57]. A model should enable users to identify and correct errors due to these changes.

3). ENSURING FAIRNESS

Fairness seeks to guarantee that a model’s performance is balanced or invariant across patient groups, and in particular that it does not disadvantage minority or otherwise protected groups [58]. A clinician or researcher should be able to ensure and validate fairness of a model by, for example, interrogating and comparing the model’s performance across groups. As an example, Puyol-Anton et al. identified statistically significant differences in Dice scores between white population and minority ethnic groups for state-of-the-art models that segmented ventricles and the myocardium in cine cardiac magnetic resonance images [59]. As another example in brain MRI segmentation, Ioannou et al. observed a statistically significant drop in Dice scores when models trained on data from white subjects were evaluated on black female subjects [60]. Finally, Abbasi-Sureshjani et al. showed that skin lesion classifiers exhibited large differences in performance between male and female patients even though their training data was relatively balanced [61].

4). UNCOVERING SCIENTIFIC INSIGHTS AND ADVANCING MECHANISTIC UNDERSTANDING.

An exciting value proposition is the possibility of uncovering or revealing novel scientific insights. Science and medicine are often concerned with discovering and characterizing causal and/or mechanistic relationships. These relationships can be characterized on physical, physiological, or bio-chemical grounds via known mechanisms, phenomena, or pathways [62].

As an example, a model trained to predict survival from H&E histological sections for colorectal cancer was interrogated using a combination of model interpretability approaches and manual review by expert human pathologists, and a prognostic histological feature, tumor adipose feature (TAF), was identified [63]. Human pathologists were then able to learn to independently identify the TAF feature as an effective prognostic marker [64]. Models could play an important role in multiple parts of the scientific process, from discovery of new image-based patterns to improving human performance on critical tasks.

5). PROVIDING EDUCATIONAL VALUE

Clinical trainees often need to learn to analyze several kinds of images to extract key clinical imaging features to identify patterns and produce differential diagnosis corresponding to those patterns. However, their training and education may be hampered due to time constraints, limited training opportunities, or a lack of access to educational images. Machine learning based teaching platforms can augment their educational experience and provide key insights to improve them as clinicians. [65] presents an AI-augmented medical education platform to detect hip fractures by showcasing a heat-map to indicate locations of abnormality. They demonstrated that students who were trained by their system showed an improvement in understanding of the key features of a hip fracture in radiographs. Adaptive Radiology Interpretation and Education System (ARIES) is another platform presented by [66] which uses ML models to extract clinical imaging features from various brain MRI sequences and present it to neuroradiology residents to perform differential diagnosis. They observed that the residents performed significantly better with ARIES than without.

6). ENGENDERING TRUST

Perhaps the least tangible but equally important auxiliary objective is engendering trust. The emphasis on trust is most needed in clinical settings, where one of the major barriers to integrating ML models into clinical workflows is a lack of trust among clinicians; from this perspective, interpretability plays a role in MLMI as a means by which to improve trust among humans who are using these models [4], [5], [12], [67]. The concept of trust can further be decomposed into two components: trusting individual predictions produced by the model, and trusting the model itself as an entity for producing predictions [68]. Notably, this implies that explainability is principally valuable as an instrument to achieve other end goals, such as building trust or increasing usability of models, but does not necessarily have intrinsic value in and of itself [69].

V. ELEMENTS OF INTERPRETABLE MLMI

In the previous section, we enumerated various ML tasks and several real world goals for MLMI models, which in conjunction with the explicit objective, can be seen as constituting the combined end goals of the model. From the preceding section, it is apparent that real world models require insights that go beyond a single, aggregated, and quantifiable performance metric which encodes the explicit objective. The need for such insights brings about the need for interpretability [9], [10].

Yet, it is clear that not all these goals are sought after for every task, and furthermore that each one of these goals demands a different “element” of interpretability. That is, there is not one unique element of interpretability that satisfies all real world goals in all scenarios. Thus, we seek to answer the question: what are the core underlying elements of interpretability in MLMI? In particular, we seek to distill these objectives to be as minimal as possible.

We pinpoint five (possibly non-exhaustive) underlying elements: localizability, visual recognizability, physical attribution, model transparency, and actionability. The first three are concerned with characterizing the features of the image. The fourth element, model transparency, is concerned with the model itself. The fifth element, actionability, is concerned with both the model and influence on the human user. These elements are further detailed below and depicted in Fig. 3.

FIGURE 3.

FIGURE 3.

Graphical overview of the elements of interpretable MLMI.

A. LOCALIZABILITY

Where are the features, either spatially or temporally, that are driving the prediction? Medical images are high-dimensional spatio-temporal data that simultaneously exhibit relevant and redundant information, at various scales and locations. Localizability of features which are driving the prediction and conveying this location to the human is a central element of interpretability in MLMI.

B. VISUAL RECOGNIZABILITY

What are the visual features that drive the prediction? We refer to a model which exhibits visual recognizability as one that can provide a description of these image features in a human-recognizable way. This might involve conveying to the user features related to brightness, certain pixel intensity patterns, or certain textural patterns.

C. PHYSICAL ATTRIBUTION

This element seeks to reveal to the human salient features of the image which are attributed to some underlying physical meaning. This element connects image features to real-world entities of or related to the image formation process. These entities could be quantitative (measurements like those enumerated in Section III-B) or conceptual (e.g. semantic or human-derived concepts).

D. MODEL TRANSPARENCY

Model transparency elucidates to the human some degree of understanding of how the model produces its outputs. In contrast to the previous three properties, transparency is concerned with inputs, architectures, and/or algorithms and does not characterize image features.

Prior works in the ML literature have discussed different notions of transparency, including simulatability, decomposability, algorithmic transparency, sparsity, modularity, and intelligibility [9], [11], [70]. These notions are applicable broadly to any ML model and are not special to medical image analysis.

By contrast, a more MLMI-specific notion of transparency are approaches which incorporate information related to known entities or mechanisms with respect to the problem being solved or other domain-specific information. Inductive biases can then be injected which constrain the model to operate within the bounds of that domain. For example, the model designer may have domain knowledge about the data-generating and/or image-formation process [71], [72], [73], or have prior knowledge of the mechanisms or anatomy that underlie the problem being studied [74], [75], [76]. See Section VI-H for further details.

E. ACTIONABILITY

Actionability elucidates to the human information that leads to some course of action. The action may be some form of falsifiability or recourse [15]. We identify two types of actionability.

The first type refers to recourse with respect to the model. Here, actionability refers to interpretable information that makes apparent a solution which involves changing or editing some aspect of the model or algorithm. This is related to model transparency, in that models which are transparent can naturally lead to actionable recourse.

The second type refers to recourse with respect to the human user. Here, actionability refers to interpretable information which leads to actionable insights for the human user. For example, making apparent certain pathologies in the image might correspond to prescribed protocols that the human should undertake. As another example, an interpretation may indicate to the human to run a follow-up experiment to further interrogate the problem in question.

VI. CONNECTING EXISTING METHODS TO ELEMENTS

In this section, we discuss several popular interpretability methods and describe how they relate to our aforementioned elements. For more comprehensive overviews of methodologies, we refer the reader to dedicated surveys [12], [13], [14].

A. RADIOMICS AND FEATURE ENGINEERING

Radiomics and other clinically-derived features seek to reduce high dimensional features like shape, intensity and texture into features using image processing or clinically-informed techniques [77], [78], [79], [80], [81], and is particularly prevalent in oncological applications. Typical radiomics workflows require a segmentation step to first localize the region of interest (e.g. tumor), and thus may be seen to exhibit localizability. Furthermore, since many engineered features are designed to capture intensity or textural information, they may be seen to exhibit some notion of recognizability. Features focused on domain-specific and clinically-relevant features like shape, size, or orientation may be seen as exhibiting physical attribution. Finally, because many features involve relatively simple mathematical expressions to compute, humans can typically contemplate the computation of these features and can thus be seen to exhibit model transparency.

B. VISUALIZATION TECHNIQUES

In this section, we detail several visualization techniques that seek to convey to humans the inner workings of a model via information that can be plotted. Since these methods are all seeking to overcome the black-box nature of models, they can all be seen as exhibiting model transparency.

Feature importance techniques such as partial dependence plots (PDPs) [82], individual conditional expectation (ICE) plots [83], and SHapley Additive exPlanations (SHAP) [84], [85], [86] have been used for analysing partial dependency of breast tumor malignancy on ultrasound image features [87], and uncovering important features in predicting cognitive decline in Alzheimer’s disease [88].

A class of techniques allows for analysis of high-dimensional latent space of deep networks. Techniques like t-SNE [89] and UMAP [90] project high-dimensional embeddings into a low-dimensional space that can be plotted (e.g. in 2 dimensions). Doing so can roughly uncover how models organize datapoints in the latent space.

C. SALIENCY MAPS

Saliency maps reveal salient features within an image such as lesions or changes over time in longitudinal studies by outputting heatmaps which can be overlaid on top of the image(s) [91], [92], [93]. Popular techniques include CAM-based methods [94], [95], [96], [97], [98], [99], [100] and methods that incorporate attention mechanisms to make predictions [92], [101], [102], [103].

Saliency maps may be seen to exhibit localizability, since they highlight salient features locally.

D. CASE-BASED METHODS AND EXPLANATION BY EXAMPLE

This class of methods present evidence for predictions for a query sample by identifying similar samples. This is sometimes referred to as “explanation by example” [9], [104]. Prototypical networks [105], [106], [107], [108] classify features or images (e.g. chest X-ray scans) by finding their nearest class prototypes (e.g. images of typical cardiomegaly, nodule, etc) in a learned embedding space. Lamy et al. propose a visual case-based reasoning method for classification and applied it to breast cancer [109].

Case-based methods may be seen to exhibit model transparency, since they can reveal to the user which images the model deems most (dis)similar to one another, which in turn reveals some understanding of its inner workings. In some cases, case-based methods may be paired with other techniques which exhibit other elements, like saliency maps for localizability [110].

E. INFLUENCE FUNCTIONS

Influence functions quantify the most influential training images (either positive or negative) which lead to the prediction for a given sample [108], [111], [112]. For example, Wang et al. used influence functions in the context of liver tumor diagnosis in order to quantify the radiological features that were most influential for classifying a particular lesion [113].

Influence functions may be seen to exhibit model transparency, since they reveal the most “helpful” or “harmful” training images for a model to make a prediction for a given image.

F. IMAGE PERTURBATION TECHNIQUES

Image perturbation techniques directly perturb the input image. These perturbations can identify changes in intensity patterns and deformations that highlight underlying biophysical and biomechanical processes. For example, counterfactuals [114], [115], [116] introduce the minimum amount of change to a given image to change its predicted class. Examples of counterfactual image generation for medical image analysis include chest [93], [117], and knee [118] X-ray scans for the prediction of pulmonary disorders, and severity of osteoarthritis, respectively, as well as brain [119], and breast [116] MRI for Alzheimer’s disease and breast tumor analyses, respectively.

Other popular perturbation techniques include adversarial attacks [120], [121], [122], [123], [124], occlusion sensitivity [125], and local interpretable model-agnostic explanations (LIME) [68].

Perturbation techniques may be seen to exhibit visual recognizability, since they may alter features in meaningful ways that humans can recognize. If these perturbations are meaningfully localized, they may also be seen to exhibit localizability.

G. TEXT-BASED METHODS

Many methods approach interpretability by incorporating human-derived and annotated concepts. In MLMI, these concepts could be clinically-relevant text or phrases, like “sclerosis”, “bone spurs”, or “narrow joint space” in the context of arthritis grading. Concept bottleneck models (CBMs) [126], [127], [128] aim to decompose end-to-end prediction models into non-linear concept prediction and linear or logistic task prediction from the predicted concept values. Concept whitening (CW) models [129] extend CBMs by additionally othogonalizing and rescaling the concept activation vectors during training for improved interpretability. Applications of CBMs to medical image analysis include skin lesion diagnosis [130], [131], pediatric appendicitis [132], and chest radiology [133]. Quantitative testing with concept activation vectors (TCAV) [134] employs concept activation vectors to interpret the internal state of neural networks in terms of user-defined concepts; the authors apply it to a deep learning model that predicts diabetic retinopathy (DR) level from retinal fundus images.

Other than concepts, text-based methods also may output justifications, reports, and captions. These methods typically provide long-form text along with the predictions. Zhang et al. propose MDNet, which leverages a vision and language model to visualize attention as well as provide justifications of the predicted diagnosis [91]. Lee et al. propose a visual word constraint model to improve the accuracy of a justification generator for diagnostic interpretations [135].

Concept-based methods may be seen to exhibit physical attribution, since concepts are typically human-annotated phrases that map to specific real-world characteristics associated with a diagnosis. Additionally, text might point out textural or image-derived information (e.g. “irregular dots” or “bright contrast”) which may be seen to exhibit recognizability.

H. ARCHITECTURAL INDUCTIVE BIASES

In medical imaging contexts, many techniques and methods exist which leverage domain knowledge to inject domain-specific inductive biases into model architectures. These methods use information specific to the problem being solved, for example specific to image content (e.g. anatomy, geometry), or leveraging information about the data-generating process (e.g. a known forward model). Since they directly alter architectures and algorithms used to solve the problem, models of this type may be seen to exhibit model transparency.

In reconstruction, unrolled architectures [71] and physics-based models [72] incorporate knowledge of the underlying forward model, data-generating process, or physical constraints. In classification, mechanistic models [136] and anatomical priors [74] incorporate knowledge of specific objects of interest into the architecture, for example brain midline shift [76], models for tumor growth [75], and cardiac image enhancement and segmentation [137]. In registration, the relative homogeneity in geometry of medical images has been leveraged [138]. Another line of research seeks to apply causal inference to various contexts in MLMI [73], [139], [140]. Works in this vein inject prior information into the model by reasoning about the causal relationships in the data-generating process.

VII. A FRAMEWORK FOR INTERPRETABLE MLMI

In this section, we provide a guide to practitioners of MLMI models for reasoning about interpretability. This guide follows the ordering of sections in the paper, and we reference the corresponding section throughout. A diagram is depicted in Fig. 1.

FIGURE 1.

FIGURE 1.

A framework for interpretability in MLMI.

  1. Problem and goals. Consider the medical use case being addressed (Section III-A) and any underlying measurements necessary for that use case (Section III-B).

  2. ML objectives. From the use case, determine the appropriate ML task and its appropriate objective function (Section IV-A).

  3. Real-world objectives. In addition to the objective function, consider other real-world objectives pertinent to the use case being addressed (Section IV-B).

  4. Elements of interpretability. From the other real-world objectives, consider the underlying elements which capture these objectives (Section V).

  5. Methods. Given these elements, select the appropriate interpretability method which addresses it (Section VI).

In the remainder of the section, we provide two case studies to illustrate our proposed framework.

A. CASE STUDY 1: AUTOMATED, INITIAL SCREENING FOR DIABETIC RETINOPATHY

  1. Problem and goals. Diabetic retinopathy (DR) is a complication of diabetes that can ultimately lead to blindness. Early-stage detection of DR is essential for taking measures to reverse the disease’s progression, making it important for people living with diabetes to be screened regularly, with retinal photography to assess for blood vessel damage or retinal lesions being a key imaging modality used for screening [141], [142], [143]. However, screening programs often face difficulties due to a shortage of clinical specialists [142]. Therefore, a machine learning solution as an initial triage tool which flags suspicious or high-risk cases and sends them to a clinician for further assessment would reduce clinician burden and enable more widespread screening, which is especially useful in resource-limited environments.

  2. ML objective. Since this task involves categorization into one of several distinct labels, this is a classification task and the loss function can be a classification loss like cross-entropy.

  3. Real-world objectives. Beyond classification, as this model will be deployed in a clinical setting, we would demand the ability to troubleshoot and audit the model, and ensure that predictions are fair across patient groups. Additionally, user trust of the model is imperative.

  4. Elements. Of utmost importance amongst the elements is localizability of the lesions or blood vessels which are suspicious. Probing a model’s ability to localize and recognize the suspicious parts of the image will satisfy the various real-world objectives. As this tool will be used as an initial screening tool and the nature of the disease being studied is well understood, the value of physical attribution and actionability is relatively small.

  5. Methods. Saliency maps may be used to address localizability.

B. CASE STUDY 2: BREAST CANCER DETECTION

  1. Problems and goals. Early detection of breast cancer is important for increasing the chances of patients’ survival [144]. A large part of clinical diagnosis in this area is concerned with characterizing various features of the tumor from mammograms, including masses, calcification, architectural distortion, asymmetries, and other related signs [145]. These and other factors have been standardized in the BI-RADS format [146]. Thus, there exists a list of human-recognizable and physically-understood predictive features.

  2. ML objective. As the goal of this tool is to make a discrete diagnostic decision, the ML task is classification and the loss function can be a classification loss like cross-entropy.

  3. Real-world objectives. Similar to Case Study 1, as this model will be deployed in a clinical setting, we would demand the ability to troubleshoot and audit the model, and ensure that predictions are fair across patient groups. Additionally, user trust of the model is imperative.

  4. Elements. Similar to Case Study 1, localizability of the tumors is an important element to consider. As this tool will be used as a diagnostic aid by clinicians, it is important to connect to known features that are widely used for breast cancer diagnosis, e.g. as codified by the BI-RADS format. Thus, visual recognizability and physical attribution of features are desired.

  5. Methods. Similar to Case Study 1, saliency maps may be used to address localizability. Visual recognizability and physical attribution may be addressed by incorporating feature engineering and radiomics [147]. The importance of these features can be further interrogated using visualization techniques like ICE and SHAP.

C. CASE STUDY 3: VISUAL ENCODING

  1. Problem and goals. Visual encoding models that predict functional MRI brain responses to visual stimuli are of scientific interest, as they enable quantification of population heterogeneity in brain stimuli-response mapping, which may allow insight into variability in bottom-up neural systems that can in turn be related to individual’s behavior or pathological state [148], [149], [150].

  2. ML objective. Since this task requires predicting real-valued time-series signals, the task is regression and an appropriate ML loss function might be mean-squared error.

  3. Real-world objectives. The ultimate objective in this case is scientific discovery. As this model will be employed for uncovering aspects of brain function and how it is linked to individual behavior, we are primarily interested in mechanistic understandings.

  4. Elements. Important elements include model transparency and actionability, as we are interested in the information and potential scientific understanding which can be uncovered by the model, as well as actionable insights for future experiments.

  5. Methods. For model transparency, specific architectures can be designed to mimic the brain, like enforcing intermediate layers of the deep network to map to different areas of the visual cortex. These layers may be visualized, for example using t-SNE plots, to understand how the network organizes information in the latent space. This can also address actionability, as being able to diagnose issues in some intermediate layers may uncover issues with certain parts of the network.

VIII. DISCUSSION

A. INHERENT INTERPRETABILITY OF PIXEL-LEVEL TASKS

Pixel-level tasks output predictions directly on the image grid, where the level of prediction is localized to the smallest level of granularity. Often, these outputs represent human-recognizable entities, like region-of-interest labels for segmentation or dense deformation fields for registration. Thus, it may be seen that such pixel-level tasks exhibit an inherent interpretability because the structures of their predictions happen to be localizable and visually recognizable. This may explain why there is a relatively fewer number of works focused on interpretability in this context (to the best of our knowledge, there are no survey papers discussing the subject).

B. INTENDED USERS

Interpretability is a subjective enterprise, and often times a useful interpretation is in the eye of the beholder. Thus, keeping in mind intended users is important when considering interpretability methods. Methods most useful to clinicians and researchers will differ greatly from methods most useful to model developers and auditors, for example. Even amongst clinicians, their expertise might vary and will affect what one is looking for in terms of an interpretation. For example, experienced radiologists might be satisfied with one interpretation, whereas a treating physician might not.

In this work, we have focused on the case where interpretability is sought for deployed systems and where the primary users are clinicians or researchers who may not be experts in ML. Generally, we argue that localizability, visual recognizability, and physical attribution are the most important elements for these users, as it connects the prediction directly to the images and abstracts away modeling-specific choices. By contrast, transparency might be more important for model designers who have knowledge of the ML specifics.

C. FAITHFULNESS

One area of interpretability research not yet mentioned is that of faithfulness of the interpretability method [151], [152], [153]. A faithful interpretable method is one that accurately reflects the underlying predictive model, such that the method is internally coherent [152]. Since faithfulness pertains only to the relationship between the explanation and the model, it is relatively removed from connections to the real-world and is thus out-of-scope with respect to this work. However, practitioners are advised to consider this aspect of interpretability in their own use cases.

D. LIMITATIONS OF INTERPRETABLE MLMI

While interpretability is a promising direction for the end goal of improving patient care, it can potentially introduce biases or hinder model performance. Here, we list two potential sources of limitation to consider.

First, interpretable MLMI might induce or contribute to human biases during decision-making processes. When users’ levels of trust in algorithmic decision-making aids are miscalibrated, the result manifests as either automation bias (too much trust) or algorithmic aversion (too much distrust) [154], [155], [156]. False senses of trust can also be induced when models’ high-dimensional internal representations are projected into low-dimensional space for human review (e.g., 2-dimensional heatmaps), leading users to the false impression of true understanding while they in fact only have an approximation of “ersatz” understanding of the model’s inner workings [157]. Because one of the core objectives of interpretable MLMI is to modulate users’ trust in ML-powered decision aids, these biases represent pitfalls which must be considered carefully when deploying these models in practice.

Second and relatedly, interpretability in MLMI might run counter to the goal of ultimately augmenting the human user’s capabilities. From this perspective, “uninterpretability” might be reflective of a means of performing decision-making orthogonal to the capabilities of humans. As medicine becomes increasingly complex, the amount of information available to be integrated into the decision-making process may exceed the capacities of the human mind [158]. Indeed, constraining ML models to be human-understandable may result in models which overlap with or simply recapitulate patterns already identified by humans. Leveraging the complementary strengths of human perception and machine perception, including incorporating and processing signals which cannot be easily rationalized by humans, may ultimately be necessarily towards the goal of improving decision-making.

E. NOVEL APPROACHES TO INTERPRETABILITY

While there have been many attempts at interpretability in MLMI, there is still a need for further research and new perspectives. For example, a nascent approach to interpretability that seeks to reason about the mechanisms by which neural networks arrive at predictions is called mechanistic interpretability [159]. Works in this line of research interpret models trained on simple tasks like modular arithmetic and decompose it into individual neurons and circuits which compute different parts necessary to arrive at the answer. This field has found promise in understanding the inner workings of transformer-based large language models [160]. While underexplored in the context of medical imaging, mechanistic interpretability is directly aligned with the real-world goal of connecting to and revealing mechanisms and mechanistic understanding. We believe works along this vein will be important in the future.

While this work is focused on interpretable ML in medical imaging, we do recognize that similar ideas may be applicable to other domains like law and finance, and hope that our work can inspire others to reason similarly in a domain-specific manner about interpretability.

IX. CONCLUSION

We arrived at five elements of interpretability in machine learning for medical imaging through reasoning about real-world goals, tasks, and objectives in medical image analysis and its intersection with machine learning. These elements are localizability, visual recognizability, physical attribution, model transparency, and actionability. Furthermore, we connect our elements to the current literature by summarizing various methods and approaches in interpretable MLMI and attributing each to our elements. Taken together, these components constitute a framework for interpretability in MLMI, a guide for practitioners on how to reason and utilize interpretability in their own use cases.

ACKNOWLEDGMENT

The authors would like to thank Kayhan Batmanghelich for insightful discussions and suggesting the actionability element.

This work was supported in part by NIH under Grant R01AG053949 and Grant T32GM007739, and in part by the NSF CAREER under Grant 1748377.

Biographies

graphic file with name nihms-1987669-b0004.gif

ALAN Q. WANG received the B.S. degree in computer engineering from the University of Illinois at Urbana–Champaign (UIUC). He is currently pursuing the Ph.D. degree with the School of Electrical and Computer Engineering, Cornell University.

He is also affiliated with Cornell Tech and the Department of Radiology, Weill Cornell Medical School. His research interests include machine learning and computer vision, especially applied to medical imaging, bridging the gap between deep learning models, and human experts like doctors and clinicians, which leads him to be interested in research in interpretability, explainability, interactivity, and robustness of deep learning models.

graphic file with name nihms-1987669-b0005.gif

BATUHAN K. KARAMAN received the M.S. degree in electrical and computer engineering from Cornell University, where he is currently pursuing the Ph.D. degree with the School of Electrical and Computer Engineering.

He is also affiliated with Cornell Tech and the Department of Radiology, Weill Cornell Medicine. His current research interests include develops interpretable and explainable deep learning techniques for the early prediction of future cognitive decline toward dementia and analysis of associated biomarkers. His research interests include machine learning and biomedical data analysis, with a focus on Alzheimer’s.

graphic file with name nihms-1987669-b0006.gif

HEEJONG KIM received the Ph.D. degree in computer science from New York University, New York City, NY, USA. She is currently a Postdoctoral Researcher in radiology with Weill Cornell Medicine. Her research interests include discovering machine learning techniques and their application to biomedical images.

graphic file with name nihms-1987669-b0007.gif

JACOB ROSENTHAL received the M.Sc. degree in health data science from the Harvard T. H. Chan School of Public Health, Boston, MA, USA, and the B.A. degree in biology from the Oberlin College, Oberlin, OH, USA. He is currently pursuing the integrated M.D. and Ph.D. degrees with the Weill Cornell/Rockefeller/Sloan Kettering Tri-Institutional M.D.-Ph.D. Program, New York City, NY, USA.

His research interests include the domain of machine learning for medicine, specifically in the development and deployment of machine learning methods to improve accuracy, efficiency, and fairness of clinical workflows.

graphic file with name nihms-1987669-b0008.gif

RACHIT SALUJA received the B.E. degree in electrical and electronics engineering from the PES Institute of Technology, India, and the M.S. degree in electrical engineering from the University of Pennsylvania. He is currently pursuing the Ph.D. degree with the School of Electrical and Computer Engineering, Cornell University. He is also affiliated with Cornell Tech and the Department of Radiology, Weill Cornell Medical School. His research interests include building multi-modal AI models for clinical radiology and developing robust radiology AI systems.

graphic file with name nihms-1987669-b0009.gif

SEAN I. YOUNG received the Ph.D. degree in electrical engineering from The University of New South Wales, Sydney, Australia. He is currently an Instructor with the Martinos Center for Biomedical Imaging, Harvard Medical School, and a Research Affiliate with CSAIL, MIT. Previously, he was a Postdoctoral Scholar with Stanford University, Stanford, CA, USA. He received the APRS/IAPR Best Paper Award at DICTA 2018, together with David Taubman.

graphic file with name nihms-1987669-b0010.gif

MERT R. SABUNCU received the Ph.D. degree in electrical engineering from Princeton University, in 2006. His Ph.D. dissertation focused on entropy-based approaches to image registration. Then, he moved to the Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, as a Postdoctoral Researcher, where he worked on a range of biomedical image analysis problems, including the segmentation of brain MRI scans. After his postdoctoral research with MIT, he spent a few years with the A. A. Martinos Center for Biomedical Imaging (Massachusetts General Hospital and Harvard Medical School) as a Junior Faculty Member, where he built a research program on algorithmic tools for integrating genetics and medical imaging. He is currently a Professor in electrical and computer engineering with Cornell University—Cornell Tech, New York City. He also holds a faculty appointment in radiology with Weill Cornell Medicine. His research group develops machine learning-based computational tools for biomedical imaging applications. He was a recipient of an NSF CAREER Award, in 2018, and an NIH Early Career Development Award, in 2011.

REFERENCES

  • [1].Dembrower K, Crippa A, Colón E, Eklund M, and Strand F, “Artificial intelligence for breast cancer detection in screening mammography in sweden: A prospective, population-based, paired-reader, non-inferiority study,” Lancet Digit. Health, vol. 5, no. 10, pp. e703–e711, Oct. 2023. [DOI] [PubMed] [Google Scholar]
  • [2].Erickson BJ, Korfiatis P, Akkus Z, and Kline TL, “Machine learning for medical imaging,” Radiographics, vol. 37, no. 2, pp. 505–515, Mar./Apr. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Shen D, Wu G, and Suk HI, “Deep learning in medical image analysis,” Annu. Rev. Biomed. Eng, vol. 19, no. 1, pp. 221–248, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Rajpurkar P. and Lungren MP, “The current and future state of AI interpretation of medical images,” Obstetrical Gynecol. Surv, vol. 78, no. 11, pp. 634–635, 2023. [DOI] [PubMed] [Google Scholar]
  • [5].Reyes M, Meier R, Pereira S, Silva CA, Dahlweid F-M, Tengg-Kobligk HV, Summers RM, and Wiest R, “On the interpretability of artificial intelligence in radiology: Challenges and opportunities,” Radiol., Artif. Intell, vol. 2, no. 3, May 2020, Art. no. e190043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Rudin C, “Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead,” Nature Mach. Intell, vol. 1, pp. 206–215, May 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Lipton ZC, “The doctor just won’t accept that!” 2017, arXiv:1711.08037. [Google Scholar]
  • [8].Kundu S, “AI in medicine must be explainable,” Nature Med., vol. 27, no. 8, p. 1328, Aug. 2021. [DOI] [PubMed] [Google Scholar]
  • [9].Lipton ZC, “The mythos of model interpretability,” in Proc. ICML Workshop Human Interpretability Mach. Learn. (WHI; ), New York, NY, USA, 2017, pp. 96–100. [Google Scholar]
  • [10].Doshi-Velez F. and Kim B, “Towards a rigorous science of interpretable machine learning,” 2017, arXiv:1702.08608. [Google Scholar]
  • [11].Murdoch WJ, Singh C, Kumbier K, Abbasi-Asl R, and Yu B, “Definitions, methods, and applications in interpretable machine learning,” Proc. Nat. Acad. Sci. USA, vol. 116, no. 44, pp. 22071–22080, Oct. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Salahuddin Z, Woodruff HC, Chatterjee A, and Lambin P, “Transparency of deep neural networks for medical image analysis: A review of interpretability methods,” Comput. Biol. Med, vol. 140, Jan. 2022, Art. no. 105111. [DOI] [PubMed] [Google Scholar]
  • [13].Huff DT, Weisman AJ, and Jeraj R, “Interpretation and visualization techniques for deep learning models in medical imaging,” Phys. Med. Biol, vol. 66, no. 4, Feb. 2021, Art. no. 04TR01. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].van der Velden BHM, Kuijf HJ, Gilhuijs KGA, and Viergever MA, “Explainable artificial intelligence (XAI) in deep learning-based medical image analysis,” Med. Image Anal, vol. 79, Jul. 2022, Art. no. 102470. [DOI] [PubMed] [Google Scholar]
  • [15].Schuhmacher D, Schörner S, Küpper C, Großerueschkamp F, Sternemann C, Lugnier C, Kraeft A-L, Jütte H, Tannapfel A, Reinacher-Schick A, Gerwert K, and Mosig A, “A framework for falsifiable explanations of machine learning models with an application in computational pathology,” Med. Image Anal, vol. 82, Nov. 2022, Art. no. 102594. [DOI] [PubMed] [Google Scholar]
  • [16].Montenegro JMF, Villarini B, Angelopoulou A, Kapetanios E, Garcia-Rodriguez J, and Argyriou V, “A survey of Alzheimer’s disease early diagnosis methods for cognitive assessment,” Sensors, vol. 20, no. 24, p. 7292, Dec. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Nooreldeen R. and Bach H, “Current and future development in lung cancer diagnosis,” Int. J. Mol. Sci, vol. 22, no. 16, p. 8661, Aug. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Bussani R, Castrichini M, Restivo L, Fabris E, Porcari A, Ferro F, Pivetta A, Korcova R, Cappelletto C, Manca P, Nuzzi V, Bessi R, Pagura L, Massa L, and Sinagra G, “Cardiac tumors: Diagnosis, prognosis, and treatment,” Current Cardiol. Rep, vol. 22, no. 12, p. 169, Oct. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Conklin JE, Lieberman JV, Barnes CA, and Louis DZ, “Disease staging: Implications for hospital reimbursement and management,” in Health Care Financing Review. Baltimore, MD, USA: Medicare and Medicaid Research Review, 1984, pp. 13–22. [PMC free article] [PubMed] [Google Scholar]
  • [20].Johnson SL, “Treatment planning,” in Therapist’s Guide to Pediatric Affect and Behavior Regulation (Practical Resources for the Mental Health Professional), Miller PM, Ed. Johnson SL, Ed. San Diego, CA, USA: Academic, 2013, Ch. 3, pp. 131–236. [Google Scholar]
  • [21].Noel NE, “Evidence-based treatment planning for substance abusetherapy,” in Evidence-Based Addiction Treatment. San Diego, CA, USA: Academic, 2009, ch. 25, pp. 399–417. [Google Scholar]
  • [22].Aronson JK, “Monitoring therapy,” Brit. J. Clin. Pharmacol, 60, 3, 229–230, 2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Chu C, White N, and Stone P, “Prognostication in palliative care,” Clin. Med, vol. 19, no. 4, pp. 306–310, Jul. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Hui D, Paiva CE, Del Fabbro EG, Steer C, Naberhuis J, van de Wetering M, Fernández-Ortega P, Morita T, Suh S-Y, Bruera E, and Mori M, “Prognostication in advanced cancer: Update and directions for future research,” Supportive Care Cancer, vol. 27, no. 6, pp. 1973–1984, Jun. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Antico M, Sasazawa F, Wu L, Jaiprakash A, Roberts J, Crawford R, Pandey AK, and Fontanarosa D, “Ultrasound guidance in minimally invasive robotic procedures,” Med. Image Anal, vol. 54, pp. 149–167, May 2019. [DOI] [PubMed] [Google Scholar]
  • [26].Stull JD, Mangan JJ, Vaccaro AR, and Schroeder GD, “Robotic guidance in minimally invasive spine surgery: A review of recent literature and commentary on a developing technology,” Current Rev. Musculoskeletal Med, vol. 12, no. 2, pp. 245–251, Jun. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Srinidhi CL, Ciga O, and Martel AL, “Deep neural network models for computational histopathology: A survey,” Med. Image Anal, vol. 67, Jan. 2021, Art. no. 101813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Gourevitch MN, Curtis LH, Durkin MS, Fagerlin A, Gelijns AC, Platt R, Reininger BM, Wylie-Rosett J, Jones K, and Tierney WM, “The emergence of population health in U.S. Academic medicine: A qualitative assessment,” JAMA Netw. Open, vol. 2, no. 4, pp. e192200–e192200, Apr. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Shahzad M, Upshur R, Donnelly P, Bharmal A, Wei X, Feng P, and Brown AD, “A population-based approach to integrated healthcare delivery: A scoping review of clinical care and public health collaboration,” BMC Public Health, vol. 19, no. 1, p. 708, Dec. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Liu Y, Yan P, Cao X, Shi Y, Li S, Wang Q, Wang L, and Zhuang X, Eds., Shape in Medical Imaging. Switzerland: Springer, 2020. [Google Scholar]
  • [31].Singh Y, Farrelly CM, Hathaway QA, Leiner T, Jagtap J, Carlsson GE, and Erickson BJ, “Topological data analysis in medical imaging: Current state of the art,” Insights Imag., vol. 14, no. 1, p. 58, Apr. 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Van Nguyen S, Tran HM, and Le TS, “Application of geometric modeling in visualizing the medical image dataset,” SN Comput. Sci, vol. 1, no. 8, p. 254, 2020. [Google Scholar]
  • [33].Mesbah Oskui S, Bhakta HC, Diamante G, Liu H, Schlenk D, and Grover WH, “Measuring the mass, volume, and density of microgram-sized objects in fluid,” PLoS ONE, vol. 12, no. 4, Apr. 2017, Art. no. e0174068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [34].Yaffe MJ, “Mammographic density. Measurement of mammographic density,” Breast Cancer Res., vol. 10, no. 3, p. 209, Jun. 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Demeestere J, Wouters A, Christensen S, Lemmens R, and Lansberg MG, “Review of perfusion imaging in acute ischemic stroke,” Stroke, vol. 51, no. 3, pp. 1017–1024, Mar. 2020. [DOI] [PubMed] [Google Scholar]
  • [36].Kamphuis ME, Greuter MJW, Slart RHJA, and Slump CH, “Quantitative imaging: Systematic review of perfusion/flow phantoms,” Eur. Radiol. Exp, vol. 4, no. 1, p. 15, Mar. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [37].Judge A. and Dodd MS, “Metabolism,” Essays Biochem., vol. 64, no. 4, pp. 607–647, Oct. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Glover GH, “Overview of functional magnetic resonance imaging,” Neurosurg. Clinics, vol. 22, no. 2, pp. 133–139, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [39].Sigrist RMS, Liau J, Kaffas AE, Chammas MC, and Willmann JK, “Ultrasound elastography: Review of techniques and clinical applications,” Theranostics, vol. 7, no. 5, pp. 1303–1329, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [40].Sarvazyan A, Hall TJ, Urban MW, Fatemi M, Aglyamov SR, and Garra BS, “An overview of elastography–An emerging branch of medical imaging,” Current Med. Imag. Rev, vol. 7, no. 4, pp. 255–282, Nov. 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [41].Dantzker DR, “Adequacy of tissue oxygenation,” Crit. Care Med, vol. 21, pp. S40–43, Feb. 1993. [DOI] [PubMed] [Google Scholar]
  • [42].Sarawade AA and Charniya NN, “Infrared thermography and its applications: A review,” in Proc. 3rd Int. Conf. Commun. Electron. Syst. (ICCES; ), Oct. 2018, pp. 280–285. [Google Scholar]
  • [43].Lahiri BB, Bagavathiappan S, Jayakumar T, and Philip J, “Medical applications of infrared thermography: A review,” Infr. Phys. Technol, vol. 55, no. 4, pp. 221–235, Jul. 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [44].Tognarelli JM, Dawood M, Shariff MIF, Grover VPB, Crossey MME, Cox IJ, Taylor-Robinson SD, and McPhail MJW, “Magnetic resonance spectroscopy: Principles and techniques: Lessons for clinicians,” J. Clin. Exp. Hepatol, vol. 5, no. 4, pp. 320–328, Dec. 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [45].O’Sullivan GJ, “Imaging of bone metastasis: An update,” World J. Radiol, vol. 7, no. 8, pp. 202–211, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Fortunati E, Argalia G, Zanoni L, Fanti S, and Ambrosini V, “New PET radiotracers for the imaging of neuroendocrine neoplasms,” Current Treatment Options Oncol., vol. 23, no. 5, pp. 703–720, May 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Zitová B. and Flusser J, “Image registration methods: A survey,” Image Vis. Comput, vol. 21, no. 11, pp. 977–1000, Oct. 2003. [Google Scholar]
  • [48].Minaee S, Boykov Y, Porikli F, Plaza A, Kehtarnavaz N, and Terzopoulos D, “Image segmentation using deep learning: A survey,” IEEE Trans. Pattern Anal. Mach. Intell, vol. 44, no. 7, pp. 3523–3542, Jul. 2022. [DOI] [PubMed] [Google Scholar]
  • [49].Ahishakiye E, Van Gijzen MB, Tumwiine J, Wario R, and Obungoloch J, “A survey on deep learning in medical image reconstruction,” Intell. Med, vol. 1, no. 3, pp. 118–127, 2021. [Google Scholar]
  • [50].Kaur A, Singh Y, Neeru N, Kaur L, and Singh A, “A survey on deep learning approaches to medical images and a systematic look up into real-time object detection,” Arch. Comput. Methods Eng, vol. 29, no. 4, pp. 2071–2111, Jun. 2022. [Google Scholar]
  • [51].Ganatra N, “A comprehensive study of applying object detection methods for medical image analysis,” in Proc. 8th Int. Conf. Comput. Sustain. Global Develop. (INDIACom), Mar. 2021, pp. 821–826. [Google Scholar]
  • [52].Yadav SS and Jadhav SM, “Deep convolutional neural network based medical image classification for disease diagnosis,” J. Big Data, vol. 6, no. 1, p. 113, Dec. 2019. [Google Scholar]
  • [53].Doi K, “Computer-aided diagnosis in medical imaging: Historical review, current status and future potential,” Computerized Med. Imag. Graph, vol. 31, nos. 4–5, pp. 198–211, Jun. 2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [54].Cohen AD, Chen Z, Parker Jones O, Niu C, and Wang Y, “Regression-based machine-learning approaches to predict task activation using resting-state fMRI,” Human Brain Mapping, vol. 41, no. 3, pp. 815–826, Feb. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [55].Varoquaux G. and Cheplygina V, “Machine learning for medical imaging: Methodological failures and recommendations for the future,” npj Digit. Med, vol. 5, no. 1, p. 48, Apr. 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [56].Zech JR, Badgeley MA, Liu M, Costa AB, Titano JJ, and Oermann EK, “Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study,” PLOS Med., vol. 15, no. 11, Nov. 2018, Art. no. e1002683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [57].Finlayson SG, Subbaswamy A, Singh K, Bowers J, Kupke A, Zittrain J, Kohane IS, and Saria S, “The clinician and dataset shift in artificial intelligence,” New England J. Med, vol. 385, no. 3, pp. 283–286, Jul. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [58].Caton S. and Haas C, “Fairness in machine learning: A survey,” ACM Comput. Surv, vol. 56, no. 7, p. 38, Apr. 2024, Art. no. 166, doi: 10.1145/3616865. [DOI] [Google Scholar]
  • [59].Puyol-Antón E, Ruijsink B, Mariscal Harana J, Piechnik SK, Neubauer S, Petersen SE, Razavi R, Chowienczyk P, and King AP, “Fairness in cardiac magnetic resonance imaging: Assessing sex and racial bias in deep learning-based segmentation,” Frontiers Cardiovascular Med., vol. 9, Apr. 2022, Art. no. 859310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [60].Ioannou S, Chockler H, Hammers A, and King AP, “A study of demographic bias in CNN-based brain MR segmentation,” in Proc. Int. Workshop Mach. Learn. Clin. Neuroimag. Switzerland: Springer, 2022, pp. 13–22. [Google Scholar]
  • [61].Abbasi-Sureshjani S, Raumanns R, Michels BEJ, Schouten G, and Cheplygina V, “Risk of training diagnostic algorithms on data with demographic bias,” in Proc. Interpretable Annotation-Efficient Learn. Med. Image Comput., 3rd Int. Workshop, iMIMIC, 2nd Int. Workshop, MIL3ID, 5th Int. Workshop, LABELS, Held Conjunct (MICCAI). Lima, Peru: Springer, 2020, pp. 183–192. [Google Scholar]
  • [62].Ross LN, “Causal concepts in biology: How pathways differ from mechanisms and why it matters,” Brit. J. Philosophy Sci, vol. 72, no. 1, pp. 131–158, Mar. 2021. [Google Scholar]
  • [63].Wulczyn E, Steiner DF, Moran M, Plass M, Reihs R, Tan F, Flament-Auvigne I, Brown T, Regitnig P, Chen P-HC, Hegde N, Sadhwani A, MacDonald R, Ayalew B, Corrado GS, Peng LH, Tse D, Müller H, Xu Z, Liu Y, Stumpe MC, Zatloukal K, and Mermel GH, “Interpretable survival prediction for colorectal cancer using deep learning,” npj Digital Medicine, vol. 4, no. 1, p. 71, Apr. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [64].L’Imperio V, Wulczyn E, Plass M, Müller H, Tamini N, Gianotti L, Zucchini N, Reihs R, Corrado GS, Webster DR, Peng LH, Chen P-H-C, Lavitrano M, Liu Y, Steiner DF, Zatloukal K, and Pagni F, “Pathologist validation of a machine learning–derived feature for colon cancer risk stratification,” JAMA Netw. Open, vol. 6, no. 3, Mar. 2023, Art. no. e2254891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [65].Cheng C-T, Chen C-C, Fu C-Y, Chaou C-H, Wu Y-T, Hsu C-P, Chang C-C, Chung I-F, Hsieh C-H, Hsieh M-J, and Liao C-H, “Artificial intelligence-based education assists medical students’ interpretation of hip fracture,” Insights into Imag., vol. 11, no. 1, pp. 1–8, Dec. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [66].Rudie JD, Duda J, Duong MT, Chen P-H, Xie L, Kurtz R, Ware JB, Choi J, Mattay RR, Botzolakis EJ, Gee JC, Bryan RN, Cook TS, Mohan S, Nasrallah IM, and Rauschecker AM, “Brain MRI deep learning and Bayesian inference system augments radiology resident performance,” J. Digit. Imag, vol. 34, no. 4, pp. 1049–1058, Aug. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [67].Shen MW, “Trust in AI: Interpretability is not necessary or sufficient, while black-box interaction is necessary and sufficient,” 2022, arXiv:2202.05302. [Google Scholar]
  • [68].Ribeiro MT, Singh S, and Guestrin C, “‘Why should i trust you?’: Explaining the predictions of any classifier,” in Proc. 22nd ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining, San Francisco, CA, USA, Aug. 2016, pp. 1135–1144. [Google Scholar]
  • [69].McCoy LG, Brenna CTA, Chen SS, Vold K, and Das S, “Believing in black boxes: Machine learning for healthcare does not need explainability to be evidence-based,” J. Clin. Epidemiol, vol. 142, pp. 252–257, Feb. 2022. [DOI] [PubMed] [Google Scholar]
  • [70].Alvarez-Melis D. and Jaakkola TS, “Towards robust interpretability with self-explaining neural networks,” in Proc. Adv. Neural Inf. Process. Syst, 2018, pp. 1–10. [Google Scholar]
  • [71].Monga V, Li Y, and Eldar YC, “Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing,” IEEE Signal Process. Mag, vol. 38, no. 2, pp. 18–44, Mar. 2021. [Google Scholar]
  • [72].Arias Chao M, Kulkarni C, Goebel K, and Fink O, “Fusing physics-based and deep learning models for prognostics,” Rel. Eng. Syst. Saf, vol. 217, Jan. 2022, Art. no. 107961. [Google Scholar]
  • [73].Castro DC, Walker I, and Glocker B, “Causality matters in medical imaging,” Nature Commun., vol. 11, no. 1, p. 3673, Jul. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [74].Dalca AV, Guttag J, and Sabuncu MR, “Anatomical priors in convolutional networks for unsupervised biomedical segmentation,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., Jun. 2018, pp. 9290–9299. [Google Scholar]
  • [75].Gaw N, Hawkins-Daarud A, Hu LS, Yoon H, Wang L, Xu Y, Jackson PR, Singleton KW, Baxter LC, Eschbacher J, Gonzales A, Nespodzany A, Smith K, Nakaji P, Mitchell JR, Wu T, Swanson KR, and Li J, “Integration of machine learning and mechanistic models accurately predicts variation in cell density of glioblastoma using multiparametric MRI,” Sci. Rep, vol. 9, no. 1, p. 10063, Jul. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [76].Pisov M, Goncharov M, Kurochkina N, Morozov S, Gombolevskiy V, Chernina V, Vladzymyrskyy A, Zamyatina K, Chesnokova A, Pronin I, Shifrin M, and Belyaev M, “Incorporating task-specific structural knowledge into CNNs for brain midline shift detection,” in Interpretability of Machine Intelligence in Medical Image Computing and Multimodal Learning for Clinical Decision Support. Switzerland: Springer, 2019, pp. 30–38. [Google Scholar]
  • [77].Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RGPM, Granton P, Zegers CML, Gillies R, Boellard R, Dekker A, and Aerts HJWL, “Radiomics: Extracting more information from medical images using advanced feature analysis,” Eur. J. Cancer, vol. 48, no. 4, pp. 441–446, Mar. 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [78].Gillies RJ, Kinahan PE, and Hricak H, “Radiomics: Images are more than pictures, they are data,” Radiology, vol. 278, no. 2, pp. 563–577, Feb. 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [79].Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, Sanduleanu S, Larue RTHM, Even AJG,Jochems A, van Wijk Y, Woodruff H, van Soest J, Lustberg T, Roelofs E, van Elmpt W, Dekker A, Mottaghy FM, Wildberger JE, and Walsh S, “Radiomics: The bridge between medical imaging and personalized medicine,” Nature Rev. Clin. Oncol, vol. 14, no. 12, pp. 749–762, Dec. 2017. [DOI] [PubMed] [Google Scholar]
  • [80].Rauschecker AM, Rudie JD, Xie L, Wang J, Duong MT, Botzolakis EJ, Kovalovich AM, Egan J, Cook TC, Bryan RN, Nasrallah IM, Mohan S, and Gee JC, “Artificial intelligence system approaching neuroradiologist-level differential diagnosis accuracy at brain MRI,” Radiology, vol. 295, no. 3, pp. 626–637, Jun. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [81].Bera K, Braman N, Gupta A, Velcheti V, and Madabhushi A, “Predicting cancer outcomes with radiomics and artificial intelligence in radiology,” Nature Rev. Clin. Oncol, vol. 19, no. 2, pp. 132–146, Feb. 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [82].Friedman JH, “Greedy function approximation: A gradient boosting machine,” Ann. Statist, vol. 29, no. 5, pp. 1189–1232, Oct. 2001. [Google Scholar]
  • [83].Goldstein A, Kapelner A, Bleich J, and Pitkin E, “Peeking inside the black box: Visualizing statistical learning with plots of individual conditional expectation,” J. Comput. Graph. Statist, vol. 24, no. 1, pp. 44–65, Jan. 2015. [Google Scholar]
  • [84].Lundberg SM and Lee S-I, “A unified approach to interpreting model predictions,” in Proc. Adv. Neural Inf. Process. Syst, 2017, pp. 1–10. [Google Scholar]
  • [85].Bang M, Eom J, An C, Kim S, Park YW, Ahn SS, Kim J, Lee S-K, and Lee S-H, “An interpretable multiparametric radiomics model for the diagnosis of schizophrenia using magnetic resonance imaging of the corpus callosum,” Transl. Psychiatry, vol. 11, no. 1, p. 462, Sep. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [86].Zhang H, Chen H, Zhang C, Cao A, Lu Q, Wu H, Zhang J, and Geng D, “A radiomics feature-based machine learning models to detect brainstem infarction (RMEBI) may enable early diagnosis in non-contrast enhanced CT,” Eur. Radiol, vol. 33, no. 2, pp. 1004–1014, Sep. 2022. [DOI] [PubMed] [Google Scholar]
  • [87].Zhang S, “Partial dependence of breast tumor malignancy on ultrasound image features derived from boosted trees,” J. Electron. Imag, vol. 19, no. 2, Apr. 2010, Art. no. 023004. [Google Scholar]
  • [88].Karaman BK, Mormino EC, and Sabuncu MR, “Machine learning based multi-modal prediction of future decline toward Alzheimer’s disease: An empirical study,” PLoS ONE, vol. 17, no. 11, Nov. 2022, Art. no. e0277322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [89].van der Maaten L. and Hinton G, “Visualizing data using t-SNE,” J. Mach. Learn. Res, vol. 9, no. 86, pp. 2579–2605, 2008. [Google Scholar]
  • [90].McInnes L, Healy J, and Melville J, “UMAP: Uniform manifold approximation and projection for dimension reduction,” 2020, arXiv:1802.03426. [Google Scholar]
  • [91].Zhang Z, Xie Y, Xing F, McGough M, and Yang L, “MDNet: A semantically and visually interpretable medical image diagnosis network,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), Jul. 2017, pp. 3549–3557. [Google Scholar]
  • [92].Böhle M, Fritz M, and Schiele B, “Convolutional dynamic alignment networks for interpretable classifications,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2021, pp. 10024–10033. [Google Scholar]
  • [93].Sun S, Woerner S, Maier A, Koch LM, and Baumgartner CF, “Inherently interpretable multi-label classification using class-specific counterfactuals,” 2023, arXiv:2303.00500. [Google Scholar]
  • [94].Zhou B, Khosla A, Lapedriza A, Oliva A, and Torralba A, “Learning deep features for discriminative localization,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), Las Vegas, NV, USA, 2016, pp. 2921–2929. [Google Scholar]
  • [95].Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, and Batra D, “Grad-CAM: Visual explanations from deep networks via gradient-based localization,” Int. J. Comput. Vis, vol. 128, no. 2, pp. 336–359, Feb. 2020. [Google Scholar]
  • [96].Lea Draelos R. and Carin L, “Use HiResCAM instead of grad-CAM for faithful explanations of convolutional neural networks,” 2020, arXiv:2011.08891. [Google Scholar]
  • [97].Feng X, Provenzano FA, and Small SA, “A deep learning MRI approach outperforms other biomarkers of prodromal Alzheimer’s disease,” Alzheimer’s Res. Therapy, vol. 14, p. 45, Mar. 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [98].Lee Y-W, Huang S-K, and Chang R-F, “CheXGAT: A disease correlation-aware network for thorax disease diagnosis from chest X-ray images,” Artif. Intell. Med, vol. 132, Oct. 2022, Art. no. 102382. [DOI] [PubMed] [Google Scholar]
  • [99].Draelos RL and Carin L, “Explainable multiple abnormality classification of chest CT volumes,” Artif. Intell. Med, vol. 132, Oct. 2022, Art. no. 102372. [DOI] [PubMed] [Google Scholar]
  • [100].Kim H. and Sabuncu MR, “Learning to compare longitudinal images,” 2023, arXiv:2304.02531. [Google Scholar]
  • [101].Yan Y, Kawahara J, and Hamarneh G, “Melanoma recognition via visual attention,” in Proc. Inf. Process. Med. Imag., 26th Int. Conf. (IPMI). Hong Kong: Springer, 2019, pp. 793–804. [Google Scholar]
  • [102].Yang H, Kim J-Y, Kim H, and Adhikari SP, “Guided soft attention network for classification of breast cancer histopathology images,” IEEE Trans. Med. Imag, vol. 39, no. 5, pp. 1306–1315, May 2020. [DOI] [PubMed] [Google Scholar]
  • [103].Yin C, Imms P, Cheng M, Amgalan A, Chowdhury NF, Massett RJ, Chaudhari NN, Chen X, Thompson PM, Bogdan P, and Irimia A, “Anatomically interpretable deep learning of brain age captures domain-specific cognitive impairment,” Proc. Nat. Acad. Sci. USA, vol. 120, no. 2, 2023, Art. no. e2214634120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [104].Kolodner J,” Case-Based Reasoning. San Mateo, CA, USA: Morgan Kaufmann, 2014. [Google Scholar]
  • [105].Snell J, Swersky K, and Zemel R, “Prototypical networks for few-shot learning,” in Proc. Adv. Neural Inf. Process. Syst, 2017, pp. 1–11. [Google Scholar]
  • [106].Chen C, Li O, Tao D, Barnett A, Rudin C, and Su JK, “This looks like that: Deep learning for interpretable image recognition,” in Proc. Adv. Neural Inf. Process. Syst, 32, 2019. [Google Scholar]
  • [107].Kim E, Kim S, Seo M, and Yoon S, “XProtoNet: Diagnosis in chest radiography with global and local explanations,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2021, pp. 15719–15728. [Google Scholar]
  • [108].Wang AQ and Sabuncu MR, “A flexible Nadaraya-Watson head can offer explainable and calibrated classification,” Trans. Mach. Learn. Res, pp. 2835–8856, 2023. [Online]. Available: https://openreview.net/forum?id=iEq6lhG4O3 [Google Scholar]
  • [109].Lamy J-B, Sekar B, Guezennec G, Bouaud J, and Séroussi B, “Explainable artificial intelligence for breast cancer: A visual case-based reasoning approach,” Artif. Intell. Med, vol. 94, pp. 42–53, Mar. 2019. [DOI] [PubMed] [Google Scholar]
  • [110].Hu B, Vasu B, and Hoogs A, “X-MIR: EXplainable medical image retrieval,” in Proc. IEEE/CVF Winter Conf. Appl. Comput. Vis. (WACV), Jan. 2022, pp. 440–450. [Google Scholar]
  • [111].Cook RD and Weisberg S, Residuals and Influence in Regression. Sydney, NSW, Australia: JMLR.org, 1982. [Google Scholar]
  • [112].Koh PW and Liang P, “Understanding black-box predictions via influence functions,” in Proc. Int. Conf. Mach. Learn., 2017, pp. 1–10. [Google Scholar]
  • [113].Wang CJ, Hamm CA, Savic L, Ferrante M, Schobert I, Schlachter T, Lin M, Weinreb JC, Duncan JS, Chapiro J, and Letzen B, “Deep learning for liver tumor diagnosis Part II: Convolutional neural network interpretation using radiologic imaging features,” Eur. Radiol, vol. 29, no. 7, pp. 3348–3357, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [114].Wachter S, Mittelstadt B, and Russell C, “Counterfactual explanations without opening the black box: Automated decisions and the GDPR,” Harvard J. Law Technol, vol. 31, no. 2, pp. 841–887, 2018. [Google Scholar]
  • [115].Nemirovsky D, Thiebaut N, Xu Y, and Gupta A, “CounteRGAN: Generating realistic counterfactuals with residual generative adversarial nets,” 2021, arXiv:2009.05199. [Google Scholar]
  • [116].Zhou S, Islam UJ, Pfeiffer N, Banerjee I, Patel BK, and Iquebal AS, “SCGAN: Sparse CounterGAN for counterfactual explanations in breast cancer prediction,” IEEE Trans. Automat. Sci. Eng, pp. 1–12, 2023, doi: 10.1109/TASE.2023.3333788. [DOI] [Google Scholar]
  • [117].Sun S, Koch LM, and Baumgartner CF, “Right for the wrong reason: Can interpretable ML techniques detect spurious correlations?” 2023, arXiv:2307.12344. [Google Scholar]
  • [118].Schutte K, Moindrot O, Hérent P, Schiratti J-B, and Jégou S, “Using StyleGAN for visual interpretability of deep learning models on medical images,” 2021, arXiv:2101.07563. [Google Scholar]
  • [119].Baumgartner CF, Koch LM, Tezcan KC, Ang JX, and Konukoglu E, “Visual feature attribution using Wasserstein GANs,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., Jun. 2018, pp. 8309–8319. [Google Scholar]
  • [120].Szegedy C, Zaremba W, Sutskever I, Bruna J, Erhan D, Goodfellow I, and Fergus R, “Intriguing properties of neural networks,” 2013, arXiv:1312.6199. [Google Scholar]
  • [121].Du-Harpur X, Arthurs C, Ganier C, Woolf R, Laftah Z, Lakhan M, Salam A, Wan B, Watt FM, Luscombe NM, and Lynch MD, “Clinically relevant vulnerabilities of deep machine learning systems for skin cancer diagnosis,” J. Investigative Dermatol, vol. 141, no. 4, pp. 916–920, Apr. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [122].Paul R, Schabath M, Gillies R, Hall L, and Goldgof D, “Mitigating adversarial attacks on medical image understanding systems,” in Proc. IEEE 17th Int. Symp. Biomed. Imag. (ISBI), Apr. 2020, pp. 1517–1521. [Google Scholar]
  • [123].Goodfellow IJ, Shlens J, and Szegedy C, “Explaining and harnessing adversarial examples,” 2014, arXiv:1412.6572. [Google Scholar]
  • [124].Su J, Vargas DV, and Sakurai K, “One pixel attack for fooling deep neural networks,” IEEE Trans. Evol. Comput, vol. 23, no. 5, pp. 828–841, Oct. 2019. [Google Scholar]
  • [125].Zeiler MD and Fergus R, Visualizing and Understanding Convolutional Networks. Switzerland: Springer, 2013. [Google Scholar]
  • [126].Koh PW, Nguyen T, Tang YS, Mussmann S, Pierson E, Kim B, and Liang P, “Concept bottleneck models,” in Proc. Int. Conf. Mach. Learn., 2020, pp. 5338–5348. [Google Scholar]
  • [127].Santurkar S, Tsipras D, Elango M, Bau D, Torralba A, and Madry A, “Editing a classifier by rewriting its prediction rules,” in Proc. Adv. Neural Inf. Process. Syst, vol. 34, 2021, pp. 23359–23373. [Google Scholar]
  • [128].Patrício C, Neves JC, and Teixeira LF, “Coherent concept-based explanations in medical image and its application to skin lesion diagnosis,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR) Workshops, Jun. 2023, pp. 3799–3808. [Google Scholar]
  • [129].Chen Z, Bei Y, and Rudin C, “Concept whitening for interpretable image recognition,” Nature Mach. Intell, vol. 2, no. 12, pp. 772–782, Dec. 2020. [Google Scholar]
  • [130].Patrício C, Neves JC, and Teixeira LF, “Coherent concept-based explanations in medical image and its application to skin lesion diagnosis,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., Jun. 2023, pp. 3798–3807. [Google Scholar]
  • [131].Yuksekgonul M, Wang M, and Zou J, “Post-hoc concept bottleneck models,” 2022, arXiv:2205.15480. [Google Scholar]
  • [132].Klimiene U, Marcinkevičs R, Wolfertstetter PR, Elsen EÖ, Paschke A, Niederberger D, Wellmann S, Knorr C, and Vogt JE, “Multiview concept bottleneck models applied to diagnosing pediatric appendicitis,” in Proc. 2nd Workshop Interpretable Mach. Learn. Healthcare (IMLH), 2022, pp. 1–16. [Google Scholar]
  • [133].Chauhan K, Tiwari R, Freyberg J, Shenoy P, and Dvijotham K, “Interactive concept bottleneck models,” in Proc. AAAI Conf. Artif. Intell, vol. 37, 2023, pp. 5948–5955. [Google Scholar]
  • [134].Kim B, Wattenberg M, Gilmer J, Cai C, Wexler J, Viegas F, and Sayres R, “Interpretability beyond feature attribution: Quantitative testing with concept activation vectors (TCAV),” in Proc. 35th Int. Conf. Mach. Learn., Jul. 2018, pp. 2668–2677. [Google Scholar]
  • [135].Lee H, Kim ST, and Ro YM, “Generation of multimodal justification using visual word constraint model for explainable computer-aided diagnosis,” in Interpretability of Machine Intelligence in Medical Image Computing and Multimodal Learning for Clinical Decision Support. Switzerland: Springer, 2019, pp. 21–29. [Google Scholar]
  • [136].Baker RE, Peña J-M, Jayamohan J, and Jérusalem A, “Mechanistic models versus machine learning, a fight worth fighting for the biological community?” Biol. Lett, vol. 14, no. 5, May 2018, Art. no. 20170660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [137].Oktay O, Ferrante E, Kamnitsas K, Heinrich M, Bai W, Caballero J, Cook SA, de Marvao A, Dawes T, O’Regan DP, Kainz B, Glocker B, and Rueckert D, “Anatomically constrained neural networks (ACNNs): Application to cardiac image enhancement and segmentation,” IEEE Trans. Med. Imag, vol. 37, no. 2, pp. 384–395, Feb. 2018. [DOI] [PubMed] [Google Scholar]
  • [138].He Y, Yang G, Ge R, Chen Y, Coatrieux J-L, Wang B, and Li S, “Geometric visual similarity learning in 3D medical image self-supervised pre-training,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit, Jun. 2023, pp. 9538–9547. [Google Scholar]
  • [139].Pearl J, Causality, 2nd ed. Cambridge, U.K.: Cambridge Univ. Press, 2009. [Google Scholar]
  • [140].Vlontzos A, Rueckert D, and Kainz B, “A review of causality for learning algorithms in medical image analysis,” 2022, arXiv:2206.05498. [Google Scholar]
  • [141].Oh K, Kang HM, Leem D, Lee H, Seo KY, and Yoon S, “Early detection of diabetic retinopathy based on deep learning and ultra-wide-field fundus images,” Sci. Rep, vol. 11, no. 1, p. 1897, Jan. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [142].Beede E, Baylor E, Hersch F, Iurchenko A, Wilcox L, Ruamviboonsuk P, and Vardoulakis LM, “A human-centered evaluation of a deep learning system deployed in clinics for the detection of diabetic retinopathy,” in Proc. CHI Conf. Hum. Factors Comput. Syst., New York, NY, USA, 2020, pp. 1–12. [Google Scholar]
  • [143].Alyoubi WL, Shalash WM, and Abulkhair MF, “Diabetic retinopathy detection through deep learning techniques: A review,” Informat. Med. Unlocked, vol. 20, 2020, Art. no. 100377. [Google Scholar]
  • [144].Nasser M. and Yusof UK, “Deep learning based methods for breast cancer diagnosis: A systematic review and future direction,” Diagnostics, vol. 13, no. 1, p. 161, Jan. 2023, doi: 10.3390/diagnostics13010161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [145].Ma M, Liu R, Wen C, Xu W, Xu Z, Wang S, Wu J, Pan D, Zheng B, Qin G, and Chen W, “Predicting the molecular subtype of breast cancer and identifying interpretable imaging features using machine learning algorithms,” Eur. Radiol, vol. 32, no. 3, pp. 1652–1662, Mar. 2022, doi: 10.1007/s00330-021-08271-4. [DOI] [PubMed] [Google Scholar]
  • [146].Spak DA, Plaxco JS, Santiago L, Dryden MJ, and Dogan BE, “BI-RADS fifth edition: A summary of changes,” Diagnostic Interventional Imag., vol. 98, no. 3, pp. 179–190, Mar. 2017, doi: 10.1016/j.diii.2017.01.001. [DOI] [PubMed] [Google Scholar]
  • [147].Karatza P, Dalakleidi K, Athanasiou M, and Nikita KS, “Interpretability methods of machine learning algorithms with applications in breast cancer diagnosis,” in Proc. 43rd Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. (EMBC; ), Nov. 2021, pp. 2310–2313, doi: 10.1109/EMBC46164.2021.9630556. [DOI] [PubMed] [Google Scholar]
  • [148].Gu Z, Jamison K, Sabuncu M, and Kuceyeski A, “Personalized visual encoding model construction with small data,” Commun. Biol, vol. 5, no. 1, p. 1382, Dec. 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [149].S. R. Oota, M. Gupta, R. S. Bapi, G. Jobard, F. Alexandre, and X. Hinaut, “ Deep neural networks and brain alignmenT: Brain encoding and decoding (Survey),” 2023, arXiv:2307.10246. [Google Scholar]
  • [150].Wen H, Shi J, Zhang Y, Lu K-H, Cao J, and Liu Z, “Neural encoding and decoding with deep learning for dynamic natural vision,” Cerebral Cortex, vol. 28, no. 12, pp. 4136–4160, Dec. 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [151].Jacovi A. and Goldberg Y, “Towards faithfully interpretable NLP systems: How should we define and evaluate faithfulness?” in Proc. 58th Annu. Meeting Assoc. Comput. Linguistics, Jurafsky D, Chai J, Schluter N, and Tetreault J, Eds. PA, USA: Association for Computational Linguistics, Jul. 2020, pp. 4198–4205. [Google Scholar]
  • [152].Dasgupta S, Frost N, and Moshkovitz M, “Framework for evaluating faithfulness of local explanations,” in Proc. Int. Conf. Mach. Learn., 2022, pp. 1–22. [Google Scholar]
  • [153].Amorim JP, Abreu PH, Santos J, Cortes M, and Vila V, “Evaluating the faithfulness of saliency maps in explaining deep learning models using realistic perturbations,” Inf. Process. Manage, vol. 60, no. 2, Mar. 2023, Art. no. 103225. [Google Scholar]
  • [154].Dietvorst BJ, Simmons JP, and Massey C, “Algorithm aversion: People erroneously avoid algorithms after seeing them err,” J. Exp. Psychol., Gen, vol. 144, no. 1, pp. 114–126, 2015. [DOI] [PubMed] [Google Scholar]
  • [155].Goddard K, Roudsari A, and Wyatt JC, “Automation bias: A systematic review of frequency, effect mediators, and mitigators,” J. Amer. Med. Inform. Assoc, vol. 19, no. 1, pp. 121–127, Jan. 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [156].Parasuraman R. and Manzey DH, “Complacency and bias in human use of automation: An attentional integration,” Hum. Factors: J. Hum. Factors Ergonom. Soc, vol. 52, no. 3, pp. 381–410, Jun. 2010. [DOI] [PubMed] [Google Scholar]
  • [157].Babic B, Gerke S, Evgeniou T, and Cohen IG, “Beware explanations from AI in health care,” Science, vol. 373, no. 6552, pp. 284–286, Jul. 2021. [DOI] [PubMed] [Google Scholar]
  • [158].Obermeyer Z. and Lee TH, “Lost in thought—The limits of the human mind and the future of medicine,” New England J. Med, vol. 377, no. 13, pp. 1209–1211, Sep. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [159].Olah C, Cammarata N, Schubert L, Goh G, Petrov M, and Carter S, “oom in: An introduction to circuits,” Distill, vol. 5, no. 3, 2020, Art. no. e00024. [Online]. Available: https://distill.pub/2020/circuits/zoom-in [Google Scholar]
  • [160].Elhage N, Nanda N, Olsson C, Henighan T, Joseph N, Mann B, Askell A, Bai Y, Chen A, Conerly T, DasSarma N, Drain D, Ganguli D, Hatfield-Dodds Z, Hernandez D, Jones A, Kernion J, Lovitt L, Ndousse K, Amodei D, Brown T, Clark J, Kaplan J, McCandlish S, and Olah C, “A mathematical framework for transformer circuits,” in Transformer Circuits Thread, 2021. [Online]. Available: https://transformer-circuits.pub/2021/framework/index.html [Google Scholar]

RESOURCES