How transparency modulates trust in artificial intelligence

John Zerilli; Umang Bhatt; Adrian Weller

doi:10.1016/j.patter.2022.100455

. 2022 Feb 24;3(4):100455. doi: 10.1016/j.patter.2022.100455

How transparency modulates trust in artificial intelligence

John Zerilli ^1,^∗, Umang Bhatt ^2,³, Adrian Weller ^2,³

PMCID: PMC9023880 PMID: 35465233

Summary

The study of human-machine systems is central to a variety of behavioral and engineering disciplines, including management science, human factors, robotics, and human-computer interaction. Recent advances in artificial intelligence (AI) and machine learning have brought the study of human-AI teams into sharper focus. An important set of questions for those designing human-AI interfaces concerns trust, transparency, and error tolerance. Here, we review the emerging literature on this important topic, identify open questions, and discuss some of the pitfalls of human-AI team research. We present opposition (extreme algorithm aversion or distrust) and loafing (extreme automation complacency or bias) as lying at opposite ends of a spectrum, with algorithmic vigilance representing an ideal mid-point. We suggest that, while transparency may be crucial for facilitating appropriate levels of trust in AI and thus for counteracting aversive behaviors and promoting vigilance, transparency should not be conceived solely in terms of the explainability of an algorithm. Dynamic task allocation, as well as the communication of confidence and performance metrics—among other strategies—may ultimately prove more useful to users than explanations from algorithms and significantly more effective in promoting vigilance. We further suggest that, while both aversive and appreciative attitudes are detrimental to optimal human-AI team performance, strategies to curb aversion are likely to be more important in the longer term than those attempting to mitigate appreciation. Our wider aim is to channel disparate efforts in human-AI team research into a common framework and to draw attention to the ecological validity of results in this field.

Keywords: artificial intelligence, machine learning, human-computer interaction, human-AI teams, human factors, transparency, explainable AI, trust

The bigger picture

Recent advances in artificial intelligence (AI) and machine learning have brought the study of human-AI (HAI) teams into sharper focus. An important set of questions for those designing HAI interfaces concerns trust—specifically, human trust in the AI systems with which they form teams. We review the literature on how perceiving an AI making mistakes violates trust and how such violations might be repaired. In doing so, we discuss the role played by various forms of algorithmic transparency in the process of trust repair, including explanations of algorithms, uncertainty estimates, and performance metrics.

It is important for alternative forms of transparency to be explored besides “explainable AI” in machine learning, since there is emerging evidence that some of these are better at promoting optimal human-AI team performance. We review these alternatives and discuss their relative performance. In addition, we note a tendency to overgeneralize the applicability of results in this area. We call for the adoption of a task-specific (as opposed to domain-specific) paradigm, using examples from human factors and ergonomics.

Introduction

The study of human-machine systems is central to a variety of behavioral and engineering disciplines, including management science,1, 2, 3 human factors,4, 5, 6, 7 robotics,8, 9, 10, 11, 12, 13 and human-computer interaction.14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 Recent advances in artificial intelligence (AI) and machine learning have brought the study of human-AI (HAI) teams into sharper focus. An important set of questions for those designing HAI interfaces concerns trust: specifically, human trust in the algorithmic systems with which they form teams. Trust in machines has been defined as “the attitude that an agent will help achieve an individual’s goals in a situation characterized by uncertainty and vulnerability.”¹⁷^,²⁵ More precisely, trust is “a psychological state comprising the intention to accept vulnerability based on positive expectations of the intentions or behavior of another.”²⁶ Trust is therefore a subjective attitude and attribute of the vulnerable party, to be distinguished from trustworthiness, which is an objective attribute of the trustee. Just as human collaboration would be impossible without some degree of trust between team members, some form of trust in algorithmic systems is necessary for HAI teams to perform smoothly and effectively. It follows too that if trust is ever violated, its repair will be crucial in any attempt to rehabilitate team performance.

Here, we briefly review the literature on how perceiving an AI make mistakes violates trust and how such violations might be repaired. In doing so, we discuss the role played by various forms of algorithmic transparency in the process of trust repair. We then identify and discuss two important questions left open in this literature: first, concerning what effects the size, frequency, type, and distribution of errors have in the violation and repair of trust, and second, concerning how various forms of transparency—in particular explanations of algorithms, confidence and performance metrics, and dynamic allocation strategies—fare comparatively in the process of trust repair. We suggest that while transparency may be crucial for facilitating trust in AI and thus for counteracting aversive behaviors, transparency should not be conceived solely in terms of explainability. Our final section discusses some of the pitfalls of HAI team research. In particular, we worry that the ecological validity of results in this field is not sufficiently appreciated—at least in practice.

We should lodge three important caveats at the outset. The first concerns the nature of the trust in question, given that trust is, in the first instance, an interpersonal attitude between humans, not between humans and machines. Interpersonal trust has been the subject of investigation in organizational and social psychology for several decades,27, 28, 29, 30, 31, 32, 33 and in these fields, trust is understood to be influenced by at least two factors: (1) the competence of the trustee and (2) the degree to which the trustee exhibits good faith/benevolent intentions—e.g., in a contractual setting, the desire to support the other party’s efforts in performing the contract—but, more generally, the absence of ill will or ulterior motives in the trustee.29, 30, 31, 32, 33 Recast into language more appropriate for artificial agents, we can take competence to denote a system’s accuracy and good faith to denote a system’s transparency, as judged by a range of criteria including, but not limited to, its explainability. It is true that good faith is not, strictly speaking, the same thing as transparency, and that transparency is often a means of verifying good faith (as well as accuracy). However, it is also true that transparency can itself be an expression of good faith on the trustee’s part, as when someone who is “open” or “forthright” is understood to harbor no ill will or hidden agenda. In other words, while good faith encompasses more than transparency, it often encompasses at least that much. Note also that, throughout this paper, we take transparency to mean any information provided about an AI system beyond its model outputs. By explainability, we mean information that specifically helps to understand how or why a system produced its outputs.³⁴

Secondly, accuracy and transparency are by no means the only antecedents of trust in embedded AIs.³⁵^,³⁶ Other important, if less marked, determinants of trust in automation include ergonomic and demographic factors, team size and composition (e.g., in terms of active versus passive users), and task type and complexity.

Finally, we note that the AIs considered in this paper are all examples of what some have termed “embedded AIs,” as opposed to AI-enabled virtual agents (e.g., Siri or Alexa ) and robots (e.g., Pepper or Roomba).³⁵ Embedded AIs are forms of AI that are “invisible to the user, embedded inside of a computer or other tool” and which thus lack “a visual representation or a distinguished identity.”³⁵ Common examples would be smartphone apps, e-mail spam filters, ranking algorithms, and recommender systems. Less obvious examples include business systems and automated decision software (e.g., customer credit rating algorithms, offender recidivism risk tools, etc.).

The effects of error and transparency on trust

In an ideal world, only systems that are trustworthy would be trusted. Distrust may be justified whenever a system performs considerably worse than a human (or human team) acting alone, or whenever a system is opaque or ethically suspect. But distrust is problematic when the distrusting behavior to which it leads—what has been termed algorithm “aversion” —is really an overreaction to having witnessed the system’s mistakes.⁵^,¹⁴^,¹⁵^,³⁷ In the most extreme case, algorithm aversion results in a refusal to engage with a system at all or a blatant disregard of its recommendations—an attitude we term “opposition.”

Conversely, there is such a thing as too much trust—algorithm “appreciation”³—or overtrust, where a human is so impressed by a system that they cease actively monitoring its outputs⁴^,⁵ and in the limiting case follow its every recommendation without question—an attitude we term “loafing.” As one might have guessed, appreciation is not a problem for systems that pass a very high threshold of accuracy³⁸^,³⁹ (see Box 1). Accordingly, the AIs of interest to HAI team research are generally trustworthy in the sense that they are adept at performing a particular task, but not so adept that overtrust ceases to be a problem (cf., Bansal et al.³⁷) and yet not so error prone that algorithm aversion becomes rational. Both aversion and its opposite, appreciation, are inappropriate attitudes toward systems that are generally trustworthy in this sense.⁴^,¹⁴

Box 1. Which AIs are the target of human-AI team research?

Human-AI (HAI) team researchers hail from a variety of behavioral and engineering disciplines, including management science, human factors, robotics, and human-computer interaction. HAI team research is concerned with the alleviation of user distrust and overtrust in AI, where such attitudes are likely to impede optimal HAI team performance. An AI that fares worse than a human (or human team) acting alone will rightly arouse distrust. An AI that is vastly superior to a human (or human team) acting alone will unproblematically elicit overtrust. But systems of the first kind are unlikely to be deployed, unless the HAI team deploying them can still outperform humans acting alone, while systems of the second kind are rare in team settings, since humans may be superfluous once a machine can perform so much better than a human (or human team) acting alone.³⁷ That leaves a wide range of AI systems as the focus of HAI team research. Humans acting alone will be better than some of these, but not better than the HAI team comprising them; the rest of these systems will be better than the humans acting alone, but not better than the HAI team comprising them. For many systems in this range, the attitude conducive to optimal HAI team performance will be vigilance, since both aversion and opposition, as well as appreciation and loafing, will impede optimal HAI team performance (see Figure 1 for the meaning of these terms). Is there a way of schematically demarcating the range of such systems? Perhaps surprisingly, no one has ever attempted to specify the class of systems that is the proper target of HAI team research. But without a clear, shared understanding of which systems require vigilance, which do not, and which should not be used at all, investigations into a large array of systems, each having different levels of reliability, make for a cluttered and confusing terrain.

Assume (plausibly) that every human (or human team) interacting with an AI in the specified range will introduce human errors (e.g., Dietvorst et al.¹⁵ and Bansal at al.³⁷) Assume further (for simplicity) that all errors are equally significant, be they human or AI. Let the rate at which humans introduce errors be denoted H, and the rate at which humans spot AI errors be denoted S. As we said, either the AI acting alone will fare better than the humans acting alone, but not better than the HAI team; or the humans acting alone will fare better than the AI acting alone, but not better than the HAI team (we ignore the case where humans acting alone can outperform both the AI and the HAI team, as the AI here would simply not be deployed). Then whenever H < S, humans will be spotting more AI errors than the errors they themselves introduce. By contrast, whenever H > S the AI will fare better than both the humans acting alone and the HAI team, because in this case the humans will be introducing more errors than those they are able to spot in the AI’s outputs. Systems performing at or higher than the level at which H > S will be best served by removing humans from the loop altogether.¹⁵^,³⁷ If for whatever reason humans are kept in the loop, however, the emergence of appreciation and loafing will not be detrimental to HAI team performance.

To illustrate, we plot user trust as a function of system reliability in Figure 3. The plot depicts a well-calibrated user trust function over a range of system performance levels (trust is said to be “well calibrated” when user expectations match system capabilities). Assume that performance at the H < S level marks the point at which a system performs better than a user alone, but not better than an HAI team (e.g., assume that a human alone makes 200 errors, an AI alone makes 100 errors, but that the HAI team will make only 40 errors, because the human spots all 100 AI errors and introduces only 40 of their own for a net total of 40 HAI team errors). As a system’s performance gradually improves on this benchmark, S falls because there are progressively fewer errors for the user to spot (e.g., AI₂ will make 99 errors, AI₃ will make 98 errors, etc., while—we assume for simplicity—a user will continue to introduce 40 errors). Eventually a system will reach the point at which H = S (e.g., AI₆₁ will make 40 errors). Any system whose performance exceeds this level (i.e., when H > S) will perform better than the HAI team (e.g., AI₆₂ will make 39 errors, but while the human will spot all 39 they will introduce 40 of their own, for a net total of 40 HAI team errors). Thus, when a system performs at the H < S level, vigilance will be the ideal user response. When a system performs at the H > S level, loafing will be the ideal user response.

We said that systems performing at or higher than the H > S level will be best served by removing humans from the loop altogether. However, this may not be technically, ethically, or politically feasible. In any event, as we noted, the emergence of appreciation and loafing in such cases will not be detrimental to HAI team performance. But to the extent that these systems are not invulnerable to errors that a human might witness, the risk of aversion and opposition will persist. Strategies to mitigate this risk, such as allowing humans to manipulate the algorithm even if doing so may degrade the system’s performanc,¹⁵ are still preferable to giving aversion and opposition free rein (again, so long as the HAI team performs better than the humans acting alone).¹⁵

To our knowledge, these various attitudes have never been cast within a single frame of reference. Papers overwhelmingly tend to problematize overtrust or distrust, failing to demonstrate that both phenomena should be understood as part of a broader inquiry into HAI teams, and that any one system can engender any of the above attitudes. Hence, we envisage opposition and loafing as lying at opposite ends of a spectrum, with algorithmic “vigilance” representing an ideal mid-point between them and aversion and appreciation lying mid-way between this ideal and each of the two extremes (Figure 1). Algorithmic vigilance, as we will use the term, is an attitude of active user engagement and healthy skepticism. It marks the level of trust that a human (or human team) should display toward an AI from the point of view of optimal HAI team performance. Confusingly, this attitude is sometimes given the name “complementarity,” presumably to indicate that some ideal division of labor has been struck between human and machine, such that humans will focus on tasks too difficult for machines and vice versa.³⁷

Scale of user attitudes toward AI in human-AI teams

But complementarity in this sense may be compatible with human loafing (see Box 1), so we prefer the term vigilance.

What counts as vigilance may differ from case to case depending on the AI under consideration. If vigilance is observed over time t, each non-ideal attitude of trust v_oppose, v_avert, v_appreciate, and v_loaf might be modeled as a related function (Figure 2A).

User trust in automation after witnessing system failures

(A) Five possible trust trajectories over time. Notice that the default attitude toward automation is generally one of high trust that falls by some measure in response to seeing a system err. The vigilant user of AI recalibrates their initially unrealistic estimate of a system’s capabilities gradually, but not to the point where their attitude becomes aversive.

(B) The hypothesized role of transparency in trust calibration

In human factors engineering and human-computer interaction, overtrust has been extensively researched for close to four decades.⁴ In human factors, the phenomenon goes by the names of “automation complacency” and “automation bias.”⁴⁰ Although similar, these effects are not the same. Automation complacency describes the state of passivity, diffidence, or deference into which the user of a system falls when uncritically relying on a technology they deem more proficient than themselves.⁴¹ In effect, it is the failure to attend to the possibility that a system may be wrong through failure to seek out either confirmatory or disconfirmatory evidence.⁷ Automation bias is a more extreme variant of this attitude and manifests when a human user actively prefers a system’s signals over actual—i.e., overtly—contradictory information, including information from more reliable sources such as the user’s own senses.⁷^,⁴¹ Crucially, it is the perception of a system’s superior performance that induces these states: they are rarely observed when a system is considered liable to even occasional error.⁵^,⁷^,42, 43, 44

By contrast, algorithm aversion has not been nearly as well researched or theorized. But some results are notable. Users of AI in many lab-based settings have been shown to display unrealistically high levels of trust initially, only for that trust to drop precipitously in response to seeing a system err.⁵^,¹⁴^,¹⁵ Users then typically retreat to human judgment, even when doing so leads demonstrably to even more errors.⁵^,¹⁴^,¹⁵ For example, during an incentivized task, when given the choice between relying on their own judgment exclusively or relying on an algorithm’s forecasts exclusively, most participants who had not seen the algorithm perform chose to rely on the algorithm exclusively, while most of those who had seen the algorithm perform (and hence err) chose to rely on human judgment, despite observing the algorithm’s better performance.¹⁴ It has been suggested that this effect is greater for obvious errors than for subtle ones, because obvious errors can quite drastically upset a user’s initially high expectations of a system’s competence.⁵ Moreover, a user’s expertise can affect their perception of machine errors.³ Users who are expert or self-confident in tasks that have been delegated to automation tend to ignore machine advice⁴⁵ and, as a result, make less-accurate predictions relative to lay people willing to follow machine advice.⁵^,¹⁴^,¹⁵

The pattern of trust → error → distrust, in which trust becomes difficult to restore despite impressive system performance, could be explained by users' “diminishing sensitivity to error.” Over the course of five studies, Dietvorst and Bharti⁴⁶ found that participants displayed error intolerance when confronted with decision makers that were highly reliable on average but incapable of perfect forecasts, and error tolerance when confronted with decision makers that were less reliable on average but that had at least a chance of making near-perfect forecasts. If users have diminishing sensitivity to error, it would plausibly explain why AIs that make even a single error are penalized so harshly: users' hopes for near-perfect automated forecasting having thus been dashed, the more volatile and error-prone decision-making option (human judgment) suddenly looks like the most appealing one (human forecasters can at least stumble on near-perfect forecasts after all). In any event, errors seem to have a stronger impact on trust than correct outputs.⁵^,⁷ This phenomenon is indeed so pronounced that cumulative feedback about a system’s superior performance presented at the end of a task session may not be enough to counteract users' misgivings after having had their expectations disappointed over the course of a task session.⁵

Curiously, while higher levels of trust generally lead to greater reliance, trust and reliance are not monotonic. An untrustworthy system may rightly arouse distrust (measured subjectively by self-evaluation and report) and yet continue to be relied upon (judging by actual usage data).⁵^,⁷^,⁴⁷ The converse of this situation has also been observed, so that even when the subjective feeling of trust eventually recovered after witnessing a system failure, immediate post-failure behavior (e.g., scrupulous cross-checking) did not revert to the pre-failure norm.⁷

In the remainder of this section, we single out four categories of transparency—explanations, performance metrics, dynamic allocation strategies, and confidence information—that we think have special significance in HAI team coordination.

Explanation as a form of transparency

No doubt the most pertinent form of transparency is explanation, which can enhance a user’s understanding of how an algorithm works and hence why it might commit the sorts of errors it does.⁵^,³⁷^,⁴⁸ While important, and a well-attested means of establishing appropriate levels of trust,⁴⁹ explanations can easily backfire. Some explanations of AI systems, for example, appear to induce automation complacency.⁵^,³⁷^,⁵⁰ Feature importance explanations—which discover which input features exert the most influence on a model’s outputs—are particularly prone to misleading users in this regard,51, 52, 53 although similar example-based explanation methods have, admittedly, been shown to be conducive to HAI team performance.⁵⁴^,⁵⁵ In the same vein, when explanations are provided before users are in a position to assess a situation for themselves, users may be led to anchor on the first data they receive, conditioning subsequent deliberation.³⁷ More perversely, “too much transparency can cause people to incorrectly follow a model when it makes a mistake, due to information overload.”²⁴ On other occasions, poor or confusing explanations can lead to algorithm aversion.²⁴

Performance metrics

Many of these results could easily lead one to the cynical conclusion that the best way for AI systems to promote the right amount of trust is simply by shielding users from information about the system’s decisions—in effect, by being less transparent.⁵^,¹⁴ (Dzindolet et al.⁵ report that “eliminating operators' awareness of an automated decision aid’s obvious errors [through blinding the participants to the decisions of the aid] was useful in promoting appropriate automation reliance if participants were continually reminded of their and their aid’s performance. Unfortunately, applying these techniques outside the laboratory is problematic. It would not be reasonable to provide someone with an automated decision aid but not allow them to see the decisions the aid has made.”) Yet there is reason to believe that a better calibration of trust to a system’s actual level of accuracy can be achieved by providing more of the right kind of transparency: not just cumulative performance feedback (delivered at the end of a task session), but continuous performance feedback that allows the user to maintain a better picture of the system’s relative superiority in real time⁵^,⁵⁶ (see Figure 2B). Some researchers have even noticed a pattern in the way accuracy information interacts with user attitudes. Metainformation about low-reliability automation runs the risk of promoting overtrust (as measured by higher trust ratings), but metainformation about high-reliability automation seems to have the opposite effect. Presumably this is because, in the first case, users are placed on notice, ready to step in and override the system when it fails, which could, perversely, contribute to a sense that the system is actually more reliable than it is; while, in the second case, metainformation may consolidate users' unrealistic expectations, which are inevitably contradicted on witnessing errors, with the attendant fallout.⁵⁷

User control and dynamic allocation

Because explanations ultimately satisfy a need to be in control, an effective alternative strategy may be to allow users a degree of latitude over whether to accept an algorithm’s outputs at face value. For instance, provided that they can modify its forecasts, users are apparently willing to take an algorithm seriously even after seeing it make occasional mistakes. What is more, the precise degree of control seems to be irrelevant: the ability to modify a forecast even slightly may be sufficient to induce appropriate reliance.¹⁵ Control can be exercised in various ways, including through cognitive “forcing” functions that prompt users to request additional information in the form of explanations should they desire them.⁵⁰

The static versus dynamic nature of task allocation is also important, because tasks in which control flexibly shifts between human and machine in accordance with user needs are better at sustaining operator vigilance.⁴⁷ HAI teams in which allocation is dynamic can be further divided between those in which the allocation is adaptable, where users dictate the allocation, and those in which the allocation is adaptive, where the allocation is automated.⁴⁷^,⁵⁸ Allocation can then proceed along several lines, but perhaps the most intuitive is along lines of difficulty. A human is likely to find some tasks easy that a machine will find hard and others hard that a machine will find easy. (From the machine’s perspective, difficulty can be understood in terms of the degree of uncertainty exhibited in regard to a specific prediction.)⁵⁹ Generally, human trust in AI is higher when tasks involve objective calculation—to the point of trusting the AI even after seeing it make mistakes⁶⁰—and lower when tasks involve social and emotional intelligence.² Both adaptive and adaptable forms of allocation can go some way toward achieving an optimal division of labor from the point of view of difficulty. For example, under adaptable allocation, humans can reserve all the tasks they consider easy for themselves and delegate the remaining ones to a machine. Under adaptive allocation, a machine could vary the difficulty of the tasks it reserved for the human, so that it referred to both moderately difficult as well as easy tasks to them, in an attempt to keep users vigilant (e.g., via so-called “catch trials”). In one study, adaptable allocation was found to have a marginal advantage over adaptive allocation, and (unsurprisingly) happens to be easier to design.⁵⁸ However, adaptive systems may be able to leverage uncertainty information in ways that are more effective than adaptable systems (catch trials for one)⁶¹ (see Box 2).

Box 2. Example of dynamic task allocation.

Allocation strategies can help a human maintain algorithmic vigilance.⁴⁷ In an adaptable HAI team, the user dictates the allocation a priori.⁵⁸ This allows humans to select which tasks they want to outsource to machines. Humans may elect to keep easy tasks for themselves, leaving harder tasks for machines, or may instead keep the difficult tasks (e.g., tasks requiring the exercise of discretion), allowing machines to focus on rote tasks. In an adaptive HAI team, by contrast, the machine dynamically determines the allocation strategy.⁵⁸

A large body of research in aviation demonstrates the potential advantages of adaptive allocation.⁶² Air traffic controllers manage aircraft flow and intervene if aircraft separation is too low.⁶³ The controller is provided with an automated decision aid to handle multiple tasks. In these scenarios, an adaptive allocation strategy is usually preferred.⁶²^,⁶⁴ One advantage of adaptive strategies is that they can accommodate the use of “catch trials.” The point of a catch trial is to ensure that the controller is alert and situationally aware.⁶⁵^,⁶⁶ They may take the form of randomly generated system errors to “catch out” the user or (more commonly) abstentions in which the system declines to recommend a course of action in a specific instance, leaving the user to fall back on their own skills.

When both the human and machine find a task easy, it likely does not matter which agent provides a response (although decision fatigue is an ever-present risk).⁶⁷^,⁶⁸ More interesting are cases in which both machine and human struggle with a task. One approach here would be to select an agent at random. If the human is selected, then the human must make a decision without the machine’s recommendation; if the machine is selected, then the human would be shown the machine’s recommendation before making a decision (i.e., the human would have a choice whether to accept the machine’s recommendation). Future work might explore the efficacy of similar tie-breaking strategies when machines and humans both struggle with the same tasks.

Confidence information

A different form of transparency involves presenting users with system confidence information. There is growing evidence that suitably formatted confidence data (e.g., in the form of uncertainty estimates, confidence intervals, confidence levels, etc.) may improve trust calibration.⁶^,⁵⁵^,⁶⁴ To the extent that humans have the capacity to incorporate an AI system’s uncertainty appropriately, this will result in better performance. However, we highlight two significant challenges: (1) humans are often poor at handling numeric information, so presentation and design may be important.⁶⁹^,⁷⁰ Indeed, there is evidence that humans may be prone to “information overload” so that providing confidence measures might lead to worse performance.³⁷ (2) In fact, it is typically challenging to provide reliable, well-calibrated uncertainty estimates. An unfortunate property of current AI systems is that they are prone to being overconfident on examples where they might perform poorly.⁷¹

Open questions

There are at least two important sets of issues whose resolution is outstanding. First, it is unclear what effects the size, frequency, type, and distribution of errors have in the loss and recovery of trust after users witness automation errors. Second, we know little about how different forms of transparency compare in the course of rebuilding that trust. In particular, almost nothing is known about how explanations, confidence data, performance metrics, and dynamic allocation strategies measure up against each other from the standpoint of optimal HAI team performance.

Error size, frequency, type, and distribution

Beyond common intuitions, little is known about the precise effects of an error’s size on trust violation and repair. It is reasonable to suppose that an error’s size need not refer to simply to its deviation from an ideal quantity or range, as in the case of risk scores that are off by some measure. By referring to an error’s size one could equally well intend to convey, more generally, how surprising the error given widely held assumptions among users about how the world ought to be. As we already suggested, mistakes on easy tasks (i.e., obvious mistakes) may be judged more harshly and be more corrosive of trust, than those on tasks perceived to be more difficult. We also noted evidence that continuous performance feedback may be an effective means of encouraging appropriate reliance after users witness automation errors. But it is not clear whether this kind of feedback is powerful enough to withstand the blow dealt to trust by the commission of large or obvious automation errors (e.g., Dzindolet et al.⁵ found that such feedback is only effective when users are shielded from seeing obvious errors altogether). Again, beyond common intuitions, little can be said about the precise effects of the frequency of errors either. But, as one might expect, users do seem able to recover more readily from isolated or acute system failures than they do from chronic ones.⁴⁸^,⁷²

Less still is known about the effects of distinct types of error on trust. Some studies purport to show that false alarms and misses affect trust differently, with false alarms having a greater negative impact than misses; while some report no significant difference along this dimension,³⁶ interpreting these conflicting results by suggesting that the consequences of false alarms versus misses determine the effects observed. In a contest between a false alarm that poses only a “minor inconvenience” (e.g., a trigger-happy smoke alarm) and a miss that could be lethal (a smoke alarm that operates intermittently), it is the former that will have less deleterious effects on trust than the latter. But as they note: “the relative influence of other types of automation failures, such as breakdowns and error messages, has yet to be determined” (our emphasis).

Perhaps least understood of all is the effect of the distribution of system errors over time. For example, are two large errors in quick succession as detrimental to trust as two large errors spaced apart (e.g., one at the beginning and one in the middle of a task session)? If so, are such “clustered” errors also more difficult to repair than temporally dispersed ones? We do not know. There are some indications that the earlier during a session that an error occurs, the sharper and more significant the decline in trust and the more difficult it will be to recover, despite reliable performance otherwise.⁷ This makes sense—if an acquaintance betrays your trust very early on in your dealings with them, you may find it harder to “forgive and forget” a single infraction than if you had been friends for 20 years. Nonetheless, such adverse events can be beneficial too, inducing appropriate reliance (as against algorithm aversion). The studies by Manzey et al.,⁷ for instance, revealed that participants exposed to automation failures earlier on in a task session were less susceptible to both automation complacency and automation bias. But beyond this we know little.

Comparative performance of transparency regimes

We already noted some of the drawbacks of AI-generated explanations in fostering well-calibrated user trust. Most notable among these is the risk of overtrust. What requires further investigation is whether the merits of various alternatives to explanations, on balance, make them more suitable than explanations. In particular, which forms of transparency are most effective in mitigating the risk of aversion and opposition after seeing an AI make a mistake? This latter question is more important than the question over which forms of transparency will best mitigate the risk of appreciation and loafing, because AI systems can be expected to improve over time, and perhaps radically. In that event, a trust surfeit arising from the use of explainable algorithms will not prove nearly as hazardous as a trust deficit arising from the use of alternative algorithms—at least in safety-critical domains. Hence the bar that any of the alternatives to explainable algorithms will have to meet may need to be set progressively higher, roughly in line with gains to system accuracy.

Be that as it may, model confidence data (e.g., uncertainty estimates) have been shown to be more helpful to users than explanations in at least one study.⁵⁵ In another, confidence data “helped pilots make better decisions about task allocation and compliance with [system] recommendations and thus resulted in improved performance and safety.”⁶ Even so, the precise experimental setup was limited to a restricted range of confidence levels (high, low, and variable) and a binary solution space (the presence of ice on the jet wing or jet tail). As the study’s authors noted, more realistic experimental conditions are necessary before one is warranted in drawing firmer conclusions. Indeed, greater comparative investigation of the efficacy of confidence data and explanations—under as close to real-life scenarios as possible—is what is really needed.⁵⁵ The same goes for dynamic performance metrics displaying an AI’s superior “running average” against its human counterpart/s. As we noted earlier in this section, whether continuous performance feedback of this sort mitigates aversive tendencies emerging after users witness large or obvious errors is not known. Allowing users to manipulate algorithmic outputs may be all that it takes to set the reverse of these tendencies in motion.¹⁵ It is possible, too, that adaptive allocation paradigms, which exploit the full possibilities of model uncertainty, will prove more effective overall in promoting vigilance than adaptable allocation. But again, whether any of these paradigms are preferable to explanations and to what extent remains unclear.

Incorrect, deceptive, or misleading transparency

Recall our definition of transparency as any information provided about an AI system beyond its model outputs. While transparency is often beneficial, we briefly note several potential dangers.⁷³ Just as model outputs can be wrong, so too can additional transparency information. Since this information might be relied upon in making decisions, incorrect transparency can cause harm. Incorrect transparency might be unintentional⁷⁴ or could be deliberately deceptive.75, 76, 77 Furthermore, even correct information might be misleading. In human communication, we often leave certain points unsaid, assuming our counterpart has background knowledge of the context. This creates the potential for information to be misleading if it is not carefully presented.⁷⁸ Hence, ideally, algorithmic transparency should satisfy what linguists would call pragmatic desiderata. However, these are not easy to measure or satisfy in practice and remain an important focus of machine learning research.

Concluding remarks and future perspectives

We have considered how various forms of algorithmic transparency may promote user vigilance. More broadly, however, we have sought to provide a practical framework for the study of HAI teams that (1) brings the same phenomena investigated by a variety of fields under a unified descriptive apparatus, (2) clarifies the scope of the technical systems that are the proper target of these investigations, and (3) identifies the overriding concern of these investigations with the maintenance of algorithmic vigilance. Our hope is that, by presenting the above research within this framework, we might inspire those who study HAI teams to seek to forge stronger connections despite the persistence of disciplinary boundaries (in practice if not in principle). At the moment, HAI team research is siloed. To take just one case, the authors of a recent (and high-quality) peer-reviewed study took themselves to be challenging “the widespread assertion that people are averse to algorithms” on the basis that the participants in their study “were quite willing to rely on algorithmic advice before seeing the algorithm err.”³ Human factors engineers would be unmoved by the finding that humans are prepared to trust—indeed overtrust—algorithms, having invested great efforts over the years in dealing with the problematic consequences of this very tendency. In our view, HAI team research should comprise a unified branch of study with a basic modus operandi and lingua franca, albeit drawing from expertise across several autonomous subfields. Our framework offers a pragmatic way forward.

Perhaps the greatest challenge in the study of HAI teams, however, is simply resisting the urge to overgeneralize experimental results.⁴⁷ Indeed, we think that ecological validity is an underappreciated problem in this area. Findings in aviation and shipping contexts are of questionable value in court and law enforcement contexts, which in turn may have little bearing on how the automation of medical diagnoses should be approached.³⁸ In legal and medical contexts, initial trust in automation is actually quite low, presumably due to the expertise of the users involved.⁷⁹^,⁸⁰^, This is at odds with the general findings we reviewed above.

Insofar as ecological validity is acknowledged, too often it features as an afterthought: a mere warning to readers of the limitations of the study concerned along with a reminder to keep those limitations in mind when applying results in real-world settings.⁷ This is a good start, but it has not prevented occasionally sweeping claims being made about how “people” using “algorithms” react in this or that situation³^,⁵^,¹⁴^,¹⁵ (cf. Carton et al.⁵¹).To illustrate, we can take an otherwise excellent and justly influential study whose authors fell into this trap. At one point, the authors state their take-home message as follows: “observing an automated decision aid make errors leads to distrust of the automated decision aid, unless an explanation is provided explaining why the aid might err.”⁵ A little further down the same page (p. 715), however, one finds the customary discussion of limitations. First, they noted that “the task was very simple and artificial.” Second, the study necessarily ignored “[t]he effect of one person’s view of the automated aid’s trustworthiness on other group members' reliance decisions,” because the study limited itself to examining the dyad of a single user with an automated aid; and so on. When the findings of a branch of study are taken up with the vim and vigor typical in HAI team research, ecological concerns become too important to squeeze into general disclaimers. How can we be certain that the limitations do not vitiate the generalizations entirely? Ideally, authors should premise all substantive claims so that even such rudiments as titles and abstracts are expressed tentatively. In the illustration just given, the take-home message cannot quite be: people distrust automated aids whose errors they witness unless an explanation is provided. Something more tentative is called for: in very simple automated tasks involving a single person, people tend to distrust automated aids whose errors they witness, unless an explanation is provided. Every algorithm, every interface, every task, is unique after all.

Perhaps the most effective way to meet the ecological challenge is for HAI team research to proceed in a task-specific fashion that takes account of the precise nature of the task and its setting. Note that task specificity is distinct from domain specificity. Domain-specific investigation would confine research and its results to a more or less widely defined domain of activity (such as maritime shipping or criminal justice). Task-specific investigation, by contrast, would confine research by the nature of the task under consideration (such as adjudication between disputing parties, regardless of whether it is carried out by a court of law, a mediator, or a human resources officer). Since the basis of investigation and extrapolation in the latter case is the similarity of the tasks undertaken, regardless of domain, task-specific investigation may harness results from research conducted across what are in fact very distinct domains of activity (as the examples just given show). Conversely, a task-specific orientation may mean that results from one experiment are not presumed to generalize to another setting, despite the fact that both tasks occur within the same domain (e.g., results from an experiment testing the behavior of judges using recidivism risk algorithms in sentencing or bail applications may fail to generalize to a setting in which judges use algorithms to determine the likelihood of a repeat psychotic episode in a parent suing for child custody).

Our impression is that sweeping claims are more typical in the literature of organizational behavior and machine learning than they are in those of, say, ergonomics and human factors. The latter fields have always had several parallel streams of inquiry running alongside one another (e.g., one for ocean navigation, one for aviation and air traffic control, one for autonomous vehicles, another for nuclear power, etc.), and this has meant that conclusions in these fields have always been implicitly circumscribed. It is in the nature of task-specific research to constrain the applicability of results.

An emphasis on task-specific inquiry may seem in tension with our call for HAI team research to espouse greater cross-disciplinary cohesion and coordination. But what we are calling for in the latter case is simply an end to the kind of siloed research in which differences in terminology serve no purpose, and where people from one field are unaware of discoveries in another relating to the exact same subject matter. Cross-disciplinary activity, as such, is compatible with task-specific investigation: the field of human factors itself offers an excellent model of domain- and task-specific research worth emulating at a larger scale. This transition may not be easy to achieve. HAI team researchers, whose main experience is in machine learning, may find it especially difficult. The machine learning community on the whole values task-independent, model-agnostic and scalable, general models to solve as many variations of a problem as possible. This work is not misconceived. Indeed, there is a delicate balance to be struck between the necessity of controlled and (to a sometimes considerable extent) contrived experimental conditions on the one hand, and real-world applicability on the other. We appreciate that experimental conditions must strive to isolate the psychological processes underlying team behaviors, and that without a certain amount of artifice in experimental design there can be no generalizable results at all. However, our aim here is to direct attention to the importance of real-world applicability and, more specifically, to intra-ecological generalizability. We propose that task specificity is an effective means of securing this form of generalizability. Then, within a task-specific orientation, the familiar give-and-take between laboratory and real life can proceed in accordance with the principles of sound applied science. But task specificity is an imperative if the machine learning community is to contribute meaningfully to HAI team research.

Acknowledgments

J.Z. is part-funded by the Leverhulme Trust (ECF-2020-428). U.B. acknowledges support from DeepMind and the Leverhulme Trust via the Leverhulme Center for the Future of Intelligence (CFI) and from the Mozilla Foundation. A.W. acknowledges support from a Turing AI Fellowship under grant EP/V025379/1, The Alan Turing Institute, and the Leverhulme Trust via CFI. We are grateful to Simone Schnall and Reuben Binns for very helpful discussion and comments.

Author contributions

Conceptualization, J.Z., U.B., and A.W.; methodology, J.Z., U.B., and A.W.; investigation, J.Z. and U.B.; writing – original draft, J.Z., with U.B. and A.W. contributing parts to middle sections; writing – review & editing, J.Z.; Box 1, conceptualization and writing, J.Z.; Box 2, conceptualization and writing, U.B.; project administration, J.Z., U.B., and A.W.

References

1.Lewandowsky S., Mundy M., Tan G. The dynamics of trust: comparing humans to automation. J. Exp. Psychol. Appl. 2000;6:104. doi: 10.1037//1076-898x.6.2.104. [DOI] [PubMed] [Google Scholar]
2.Lee M.K. Understanding perception of algorithmic decisions: fairness, trust, and emotion in response to algorithmic management. Big Data Soc. 2018;5 [Google Scholar]
3.Logg J.M., Minson J.A., Moore D.A. Algorithm appreciation: people prefer algorithmic to human judgment. Organ. Behav. Hum. Decis. Process. 2019;151:90–103. [Google Scholar]
4.Parasuraman R., Riley V. Humans and automation: use, misuse, disuse, abuse. Hum. Factors. 1997;39:230–253. [Google Scholar]
5.Dzindolet M.T., Peterson S.A., Pomranky R.A., Pierce L.G., Beck H.P. The role of trust in automation reliance. Int. J. Human Comput. Stud. 2003;58:697–718. [Google Scholar]
6.McGuirl J.M., Sarter N.B. Supporting trust calibration and the effective use of decision aids by presenting dynamic system confidence information. Hum. Factors. 2006;48:656–665. doi: 10.1518/001872006779166334. [DOI] [PubMed] [Google Scholar]
7.Manzey D., Reichenbach J., Onnasch L. Human performance consequences of automated decision aids: the impact of degree of automation and system experience. J. Cogn. Eng. Decis. Making. 2012;6:57–87. [Google Scholar]
8.Bainbridge W.A., Hart J.W., Kim E.S., Scassellati B. The benefits of interactions with physically present robots over video-displayed agents. Int. J. Soc. Robot. 2011;3:41–52. [Google Scholar]
9.Desai M., Kaniarasu P., Medvedev M., Steinfeld A., Yanco H. 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI) IEEE; 2013. Impact of robot failures and feedback on real-time trust; pp. 251–258. [Google Scholar]
10.Gombolay M.C., Gutierrez R.A., Clarke S.G., Sturla G.F., Shah J.A. Decision-making authority, team efficiency and human worker satisfaction in mixed human-robot teams. Aut. Robots. 2015;39:293–312. [Google Scholar]
11.Robinette P., Howard A.M., Wagner A.R. International Conference on Social Robotics. Springer; 2015. Timing is key for robot trust repair; pp. 574–583. [Google Scholar]
12.Salem M., Lakatos G., Amirabdollahian F., Dautenhahn K. 2015 10th ACM/IEEE International Conference on Human-Robot Interaction (HRI) IEEE; 2015. Would you trust a (faulty) robot? Effects of error, task type and personality on human-robot cooperation and trust; pp. 1–8. [Google Scholar]
13.Andrist S., Bohus D., Yu Z., Horvitz E. 2016 11 th ACM/IEEE International Conference on Human-Robot Interaction (HRI) IEEE; 2016. Are you messing with me? Querying about the sincerity of interactions in the open world; pp. 409–410. [Google Scholar]
14.Dietvorst B.J., Simmons J.P., Massey C. Algorithm aversion: people erroneously avoid algorithms after seeing them err. J. Exp. Psychol. Gen. 2015;144:114. doi: 10.1037/xge0000033. [DOI] [PubMed] [Google Scholar]
15.Dietvorst B.J., Simmons J.P., Massey C. Overcoming algorithm aversion: people will use imperfect algorithms if they can (even slightly) modify them. Manag. Sci. 2018;64:1155–1170. [Google Scholar]
16.Montague E., Xu J. Understanding active and passive users: the effects of an active user using normal, hard and unreliable technologies on user assessment of trust in technology and co-user. Appl. Ergon. 2012;43:702–712. doi: 10.1016/j.apergo.2011.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Jacovi A., Marasović A., Miller T., Goldberg Y. Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. 2021. Formalizing trust in artificial intelligence: prerequisites, causes and goals of human trust in AI; pp. 624–635. [Google Scholar]
18.Schmidt P., Biessmann F., Teubner T. Transparency and trust in artificial intelligence systems. J. Decis. Syst. 2020;29:260–278. [Google Scholar]
19.De-Arteaga M., Fogliato R., Chouldechova A. Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. A case for humans-in- the-loop: decisions in the presence of erroneous algorithmic scores; pp. 1–12. [Google Scholar]
20.Amershi S., Weld D., Vorvoreanu M., Fourney A., Nushi B., Collisson P., Suh J., Iqbal S., Bennett P.N., Inkpen K., et al. Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery; 2019. Guidelines for human-AI interaction; pp. 1–13. [Google Scholar]
21.Yang F., Huang Z., Scholtz J., Arendt D.L. Proceedings of the 25th International Conference on Intelligent User Interfaces. 2020. How do visual explanations foster end users’ appropriate trust in machine learning? pp. 189–201. [Google Scholar]
22.Suresh H., Lao N., Liccardi I. 12th ACM Conference on Web Science. 2020. Misplaced trust: measuring the interference of machine learning in human decision-making; pp. 315–324. [Google Scholar]
23.Weerts H.J., van Ipenburg W., Pechenizkiy M. Proceedings of KDD Workshop on Explainable AI. 2019. A human-grounded evaluation of shap for alert processing. [Google Scholar]
24.Kaur H., Nori H., Jenkins S., Caruana R., Wallach H., Wortman Vaughan J. Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. Interpreting interpretability: understanding data scientists’ use of interpretability tools for machine learning; pp. 1–14. [Google Scholar]
25.Lee J.D., See K.A. Trust in automation: designing for appropriate reliance. Hum. Factors. 2004;46:50–80. doi: 10.1518/hfes.46.1.50_30392. [DOI] [PubMed] [Google Scholar]
26.Rousseau D.M., Sitkin S.B., Burt R.S., Camerer C. Not so different after all: a cross-discipline view of trust. Acad. Manag. Rev. 1998;23:393–404. [Google Scholar]
27.Siegrist M., Earle T.C., Gutscher H. Test of a trust and confidence model in the applied context of electromagnetic field (EMF) risks. Risk Anal. Int. J. 2003;23:705–716. doi: 10.1111/1539-6924.00349. [DOI] [PubMed] [Google Scholar]
28.Siegrist M., Gutscher H., Earle T.C. Perception of risk: the influence of general trust, and general confidence. J. Risk Res. 2005;8:145–156. [Google Scholar]
29.Epley N., Waytz A., Cacioppo J.T. On seeing human: a three-factor theory of anthropomorphism. Psychol. Rev. 2007;114:864. doi: 10.1037/0033-295X.114.4.864. [DOI] [PubMed] [Google Scholar]
30.Evans A.M., Krueger J.I. The psychology (and economics) of trust. Social Personal. Psychol. Compass. 2009;3:1003–1017. [Google Scholar]
31.Thielmann I., Hilbig B.E. Trust: an integrative review from a person- situation perspective. Rev. Gen. Psychol. 2015;19:249–277. [Google Scholar]
32.Lewicki R.J., Brinsfield C. Trust repair. Annu. Rev. Organ. Psychol. Organ. Behav. 2017;4:287–313. [Google Scholar]
33.Fiske S.T. Stereotype content: warmth and competence endure. Curr. Dir. Psychol. Sci. 2018;27:67–73. doi: 10.1177/0963721417738825. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Bhatt U., Xiang A., Sharma S., Weller A., Taly A., Jia Y., Ghosh J., Puri R., Moura J.M., Eckersley P. Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. 2020. Explainable machine learning in deployment; pp. 648–657. [Google Scholar]
35.Glikson E., Woolley A.W. Human trust in artificial intelligence: review of empirical research. Acad. Manag. Ann. 2020;14:627–660. [Google Scholar]
36.Hoff K.A., Bashir M. Trust in automation: integrating empirical evidence on factors that influence trust. Hum. Factors. 2015;57:407–434. doi: 10.1177/0018720814547570. [DOI] [PubMed] [Google Scholar]
37.Bansal G., Wu T., Zhou J., Fok R., Nushi B., Kamar E., Ribeiro M.T., Weld D. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 2021. Does the whole exceed its parts? The effect of AI explanations on complementary team performance; pp. 1–16. [Google Scholar]
38.Goddard K., Roudsari A., Wyatt J.C. Automation bias: empirical results assessing influencing factors. Int. J. Med. Inform. 2014;83:368–375. doi: 10.1016/j.ijmedinf.2014.01.001. [DOI] [PubMed] [Google Scholar]
39.Zerilli J., Knott A., Maclaurin J., Gavaghan C. Algorithmic decision-making and the control problem. Minds Mach. 2019;29:555–578. [Google Scholar]
40.Parasuraman R., Manzey D.H. Complacency and bias in human use of automation: an attentional integration. Hum. Factors. 2010;52:381–410. doi: 10.1177/0018720810376055. [DOI] [PubMed] [Google Scholar]
41.Pazouki K., Forbes N., Norman R.A., Woodward M.D. Investigation on the impact of human-automation interaction in maritime operations. Ocean Eng. 2018;153:297–304. [Google Scholar]
42.Bagheri N., Jamieson G.A. Considering subjective trust and monitoring behavior in assessing automation-induced “complacency”. Hum. Perform. Situat. Aware. Autom. Curr. Res. Trends. 2004;1:54–59. [Google Scholar]
43.Banks V.A., Eriksson A., O’Donoghue J., Stanton N.A. Is partially automated driving a bad idea? Observations from an on-road study. Appl. Ergon. 2018;68:138–145. doi: 10.1016/j.apergo.2017.11.010. [DOI] [PubMed] [Google Scholar]
44.Banks V.A., Plant K.L., Stanton N.A. Driver error or designer error: using the perceptual cycle model to explore the circumstances surrounding the fatal Tesla crash on 7th may 2016. Saf. Sci. 2018;108:278–285. [Google Scholar]
45.Lee J.D., Moray N. Trust, self-confidence, and operators’ adaptation to automation. Int. J. Human Comput. Stud. 1994;40:153–184. [Google Scholar]
46.Dietvorst B.J., Bharti S. People reject algorithms in uncertain decision domains because they have diminishing sensitivity to forecasting error. Psychol. Sci. 2020;31:1302–1314. doi: 10.1177/0956797620948841. [DOI] [PubMed] [Google Scholar]
47.Chavaillaz A., Wastell D., Sauer J. System reliability, performance and trust in adaptable automation. Appl. Ergon. 2016;52:333–342. doi: 10.1016/j.apergo.2015.07.012. [DOI] [PubMed] [Google Scholar]
48.Lee J., Moray N. Trust, control strategies and allocation of function in human-machine systems. Ergonomics. 1992;35:1243–1270. doi: 10.1080/00140139208967392. [DOI] [PubMed] [Google Scholar]
49.Lai V., Tan C. Proceedings of the Conference on Fairness, Accountability, and Transparency. 2019. On human predictions with explanations and predictions of machine learning models: a case study on deception detection; pp. 29–38. [Google Scholar]
50.Bu¸cinca Z., Malaya M.B., Gajos K.Z. To trust or to think: cognitive forcing functions can reduce overreliance on AI in AI-assisted decision-making. Proc. ACM Human Comput. Interact. 2021;5:1–21. [Google Scholar]
51.Carton S., Mei Q., Resnick P. Vol. 14. 2020. Feature-based explanations don’t help people detect misclassifications of online toxicity; pp. 95–106. (Proceedings of the International AAAI Conference on Web and Social Media). [Google Scholar]
52.Shen H., Huang T.-H. Vol. 8. 2020. How useful are the machine-generated interpretations to general users? A human evaluation on guessing the incorrectly predicted labels; pp. 168–172. (Proceedings of the AAAI Conference on Human Computation and Crowdsourcing). [Google Scholar]
53.Kenny E.M., Ford C., Quinn M., Keane M.T. Explaining black-box classifiers using post-hoc explanations-by-example: the effect of explanations and error-rates in XAI user studies. Artif. Intell. 2021;294:103459. [Google Scholar]
54.Jeyakumar J.V., Noor J., Cheng Y.-H., Garcia L., Srivastava M. How can I explain this to you? An empirical study of deep neural network explanation methods. Adv. Neural Inf. Process. Syst. 2020;33:4211–4222. [Google Scholar]
55.van der Waa J., Nieuwburg E., Cremers A., Neerincx M. Evaluating XAI: a comparison of rule-based and example-based explanations. Artif. Intell. 2021;291:103404. [Google Scholar]
56.Wang L., Jamieson G.A., Hollands J.G. Trust and reliance on an automated combat identification system. Hum. Factors. 2009;51:281–291. doi: 10.1177/0018720809338842. [DOI] [PubMed] [Google Scholar]
57.Seong Y., Bisantz A.M. The impact of cognitive feedback on judgment performance and trust with decision aids. Int. J. Ind. Ergon. 2008;38:608–625. [Google Scholar]
58.Sauer J., Kao C.-S., Wastell D. A comparison of adaptive and adaptable automation under different levels of environmental stress. Ergonomics. 2012;55:840–853. doi: 10.1080/00140139.2012.676673. [DOI] [PubMed] [Google Scholar]
59.Bhatt U., Antoran J., Zhang Y., Liao Q.V., Sattigeri P., Fogliato R., Melancon G., Krishnan R., Stanley J., Tickoo O., et al. Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society. AIES ’21, Association for Computing Machinery; 2021. Uncertainty as a form of transparency: measuring, communicating, and using uncertainty; pp. 401–413. [Google Scholar]
60.Dijkstra J.J. User agreement with incorrect expert system advice. Behav. Inf. Technol. 1999;18:399–411. [Google Scholar]
61.De A., Okati N., Zarezade A., Rodriguez M.G. Vol. 35. 2021. Classification under human assistance; pp. 5905–5913. (Proceedings of the AAAI Conference on Artificial Intelligence). [Google Scholar]
62.Parasuraman R., Mouloua M., Molloy R. Effects of adaptive task allocation on monitoring of automated systems. Hum. Factors. 1996;38:665–679. doi: 10.1518/001872096778827279. [DOI] [PubMed] [Google Scholar]
63.Metzger U., Parasuraman R. Automation in future air traffic management: effects of decision aid reliability on controller performance and mental workload. Hum. Factors. 2005;47:35–49. doi: 10.1518/0018720053653802. [DOI] [PubMed] [Google Scholar]
64.Papenmeier A., Englebienne G., Seifert C. IJCAI Workshop on Explainable Artificial Intelligence; 2019. How Model Accuracy and Explanation Fidelity Influence User Trust. [Google Scholar]
65.Davies D.R., Parasuraman R. Academic Press; 1982. The Psychology of Vigilance. [Google Scholar]
66.Gugerty L.J., Tirre W.C. Individual differences in situation awareness. Situat. Aware. Anal. Meas. 2000:249–276. [Google Scholar]
67.Chaparro A., Groff L., Tabor K., Sifrit K., Gugerty L.J. volume 43. SAGE Publications Sage CA: Los Angeles, CA; 1999. Maintainingsituational awareness: the role of visual attention; pp. 1343–1347. (Proceedings of the Human Factors and Ergonomics Society Annual Meeting). [Google Scholar]
68.Warm J.S., Dember W.N., Hancock P.A. In: Automation and Human Performance: Theory and Applications. Parasuraman R., Mouloua M., editors. Lawrence Erlbaum Associates, Inc; 1996. Vigilance and workload in automated systems; pp. 183–200. [Google Scholar]
69.Reyna V.F., Brainerd C.J. Numeracy, ratio bias, and denominator neglect in judgments of risk and probability. Learn. Indiv Differ. 2008;18:89–107. https://linkinghub.elsevier.com/retrieve/pii/S1041608007000428 [Google Scholar]
70.Spiegelhalter D., Pearson M., Short I. Visualizing uncertainty about the future. Science. 2011;333:1393–1400. doi: 10.1126/science.1191181. https://www.sciencemag.org/lookup/doi/10.1126/science.1191181 [DOI] [PubMed] [Google Scholar]
71.Guo C., Pleiss G., Sun Y., Weinberger K.Q. International Conference on Machine Learning. 2017. On calibration of modern neural networks; pp. 1321–1330. [Google Scholar]
72.Biros D.P., Daly M., Gunsch G. The influence of task load and automation trust on deception detection. Group Decis. Negot. 2004;13:173–189. [Google Scholar]
73.Weller A. Explainable AI: Interpreting, Explaining and Visualizing Deep Learning. Springer; 2019. Transparency: motivations and challenges; pp. 23–40. [Google Scholar]
74.Ehsan U., Riedl M.O. Explainability pitfalls: beyond dark patterns in explainable AI. 2021. https://arxiv.org/abs/2109.12480 [DOI] [PMC free article] [PubMed]
75.Heo J., Joo S., Moon T. Fooling neural network interpretations via adversarial model manipulation. Adv. Neural Inf. Process. Syst. 2019;32:2925–2936. [Google Scholar]
76.Dimanov B., Bhatt U., Jamnik M., Weller A. Proceedings of the 2020 European Conference on AI. 2020. You shouldn’t trust me: learning models which conceal unfairness from multiple explanation methods. [Google Scholar]
77.Slack D., Hilgard S., Jia E., Singh S., Lakkaraju H. Proceedings of the AAAI/ACM Conference on AI, Ethics and Society. 2020. Fooling LIME and SHAP: adversarial attacks on post hoc explanation methods; pp. 180–186. [Google Scholar]
78.Gigerenzer G., Wegwarth O., Feufel M. 2010. Misleading Communication of Risk. [DOI] [PubMed] [Google Scholar]
79.Linkov F., Sanei-Moghaddam A., Edwards R.P., Lounder P.J., Ismail N., Goughnour S.L., Kang C., Mansuria S.M., Comerci J.T. Implementation of hysterectomy pathway: impact on complications. Women’s Health Issues. 2017;27:493–498. doi: 10.1016/j.whi.2017.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
80.Christin A. Algorithms in practice: comparing web journalism and criminal justice. Big Data Soc. 2017;4 [Google Scholar]

[bib1] 1.Lewandowsky S., Mundy M., Tan G. The dynamics of trust: comparing humans to automation. J. Exp. Psychol. Appl. 2000;6:104. doi: 10.1037//1076-898x.6.2.104. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Lee M.K. Understanding perception of algorithmic decisions: fairness, trust, and emotion in response to algorithmic management. Big Data Soc. 2018;5 [Google Scholar]

[bib3] 3.Logg J.M., Minson J.A., Moore D.A. Algorithm appreciation: people prefer algorithmic to human judgment. Organ. Behav. Hum. Decis. Process. 2019;151:90–103. [Google Scholar]

[bib4] 4.Parasuraman R., Riley V. Humans and automation: use, misuse, disuse, abuse. Hum. Factors. 1997;39:230–253. [Google Scholar]

[bib5] 5.Dzindolet M.T., Peterson S.A., Pomranky R.A., Pierce L.G., Beck H.P. The role of trust in automation reliance. Int. J. Human Comput. Stud. 2003;58:697–718. [Google Scholar]

[bib6] 6.McGuirl J.M., Sarter N.B. Supporting trust calibration and the effective use of decision aids by presenting dynamic system confidence information. Hum. Factors. 2006;48:656–665. doi: 10.1518/001872006779166334. [DOI] [PubMed] [Google Scholar]

[bib7] 7.Manzey D., Reichenbach J., Onnasch L. Human performance consequences of automated decision aids: the impact of degree of automation and system experience. J. Cogn. Eng. Decis. Making. 2012;6:57–87. [Google Scholar]

[bib8] 8.Bainbridge W.A., Hart J.W., Kim E.S., Scassellati B. The benefits of interactions with physically present robots over video-displayed agents. Int. J. Soc. Robot. 2011;3:41–52. [Google Scholar]

[bib9] 9.Desai M., Kaniarasu P., Medvedev M., Steinfeld A., Yanco H. 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI) IEEE; 2013. Impact of robot failures and feedback on real-time trust; pp. 251–258. [Google Scholar]

[bib10] 10.Gombolay M.C., Gutierrez R.A., Clarke S.G., Sturla G.F., Shah J.A. Decision-making authority, team efficiency and human worker satisfaction in mixed human-robot teams. Aut. Robots. 2015;39:293–312. [Google Scholar]

[bib11] 11.Robinette P., Howard A.M., Wagner A.R. International Conference on Social Robotics. Springer; 2015. Timing is key for robot trust repair; pp. 574–583. [Google Scholar]

[bib12] 12.Salem M., Lakatos G., Amirabdollahian F., Dautenhahn K. 2015 10th ACM/IEEE International Conference on Human-Robot Interaction (HRI) IEEE; 2015. Would you trust a (faulty) robot? Effects of error, task type and personality on human-robot cooperation and trust; pp. 1–8. [Google Scholar]

[bib13] 13.Andrist S., Bohus D., Yu Z., Horvitz E. 2016 11 th ACM/IEEE International Conference on Human-Robot Interaction (HRI) IEEE; 2016. Are you messing with me? Querying about the sincerity of interactions in the open world; pp. 409–410. [Google Scholar]

[bib14] 14.Dietvorst B.J., Simmons J.P., Massey C. Algorithm aversion: people erroneously avoid algorithms after seeing them err. J. Exp. Psychol. Gen. 2015;144:114. doi: 10.1037/xge0000033. [DOI] [PubMed] [Google Scholar]

[bib15] 15.Dietvorst B.J., Simmons J.P., Massey C. Overcoming algorithm aversion: people will use imperfect algorithms if they can (even slightly) modify them. Manag. Sci. 2018;64:1155–1170. [Google Scholar]

[bib16] 16.Montague E., Xu J. Understanding active and passive users: the effects of an active user using normal, hard and unreliable technologies on user assessment of trust in technology and co-user. Appl. Ergon. 2012;43:702–712. doi: 10.1016/j.apergo.2011.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] 17.Jacovi A., Marasović A., Miller T., Goldberg Y. Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. 2021. Formalizing trust in artificial intelligence: prerequisites, causes and goals of human trust in AI; pp. 624–635. [Google Scholar]

[bib18] 18.Schmidt P., Biessmann F., Teubner T. Transparency and trust in artificial intelligence systems. J. Decis. Syst. 2020;29:260–278. [Google Scholar]

[bib19] 19.De-Arteaga M., Fogliato R., Chouldechova A. Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. A case for humans-in- the-loop: decisions in the presence of erroneous algorithmic scores; pp. 1–12. [Google Scholar]

[bib20] 20.Amershi S., Weld D., Vorvoreanu M., Fourney A., Nushi B., Collisson P., Suh J., Iqbal S., Bennett P.N., Inkpen K., et al. Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery; 2019. Guidelines for human-AI interaction; pp. 1–13. [Google Scholar]

[bib21] 21.Yang F., Huang Z., Scholtz J., Arendt D.L. Proceedings of the 25th International Conference on Intelligent User Interfaces. 2020. How do visual explanations foster end users’ appropriate trust in machine learning? pp. 189–201. [Google Scholar]

[bib22] 22.Suresh H., Lao N., Liccardi I. 12th ACM Conference on Web Science. 2020. Misplaced trust: measuring the interference of machine learning in human decision-making; pp. 315–324. [Google Scholar]

[bib23] 23.Weerts H.J., van Ipenburg W., Pechenizkiy M. Proceedings of KDD Workshop on Explainable AI. 2019. A human-grounded evaluation of shap for alert processing. [Google Scholar]

[bib24] 24.Kaur H., Nori H., Jenkins S., Caruana R., Wallach H., Wortman Vaughan J. Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. Interpreting interpretability: understanding data scientists’ use of interpretability tools for machine learning; pp. 1–14. [Google Scholar]

[bib25] 25.Lee J.D., See K.A. Trust in automation: designing for appropriate reliance. Hum. Factors. 2004;46:50–80. doi: 10.1518/hfes.46.1.50_30392. [DOI] [PubMed] [Google Scholar]

[bib26] 26.Rousseau D.M., Sitkin S.B., Burt R.S., Camerer C. Not so different after all: a cross-discipline view of trust. Acad. Manag. Rev. 1998;23:393–404. [Google Scholar]

[bib27] 27.Siegrist M., Earle T.C., Gutscher H. Test of a trust and confidence model in the applied context of electromagnetic field (EMF) risks. Risk Anal. Int. J. 2003;23:705–716. doi: 10.1111/1539-6924.00349. [DOI] [PubMed] [Google Scholar]

[bib28] 28.Siegrist M., Gutscher H., Earle T.C. Perception of risk: the influence of general trust, and general confidence. J. Risk Res. 2005;8:145–156. [Google Scholar]

[bib29] 29.Epley N., Waytz A., Cacioppo J.T. On seeing human: a three-factor theory of anthropomorphism. Psychol. Rev. 2007;114:864. doi: 10.1037/0033-295X.114.4.864. [DOI] [PubMed] [Google Scholar]

[bib30] 30.Evans A.M., Krueger J.I. The psychology (and economics) of trust. Social Personal. Psychol. Compass. 2009;3:1003–1017. [Google Scholar]

[bib31] 31.Thielmann I., Hilbig B.E. Trust: an integrative review from a person- situation perspective. Rev. Gen. Psychol. 2015;19:249–277. [Google Scholar]

[bib32] 32.Lewicki R.J., Brinsfield C. Trust repair. Annu. Rev. Organ. Psychol. Organ. Behav. 2017;4:287–313. [Google Scholar]

[bib33] 33.Fiske S.T. Stereotype content: warmth and competence endure. Curr. Dir. Psychol. Sci. 2018;27:67–73. doi: 10.1177/0963721417738825. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] 34.Bhatt U., Xiang A., Sharma S., Weller A., Taly A., Jia Y., Ghosh J., Puri R., Moura J.M., Eckersley P. Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. 2020. Explainable machine learning in deployment; pp. 648–657. [Google Scholar]

[bib35] 35.Glikson E., Woolley A.W. Human trust in artificial intelligence: review of empirical research. Acad. Manag. Ann. 2020;14:627–660. [Google Scholar]

[bib36] 36.Hoff K.A., Bashir M. Trust in automation: integrating empirical evidence on factors that influence trust. Hum. Factors. 2015;57:407–434. doi: 10.1177/0018720814547570. [DOI] [PubMed] [Google Scholar]

[bib37] 37.Bansal G., Wu T., Zhou J., Fok R., Nushi B., Kamar E., Ribeiro M.T., Weld D. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 2021. Does the whole exceed its parts? The effect of AI explanations on complementary team performance; pp. 1–16. [Google Scholar]

[bib38] 38.Goddard K., Roudsari A., Wyatt J.C. Automation bias: empirical results assessing influencing factors. Int. J. Med. Inform. 2014;83:368–375. doi: 10.1016/j.ijmedinf.2014.01.001. [DOI] [PubMed] [Google Scholar]

[bib39] 39.Zerilli J., Knott A., Maclaurin J., Gavaghan C. Algorithmic decision-making and the control problem. Minds Mach. 2019;29:555–578. [Google Scholar]

[bib40] 40.Parasuraman R., Manzey D.H. Complacency and bias in human use of automation: an attentional integration. Hum. Factors. 2010;52:381–410. doi: 10.1177/0018720810376055. [DOI] [PubMed] [Google Scholar]

[bib41] 41.Pazouki K., Forbes N., Norman R.A., Woodward M.D. Investigation on the impact of human-automation interaction in maritime operations. Ocean Eng. 2018;153:297–304. [Google Scholar]

[bib42] 42.Bagheri N., Jamieson G.A. Considering subjective trust and monitoring behavior in assessing automation-induced “complacency”. Hum. Perform. Situat. Aware. Autom. Curr. Res. Trends. 2004;1:54–59. [Google Scholar]

[bib43] 43.Banks V.A., Eriksson A., O’Donoghue J., Stanton N.A. Is partially automated driving a bad idea? Observations from an on-road study. Appl. Ergon. 2018;68:138–145. doi: 10.1016/j.apergo.2017.11.010. [DOI] [PubMed] [Google Scholar]

[bib44] 44.Banks V.A., Plant K.L., Stanton N.A. Driver error or designer error: using the perceptual cycle model to explore the circumstances surrounding the fatal Tesla crash on 7th may 2016. Saf. Sci. 2018;108:278–285. [Google Scholar]

[bib45] 45.Lee J.D., Moray N. Trust, self-confidence, and operators’ adaptation to automation. Int. J. Human Comput. Stud. 1994;40:153–184. [Google Scholar]

[bib46] 46.Dietvorst B.J., Bharti S. People reject algorithms in uncertain decision domains because they have diminishing sensitivity to forecasting error. Psychol. Sci. 2020;31:1302–1314. doi: 10.1177/0956797620948841. [DOI] [PubMed] [Google Scholar]

[bib47] 47.Chavaillaz A., Wastell D., Sauer J. System reliability, performance and trust in adaptable automation. Appl. Ergon. 2016;52:333–342. doi: 10.1016/j.apergo.2015.07.012. [DOI] [PubMed] [Google Scholar]

[bib48] 48.Lee J., Moray N. Trust, control strategies and allocation of function in human-machine systems. Ergonomics. 1992;35:1243–1270. doi: 10.1080/00140139208967392. [DOI] [PubMed] [Google Scholar]

[bib49] 49.Lai V., Tan C. Proceedings of the Conference on Fairness, Accountability, and Transparency. 2019. On human predictions with explanations and predictions of machine learning models: a case study on deception detection; pp. 29–38. [Google Scholar]

[bib50] 50.Bu¸cinca Z., Malaya M.B., Gajos K.Z. To trust or to think: cognitive forcing functions can reduce overreliance on AI in AI-assisted decision-making. Proc. ACM Human Comput. Interact. 2021;5:1–21. [Google Scholar]

[bib51] 51.Carton S., Mei Q., Resnick P. Vol. 14. 2020. Feature-based explanations don’t help people detect misclassifications of online toxicity; pp. 95–106. (Proceedings of the International AAAI Conference on Web and Social Media). [Google Scholar]

[bib52] 52.Shen H., Huang T.-H. Vol. 8. 2020. How useful are the machine-generated interpretations to general users? A human evaluation on guessing the incorrectly predicted labels; pp. 168–172. (Proceedings of the AAAI Conference on Human Computation and Crowdsourcing). [Google Scholar]

[bib53] 53.Kenny E.M., Ford C., Quinn M., Keane M.T. Explaining black-box classifiers using post-hoc explanations-by-example: the effect of explanations and error-rates in XAI user studies. Artif. Intell. 2021;294:103459. [Google Scholar]

[bib54] 54.Jeyakumar J.V., Noor J., Cheng Y.-H., Garcia L., Srivastava M. How can I explain this to you? An empirical study of deep neural network explanation methods. Adv. Neural Inf. Process. Syst. 2020;33:4211–4222. [Google Scholar]

[bib55] 55.van der Waa J., Nieuwburg E., Cremers A., Neerincx M. Evaluating XAI: a comparison of rule-based and example-based explanations. Artif. Intell. 2021;291:103404. [Google Scholar]

[bib56] 56.Wang L., Jamieson G.A., Hollands J.G. Trust and reliance on an automated combat identification system. Hum. Factors. 2009;51:281–291. doi: 10.1177/0018720809338842. [DOI] [PubMed] [Google Scholar]

[bib57] 57.Seong Y., Bisantz A.M. The impact of cognitive feedback on judgment performance and trust with decision aids. Int. J. Ind. Ergon. 2008;38:608–625. [Google Scholar]

[bib58] 58.Sauer J., Kao C.-S., Wastell D. A comparison of adaptive and adaptable automation under different levels of environmental stress. Ergonomics. 2012;55:840–853. doi: 10.1080/00140139.2012.676673. [DOI] [PubMed] [Google Scholar]

[bib59] 59.Bhatt U., Antoran J., Zhang Y., Liao Q.V., Sattigeri P., Fogliato R., Melancon G., Krishnan R., Stanley J., Tickoo O., et al. Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society. AIES ’21, Association for Computing Machinery; 2021. Uncertainty as a form of transparency: measuring, communicating, and using uncertainty; pp. 401–413. [Google Scholar]

[bib60] 60.Dijkstra J.J. User agreement with incorrect expert system advice. Behav. Inf. Technol. 1999;18:399–411. [Google Scholar]

[bib61] 61.De A., Okati N., Zarezade A., Rodriguez M.G. Vol. 35. 2021. Classification under human assistance; pp. 5905–5913. (Proceedings of the AAAI Conference on Artificial Intelligence). [Google Scholar]

[bib75] 62.Parasuraman R., Mouloua M., Molloy R. Effects of adaptive task allocation on monitoring of automated systems. Hum. Factors. 1996;38:665–679. doi: 10.1518/001872096778827279. [DOI] [PubMed] [Google Scholar]

[bib76] 63.Metzger U., Parasuraman R. Automation in future air traffic management: effects of decision aid reliability on controller performance and mental workload. Hum. Factors. 2005;47:35–49. doi: 10.1518/0018720053653802. [DOI] [PubMed] [Google Scholar]

[bib62] 64.Papenmeier A., Englebienne G., Seifert C. IJCAI Workshop on Explainable Artificial Intelligence; 2019. How Model Accuracy and Explanation Fidelity Influence User Trust. [Google Scholar]

[bib77] 65.Davies D.R., Parasuraman R. Academic Press; 1982. The Psychology of Vigilance. [Google Scholar]

[bib78] 66.Gugerty L.J., Tirre W.C. Individual differences in situation awareness. Situat. Aware. Anal. Meas. 2000:249–276. [Google Scholar]

[bib79] 67.Chaparro A., Groff L., Tabor K., Sifrit K., Gugerty L.J. volume 43. SAGE Publications Sage CA: Los Angeles, CA; 1999. Maintainingsituational awareness: the role of visual attention; pp. 1343–1347. (Proceedings of the Human Factors and Ergonomics Society Annual Meeting). [Google Scholar]

[bib80] 68.Warm J.S., Dember W.N., Hancock P.A. In: Automation and Human Performance: Theory and Applications. Parasuraman R., Mouloua M., editors. Lawrence Erlbaum Associates, Inc; 1996. Vigilance and workload in automated systems; pp. 183–200. [Google Scholar]

[bib63] 69.Reyna V.F., Brainerd C.J. Numeracy, ratio bias, and denominator neglect in judgments of risk and probability. Learn. Indiv Differ. 2008;18:89–107. https://linkinghub.elsevier.com/retrieve/pii/S1041608007000428 [Google Scholar]

[bib64] 70.Spiegelhalter D., Pearson M., Short I. Visualizing uncertainty about the future. Science. 2011;333:1393–1400. doi: 10.1126/science.1191181. https://www.sciencemag.org/lookup/doi/10.1126/science.1191181 [DOI] [PubMed] [Google Scholar]

[bib65] 71.Guo C., Pleiss G., Sun Y., Weinberger K.Q. International Conference on Machine Learning. 2017. On calibration of modern neural networks; pp. 1321–1330. [Google Scholar]

[bib66] 72.Biros D.P., Daly M., Gunsch G. The influence of task load and automation trust on deception detection. Group Decis. Negot. 2004;13:173–189. [Google Scholar]

[bib67] 73.Weller A. Explainable AI: Interpreting, Explaining and Visualizing Deep Learning. Springer; 2019. Transparency: motivations and challenges; pp. 23–40. [Google Scholar]

[bib68] 74.Ehsan U., Riedl M.O. Explainability pitfalls: beyond dark patterns in explainable AI. 2021. https://arxiv.org/abs/2109.12480 [DOI] [PMC free article] [PubMed]

[bib69] 75.Heo J., Joo S., Moon T. Fooling neural network interpretations via adversarial model manipulation. Adv. Neural Inf. Process. Syst. 2019;32:2925–2936. [Google Scholar]

[bib70] 76.Dimanov B., Bhatt U., Jamnik M., Weller A. Proceedings of the 2020 European Conference on AI. 2020. You shouldn’t trust me: learning models which conceal unfairness from multiple explanation methods. [Google Scholar]

[bib71] 77.Slack D., Hilgard S., Jia E., Singh S., Lakkaraju H. Proceedings of the AAAI/ACM Conference on AI, Ethics and Society. 2020. Fooling LIME and SHAP: adversarial attacks on post hoc explanation methods; pp. 180–186. [Google Scholar]

[bib72] 78.Gigerenzer G., Wegwarth O., Feufel M. 2010. Misleading Communication of Risk. [DOI] [PubMed] [Google Scholar]

[bib73] 79.Linkov F., Sanei-Moghaddam A., Edwards R.P., Lounder P.J., Ismail N., Goughnour S.L., Kang C., Mansuria S.M., Comerci J.T. Implementation of hysterectomy pathway: impact on complications. Women’s Health Issues. 2017;27:493–498. doi: 10.1016/j.whi.2017.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib74] 80.Christin A. Algorithms in practice: comparing web journalism and criminal justice. Big Data Soc. 2017;4 [Google Scholar]

PERMALINK

How transparency modulates trust in artificial intelligence

John Zerilli

Umang Bhatt

Adrian Weller

Summary

The bigger picture

Introduction

The effects of error and transparency on trust

Box 1. Which AIs are the target of human-AI team research?

Figure 1.

Figure 3.

Figure 2.

Explanation as a form of transparency

Performance metrics

User control and dynamic allocation

Box 2. Example of dynamic task allocation.

Confidence information

Open questions

Error size, frequency, type, and distribution

Comparative performance of transparency regimes

Incorrect, deceptive, or misleading transparency

Concluding remarks and future perspectives

Acknowledgments

Author contributions

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

How transparency modulates trust in artificial intelligence

John Zerilli

Umang Bhatt

Adrian Weller

Summary

The bigger picture

Introduction

The effects of error and transparency on trust

Box 1. Which AIs are the target of human-AI team research?

Figure 1.

Figure 3.

Figure 2.

Explanation as a form of transparency

Performance metrics

User control and dynamic allocation

Box 2. Example of dynamic task allocation.

Confidence information

Open questions

Error size, frequency, type, and distribution

Comparative performance of transparency regimes

Incorrect, deceptive, or misleading transparency

Concluding remarks and future perspectives

Acknowledgments

Author contributions

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases