Visual aids improve diagnostic inferences and metacognitive judgment calibration

Rocio Garcia-Retamero; Edward T Cokely; Ulrich Hoffrage

doi:10.3389/fpsyg.2015.00932

. 2015 Jul 16;6:932. doi: 10.3389/fpsyg.2015.00932

Visual aids improve diagnostic inferences and metacognitive judgment calibration

Rocio Garcia-Retamero ^1,^2,^3,^*, Edward T Cokely ^4,^2,³, Ulrich Hoffrage ⁵

PMCID: PMC4504147 PMID: 26236247

Abstract

Visual aids can improve comprehension of risks associated with medical treatments, screenings, and lifestyles. Do visual aids also help decision makers accurately assess their risk comprehension? That is, do visual aids help them become well calibrated? To address these questions, we investigated the benefits of visual aids displaying numerical information and measured accuracy of self-assessment of diagnostic inferences (i.e., metacognitive judgment calibration) controlling for individual differences in numeracy. Participants included 108 patients who made diagnostic inferences about three medical tests on the basis of information about the sensitivity and false-positive rate of the tests and disease prevalence. Half of the patients received the information in numbers without a visual aid, while the other half received numbers along with a grid representing the numerical information. In the numerical condition, many patients–especially those with low numeracy–misinterpreted the predictive value of the tests and profoundly overestimated the accuracy of their inferences. Metacognitive judgment calibration mediated the relationship between numeracy and accuracy of diagnostic inferences. In contrast, in the visual aid condition, patients at all levels of numeracy showed high-levels of inferential accuracy and metacognitive judgment calibration. Results indicate that accurate metacognitive assessment may explain the beneficial effects of visual aids and numeracy–a result that accords with theory suggesting that metacognition is an essential part of risk literacy. We conclude that well-designed risk communications can inform patients about healthrelevant numerical information while helping them assess the quality of their own risk comprehension.

Keywords: visual aids, Bayesian reasoning, natural frequencies, numeracy, risk literacy, medical decision making, diagnostic inferences

Introduction

Visual aids are graphical representations of numerical expressions of probability. They include, among others, icon arrays, bar and line charts, and grids (Paling, 2003; Spiegelhalter et al., 2011). Visual aids provide an effective means of risk communication when they are transparent (Garcia-Retamero and Cokely, 2013)—that is, when their elements are well defined and they accurately and clearly represent the relevant risk information by making part-to-whole relationships in the data visually available (Gillan et al., 1998; Ancker et al., 2006; Reyna and Brainerd, 2008; Fischhoff et al., 2012; Trevena et al., 2012).

Transparent visual aids improve comprehension of risks associated with different lifestyles, screenings, and medical treatments, and they promote consideration of beneficial treatments despite side-effects (Feldman-Stewart et al., 2000; Paling, 2003; Waters et al., 2007; Zikmund-Fisher et al., 2008a; Zikmund-Fisher, 2015). Transparent visual aids also increase appropriate risk-avoidance behaviors, they promote healthy behaviors (Garcia-Retamero and Cokely, 2011, 2014a), they reduce errors and biases induced by anecdotal narratives and framed messages (Fagerlin et al., 2005; Schirillo and Stone, 2005; Garcia-Retamero and Galesic, 2009, 2010a; Cox et al., 2010; Garcia-Retamero et al., 2010) and they aid comprehension of complex concepts such as incremental risk (Zikmund-Fisher et al., 2008b). Risk information presented visually is also judged as easier to understand and recall than the same information presented numerically (Feldman-Stewart et al., 2007; Goodyear-Smith et al., 2008; Gaissmaier et al., 2012; Zikmund-Fisher et al., 2014; Okan et al., 2015).

However, not all visual aids are equally effective for all tasks (see Garcia-Retamero and Cokely, 2013, for a review). For instance, bar graphs are useful for comparing data points (Lipkus and Hollands, 1999; Lipkus, 2007; Fischhoff et al., 2012); line graphs are helpful for depicting trends over time; magnifier risk scales (including magnifying lenses) are useful for depicting small numbers (Ancker et al., 2006); icon arrays can be helpful for communicating treatment risk reduction and risk of side effects (Feldman-Stewart et al., 2000; Garcia-Retamero and Galesic, 2009, 2010b; Ancker et al., 2011; Okan et al., 2012); logic trees can be useful for visually depicting argument structure (Mandel, 2014); and grids can help depict large numbers when communicating the predictive value of medical tests (Garcia-Retamero and Hoffrage, 2013).

Grids displaying numerical information graphically have been found to boost the accuracy of perceptions of health-related benefits and risks beyond the effect of other transparent information formats. To illustrate, doctors and patients often have difficulties inferring the predictive value of a medical test from information about the sensitivity and false-positive rate of the test and the prevalence of the disease. In an influential study on how doctors process information about the results of mammography, Eddy (1982) gave 100 doctors the following information: “The probability that a woman has breast cancer is 1%. When a woman has breast cancer, it is not sure that she will have a positive result on the mammography: she has an 80% probability of having a positive result on the mammography. When a woman does not have breast cancer, it is still possible that she will have a positive result on the mammography: she has a 10% probability of having a positive result on the mammography.”

After having read this information, doctors were required to estimate the probability that a woman with a positive mammography actually has breast cancer. Eddy (1982) reported that 95 of 100 doctors estimated this probability to be about 80% (see Gigerenzer, 2013; Ellis et al., 2014, for similar results in patients). If one inserts the numbers presented above into a Bayes’ theorem, however, one gets a value of 8%, which is one order of magnitude smaller.

Gigerenzer and Hoffrage (1995, 1999) showed that communicating information about medical tests in natural frequencies as compared to probabilities improves diagnostic inferences (see also Sedlmeier and Gigerenzer, 2001; Kurzenhäuser and Hoffrage, 2002; Mandel, 2015). Natural frequencies are final tallies in a set of objects or events randomly sampled from the natural environment (Hoffrage et al., 2000, 2002). For the mammography task the statistical information provided in terms of natural frequencies reads: “100 out of every 10,000 women have breast cancer. When a woman has breast cancer, it is not sure that she will have a positive result on the mammography: 80 of every 100 such women will have a positive result on the mammography. When a woman does not have breast cancer, it is still possible that she will have a positive result on the mammography: 990 out of every 9,900 such women will have a positive result on the mammography.”

Even though the effect of numerical format (probabilities vs. natural frequencies) is substantial, performance in the natural frequency condition still leaves room for improvement. A study conducted by Garcia-Retamero and Hoffrage (2013) showed that grids displaying numerical information graphically improved diagnostic inferences in both doctors and their patients beyond the effect of natural frequencies (see also Brase, 2014, for similar results in young adults). The authors showed that these grids not only increased objective accuracy but also increased perceived usefulness of information and decreased perceived task difficulty. The aim of the current research was to extend this literature by investigating whether visual aids also help decision makers accurately assess their risk comprehension (metacognitive judgment calibration). In particular, we followed the method used by Garcia-Retamero and Hoffrage (2013) and investigated whether grids graphically displaying information about the predictive value of medical tests improve self-assessment of diagnostic inferences in patients.

Previous research showed that people can be highly overconfident when assessing the accuracy of their own judgments (Griffin and Brenner, 2004). For example, Dunning et al. (2004) conducted a systematic review of the literature on the topic and concluded that people’s self-views hold only a tenuous to modest relationship with their actual behavior and performance. On average, people say that they are “above average” in skill—a conclusion that defies statistical possibility for symmetric distributions of individuals (however, this conclusion is plausible if the mean and the median of a distribution are not identical; Gigerenzer et al., 2012). People also overestimate the likelihood that they will engage in desirable behaviors and achieve favorable outcomes, they furnish overly optimistic estimates of when they will complete future projects, and they reach judgments with too much confidence.

People tend to be highly overconfident at low levels of accuracy yet relatively well calibrated at higher levels of accuracy—a result that suggests the presence of an “unskilled and unaware effect” (Ehrlinger and Dunning, 2003; Ehrlinger et al., 2008). This result is consistent with research on individuals with low numeracy (i.e., the ability to accurately interpret numerical information about risk; Ancker and Kaufman, 2007; Fagerlin et al., 2007; Reyna et al., 2009; Galesic and Garcia-Retamero, 2010; Cokely et al., 2012; Peters, 2012). This research shows that people with low numeracy are especially inaccurate when evaluating the accuracy of their own judgments, showing overconfidence (Ghazal et al., 2014), and are not able to use risk reduction information to adjust their estimates (Schwartz et al., 1997). Overconfidence mediates, at least in part, the effect of numeracy on judgment accuracy (Ghazal et al., 2014). Thus, people with low numeracy may struggle to grasp numerical concepts that are essential for understanding health-relevant information because they have difficulties assessing the accuracy of their own estimates.

Our hypothesis is that visual aids can improve both accuracy of diagnostic inferences and metacognitive judgment calibration (i.e., how well patients assess the accuracy of these inferences) (H₁). We also hypothesize that visual aids may be especially useful for patients with low numeracy (H₂). Visual aids can increase the likelihood that less numerate patients deliberate on the available risk information, elaborating more on the problem at hand and on their own understanding of the problem (Garcia-Retamero and Cokely, 2013, 2014b). Deliberation tends to be important for risk understanding because it promotes more thorough, complex, and durable information representations (Cokely and Kelley, 2009)—an important component of metacognitive judgment calibration (Thompson et al., 2011). By influencing encoding and representation, visual aids can increase metacognitive judgment calibration, reducing overconfidence. Improvements in metacognitive processes can, in turn, improve the accuracy of inferences (H₃).

Materials and Methods

Participants

Participants included 108 patients recruited from four hospitals in the cities of Jaén and Granada (Spain) during treatment consultation. To be eligible for recruitment, patients had to have no previous formal medical training. If they agreed to participate, they were provided with an introductory letter describing the purpose of the study and their questions were answered. Eighty four percent of the patients who had been approached (n = 128) agreed to participate in the study. Those who refused mentioned one or more of the following reasons: respondent burden, lack of interest in research, and/or busy schedules. Patients had an average age of 52 years (range 19–76), and 78% were females. Most of the patients (86%) had a high school degree or less, and only 14% had a university education before participating in the study. Twenty-three percent of the patients had a chronic condition (e.g., allergies or diabetes). Patients received €20 for participating in the study and were assigned randomly to one of two groups. Male and female patients were evenly distributed in the groups. The Ethics Committee of the University of Granada approved the methodology, and all patients consented to participation through a consent form at the beginning of the study.

Materials and Procedure

Patients completed a two-part paper-and-pencil questionnaire. In the first part, they were presented with three tasks involving different diagnostic inferences: inferring breast cancer from a positive mammogram, colon cancer from a positive hemoccult test, and insulin-dependent diabetes from a genetic test. The order of the three tasks was randomized, independently for each patient. Wording and length of the tasks were comparable to the variant of the breast cancer task that we provided in the introduction of the current article. The information about the sensitivity and false-positive rate of the tests and prevalence of the diseases was taken from published studies (Hoffrage and Gigerenzer, 1998; Garcia-Retamero and Hoffrage, 2013) and was reported in natural frequencies (see Table 1). There were no time constraints, but the questionnaire took approximately 15 min to complete.

TABLE 1.

Information about prevalence of the diseases, and sensitivity and false-positive rate of the tests.

Diagnostic task	Base rate	Sensitivity	False-positive rate	Positive predictive value
Breast cancer	100 of 10,000	80 of 100	990 of 9,900	80 of 1,070
Colon cancer	30 of 10,000	15 of 30	299 of 9,970	15 of 314
Diabetes	50 of 10,000	48 of 50	4,975 of 9,950	48 of 5,023

Open in a new tab

Note that the false-positive rate is the complement of the specificity.

Half of the patients received the information about the sensitivity and false-positive rate of the tests and prevalence of the diseases in numbers without a visual aid. The other half received numbers along with a grid representing the numerical information. Figure 1 presents the grid that patients received in the mammography task. The visual display represented the number of women who obtained a positive mammogram, the number of women who have breast cancer, and the overall number of women at risk. Women were depicted as squares as previous research has found no differences in effects of arrays with faces compared to more abstract symbols such as squares or circles (Stone et al., 2003; Gaissmaier et al., 2012).

**Visual aid representing the overall number of women at risk, the number of women who have breast cancer, and the number of women who obtained a positive mammogram**.

After having received the information about the sensitivity and false-positive rate of the test and the base rate of the disease for a given task, patients made a diagnostic inference. In the breast cancer task, patients were told: “Imagine a representative sample of women who got a positive result on the mammography. Give your best guess: how many of these women do you expect to have breast cancer?” Patients were asked to provide two numbers such as X out of Y (leaving it up to them which denominator to use). After making the three diagnostic inferences, patients estimated accuracy of their diagnostic inferences. In particular, they estimated the number of correct inferences that they thought they had made on a scale ranging from 0 to 3. The second part of the questionnaire included a measure of numerical skills using twelve items taken from Schwartz et al. (1997) and Lipkus et al. (2001; see Cokely et al., 2012, for a review).

Design and Dependent Variables

We employed a mixed design with one independent variable manipulated experimentally between-groups: information format (numerical only vs. numerical and visual). In addition, we considered one independent variable that was not manipulated experimentally but measured, namely numeracy. We split patients into two groups according to the median of their numeracy scores. The low-numeracy group (n = 52) included patients with eight or fewer correct answers, while the high-numeracy group (n = 56) included those with nine or more correct answers (see Peters et al., 2006; Garcia-Retamero and Galesic, 2010b; and Garcia-Retamero and Cokely, 2014a, for a similar procedure).

Patients answered questions about the three tasks involving different diagnostic inferences. We used patients’ answers to the questions to determine our three dependent variables. Objective accuracy was measured as the percentage of correct inferences in the three tasks. Following Gigerenzer and Hoffrage (1995; see also Hoffrage et al., 2000), a response was considered accurate if it matched the value specified in the last column of Table 1 plus/minus one percentage point. A more liberal criterion than the one that we used in our analyses yielded similar findings to those reported in the results section. Estimated accuracy was measured as the estimated percentage of correct inferences in the three tasks. Finally, metacognitive judgment calibration was determined for each patient by computing the difference between estimated accuracy and objective accuracy (see Ghazal et al., 2014, for a similar method).

Analyses

First, we conducted analyses of variance (ANOVAs) to assess the effect of information format and numeracy on objective accuracy, estimated accuracy, and metacognitive judgment calibration (H₁ and H₂). Second, we assessed whether metacognitive judgment calibration explains the effect of information format and numeracy on objective accuracy (H₃). In particular, we conducted an analysis of covariance (ANCOVA) to assess the effect of information format and numeracy on objective accuracy after controlling for metacognitive judgment calibration. We also conducted mediational analyses to assess whether the effect of information format and numeracy on objective accuracy was mediated by metacognitive judgment calibration.

Finally, to find additional support of our hypothesis (H₃) and address an alternative explanation of our results, we investigated whether objective accuracy explains the effect of information format and numeracy on metacognitive judgment calibration. In particular, we conducted an ANCOVA to assess the effect of information format and numeracy on metacognitive judgment calibration after controlling for objective accuracy. In addition, we conducted mediational analyses to assess whether the effect of information format and numeracy on metacognitive judgment calibration was mediated by objective accuracy. As this alternative model seems plausible, we compared the size of its indirect effect (i.e., the amount of mediation) with that of the model with metacognitive judgment calibration as a mediator. Numeracy was included as a dichotomous variable in the ANOVAs and ANCOVAs and as a continuous variable in the mediation analyses. We found consistent results in these analyses (for a similar method, see Peters et al., 2006; Garcia-Retamero and Galesic, 2009, 2010b; and Garcia-Retamero and Cokely, 2014a).

Results

How did patients perform in the diagnostic inference tasks? And how did they think they had performed in the tasks? The percentage of patients who answered correctly three, two, one, and zero tasks was 24, 19, 18, and 39% respectively. In contrast, 50, 25, 19, and 6% of the patients estimated that they had made three, two, one, and zero correct diagnostic inferences, respectively. Only 34% of the patients were accurate when assessing the accuracy of their inferences (i.e., they were well calibrated); 38% overestimated accuracy in one task (33%); 18% overestimated accuracy in two tasks (67%); 6% overestimated accuracy in three tasks (100%); and 4% underestimated accuracy. Patients who achieved higher levels of accuracy were well calibrated, whereas patient with low levels of accuracy were highly overconfident (see Figure 2).

**Estimated accuracy by objective accuracy.** Error bars indicate one standard error of the mean.

Do visual aids and numeracy affect objective accuracy? Are visual aids especially useful for patients with low numeracy? Patients made more accurate inferences when the information was presented both numerically and visually (55% correct inferences) as compared to numerically only (32%) (H₁). In addition, patients with high numeracy were more accurate (51% correct inferences) as compared to low-numerate patients (35%). Finally, grids displaying numerical information were particularly useful additions for patients with low numeracy (see Figure 3). In contrast, there was only a minor increase in accuracy in patients with high numeracy when they received the additional visual display (H₂). In line with these results, the ANOVA with information format and numeracy as between-subjects factors and objective accuracy across the three tasks as the dependent variable revealed a main effect of information format, F_1,104 = 10.77, p = 0.001, $η_{p}^{2}$ = 0.09, and numeracy, F_1,104 = 5.79, p = 0.02, $η_{p}^{2}$ = 0.05. The interaction between information format and numeracy was also significant, F_1,104 = 3.82, p = 0.05, $η_{p}^{2}$ = 0.04.

**Objective accuracy, estimated accuracy, and metacognitive judgment calibration across the three diagnostic tasks by information format and numeracy.** Error bars indicate one standard error of the mean.

Do visual aids and numeracy affect estimated accuracy and metacognitive judgment calibration? Are visual aids especially useful for patients with low numeracy? Estimates of accuracy were not influenced by information format or numeracy. On average, patients estimated that 72% of their inferences were correct (see Figure 3). In contrast, both information format and numeracy had an effect on accuracy of estimates (i.e., metacognitive judgment calibration) (H₁). Grids displaying numerical information improved metacognitive judgment calibration in patients with low numeracy. These patients more accurately estimated the accuracy of their own inferences when they received the visual aid. However, the beneficial effect of the visual aid could not be observed in patients with high numeracy (H₂). These patients were relatively well calibrated regardless of information format. In line with these results, the ANOVA with information format and numeracy as between-subjects factors and estimated accuracy of diagnostic inferences across the three tasks as a dependent variable did not reveal any significant results (F < 1). In contrast, the ANOVA with information format and numeracy as between-subjects factors and metacognitive judgment calibration as a dependent variable revealed a main effect of information format, F_1,104 = 14.62, p = 0.001, $η_{p}^{2}$ = 0.12, and numeracy, F_1,104 = 5.28, p = 0.02, $η_{p}^{2}$ = 0.05, and an interaction between information format and numeracy, F_1,104 = 10.22, p = 0.002, $η_{p}^{2}$ = 0.09.

Does metacognitive judgment calibration explain the effect of information format and numeracy on objective accuracy? Visual aids do not improve objective accuracy in patients with low numeracy when metacognitive judgment calibration has been controlled for statistically (see Figure 4). In line with these results, the ANCOVA with information format and numeracy as between-subjects factors, objective accuracy across the three tasks as the dependent variable, and metacognitive judgment calibration as a covariate only revealed a main effect of metacognitive judgment calibration, F_1,103 = 37.25, p = 0.001, $η_{p}^{2}$ = 0.27. The main effect of information format and numeracy and the interaction between information format and numeracy was no longer significant (F < 1).

**Objective accuracy across the three diagnostic tasks by information format and numeracy after controlling for the effect of metacognitive judgment calibration.** Metacognitive judgment calibration across the three diagnostic tasks by information format and numeracy after controlling for the effect of objective accuracy. Error bars indicate one standard error of the mean.

To ensure comparability with results in the ANOVA and ANCOVA, in mediational analyses we first modeled objective accuracy when patients received information in numbers and then when they received an additional visual display representing the numerical information. In the numerical condition, regression analyses showed that numeracy influenced both metacognitive judgment calibration, β = –0.56, t₅₃ = –4.97, p = 0.001, and objective accuracy, β = 0.46, t₅₃ = 3.82, p = 0.001, whereby patients who were more numerate more accurately assessed the accuracy of their inferences (i.e., were better calibrated) and made more accurate inferences (see Figure 5A). In addition, metacognitive judgment calibration was related to objective accuracy, β = –0.52, t₅₂ = –4.04, p = 0.001. Patients who more accurately assessed the accuracy of their inferences also made more accurate inferences. When metacognitive judgment calibration was included in the regression analyses, the effect of numeracy on objective accuracy was significantly reduced and was no longer significant, β = 0.17, t₅₂ = 1.30, p = 0.20. The results of the Sobel test indicated that metacognitive judgment calibration mediates the relationship between numeracy and objective accuracy, z = 3.135, p = 0.001 [Effect = 0.30, 95% CI (0.27,0.33); AIC (Akaike Information Criterion) = 998.80]. When patients received the additional visual aid representing the numerical information, numeracy did not influence metacognitive judgment calibration, β = 0.10, t₅₁ = 0.70, p = 0.49, or objective accuracy, β = 0.14, t₅₁ = 1.01, p = 0.32 (see Figure 5B). As expected, metacognitive judgment calibration was again related to objective accuracy, β = –0.53, t₅₀ = –4.48, p = 0.001.

**Path analyses.** Effect of numeracy on objective accuracy and the mediational effect of metacognitive judgment calibration when **(A)** patients received information only in numbers and when **(B)** they received an additional visual display representing the numerical information. Effect of numeracy on metacognitive judgment calibration and the mediational effect of objective accuracy when **(C)** patients received information only in numbers and when **(D)** they received an additional visual display representing the numerical information. Note: Standardized coefficients are shown. *p < 0.05.

Does objective accuracy explain the effect of information format and numeracy on metacognitive judgment calibration? Visual aids improve metacognitive judgment calibration in patients with low numeracy after objective accuracy has been controlled for statistically (see Figure 4). The ANCOVA with information format and numeracy as between-subjects factors, metacognitive judgment calibration across the three tasks as the dependent variable, and objective accuracy as a covariate revealed a main effect of objective accuracy, F_1,103 = 37.25, p = 0.001, $η_{p}^{2}$ = 0.27, and information format, F_1,103 = 5.56, p = 0.020, $η_{p}^{2}$ = 0.05, and an interaction between information format and numeracy, F_1,103 = 6.24, p = 0.014, $η_{p}^{2}$ = 0.06.

As expected, in the numerical condition, regression analyses showed that objective accuracy was related to metacognitive judgment calibration, β = –0.46, t₅₂ = –4.04, p = 0.001 (see Figure 5C). Patients who made more accurate inferences also more accurately assessed the accuracy of these inferences. When objective accuracy was included in the regression analyses, the effect of numeracy on metacognitive judgment calibration was reduced but it was still significant, β = –0.35, t₅₂ = –3.12, p = 0.003. The results of the Sobel test indicated that objective accuracy mediates the relationship between numeracy and metacognitive judgment calibration, z = –2.78, p = 0.003. However, the size of the indirect effect [Effect = –0.21, 95% CI (–0.24, –0.18)] was smaller and AIC (AIC = 1057.30) was larger to that of the previous model. These results suggest that the model including objective accuracy as a mediator is a worse model than the model including metacognitive judgment calibration as a mediator.

In line with previous results, when patients received the additional visual aid representing the numerical information, objective accuracy was related to metacognitive judgment calibration, β = –0.54, t₅₀ = –4.48, p = 0.001 (see Figure 5D). In sum, results in ANCOVAs and mediational analyses suggest that metacognitive judgment calibration mediates the effect of numeracy on objective accuracy (H₃) and not the other way around. Thus these analyses suggest that, in the numerical condition, highly numerate patients make more accurate inferences than patients with low numeracy because they more accurately evaluate the accuracy of their own inferences. In contrast, in the visual condition, patients at all levels of numeracy showed similar high-levels of metacognitive judgment calibration and, in turn, high-levels of inferential accuracy.

Discussion

We investigated patients’ diagnostic inferences about the predictive value of medical tests from information about the sensitivity and false-positive rate of the tests and the prevalence of several diseases. Our results showed that many patients—especially those with low numeracy—made incorrect inferences about the predictive value of the tests and dramatically overestimated the accuracy of these inferences. High overestimates at low levels of accuracy become more calibrated at higher levels of accuracy—a result that suggests the presence of an “unskilled and unaware effect” (see also Ehrlinger and Dunning, 2003; Ehrlinger et al., 2008; Ghazal et al., 2014).

Our results are compatible with previous evidence on the role of numeracy in understanding health-relevant risk communications and medical decision making (Fagerlin et al., 2007; Apter et al., 2008; Reyna et al., 2009; Peters, 2012; Garcia-Retamero and Galesic, 2013; Johnson and Tubau, 2015). Patients with low levels of numeracy have more difficulties interpreting numerical risks of side effects (Gardner et al., 2011), and they are more susceptible to being influenced by the way the health information is framed in problems involving probabilities (Peters et al., 2006; Peters and Levin, 2008; Garcia-Retamero and Galesic, 2010a, 2011; Galesic and Garcia-Retamero, 2011a)—presumably because they are more influenced by non-numerical information (e.g., mood states; Peters et al., 2007; Petrova et al., 2014). Compared to patients with high numeracy, less-numerate patients also tend to overestimate their risk of suffering from several diseases (Davids et al., 2004; Gurmankin et al., 2004), they are less able to use risk reduction information to adjust their risk estimates (e.g., screening data; Schwartz et al., 1997), they tend to overestimate benefits of uncertain treatments (Weinfurt et al., 2003; Garcia-Retamero and Galesic, 2010b), and they have more deficits in understanding the information necessary to follow dietary recommendations (Rothman et al., 2006). Compared to patients with high numeracy, less-numerate patients also tend to search for less information about their disease (Portnoy et al., 2010), and they often choose lower-quality health options (e.g., health insurance plans; Hibbard et al., 2007; Hanoch et al., 2010). As a consequence, they tend to suffer more comorbidity and take more prescribed drugs (Garcia-Retamero et al., 2015). Less-numerate doctors and patients also favor a paternalistic model of medical decision making, in which doctors are dominant and autonomous (Garcia-Retamero et al., 2014), and patients prefer not to participate and instead delegate decision making (Galesic and Garcia-Retamero, 2011b). This is troubling given that the paternalistic model of medical decision making is increasingly being questioned (Kaplan and Frosch, 2005).

Our research suggests a potential explanation of the link between numeracy and understanding of health-relevant quantitative information. Highly numerate patients might make more accurate inferences as compared to patients with low numeracy because they more accurately evaluate the accuracy of their own inferences (i.e., they show better metacognitive judgment calibration). Thus metacognitive judgment calibration might drive, at least in part, the numeracy-to-performance relationship. Previous research suggests that the link between numeracy and superior judgment and decision making might reflect differences in heuristic-based deliberation (e.g., deep elaborative processing; Cokely and Kelley, 2009; Cokely et al., 2012), affective numerical intuition (e.g., precise symbolic number mapping; Peters et al., 2006; Peters, 2012), and meaningful intuitive understanding (e.g., gist-based representation and reasoning; Reyna, 2004; Reyna et al., 2009; see Cokely et al., 2014, for a review). Our research extends this literature suggesting that there is also a tight link between numeracy, metacognition, and understanding of health-relevant numerical information (see Ghazal et al., 2014, for similar results in highly educated samples).

Our results are also compatible with a variety of studies indicating that judgment self-assessment can operate as a domain-general skill that correlates with—but that can also be seen as an independent predictor of—general abilities, personality traits, and cognitive performance (Stankov, 2000; Stankov and Lee, 2008; Schraw, 2010). Overall our results accord with metacognitive theory suggesting that metacognitive judgment calibration tends to be useful because it is instrumental in self-regulation—i.e., the monitoring and control of cognition (Nelson, 1990; Metcalfe and Finn, 2008). Related studies of factors like “feeling of correctness” show that confidence-type judgments predict differences in information search and elaboration. In addition to predicting judgments about the correctness of one’s answer, one’s feeling of correctness tends to be related to “rethinking” times and the likelihood of changing one’s initial answer during reasoning (Thompson et al., 2011). These studies suggest that factors related to how one uses and assesses judgment accuracy may often be essential components determining the extent to which one deliberates during judgment and decision making (Ghazal et al., 2014). For these and other reasons it seems likely that metacognition is an essential component of the ability to understand and make good decisions about risk (i.e., risk literacy; see www.RiskLiteracy.org).

Finally, our results can have important implications for medical practice as they suggest suitable ways to communicate quantitative medical data—especially to patients lacking numerical skills. Our research shows that visual aids improve both objective accuracy and metacognitive judgment calibration, especially in less numerate patients, eliminating differences between this group of patients and the more numerate group. In addition, our research suggests that visual aids increase objective accuracy by improving metacognitive judgment calibration. As we mentioned above, calibration can mediate the relationship between numeracy and superior performance. In the current research, however, this result only holds when patients received numerical information without a visual display. In contrast, metacognitive judgment calibration did not mediate the effect of numeracy on objective accuracy when patients received the additional visual aid representing the numerical information because numeracy was no longer as robustly related to accuracy of inferences. In the visual condition, both patients with low and high numeracy were often well calibrated and, in turn, often made accurate inferences. These results suggest that visual aids might improve risk understanding, at least in part, by improving metacognitive judgment calibration and reducing overestimates of accuracy.

It is also possible that the effect of visual aids on both judgment accuracy and metacognitive judgment calibration follow from the development of better cognitive representations, which, in turn, facilitate reasoning and metacognitive monitoring (see Cosmides and Tooby, 1996; Brase et al., 1998; Brase, 2009). For instance, more cues available in memory can be used to explore essential relationships or to recognize that one has some missing knowledge. This conclusion is compatible with previous research indicating that visual aids help less numerate people identify and infer essential aspects of the risk information (e.g., “gross-level information”; Feldman-Stewart et al., 2000; Zikmund-Fisher et al., 2010). Visual aids also increase the ability of less numerate people to recognize superordinate classes, making part-to-whole relations in the data visually available (Ancker et al., 2006; Reyna and Brainerd, 2008). Moreover, visual aids improve risk comprehension by increasing the likelihood that less numerate people deliberate on the available risk information (Garcia-Retamero and Cokely, 2013, 2014b). By influencing memory encoding and representation, visual aids can also give rise to enduring changes in attitudes and behavioral intentions, which in turn affect behavior and risky decision making (Garcia-Retamero and Cokely, 2011, 2014a, 2015). Thus visual aids can improve judgment and decision making and help promote healthy behavior by improving understanding of health-relevant numerical information, by improving assessments of the accuracy of inferences about this information, and by establishing enduring attitudes and fostering intentions to perform the behavior, which may further promote understanding and self-assessment.

As with any research, our study has some limitations and leaves open several questions for future research. For instance, objective accuracy and metacognitive judgment calibration were correlated as the former was included in the measurement of the latter. To the extent that judgment calibration cannot be defined independently of objective accuracy, these concepts are not independent. So any results in this area need to be benchmarked accordingly. Nevertheless, our analyses showed that information format and numeracy have a significant effect on metacognitive judgment calibration even after objective accuracy has been controlled for statistically.

It is important to mention that our conclusions are based on patients’ diagnostic inferences and estimates when they received information about prevalence of several diseases, and the sensitivity and false-positive rate of the tests in natural frequencies (Hoffrage and Gigerenzer, 1998; Hoffrage et al., 2000, 2002). Future research could investigate these inferences and estimates when the information is reported in other numerical formats (e.g., probabilities). In addition, future research could also investigate whether these inferences and estimates affect behavioral intentions and actual behavior (e.g., whether patients indicate that they would take a medical test depending on the way the information about the test is communicated and if expressed interest exceeds actual uptake). Our sample of patients was older and less educated than the general population in Spain and other countries. Future research could also examine whether visual aids confer similar results in more educated participants (e.g., physicians) in different countries. Finally, future research could investigate whether the general findings hold across different types of visual aids (e.g., icon arrays, bar charts, and line plots), when visual aids are provided instead of rather than in addition to numerical information, and when visual aids differ in iconicity (i.e., when they are more or less abstract). In accord with the growing body of research, we predict that simple, well-designed visual aids will show substantial benefits in many situations, especially when communicating with less numerate individuals.

Author Contributions

All authors listed on the manuscript have contributed sufficiently to the project to be included as authors. All authors conceptualized the study, obtained funding, and wrote the paper. All authors approved the final version of the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The current research was funded by the Ministerio de Economía y Competitividad (Spain) (PSI2011-22954 and PSI2014-51842-R), the National Science Foundation (USA) (SES-1253263), and the Swiss National Science Foundation (100014_140503). The authors declare independence from these funding agencies and do not have conflicts of interest including financial interests, activities, relationships, and affiliations.

References

Ancker J. S., Kaufman D. (2007). Rethinking health numeracy: a multidisciplinary literature review. J. Am. Med. Inform. Assoc. 14, 713–721. 10.1197/jamia.M2464 [DOI] [PMC free article] [PubMed] [Google Scholar]
Ancker J. S., Senathirajah Y., Kukafka R., Starren J. B. (2006). Design features of graphs in health risk communication: a systematic review. J. Am. Med. Inform. Assoc. 13, 608–618. 10.1197/jamia.M2115 [DOI] [PMC free article] [PubMed] [Google Scholar]
Ancker J. S., Weber E. U., Kukafka R. (2011). Effect of arrangement of stick figures on estimates of proportion in risk graphics. Med. Decis. Mak. 31, 143–150. 10.1177/0272989X10369006 [DOI] [PMC free article] [PubMed] [Google Scholar]
Apter A. J., Paasche-Orlow M. K., Remillard J. T., Bennett I. M., Ben-Joseph E. P., Batista R. M., et al. (2008). Numeracy and communication with patients: they are counting on us. J. Gen. Intern. Med. 23, 2117–2224. 10.1007/s11606-008-0803-x [DOI] [PMC free article] [PubMed] [Google Scholar]
Brase G. L. (2009). Pictorial representations in statistical reasoning. Appl. Cogn. Psychol. 23, 369–381. 10.1002/acp.1460 [DOI] [Google Scholar]
Brase G. L. (2014). The power of representation and interpretation: doubling statistical reasoning performance with icons and frequentist interpretations of ambiguous numbers. J. Cogn. Psychol. 26, 81–97. 10.1080/20445911.2013.861840 [DOI] [Google Scholar]
Brase G. L., Cosmides L., Tooby J. (1998). Individuation, counting, and statistical inference: the role of frequency and whole-object representations in judgment under uncertainty. J. Exp. Psychol. Gen. 127, 3–21. 10.1037/0096-3445.127.1.3 [DOI] [Google Scholar]
Cokely E. T., Galesic M., Schulz E., Ghazal S., Garcia-Retamero R. (2012). Measuring risk literacy: the Berlin Numeracy Test. Judgm. Decis. Mak. 7, 25–47. [Google Scholar]
Cokely E. T., Ghazal S., Garcia-Retamero R. (2014). “Measuring numeracy,” in Numerical Reasoning in Judgments and Decision Making about Health, eds Anderson B. L., Schulkin J. (Cambridge: Cambridge University Press; ), 11–38. [Google Scholar]
Cokely E. T., Kelley C. M. (2009). Cognitive abilities and superior decision making under risk: a protocol analysis and process model evaluation. Judgm. Decis. Mak. 4, 20–33. [Google Scholar]
Cosmides L., Tooby J. (1996). Are humans good intuitive statisticians after all? rethinking some conclusions from the literature on judgment under uncertainty. Cognition 58, 1–73. 10.1016/0010-0277(95)00664-8 [DOI] [Google Scholar]
Cox D. S., Cox A. D., Sturm L., Zimet G. (2010). Behavioral interventions to increase HPV vaccination acceptability among mothers of young girls. Health Psychol. 29, 29–39. 10.1037/a0016942 [DOI] [PubMed] [Google Scholar]
Davids S. L., Schapira M. M., McAuliffe T. L., Nattinger A. B. (2004). Predictors of pessimistic breast cancer risk perceptions in a primary care population. J. Gen. Intern. Med. 19, 310–315. 10.1111/j.1525-1497.2004.20801.x [DOI] [PMC free article] [PubMed] [Google Scholar]
Dunning D., Heath C., Suls J. M. (2004). Flawed self-assessment implications for health, education, and the workplace. Psychol. Sci. Public Interest 5, 69–106. 10.1111/j.1529-1006.2004.00018.x [DOI] [PubMed] [Google Scholar]
Eddy D. M. (1982). “Probabilistic reasoning in clinical medicine: problems and opportunities,” in Judgment Under Uncertainty: Heuristics and Biases, ed. Kahneman D. (Cambridge: Cambridge University Press; ), 249–267. 10.1017/CBO9780511809477.019 [DOI] [Google Scholar]
Ehrlinger J., Dunning D. (2003). How chronic self-views influence (and potentially mislead) estimates of performance. J. Pers. Soc. Psychol. 84, 5–17. 10.1037/0022-3514.84.1.5 [DOI] [PubMed] [Google Scholar]
Ehrlinger J., Johnson K., Banner M., Dunning D., Kruger J. (2008). Why the unskilled are unaware: further explorations of (absent) self-insight among the incompetent. Organ. Behav. Hum. Decis. Process. 105, 98–121. 10.1016/j.obhdp.2007.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
Ellis K. M., Cokely E. T., Ghazal S., Garcia-Retamero R. (2014). Do people understand their home HIV test results? risk literacy and information search. Proc. Hum. Fact. Ergon. Soc. Ann. Meet. 58, 1323–1327. 10.1177/1541931214581276 [DOI] [Google Scholar]
Fagerlin A., Ubel P. A., Smith D. M., Zikmund-Fisher B. J. (2007). Making numbers matter: present and future research in risk communication. Am. J. Health Behav. 31, S47–S56. 10.5993/ajhb.31.s1.7 [DOI] [PubMed] [Google Scholar]
Fagerlin A., Wang C., Ubel P. A. (2005). Reducing the influence of anecdotal reasoning on people’s health care decisions: is a picture worth a thousand statistics? Med. Decis. Mak. 25, 398–405. 10.1177/0272989X05278931 [DOI] [PubMed] [Google Scholar]
Feldman-Stewart D., Brundage M. D., Zotov V. (2007). Further insight into the perception of quantitative information: judgments of gist in treatment decisions. Med. Decis. Mak. 27, 34–43. 10.1177/0272989X06297101 [DOI] [PubMed] [Google Scholar]
Feldman-Stewart D., Kocovski N., McConnell B. A., Brundage M. D., Mackillop W. J. (2000). Perception of quantitative information for treatment decisions. Med. Decis. Mak. 20, 228–238. 10.1177/0272989X0002000208 [DOI] [PubMed] [Google Scholar]
Fischhoff B., Brewer N. T., Downs J. S. (2012). Communicating Risks and Benefits: An Evidence Based User’s Guide. Silver Spring, MD: US Department of Health and Human Service, Food and Drug Administration. [Google Scholar]
Gaissmaier W., Wegwarth O., Skopec D., Müller A., Broschinski S., Politi M. C. (2012). Numbers can be worth a thousand pictures: individual differences in understanding graphical and numerical representations of health-related information. Health Psychol. 31, 286–296. 10.1037/a0024850 [DOI] [PubMed] [Google Scholar]
Galesic M., Garcia-Retamero R. (2010). Statistical numeracy for health: a cross-cultural comparison with probabilistic national samples. Arch. Intern. Med. 170, 462–468. 10.1001/archinternmed.2009.481 [DOI] [PubMed] [Google Scholar]
Galesic M., Garcia-Retamero R. (2011a). Communicating consequences of risky behaviors: life expectancy versus risk of disease. Patient Educ. Couns. 82, 30–35. 10.1016/j.pec.2010.02.008 [DOI] [PubMed] [Google Scholar]
Galesic M., Garcia-Retamero R. (2011b). Do low-numeracy people avoid shared decision making? Health Psychol. 30, 336–341. 10.1037/a0022723 [DOI] [PubMed] [Google Scholar]
Garcia-Retamero R., Andrade A., Sharit J., Ruiz J. G. (2015). Is patient’s numeracy related to physical and mental health? Med. Decis. Mak. 35, 501–511. 10.1177/0272989X15578126 [DOI] [PubMed] [Google Scholar]
Garcia-Retamero R., Cokely E. T. (2011). Effective communication of risks to young adults: using message framing and visual aids to increase condom use and STD screening. J. Exp. Psychol. Appl. 17, 270–287. 10.1037/a0023677 [DOI] [PubMed] [Google Scholar]
Garcia-Retamero R., Cokely E. T. (2013). Communicating health risks with visual aids. Curr. Dir. Psychol. Sci. 22, 392–399. 10.1177/0963721413491570 [DOI] [Google Scholar]
Garcia-Retamero R., Cokely E. T. (2014a). The influence of skills, message frame, and visual aids on prevention of sexually transmitted diseases. J. Behav. Decis. Mak. 27, 179–189. 10.1002/bdm.1797 [DOI] [Google Scholar]
Garcia-Retamero R., Cokely E. T. (2014b). “Using visual aids to help people with low numeracy make better decisions,” in Numerical Reasoning in Judgments and Decision Making about Health, eds Anderson B. L., Schulkin J. (Cambridge: Cambridge University Press; ), 153–174. [Google Scholar]
Garcia-Retamero R., Cokely E. T. (2015). Simple but powerful health messages for increasing condom use in young adults. J. Sex Res. 52, 30–42. 10.1080/00224499.2013.806647 [DOI] [PubMed] [Google Scholar]
Garcia-Retamero R., Galesic M. (2009). Communicating treatment risk reduction to people with low numeracy skills: a cross-cultural comparison. Am. J. Public Health 99, 2196–2202. 10.2105/AJPH.2009.160234 [DOI] [PMC free article] [PubMed] [Google Scholar]
Garcia-Retamero R., Galesic M. (2010a). How to reduce the effect of framing on messages about health. J. Gen. Intern. Med. 25, 1323–1329. 10.1007/s11606-010-1484-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
Garcia-Retamero R., Galesic M. (2010b). Who profits from visual aids: overcoming challenges in people’s understanding of risks. Soc. Sci. Med. 70, 1019–1025. 10.1016/j.socscimed.2009.11.031 [DOI] [PubMed] [Google Scholar]
Garcia-Retamero R., Galesic M. (2011). Using plausible group sizes to communicate information about medical risks. Patient Educ. Couns. 84, 245–250. 10.1016/j.pec.2010.07.027 [DOI] [PubMed] [Google Scholar]
Garcia-Retamero R., Galesic M. (2013). Transparent Communication of Health Risks: Overcoming Cultural Differences. New York: Springer; 10.1007/978-1-4614-4358-2 [DOI] [Google Scholar]
Garcia-Retamero R., Galesic M., Gigerenzer G. (2010). Do icon arrays help reduce denominator neglect? Med. Decis. Mak. 30, 672–684. 10.1177/0272989X10369000 [DOI] [PubMed] [Google Scholar]
Garcia-Retamero R., Hoffrage U. (2013). Visual representation of statistical information improves diagnostic inferences in doctors and their patients. Soc. Sci. Med. 83, 27–33. 10.1016/j.socscimed.2013.01.034 [DOI] [PubMed] [Google Scholar]
Garcia-Retamero R., Wicki B., Cokely E. T., Hanson B. (2014). Factors predicting surgeons’ preferred and actual roles in interactions with their patients. Health Psychol. 33, 920–928. 10.1037/hea0000061 [DOI] [PubMed] [Google Scholar]
Gardner P. H., McMillan B., Raynor D. K., Woolf E., Knapp P. (2011). The effect of numeracy on the comprehension of information about medicines in users of a patient information website. Patient Educ. Couns. 83, 398–403. 10.1016/j.pec.2011.05.006 [DOI] [PubMed] [Google Scholar]
Ghazal S., Cokely E. T., Garcia-Retamero R. (2014). Predicting biases in very highly educated samples: numeracy and metacognition. Judgm. Decis. Mak. 9, 15–34. [Google Scholar]
Gigerenzer G. (2013). How I got started teaching physicians and judges risk literacy. Appl. Cogn. Psych. 28, 612–614. 10.1002/acp.2980 [DOI] [Google Scholar]
Gigerenzer G., Fiedler K., Olsson H. (2012). “Rethinking cognitive biases and environmental consequences,” in Ecological Rationality: Intelligence in the World, eds Todd P. M., Gigerenzer G., the ABC Research Group (New York: Oxford University Press; ), 80–110. [Google Scholar]
Gigerenzer G., Hoffrage U. (1995). How to improve Bayesian reasoning without instruction: frequency formats. Psychol. Rev. 102, 684–704. 10.1037/0033-295X.102.4.684 [DOI] [Google Scholar]
Gigerenzer G., Hoffrage U. (1999). Overcoming difficulties in Bayesian reasoning: a reply to Lewis and Keren (1999) and Mellers and McGraw (1999). Psychol. Rev. 106, 425–430. 10.1037/0033-295X.106.2.425 [DOI] [Google Scholar]
Gillan D. J., Wickens C. D., Hollands J. G., Carswell C. M. (1998). Guidelines for presenting quantitative data in HFES publications. Hum. Fact. 40, 28–41. [Google Scholar]
Goodyear-Smith F., Arroll B., Chan L., Jackson R., Wells S., Kenealy T. (2008). Patients prefer pictures to numbers to express cardiovascular benefit from treatment. Ann. Fam. Med. 6, 213–217. 10.1370/afm.795 [DOI] [PMC free article] [PubMed] [Google Scholar]
Griffin D., Brenner L. (2004). “Perspectives on probability judgment calibration,” in Blackwell Handbook of Judgment and Decision Making, eds Koehler D. J., Harvey N. (Oxford: Blackwell; ), 177–199. 10.1002/9780470752937.ch9 [DOI] [Google Scholar]
Gurmankin A. D., Baron J., Armstrong K. (2004). Intended message versus message received in hypothetical physician risk communications: exploring the gap. Risk Anal. 24, 1337–1347. 10.1111/j.0272-4332.2004.00530.x [DOI] [PubMed] [Google Scholar]
Hanoch Y., Miron-Shatz T., Cole H., Himmelstein M., Federman A. D. (2010). Choice, numeracy, and physicians-in-training performance: the case of medicare part D. Health Psychol. 29, 454–459. 10.1037/a0019881 [DOI] [PMC free article] [PubMed] [Google Scholar]
Hibbard J. H., Peters E., Dixon A., Tusler M. (2007). Consumer competencies and the use of comparative quality information: it isn’t just about literacy. Med. Care Res. Rev. 64, 379–394. 10.1177/1077558707301630 [DOI] [PubMed] [Google Scholar]
Hoffrage U., Gigerenzer G. (1998). Using natural frequencies to improve diagnostic inferences. Acad. Med. 73, 538–540. 10.1097/00001888-199805000-00024 [DOI] [PubMed] [Google Scholar]
Hoffrage U., Gigerenzer G., Krauss S., Martignon L. (2002). Representation facilitates reasoning: what natural frequencies are and what they are not. Cognition 84, 343–352. 10.1016/S0010-0277(02)00050-1 [DOI] [PubMed] [Google Scholar]
Hoffrage U., Lindsey S., Hertwig R., Gigerenzer G. (2000). Communicating statistical information. Science 290, 2261–2262. 10.1126/science.290.5500.2261 [DOI] [PubMed] [Google Scholar]
Johnson E. D., Tubau E. (2015). Comprehension and computation in Bayesian problem solving. Front. Psychol. 6:938. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kaplan R. M., Frosch D. L. (2005). Decision making in medicine and health care. Annu. Rev. Clin. Psychol. 1, 525–556. 10.1146/annurev.clinpsy.1.102803.144118 [DOI] [PubMed] [Google Scholar]
Kurzenhäuser S., Hoffrage U. (2002). Teaching Bayesian reasoning: an evaluation of a classroom tutorial for medical students. Med. Teach. 24, 516–521. 10.1080/0142159021000012540 [DOI] [PubMed] [Google Scholar]
Lipkus I. M. (2007). Numeric, verbal, and visual formats of conveying health risks: suggested best practices and future recommendations. Med. Decis. Mak. 27, 696–713. 10.1177/0272989X07307271 [DOI] [PubMed] [Google Scholar]
Lipkus I. M., Hollands J. G. (1999). The visual communication of risk. J. Natl. Cancer Inst. Monogr. 25, 149–163. 10.1093/oxfordjournals.jncimonographs.a024191 [DOI] [PubMed] [Google Scholar]
Lipkus I. M., Samsa G., Rimer B. K. (2001). General performance on a numeracy scale among highly educated samples. Med. Decis. Mak. 21, 37–44. 10.1177/0272989X0102100105 [DOI] [PubMed] [Google Scholar]
Mandel D. R. (2014). Visual representation of rational belief revision: another look at the sleeping beauty problem. Front. Psychol. 5:1232. 10.3389/fpsyg.2014.01232 [DOI] [PMC free article] [PubMed] [Google Scholar]
Mandel D. R. (2015). Instruction in information structuring improves Bayesian judgment in intelligence analysts. Front. Psychol. 6:387. 10.3389/fpsyg.2015.00387 [DOI] [PMC free article] [PubMed] [Google Scholar]
Metcalfe J., Finn B. (2008). Evidence that judgments of learning are causally related to study choice. Psychon. Bull. Rev. 15, 174–179. 10.3758/PBR.15.1.174 [DOI] [PubMed] [Google Scholar]
Nelson T. O. (1990). Metamemory: a theoretical framework and new findings. Psychol. Learn. Motiv. 26, 125–173. 10.1016/S0079-7421(08)60053-5 [DOI] [Google Scholar]
Okan Y., Garcia-Retamero R., Cokely E. T., Maldonado A. (2012). Individual differences in graph literacy: overcoming denominator neglect in risk comprehension. J. Behav. Decis. Mak. 25, 390–401. 10.1002/bdm.751 [DOI] [Google Scholar]
Okan Y., Garcia-Retamero R., Cokely E. T., Maldonado A. (2015). Improving risk understanding across ability levels: encouraging active processing with dynamic icon arrays. J. Exp. Psychol. Appl. 21, 178–194. 10.1037/xap0000045 [DOI] [PubMed] [Google Scholar]
Paling J. (2003). Strategies to help patients understand risks. BMJ 327, 745–748. 10.1136/bmj.327.7417.745 [DOI] [PMC free article] [PubMed] [Google Scholar]
Peters E. (2012). Beyond comprehension the role of numeracy in judgments and decisions. Curr. Dir. Psychol. Sci. 21, 31–35. 10.1177/0963721411429960 [DOI] [Google Scholar]
Peters E., Dieckmann N., Dixon A., Hibbard J. H., Mertz C. K. (2007). Less is more in presenting quality information to consumers. Med. Care Res. Rev. 64, 169–190. 10.1177/10775587070640020301 [DOI] [PubMed] [Google Scholar]
Peters E., Levin I. P. (2008). Dissecting the risky-choice framing effect: numeracy as an individual-difference factor in weighting risky and riskless options. Judgm. Decis. Mak. 3, 435–448. [Google Scholar]
Peters E., Vastfjall D., Slovic P., Mertz C. K., Mazzocco K., Dickert S. (2006). Numeracy and decision making. Psychol. Sci. 17, 407–413. 10.1111/j.1467-9280.2006.01720.x [DOI] [PubMed] [Google Scholar]
Petrova D. G., Pligt J., Garcia-Retamero R. (2014). Feeling the numbers: on the interplay between risk, affect, and numeracy. J. Behav. Decis. Mak. 27, 191–199. 10.1002/bdm.1803 [DOI] [Google Scholar]
Portnoy D. B., Roter D., Erby L. H. (2010). The role of numeracy on client knowledge in BRCA genetic counseling. Patient Educ. Couns. 81, 131–136. 10.1016/j.pec.2009.09.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
Reyna V. F. (2004). How people make decisions that involve risk: a dual-processes approach. Curr. Dir. Psychol. Sci. 13, 60–66. 10.1111/j.0963-7214.2004.00275.x [DOI] [Google Scholar]
Reyna V. F., Brainerd C. J. (2008). Numeracy, ratio bias, and denominator neglect in judgments of risk and probability. Learn. Individ. Differ. 18, 89–107. 10.1016/j.lindif.2007.03.011 [DOI] [Google Scholar]
Reyna V. F., Nelson W. L., Han P. K., Dieckmann N. F. (2009). How numeracy influences risk comprehension and medical decision making. Psychol. Bull. 135, 943–973. 10.1037/a0017327 [DOI] [PMC free article] [PubMed] [Google Scholar]
Rothman R. L., Housam R., Weiss H., Davis D., Gregory R., Gebretsadik T., et al. (2006). Patient understanding of food labels: the role of literacy and numeracy. Am. J. Prev. Med. 31, 391–398. 10.1016/j.amepre.2006.07.025 [DOI] [PubMed] [Google Scholar]
Schirillo J. A., Stone E. R. (2005). The greater ability of graphical versus numerical displays to increase risk avoidance involves a common mechanism. Risk Anal. 25, 555–566. 10.1111/j.1539-6924.2005.00624.x [DOI] [PubMed] [Google Scholar]
Schraw G. (2010). Measuring self-regulation in computer-based learning environments. Educ. Psychol. 45, 258–266. 10.1080/00461520.2010.515936 [DOI] [Google Scholar]
Schwartz L. M., Woloshin S., Black W. C., Welch H. G. (1997). The role of numeracy in understanding the benefit of screening mammography. Ann. Intern. Med. 127, 966–972. 10.7326/0003-4819-127-11-199712010-00003 [DOI] [PubMed] [Google Scholar]
Sedlmeier P., Gigerenzer G. (2001). Teaching Bayesian reasoning in less than two hours. J. Exp. Psychol. Gen. 130, 380–400. 10.1037/0096-3445.130.3.380 [DOI] [PubMed] [Google Scholar]
Spiegelhalter D., Pearson M., Short I. (2011). Visualizing uncertainty about the future. Science 333, 1393–1400. 10.1126/science.1191181 [DOI] [PubMed] [Google Scholar]
Stankov L. (2000). Structural extensions of a hierarchical view on human cognitive abilities. Learn. Individ. Differ. 12, 35–51. 10.1016/S1041-6080(00)00037-6 [DOI] [Google Scholar]
Stankov L., Lee J. (2008). Confidence and cognitive test performance. J. Educ. Psychol. 100, 961–976. 10.1037/a0012546 [DOI] [Google Scholar]
Stone E. R., Sieck W. R., Bull B. E., Yates F. J., Parks S. C., Rush C. J. (2003). Foreground: background salience: explaining the effects of graphical displays on risk avoidance. Organ. Behav. Hum. Decis. Process. 90, 19–36. 10.1016/S0749-5978(03)00003-7 [DOI] [Google Scholar]
Thompson V. A., Prowse Turner J. A., Pennycook G. (2011). Intuition, reason, and metacognition. Cogn. Psychol. 63, 107–140. 10.1016/j.cogpsych.2011.06.001 [DOI] [PubMed] [Google Scholar]
Trevena L., Zikmund-Fisher B., Edwards A., Gaissmaier W., Galesic M., Han P., et al. (2012). “Presenting probabilities,” in Update of the International Patient Decision Aids Standards (IPDAS) Collaboration’s Background Document, eds Volk R., Llewellyn-Thomas H. Available at: http://ipdas.ohri.ca/IPDAS-Chapter-C.pdf [Google Scholar]
Waters E. A., Weinstein N. D., Colditz G. A., Emmons K. M. (2007). Reducing aversion to side effects in preventive medical treatment decisions. J. Exp. Psychol. Appl. 13, 11–21. 10.1037/1076-898X.13.1.11 [DOI] [PubMed] [Google Scholar]
Weinfurt K. P., Castel L. D., Li Y., Sulmasy D. P., Balshem A. M., Benson A. B., et al. (2003). The correlation between patient characteristics and expectations of benefit from phase I clinical trials. Cancer 98, 166–175. 10.1002/cncr.11483 [DOI] [PubMed] [Google Scholar]
Zikmund-Fisher B. J. (2015). Stories of MDM: from a conversation to a career of making less data more useful. Med. Decis. Mak. 35, 1–3. 10.1177/0272989X14563576 [DOI] [PubMed] [Google Scholar]
Zikmund-Fisher B. J., Fagerlin A., Ubel P. A. (2008a). Improving understanding of adjuvant therapy options by using simpler risk graphics. Cancer 113, 3382–3390. 10.1002/cncr.23959 [DOI] [PMC free article] [PubMed] [Google Scholar]
Zikmund-Fisher B. J., Ubel P. A., Smith D. M., Derry H. A., McClure J. B., Stark A., et al. (2008b). Communicating side effect risks in a tamoxifen prophylaxis decision aid: the debiasing influence of pictographs. Patient Educ. Couns. 73, 209–214. 10.1016/j.pec.2008.05.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
Zikmund-Fisher B. J., Fagerlin A., Ubel P. A. (2010). A demonstration of “less can be more” in risk graphics. Med. Decis. Mak. 30, 661–671. 10.1177/0272989X10364244 [DOI] [PMC free article] [PubMed] [Google Scholar]
Zikmund-Fisher B. J., Witteman H. O., Dickson M., Fuhrel-Forbis A., Kahn V. C., Exe N. L., et al. (2014). Blocks, ovals, or people? icon type affects risk perceptions and recall of pictographs. Med. Decis. Mak. 34, 443–453. 10.1177/0272989X13511706 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B1] Ancker J. S., Kaufman D. (2007). Rethinking health numeracy: a multidisciplinary literature review. J. Am. Med. Inform. Assoc. 14, 713–721. 10.1197/jamia.M2464 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] Ancker J. S., Senathirajah Y., Kukafka R., Starren J. B. (2006). Design features of graphs in health risk communication: a systematic review. J. Am. Med. Inform. Assoc. 13, 608–618. 10.1197/jamia.M2115 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] Ancker J. S., Weber E. U., Kukafka R. (2011). Effect of arrangement of stick figures on estimates of proportion in risk graphics. Med. Decis. Mak. 31, 143–150. 10.1177/0272989X10369006 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4] Apter A. J., Paasche-Orlow M. K., Remillard J. T., Bennett I. M., Ben-Joseph E. P., Batista R. M., et al. (2008). Numeracy and communication with patients: they are counting on us. J. Gen. Intern. Med. 23, 2117–2224. 10.1007/s11606-008-0803-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] Brase G. L. (2009). Pictorial representations in statistical reasoning. Appl. Cogn. Psychol. 23, 369–381. 10.1002/acp.1460 [DOI] [Google Scholar]

[B6] Brase G. L. (2014). The power of representation and interpretation: doubling statistical reasoning performance with icons and frequentist interpretations of ambiguous numbers. J. Cogn. Psychol. 26, 81–97. 10.1080/20445911.2013.861840 [DOI] [Google Scholar]

[B7] Brase G. L., Cosmides L., Tooby J. (1998). Individuation, counting, and statistical inference: the role of frequency and whole-object representations in judgment under uncertainty. J. Exp. Psychol. Gen. 127, 3–21. 10.1037/0096-3445.127.1.3 [DOI] [Google Scholar]

[B8] Cokely E. T., Galesic M., Schulz E., Ghazal S., Garcia-Retamero R. (2012). Measuring risk literacy: the Berlin Numeracy Test. Judgm. Decis. Mak. 7, 25–47. [Google Scholar]

[B9] Cokely E. T., Ghazal S., Garcia-Retamero R. (2014). “Measuring numeracy,” in Numerical Reasoning in Judgments and Decision Making about Health, eds Anderson B. L., Schulkin J. (Cambridge: Cambridge University Press; ), 11–38. [Google Scholar]

[B10] Cokely E. T., Kelley C. M. (2009). Cognitive abilities and superior decision making under risk: a protocol analysis and process model evaluation. Judgm. Decis. Mak. 4, 20–33. [Google Scholar]

[B11] Cosmides L., Tooby J. (1996). Are humans good intuitive statisticians after all? rethinking some conclusions from the literature on judgment under uncertainty. Cognition 58, 1–73. 10.1016/0010-0277(95)00664-8 [DOI] [Google Scholar]

[B12] Cox D. S., Cox A. D., Sturm L., Zimet G. (2010). Behavioral interventions to increase HPV vaccination acceptability among mothers of young girls. Health Psychol. 29, 29–39. 10.1037/a0016942 [DOI] [PubMed] [Google Scholar]

[B13] Davids S. L., Schapira M. M., McAuliffe T. L., Nattinger A. B. (2004). Predictors of pessimistic breast cancer risk perceptions in a primary care population. J. Gen. Intern. Med. 19, 310–315. 10.1111/j.1525-1497.2004.20801.x [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] Dunning D., Heath C., Suls J. M. (2004). Flawed self-assessment implications for health, education, and the workplace. Psychol. Sci. Public Interest 5, 69–106. 10.1111/j.1529-1006.2004.00018.x [DOI] [PubMed] [Google Scholar]

[B15] Eddy D. M. (1982). “Probabilistic reasoning in clinical medicine: problems and opportunities,” in Judgment Under Uncertainty: Heuristics and Biases, ed. Kahneman D. (Cambridge: Cambridge University Press; ), 249–267. 10.1017/CBO9780511809477.019 [DOI] [Google Scholar]

[B16] Ehrlinger J., Dunning D. (2003). How chronic self-views influence (and potentially mislead) estimates of performance. J. Pers. Soc. Psychol. 84, 5–17. 10.1037/0022-3514.84.1.5 [DOI] [PubMed] [Google Scholar]

[B17] Ehrlinger J., Johnson K., Banner M., Dunning D., Kruger J. (2008). Why the unskilled are unaware: further explorations of (absent) self-insight among the incompetent. Organ. Behav. Hum. Decis. Process. 105, 98–121. 10.1016/j.obhdp.2007.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] Ellis K. M., Cokely E. T., Ghazal S., Garcia-Retamero R. (2014). Do people understand their home HIV test results? risk literacy and information search. Proc. Hum. Fact. Ergon. Soc. Ann. Meet. 58, 1323–1327. 10.1177/1541931214581276 [DOI] [Google Scholar]

[B19] Fagerlin A., Ubel P. A., Smith D. M., Zikmund-Fisher B. J. (2007). Making numbers matter: present and future research in risk communication. Am. J. Health Behav. 31, S47–S56. 10.5993/ajhb.31.s1.7 [DOI] [PubMed] [Google Scholar]

[B20] Fagerlin A., Wang C., Ubel P. A. (2005). Reducing the influence of anecdotal reasoning on people’s health care decisions: is a picture worth a thousand statistics? Med. Decis. Mak. 25, 398–405. 10.1177/0272989X05278931 [DOI] [PubMed] [Google Scholar]

[B21] Feldman-Stewart D., Brundage M. D., Zotov V. (2007). Further insight into the perception of quantitative information: judgments of gist in treatment decisions. Med. Decis. Mak. 27, 34–43. 10.1177/0272989X06297101 [DOI] [PubMed] [Google Scholar]

[B22] Feldman-Stewart D., Kocovski N., McConnell B. A., Brundage M. D., Mackillop W. J. (2000). Perception of quantitative information for treatment decisions. Med. Decis. Mak. 20, 228–238. 10.1177/0272989X0002000208 [DOI] [PubMed] [Google Scholar]

[B23] Fischhoff B., Brewer N. T., Downs J. S. (2012). Communicating Risks and Benefits: An Evidence Based User’s Guide. Silver Spring, MD: US Department of Health and Human Service, Food and Drug Administration. [Google Scholar]

[B24] Gaissmaier W., Wegwarth O., Skopec D., Müller A., Broschinski S., Politi M. C. (2012). Numbers can be worth a thousand pictures: individual differences in understanding graphical and numerical representations of health-related information. Health Psychol. 31, 286–296. 10.1037/a0024850 [DOI] [PubMed] [Google Scholar]

[B25] Galesic M., Garcia-Retamero R. (2010). Statistical numeracy for health: a cross-cultural comparison with probabilistic national samples. Arch. Intern. Med. 170, 462–468. 10.1001/archinternmed.2009.481 [DOI] [PubMed] [Google Scholar]

[B26] Galesic M., Garcia-Retamero R. (2011a). Communicating consequences of risky behaviors: life expectancy versus risk of disease. Patient Educ. Couns. 82, 30–35. 10.1016/j.pec.2010.02.008 [DOI] [PubMed] [Google Scholar]

[B27] Galesic M., Garcia-Retamero R. (2011b). Do low-numeracy people avoid shared decision making? Health Psychol. 30, 336–341. 10.1037/a0022723 [DOI] [PubMed] [Google Scholar]

[B28] Garcia-Retamero R., Andrade A., Sharit J., Ruiz J. G. (2015). Is patient’s numeracy related to physical and mental health? Med. Decis. Mak. 35, 501–511. 10.1177/0272989X15578126 [DOI] [PubMed] [Google Scholar]

[B29] Garcia-Retamero R., Cokely E. T. (2011). Effective communication of risks to young adults: using message framing and visual aids to increase condom use and STD screening. J. Exp. Psychol. Appl. 17, 270–287. 10.1037/a0023677 [DOI] [PubMed] [Google Scholar]

[B30] Garcia-Retamero R., Cokely E. T. (2013). Communicating health risks with visual aids. Curr. Dir. Psychol. Sci. 22, 392–399. 10.1177/0963721413491570 [DOI] [Google Scholar]

[B31] Garcia-Retamero R., Cokely E. T. (2014a). The influence of skills, message frame, and visual aids on prevention of sexually transmitted diseases. J. Behav. Decis. Mak. 27, 179–189. 10.1002/bdm.1797 [DOI] [Google Scholar]

[B32] Garcia-Retamero R., Cokely E. T. (2014b). “Using visual aids to help people with low numeracy make better decisions,” in Numerical Reasoning in Judgments and Decision Making about Health, eds Anderson B. L., Schulkin J. (Cambridge: Cambridge University Press; ), 153–174. [Google Scholar]

[B33] Garcia-Retamero R., Cokely E. T. (2015). Simple but powerful health messages for increasing condom use in young adults. J. Sex Res. 52, 30–42. 10.1080/00224499.2013.806647 [DOI] [PubMed] [Google Scholar]

[B34] Garcia-Retamero R., Galesic M. (2009). Communicating treatment risk reduction to people with low numeracy skills: a cross-cultural comparison. Am. J. Public Health 99, 2196–2202. 10.2105/AJPH.2009.160234 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B35] Garcia-Retamero R., Galesic M. (2010a). How to reduce the effect of framing on messages about health. J. Gen. Intern. Med. 25, 1323–1329. 10.1007/s11606-010-1484-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B36] Garcia-Retamero R., Galesic M. (2010b). Who profits from visual aids: overcoming challenges in people’s understanding of risks. Soc. Sci. Med. 70, 1019–1025. 10.1016/j.socscimed.2009.11.031 [DOI] [PubMed] [Google Scholar]

[B37] Garcia-Retamero R., Galesic M. (2011). Using plausible group sizes to communicate information about medical risks. Patient Educ. Couns. 84, 245–250. 10.1016/j.pec.2010.07.027 [DOI] [PubMed] [Google Scholar]

[B38] Garcia-Retamero R., Galesic M. (2013). Transparent Communication of Health Risks: Overcoming Cultural Differences. New York: Springer; 10.1007/978-1-4614-4358-2 [DOI] [Google Scholar]

[B39] Garcia-Retamero R., Galesic M., Gigerenzer G. (2010). Do icon arrays help reduce denominator neglect? Med. Decis. Mak. 30, 672–684. 10.1177/0272989X10369000 [DOI] [PubMed] [Google Scholar]

[B40] Garcia-Retamero R., Hoffrage U. (2013). Visual representation of statistical information improves diagnostic inferences in doctors and their patients. Soc. Sci. Med. 83, 27–33. 10.1016/j.socscimed.2013.01.034 [DOI] [PubMed] [Google Scholar]

[B41] Garcia-Retamero R., Wicki B., Cokely E. T., Hanson B. (2014). Factors predicting surgeons’ preferred and actual roles in interactions with their patients. Health Psychol. 33, 920–928. 10.1037/hea0000061 [DOI] [PubMed] [Google Scholar]

[B42] Gardner P. H., McMillan B., Raynor D. K., Woolf E., Knapp P. (2011). The effect of numeracy on the comprehension of information about medicines in users of a patient information website. Patient Educ. Couns. 83, 398–403. 10.1016/j.pec.2011.05.006 [DOI] [PubMed] [Google Scholar]

[B43] Ghazal S., Cokely E. T., Garcia-Retamero R. (2014). Predicting biases in very highly educated samples: numeracy and metacognition. Judgm. Decis. Mak. 9, 15–34. [Google Scholar]

[B44] Gigerenzer G. (2013). How I got started teaching physicians and judges risk literacy. Appl. Cogn. Psych. 28, 612–614. 10.1002/acp.2980 [DOI] [Google Scholar]

[B45] Gigerenzer G., Fiedler K., Olsson H. (2012). “Rethinking cognitive biases and environmental consequences,” in Ecological Rationality: Intelligence in the World, eds Todd P. M., Gigerenzer G., the ABC Research Group (New York: Oxford University Press; ), 80–110. [Google Scholar]

[B46] Gigerenzer G., Hoffrage U. (1995). How to improve Bayesian reasoning without instruction: frequency formats. Psychol. Rev. 102, 684–704. 10.1037/0033-295X.102.4.684 [DOI] [Google Scholar]

[B47] Gigerenzer G., Hoffrage U. (1999). Overcoming difficulties in Bayesian reasoning: a reply to Lewis and Keren (1999) and Mellers and McGraw (1999). Psychol. Rev. 106, 425–430. 10.1037/0033-295X.106.2.425 [DOI] [Google Scholar]

[B48] Gillan D. J., Wickens C. D., Hollands J. G., Carswell C. M. (1998). Guidelines for presenting quantitative data in HFES publications. Hum. Fact. 40, 28–41. [Google Scholar]

[B49] Goodyear-Smith F., Arroll B., Chan L., Jackson R., Wells S., Kenealy T. (2008). Patients prefer pictures to numbers to express cardiovascular benefit from treatment. Ann. Fam. Med. 6, 213–217. 10.1370/afm.795 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B50] Griffin D., Brenner L. (2004). “Perspectives on probability judgment calibration,” in Blackwell Handbook of Judgment and Decision Making, eds Koehler D. J., Harvey N. (Oxford: Blackwell; ), 177–199. 10.1002/9780470752937.ch9 [DOI] [Google Scholar]

[B51] Gurmankin A. D., Baron J., Armstrong K. (2004). Intended message versus message received in hypothetical physician risk communications: exploring the gap. Risk Anal. 24, 1337–1347. 10.1111/j.0272-4332.2004.00530.x [DOI] [PubMed] [Google Scholar]

[B52] Hanoch Y., Miron-Shatz T., Cole H., Himmelstein M., Federman A. D. (2010). Choice, numeracy, and physicians-in-training performance: the case of medicare part D. Health Psychol. 29, 454–459. 10.1037/a0019881 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B53] Hibbard J. H., Peters E., Dixon A., Tusler M. (2007). Consumer competencies and the use of comparative quality information: it isn’t just about literacy. Med. Care Res. Rev. 64, 379–394. 10.1177/1077558707301630 [DOI] [PubMed] [Google Scholar]

[B54] Hoffrage U., Gigerenzer G. (1998). Using natural frequencies to improve diagnostic inferences. Acad. Med. 73, 538–540. 10.1097/00001888-199805000-00024 [DOI] [PubMed] [Google Scholar]

[B55] Hoffrage U., Gigerenzer G., Krauss S., Martignon L. (2002). Representation facilitates reasoning: what natural frequencies are and what they are not. Cognition 84, 343–352. 10.1016/S0010-0277(02)00050-1 [DOI] [PubMed] [Google Scholar]

[B56] Hoffrage U., Lindsey S., Hertwig R., Gigerenzer G. (2000). Communicating statistical information. Science 290, 2261–2262. 10.1126/science.290.5500.2261 [DOI] [PubMed] [Google Scholar]

[B57] Johnson E. D., Tubau E. (2015). Comprehension and computation in Bayesian problem solving. Front. Psychol. 6:938. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B58] Kaplan R. M., Frosch D. L. (2005). Decision making in medicine and health care. Annu. Rev. Clin. Psychol. 1, 525–556. 10.1146/annurev.clinpsy.1.102803.144118 [DOI] [PubMed] [Google Scholar]

[B59] Kurzenhäuser S., Hoffrage U. (2002). Teaching Bayesian reasoning: an evaluation of a classroom tutorial for medical students. Med. Teach. 24, 516–521. 10.1080/0142159021000012540 [DOI] [PubMed] [Google Scholar]

[B60] Lipkus I. M. (2007). Numeric, verbal, and visual formats of conveying health risks: suggested best practices and future recommendations. Med. Decis. Mak. 27, 696–713. 10.1177/0272989X07307271 [DOI] [PubMed] [Google Scholar]

[B61] Lipkus I. M., Hollands J. G. (1999). The visual communication of risk. J. Natl. Cancer Inst. Monogr. 25, 149–163. 10.1093/oxfordjournals.jncimonographs.a024191 [DOI] [PubMed] [Google Scholar]

[B62] Lipkus I. M., Samsa G., Rimer B. K. (2001). General performance on a numeracy scale among highly educated samples. Med. Decis. Mak. 21, 37–44. 10.1177/0272989X0102100105 [DOI] [PubMed] [Google Scholar]

[B63] Mandel D. R. (2014). Visual representation of rational belief revision: another look at the sleeping beauty problem. Front. Psychol. 5:1232. 10.3389/fpsyg.2014.01232 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B64] Mandel D. R. (2015). Instruction in information structuring improves Bayesian judgment in intelligence analysts. Front. Psychol. 6:387. 10.3389/fpsyg.2015.00387 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B65] Metcalfe J., Finn B. (2008). Evidence that judgments of learning are causally related to study choice. Psychon. Bull. Rev. 15, 174–179. 10.3758/PBR.15.1.174 [DOI] [PubMed] [Google Scholar]

[B66] Nelson T. O. (1990). Metamemory: a theoretical framework and new findings. Psychol. Learn. Motiv. 26, 125–173. 10.1016/S0079-7421(08)60053-5 [DOI] [Google Scholar]

[B67] Okan Y., Garcia-Retamero R., Cokely E. T., Maldonado A. (2012). Individual differences in graph literacy: overcoming denominator neglect in risk comprehension. J. Behav. Decis. Mak. 25, 390–401. 10.1002/bdm.751 [DOI] [Google Scholar]

[B68] Okan Y., Garcia-Retamero R., Cokely E. T., Maldonado A. (2015). Improving risk understanding across ability levels: encouraging active processing with dynamic icon arrays. J. Exp. Psychol. Appl. 21, 178–194. 10.1037/xap0000045 [DOI] [PubMed] [Google Scholar]

[B69] Paling J. (2003). Strategies to help patients understand risks. BMJ 327, 745–748. 10.1136/bmj.327.7417.745 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B70] Peters E. (2012). Beyond comprehension the role of numeracy in judgments and decisions. Curr. Dir. Psychol. Sci. 21, 31–35. 10.1177/0963721411429960 [DOI] [Google Scholar]

[B71] Peters E., Dieckmann N., Dixon A., Hibbard J. H., Mertz C. K. (2007). Less is more in presenting quality information to consumers. Med. Care Res. Rev. 64, 169–190. 10.1177/10775587070640020301 [DOI] [PubMed] [Google Scholar]

[B72] Peters E., Levin I. P. (2008). Dissecting the risky-choice framing effect: numeracy as an individual-difference factor in weighting risky and riskless options. Judgm. Decis. Mak. 3, 435–448. [Google Scholar]

[B73] Peters E., Vastfjall D., Slovic P., Mertz C. K., Mazzocco K., Dickert S. (2006). Numeracy and decision making. Psychol. Sci. 17, 407–413. 10.1111/j.1467-9280.2006.01720.x [DOI] [PubMed] [Google Scholar]

[B74] Petrova D. G., Pligt J., Garcia-Retamero R. (2014). Feeling the numbers: on the interplay between risk, affect, and numeracy. J. Behav. Decis. Mak. 27, 191–199. 10.1002/bdm.1803 [DOI] [Google Scholar]

[B75] Portnoy D. B., Roter D., Erby L. H. (2010). The role of numeracy on client knowledge in BRCA genetic counseling. Patient Educ. Couns. 81, 131–136. 10.1016/j.pec.2009.09.036 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B76] Reyna V. F. (2004). How people make decisions that involve risk: a dual-processes approach. Curr. Dir. Psychol. Sci. 13, 60–66. 10.1111/j.0963-7214.2004.00275.x [DOI] [Google Scholar]

[B77] Reyna V. F., Brainerd C. J. (2008). Numeracy, ratio bias, and denominator neglect in judgments of risk and probability. Learn. Individ. Differ. 18, 89–107. 10.1016/j.lindif.2007.03.011 [DOI] [Google Scholar]

[B78] Reyna V. F., Nelson W. L., Han P. K., Dieckmann N. F. (2009). How numeracy influences risk comprehension and medical decision making. Psychol. Bull. 135, 943–973. 10.1037/a0017327 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B79] Rothman R. L., Housam R., Weiss H., Davis D., Gregory R., Gebretsadik T., et al. (2006). Patient understanding of food labels: the role of literacy and numeracy. Am. J. Prev. Med. 31, 391–398. 10.1016/j.amepre.2006.07.025 [DOI] [PubMed] [Google Scholar]

[B80] Schirillo J. A., Stone E. R. (2005). The greater ability of graphical versus numerical displays to increase risk avoidance involves a common mechanism. Risk Anal. 25, 555–566. 10.1111/j.1539-6924.2005.00624.x [DOI] [PubMed] [Google Scholar]

[B81] Schraw G. (2010). Measuring self-regulation in computer-based learning environments. Educ. Psychol. 45, 258–266. 10.1080/00461520.2010.515936 [DOI] [Google Scholar]

[B82] Schwartz L. M., Woloshin S., Black W. C., Welch H. G. (1997). The role of numeracy in understanding the benefit of screening mammography. Ann. Intern. Med. 127, 966–972. 10.7326/0003-4819-127-11-199712010-00003 [DOI] [PubMed] [Google Scholar]

[B83] Sedlmeier P., Gigerenzer G. (2001). Teaching Bayesian reasoning in less than two hours. J. Exp. Psychol. Gen. 130, 380–400. 10.1037/0096-3445.130.3.380 [DOI] [PubMed] [Google Scholar]

[B84] Spiegelhalter D., Pearson M., Short I. (2011). Visualizing uncertainty about the future. Science 333, 1393–1400. 10.1126/science.1191181 [DOI] [PubMed] [Google Scholar]

[B85] Stankov L. (2000). Structural extensions of a hierarchical view on human cognitive abilities. Learn. Individ. Differ. 12, 35–51. 10.1016/S1041-6080(00)00037-6 [DOI] [Google Scholar]

[B86] Stankov L., Lee J. (2008). Confidence and cognitive test performance. J. Educ. Psychol. 100, 961–976. 10.1037/a0012546 [DOI] [Google Scholar]

[B87] Stone E. R., Sieck W. R., Bull B. E., Yates F. J., Parks S. C., Rush C. J. (2003). Foreground: background salience: explaining the effects of graphical displays on risk avoidance. Organ. Behav. Hum. Decis. Process. 90, 19–36. 10.1016/S0749-5978(03)00003-7 [DOI] [Google Scholar]

[B88] Thompson V. A., Prowse Turner J. A., Pennycook G. (2011). Intuition, reason, and metacognition. Cogn. Psychol. 63, 107–140. 10.1016/j.cogpsych.2011.06.001 [DOI] [PubMed] [Google Scholar]

[B89] Trevena L., Zikmund-Fisher B., Edwards A., Gaissmaier W., Galesic M., Han P., et al. (2012). “Presenting probabilities,” in Update of the International Patient Decision Aids Standards (IPDAS) Collaboration’s Background Document, eds Volk R., Llewellyn-Thomas H. Available at: http://ipdas.ohri.ca/IPDAS-Chapter-C.pdf [Google Scholar]

[B90] Waters E. A., Weinstein N. D., Colditz G. A., Emmons K. M. (2007). Reducing aversion to side effects in preventive medical treatment decisions. J. Exp. Psychol. Appl. 13, 11–21. 10.1037/1076-898X.13.1.11 [DOI] [PubMed] [Google Scholar]

[B91] Weinfurt K. P., Castel L. D., Li Y., Sulmasy D. P., Balshem A. M., Benson A. B., et al. (2003). The correlation between patient characteristics and expectations of benefit from phase I clinical trials. Cancer 98, 166–175. 10.1002/cncr.11483 [DOI] [PubMed] [Google Scholar]

[B92] Zikmund-Fisher B. J. (2015). Stories of MDM: from a conversation to a career of making less data more useful. Med. Decis. Mak. 35, 1–3. 10.1177/0272989X14563576 [DOI] [PubMed] [Google Scholar]

[B93] Zikmund-Fisher B. J., Fagerlin A., Ubel P. A. (2008a). Improving understanding of adjuvant therapy options by using simpler risk graphics. Cancer 113, 3382–3390. 10.1002/cncr.23959 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B94] Zikmund-Fisher B. J., Ubel P. A., Smith D. M., Derry H. A., McClure J. B., Stark A., et al. (2008b). Communicating side effect risks in a tamoxifen prophylaxis decision aid: the debiasing influence of pictographs. Patient Educ. Couns. 73, 209–214. 10.1016/j.pec.2008.05.010 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B95] Zikmund-Fisher B. J., Fagerlin A., Ubel P. A. (2010). A demonstration of “less can be more” in risk graphics. Med. Decis. Mak. 30, 661–671. 10.1177/0272989X10364244 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B96] Zikmund-Fisher B. J., Witteman H. O., Dickson M., Fuhrel-Forbis A., Kahn V. C., Exe N. L., et al. (2014). Blocks, ovals, or people? icon type affects risk perceptions and recall of pictographs. Med. Decis. Mak. 34, 443–453. 10.1177/0272989X13511706 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Visual aids improve diagnostic inferences and metacognitive judgment calibration

Rocio Garcia-Retamero

Edward T Cokely

Ulrich Hoffrage

Abstract

Introduction