Improving Present Practices in the Visual Display of Interactions

Connor J McCabe; Dale S Kim; Kevin M King

doi:10.1177/2515245917746792

. Author manuscript; available in PMC: 2021 Apr 27.

Published in final edited form as: Adv Methods Pract Psychol Sci. 2018 Mar 28;1(2):147–165. doi: 10.1177/2515245917746792

Improving Present Practices in the Visual Display of Interactions

Connor J McCabe ¹, Dale S Kim ^2,³, Kevin M King ¹

PMCID: PMC8078831 NIHMSID: NIHMS1553023 PMID: 33912789

Abstract

Interaction plots are used frequently in psychology research to make inferences about moderation hypotheses. A common method of analyzing and displaying interactions is to create simple-slopes or marginal-effects plots using standard software programs. However, these plots omit features that are essential to both graphic integrity and statistical inference. For example, they often do not display all quantities of interest, omit information about uncertainty, or do not show the observed data underlying an interaction, and failure to include these features undermines the strength of the inferences that may be drawn from such displays. Here, we review the strengths and limitations of present practices in analyzing and visualizing interaction effects in psychology. We provide simulated examples of the conditions under which visual displays may lead to inappropriate inferences and introduce open-source software that provides optimized utilities for analyzing and visualizing interactions.

Keywords: interaction, moderation, simple slopes, regions of significance, data visualization, open data, open materials

Moderation hypotheses are common in psychological research. For instance, researchers often test whether a given effect differs across groups, such as gender or racial groups, or examine how environmental or individual difference factors such as adversity or biological traits modify risk indicators of psychopathology (e.g., Luthar, Cicchetti, & Becker, 2000). In ordinary least squares (OLS) regression, moderation is tested by including a linear interaction term (e.g., XZ) with its constituent first-order terms (X and Z). The magnitude of the interaction coefficient provides the estimated change in the effect of the focal predictor X on the outcome Y for a 1-unit change in the moderator Z (or, equivalently, the estimated change in Z for a 1-unit change in X). The significance value and confidence interval for the interaction coefficient are then typically used to determine the degree of support for the moderation hypothesis.

To date, considerable work has offered guidelines on statistical improvements for testing and interpreting interactions (e.g., Brambor, Clark, & Golder, 2006). For instance, it is now common practice for researchers to mean-center continuous X and Z variables when testing interactions, in order to facilitate the interpretation of intercepts and conditional effects (Dalal & Zickar, 2012). Researchers also regularly conduct simple-slopes analyses to test the conditional effect of X at multiple levels of Z (Aiken & West, 1991), or use the Johnson-Neyman (J-N) technique (Johnson & Neyman, 1936) to assess the conditional effect of X on Y across a range of values of Z. These approaches have been extended to multiple analytic frameworks, such as hierarchical linear modeling and structural equation modeling (Preacher, Curran, & Bauer, 2006). Researchers have also identified a number of factors that substantially reduce power to detect interaction effects (for reviews, see Aguinis, 1995, and Frazier, Tix, & Barron, 2004). These include poor reliability of and range restriction in the predictor variables (Aguinis, 1995), substantial between-groups disparities in sample size and variance when the potential moderation is a categorical variable (Aguinis, 1995; Stone-Romero, Alliger, & Aguinis, 1994), reduced variability in a continuous moderator as a result of artificially dichotomizing the moderator (e.g., by a median split) prior to analysis (which can also increase risk for detecting spurious effects; Bissonnette, Ickes, Bernstein, & Knowles, 1990; MacCallum, Browne, & Sugawara, 1996), and testing the statistical significance of a categorical-variable interaction by analyzing the focal predictor’s effect on the dependent variable separately for each category (i.e., subgroup analysis; Stone-Romero & Anderson, 1994).

However, considerably less attention has been devoted to providing recommendations for the substantive evaluation of interactions, which may have implications for conclusions regarding competing theoretical claims (see, e.g., Berry, Golder, & Milton, 2012; Roisman et al., 2012). For instance, in developmental psychopathology, some researchers have proposed a diathesis-stress model (Monroe & Simons, 1991). This model posits that individuals with a vulnerability (e.g., difficult temperament) fare worse than those without the vulnerability when exposed to environmental stressors (such as poor parenting), but may look no different from their nonvulnerable peers in low-stress conditions. An alternative theory proposes that these same vulnerabilities can also help children thrive in protective environments (Ellis & Boyce, 2008). These theories imply two different forms of an interaction, and determining whether the data support one theory or the other depends on how researchers evaluate the nature of the interaction. Roisman et al. (2012) highlighted analytic and visual approaches that can help determine which hypothesis is better supported by the data, such as visual inspection of interaction plots, regions-of-significance analyses, and tests of nonlinear (e.g., quadratic) effects, among others. In this article, we aim to provide similar recommendations for social science more generally, to improve scientific inference for evaluating moderated effects.

The goal of this article is to demonstrate how theoretical inference in tests of moderation can be improved by improving visual displays. A major aim in the social sciences is to increase transparency of the scientific process to ensure rigor and replicability (Cumming, 2014). Visual displays can substantially aid in attaining this goal. Displays provide efficient and nuanced information about univariate and multivariate relations in data that may not be readily apparent from tables or text descriptions of results. They also help identify misspecified models and influential data points (e.g., outliers) and facilitate how key analytic findings are communicated between researchers and their audiences (Tay, Parrigon, Huang, & LeBreton, 2016). Optimizing the visual display of interactions can thus improve the scientific rigor of moderation tests.

The Utility of Interaction Displays

Why are visual displays important for evaluating moderated effects? First, interpreting interaction coefficients is not necessarily easy or straightforward (see also Dawson, 2014; Preacher et al., 2006). Consider the following multiple regression equation with a single two-way interaction term:¹

y_{i} = b_{0} + b_{1} x_{i} + b_{2} z_{i} + b_{3} x_{i} z_{i} + ε_{i} .

Output from a regression analysis (assuming continuous and standardized predictors) might provide us with the following (Example 1), which shows coefficient estimates in place of the b values:

\hat{y} = 40 + 15 x + 11 z + 10 x z .

For applied researchers, the interpretation of 15 or 11 may not be intuitive, largely because each of these effects depends (or is conditional) on the value of the other interacting variable given the inclusion of the interaction term, XZ. For instance, one can interpret the coefficient b₁ (in this case, 15) as the effect of a 1-unit change in X on the value of Y when all other predictors are equal to zero. In other words, because an interaction term is specified in this model, b₁ is also the effect of X only when Z = 0. Rearranging the formula can show this more explicitly:

\hat{y} = 40 + 11 z + (15 + 10 z) x .

When Z is 0, 10Z is also 0, and the slope of Y on X reduces to 15. Because 11Z in the remaining term also reduces to 0 in this case, we are left with the regression equation $\hat{y} = 40 + 15 x$ , which gives the intercept and simple slope of X for Y when Z is zero. Because Z is standardized (i.e., the mean of Z is 0), the slope for X is synonymous with the slope of Y on X at the mean of Z. Similarly, b₃ (in this case, 10) might be interpreted as the effect of a 1-unit change in Z on the coefficient of X (or, conversely, the effect of a 1-unit change in X on the coefficient of Z). With a bit of algebra, one can use the estimates of the conditional effects, b₂ and b₁, and the interaction term, b₃, to extract the meaning of the interaction in relatively plain language that can help us to confirm or disconfirm the hypothesized effect. Following Aiken and West (1991), readers could use the information in a regression table to construct simple-slopes equations for a range of values of Z. For instance, when Z = −1 in the present example, the equation becomes $\hat{y} = 29 + 5 x$ ; when Z = 1, the equation becomes $\hat{y} = 51 + 25 x$ ; and so on. However, we argue that relying on algebra to understand a substantive finding unduly burdens readers.

Instead, we suggest that researchers use visual displays to aid themselves and readers in making inferences about moderation hypotheses. Figure 1 illustrates three different forms of interaction, each of which may support different theories about how the interplay between two factors influences an outcome (see also Luthar et al., 2000; Roisman et al., 2012). Figure 1a illustrates a synergistic interaction, in which higher levels of Z enhance the effect of X on Y (as in Example 1). In contrast, Figure 1b illustrates a buffering effect, in which higher levels of Z instead reduce the effect of X on Y. Figure 1c illustrates an antagonistic interaction: The two predictors affect Y in the same direction, though X is associated with Y only at lower levels of Z or in the absence of Z. Such displays are common in psychological research and are typically the basis for making inferences about whether the data support a specific moderation hypothesis.

Fig. 1. — Standard visual displays illustrating (a) synergistic, (b) buffering, and (c) antagonistic interactions.

However, these displays lack a number of core features that are key to both effective communication and statistical integrity (G. King, Tomz, & Wittenberg, 2000; Tufte, 2001). First, these plots limit the number of displayed quantities of substantive interest, in that they communicate only the values of $\hat{Y}$ at selected values of X and Z, and any effects outside this range remain undetected (Preacher et al., 2006; Roisman et al., 2012). Second, they do not communicate a reasonable measure of uncertainty in the displayed estimates. Including a measure of uncertainty is either recommended or required by journal editors for text descriptions of results (Cumming, 2014), but uncertainty is seldom addressed in the analysis of interactions (Brambor et al., 2006). For example, although the American Psychological Association publication manual (American Psychological Association, 2010) recommends reporting 95% confidence intervals (CIs) for regression coefficients whenever possible, most current interaction displays do not include confidence regions for simple slopes. Third, these graphics do not show the observed data on which the model estimates are based. Including these data is essential for diagnosing model misspecification—that is, determining whether a model appropriately captures patterns that are suggested by the observed data (Cohen, Cohen, West, & Aiken, 2003). A common means of including the data in the bivariate case is the scatterplot, which helps one determine visually how well a regression line quantifies a theoretically linear bivariate relation (e.g., Tufte, 2001), yet observed data are rarely included in displays of interaction effects.

Here, we first review present practices for the visual display of interactions in psychology and describe the strengths and limitations of these approaches. We then describe an open-source software utility that allows users to apply several graphic solutions that address these limitations.

Disclosures

All simulation code, simulated data, and code for creating the key figures in this article, as well as source code and instructions for using the Web utility described, are available online in a GitHub repository (https://github.com/connorjmccabe/InterActive). Code for generating the simulated examples used in this article and the simulated data can be found at https://github.com/connorjmccabe/InterActive/tree/master/Simulated%20Data. Code for reproducing the key figures can be found at https://github.com/connorjmccabe/InterActive/tree/master/Manuscript%20Figures%20Code. Source code for the Web utility, called interActive, introduced in this manuscript can be found at https://github.com/connorjmccabe/InterActive/blob/master/interActive_OLS.R.

Present Practices

The simple-slopes approach

The most common method for probing interaction effects is simple-slopes analysis, typically conducted using the pick-a-point approach (Aiken & West, 1991). In the pick-a-point approach, researchers select values of interest of the moderator variable (Z) and examine the effect of the focal predictor (X) on the outcome (Y) at each of these values (Cohen et al., 2003). When Z is continuous, the interaction is probed by recentering Z around values of interest. That is, Z is shifted by subtracting a value—typically +1 SD or −1 SD—from each observation, such that the distribution of Z is centered about this value. The model is reestimated with these recentered variables to obtain coefficient estimates, CIs, and p values for the effect of X on Y at these selected values of Z. Table 1 illustrates this process in a simulated example of a synergistic interaction involving continuous variables (the parameterization is based on the model in Example 1). The table suggests that when Z is low (i.e., centered at 1 SD below its mean), a 1-unit increase in X is not significantly associated with change in Y (b = 4.01, 95% CI = [−7.60, 15.61]), β = 0.06). However, when Z is at the mean, a 1-unit increase in X is associated with a 22.17-unit (95% CI = [12.79, 31.54]) increase in Y (β = 0.34), and when Z is high (i.e., centered at 1 SD above its mean), a 1-unit increase in X is associated with a 40.32-unit (95% CI = [27.70, 52.95]) increase in Y (β = 0.62). In the case of binary categorical moderators, analyses can be carried out using identical steps, but instead the moderator is recentered around each dummy-coded category. Testing interactions is more analytically cumbersome with multiple categories because it involves testing a set of interaction terms using multiple dummy-coded variables (e.g., Dawson, 2014). However, a simple-slopes estimate for each dummy-coded category can be derived by reestimating the effect of X on Y separately for each category (Cohen et al., 2003).

Table 1.

Simulated Results of a Synergistic Interaction Probed Using the Pick-a-Point Approach (N = 150)

Parameter	Estimate	SE	95% CI	β	t	P
	Z centered at 1 SD below its mean
Intercept	19.80	5.99	[7.96, 31.65]		3.30	.001
X coefficient	4.01	5.87	[−7.60, 15.61]	0.06	0.68	.496
Z coefficient	17.29	4.00	[9.38, 25.20]	0.32	4.32	< .001
XZ coefficient	17.11	3.67	[9.86, 24.36]	0.26	4.67	< .001
	Z centered at its mean
Intercept	38.15	4.11	[30.03, 46.27]		9.28	< .001
X coefficient	22.17	4.74	[12.79, 31.54]	0.34	4.67	< .001
Z coefficient	17.29	4.00	[9.38, 25.20]	0.32	4.32	< .001
XZ coefficient	17.11	3.67	[9.86, 24.36]	0.26	4.67	< .001
	Z centered at 1 SD above its mean
Intercept	56.50	5.82	[44.99, 68.00]		9.70	< .001
X coefficient	40.32	6.39	[27.70, 52.95]	0.62	6.31	< .001
Z coefficient	17.29	4.00	[9.38, 25.20]	0.32	4.32	< .001
XZ coefficient	17.11	3.67	[9.86, 24.36]	0.26	4.67	< .001

Open in a new tab

Note: X is mean-centered in all the models. Note that the intercepts and estimates of the coefficient of X differ across transformations of Z. CI = confidence interval.

Figure 2 provides a simple-slopes plot for Example 1, based on the simulated results in Table 1. Plots like this one can be created with software programs such as Excel (e.g., Dawson, 2014), either by using coefficients derived using the pick-a-point approach or by computing simple slopes based on coefficient estimates from the original model. By default, plots are typically constructed displaying the effect at 1 SD above and below the mean of both X and Z.

Fig. 2. — A simple-slopes plot of the simulated XZ interaction corresponding with the results provided in Table 1. Note that *low* refers to 1 SD below the mean and *high* refers to 1 SD above the mean for both X and Z.

Simple-slopes plots are limited in several ways. First, they often do not represent the full nature of an interaction effect because they typically display the data only at 1 SD above and below the means of the predictors. For instance, each of the interactions in Figure 1 may also take the form of a disordinal (or crossover) interaction if the depicted lines are within an appropriate range of X. Disordinal interactions suggest that the rank order of Y given Z changes with X (or the rank order of Y given X changes with Z). The value of X at which this change happens (i.e., the crossover point) can be computed as follows (Cohen et al., 2003):

x = - \frac{b_{2}}{b_{3}} .

(1)

A crossover point may provide evidence that an interaction is disordinal. However, all interactions are disordinal interactions across an infinite range of X, and interactions should not be interpreted as disordinal if the crossover point is outside the observed range of X (Cohen et al., 2003). Thus, on one hand, extending the horizontal axis beyond the observed range of the focal variable may suggest a disordinal interaction that is unsupported by the data (a Type I error), but on the other hand, restricting the range of X (to within 1 SD above and below its mean, e.g.) may obscure evidence of this effect if it truly exists (a Type II error; e.g., Roisman et al., 2012). Providing a depiction of an interaction within a meaningful range of X can substantially aid in evaluating this effect.

Range restriction in moderator variables is also a concern. Displaying an interaction at several levels of a continuous moderator does not necessarily describe the full nature of an interaction across all relevant levels of that moderator. This limitation may be especially important when the significance or direction of the simple-slopes effect changes at more extreme levels of the moderator. For example, if a moderator is skewed or if a sample is particularly large, there may be a substantial number of participants represented at values higher than 1 SD above the mean or lower than 1 SD below the mean, and simple-slopes plots may not accurately represent the nature of the predictors’ effects at those values. Using the J-N technique addresses this concern analytically (Johnson & Fay, 1950; Potthoff, 1964). This approach provides an estimate of the range of the moderator variable at which the focal predictor is significantly associated with the outcome. In effect, this technique is an extension of the pick-a-point approach: Interactions are probed not only at a limited number of levels of a moderator, but rather across the full range of values of the moderator observed in the data. The statistical significance and direction of these simple slopes are then used to better characterize the interaction. Tools for conducting and plotting the results of J-N analyses have been developed for standard statistical software programs (Hayes & Matthes, 2009; Preacher et al., 2006). Simple-slopes plots do not accommodate the analytic strengths of the J-N technique.

Plots of simple slopes typically show only the simple-slopes estimates, with little direct indication of the uncertainty in the estimates or whether the slopes differed significantly from zero. Showing the uncertainty in simple slopes would provide a depiction of how precisely each effect was estimated, which is influenced by factors such as sample size. For instance, Figure 3a displays simple slopes and confidence regions associated with the predicted values of Y for the simulated example in Figure 2 and Table 1 (N = 150), whereas Figure 3b displays simple slopes and confidence regions from an identical parameterization in a larger sample size (N = 500). Although the estimates for the two sample sizes are nearly identical, the confidence regions are notably wider in the smaller sample, which suggests less certainty in the predicted outcome values compared with the larger sample. Moreover, whereas the simple slope at low levels of the moderator (1 SD below the mean) was not significant in the smaller sample (b = 4.01, 95% CI = [−7.60, 15.61], β = 0.06), this effect was statistically different from zero in the larger sample (b = 6.64, 95% CI = [0.64, 12.64], β = 0.12). This is because, holding all else constant, the standard error of the simple slope will be smaller at larger sample sizes, which results in more precise estimation of the regression line and increased power to detect the slope effect.

Fig. 3. — Illustration of the effect of sample size on uncertainty in simple-slopes estimates: simple slopes with 95% confidence regions for (a) the simulated example in Table 1 (N = 150) and (b) the same parameterization with a larger sample size (N = 500). For *Z, low* refers to 1 SD below the mean, and *high* refers to 1 SD above the mean.

Finally, most simple-slopes plots do not display the observed data. This omission prevents the use of these plots to diagnose whether the interaction effect is appropriately specified (e.g., Tay et al., 2016) or whether the simple slopes selected represent actual data. For instance, Figure 4 displays three scenarios for Example 1 in which the observed data for the predictor variables were simulated from different population distributions (N = 500). Although the simple slopes are nearly identical in the three scenarios, the observed data provide markedly different information as to how well the estimated simple slopes fit the data. For example, in Figure 4a, both X and Z are multivariate normally distributed predictors, and the simple-slopes estimates capture the data fairly well and provide evidence for a disordinal interaction. However, in Figure 4b, X and Z were simulated from exponential distributions (skew of X = 2.61, skew of Z = 2.35). The positive skew in both variables means that almost no data existed at 1 or 2 SD below the mean of X, and the simple-slopes effect when Z is 1 SD below the mean in fact represents no individuals in the observed data. Finally, Figure 4c shows how simple slopes can suggest an interaction effect that does not exist in the data. The data for this plot were instead generated from a model in which X had a quadratic effect but the interaction coefficient was zero, which is suggested by the parabolic pattern shown in this display. Because X and Z were strongly correlated in this example (r = .52), XZ and X² were confounded, and a significant interaction was erroneously detected (Cohen et al., 2003; MacCallum & Mar, 1995). It has been strongly recommended that researchers carefully consider their data and plot interactions at meaningful levels of the moderator to ensure that the simple-slopes effects they report appropriately reflect real data (Aiken & West, 1991; Dawson, 2014); plotting observed data may help researchers meet this recommendation.

Fig. 4. — Illustration of the impact of nonnormality on the interpretation of simple slopes. The three scatterplots show regression lines and observed data for the XZ interaction in Example 1, simulated from different predictor distributions. Although the simple slopes are nearly identical in the three graphs, the interaction patterns evident in the observed data differ according to whether the predictors (a) are normally distributed, (b) have skewed distributions, or (c) are strongly correlated (i.e., the interaction is confounded with a quadratic effect of the focal predictor). For *Z, low* refers to 1 SD below the mean, and *high* refers to 1 SD above the mean.

The marginal-effects approach

Marginal-effects (or regions-of-significance) plots (e.g., Berry et al., 2012; Preacher et al., 2006) are commonly used in combination with the J-N analytic approach to interactions. These plots depict the simple-slope coefficient of the focal variable and its 95% confidence region against values of the moderator. They indicate the significance, uncertainty, magnitude, and direction of the simple slope across a full hypothetical range of the moderator variable, often a range from 3 SD below to 3 SD above the mean (e.g., Fig. 2 in Preacher et al., 2006). Whereas simple-slopes plots display conditional effects at only select levels, marginal-effects plots ensure that an interaction is fully explored by showing how the simple-slope coefficient of a focal predictor changes across the entire range of the moderator. For instance, Figure 5 is a marginal-effects plot of the interaction in Example 1. When the moderator is 2.30 SD below the mean or lower, or 0.60 SD below the mean or higher, the 95% CI does not contain 0. The plot thus indicates that X is negatively associated with Y when Z is very low (at least 2.30 SD below the mean) and positively associated with Y when Z is −0.60 SD below the mean or greater. Note that, by comparison, Figure 2 fails to provide evidence of the reverse effect in the same data when Z is “low” (1 SD below the mean) because the simple negative slope is significant only when Z is 2.30 SD below the mean or lower.

Fig. 5. — A marginal-effects (or regions-of-significance) plot of Example 1. The plot shows the marginal effect of X on Y (i.e., the simple slope of X) across a range of the moderator variable Z; the shaded area indicates the 95% confidence region for the marginal effect. The marginal rug of Z on the horizontal axis indicates the frequency of observed values of Z across its displayed range.

Although marginal-effects plots aid in detecting and communicating interaction effects that may otherwise be missed, they fail to indicate whether or how much data are represented within the graphed regions and therefore may suggest effects that are not (or are very minimally) supported by the data. For instance, in the data from Example 1, very few data points are observed when Z is lower than 2.30 SD below the mean (see Fig. 5). Only two observations (1.3% of the sample) fall within the region of significance where the slope is negative, and there is no indication of the extent to which these observations are consistent with the estimated model. Indeed, in an often-cited article describing the use of the J-N technique in psychology, the authors provided a figure in which one of the tails of the displayed moderator Z had few to no observations, but this was noted only in the text (Preacher et al., 2006). Restricting the range of the horizontal axis to the observed range of Z and including a marginal rug of the observed data, as in Figure 5, helps address this concern (e.g., Berry et al., 2012). The inclusion of the observed data might lead us to infer that they do not support the conclusion that X has a significant effect on Y when Z is 2.30 SD below the mean or lower.

Moreover, despite their utility, marginal-effects plots have been provided less often than simple-slopes plots in publications reporting tests of moderation. For instance, we randomly sampled 50 of the 253 articles that were published in 2016 and cited the article in which Preacher et al. (2006) described the use of the J-N technique and marginal-effects plots. In brief, of these 50 articles, 38 (76%) provided a simple-slopes plot at two or more levels of the moderator to supplement their analyses. Only 10 (20%) made any mention of conducting a regions-of-significance analysis, and only 4 (8%) provided a marginal-effects plot depicting regions-of-significance results. In other words, among a sample of 50 publications citing a seminal manuscript describing the use of marginal-effects plots, only 8% actually used that kind of display in their article, and 76% used a less descriptive display. These results illustrate a substantial gap between the development of an advanced approach to analysis and its implementation.

We suspect that the unintuitive nature of marginal-effects plots has limited their widespread adoption. Marginal-effects plots are ultimately used to understand the range of a moderator for which a focal predictor is statistically significantly associated with an outcome (as well as the degree of uncertainty in the association); thus, this information is somewhat redundant with information provided by a text description of results obtained using the J-N technique. Moreover, readers are mostly interested in using displays to infer the predicted value of the dependent variable at meaningful values of X and Z. Common visuals for presenting effects (e.g., bar graphs, histograms, bivariate scatter-plots, and simple-slope plots) meet this need by assigning the values of greatest interest (i.e., values of the dependent variable) to the vertical axis and the primary predictor variable to the horizontal axis. However, marginal-effects plots allocate a relatively unintuitive value (the simple slope) to the vertical axis and the predictor of lesser importance (the moderator) to the horizontal axis, and omit information about the model-predicted value of the dependent variable entirely. We encourage researchers to use marginal-effects plots to support regions-of-significance analyses and, when doing so, to include overlaid data on these plots. However, given the limitations of such plots, providing additional displays may further aid in communicating an interaction effect.

Summary of present practices

Visual displays of interactions can substantially improve inferences, communication, and transparency, and also can help in diagnosing problems in data analysis. Although simple-slopes and marginal-effects plots strengthen the interpretation of moderation analyses in some ways, they have several limitations. In the next section, we describe an approach to create displays of interactions that utilize the strengths of present practices and address the concerns we have highlighted in our critiques.

Improving the Visual Display of Interactions: interActive

We created an open-source analysis and data-visualization application that builds on simple-slopes and marginal-effects plots to display all quantities of interest, uncertainty in the displayed estimates, and the data underlying an interaction (https://connorjmccabe.shinyapps.io/interactive/). We created this application, called interActive, using the freely available statistical program R (R Development Core Team, 2016) in the Shiny Web application framework (Chang, Cheng, Allaire, Xie, & McPherson, 2017). The graphics were created using the ggplot2 graphics package (Wickham, 2009). The interActive application provides data-upload functionality and allows users to specify and analyze OLS regression models with two-way interaction effects. The present functionality allows for either continuous linear or quadratic focal predictors and either continuous or binary categorical moderator predictors. The application accommodates the specification of covariates (i.e., control variables) and was designed to be usable by researchers at all levels of quantitative expertise. It can be used to conduct regions-of-significance analyses for interactions of continuous variables and creates marginal-effects plots of the results, with marginal rugs indicating observed data (e.g., Fig. 5).

The interActive application is based on the concept of small multiples (Tufte, 2001). An individual plot is created for each of several simple slopes (e.g., Fig. 6). This facilitates the display of a broad range of simple-slope effects, observed data, and measures of uncertainty. Because the design of all plots is identical except for the level of the moderator, a viewer’s attention is directed toward the change in pattern across multiples, which enables the viewer to understand the nature of the interaction depicted. Using small multiples allows indicators of observed data and measurement uncertainty to be included in each plot. Additionally, users can specify the level of the moderator for each multiple. This provides users with flexibility in deciding the number of levels of the moderator and the specific values of the moderator at which they will probe the interaction, so as to best characterize the observed data.

Fig. 6. — Illustration of small multiples created by interActive for Example 1 using multivariate normal predictors. Simple slopes are provided for levels of the moderator 2 SD and 1 SD below the mean, at the mean, and 1 SD and 2 SD above the mean. Each graphic shows the computed 95% confidence region (shaded area), the observed data (gray circles), the maximum and minimum values of the outcome (dashed horizontal lines), and the crossover point (diamond). The x-axes represent the full range of the focal predictor. CI = confidence interval; PTCL = percentile.

The functionality of interActive is leveraged to display the observed data that are most representative of each simple slope. For each small multiple representing a given moderator value, the displayed data points reflect the bivariate relation between the focal predictor and the dependent variable. This relation is shown within a range of the moderator that begins at half the distance from the value of the moderator at the next lower multiple and ends at half the distance from the value of the moderator at the next higher multiple. For instance, in a series of multiples depicting simple slopes at −1, −0.5, 0, +1, and +2 standardized units from the mean of the moderator, the −1-SD multiple would depict only observations with values of the moderator at −0.75 SD or lower; the −0.5-SD multiple would correspond with observations between −0.25 and −0.75 SD; and so forth. Limiting each multiple to a particular data space makes it possible to evaluate each simple slope on whether it represents real data.

We implemented additional design choices to allow for more nuanced evaluation of the depicted effect. For instance, interActive specifies the limits of the x-axis of each multiple as the minimum and maximum values of the focal variable, in order to display each simple slope across the full range of the focal predictor. Also, interActive computes the crossover point (using Equation 1) and displays it on each small multiple so that viewers can use it in conjunction with the observed data to determine the degree of evidence for a crossover effect. In addition, each multiple displays the coefficient and 95% CI of the simple slope to provide readers with a text description of the displayed slope. Each multiple also displays the percentile corresponding with the specified level of the moderator to aid readers in evaluating whether the depicted simple-slope effect is sensible given the data. Finally, we added horizontal dashed lines at the minimum and maximum observed values of the outcome variable to aid extrapolation of each prediction line.

For each simple slope, interActive computes the 95% CI for the predicted value of Y conditional on each observed value of X. If covariates are included, they are held constant at their respective means when these estimates are derived. These CI estimates are used to plot a confidence region, the area in which the true regression line is expected to fall 95% of the time. The CI for ${\hat{y}}_{i}$ conditional on the value of the predictor variables is given by the following equations (Cohen et al., 2003):

{CI}_{{\hat{y}}_{i}} = {\hat{y}}_{i} \pm t (\frac{1 - α}{2}, n - k - 1) S E_{{\hat{y}}_{i}}

(2)

And

S E_{{\hat{y}}_{i}} = \hat{σ} \sqrt{x_{i}^{T} {(X^{T} X)}^{- 1} x_{i}},

(3)

where $\frac{1 - α}{2}$ is the two-tailed probability for a given α, n−k− 2 is the degrees of freedom, X is an n × k matrix of the predictor variables, x_i is a p × 1 column vector of individual observations, $\hat{σ}$ is the residual standard error, and T is defined as the transpose operator. For each simple slope, interActive computes 95% CIs for ${\hat{y}}_{i}$ for hypothetical values of the focal predictor across the observed range. These values are then used as the bounds of the shaded polygon that depicts the confidence region of the regression line for that simple slope.

Appendix A provides an example of the computation of the confidence interval for a predicted value in a bivariate case. Equivalently, and perhaps more intuitively for many researchers, this interval can also be understood as the 95% CI of the intercept in a regression model, and one could center the focal predictor around different values to derive points that follow the edges of the confidence region. Table 2 and Figure 7 illustrate this point in a bivariate case using Example 1. Note that in the standard regression output provided in Table 2, the intercept value when X is centered at zero (i.e., the value of ${\hat{y}}_{i}$ ) is 44.46, and the standard error is 4.32. Using Equation 2, given 148 (n − k − 1) degrees of freedom and a corresponding t value of 1.98, we can compute that the 95% CI for $\hat{y}$ when X is zero ranges from 35.91 to 52.98. We can compute the CI in the case when X is centered at any other value; for instance, when X is centered at 1 (see Table 2), the intercept $({\hat{y}}_{i})$ is 71.95, the standard error of this value is 6.49, and the corresponding CI ranges from 59.13 to 84.77. Note that the lower and upper limits of $\hat{y}$ when X is centered at 0 and 1 correspond with the points denoted on the confidence region in Figure 7; if this process were repeated across the full displayed range of X, these values would circumscribe the 95% confidence region.

Table 2.

Simulated Results of a Bivariate Regression With the Predictor Centered at Different Values (N = 150)

Parameter	Estimate	SE	95% CI	β	t	P
		X centered at 0
Intercept	44.46	4.32	[35.91, 52.98]		10.29	< .001
X coefficient	27.51	4.84	[17.94, 37.07]	0.423	5.682	< .001
		X centered at 1
Intercept	71.95	6.49	[59.13, 84.77]		11.09	< .001
X coefficient	27.51	4.84	[17.94, 37.07]	0.423	5.682	< .001

Open in a new tab

Note: The intercept values differ across transformations of X. The confidence intervals (CIs) in boldface correspond with the points indicated on the confidence region in Figure 7.

Fig. 7. — Plot of the bivariate relation between X and Y in Example 1 and the confidence region of the linear estimate. The small squares indicate the lower and upper limits of the 95% confidence interval of $\hat{y}$ when X is (centered at) 0 and (centered at) 1 (see Table 2).

The interActive plot in Figure 6 was simulated from Example 1 using multivariate normal predictors. Note that this display provides much of the same information found in Table 1 (e.g., estimates of slopes, intercepts, and confidence) while also showing more thoroughly how well the model represents the data. Figure 8a displays corresponding estimates simulated from exponentially distributed predictors. Note that the plot elements added by interActive make it readily apparent that the data are not well represented by these graphs; they suggest that the researcher should consider values of the moderator that are more representative of the data (e.g., Fig. 8b). Appendix B provides an example of how interActive can enhance understanding of an interaction effect observed in real data.

Fig. 8. — Small multiples created by interActive for Example 1 in a simulation with exponentially distributed predictors. In (a), simple slopes are provided for levels of the moderator 2 SD and 1 SD below the mean, at the mean, and 1 SD and 2 SD above the mean. The observed data are better characterized in (b), which provides simple slopes for a more restricted range of values of the moderator. Each graphic shows the computed 95% confidence region (shaded area), the observed data (gray circles), the maximum and minimum values of the outcome (dashed horizontal lines), and the crossover point (diamond). The x-axes represent the full range of the focal predictor. CI = confidence interval; PTCL = percentile.

Discussion

We have reviewed current practices for graphically displaying interaction effects and provided tools and guidelines for improving displays to affirm and communicate statistical results of moderation analyses. We have included simulated examples showing the conditions under which improper visual displays can affect inference and have shown how the interActive application can be used to address these concerns. To facilitate understanding of the full nature of interaction effects, we recommend the J-N technique and marginal-effects plots as standard analytic strategies for probing interactions of continuous variables across the full range of the moderator (Preacher et al., 2006; Roisman et al., 2012). We also encourage researchers to continue providing displays of simple-slopes effects to communicate the substantive nature of interactions and to construct these displays bearing in mind the principles of graphic integrity we have described here. These practices will support the validity of inferences made while also communicating them with appropriate precision and clarity. When advances in both statistical and graphic approaches are employed, researchers and readers alike can evaluate the nature of an interaction effect with greater understanding and confidence.

We consider the practices we have described to be a first step toward improving visual displays of interactions. For instance, we aim to extend these practices to displaying interactions in nonlinear models given that interpreting simple effects is even less straightforward in nonlinear than in linear cases (Ai & Norton, 2003; Karaca-Mandic, Norton, & Dowd, 2012). Similarly, these principles should also be extended to interactions in structural equation and multilevel modeling (Preacher et al., 2006). We hope that educators, editors, and researchers will use the interActive application and the principles we have detailed to improve understanding and methodological rigor in moderation analyses. We urge researchers to consider data visualization as a crucial (rather than auxiliary) step in the scientific process.

Funding

This research was partially supported by a grant from the National Institute on Drug Abuse (DA040376) to C. J. McCabe. The content of this article is solely the responsibility of the authors and does not necessarily represent the official views of the funding agency.

Appendix A: Computing the Confidence Interval of a Predicted Value of a Dependent Variable

In this appendix, we illustrate how to compute an estimate and 95% confidence region for ${\hat{y}}_{i}$ We use a randomly sampled subset (n = 5) from the data generated in Example 1 to illustrate computations in matrix-algebra form.

Assume the following data:

X	Y
−1.29	48.80
0.64	4.78
1.20	81.38
1.37	112.00
−0.70	67.79

Open in a new tab

We are regressing Y on X, as follows:

y_{i} = {\hat{b}}_{0} + {\hat{b}}_{1} x_{i} + e_{i} .

In matrix form, this can be equivalently understood as

\hat{y} = X \hat{b} + e,

where b is a 2 × 1 vector of unstandardized coefficients and e is an n × 1 vector of residuals. Estimating this model in ordinary least squares regression yields the estimates shown in Table A1, which can be obtained directly from standard statistical software.

Suppose we arrange our predictor variable into a 5 × 2 matrix X, which includes a column of 1s so that the intercept is included in the linear combination:

X = [\begin{matrix} 1 & - 1.53 \\ 1 & 0.40 \\ 1 & 0.95 \\ 1 & 1.13 \\ 1 & 0.95 \end{matrix}]

We can then create a row of hypothetical values of our predictor to obtain the value and 95% confidence interval of ${\hat{y}}_{i}$ Assuming we wish to use the hypothetical value of X = 0.5, we represent this as a 2 × 1 column vector:

x_{i} = [\begin{array}{r} 1 \\ 0.5 \end{array}],

where the first row of this vector is a placeholder value of b₀ and the second row represents the hypothetical value of the predictor X. Note that we can multiply $x_{i}^{T}$ by a 2 × 1 vector of our model’s coefficients to obtain ${\hat{y}}_{i}$ given X = 0.5:

{\hat{y}}_{i} = [\begin{matrix} 1 & 0.5 \end{matrix}] [\begin{matrix} 60.03 \\ 11.99 \end{matrix}] = 66.03.

Table A1.

Linear-Model Results for the Effect of X on Y (n = 5)

Parameter	Estimate	SE	t	p
Intercept (b₀)	60.03	19.75	3.04	.06
X coefficient (b₁)	11.99	18.20	0.66	.56

Open in a new tab

Note: The residual standard error is 43.03, and R² is .126.

The formula for the standard error of this estimate (Equation 3) is

S E_{{\hat{y}}_{i}} = \hat{σ} \sqrt{x_{i}^{T} {(X^{T} X)}^{- 1} x_{i}} .

Obtaining the inverse of the inner product (X^TX)⁻¹, rounded to two decimal places, yields

{(X^{T} X)}^{- 1} = {([\begin{array}{r} 1 & 1 & 1 & 1 & 1 \\ - 1.53 & 0.40 & 0.95 & 1.13 & 0.95 \end{array}] [\begin{array}{l} 1 & - 1.53 \\ 1 & 0.40 \\ 1 & 0.95 \\ 1 & 1.13 \\ 1 & 0.95 \end{array}])}^{- 1} = [\begin{array}{r} 0.20 & 0 \\ 0 & 0.18 \end{array}] .

Once this matrix is computed, we can apply this quantity to our formula, in concert with the values of $\hat{σ}$ and x_i obtained earlier:

S E_{{\hat{y}}_{i}} = 43.03 \sqrt{[\begin{array}{r} 1 \\ 0.5 \end{array}] [\begin{array}{r} 0.20 & 0 \\ 0 & 0.18 \end{array}] [\begin{array}{r} 1 & 0.5 \end{array}]} = 19.80.

Given ${\hat{y}}_{i} = 66.03$ and $t (\frac{1 - α}{2}, n - k - 1) = t (0.025, 3) = 3.18$ , we can now use Equation 2 to compute a 95% confidence region for this value:

{\hat{y}}_{i} \pm t (\frac{1 - α}{2}, n - k - 1) S E_{{\hat{y}}_{i}} = 66.03 \pm 3.18 (19.80) = [3.02, 129.03] .

These results suggest that when X = 0.5, we are 95% confident that the predicted value of Y falls between 3.02 and 129.03. Conducting these computations across a range of x_i quantities would create points that circumscribe the confidence region of a plotted linear estimate.

The R code for recreating these computations and generating a plot depicting these values is available at https://github.com/connorjmccabe/InterActive/blob/master/Appendix%20A%20code/AppendixA_code.R.

Appendix B: Example of Using interActive With Real Data

Here we illustrate how using interActive to depict a previously reported interaction effect can enhance interpretation of the data. The example is drawn from a study of 491 young adults who were undergraduate students in the Pacific Northwest region of the United States. The study examined whether the effect of sensation seeking on the frequency of alcohol-related problems differed across levels of alcohol use (see K. M. King, Karyadi, Luk, & Patock-Peckham, 2011, for more details). Note that the original authors used semicontinuous regression given that the outcome variable was zero inflated and overdispersed, and violated ordinary least squares (OLS) assumptions. Therefore, in this example, we use parametric bootstrapping to simulate a new conditionally normal alcohol-problems variable that was based on an OLS model from the original data. Moreover, we do not include the covariates included in the original report because these variables were unavailable in the current data set. Nonetheless, the estimate of the interaction effect we obtained (b = 0.082, 95% confidence interval, CI = [0.025, 0.140]) was nearly identical to that of the original report (b = 0.071, 95% CI = [0.016, 0.126]).

Figure B1a is an adaptation of the simple-slopes display from the original article. In this graphic, the slope for the effect of sensation seeking on alcohol-related problems was plotted at low (1 SD below the mean), mean, and high (1 SD above the mean) levels of alcohol use. The reader can infer several effects. First, the fact that the graphed lines are higher on the vertical axis as the level of alcohol use increases suggests a strong main effect of alcohol use on problems. Second, this plot shows that when alcohol use was low, a 1-unit increase in sensation seeking corresponded with about a 1-unit decrease in the level of alcohol-related problems, suggesting that sensation seeking may be protective against alcohol-related problems when alcohol use is low. At mean levels of alcohol use, this effect diminished. Because this simple slope is approximately flat, the graphic suggests that sensation seeking was unrelated to alcohol-related problems at average levels of alcohol use. In contrast, at high levels of drinking, sensation seeking appeared to increase the risk of alcohol-related problems (specifically, a 1-unit increase in sensation seeking at high levels of use corresponded with about a 3-unit increase in problems). Third, this graphic can be used to examine estimates of alcohol-related problems conditional on specific hypothetical values of sensation seeking and alcohol use. For instance, the model predicts that an individual reporting an average level of both drinking and sensation seeking would experience about 18 alcohol problems per year. An individual who has a high level of alcohol use and is 1 SD above the mean in sensation seeking is predicted to experience about 39 alcohol problems per year, and so on.

Fig. B1. — The relation between sensation seeking and the number of alcohol-related problems experienced per year across multiple levels of alcohol use. The graph in (a) shows the simple slopes for the relation between the standardized level of sensation seeking (on the x-axis) and the number of alcohol-related problems (on the y-axis) at low (1 SD below the mean), mean, and high (1 SD above the mean) levels of alcohol use (adapted with permission from K. M. King, Karyadi, Luk, & Patock-Peckham, 2011). The graph in (b) provides a marginal-effects display for the same interaction effect. The x-axis indicates the standardized level of the moderator, and the vertical dashed lines indicate the levels of the moderator at which the focal variable becomes significantly associated with the dependent variable. The 95% confidence region is indicated by the shaded area. A marginal rug showing the frequency of different levels of alcohol use is included.

Using this graphic alone, one might infer that the effect of sensation seeking on alcohol-related problems reverses depending on how much one drinks. That is, sensation seeking appears to be protective when use is low and a risk factor when use is high, and it may or may not be associated with alcohol-related problems when use is at average levels. This effect is ostensibly supported by a regions-of-significance analysis: The slope of the effect of sensation seeking on alcohol-related problems is significant and negative when alcohol use is approximately 2.15 SD below its mean and is significant and positive when use is 0.35 SD higher than its mean (Fig. B1b).

Although the effects presented using interActive’s small multiples (see Fig. B2) are similar to those depicted in Figure B1a, more nuanced information about these effects is available from the small multiples because they include confidence regions and more simple slopes and also display the observed data. For instance, as does Figure B1a, Figure B2 indicates that greater alcohol use is associated with more negative alcohol consequences and that the effect of sensation seeking on alcohol-related problems becomes stronger at higher levels of use. Also, as in the case of Figure B1a, the simple slopes provided can be used to determine the predicted level of alcohol-related problems conditional on specific values of sensation seeking and alcohol use. But in the case of Figure B2, predictions across a greater range of conditional values are possible because the x-axis is extended to include the full observed range of the data.

The added elements in the display also increase the transparency of the data and clarify the inferences that can be made from the results. For instance, though J-N analyses indicated a significant protective effect of sensation seeking when alcohol use was 2.15 SD below the mean (b = −2.93, 95% CI = [−5.84, −0.02]), Figure B2a shows that this effect in fact represents no observations of alcohol use (i.e., there are no data at 2 SD below the mean, which corresponds with the 0th percentile of alcohol use). Similarly, whereas Figure B1 suggests that sensation seeking was protective against alcohol-related problems at 1 SD below the mean of alcohol use, Figure B2a shows that this effect was not significant (b = −0.94, 95% CI = [−2.72, 0.84]). In contrast, the figure shows that when alcohol use was at the mean, sensation seeking had no observed effect on alcohol-related problems (b = 0.80, 95% CI = [−0.52, 2.11]), and when use alcohol use was at 1 SD above the mean, the effect was significant and positive (b = 2.53, 95% CI = [0.72, 4.34]). The graphic also shows that among individuals 2 SD above the mean of alcohol use, a 1-unit increase in sensation seeking was associated with slightly more than 4 additional reported alcohol-related problems per year (b = 4.27, 95% CI = 1.48, 7.05]). Note that although no observed data existed at 2 SD below the mean of alcohol use, real data are represented at 2 SD above the mean. However, the simple slope does not capture the data range in this small multiple particularly well, and a researcher may opt to interpret this slope tentatively.

In summary, Figure B2a suggests a substantively different conclusion compared with the graphics in Figure B1. Whereas the plots in Figure B1 suggest that sensation seeking was protective against alcohol-related problems at low levels of alcohol use, the revised graphic shows that this inference is in fact unsupported. Instead, they suggest that the effect of sensation seeking on alcohol-related problems is not significantly different from zero at mean levels of alcohol use or lower, but is significant and positive at both 1 and 2 SD above the mean level of alcohol use. We can consider revising the interActive graphic to depict this effect across a more representative range of the moderator, as in Figure B2b.

Footnotes

Declaration of Conflicting Interests

The author(s) declared that there were no conflicts of interest with respect to the authorship or the publication of this article.

Open Practices

All data and materials have been made publicly available via GitHub and can be accessed at https://github.com/connorjmccabe/InterActive. The complete Open Practices Disclosure for this article can be found at http://journals.sagepub.com/doi/suppl/10.1177/2515245917746792. This article has received badges for Open Data and Open Materials. More information about the Open Practices badges can be found at http://www.psychologicalscience.org/publications/badges.

^1.

Though we do not include covariates (control variables) in this example, any number of covariates can be included to explain additional error variance in the outcome variable or to examine the effect of variables of interest on a dependent variable above and beyond the effects of these covariates. As is the case with any other predictor, the interaction coefficient will be affected by the inclusion of a covariate to the degree that (a) the interaction term and its constituent lower-order variables are highly intercorrelated with the covariate (i.e., multicollinearity) and (b) the variance in the dependent variable is explained by the covariate (Cohen, Cohen, West, & Aiken, 2003).

References

Aguinis H (1995). Statistical power with moderated multiple regression in management research. Journal of Management, 21, 1141–1158. doi: 10.1177/014920639502100607 [DOI] [Google Scholar]
Ai C, & Norton EC (2003). Interaction terms in logit and probit models. Economics Letters, 80, 123–129. doi: 10.1016/S0165-1765(03)00032-6 [DOI] [Google Scholar]
Aiken LS, & West SG (1991). Multiple regression: Testing and interpreting interactions. Thousand Oaks, CA: Sage. [Google Scholar]
American Psychological Association. (2010). Publication manual of the American Psychological Association (6th ed.). Washington, DC: Author. [Google Scholar]
Berry WD, Golder M, & Milton D (2012). Improving tests of theories positing interaction. The Journal of Politics, 74, 653–671. doi: 10.1017/S0022381612000199 [DOI] [Google Scholar]
Bissonnette V, Ickes W, Bernstein I, & Knowles E (1990). Personality moderating variables: A warning about statistical artifact and a comparison of analytic techniques. Journal of Personality, 58, 567–587. [Google Scholar]
Brambor T, Clark WR, & Golder M (2006). Understanding interaction models: Improving empirical analyses. Political Analysis, 14, 63–82. doi: 10.1093/pan/mpi014 [DOI] [Google Scholar]
Chang W, Cheng J, Allaire JJ, Xie Y, & McPherson J (2017). shiny: Web application framework for R (R Package Version 1.0.5) [Computer software]. Retrieved from https://CRAN.R-project.org/package=shiny
Cohen J, Cohen P, West SG, & Aiken LS (2003). Applied multiple regression/correlation analysis for the behavioral sciences (3rd ed.). Hillsdale, NJ: Erlbaum. [Google Scholar]
Cumming G (2014). The new statistics: Why and how. Psychological Science, 25, 7–29. doi: 10.1177/0956797613504966 [DOI] [PubMed] [Google Scholar]
Dalal DK, & Zickar MJ (2012). Some common myths about centering predictor variables in moderated multiple regression and polynomial regression. Organizational Research Methods, 15, 339–362. doi: 10.1177/1094428111430540 [DOI] [Google Scholar]
Dawson JF (2014). Moderation in management research: What, why, when, and how. Journal of Business and Psychology, 29, 1–19. doi: 10.1007/s10869-013-9308-7 [DOI] [Google Scholar]
Ellis BJ, & Boyce WT (2008). Biological sensitivity to context. Current Directions in Psychological Science, 17, 183–187. doi: 10.1111/j.1467-8721.2008.00571.x [DOI] [Google Scholar]
Frazier PA, Tix AP, & Barron KE (2004). Testing moderator and mediator effects in counseling psychology research. Journal of Counseling Psychology, 51, 115–134. doi: 10.1037/0022-0167.51.1.115 [DOI] [Google Scholar]
Hayes AF, & Matthes J (2009). Computational procedures for probing interactions in OLS and logistic regression: SPSS and SAS implementations. Behavior Research Methods, 41, 924–936. doi: 10.3758/BRM.41.3.924 [DOI] [PubMed] [Google Scholar]
Johnson PO, & Fay LC (1950). The Johnson-Neyman technique, its theory and application. Psychometrika, 15, 349–367. doi: 10.1007/BF02288864 [DOI] [PubMed] [Google Scholar]
Johnson PO, & Neyman J (1936). Tests of certain linear hypotheses and their application to some educational problems. Statistical Research Memoirs, 1, 57–93. doi: 10.2307/302397 [DOI] [Google Scholar]
Karaca-Mandic P, Norton EC, & Dowd B (2012). Interaction terms in nonlinear models. Health Services Research, 47, 255–274. doi: 10.1111/j.1475-6773.2011.01314.x [DOI] [PMC free article] [PubMed] [Google Scholar]
King G, Tomz M, & Wittenberg J (2000). Making the most of statistical analyses: Improving interpretation and presentation. American Journal of Political Science, 44, 347–361. doi: 10.2307/2669316 [DOI] [Google Scholar]
King KM, Karyadi KA, Luk JW, & Patock-Peckham JA (2011). Dispositions to rash action moderate the associations between concurrent drinking, depressive symptoms, and alcohol problems during emerging adulthood. Psychology of Addictive Behaviors, 25, 446–454. doi: 10.1037/a0023777 [DOI] [PubMed] [Google Scholar]
Luthar SS, Cicchetti D, & Becker B (2000). The construct of resilience: A critical evaluation and guidelines for future work. Child Development, 71, 543–562. doi: 10.1111/1467-8624.00164 [DOI] [PMC free article] [PubMed] [Google Scholar]
MacCallum RC, Browne MW, & Sugawara HM (1996). Power analysis and determination of sample size for covariance structure modeling. Psychological Methods, 1, 130–149. doi: 10.1037//1082-989X.1.2.130 [DOI] [Google Scholar]
MacCallum RC, & Mar CM (1995). Distinguishing between moderator and quadratic effects in multiple regression. Psychological Bulletin, 118, 405–421. doi: 10.1037/0033-2909.118.3.405 [DOI] [Google Scholar]
Monroe SM, & Simons AD (1991). Diathesis-stress theories in the context of life stress research: Implications for the depressive disorders. Psychological Bulletin, 110, 406–425. doi: 10.1037/0033-2909.110.3.406 [DOI] [PubMed] [Google Scholar]
Potthoff JF (1964). On the Johnson-Neyman technique and some extensions thereof. Psychometrika, 29, 241–256. [Google Scholar]
Preacher K, Curran PJ, & Bauer DJ (2006). Computational tools for probing interactions in multiple linear regression, multilevel modeling, and latent curve analysis. Journal of Educational and Behavioral Statistics, 31, 437–448. [Google Scholar]
R Development Core Team. (2016). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]
Roisman GI, Newman DA, Fraley RC, Haltigan JD, Groh AM, & Haydon KC (2012). Distinguishing differential susceptibility from diathesis-stress: Recommendations for evaluating interaction effects. Development and Psychopathology, 24, 389–409. doi: 10.1017/S0954579412000065 [DOI] [PubMed] [Google Scholar]
Stone-Romero EF, Alliger GM, & Aguinis H (1994). Type II error problems in the use of moderated multiple regression for the detection of moderating effects of dichotomous variables. Journal of Management, 20, 167–178. [Google Scholar]
Stone-Romero EF, & Anderson LE (1994). Relative power of moderated multiple regression and the comparison of subgroup correlation coefficients for detecting moderating effects. Journal of Applied Psychology, 79, 354–359. [Google Scholar]
Tay L, Parrigon S, Huang Q, & LeBreton JM (2016). Graphical descriptives: A way to improve data transparency and methodological rigor in psychology. Perspectives on Psychological Science, 11, 692–701. doi: 10.1177/1745691616663875 [DOI] [PubMed] [Google Scholar]
Tufte ER (2001). The visual display of quantitative information. Cheshire, CT: Graphics Press. [Google Scholar]
Wickham H (2009). ggplot2: Elegant graphics for data analysis. doi: 10.1007/978-0-387-98141-3 [DOI] [Google Scholar]

[R1] Aguinis H (1995). Statistical power with moderated multiple regression in management research. Journal of Management, 21, 1141–1158. doi: 10.1177/014920639502100607 [DOI] [Google Scholar]

[R2] Ai C, & Norton EC (2003). Interaction terms in logit and probit models. Economics Letters, 80, 123–129. doi: 10.1016/S0165-1765(03)00032-6 [DOI] [Google Scholar]

[R3] Aiken LS, & West SG (1991). Multiple regression: Testing and interpreting interactions. Thousand Oaks, CA: Sage. [Google Scholar]

[R4] American Psychological Association. (2010). Publication manual of the American Psychological Association (6th ed.). Washington, DC: Author. [Google Scholar]

[R5] Berry WD, Golder M, & Milton D (2012). Improving tests of theories positing interaction. The Journal of Politics, 74, 653–671. doi: 10.1017/S0022381612000199 [DOI] [Google Scholar]

[R6] Bissonnette V, Ickes W, Bernstein I, & Knowles E (1990). Personality moderating variables: A warning about statistical artifact and a comparison of analytic techniques. Journal of Personality, 58, 567–587. [Google Scholar]

[R7] Brambor T, Clark WR, & Golder M (2006). Understanding interaction models: Improving empirical analyses. Political Analysis, 14, 63–82. doi: 10.1093/pan/mpi014 [DOI] [Google Scholar]

[R8] Chang W, Cheng J, Allaire JJ, Xie Y, & McPherson J (2017). shiny: Web application framework for R (R Package Version 1.0.5) [Computer software]. Retrieved from https://CRAN.R-project.org/package=shiny

[R9] Cohen J, Cohen P, West SG, & Aiken LS (2003). Applied multiple regression/correlation analysis for the behavioral sciences (3rd ed.). Hillsdale, NJ: Erlbaum. [Google Scholar]

[R10] Cumming G (2014). The new statistics: Why and how. Psychological Science, 25, 7–29. doi: 10.1177/0956797613504966 [DOI] [PubMed] [Google Scholar]

[R11] Dalal DK, & Zickar MJ (2012). Some common myths about centering predictor variables in moderated multiple regression and polynomial regression. Organizational Research Methods, 15, 339–362. doi: 10.1177/1094428111430540 [DOI] [Google Scholar]

[R12] Dawson JF (2014). Moderation in management research: What, why, when, and how. Journal of Business and Psychology, 29, 1–19. doi: 10.1007/s10869-013-9308-7 [DOI] [Google Scholar]

[R13] Ellis BJ, & Boyce WT (2008). Biological sensitivity to context. Current Directions in Psychological Science, 17, 183–187. doi: 10.1111/j.1467-8721.2008.00571.x [DOI] [Google Scholar]

[R14] Frazier PA, Tix AP, & Barron KE (2004). Testing moderator and mediator effects in counseling psychology research. Journal of Counseling Psychology, 51, 115–134. doi: 10.1037/0022-0167.51.1.115 [DOI] [Google Scholar]

[R15] Hayes AF, & Matthes J (2009). Computational procedures for probing interactions in OLS and logistic regression: SPSS and SAS implementations. Behavior Research Methods, 41, 924–936. doi: 10.3758/BRM.41.3.924 [DOI] [PubMed] [Google Scholar]

[R16] Johnson PO, & Fay LC (1950). The Johnson-Neyman technique, its theory and application. Psychometrika, 15, 349–367. doi: 10.1007/BF02288864 [DOI] [PubMed] [Google Scholar]

[R17] Johnson PO, & Neyman J (1936). Tests of certain linear hypotheses and their application to some educational problems. Statistical Research Memoirs, 1, 57–93. doi: 10.2307/302397 [DOI] [Google Scholar]

[R18] Karaca-Mandic P, Norton EC, & Dowd B (2012). Interaction terms in nonlinear models. Health Services Research, 47, 255–274. doi: 10.1111/j.1475-6773.2011.01314.x [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] King G, Tomz M, & Wittenberg J (2000). Making the most of statistical analyses: Improving interpretation and presentation. American Journal of Political Science, 44, 347–361. doi: 10.2307/2669316 [DOI] [Google Scholar]

[R20] King KM, Karyadi KA, Luk JW, & Patock-Peckham JA (2011). Dispositions to rash action moderate the associations between concurrent drinking, depressive symptoms, and alcohol problems during emerging adulthood. Psychology of Addictive Behaviors, 25, 446–454. doi: 10.1037/a0023777 [DOI] [PubMed] [Google Scholar]

[R21] Luthar SS, Cicchetti D, & Becker B (2000). The construct of resilience: A critical evaluation and guidelines for future work. Child Development, 71, 543–562. doi: 10.1111/1467-8624.00164 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] MacCallum RC, Browne MW, & Sugawara HM (1996). Power analysis and determination of sample size for covariance structure modeling. Psychological Methods, 1, 130–149. doi: 10.1037//1082-989X.1.2.130 [DOI] [Google Scholar]

[R23] MacCallum RC, & Mar CM (1995). Distinguishing between moderator and quadratic effects in multiple regression. Psychological Bulletin, 118, 405–421. doi: 10.1037/0033-2909.118.3.405 [DOI] [Google Scholar]

[R24] Monroe SM, & Simons AD (1991). Diathesis-stress theories in the context of life stress research: Implications for the depressive disorders. Psychological Bulletin, 110, 406–425. doi: 10.1037/0033-2909.110.3.406 [DOI] [PubMed] [Google Scholar]

[R25] Potthoff JF (1964). On the Johnson-Neyman technique and some extensions thereof. Psychometrika, 29, 241–256. [Google Scholar]

[R26] Preacher K, Curran PJ, & Bauer DJ (2006). Computational tools for probing interactions in multiple linear regression, multilevel modeling, and latent curve analysis. Journal of Educational and Behavioral Statistics, 31, 437–448. [Google Scholar]

[R27] R Development Core Team. (2016). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]

[R28] Roisman GI, Newman DA, Fraley RC, Haltigan JD, Groh AM, & Haydon KC (2012). Distinguishing differential susceptibility from diathesis-stress: Recommendations for evaluating interaction effects. Development and Psychopathology, 24, 389–409. doi: 10.1017/S0954579412000065 [DOI] [PubMed] [Google Scholar]

[R29] Stone-Romero EF, Alliger GM, & Aguinis H (1994). Type II error problems in the use of moderated multiple regression for the detection of moderating effects of dichotomous variables. Journal of Management, 20, 167–178. [Google Scholar]

[R30] Stone-Romero EF, & Anderson LE (1994). Relative power of moderated multiple regression and the comparison of subgroup correlation coefficients for detecting moderating effects. Journal of Applied Psychology, 79, 354–359. [Google Scholar]

[R31] Tay L, Parrigon S, Huang Q, & LeBreton JM (2016). Graphical descriptives: A way to improve data transparency and methodological rigor in psychology. Perspectives on Psychological Science, 11, 692–701. doi: 10.1177/1745691616663875 [DOI] [PubMed] [Google Scholar]

[R32] Tufte ER (2001). The visual display of quantitative information. Cheshire, CT: Graphics Press. [Google Scholar]

[R33] Wickham H (2009). ggplot2: Elegant graphics for data analysis. doi: 10.1007/978-0-387-98141-3 [DOI] [Google Scholar]

PERMALINK

Improving Present Practices in the Visual Display of Interactions

Connor J McCabe

Dale S Kim

Kevin M King

Abstract

The Utility of Interaction Displays

Fig. 1.

Disclosures