In 2013, we reported in Psychological Science on a longitudinal field experiment in which we and our coauthors randomly assigned participants to receive or not to receive positive-emotions training over 6 weeks, in order to test theory-driven hypotheses regarding the pathways by which positive emotions might build physical health (Kok et al., 2013). Our results revealed that, amidst the unpredictability of field settings, positive-emotions training produced statistically significant improvements in a marker of physical health, and that these improvements were mediated by psychological processes. On the basis of her reanalyses of our data, Nickerson (2018) claims that we overstated our findings. As we detail here, we find the empirical basis for Nickerson’s claim to be questionable.
The Influence of Statistical Outliers
In discussing “unusual” participants, Nickerson argues that extreme and implausible values for high-frequency heart rate variability (HF-HRV) for 7 participants led us to overstate the effects of the positive-emotions training deployed, known as loving-kindness meditation (LKM). However, Nickerson’s method of identifying influential observations does not fit the analysis we conducted. Specifically, although Nickerson characterizes seven data points as “unusual,” she spotlights 5 participants with the highest scores for final vagal tone (see her Fig. 1). Given that the ultimate variable of interest in the analysis we reported in our 2013 article represented change in vagal tone from baseline to the end of the study (see Kok et al., 2013, Fig. 2), Nickerson’s focus on final vagal tone is inappropriate.
Closer inspection of Nickerson’s Figure 1 reveals that of the five data points that she characterizes as extreme, the three rightmost points show end-of-study values that are consistent with their corresponding baseline values. Even though these three values are high relative to the other HF-HRV scores, the change in vagal tone that they represent is not itself disproportionately high. Thus, in our statistical model, which rested on change in vagal tone, these three data points were not statistical outliers.
Nickerson also wonders how our team could disagree with the authors of a previously published Commentary on our 2013 article (Heathers, Brown, Coyne, & Friedman, 2015) regarding the biological validity of the seven vagal-tone values she refers to as unusual. It is because, as stated in our published response to Heathers et al. (Kok & Fredrickson, 2015), our determination of biological validity was based on having an outside expert (blind to experimental condition) inspect the raw electrocardiogram (ECG) data for each participant in question—a step that Heathers et al. did not take. The expert we consulted was James Long, of the James Long Company, who has provided software and hardware solutions for psychophysiological research since 1979. Long determined that the values for all participants in our sample, except the two identified and excluded in our response to Heathers et al. (participants 557004 and 557027), were the result of biologically plausible ECG recordings and not of error.1 Therefore, it is inappropriate to exclude additional participants from analysis.
Nickerson’s approach to the 7 participants she designated as unusual was to exclude them from her reanalysis of our data. A truism in statistics is that statistically significant findings can be rendered nonsignificant with sufficient reductions in sample size and, correspondingly, in statistical power. In this light, it is noteworthy that Nickerson’s approach reduced the available sample by more than 13%.
Figure 1 presents a histogram for change in vagal tone (calculated by subtracting baseline HF-HRV from end-of-study HF-HRV) for each experimental condition (participants 557004 and 557027 excluded).2 We agree with Nickerson that the rightmost value in the LKM histogram may plausibly function as an influential statistical outlier. We do not agree, however, with her strategy of addressing outliers by expunging data. An alternative analytic method for removing the disproportionate influence of outliers, one that does not reduce statistical power, is to Winsorize any extreme values by replacing them with the value from the 95th percentile (Rivest, 1994). This approach simultaneously reduces the distortion that can be caused by an outlying value while retaining most of the information provided by the data point (in this case, that the individual had a high value for change in vagal tone).
After Winsorizing the data (and again excluding the two cases identified as biologically implausible in Kok & Fredrickson, 2015), we found that empirical support for our three original hypotheses stood steady. Specifically, we reran our original model (see Fig. 2 in Kok et al., 2013) using the 95th-percentile method of Winsorizing extreme data points for change in vagal tone. Six outlying values,3 including the rightmost value of concern to Nickerson (see Fig. 1), were adjusted. The model fit continued to be acceptable (root-mean-square error of approximation = 0.084, 90% confidence interval = [0.063, 0.103]; comparative fit index = 0.945; Tucker-Lewis index = 0.943), and the results continued to support our three hypotheses: As predicted by Hypothesis 1, the practice of LKM increased positive emotions (b = 0.055, z = 3.770, p < .001), and baseline vagal tone moderated this effect such that participants who began the study with higher values of HF-HRV showed greater increases in positive emotions over 9 weeks’ time (b = 0.047, z = 3.003, p = .003). As predicted by Hypothesis 2, increased positive emotions predicted increased perceived social connections (b = 1.065, z = 4.184, p < .001). And as predicted by Hypothesis 3, greater increase in perceived social connections predicted larger changes in vagal tone over the course of the study (b = 3.795, z = 2.008, p = .045).
Within-Person Analyses?
Nickerson argues that evidence for upward spirals needs to rest on within-person analyses rather than between-person analyses. We agree with Nickerson that within-person tests are of considerable interest. We find fault, however, with her claim that she has provided a within-person test and show that her between-person test rests on data stripped of much of its information value.
A key strength of our original study is its experimental design, in which participants were randomly assigned to either a control condition or an intervention condition (LKM). Both our original analysis and the reanalysis Nickerson offers in the latter section of her Commentary are between-person tests that compare these two groups. Even though both her approach and ours include derived variables to represent within-person changes (i.e., computed indices of change in her reanalysis; latent scores in our analysis), they both ultimately compare these within-person variables using between-person tests. Statistical experts argue that any effects assessed using a single derived value per person to represent within-person change on a repeated measure—whether as an independent or a dependent variable—are unambiguously between-person effects (Curran, Howard, Bainter, Lane, & McGinley, 2014). So, although Nickerson claims that her reanalysis provides a test of within-person effects, it does not.
The primary difference between our original analysis and Nickerson’s reanalysis is that she chose to drastically reduce the information value of our data before conducting her reanalysis. She did this by dichotomizing the data. (We note that only 1 of Nickerson’s 195 computed indices of change—across three variables for each of 65 participants—resulted in a value of “no change.” Therefore, Nickerson’s trichotomization strategy in effect reduces to dichotomization.) Dichotomizing continuous data is strongly discouraged by statistical experts because it has been decisively linked to a wide variety of adverse statistical consequences, including (a) loss of information about individual differences, (b) loss of effect size and power, and (c) loss of measurement reliability (MacCallum, Zhang, Preacher, & Rucker, 2002). Thus, Nickerson’s tallies (in the rightmost column of her Table 1) drastically diminish the information value of the original data. With her extreme approach to data reduction, it is foreseeable that statistical significance would be lost in almost any analysis of these tallies.
In sum, Nickerson’s approach of conducting a between-person test on dichotomized data is far less sensitive than our original analysis, which used the full information within our continuous data.
Future Directions
Although the trio of hypotheses we tested in our study continue to be supported by the initial data we collected in 2007, we acknowledge that the scientific study of upward-spiral processes remains in its infancy (for a review, see Fredrickson & Joiner, in press). Only a handful of studies, for instance, have incorporated biological variables into tests of the prospective and reciprocal relations that form upward-spiral dynamics (i.e., Burns et al., 2008; Kok & Fredrickson, 20104). Whereas multiple laboratories have reported cross-sectional positive associations between cardiac vagal tone and well-being (both physical and mental; e.g., Bhattacharyya, Whitehead, Rakhit, & Steptoe, 2008; Geisler, Vennewald, Kubiak, & Weber, 2010; Marsland et al., 2007; Oveis et al., 2009; Wang, Lu, & Qin, 2013; for nonlinear associations, see Kogan, Gruber, Shallcross, Ford, & Mauss, 2013; yet for null results, see Silvia, Jackson, & Sopko, 2014, and Sloan et al., 2017), to date, no direct or conceptual replications of the longitudinal field experiment we reported in 2013 have been published. Of relevance for further longitudinal investigations of upward-spiral dynamics are new statistical tools that can be used for rigorous and simultaneous tests of between-person and within-person effects over time (e.g., group iterative multiple-model estimation: see Beltz, Wright, Sprague, & Molenaar, 2016; latent curve models with structured residuals: see Curran et al., 2014). These advanced statistical tools, however, require larger samples (e.g., N = 250) and more frequent repeated assessments (e.g., five time points) than are available in the data set on which our study was based. Rigorous and well-powered tests are thus still needed to further examine whether and how self-generated positive emotions improve vagal tone and other objective markers of physical health, and whether they do so in an upward-spiral dynamic.
Supplementary Material
Acknowledgments
The authors wish to thank Aaron Boulton, Sara Algoe, Ann Firestine, and James Long for their assistance with preparing this response.
We thank Nickerson for bringing to our attention an additional participant (557055) whose data we had inadvertently failed to send to James Long for evaluation in 2015. Upon inspecting the ECG data for this participant in 2017, Long noted a single ectopic beat near the beginning of the 2-min resting phase used to determine end-of-study vagal tone. This single ectopic beat had artifactually produced an atypically high HF-HRV value (i.e., 0.0106). Long recommended that we recompute this participant’s HF-HRV value by excluding the first 16 s of data, which contained the ectopic beat. Doing so resulted in a corrected value more typical for our sample (i.e., 0.000167). All data representations and analyses reported here use this corrected value.
We chose to illustrate change in vagal tone using difference scores in this figure to retain the interpretability of x-axis values above and below zero as positive and negative change over time, respectively. We note that the statistical models reported here and in our original article predict change in vagal tone as indexed by residualized change scores. We have provided a supplementary figure that presents corresponding condition-specific histograms based on residualized change scores at the Open Science Framework (https://osf.io/jazfy/). The distributions, outliers, and condition-specific means in that figure are comparable to those reported here.
Values for change in vagal tone were Winsorized for participants 557012, 557016, 557037, 557059, 557065, and 557070.
We note that in this earlier study, we examined individual differences in upward-spiral dynamics using the same data set as used in Kok et al. (2013), with statistical controls for group assignment. Across these two publications, we concluded that upward spirals between positive emotions and cardiac vagal tone occur naturally to greater or lesser degrees (Kok & Fredrickson, 2010) and that when individuals are randomly assigned to receive positive-emotions training, their upward spirals can be nudged to advance more swiftly (Kok et al., 2013).
Footnotes
Action Editor: D. Stephen Lindsay served as action editor for this article.
Author Contributions: B. L. Fredrickson and B. E. Kok each drafted sections of the manuscript and provided critical revisions. Both authors approved the final version of the manuscript for submission.
Declaration of Conflicting Interests: B. L. Fredrickson served as a paid consultant to Happify.com on the subject of evidence-based well-being interventions. B. L. Fredrickson also serves as an uncompensated advisory board member for several nonprofit organizations, including the Positive Coaching Alliance, The Psych Report, and the VIA Institute on Character. The authors declared that they had no conflicts of interest with respect to the authorship or the publication of this article.
Funding: The data reanalyzed in this article were collected with support from a research grant (MH59615) awarded to Principal Investigator B. L. Fredrickson by the National Institute of Mental Health of the U.S. National Institutes of Health (NIH). Support for B. L. Fredrickson’s time came from a Research and Study Assignment from the College of Arts and Sciences at the University of North Carolina at Chapel Hill, a Cattell Sabbatical Award from the Association for Psychological Science, and three research grants awarded by the NIH: a National Institute for Nursing Research Grant (NR012899), an award supported by the NIH Common Fund, which is managed by the NIH Office of the Director/Office of Strategic Coordination; a National Cancer Institute Research Grant (CA170128); and a National Center for Complementary and Integrative Health Research Grant (AT007884). These funding agencies played no role in the decision to publish this Commentary or in the preparation of the manuscript.
Open Practices:
All data, along with James Long’s assessments of the disputed cases, have been made publicly available via the Open Science Framework and can be accessed at https://osf.io/jazfy/. The complete Open Practices Disclosure for this article can be found at http://journals.sagepub.com/doi/suppl/10.1177/0956797617707319. This article has received the badge for Open Data. More information about the Open Practices badges can be found at http://www.psychologicalscience.org/publications/badges.
References
- Beltz A. M., Wright A. G. C., Sprague B. N., Molenaar P. C. M. (2016). Bridging the nomothetic and ideographic approaches to the analysis of clinical data. Assessment, 23, 447–458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bhattacharyya M. R., Whitehead D. L., Rakhit R., Steptoe A. (2008). Depressed mood, positive affect, and heart rate variability in patients with suspected coronary artery disease. Psychosomatic Medicine, 70, 1020–1027. [DOI] [PubMed] [Google Scholar]
- Burns A. B., Brown J. S., Sachs-Ericsson N., Plant E. A., Curtis J. T., Fredrickson B. L., Joiner T. E. (2008). Upward spirals of positive emotion and coping: Replication, extension, and initial exploration of neurochemical substrates. Personality and Individual Differences, 44, 360–370. doi: 10.1016/j.paid.2007.08.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Curran P. J., Howard A. L., Bainter S. A., Lane S. T., McGinley J. S. (2014). The separation of between-person and within-person components of individual change over time: A latent curve model with structured residuals. Journal of Consulting and Clinical Psychology, 82, 879–894. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fredrickson B. L., Joiner T. (in press). Reflections on positive emotions and upward spirals. Perspectives on Psychological Science. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geisler F. C. M., Vennewald N., Kubiak T., Weber H. (2010). The impact of heart rate variability on subjective well-being is mediated by emotion regulation. Personality and Individual Differences, 49, 723–728. [Google Scholar]
- Heathers J. A. J., Brown N. J. L., Coyne J. C., Friedman H. L. (2015). The elusory upward spiral: A reanalysis of Kok et al. (2013). Psychological Science, 26, 1140–1143. doi: 10.1177/0956797615572908 [DOI] [PubMed] [Google Scholar]
- Kogan A., Gruber J., Shallcross A. J., Ford B. Q., Mauss I. B. (2013). Too much of a good thing? Cardiac vagal tone’s nonlinear relationship with well-being. Emotion, 13, 599–604. [DOI] [PubMed] [Google Scholar]
- Kok B. E., Coffey K. A., Cohn M. A., Catalino L. I., Vacharkulksemsuk T., Algoe S. B., . . . Fredrickson B. L. (2013). How positive emotions build physical health: Perceived positive social connections account for the upward spiral between positive emotions and vagal tone. Psychological Science, 24, 1123–1132. [DOI] [PubMed] [Google Scholar]
- Kok B. E., Fredrickson B. L. (2010). Upward spirals of the heart: Autonomic flexibility, as indexed by vagal tone, reciprocally and prospectively predicts positive emotions and social connectedness. Biological Psychology, 85, 432–436. doi: 10.1016/j.biopsycho.2010.09.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kok B. E., Fredrickson B. L. (2015). Evidence for the upward spiral stands steady: A response to Heathers, Brown, Coyne, and Friedman (2015). Psychological Science, 26, 1144–1146. doi: 10.1177/0956797615584304 [DOI] [PubMed] [Google Scholar]
- MacCallum R. C., Zhang S., Preacher K. J., Rucker D. D. (2002). On the practice of dichotomization of quantitative variables. Psychological Methods, 7, 19–40. [DOI] [PubMed] [Google Scholar]
- Marsland A. L., Gianaros P. J., Prather A. A., Jennings J. R., Neumann S. A., Manuck S. B. (2007). Stimulated production of proinflammatory cytokines covaries inversely with heart rate variability. Psychosomatic Medicine, 69, 709–716. [DOI] [PubMed] [Google Scholar]
- Nickerson C. A. (2018). Overstating the effects of loving-kindness meditation: Comment on Kok et al. (2013). Psychological Science, 29, 463–466. [DOI] [PubMed] [Google Scholar]
- Oveis C., Cohen A. B., Gruber J., Shiota M. N., Haidt J., Keltner D. (2009). Resting respiratory sinus arrhythmia is associated with tonic positive emotionality. Emotion, 9, 265–270. [DOI] [PubMed] [Google Scholar]
- Rivest L. (1994). Statistical properties of Winsorized means for skewed distributions. Biometrika, 81, 373–383. [Google Scholar]
- Silvia P. J., Jackson B. A., Sopko R. S. (2014). Does baseline heart rate variability reflect stable positive emotionality? Personality and Individual Differences, 70, 183–187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sloan R. P., Schwarz E., McKinley P. S., Weinstein M., Love G., Ryff C., . . . Seeman T. (2017). Vagally-mediated heart rate variability and indices of well-being: Results of a nationally representative study. Health Psychology, 36, 73–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Z., Lu W., Qin R. (2013). Respiratory sinus arrhythmia is associated with trait positive affect and positive emotional expressivity. Biological Psychology, 93, 190–196. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.