Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Mar 1.
Published in final edited form as: J Exp Soc Psychol. 2022 Jan 4;99:104278. doi: 10.1016/j.jesp.2021.104278

Application of sentence-level text analysis: The role of emotion in an experimental learning intervention

Manyu Li 1
PMCID: PMC8803271  NIHMSID: NIHMS1770696  PMID: 35110783

Abstract

This registered study aimed at testing the role of emotion in the intervention effect of an experimental intervention study in academic settings. Previous analyses of the National Study of the Learning Mindset (Yeager et al., 2019) showed that in a randomized controlled trial, high school students who were given the growth mindset intervention had, on average higher GPA than did students in the control condition. Previous analyses also showed that school achievement levels moderated the intervention effect. This study applied a sentence-level text analysis strategy to detect participants’ attentional focus in five emotional dimensions (valence, arousal, dominance/control, approach-avoidant, and uncertainty) across three writing prompts students wrote during the intervention. Linear mixed models were conducted to test if emotional dimension scores computed using the text analysis predicted a higher intervention effect (i.e., higher post-intervention GPA given pre-intervention GPA). The moderating role of school achievement levels was also examined. The results of this study have implications on the possibility of applying text analysis strategies on open-ended questions in interventions or experimental studies to examine the role of the emotion-attentional focus of participants during intervention or experimental studies on the intervention or experimental outcomes, especially those that are conducted in academic settings.

Keywords: text analysis, emotion, natural language processing, learning intervention, academic performance


Human cognitive processes are tied to emotion (Pekrun, 2006; Tyng, Amin, Saad, & Malik, 2017). Emotion was found to be related to attention and engagement (Ballenghein, Megalakaki, & Baccino, 2019; Meinhardt & Pekrun, 2003; Yiend, Barnicot, Williams, & Fox, 2018), perception (Pourtois, Schettino, & Vuilleumier, 2013), reward motivation (Padmala, Sambuco, & Pessoa, 2019), and memory (Schweizer et al., 2019). Recent evidence showed that emotion also plays a role in learning, such as learning in academic settings (Mayer, 2019), and multimedia learning (Um, Plass, Hayward, & Homer, 2012). To expand our knowledge on the role of emotion in experimental settings and learning intervention, this study examined the relationship between emotion and learning outcome in a large-scale, e-learning intervention study in educational settings. Specifically, this study uses data from the National Study of the Learning Mindset (NSLM) to explore whether the use of affectively connotative lexicon (as measured along five emotional dimensions) detected in students’ writing during the intervention study were associated with the effectiveness of a computer-based learning mindset intervention. A text analysis approach will be used to score the emotional dimensions reflected in students’ writing.

Emotion and Learning in Experimental/Intervention Studies in Academic Settings

In this study, emotion is conceptually defined as consisting of “a feeling state/process that motivates and organizes cognition and action… [and] antecedent cognitive appraisals and ongoing cognition… and may motivate approach or avoidant behavior, exercise control/regulation of responses” (Izard, 2010, p. 397). In this study, emotion is operationalized as multidimensional, including valence, arousal/activation, dominance/control (VAD), uncertainty, and approach-avoidance, all of which are commonly included in classic and large-scale cross-cultural studies on emotion and emotion semantics (e.g., Fontaine, Scherer, Roesch, & Ellsworth, 2007; Jackson et al., 2019).

According to the Control-Value Theory (CVT) of achievement emotions (Pekrun, Frenzel, Goetz, & Perry, 2007; Pekrun & Perry, 2014), the emotion experienced in a learning or academic setting (i.e. emotions tied to achievement situations or achievement emotion) may contribute to one’s learning outcomes or academic performance (Artino & Pekrun, 2014; Duffy, Lajoie, Pekrun, & Lachapelle, 2018; Peterson, Brown, & Jun, 2015). The theory posits that learners’ emotion comes from their judgment or appraisal of the value of the task and their control over the task, resulting in emotion including two main dimensions: Valence (evaluation of pleasantness, or positive vs. negative feelings), and activation/arousal (physiologically activating or deactivating emotion) (Pekrun, Goetz, Titz, & Perry, 2002; Pekrun & Perry, 2014). Generally, increased appraisal of positive value and control, and increased positive valence are associated with positive outcomes in academic settings (Goetz, Pekrun, Hall, & Haag, 2006; Pekrun, Goetz, Frenzel, & Barchfeld, 2011; Pekrun, Lichtenfeld, Marsh, Murayama, & Goetz, 2017; Pekrun & Perry, 2014), experimental studies (Heidig, Müller, & Reichelt, 2014; Münchow & Bannert, 2019; Park, Knörzer, Plass, & Brünken, 2015; Plass, Heidig, Hayward, Homer, & Um, 2014), and in intervention/training programs (Duffy et al., 2018). Results for the second dimension, arousal were mixed. While some studies found that emotional arousal directly predicted performance (e.g., Chung, Cheon, & Lee, 2015) and memory (Otani, Libkuman, Widner, & Graves, 2007) in academic settings, other studies found that emotional arousal may impair memory functioning and learning outcomes (e.g., Madan, Fujiwara, Caplan, & Sommer, 2017).

Another model of emotional dimensions, the Valence-Arousal-Dominance (VAD) model (or Pleasure-Arousal-Dominance model, PAD) aligns with the CVT. The VAD highlights that the three fundamental emotion dimensions are valence, arousal, and dominance/control (Russell & Mehrabian, 1977). According to the VAD (and consistent with CVT), arousal is the degree of alertness and engagement people experience. Positive activating emotion includes joy and surprise, and negative activating emotion includes anger, disgust, anxiety, and fear, while sadness is conceptualized as a negative deactivating emotion (Frijda, 1986; Goudbeek & Scherer, 2010; Russell & Mehrabian, 1977). Recent studies found similar mapping of basic emotion to the VAD model (e.g., Fontaine et al., 2007). Similar to the CVT, VAD sees dominance/control as an important determinant of emotion. There is only a slight difference between the two theories: While CVT places control as an antecedent of emotional valence and arousal, the VAD includes control as a part of the emotion appraisal process (consistent with the emotion definition in this study). Such feeling of control was found to map to basic emotion of joy (+), anger (+), anticipation (+), sad (−), fear (−), and anxiety (−) (Fontaine et al., 2007; Goudbeek & Scherer, 2010; Russell & Mehrabian, 1977). Research on learning and the perception of control found that control was predictive of learning outcomes (Goetz et al., 2006). Although not directly referring to control emotion, many studies examining students’ perceived academic control or how participants “felt” (Hall, 2008, p. 1134) about their control in academic settings found that such cognitive appraisal of control predicts academic success (e.g., Hall, 2008; Perry, Hladkyj, Pekrun, Clifton, & Chipperfield, 2005; Respondek, Seufert, Hamm, & Nett, 2020; Stark, Brünken, & Park, 2018). Synthesizing the CVT and the VAD, this study explored the relationship between the three emotional dimensions (V-A-D) and the intervention effect (Research Questions – RQ 1 – 3). It was hypothesized that valance (RQ1 or hypothesis 1) and dominance/control detected in students’ writing (RQ3 or hypothesis 3) positively predicted the intervention effect, moderated by school achievement levels (moderation explained further below). The relationship between arousal and intervention effect is less clear because of the mixed research findings, so the research question for arousal detected in students’ writing and the intervention effect (RQ 2) was exploratory rather than confirmatory.

Another relatively less studied emotional dimension is uncertainty or unpredictability. Although accounting for less variance than valence and arousal in dimension reduction analyses of emotion language structures (Fontaine et al., 2007; Jackson et al., 2019), uncertainty is still regarded as an important emotion appraisal process (Ellsworth, 2003; Fontaine et al., 2007). Uncertainty is defined as one’s appraisal of novelty and familiarity (Ellsworth, 2003), and is mapped to basic emotion of anticipation (+), surprise (+), fear (+), disgust (−), anger (−), and joy (−) (Smith & Ellsworth, 1985). However, it is unclear how uncertainty connects to academic experiences. The increasing feeling of uncertainty (operationalized as the lack of information in learning settings) was found to predict higher curiosity and more positive learning outcome than certainty (Campion, Martins, & Wilhelm, 2009; Lamnina & Chase, 2019; Loibl & Rummel, 2014), although uncertain emotion (unexpected and uncertain) was found to relate to higher negative affect (Lamnina & Chase, 2019). In animal research, uncertain stimuli (tones and shocks) were found to contribute to learned helplessness (Overmier & Wielkiewicz, 1983). In humans, uncertain stimuli (tones) were associated with higher anxiety and poorer behavioral adjustment and performance in gain-loss tasks (Nelson, Kessel, Jackson, & Hajcak, 2016). Uncertainty is also suggesting to be disruptive, preventing individuals to attend to cues that help them effectively cope with the negative situation (Grupe & Nitschke, 2013). Since previous findings on uncertainty and learning are mixed, this study explored (but did not conduct confirmatory testing on) the relations between uncertainty detected in students’ writing and the learning intervention effect (RQ 4).

A final emotion dimension examined in this study is approach-avoidance which is another fundamental emotion process (Elliot, Eder, & Harmon-Jones, 2013). Approach and avoidance are motivational directions (or goal orientations) that are closely linked to emotion (Elliot et al., 2013; Harmon-Jones, Harmon-Jones, & Summerell, 2017), with an approach defined as striving for and eagerly approaching a desirable state and avoidance defined as eagerly moving away from undesirable state (Scholer, Cornwell, & Higgins, 2019). Approach-avoidance is similar to emotional arousal because both may activate or deactivate behaviors, although researchers still see the two as distinctive concepts (Harmon-Jones et al., 2017). Approach-motivated emotion used to be linked only to positive emotion (approaching rewards, e.g., joy), and avoidance-motivated emotion used to be linked only to negative emotion (avoiding punishment, e.g., fear, disgust) (Cacioppo, Gardner, & Berntson, 1999; Russell & Carroll, 1999); however, studies in the past decades have suggested that approach-motivated emotion also involves negative affect, anger (Carver & Harmon-Jones, 2009; Harmon-Jones, Harmon-Jones, & Price, 2013; Marsh, Ambady, & Kleck, 2005). In terms of the link between approach-avoidance and learning or academic success, studies generally support that approach-orientation plays a positive role whereas avoidance-orientation plays a negative role (Elliot, 2020; Kirikkanat & Soyer, 2018; Lüftenegger et al., 2016). Approach (vs. avoidance) is also seen as more effective in academic settings as it encourages individuals to make plans and manage tasks (e.g., self-regulated learning) (Bartels, Magun-Jackson, & Kemp, 2009; Elliot & Moller, 2003). This study tested whether approach-avoidance detected in texts predicted the intervention effect (RQ 5). It was hypothesized that approach-avoidance detected in students’ writing (i.e. higher approach emotion) predicted a higher intervention effect.

Table 1 summarized the emotion dimensions, conceptual definitions, and RQs, and confirmatory/exploratory nature of each RQ examined in this study. This study extended previous findings on emotion and learning to explore whether emotion detected in students’ writing (by chosen lexicons and text analysis algorithm) during the intervention study was associated with the degree of intervention effect through secondary data analysis of an effective computer-based learning intervention in a national sample in the United States (for the original intervention study, see Yeager et al., 2019).

Table 1.

Emotion dimensions examined in this study.

Emotion
dimensions
Conceptual Definitions Lexicon Research Questions (RQ1 – RQ5)
Valence Positive and negative feelings, evaluation-pleasantness (Pekrun & Perry, 2014; Russell & Mehrabian, 1977) hash_sentiment_jockers_rinkers (Rinker, 2018), inclusive of NRC and other lexicons Does valence detected in texts using the lexicon predict the degree of intervention effect? (RQ1)*
Arousal Degree of alertness and engagement (Frijda, 1986) that is often related to physiological activation (heart rate) or sympathetic arousal (Fontaine et al., 2007; Pekrun, 2006). NRC VAD Does arousal detected in texts using the lexicon predict the degree of intervention effect? (RQ2)
Control Sense of power or control over the eliciting event; feelings of power or weak/submission (Osgood, May, & Miron, 1975) NRC VAD Does control emotion detected in texts using the lexicon predict the degree of intervention effect? (RQ3)*
Uncertainty Uncertainty-certainty (uc-c) is defined as the appraisals of novelty and unpredictability (Ellsworth, 2003) Generated from NRC emotion using lexicons for anticipation (uc), surprise (uc), fear (uc), disgust (c), anger (c), and joy (c) (Smith & Ellsworth, 1985) Does uncertainty emotion detected in text using the lexicon predict the degree of intervention effect? (RQ4)
Approach-Avoidance Motivational directions (or goal orientations); approach (ap) defined as striving for and eagerly approaching a desirable state and avoidance (av) defined as eagerly moving away from undesirable state (Elliot et al., 2013; Scholer et al., 2019) Generated from NRC emotion using lexicons for joy (ap), anger (ap), fear (av), and disgust (av) (Cacioppo et al., 1999; Elliot et al., 2013; Marsh et al., 2005; Russell & Carroll, 1999) Does approach-avoidant emotion detected in text using the lexicon predict the degree of intervention effect? (RQ5)*
*

Only RQ1, 3 and 5 contain confirmatory hypotheses such that higher scores in the emotional dimensions (detected by text analysis) predicted higher intervention effect. The other dimensions (RQ2 and 4) are exploratory research questions because of mixed research findings in the literature. An added research question (RQ 6) examined the confirmatory hypothesis that there will be a higher number of significant relationships between emotional dimension scores and the intervention effect in the scores from WP2 and 3 than WP1.

Automated Text Analysis of Emotion

Emotion is reflected not only through one’s non-verbal communication, but also one’s verbal communication, that is, language (for a recent review on language and emotion, see Lindquist, Gendron, Satpute, & Lindquist, 2018). The foundation of automated text analysis (particularly a word frequency approach) is the assumption that emotion, or to be more exact, the attentional focus of the writers can be determined from narratives (Bestgen, 1994; Dyer, 1983; Tausczik & Pennebaker, 2010). Attentional focus is one’s “priorities, intentions, and thoughts” (Tausczik & Pennebaker, 2010, p. 30). It reflects one’s selection of experiences (Croft & Cruse, 2004) and internal feelings, and how one processes the environmental stimuli (Tausczik & Pennebaker, 2010). The examination of attentional focus using a linguistic approach is based on the idea that “different words in a semantic frame or domain focus our attention on the different elements in the frame” (Croft & Cruse, 2004, p. 47). This approach of understanding writers’ attentional focus using text analysis is termed as “word as attention” (Boyd & Schwartz, 2021, p. 25). This word as attention approach examines word frequencies to estimate individuals’ attention to different domains (e.g., examining frequencies of emotional words to estimate writers’ attention to different emotional dimensions). Despite some drawbacks in such an approach (e.g., explorative, descriptive nature), word as attention approach allows extension of language analysis to different research domains, such as politics, personality, and in this study, intervention/educational studies (see Boyd & Schwartz, 2021 for a review of the approach). This study was particularly interested in examining students’ attentional focus to affectively-connotative concepts (measured using the lexicons tied to the five emotional dimensions defined above) and how this emotion-attentional focus correlated with the intervention effect.

One of the most common methods in “word as attention”, automated text analysis is the user-defined dictionary (UDD) method (E. E. Chen & Wojcik, 2016). In the UDD approach, dictionaries (or lexicons) are developed based on existing common dictionaries and prior research (e.g., Pennebaker, Boyd, Jordan, & Blackburn, 2015). Among all UDD methods, the most common tool used by psychological researchers is Linguistic Inquiry Word Count (LIWC; Tausczik & Pennebaker, 2010) which helps users identify the frequency of words that reflect different psychological categories. LIWC is considered a valid tool to look up psychological aspects of texts (Jones, Wojcik, Sweeting, & Silver, 2016). It is also an easy-to-use tool for psychologists as it does not require users to have any computer programing knowledge. An alternative to word-level text analysis like LIWC is a sentence-level text analysis. While methods like LIWC break down unstructured text by identifying frequencies of words, sentence-level text analysis breaks down unstructured text into sentences (by identifying punctuations). This study did not aim at replacing the word-level method with a sentence-level method or demonstrating whether sentence-level is superior to word-level analysis. This study hoped to provide an alternative to existing text analysis for research that needs to consider words before and after the target words through the use of a convenient R package.

Lexicon for Emotion Words

In UDD text analysis, the key is to select a lexicon that is appropriate to the study context. Table 1 summarized the lexicons that are used in this study. Most text analysis of emotion words focuses on valence alone; therefore, there are a number of valence lexicons available, such as AFINN (Nielsen, 2011), bing (Hu & Liu, 2004), and NRC (Mohammad & Turney, 2013). LIWC also contains a lexicon for positive and negative emotion (Pennebaker et al., 2015). These dictionaries used different methods to score the valence of words. Recently, computer scientists noticed the limitation of independent lexicons and started to develop a lexicon that combines these three common dictionaries (e.g., Jockers, 2015). This study employed one of the most recent developments, hash_sentiment_jockers_rinker lexicon (Rinker, 2018b) that combines Jockers’ (2015) wrapper for the three common lexicons, as well as the SentiWord lexicon (Baccianella, Esuli, & Sebastiani, 2010) and Rinker’s augmented version of the bing (Hu & Liu, 2004) lexicon. The final improved version of this valence lexicon consisted of 11,710 English words and phrases that are scored according to the polarity of valence from −1 (negative) to +1 (positive).

For emotion arousal and dominance/control, there are two main options. One was developed and expanded from the Affective Norms for English Words (ANEW) VAD lexicon by asking a large Amazon Turk sample to rate a subset of the words (Warriner, Kuperman, & Brysbaert, 2013). The scores of each word in the lexicon come from the mean ratings of the Amazon Turk (MTurk) sample. Therefore, researchers can extract the average ratings from their sample and use them as scores for calculating valence, arousal, and dominance. The final updated lexicon contains 14,000 words. Another (more recent) lexicon is the NRC VAD lexicon, containing not only the aforementioned valence lexicon but also a lexicon for arousal and dominance (Mohammad, 2018). The NRC VAD lexicon includes words from the NRC Emotion lexicon (valence, see above), ANEW, the updated ANEW by Warriner et al. (2013), and a few other sets, totaling 20,007 words. The scores in NRC VAD were generated in a similar way as Warriner et al. (2013); however, NRC used a wider sample of native-English speakers on a crowdsourcing platform, CrowdFlower, instead of MTurk. Also, instead of asking participants to rate each term, they gave participants a set of four randomly selected terms (4-tuple) and asked them to choose the best and the worst level of valence (V), arousal (A), and dominance per 4-tuple (D; i.e., best-worst sets/BWS). Participants were allowed to do as many or as few 4-tuples as they wanted. Using the total of 778,085 BWS obtained, V/A/D scores were computed by counting how many times a word was nominated as the best minus how many times the word was nominated as the worst and then transforming into a score between 0 (lowest V/A/D) and 1 (highest V/A/D). The NRC VAD showed good split-half reliability and generally lower correlations among the VAD dimensions (i.e. discriminant validity) than Warriner et al. (2013) lexicon. In this study, the NRC arousal and dominance lexicon were used.

In addition to these developed lexicons on arousal, one may also use existing studies in affective science and put together an arousal or dominance lexicon from basic emotion. For example, studies show that joy, anger, fear, surprise, and anxiety reflect high arousal emotions, whereas sadness and disgust are low arousal emotions (e.g., Fontaine et al., 2007; Goudbeek & Scherer, 2010). Similarly, dominance is represented by joy, anger, anticipation, whereas submissive (low dominance) is represented by sadness, fear, and anxiety (Fontaine et al., 2007; Goudbeek & Scherer, 2010). However, because there are already reliable and valid arousal and dominance lexicons available, existing validated lexicons that include more words (i.e., NRC VAD) are preferred.

Unlike VAD, other emotional dimensions have received less attention in lexicon development. Therefore, putting together a lexicon using a basic emotion approach is probably the only option (another option is to develop a new lexicon, which is beyond the scope of this study). The most popular basic emotion lexicon is the NRC emotion lexicon (Mohammad & Turney, 2010). It consists of eight emotion types, including anger, fear, anticipation, trust, surprise, sadness, joy, and disgust (Mohammad & Turney, 2010, 2013). Other lexicons, such as the aforementioned AFINN and bing only detect valence, but not discrete emotion types. For uncertainty, a review of the literature revealed that uncertainty is represented by anticipation, surprise, and fear, and certainty associated with anger, disgust, and joy (Fontaine et al., 2007; Smith & Ellsworth, 1985; Tiedens & Linton, 2001). For approach-avoidance, the emotions of joy and anger were suggested to be approach motivation, and fear and disgust were suggested to be avoidant motivation (Elliot et al., 2013; Harmon-Jones et al., 2017; Marsh et al., 2005; Russell & Carroll, 1999).

Valence shifters, Double Negative, and More

In sentence-level text analysis for emotion, it is important to identify sentence structure that may alter the meaning of an emotional word. For example, valence shifters, words that change or enhance the direction of a sentence, are key information that tells researchers the actual meaning of the text and thus need to be considered by a text analysis algorithm (Polanyi & Zaenen, 2006). For example, the word “never” in “I am never satisfied” changes the valence of the sentence from positive to negative). Other than negation (e.g., “never” and “not”), valence shifters can also be intensifier/amplifier (e.g., “really” and “very”), downtoner/de-amplifier (e.g., “barely” and “rarely”), or adversative conjunction (e.g., “but” and “however”). Therefore, in the text analysis algorithm, researchers may include a lexicon for these valence shifters (e.g., hash_valence_shifters; Rinker, 2018) in the final computation of emotional dimension scores.

Adding to the complexity of valence shifters is the issue of the location of valence shifters, the effect of punctuations, and the number of valence shifters. Regarding the location of valence shifters, some valence shifters may appear right before the word of interests (“I am not happy”), while others may precede by more than one position (e.g. “I do not feel good”, “not that I like her…”). Valence shifters may also appear after the word of interest (e.g., “I fear no man.”). In addition, punctuation may change the role of valence shifters in a sentence. For example, for the sentence “I am happy, not sad”, the valence shifters “not” should be tied to “sad”, rather than “happy” which is located before the comma. Also, a sentence may have more than one valence shifters. Specifically, double negative in a sentence means positive valence (e.g., “I did not say I did not like her”, with two negators near the targeted polarized word “like”). These issues can be categorized as the n-gram problem (Jurafsky & Martin, 2019). The “n” in n-gram refers to the number of words to be considered in each instance of text analysis. To analyze n-gram, researchers first have to tokenize the text. For example, bigram tokenization breaks down texts into two-word pairs for analysis (e.g., “I am”, “am not”, “not happy”). Then, using a valence shifter lexicon, valence shifters (negator, amplifier, de-amplifier, or adversative conjunction) can be detected and the score of the valence reflected in the same sentence can be adjusted. The score should be adjusted based on the combinations of valence shifters in a sentence, as well as the position of punctuation (Open Data Science, 2018; Raja, 2017). Finally, computer science research has suggested that English-language text analysis should consider at least trigrams (e.g., two words before and one word after the target emotional word, or three words before and no word after the target word) (Yu, Shang, Hsu, Castellanos, & Han, 2016). Therefore, an improved algorithm should be able to 1) search for valence shifters surrounding the target word of interests (e.g., at least three words surrounding the target emotional word), 2) consider more than one valence shifters (e.g., double-negative), and 3) consider the effect of punctuation in valence shifters.

Sentimentr

This study introduced an innovative text analysis that has a great potential for psychological research, sentimentr (Rinker, 2019). Sentimentr was originally developed to examine valence; however, the developer has recently added a wrapper to analyze basic emotion (via the function emotion_by, see below). With some modifications, this study further applied sentimentr in analyzing other emotion dimensions. In reviewing various recent developments in text analysis in R (syuzhet, Rsentiment, sentimentr, and SentimentAnalysis), Naldi (2019) suggested that although all packages contain a good lexicon and the flexibility to update lexicons, only sentimentr accurately considered valence shifters which is a very critical issue in text analysis (see the section above on valence shifters). Sentimentr also addresses the valence shifters problems discussed earlier by 1) examining six words surrounding the target emotional word (four before and two after), 2) considering multiple valence shifters (e.g., double negative), and 3) considering the effect of punctuation in valence shifters (i.e., using punctuation to separate the effect of valence shifters).

Although there is a steep learning curve in learning sentimentr (just as other programming-based analyses), sentimentr contains a very powerful algorithm that has already taken into account the language issues mentioned above. Therefore, users do not have to write detailed source code. Its default setting can also be easily changed to update lexicons. In addition, sentimentr uses the R-programming language, which is already popular in psychology. Therefore, researchers do not need to learn another programming language to use sentimentr.

Sentimentr has mostly been applied by computer scientists in studying big data, such as consumers’ online reviews (Kawate & Patil, 2017), emotion that YouTubers convey (Kleinberg, Mozes, & van der Vegt, 2018), and online polls (Kassraie, Modirshanechi, & Aghajan, 2017). Although sentimentr has not been used in mainstream psychology, it has been applied to many areas of research involving the study of emotion, such as public opinions in business and management (Ikoro, Sharmina, Malik, & Batista-Navarro, 2018; Sinha, Choudhury, & Agrawal, 2014), political science (Sanders, 2018) and even predicting stock trend from sentiments reflected in the market and social media (J. Chen et al., 2015; Samuel & Kretinin, 2018).

In sentimentr, the valence score is computed by first considering the polarity of words and by default, four words preceding and two words after the polarity word (i.e. words that contain valence or polar cluster cijl; [1.1]). Then, valence shifters, punctuations, and their positions in the cluster are considered, resulting in the sentence-level valence scores (sij; [1.2]).

cijl=((1+wamp+wdeamp)(wijkp)(1)2+wneg) [1.1]
valencesij=cijlnwi,jn [1.2]

where wijkp is the polarity score of a polar word (i.e. words that contain valence) within the kth word in the sentence j within paragraph i (sij). The average sentence-level sentiment is computed by taking into account the weighted number of the amplifiers (wamp) and de-amplifiers (wdeamp), multiplied by the polarity scores of the words and the functions (odd vs. even) of the negators (wneg), by the square root of the number of words in a sentence (nw). For details of the algorithm and its source code, see https://github.com/trinker/sentimentr.

Similarly, arousal (arousalpi) and control/dominance (controlpi) are computed by detecting arousal and control emotion words and assigning the arousal scores (wijka) and control/dominance (wijkc) scores with the NRC VAD lexicon (formulas 2 and 3 respectively). The valence shifters used in the analyses of arousal and control emotion are the same as the ones used in the analysis of valence (i.e., wamp, wdeamp, wneg).

arousalsij=((1+wamp+wdeamp)(wijka)(1)2+wneg)nwi,jn [2]
controlsij=((1+wamp+wdeamp)(wijkd)(1)2+wneg)nwi,jn [3]

For the two dimensions that employ basic emotion lexicons, uncertainty-certainty and approach-avoidant, words relating to the emotion at each end of the dimension are detected and counted (e.g., anger is detected for certainty, disgust is detected for avoidant, see Table 1). The emotion_by function in sentimentr (Rinker, 2019) allows the easy computation of the rate of occurrences of non-negated emotional words within a sentence. This approach first screens out any negated emotional words and counts only those words that do not contain negation within a cluster (i.e. in this study, 4 words preceding and 2 words following the emotional words). For example, to compute the rate of uncertainty in a sentence, emotion_by counts the number of non-negated approach words (nuncertaintyj+), divided by the number of words in a sentence (nw) such that a score of 0 represents no occurrences of non-negated emotional words (i.e. approach emotion) in a sentence, and 1 represents that all words in the sentence contain non-negated emotional words. A single score for each emotional dimension (uncertainty – certainty sij and approach – avoidant sij) is obtained by subtraction:

uncertaintycertaintysij=nuncertaintyj+ncertaintyj+nwj [4]
approachavoidantsij=naproachj+navoidantj+nwj [5]

Current Study and Research Questions

This study used the National Study of Learning Mindsets (NSLM) data. NSLM is the largest-ever randomized controlled trial of an online growth mindset intervention program in the U.S. in k-12 settings (Yeager et al., 2019). Using the text analysis equations above, scores for each emotional dimension were computed based on students’ writings during the learning intervention study. This secondary registered report used students’ writing in response to three writing prompts as a proxy measure for the emotional state of the mind or the emotion-attentional focus (Boyd & Schwartz, 2021; Tausczik & Pennebaker, 2010) of students during the study. In other words, using text analysis strategies, this study examined students’ scope of attention while responding to each writing prompt during three different stages of interventions. This study was particularly interested in the affectively connotative concepts (measured along five emotional dimensions using established lexicons) being attended during each writing prompt. The primary goal of this study was to investigate whether students’ emotional attentional foci are associated with the intervention effect, moderated by school achievement levels. The five specific research questions were listed in Table 1. Since in the original study, the intervention effect was found to be the largest in low and medium achieving schools rather than high achieving school (Yeager et al., 2019), the variable of school achievement level was added to the hypothesis testing as a moderator to examine whether school achievement levels moderated the relations between emotional dimensions and intervention outcome. To enhance the reproducibility of the method used in this paper, all the steps of text analysis for emotion were explained in the method section. R programming codes used in this study can be found on https://osf.io/xtdc3.

Method

All procedures and analytic methods were registered during Stage 1 of the Registered Report. A 10% random sample from the complete dataset was provided by the data gatekeeper of the NSLM dataset to validate the automated text analysis method, evaluate the feasibility of the proposed methods, and establish proof of concept. Both the pilot study and the main study (i.e., analyses of the 90% remaining sample) followed the same set of procedures and measures described in this section. All measures, manipulations, and exclusions in the study are disclosed, as well as the method of determining the final sample size (with particular reference to whether the collection was continued after data analysis).

Sample

Data Source and Original Study Design

The data used in this study comes from the NSLM dataset (doi:10.3886/ICPSR37353.v1). The testing of the effectiveness of the Mindset intervention was pre-registered and later showed that the intervention program was effective in increasing students’ GPA and growth mindset (Yeager et al., 2019). The NSLM data were collected from 9th graders during the 2015-2016 academic year using a probability sampling strategy (i.e., stratified random sampling) stratifying school achievement levels (low, medium, high) and minority composition (i.e., percentage of Black/African-American or Hispanic/Latino/Latina students). Based on the stratification plan, 139 schools were randomly selected without replacement among approximately 12,000 public high schools in the United States. Among them, 76 schools participated in the study, but only 65 schools provided students’ records (e.g., GPA). Two schools requested to remove their data for public use; therefore, a total of 63 schools were included in the final sample. The final dataset contained 21,364 participants. Despite that some schools did not respond, Tipton generalizability index for comparing experimental samples and populations (.98) showed that the representativeness of the sample was not compromised (Tipton, 2014; Yeager et al., 2019).

During the study, participating 9th graders were randomly assigned to either the intervention condition or the control condition through a computer algorithm provided by a third-party technology vendor. The intervention introduced ideas and examples of the growth mindset and the application of the growth mindset to the purpose of life. Two intervention sessions were administered 21 to 27 days apart. In the first session, students in the intervention condition “read an article on scientific evidence for neural plasticity [and] complete writing exercises to internalize article’s lessons]; in the second session, students in the intervention “learn more about how students/celebrities have put a growth mindset into practice [and] complete writing exercises” (Yeager et al., 2019, p. s10). The control condition was similar to the intervention condition, except that participants read ideas and examples of the brain functions. In addition to the intervention or control materials, students also filled out a survey consisting of various measures on mindset, and social and academic variables. (For detailed methodology, see Dweck & Yeager, 2019; Yeager et al., 2019, 2016).

Pilot Sample

The pilot sample was a random sample of 10% data (N = 2,269) provided by the data gatekeeper. Since only students who were in the intervention group (i.e., growth mindset intervention) were asked to write their reflection of the intervention, only students in the intervention group were included in the current study (N = 834). Missing data were excluded from the analysis. Students whose pre-intervention GPA (n = 77), post-intervention GPA (n = 19) or both pre- and post-intervention GPA (n = 173) was not available were excluded. Fifty-two students did not write any reflection (among them 33 also had missing GPA data). Generalized linear model results showed that students who did not write any reflection were not different from those who wrote reflection by pre-intervention GPA (B = −.19, S.E. = .17, t = −1.11, p = .267), post-intervention GPA (B = −.23, S.E. = .15, t = −1.50, p = .133), or sex (B = .18, S.E. = .33, t = .54, p = .590). Similarly, students missing pre-intervention GPA were not different from those without missing GPA by sex (B = .29, S.E. = .27, z = 1.09, p = .28), or post-intervention GPA (B = −.14, S.E. = .13, z = −1.08, p = .279). Students missing post-intervention GPA were not different by sex (B = −.31, S.E. = .38, z = −.82, p = .414) or pre-intervention GPA (B = −.36, S.E. = .23, z = −1.57, p = .116).

The final sample of the pilot data consisted of 550 participants from 61 schools across the United States, and a total of 58,826 words in 3,680 sentences for analysis across three writing prompts. About half of the participants were female (nfemale = 234, 48.1%; nmale = 252, 51.9%). Sensitivity power analysis using G*Power 3.1.9.2 was conducted on the pilot sample using an alpha significance criterion of .05 (two-tailed), a power criterion of .90, N = 550 to test the minimum effect size for t-tests for differences between two dependent means. Results showed that the minimum effect size (Cohen’s d) is .14 or, converting to partial R used in this study (Ruscio, 2008), the minimum effect size is partial R = .07. Results of the pilot sample were included in the supplementary document (sections 3.1 - 3.5).

Main Study Sample

The same exclusion criteria for the pilot data were used in screening the main study sample. Participants were excluded if 1) they were in the control condition (i.e. did not write reflections of the intervention), and if 2) any of their written reflection, or pre-test and post-test GPAs were missing in the data. The main study sample was the remaining data, consisting of a total of 6,886 student participants in the treatment condition (who were not included in the pilot sample). Missing data excluded from the study included students whose pre-intervention GPA (n = 573), post-intervention GPA (n = 194) or both pre- and post-intervention GPA (n = 1,681) were missing. A total of 114 students did not write any reflection (among them 43 also had missing GPA data). Generalized linear model results showed that students who did not write any reflection had significantly lower pre-intervention GPA (B = −.81, S.E. = .11, z = −7.33, p < .001), and post-intervention GPA (B = −.75, S.E. = .11, z = −6.88, p < .001) than had students who wrote reflection. Male students (n = 51) were more likely than female students (n = 25) to have missing data on the writing reflection prompts (B = .72, S.E. = .25, z = 2.92, p = .004). For missing GPA, sex was not predictive of missing pre-intervention GPA (B = −.07, S.E. = .10, z = −.77, p = .444), or post-intervention GPA (B = .18, S.E. = .13, z = 1.42, p = .155). Students with missing data on pre-intervention GPA tended to have a lower post-intervention GPA (B = −.13, S.E. = .05, t = −2.65, p = .008), and vice versa, i.e., students with missing data on post-intervention GPA tended to have a lower pre-intervention GPA (B = −.58, S.E. = .07, t = −7.98, p < .001).

The final sample of the pilot data consisted of 4,372 participants from 63 schools across the United States, and a total of 481,624 words in 20,624 sentences for analysis across three writing prompts. About half of the participants were female (nfemale = 1,851, 50%; nmale = 1837, 50%). Sensitivity power analysis using G*Power 3.1.9.2 was conducted for the main sample using an alpha significance criterion of .05 (two-tailed), a power criterion of .90, N = 4,372 to test the minimum effect size for t-tests for differences between two dependent means. Results showed that the minimum effect size (Cohen’s d) is .05 or converting to partial R used in this study (Ruscio, 2008), the minimum effect size was partial R = .03.

Measures

Writing Prompts

Students were asked to write their thoughts at three different time points: At the beginning of the first intervention session (before main intervention materials are presented), at the end of the first session, and during the second intervention session. The two intervention sections were 21-27 days apart (Yeager et al., 2019). The writing prompts were used to check students’ engagement with the growth mindset intervention in the original study through counting the number of words students wrote (for detail, see supplementary information of the original study Yeager et al., 2019). The original study did not examine the contents of students’ writing; however, the secondary analysis of the writing prompts to analyze students’ emotion in this study context was approved by NSLM. The detailed instructions of the three writing prompts were included in the supplementary file. In the main study data, there were a total of 481,624 words spread across WP1 (142,957), WP2 (216,827), and WP3 (121,840) words. Among them, there was a total of 10,691 unique tokens. At sentence level, there were a total of 7,602 (WP1), 13,679 (WP2), and 8,188 (WP3) sentences.

Scores of Emotional Dimensions

All writing prompts are analyzed in the same way; specifically, texts are cleaned and for each emotional dimension, a score is computed for each sentence in each writing prompt using the formula listed in the introduction. Because of the concerns of the violation of independence in text analysis (Winter, 2020), instead of averaging or summing sentence scores to generate participants’ scores, raw sentence scores will be used in computing the predictive models in linear mixed models (see data analysis plan). All scores are on a continuous scale. Scores computed by sentiment_by range from −1 to 1, and scores computed by emotion_by range from 0 to 1. To maintain consistency in the report of the emotional dimensions, all scores computed by sentiment_by were rescaled to 0 to 1 using rescale in package scales without affecting their distribution.

Learning outcome – GPA

The learning outcome used in this study was the same as the original NSLM intervention study (Yeager et al., 2019). Grade-point averages (GPAs) were the average scores of the core courses GPA (i.e. mathematics, English/language arts, science, and social studies) and were administrative data reported by the schools. The GPA was calculated by an external company, MDRC (Zhu, Garcia, Boxer, Wadhera, & Alonzo, 2019). Raw score GPA was ranged from 1.00 to 4.30 (pre-intervention GPA: M = 2.79, SD = .98; post-intervention GPA: M = 2.59, SD = 1.07). GPA was collected before the intervention and the year after the intervention. Specifically, pre-intervention GPA was the 8th grade GPA if the intervention was conducted in semester 1 (fall), or 9th grade semester 1 if the intervention was conducted in semester 2 (spring; i.e., most recent GPA at the time of the intervention/writing prompts). Post-intervention GPA was the 9th grade fall and spring GPA if the intervention was conducted in the fall, or 9th grade spring for students who took the intervention in the spring (i.e., end of the school year but after the intervention/writing prompts). To account for school variations in school grading norm, all GPA used in the analysis was centered by school means and standard deviations in the overall sample. The GPA used in this study and the calculation was the same as the intervention effect computed in Yeager et al. (2019).

School Achievement Level

School achievement level was computed by the investigators of the original NSLM dataset (see Yeager et al., 2019). Specifically, based on composite PSAT scores, state standardized tests, and AP participation and scores, each school is assigned a category of low achievement (i.e. below 25th percentile), medium achievement (i.e. 25th to 75th percentile), and high achievement (i.e. above 75th percentile). Since the NSLM intervention treatment effect was found to be moderated by school achievement level (Yeager et al., 2019), in this study, school achievement level was added as a covariate and the interaction term school achievement level x valence x survey phase is added. In the main study, 678 students (16%) coming from 12 different schools belonged to the low achievement category, while 2,509 students (57%) coming from 36 schools and 1,185 students (27%) coming from 15 different schools belonged to the middle and high achievement category respectively.

Survey Weights

Weights were provided by the NSLM data manager and were previously computed by the survey firm and were used in previous analyses of the intervention effects (Yeager et al., 2019).

Data Treatment Procedure

Text Cleaning

The first step was to prepare the text for accurate analysis. First, all texts (stored in the variable x) were converted to lower case using the sapply(x, tolower) function in base r. Then, using the R package textclean (Rinker, 2018a), common symbols were replaced with text (e.g., “@” are replaced by “at”, “%” are replaced by percent and “#” are replaced by “number”, and “1st” to “first”). Contractions were expanded (e.g., from “don’t” to “do not”). Any remaining unconverted special characters were removed from the texts. Only punctuations were kept in the texts, as punctuation is crucial in breaking sentences for sentence-level sentiment analysis. Specifically, sentimentr considers pause locations (commas, colons, and semicolons specifically) and uses pause locations to calculate the boundary of a word cluster surrounding a polarized word. Punctuations that are used to end a sentence (period, exclamation marks, and question marks) are used to identify sentences within paragraphs. Typos and non-interpretable words were screened using the package hunspell (Ooms, 2018). The hunspell package in R (Ooms, 2018) enables researchers to input their texts (in a data frame) and screen the texts using the default dictionary. Any words that do not exist in the dictionary are stored in a list object. These included names (e.g., places, specific terms) and misspelled words/random letters (names of the participants were removed by the NSLM data manager, but any remaining names in the text were identified with hunspell). A total of 3,840 counts of non-interpretable words (typos and names) were identified out of the total 481,624 words across the three WPs (i.e., 0.7%). When counting only unique tokens, the total unique non-interpretable words were 3,218 out of a total of 10,691 unique tokens (i.e., 30%). The package mgsub (Ewing, 2019) was then used to remove names and replace any misspelled words.

Modalities

Modalities or modal verbs may change the meaning of the sentence, e.g., “it could be better” is certainly not as positive as “it is better”. Therefore, modal verbs are added to the valence shifters used in sentimentr when computing valence scores. There is no clear agreement on the treatment of modalities (Gelbukh, 2015; Liu, Yu, Liu, & Chen, 2014; Schulder, Wiegand, & Ruppenhofer, 2020). Therefore, only a limited amount of modal verbs that have empirical findings to support are added to the valence shifters based on recent literature (e.g., Jang & Kim, 2017). Specifically, “can/could/would/may/might” are added as de-amplifier, and “can/could/would/may/might not” remained as negator in the valence shifters list in sentimentr.

Data Analysis Plan

Both the pilot analysis (Stage 1) and the main study data analysis (Stage 2) followed the same data analysis plan. Results of the pilot analysis were included in the supplementary document (sections 3.1 - 3.5).

Descriptive statistics

Descriptive statistics (mean and standard deviation) for all emotional dimension scores were reported. Then, frequencies of top emotional words for each writing prompt were computed. Since approach-avoidance and certainty-uncertainty scores were generated based on basic emotion, top word frequencies for basic emotion were reported.

Analysis plan for Research Questions 1 – 5 (i.e., effects of each emotional dimension on the intervention effect)

Then, predictive modeling of each of the research questions/hypotheses listed in Table 1 was tested using separate linear mixed models estimating the fixed effect of the emotional dimension and accounting for by-subject variability:

y=β1x1+β2x2+β3x3+β4x1x2+β5x2x3+β6x1x3+β7x1x2x3+β8x4+b1z1+ε [6]

where y is the intervention effect (GPA), β1 and x1 are the fixed-effect coefficient and regressor of the emotional dimension (valence, arousal, etc.), β2 and x2 are the fixed-effect coefficient and regressor of the survey phase (pre vs. post-intervention), β3 and x3 are the fixed-effect coefficient and regressor of school achievement level, b1 and z1 are the random-effect coefficient and regressor for individual participants, and ε is the error term. In addition, because students were prompted by the writing prompt when writing their responses, it is possible that the words mentioned in the writing prompt may have primed students to use those terms more frequently. Therefore, their emotional dimension scores (attentional focus) may be affected. To control for the effect of the writing prompt, emotional scores of students’ usage of writing prompt words were controlled (β8x4). Finally, survey weights (see measures) were used when computing the model. The results were computed using the package lme4 (Bates, Mächler, Bolker, & Walker, 2015) and emmeans (Lenth, 2019).

To determine whether each emotional dimension was related to the intervention effect, the coefficients of the interaction terms involving survey phase and valence and t-values were computed (i.e., mainly the interaction terms emotional dimension x survey phase and emotional dimension x survey phase x school achievement level). A 95% confidence interval was used. Effect sizes for fixed effects were calculated using semi-partial R, a recommended method for computing effect sizes for generalized linear mixed models (Jaeger, Edwards, Das, & Sen, 2017). The R package, r2glmm were used to calculate all effect sizes (Jaeger, 2017). Finally, sensitivity analyses were conducted if the coefficients of the interaction terms x1x2 or x1x2x3 was significant. In this case, the package emmeans (Lenth, 2019) was used to see if each school's achievement levels (low, medium, and high) were significantly different in the relations between the emotional dimension and the intervention effects (i.e., Pre-post GPA).

Outlier Treatment Plan

After the linear mixed model is computed, outliers of the model were detected following the suggestions of Nieuwenhuis, te Grotenhuis, and Pelzer (2012), and tests were conducted to see if the model without outliers is significantly different from the model with outliers. Specifically, the R package is specifically designed for mixed model outlier detection, influence.ME (Nieuwenhuis et al., 2012) was used to calculate Cook’s Distance for each of the testing variables. Cook’s distance greater than 4/n, where n is the number of grouping, was considered to be influential outliers (Van der Meer, Te Grotenhuis, & Pelzer, 2010). If outliers are present, sigtest function is used to test whether the removal of outliers results in significant changes in the significance levels of the testing variables in the model (i.e. whether the absence of outliers yields a significantly different result). A cut-off t-value of -1.96 was used (Nieuwenhuis et al., 2012). If the model without outliers was found to be significantly different from the model with outliers, the model without outliers will be recalculated and the findings would be presented. In the main study sample, a conservative Cook’s distance was used such that any Cook’s distance greater than (4/7602) was considered influential outliers (i.e., n = 7,602, the smallest sample size across all three writing prompts, resulting in the highest Cook’s distance cut-off).

Results

Descriptive statistics

Descriptive statistics (mean and standard deviation) for all emotional dimension scores were reported in Table 2. Frequencies of top emotional words for each writing prompt were presented in the supplementary document. Specifically, most frequent words for valence, arousal, and dominance/control were reported separately (supplementary document section 4.1). Since approach-avoidance and certainty-uncertainty scores were generated based on basic emotion, top word frequencies for basic emotion were reported (supplementary document section 4.2).

Table 2.

Means and standard deviations of emotional dimension scores obtained through text analyses. Results are organized by sentences across the three writing prompts (WPs).

Emotional dimensions WP1 WP2 WP3
Sentence n = 7,602 Sentence n = 13,679 Sentence n = 8,188
By sentences M (SD) M (SD) M (SD)
Valence .46 (.09) .40 (.08) .52 (.10)
Arousal .65 (.09) .49 (.08) .56 (.10)
Control/Dominance .53 (.11) .42 (.08) .34 (.10)
Uncertainty-Certainty .52 (.08) .35 (.06) .51 (.07)
Approach-Avoidance .43 (.08) .41 (.06) .42 (.07)

Note: For easier understanding and comparison across variables, the variables presented in this table are scaled to a (0, 1) scale.

Linear Mixed Model Analyses for Research Questions 1-5

A summary of the results for the main study were presented as a summary in Table 3. Tables presenting detailed estimates and effect sizes for individual tests were presented in the supplementary file (Section 4.4). Again, the estimates relevant to our research questions are mainly the interaction term emotional dimension x survey (x1x2) phase and the three-way interaction of emotional dimension x survey phase x school achievement level (x1x2x3). For Research Question 1 (valence), only WP2 showed significant effect. Particularly, sensitivity analyses showed that contrary to the hypothesis, for WP2, valence negatively predicted the intervention effect in low school achievement (b = −.48, S.E. = .15, z = −3.16, p = .002). However, consistent with the hypothesis, valence positively predicted the intervention effect in middle school achievement (b = .16, S.E. = .06, z = −2.61, p = .009). For students in high school achievement group, valence did not predict intervention effect (b = .00, S.E. = .10, z = −0.03, p = .977).

Table 3.

Summary of linear mixed model fixed effect results of the effect of emotional dimensions on students’ pre- and post-intervention GPA. Only major predictors relating to the research questions are shown here. Detailed results for each of the model can be found in the supplementary file.

VAL ARS CRT UNC ApAv
Writing prompt 1 B SE B SE B SE B SE B SE
Main effect of emotional dimension (x1) −0.06 0.19 0.04 0.22 0.14 0.15 0.11 0.18 −0.25 0.22
Interaction effect of the emotional dimension and survey phase (Post-intervention) (x1x2) 0.23 0.20 0.07 0.23 −0.07 0.16 −0.18 0.19 0.33 0.23
Interaction effect of the emotional dimension and survey phase (Post-intervention) with school achievement level (x1x2x3)
  School Achievement Level: Medium −0.19 0.22 −0.06 0.25 0.08 0.17 0.28 0.21 −0.56* 0.25
  School Achievement Level: High 0.26 0.25 0.09 0.28 0.23 0.20 0.20 0.25 −0.15 0.30
Writing prompt 2
Main effect of emotional dimension (x1) 0.23# 0.12 0.38 0.12 0.26* 0.13 0.30# 0.18 −0.03 0.17
Interaction effect of the emotional dimension and survey phase (Post-intervention) (x1x2) −0.48** 0.15 −0.70*** 0.14 −0.48** 0.16 −0.52* 0.21 −0.02 0.21
Interaction effect of the emotional dimension and survey phase (Post-intervention) with school achievement level (x1x2x3)
  School Achievement Level: Medium 0.64*** 0.16 0.68*** 0.15 0.53** 0.17 0.54* 0.23 0.09 0.23
  School Achievement Level: High 0.48** 0.18 0.64*** 0.18 0.48* 0.19 0.74** 0.27 −0.17 0.26
Writing prompt 3
Main effect of emotional dimension (x1) −0.18 0.16 0.10 0.17 0.30 0.19 0.16 0.20 −0.70** 0.24
Interaction effect of the emotional dimension and survey phase (Post-intervention) (x1x2) 0.38 0.17 −0.14 0.18 −0.24 0.19 −0.41 0.23 1.67*** 0.27
Interaction effect of the emotional dimension and survey phase (Post-intervention) with school achievement level (x1x2x3)
  School Achievement Level: Medium −0.17 0.19 −0.02 0.20 0.25 0.21 0.28 0.25 −1.57*** 0.29
  School Achievement Level: High −0.10 0.22 0.31 0.23 0.41 0.24 0.53# 0.29 −1.75*** 0.33

Note: VAL = emotional valence, ARS = arousal, CRT = control, ApAv = approach-avoidant, UNC = uncertainty-certainty

#

p < .10

*

p < .05

**

p < .01

***

p < .001

For arousal (Research Question 2) and control/dominance (Research Question 3), again, significant findings were only observed in WP2. Both arousal (b = −.26, S.E. = .06, z = −4.16, p < .001) and control (b = −.14, S.E. = .07, z = −2.14, p = .033) negatively predicted the intervention effect regardless of school achievement levels. When breaking down into the three school achievement levels, arousal negatively predicted the intervention effect in low school achievement level (b = −.70, S.E. = .14, z = −4.96, p < .001), but did not predict the intervention effect in middle (b = −.02, S.E. = .06, z = −0.35, p = .724) or high school achievement levels (b = −.06, S.E. = .11, z = −0.58, p = .559). For control, contrary to the hypothesis, control negatively predicted the intervention effect regardless of school achievement levels (b = −.14, S.E. = .07, z = −2.14, p = .033). The negative significant effect was only found in low school achievement level (b = −.48, S.E. = .16, z = −3.03, p = .002), but not in middle (b = .06, S.E. = .06, z = −0.88, p = .380) or high school achievement level (b = .00, S.E. = .10, z = −0.02, p = .987).

For uncertainty-certainty (Research Question 4), significant results were observed in WP2 only. Specifically, in WP2, uncertainty negatively predicted the intervention effect only in low school achievement level (b = −.52, S.E. = .21, z = −2.43, p = .015), but not in middle (b = .02, S.E. = .09, z = −0.22, p = .823) or high school achievement levels (b = .22, S.E. = .16, z = −1.39, p = .165). It is worth noting that for uncertainty-certainty in WP3, the interaction term was marginally significant (b = 0.53, S.E. = 0.29, t = 1.82, p = .069). A sensitivity analysis was performed to explore the pattern. Results showed that similar to the results in WP2, the predictive relationship between uncertainty-certainty and intervention effect was more negative in low achievement schools (b = −0.41, S.E. = .23, z = −1.81, p = .071) than in middle (b = −0.14, S.E. = .10, z = −1.31, p = .189) or high achievement schools (b = 0.11, S.E. = .18, z = 0.64, p = .524).

For approach-avoidant (Research Question 5), significant results were observed in WP1 and WP3. Sensitivity analyses revealed that as hypothesized, approach-avoidant predicted intervention effect negatively for middle achievement school (b = −0.24, S.E. = .10, z = −2.54, p = .011) such that higher approach scores predicted lower intervention effect. However, for low (b = −.33, S.E. = .23, z = −1.45, p = .148) and high achievement school (b = .19, S.E. = .19, z = 0.99, p = .323), approach-avoidant showed a non-significant positive trend between approach-avoidant and the intervention effect. For WP3, approach-avoidant predicted positive intervention effect only significantly in low school achievement (b = 1.67, S.E. = .27, z = 6.30, p <.001) such that the higher students’ approach scores, the higher the intervention effect. However, approach-avoidant did not predict intervention effect in the middle (b = 0.10, S.E. = .12, z = 0.87, p = .397) or high achievement school (b = 0.08, S.E. = .20, z = 0.42, p = .677).

Finally, across the three writing prompts (WP), WP1 (completed before presenting any intervention materials) yielded almost no significant results. On the contrary, WP2 (completed after the first intervention session) yielded the most significant results.

Exploratory analyses

Since valence and control were found in an opposite direction than hypothesized, for exploratory purposes, interactions among valence, arousal, and control on the intervention effect were examined using the same linear mixed model methods but without considering school achievement level (to avoid doing a 4-way interaction and generating overcomplicated findings). Results showed that valence and arousal interacted to predict the intervention effect (b = −1.71, S.E. = .52, t = −3.28, p = .001). Sensitivity analysis showed that valence positively predicted the intervention outcome only when arousal is low (−1SD) (b = 0.17, S.E. = .07, z = −2.53, p = .011), and did not predict intervention outcome when arousal is high (+1SD) or at the mean level. The interaction between arousal and control were also significant (b = −2.03, S.E. = .52, t = −3.93, p < .001). Particularly, arousal played an antagonistic role, that is, when arousal is low (−1SD), control predicted intervention effects positively (b = .15, S.E. = .06, z = 2.25, p = .024), but when arousal is high (+1SD), control predicted intervention effects negatively (b = −.18, S.E. = .07, z = −2.70, p = .007).

Outliers treatment

The R package influence.ME (Nieuwenhuis et al., 2012) was used to calculate Cook’s Distance for each of the testing variables. Cook’s distance greater than (4/7602) was considered influential outliers (i.e., n = 7,602, the smallest sample size across all three writing prompts, resulting in the highest Cook’s distance cut-off). The number of outliers and significant outliers identified was presented in the supplementary material section 4.5. Significant outliers were identified using the sigtest() function in the influence.ME package (Nieuwenhuis et al., 2012). A datapoint was determined to be influential when removing it resulted in a change in the statistical significance of the tested variables in the linear mixed model (Nieuwenhuis et al., 2012). For the main study sample, despite having some outliers identified, none were significant outliers. The (lack of) impact of the outliers was further confirmed by re-running the linear mixed model with all outliers removed. The results showed similar patterns and the statistical significance of the variables tested in the model remained unchanged. The findings reported in the result session were based on the full sample with no outliers removed.

Discussion

The purpose of this registered secondary analysis was to test the role of five emotional dimensions on the intervention effect of the National Study of the Learning Mindset (NSLM) by analyzing the texts students wrote in three open-ended writing prompts (written across three time points during the study). Specifically, text analyses were used to detect students’ scope of attention or the emotion-attentional focus (Boyd & Schwartz, 2021; Tausczik & Pennebaker, 2010) while responding to each writing prompt during three different stages of interventions. Five research questions were asked, covering five different aspects of emotion. Each research question dealt with whether emotional dimension scores predicted the intervention effect (i.e., pre-post GPA change), moderated by the three school achievement levels.

Results showed that arousal, control, and uncertainty negatively predicted students’ intervention effect, but only in students coming from schools with a low achievement level. On the other hand, high scores on the approach-avoidant spectrum (i.e., high approach scores) positively predicted the intervention effect, but again, only in students coming from a low school achievement level. The results for valence were mixed. Valence positively predicted the intervention effect in middle school achievement but negatively predicted the intervention effect in low school achievement.

It was originally hypothesized that valence, control, and approach-avoidant would positively predict the intervention effect (no confirmatory hypotheses were made for arousal and uncertainty). Only the hypothesis for approach-avoidant was supported. Control, on the other hand, showed opposite results than expected. Control was defined in this study conceptually as a feeling of power and operationally as joy (+), anger (+), anticipation (+), sad (−), fear (−), and anxiety (−) (Fontaine et al., 2007; Goudbeek & Scherer, 2010; Russell & Mehrabian, 1977). It is possible that anger may have contributed to the negative predictive relationship between control and intervention effect. Further, similar to valence, joy might have played a negative role in the intervention effect for students from low achievement school level, contributing to the overall negative relationship. Another contradictory finding was valence. Valence was expected to play a positive role in students; however, this only happened in students in middle school achievement level. Students’ valence negatively predicted the intervention effect in low school achievement level. Therefore, it is possible that valence could play a very different role in academic outcomes depending on students’ backgrounds. However, exploratory analyses showed that valence may interact with arousal to predict the intervention effect. Specifically, positive valence may help students learn the intervention materials, but only when their arousal level is low. This is consistent with recent studies that emphasized the interacting role of valence and arousal (Chung et al., 2015; Schweizer et al., 2019).

As expected, WP1 (pre-intervention prompt) did not generate many significant findings. On the other hand, WP2 (after the first intervention session) generated significant findings in four out of five emotional dimensions. WP2 is also the richest data, containing the highest numbers of sentences and words. Another finding worth noting is that most of the results were found in low achievement schools, meaning that emotional focus of students in low achievement schools predicted the intervention effect the most. On the other hand, no significant findings were found in high-achieving schools at all. This may be because the intervention effect was found to be smallest in high-achieving schools (Yeager et al., 2019). Further, schools with high achievements may already have extensive resources to help students to succeed; therefore, this intervention may not have contributed much to these students’ GPA improvement (Yeager et al., 2019). High achievement schools may also have cultivated positive emotional learning environments and may have taught ideas similar to the growth mindset; therefore, students’ emotional focus at the time of the intervention (i.e., exposure to familiar materials) may not produce an effect. On the contrary, students in low achievement schools might be exposed to the ideas of the growth mindset intervention for the very first time. Thus, their emotional focus when interacting with the intervention materials might have determined whether they internalized the materials and whether the intervention contributed to their academic achievements in the following semesters. Specifically, the results from this study suggested that for students in low achievement schools, having approach emotional focus (i.e., defined as approaching a desirable state) during the learning process predicted better learning outcomes. The lack of control/dominance (i.e., lack of sense of power, feeling submission) and the lack of arousal emotional focus (i.e., low alertness/physiological arousal) during the learning process seemed to predict better (not worse) learning outcomes in students in low achievement schools.

As with any research studies, this study is limited in several ways. This study used existing data. Students’ emotional experiences were not directly measured in the original NSLM. Instead, the open-ended questions (used originally to check compliance) were used to serve as an indicator of students’ emotional experiences and they did not directly ask students to report how they were feeling. This is based on the assumption that emotion can be reflected in one’s language (which is also the foundation of emotional analysis in natural language processing) (Bestgen, 1994; Dyer, 1983). It is possible that students’ writing may jointly reflect (confound with) their level of compliance and engagement during the intervention, language skills, or academic performance. Another assumption is that how students felt during the study (i.e., at any time point when they sat in front of the computer completing the study) would be expressed as they write their responses to the questions. The benefit of this method is that it avoided the self-reporting nature of emotion measurements.

Another limitation is that text analysis in behavioral sciences is work-in-progress (Boyd & Schwartz, 2021), and naturally, there are limitations in the text analysis approach chosen in this study. Delimitation is necessary to clarify the scope of this approach. First, texts are not independent (Winter, 2020). Each sentence represents an idea, and the next sentence depends on the ideas expressed in previous sentences. For example, in WP2, students were asked to imagine a future student that was struggling. They were then asked to use the things they learned in the intervention to encourage them. Therefore, it is possible that students started with negatively valenced statements (reflecting the initial state of anxiety of the imagined 9th grader) and then moved to positively valenced statements showing encouragement. One way to resolve potential dependency between sentences is to apply the linear mixed model approach used in this study to analyze sentences separately and then account for the varying slopes for each participant (Winter, 2020). This avoids averaging texts written by a participant and taking away the unique valence of each sentence. Despite this adjustment, the approach used in this study cannot be used to understand the change of tones from one sentence to another (i.e., examining the frequency of emotional dimensions reflected in texts, not change in emotional degree within a person). Second, the algorithm and text processing approach in this study did not differentiate the use of formal/regular expression versus idiomatic expression, slang, or metaphor. Also, only five modal verbs are considered. Some of these issues can be addressed easily in the text cleaning stage using lexicons for idioms and modal auxiliaries to code the frequency of use, or simply add these words to the emotion lexicons or valence shifters lexicons and provide them with a score for each emotional dimension. However, to implement these strategies, a clear evidence-based mechanism needs to be identified. In this study, only limited number of modal verbs are addressed, and idiomatic expression, slangs, and metaphor are not accounted for by the algorithm. Third, pronunciations in this study are only used to identify sentences and setting boundaries for a word cluster. The use of exclamation marks, question marks, or commas in hypotaxis is not considered in the present algorithm and text processing procedure.

Finally, narratives are communicative and social in nature, containing an experiencer and a target recipient. In this study, students responded to a prompt asking them to think of a time they feel challenged (WP1), write a letter to a future 9th grader based on what they learned in the intervention (WP2), and how they can use their stronger brain to address an issue they care (WP3). Therefore, we could not equate the text as the students’ experiences of emotion in the intervention or learning processes. Instead, the scores computed from the texts merely reflect the attentional focus of the students during that hour of intervention (Tausczik & Pennebaker, 2010). There are merits and pitfalls of this “Words as Attention” approach (Boyd & Schwartz, 2021), yet the “words as attention” approach can still serve as a way for psychologists to understand the description of correlational relationships (e.g., in this study, the relationship between emotional dimensions detected in text and intervention effect). Also, publishing in-progress studies are crucial to the future development of text analysis application in psychology. This study added to existing studies by exploring how the analysis of five different emotional dimensions can be done with a single R package, sentimentr.

Despite the limitations, this study made a step forward to introduce text analysis of emotional dimensions in intervention and experimental studies in social psychology. Potential values of this study include 1) the application of an R-based text analysis method for researchers who want to include open-ended questions in their experimental design with large samples, 2) the exploration of emotional dimensions (not only valence) in text analysis, and 3) results that demonstrate the data pattern of how emotional dimensions (detected through text analysis) may be associated with an experimental study outcome.

Supplementary Material

1

Highlights.

  • Methods and analytic plans are registered.

  • Automated text analysis is used to detect emotion-attentional focus of students in a national intervention.

  • Results showed that five emotional dimensions detected via text analyses predicted the intervention effect, moderated by school achievement levels.

Acknowledgment and Funding Sources

The research reported in this manuscript was supported by the National Study of Learning Mindsets Early Career Fellowship with funding generously provided by the Bezos Family Foundation to the Mindset Scholars Network and the University of Texas at Austin Population Research Center. The University of Texas at Austin receives core support from the National Institute of Child Health and Human Development under the award number 5R24 HD042849.

This manuscript uses data from the National Study of Learning Mindsets (doi:10.3886/ICPSR37353.v1) (PI: D. Yeager; Co-Is: R. Crosnoe, C. Dweck, C. Muller, B. Schneider, & G. Walton), which was made possible through methods and data systems created by the Project for Education Research That Scales (PERTS), data collection carried out by ICF International, meetings hosted by the Mindset Scholars Network at the Center for Advanced Study in the Behavioral Sciences at Stanford University, assistance from C. Hulleman, R. Ferguson, M. Shankar, T. Brock, C. Romero, D. Paunesku, C. Macrander, T. Wilson, E. Konar, M. Weiss, E. Tipton, and A. Duckworth, and funding from the Raikes Foundation, the William T. Grant Foundation, the Spencer Foundation, the Bezos Family Foundation, the Character Lab, the Houston Endowment, the National Institutes of Health under award number R01HD084772-01, the National Science Foundation under grant number 1761179, Angela Duckworth (personal gift), and the President and Dean of Humanities and Social Sciences at Stanford University.

The content is solely the responsibility of the authors and does not necessarily represent the official views of the Bezos Family Foundation, the Mindset Scholars Network, the University of Texas at Austin Population Research Center, the National Institutes of Health, the National Science Foundation, or other funders.

The author would like to thank Prof. Barbara Schneider, Drs. Pratik Mhatre and Alexander Browman, Dr. Shannette Porter, and the Mindset Scholars Network staff for their comments on the early drafts of the manuscript. The author would also like to thank Drs. Pratik Mhatre, Jenny Buontempo, and Robert Reynolds for providing data assistance and Nadia Turki, Trey Delcambre, Wade Johnson, and Annie Vu for their help in testing the reliability of the text analysis.

Footnotes

Conflict of Interest

There is no conflict of interest to be declared.

Ethical Statement

The author confirms that the article submitted has not been published previously except in the form of an electronic preprint. The article submitted is not under consideration for publication elsewhere. Its publication is approved by the author and explicitly by the responsible authorities where the work was carried out. If the article is accepted for publication, it will not be published elsewhere including electronically in the same form, in English or in any other language, without the consent of the copyright-holder.

The data in the paper was not part of a previous submission to JESP.

Reporting Guidelines

The author has followed the instruction to indicate that “all measures, manipulations, and exclusions in the study are disclosed, as well as the method of determining the final sample size (with particular reference to whether collection was continued after data analysis).” (p. 15 of the manuscript)

The author also confirms that effect sizes, exact p-values, means, standard deviations, and inter-variable correlations relevant to the analyses have been reported. Sensitivity power analysis has also been performed for existing data analysis.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Artino AR, & Pekrun R (2014). Using control-value theory to understand achievement emotions in medical education. Academic Medicine, 89(12), 1696. 10.1097/ACM.0000000000000536 [DOI] [PubMed] [Google Scholar]
  2. Baccianella S, Esuli A, & Sebastiani F (2010). SentiWordNet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the International Conference on Language Resources and Evaluation, LREC; (pp. 2200–2204). Pisa, Italy. [Google Scholar]
  3. Ballenghein U, Megalakaki O, & Baccino T (2019). Cognitive engagement in emotional text reading: concurrent recordings of eye movements and head motion. Cognition and Emotion, 33(7), 1448–1460. 10.1080/02699931.2019.1574718 [DOI] [PubMed] [Google Scholar]
  4. Bartels JM, Magun-Jackson S, & Kemp AD (2009). Volitional Regulation and Self-regulated Learning: An Examination of Individual Differences in Approach-Avoidance Achievement Motivation. Electronic Journal of Research in Educational Psychology, 7(2), 605–626. [Google Scholar]
  5. Bates D, Mächler M, Bolker B, & Walker S (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48. 10.18637/jss.v067.i01 [DOI] [Google Scholar]
  6. Bestgen Y (1994). Can emotional valence in stories be determined from words? Cognition and Emotion, 8(1), 21–36. 10.1080/02699939408408926 [DOI] [Google Scholar]
  7. Boyd RL, & Schwartz HA (2021). Natural Language Analysis and the Psychology of Verbal Behavior: The Past, Present, and Future States of the Field. Journal of Language and Social Psychology, 40(1), 21–41. 10.1177/0261927X20967028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cacioppo JT, Gardner WL, & Berntson GG (1999). The affect system has parallel and integrative processing components: Form follows function. Journal of Personality and Social Psychology, 76(5), 839–855. 10.1037/0022-3514.76.5.839 [DOI] [Google Scholar]
  9. Campion N, Martins D, & Wilhelm A (2009). Contradictions and predictions: Two sources of uncertainty that raise the cognitive interest of readers. Discourse Processes, 46(4), 341–368. 10.1080/01638530802629125 [DOI] [Google Scholar]
  10. Carver CS, & Harmon-Jones E (2009). Anger Is an Approach-Related Affect: Evidence and Implications. Psychological Bulletin, 135(2), 183–204. 10.1037/a0013965 [DOI] [PubMed] [Google Scholar]
  11. Chen EE, & Wojcik SP (2016). A practical guide to big data research in psychology. Psychological Methods, 21(4), 458–474. 10.1037/met0000111.supp [DOI] [PubMed] [Google Scholar]
  12. Chen J, Chai A, Goel M, Lieu D, Mohamed F, Nahm D, & Wu B (2015). Predicting stock prices from news articles. In The Undergraduate Statistics Association - Project Committee (pp. 1–5). Berkeley, CA. [Google Scholar]
  13. Chung S, Cheon J, & Lee KW (2015). Emotion and multimedia learning: an investigation of the effects of valence and arousal on different modalities in an instructional animation. Instructional Science, 43(5), 545–559. 10.1007/s11251-015-9352-y [DOI] [Google Scholar]
  14. Croft W, & Cruse A (2004). Cognitive Linguistics. Cambridge University Press. [Google Scholar]
  15. Duffy MC, Lajoie SP, Pekrun R, & Lachapelle K (2018). Emotions in medical education: Examining the validity of the medical emotion scale (MES) across authentic medical learning environments. Learning and Instruction, 1–13. 10.1016/j.learninstruc.2018.07.001 [DOI] [Google Scholar]
  16. Dweck CS, & Yeager DS (2019). Mindsets: A View From Two Eras. Perspectives on Psychological Science (Vol. 14). 10.1177/1745691618804166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dyer MG (1983). The role of affect in narratives. Cognitive Science, 7(3), 211–242. 10.1016/S0364-0213(83)80012-3 [DOI] [Google Scholar]
  18. Elliot AJ (2020). Competition and achievement outcomes: A hierarchical motivational analysis. Motivation Science, 6(1), 3–11. 10.1037/mot0000164 [DOI] [Google Scholar]
  19. Elliot AJ, Eder AB, & Harmon-Jones E (2013). Approach-avoidance motivation and emotion: Convergence and divergence. Emotion Review, 5(3), 308–311. 10.1177/1754073913477517 [DOI] [Google Scholar]
  20. Elliot AJ, & Moller AC (2003). Performance-approach goals: Good or bad forms of regulation? International Journal of Educational Research, 39(4–5), 339–356. 10.1016/j.ijer.2004.06.003 [DOI] [Google Scholar]
  21. Ellsworth PC (2003). Appraisal processes in emotion. Handbook of Affective Sciences. [Google Scholar]
  22. Ewing M (2019). Mgsub: Safe, multiple, simultaneous string substitution. Retrieved from https://cran.r-project.org/package=mgsub
  23. Fontaine JRJ, Scherer KR, Roesch EB, & Ellsworth PC (2007). The world of emotions is not two-dimensional. Psychological Science, 18(12), 1050–1057. 10.1111/j.1467-9280.2007.02024.x [DOI] [PubMed] [Google Scholar]
  24. Frijda NH (1986). The Emotions. Cambridge, UK: Cambridge University Press. [Google Scholar]
  25. Gelbukh A (2015). Computational Linguistics and Intelligent Text Processing: 16th International Conference, CICLing 2015 Cairo, Egypt, April 14-20, 2015 Proceedings, Part II. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9042, 49–65. 10.1007/978-3-319-18117-2 [DOI] [Google Scholar]
  26. Goetz T, Pekrun R, Hall N, & Haag L (2006). Academic emotions from a social-cognitive perspective: Antecedents and domain specificity of students’ affect in the context of Latin instruction. British Journal of Educational Psychology, 76(2), 289–308. 10.1348/000709905X42860 [DOI] [PubMed] [Google Scholar]
  27. Goudbeek M, & Scherer K (2010). Beyond arousal: Valence and potency/control cues in the vocal expression of emotion. The Journal of the Acoustical Society of America, 128(3), 1322. 10.1121/1.3466853 [DOI] [PubMed] [Google Scholar]
  28. Grupe DW, & Nitschke JB (2013). Uncertainty and anticipation in anxiety: an integrated neurobiological and psychological perspective. Nature Reviews Neuroscience, 14(7), 488–501. 10.1038/nrn3524 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Hall NC (2008). Self-regulation of primary and secondary control in achievement settings: A process model. Journal of Social and Clinical Psychology, 27(10), 1126–1164. 10.1521/jscp.2008.27.10.1126 [DOI] [Google Scholar]
  30. Harmon-Jones E, Harmon-Jones C, & Price TF (2013). What is approach motivation? Emotion Review, 5(3), 291–295. 10.1177/1754073913477509 [DOI] [Google Scholar]
  31. Harmon-Jones E, Harmon-Jones C, & Summerell E (2017). On the importance of both dimensional and discrete models of emotion. Behavioral Sciences, 7(4). 10.3390/bs7040066 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Heidig S, Müller J, & Reichelt M (2014). Emotional design in multimedia learning: Differentiation on relevant design features and their effects on emotions and learning. Computers in Human Behavior, 44, 81–95. 10.1016/j.chb.2014.11.009 [DOI] [Google Scholar]
  33. Hu M, & Liu B (2004). Mining and Summarizing Customer Reviews. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD-2004). Seattle, WA. [Google Scholar]
  34. Ikoro V, Sharmina M, Malik K, & Batista-Navarro RT (2018). Analysing sentiments expressed on Twitter by UK energy company’s consumers. Management & Security. [Google Scholar]
  35. Izard CE (2010). The many meanings/aspects of emotion: Definitions, functions, activation, and regulation. Emotion Review, 2(4), 363–370. 10.1177/1754073910374661 [DOI] [Google Scholar]
  36. Jackson JC, Watts J, Henry TR, List J, Forkel R, Mucha PJ, … Lindquist KA (2019). Emotion semantics show both cultural variation and universal structure. Science, 366(6472), 1517–1522. 10.1126/science.aaw8160 [DOI] [PubMed] [Google Scholar]
  37. Jaeger BC (2017). r2glmm: Computes R Squared for Mixed (Multilevel) Models. Retrieved from https://cran.r-project.org/package=r2glmm [Google Scholar]
  38. Jaeger BC, Edwards LJ, Das K, & Sen PK (2017). An R2 statistic for fixed effects in the generalized linear mixed model. Journal of Applied Statistics, 44(6), 1086–1105. 10.1080/02664763.2016.1193725 [DOI] [Google Scholar]
  39. Jang J, & Kim K-E (2017). Sentiment analysis utilizing modal expressions. Journal of British and America Studies, 36(8), 179–200. [Google Scholar]
  40. Jockers ML (2017). Syuzhet: Extract sentiment and plot arcs from yext. Retrieved from https://github.com/mjockers/syuzhet
  41. Jones NM, Wojcik SP, Sweeting J, & Silver RC (2016). Tweeting negative emotion: An investigation of twitter data in the aftermath of violence on college campuses. Psychological Methods, 21(4), 526–541. 10.1037/met0000099 [DOI] [PubMed] [Google Scholar]
  42. Jurafsky D, & Martin J (2019). Naive bayes and sentiment classification. Speech and Language Processing, 1–21. [Google Scholar]
  43. Kassraie P, Modirshanechi A, & Aghajan HK (2017). Election vote share prediction using a sentiment-based fusion of Twitter data with google trends and online polls (pp. 363–370). 10.5220/0006484303630370 [DOI] [Google Scholar]
  44. Kawate S, & Patil K (2017). An approach for reviewing and ranking the customers’ reviews through quality of review. ICTACT Journal on Soft Computing, 7(2), 1390–1401. 10.21917/ijsc.2017.0193 [DOI] [Google Scholar]
  45. Kirikkanat B, & Soyer MK (2018). A path analysis model pertinent to undergraduates’ academic success: Examining academic confidence, psychological capital and academic coping factors. European Journal of Educational Research, 7(1), 133–150. 10.12973/eu-jer.7.1.133 [DOI] [Google Scholar]
  46. Kleinberg B, Mozes M, & van der Vegt I (2018). Identifying the sentiment styles of YouTube’s vloggers. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (pp. 3581–3590). Stroudsburg, PA, USA: Association for Computational Linguistics. 10.18653/v1/D18-1394 [DOI] [Google Scholar]
  47. Lamnina M, & Chase CC (2019). Developing a thirst for knowledge: How uncertainty in the classroom influences curiosity, affect, learning, and transfer. Contemporary Educational Psychology, 59(June). 10.1016/j.cedpsych.2019.101785 [DOI] [Google Scholar]
  48. Lenth R (2019). emmeans: Estimated Marginal Means, aka Least-Squares Means. R package version 1.3.5. Retrieved from https://cran.r-project.org/package=emmeans [Google Scholar]
  49. Lindquist KA, Gendron M, Satpute AB, & Lindquist K (2018). Language and emotion: Putting words into feelings and feelings into words. In Barrett LF, Lewis M, & Haviland-Jones JM (Eds.), The Handbook of Emotions (4th ed., pp. 579–594). Guiford Press. [Google Scholar]
  50. Liu Y, Yu X, Liu B, & Chen Z (2014). Sentence-level sentiment analysis in the presence of modalities. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8404 LNCS(PART 2), 1–16. 10.1007/978-3-642-54903-8_1 [DOI] [Google Scholar]
  51. Loibl K, & Rummel N (2014). Knowing what you don’t know makes failure productive. Learning and Instruction, 34, 74–85. 10.1016/j.learninstruc.2014.08.004 [DOI] [Google Scholar]
  52. Lüftenegger M, Klug J, Harrer K, Langer M, Spiel C, & Schober B (2016). Students’ achievement goals, learning-related emotions and academic achievement. Frontiers in Psychology, 7(MAY), 1–10. 10.3389/fpsyg.2016.00603 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Madan CR, Fujiwara E, Caplan JB, & Sommer T (2017). Emotional arousal impairs association-memory: Roles of amygdala and hippocampus. NeuroImage, 156, 14–28. 10.1016/j.neuroimage.2017.04.065 [DOI] [PubMed] [Google Scholar]
  54. Marsh AA, Ambady N, & Kleck RE (2005). The Effects of Fear and Anger Facial Expressions on Approach- and Avoidance-Related Behaviors. Emotion, 5(1), 119–124. 10.1037/1528-3542.5.1.119 [DOI] [PubMed] [Google Scholar]
  55. Mayer RE (2019). Searching for the role of emotions in e-learning. Learning and Instruction, 1–3. 10.1016/j.learninstruc.2019.05.010 [DOI] [Google Scholar]
  56. Meinhardt J, & Pekrun R (2003). Attentional resource allocation to emotional events: An ERP study. Cognition and Emotion, 17(3), 477–500. 10.1080/02699930244000039 [DOI] [PubMed] [Google Scholar]
  57. Mohammad SM (2018). Obtaining reliable human ratings of valence, arousal, and dominance for 20,000 English words. ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), 1, 174–184. 10.18653/v1/p18-1017 [DOI] [Google Scholar]
  58. Mohammad SM, & Turney PD (2010). Emotions evoked by common words and phrases: using mechanical turk to create an emotion lexicon. CAAGET ’10 Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, (June), 26–34. [Google Scholar]
  59. Mohammad SM, & Turney PD (2013). Crowdsourcing a word-emotion association lexicon. In Computational Intelligence (Vol. 29, pp. 436–465). 10.1111/j.1467-8640.2012.00460.x [DOI] [Google Scholar]
  60. Münchow H, & Bannert M (2019). Feeling good, learning better? Effectivity of an emotional design procedure in multimedia learning. Educational Psychology, 39(4), 530–549. 10.1080/01443410.2018.1524852 [DOI] [Google Scholar]
  61. Nelson BD, Kessel EM, Jackson F, & Hajcak G (2016). The impact of an unpredictable context and intolerance of uncertainty on the electrocortical response to monetary gains and losses. Cognitive, Affective and Behavioral Neuroscience, 16(1), 153–163. 10.3758/s13415-015-0382-3 [DOI] [PubMed] [Google Scholar]
  62. årup Nielsen F. (2011). A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. In Proceedings of the ESWC2011 Workshop on “Making Sense of Microposts”: Big things come in small packages. (pp. 93–98). [Google Scholar]
  63. Nieuwenhuis R, te Grotenhuis M, & Pelzer B (2012). Influence.ME: Tools for detecting influential data in mixed effects models. R Journal, 4(2), 38–47. 10.32614/RJ-2012-011 [DOI] [Google Scholar]
  64. Ooms J (2018). Hunspell: High-performance stemmer, tokenizer, and spell checker. r package version 3.0. Retrieved from https://cran.r-project.org/package=hunspell
  65. Open Data Science. (2018). An introduction to sentence-Level sentiment analysis with sentimentr. Retrieved August 8, 2019, from https://medium.com/@ODSC/an-introduction-to-sentence-level-sentiment-analysis-with-sentimentr-ac556bd7f75a [Google Scholar]
  66. Osgood CE, May WH, & Miron MS (1975). Cross-Cultural Universals in Affective Meaning. Urbana, IL: University of Illinois Press. [Google Scholar]
  67. Otani H, Libkuman TM, Widner RJ, & Graves EI (2007). Memory for emotionally arousing stimuli: A comparison of younger and older adults. Journal of General Psychology, 134(1), 23–42. 10.3200/GENP.134.1.23-42 [DOI] [PubMed] [Google Scholar]
  68. Overmier JB, & Wielkiewicz RM (1983). On unpredictability as a causal factor in “learned helplessness.” Learning and Motivation, 14(3), 324–337. 10.1016/0023-9690(83)90020-6 [DOI] [Google Scholar]
  69. Padmala S, Sambuco N, & Pessoa L (2019). Interactions between reward motivation and emotional processing 1. In Emotion and Cognition (pp. 1–21). 10.1016/bs.pbr.2019.03.023 [DOI] [PubMed] [Google Scholar]
  70. Park B, Knörzer L, Plass JL, & Brünken R (2015). Emotional design and positive emotions in multimedia learning: An eyetracking study on the use of anthropomorphisms. Computers & Education, 86, 30–42. 10.1016/j.compedu.2015.02.016 [DOI] [Google Scholar]
  71. Pekrun R (2006). The control-value theory of achievement emotions: Assumptions, corollaries, and implications for educational research and practice. Educational Psychology Review, 18(4), 315–341. 10.1007/s10648-006-9029-9 [DOI] [Google Scholar]
  72. Pekrun R, Frenzel AC, Goetz T, & Perry RP (2007). The control-value theory of achievement emotions: An integrative approach to emotions in education. In Schutz PA & Pekrun R, (Eds.), Emotion in Education (pp. 13–36). Amsterdam: Academic Press. [Google Scholar]
  73. Pekrun R, Goetz T, Frenzel AC, & Barchfeld P (2011). Measuring emotions in students’ learning and performance: The Achievement Emotions Questionnaire (AEQ). Contemporary Educational Psychology, 36(1), 36–48. [Google Scholar]
  74. Pekrun R, Goetz T, Titz W, & Perry RP (2002). Academic emotions in students’ self-regulated learning and achievement: A program of qualitative and quantitative research. Educational Psychologist, 37(2), 91–105. 10.1207/S15326985EP3702_4 [DOI] [Google Scholar]
  75. Pekrun R, Lichtenfeld S, Marsh HW, Murayama K, & Goetz T (2017). Achievement emotions and academic performance: Longitudinal models of reciprocal effects. Child Development, 88(5), 1653–1670. 10.1111/cdev.12704 [DOI] [PubMed] [Google Scholar]
  76. Pekrun R, & Perry RP (2014). The control-value theory of achievement emotions in education. In Pekrun R & Linnenbrink-Garcia L (Eds.), International Handbook of Emotions in Education (pp. 13–36). New York: Routledge. 10.4324/9780203148211 [DOI] [Google Scholar]
  77. Pennebaker JW, Boyd RL, Jordan K, & Blackburn K (2015). The development and psychometric properties of LIWC2015. (U. of T. at Austin, Ed.). Austin, TX. [Google Scholar]
  78. Perry RP, Hladkyj S, Pekrun RH, Clifton RA, & Chipperfield JG (2005). Perceived academic control and failure in college students: A three-year study of scholastic attainment. Research in Higher Education, 46(5), 535–569. 10.1007/s11162-005-3364-4 [DOI] [Google Scholar]
  79. Peterson ER, Brown GTL, & Jun MC (2015). Achievement emotions in higher education: A diary study exploring emotions across an assessment event. Contemporary Educational Psychology, 42, 82–96. 10.1016/j.cedpsych.2015.05.002 [DOI] [Google Scholar]
  80. Plass JL, Heidig S, Hayward EO, Homer BD, & Um E (2014). Emotional design in multimedia learning: Effects of shape and color on affect and learning. Learning and Instruction, 29, 128–140. 10.1016/j.learninstruc.2013.02.006 [DOI] [Google Scholar]
  81. Polanyi L, & Zaenen A (2006). Contextual valence shifters. In Computing Attitude and Affect in Text: Theory and Applications (pp. 1–10). Berlin/Heidelberg: Springer-Verlag. 10.1007/1-4020-4102-0_1 [DOI] [Google Scholar]
  82. Pourtois G, Schettino A, & Vuilleumier P (2013). Brain mechanisms for emotional influences on perception and attention: What is magic and what is not. Biological Psychology, 92(3), 492–512. 10.1016/j.biopsycho.2012.02.007 [DOI] [PubMed] [Google Scholar]
  83. Raja AM (2017). Handling ‘happy’ vs ‘not happy’: Better sentiment analysis with sentimentr in R ∣ DataScience+. Retrieved August 8, 2019, from https://datascienceplus.com/handling-happy-vs-not-happy-better-sentiment-analysis-with-sentimentr-in-r/ [Google Scholar]
  84. file:///C:/Users/c00301945/Downloads/fpsyg-08-00243. pd. Respondek L, Seufert T, Hamm JM, & Nett UE (2020). Linking changes in perceived academic control to university dropout and university grades: A longitudinal approach. Journal of Educational Psychology, 112(5), 987–1002. 10.1037/edu0000388 [DOI] [Google Scholar]
  85. Rinker TW (2018a). {textclean}: Text Cleaning Tools}. [Google Scholar]
  86. Rinker TW (2018b). Lexicon: Lexicon data version 1.2.1. Retrieved from http://github.com/trinker/lexicon
  87. Rinker TW (2019). sentimentr: Calculate text polarity sentimentr. Retrieved June 2, 2019, from https://cran.r-project.org/web/packages/sentimentr/readme/README.html [Google Scholar]
  88. Ruscio J (2008). A probability-based measure of effect size: Robustness to base rates and other factors. Psychological Methods, 13(1), 19–30. 10.1037/1082-989X.13.1.19 [DOI] [PubMed] [Google Scholar]
  89. Russell JA, & Carroll JM (1999). On the bipolarity of positive and negative affect. Psychological Bulletin, 125(1), 3–30. 10.1037//0033-2909.125.1.3 [DOI] [PubMed] [Google Scholar]
  90. Russell JA, & Mehrabian A (1977). Evidence for a three-factor theory of emotions. Journal of Research in Personality, 11(3), 273–294. 10.1016/0092-6566(77)90037-X [DOI] [Google Scholar]
  91. Samuel J, & Kretinin A (2018). When the going gets tough, the tweets get going! An exploratory analysis of tweets sentiments in the stock market. American Journal of Management, 18(5), 2012–2017. 10.33423/ajm.v18i5.251 [DOI] [Google Scholar]
  92. Sanders JE (2018). Similarities and differences in the argumentative characteristics of the official brexit campaigns. LSE Undergraduate Political Review, 1, 1–21. 10.21953/lse.naec6c7u8de9 [DOI] [Google Scholar]
  93. Scholer AA, Cornwell JFM, & Higgins ET (2019). Should We Approach Approach and Avoid Avoidance? An Inquiry from Different Levels. Psychological Inquiry, 30(3), 111–124. 10.1080/1047840X.2019.1643667 [DOI] [Google Scholar]
  94. Schulder M, Wiegand M, & Ruppenhofer J (2020). Automatic generation of lexica for sentiment polarity shifters. Natural Language Engineering, (2020), 1–27. 10.1017/S135132492000039X [DOI] [Google Scholar]
  95. Schweizer S, Satpute AB, Atzil S, Field AP, Hitchcock C, Black M, … Dalgleish T (2019). The impact of affective information on working memory: A pair of meta-analytic reviews of behavioral and neuroimaging evidence. Psychological Bulletin, 145(6), 566–609. 10.1037/bul0000193 [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Sinha P, Choudhury AD, & Agrawal AK (2014). Sentiment analysis of wimbledon tweets. In CEUR Workshop Proceedings (Vol. 1141, pp. 51–52). [Google Scholar]
  97. Smith CA, & Ellsworth PC (1985). Patterns of Cognitive Appraisal in Emotion. Journal of Personality and Social Psychology, 48(4), 813–838. 10.1037/0022-3514.48.4.813 [DOI] [PubMed] [Google Scholar]
  98. Stark L, Brünken R, & Park B (2018). Emotional text design in multimedia learning: A mixed-methods study using eye tracking. Computers & Education, 120, 185–196. 10.1016/j.compedu.2018.02.003 [DOI] [Google Scholar]
  99. Tausczik YR, & Pennebaker JW (2010). The psychological meaning of words: LIWC and computerized text analysis methods. Journal of Language and Social Psychology, 29(1), 24–54. 10.1177/0261927X09351676 [DOI] [Google Scholar]
  100. Tiedens LZ, & Linton S (2001). Judgment under emotional certainty and uncertainty: The effects of specific emotions on information processing. Journal of Personality and Social Psychology, 81(6), 973–988. 10.1037/0022-3514.81.6.973 [DOI] [PubMed] [Google Scholar]
  101. Tipton E (2014). How Generalizable Is Your Experiment? An Index for Comparing Experimental Samples and Populations. Journal of Educational and Behavioral Statistics, 39(6), 478–501. 10.3102/1076998614558486 [DOI] [Google Scholar]
  102. Tyng CM, Amin HU, Saad MNM, & Malik AS (2017). The influences of emotion on learning and memory. Frontiers in Psychology, 8, 1–22. 10.3389/fpsyg.2017.01454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Um E. “Rachel,” Plass JL, Hayward EO, & Homer BD (2012). Emotional design in multimedia learning. Journal of Educational Psychology, 104(2), 485–498. 10.1037/a0026609 [DOI] [Google Scholar]
  104. Van der Meer T, Te Grotenhuis M, & Pelzer B (2010). Influential cases in multilevel modeling: A methodological comment. American Sociological Review, 75(1), 173–178. 10.1177/0003122409359166 [DOI] [Google Scholar]
  105. Warriner AB, Kuperman V, & Brysbaert M (2013). Norms of valence, arousal, and dominance for 13,915 English lemmas. Behavior Research Methods, 45(4), 1191–1207. 10.3758/s13428-012-0314-x [DOI] [PubMed] [Google Scholar]
  106. Winter B (2020). Mixed Models 1: Conceptual Introduction. In Statistics for Linguists: An Introduction Using R. New York: Routledge. [Google Scholar]
  107. Yeager DS, Hanselman P, Walton GM, Murray JS, Crosnoe R, Muller C, … Dweck CS (2019). A national experiment reveals where a growth mindset improves achievement. Nature, 573(7774), 1–6. 10.1038/s41586-019-1466-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Yeager DS, Romero C, Paunesku D, Hulleman CS, Schneider B, Hinojosa C, … Dweck CS (2016). Using design thinking to improve psychological interventions: The case of the growth mindset during the transition to high school. Journal of Educational Psychology, 108(3), 374–391. 10.1007/s13398-014-0173-7.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Yiend J, Barnicot K, Williams M, & Fox E (2018). The influence of positive and negative affect on emotional attention. Journal of Behavior Therapy and Experimental Psychiatry, 61(March 2017), 80–86. 10.1016/j.jbtep.2018.06.008 [DOI] [PubMed] [Google Scholar]
  110. Yu H, Shang J, Hsu M, Castellanos M, & Han J (2016). Data-driven contextual valence shifter quantification for multi-theme sentiment analysis. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (Vol. 176, pp. 939–948). New York, New York, USA: ACM Press. 10.1145/2983323.2983793 [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Zhu P, Garcia I, Boxer K, Wadhera S, & Alonzo E (2019). Using a growth mindset intervention to help ninth-graders: An independent evaluation of the National Study of Learning Mindsets. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES