Skip to main content
International Journal of Environmental Research and Public Health logoLink to International Journal of Environmental Research and Public Health
. 2021 Apr 26;18(9):4591. doi: 10.3390/ijerph18094591

Perceiving Social-Emotional Volatility and Triggered Causes of COVID-19

Si Jiang 1,2, Hongwei Zhang 3, Jiayin Qi 1,4,*, Binxing Fang 1,5, Tingliang Xu 6
Editor: Paul B Tchounwou
PMCID: PMC8123597  PMID: 33926072

Abstract

Health support has been sought by the public from online social media after the outbreak of novel coronavirus disease 2019 (COVID-19). In addition to the physical symptoms caused by the virus, there are adverse impacts on psychological responses. Therefore, precisely capturing the public emotions becomes crucial to providing adequate support. By constructing a domain-specific COVID-19 public health emergency discrete emotion lexicon, we utilized one million COVID-19 theme texts from the Chinese online social platform Weibo to analyze social-emotional volatility. Based on computed emotional valence, we proposed a public emotional perception model that achieves: (1) targeting of public emotion abrupt time points using an LSTM-based attention encoder-decoder (LAED) mechanism for emotional time-series, and (2) backtracking of specific triggered causes of abnormal volatility in a cognitive emotional arousal path. Experimental results prove that our model provides a solid research basis for enhancing social-emotional security outcomes.

Keywords: social-emotional volatility, COVID-19, cause detection, deep learning

1. Introduction

The pandemic of the novel coronavirus disease 2019 (COVID-19) highlights the vital role of online social media in the development of sudden public health. When the public is in home isolation (to avoid outdoor gathering activities and to reduce chances of infection), people are more likely to share on social networks benign symptoms and conditions (e.g., headache, stress, and cough), as well as inconveniences associated with those conditions [1]. These psychological pressures affect people’s physical and mental health and pose a potential threat to normal and stable society.

Several scholars have been concerned about the psychological impact and have identified that the media played a vital role during the COVID-19 pandemic because the public sought security through excessive use of Internet information related to the events [2,3,4]. To supplement previous research on psychological effects studying the polarity of emotions [5], which are normally considered to be positive, negative, or neutral [6], discrete emotions have predictable effects on subsequent cognitive effort, depending on an individual’s underlying appraisal of certainty [7]. Thus, we utilized eight discrete emotional dimensions to measure changes in the public’s psychological state on Weibo, a Chinese version of Twitter and a platform that appeals to the 500 million people sharing their opinions. Consequently, we were able to dynamically perceive the emotional volatility of a discrete emotional time-series of collective cognition since social network data are the most effective and accurate indicators for studying public sentiment [8].

We integrated both abnormal emotional volatility detection and triggered causes into a perception model to dynamically monitor the public’s emotional reactions. The volatility is reflected in emotional valence, which is generally considered to be an expression of emotional intensity and strength. We first constructed a discrete emotion lexicon that for measured and monitored the public’s emotional state with regard to COVID-19 public health emergencies. In addition, we proposed a public emotional perception model that integrated deep learning to detect abrupt timing by the stimulus. Figure 1 shows the overall model process. First, we constructed a series of datasets, including the computed emotional valences, topics in the pandemic period, and tags for events that aroused a high degree of public attention. These labeled data served as the basis for the model for evaluating emotion. Second, we input each continuous emotional time-series and their valences into our proposed LSTM-based attention encoder-decoder (LAED) mechanism on emotional burst time points estimation. Third, we traced back the time window before that time point, followed the emotional arousal path forward according to the emotion appraisal model, ranked the specific corresponding causes of emotional volatility, and finally accurately positioned the largest risks affecting public emotions. We also compared the Pearson correlation coefficient between infected persons and discrete emotions.

Figure 1.

Figure 1

Overall Public Emotional Perception Model.

Experimental results show that our domain-specific emotional lexicon can capture the public emotional situation, and the perception model not only provides a solution to real-time dynamic monitoring of the public’s emotional state but also serves as an aid for detecting actual emotional causes. Public health emergencies concern the safety of people’s livelihood, and our research provides a solution using the online social media context to identify in advance potential emotional risks.

The remainder of this paper is organized as follows: Section 2 summarizes related research work, Section 3 describes the methods of domain-specific discrete emotion lexicon construction and our proposed time-point detection and cause perception model, Section 4 reports the experimental process and results, and the conclusions are presented in Section 5.

2. Related Work

In contrast to the existing studies on event evolution and emotion classification, we managed to solve how specific discrete emotions depict the psychological level of the public during an emergency. We refined the relevant research in the two subsequent sections: Section 2.1 public emotion and discrete emotion lexicon and Section 2.2 emotional abrupt and cause detection.

2.1. Public Emotion and Discrete Emotion Lexicon

Lexicon-based sentiment analysis has shown promise for analyzing the impact of user emotions on social networks [9,10,11]. Therefore, it is imperative to build an adaptive emotional lexicon to capture the public’s discrete emotions. TensiStrength detects stress and relaxation strength in social web texts associated with human-annotated corpora [12]. Using the correlation of online users’ stress states from Twitter interactions, a hybrid model comprising a factor graph model and a convolutional neural network (CNN) was proposed to improve detection performance [13]. For new words in the field, a large unlabeled corpus by Sogou, a Chinese search engine, was utilized to correct and expand current Chinese sentiment dictionaries [14], and a deep learning neural network extracted keywords to predict and identify potential default risk platforms [15]. However, none of these studies were conducted on specific Chinese discrete emotions. The most relevant Chinese fine-grained scheme initially proposed by Ren-CECps [16] ensures the accuracy of emotive words but lacks the ability to adapt to the domain. They constructed a new lexicon, Ren-CECps-SWB 2.0, for measuring the subjective well-being (SWB) of Chinese [17].

To this end, it was necessary to construct an emotional lexicon based on human cognitive psychology, which is suitable for accurately describing the changes of public emotional and stress states in online social media during the COVID-19 pandemic.

2.2. Emotional Abrupt and Cause Detection

There is a lack of research on the specific factors influencing public mentality during the COVID-19 pandemic [3], and the research has failed to solve the problem of tracking and identifying the stressors to explain the role of anxiety from the perspective of psychological factors [18]. Traditional cause extraction involves finding the event corresponding to a particular emotion by using the grammatical rules of emotional sentences, using a multi-attention-based neural network that was grounded in the context of words to locate emotional clauses [19].

However, our definition of the cause of emotion is different from the prior ones. We maintain that emotion at the semantic level is the description of the emotional body and emotional target object, which can be achieved by syntactic and lexical analysis. The emotional cause was proposed based on the OCC model of emotion [20], a rule constructed by Bayesian probability to recognize different emotional components under different emotional factors [21]. Some researchers analyzed emotional changes, extracted foreground and background latent dirichlet allocation (LDA) models to extract foreground topics, and proposed cause candidate and background LDA models to rank the popularity of extracted foreground topics in the emotional change period [22]. Others adopted a multivariate Gaussian distribution with a power-law distribution to model and analyze the users’ emotion of microblogs and to detect the reactive emotion state [23], focusing on the sentiment tracking of the needs of different entities and detecting sentiment spikes on the problem of extracting and ranking the causes of a sentiment spike [24].

Therefore, we detected the most obvious type of emotions among the public due to their different psychological acceptance of media information and government measures, analyzed the effect of different causes of abnormal emotion, and emphasized the most extensive public risk factors.

3. Methods

3.1. Dataset Analysis

The experiment selected the 230 COVID-19 themes dataset (https://www.datafountain.cn/competitions/423, accessed on 3 March 2020) for the period between 1 January 2020 and 20 February 2020, during which time a total of 1 million Chinese Weibo data were collected. We calculated the emotional categories and emotional valences of different individual public posts within the unit time interval to integrate the public collective emotions and detect hot events. To cover all of the candidate events, we referred to an emerging social media site, zhiweidata.com, as well as the traditional media Nanfang Metropolis Daily “Special COVID-19”. We performed the LDA topic model in the test corpus to make the detection of epidemic events more representative. For the evaluation of perplexity and coherence score, the optimal number of topics equaled 7. Other relevant reports had different divisions on the number and type of topics according to the contents and objects. Our research team individually labeled the major COVID-19 events into topics by personal cognition of real events, in addition to topic classifications made by news reports. Finally, we summarized 7 topics (listed in Table 2) based on the content from the open data we used and used the LDA topic model to label the corpus by the dominant percentage score. Meanwhile, the time points were labeled in the form of text similarity to major events and served as the basis for our response to the public.

3.2. Measurement of Public Emotions

Regarding the COVID-19 public health emergency, a suitable emotion lexicon was the basis for measuring the public’s emotional characteristics. Ren_CECps lexicon consists of four positive emotions—expect, joy, love, surprise—and four negative emotions—anxiety, sorrow, angry, hate—providing a Chinese annotation model with eight emotional dimensions. We extracted labeled keywords and emotional phrases from the original Ren_CECps lexicon and loaded them on a 10,000 randomly extracted test corpus. The results indicated that the coverage was only 14.85% of the corpus, which suggests that the original lexicon lacked the capability of understanding the actual public emergency emotional situation. To better adapt and measure public opinion of the COVID-19 outbreak, we adapted a domain discrete emotion lexicon, optimized emotional volatility, and finally constructed a better lexicon that could more accurately describe the public emotional phenomenon.

3.2.1. Discrete Emotion Lexicon Construction

We extracted labeled keywords and phrases with eight-dimensional emotion tags from 1487 Chinese blogs in Ren-CECps, checked word length, part of speech, and emotional intensity and eliminated repeated words that had no clear meaning when the eight-dimensional emotional scores were all equal to 0. Then, extending synonyms from HIT Tongyici Cilin were extended by bootstrapping to add synonyms to the list of seeds, and the list was continuously updated. To adjust for repeated words, we adopted the method of multiple sampling, which is updated in Equation (1):

ei=1Nj=1MeValencej (1)

where, ei represents valence of the emotional type, N is the total number under the emotional type, eValence is the original annotated emotional strength, and M is the total number of words appearing. Previous experimental experience proves that our approach of computing emotional valence for eight emotional dimensions within the time interval can express emotion volatility [17,25].

Secondly, we combined context representation by using the Google open source tool Word2vec and the CBOW language model. We trained the semantic word vectors from 3.5G SogouCA (http://www.sogou.com/labs/resource/ca.php, accessed on 20 June 2017) and satisfied the maximum conditional probability. In addition, we extracted high-frequency word sequences of the test corpus and expanded the domain emotion lexicon by word vector similarity. On the basis of calculating the cosine distance, we used the k-NN algorithm to locate the most relevant candidate words that matched the target words. By adjusting the threshold range of k, we determined the emotional category of the new candidate words and assigned value to them. After repeated experiments and manual verification, we chose k = 5 given the premise of similarity greater than 0.6, and finally expanded to 2455 domain-specific extra words on the topic of the COVID-19 public health pandemic.

After the above steps, we constructed a total of 45,096 emotion vectors with 110,387 overlapping dimensions. Figure 2 shows the lexicon scale comparison of expansion and origin in the number of word count in each emotion. The scale of the expanded emotional lexicon was nearly 4-fold of the original.

Figure 2.

Figure 2

Comparison of the Lexicon Scale of Expansion and the Original.

3.2.2. Optimization Strategy of Social-Emotional Volatility

To verify the expanded lexicon validity, we extracted an extra 10,000 test corpus from the open dataset. First, we cleaned low-frequency words (word frequency less than 5) from the lexicon. Following the experience of previous work, volatility was considered with different levels of valence. With the increase in valence, the number of words in the lexicon and domain words decreased continuously, but the effect of emotional volatility was more noticeable under the same situation. Because a value of 0 in a dimension means there is no emotional expression in the corresponding dimension, we removed the non-zero valence lower than a certain threshold for all eight dimensions. Then, we statistically compared the original with the expansion, as well as with the words retrieved in the emotion lexicon, the coverage of the test corpus, and the count and coverage of COVID-19 domain words under different valence thresholds in Table 1.

Table 1.

Coverage Comparison in Different Lexicons and Valences.

Original Expansion ≥0.1 ≥0.2 ≥0.3 ≥0.4 ≥0.5 ≥0.6 ≥0.7 ≥0.8 ≥0.9
Count in Lexicon 17,530 45,096 29,681 24,044 20,601 16,892 13,950 9052 4541 2002 458
Coverage in Corpus 0.1485 0.7484 0.6531 0.5186 0.4825 0.4129 0.3371 0.2431 0.1645 0.1292 0.0835
Count in Domain 24 1837 1625 1164 1044 833 600 432 186 75 19
Coverage in Domain 0.0097 0.7482 0.6619 0.4741 0.4252 0.3393 0.2443 0.1759 0.0757 0.0305 0.0077

The experiment showed that the expanded domain lexicon significantly improved the matching of the COVID-19 corpus compared with the original. The coverage of both the domain words and the test corpus increased 5-fold, reaching 74.84%. After testing and observing the change in emotional volatility under each threshold, the coverage rate of the corpus and field decreased significantly after 0.4. Therefore, we chose 0.3 as our optimized threshold, where the coverage rate of the corpus was still close to 50% under the condition of 42.52% coverage rate of field words, and the emotional volatility of eight dimensions was still relatively obvious.

We observed whether the improved lexicon was consistent with the emotional fluctuations of the public. For each microblog content, there is also the phenomenon of emotional co-occurrence. This causes the valence in each dimension to weaken, and it is the reason why we optimized emotional valence to 0.3 to intensify the emotional volatility. The calculation method of emotional valence within a unit time interval E is formulated in Equation (2):

E=1Mm=1M[1Nn=1Nenm(j)]=1MNm=1Mn=1Nenm(j) (2)

where enm(j) represents the emotional valence of the jth valence of the nth word in a total of m texts per unit of time, so m=1Mn=1Nenm(j) denotes the emotional valence of all words in m texts in the jth dimension in a unit of time.

We outline the data timeline in Figure 3 to show eight emotional volatilities and their distribution, from 1 January to 18 February. We selected a part of major public opinion events to observe the emotional volatility that reflected the public emotional reactions. We identified the time point, as shown in Figure 3, where the emotion volatility was obvious at the moment of major events. Thus, volatility can accurately depict the emotional state of the public during the pandemic. For example, on 20 January, a news press conference was received with the news of novel coronavirus “Human-to-human transmission”. Afterward, feelings of hate and anxiety dramatically increased and reached an all-time high on 21 January. Overall, negative emotions experienced a significant process of rising emotional valence throughout the entire time period; the most prevalent were hate and anxiety. Love, which had the largest coverage in the lexicon and a high valence, showed an overall downward trend.

Figure 3.

Figure 3

Summary of Eight Emotional Volatilities and Major Events.

The results show that the adapted emotional lexicon is scientific and effective in psychological quantification. Therefore, as far as discrete emotion research is concerned, it is necessary to analyze the volatility of each emotion carefully to find the types of emotions that are specifically relevant to the events.

3.3. Public Emotional Perception Model

3.3.1. Abrupt Time Point Perception

In the midst of a public health security emergency, the public’s emotional state is more sensitive and fluctuates at the time point where the reaction occurs after being stimulated. It is difficult to locate the abrupt time points of public emotions because the data are sparse. This poses a great challenge when trying to create an effective model. Thus, we propose an LSTM-based attention encoder-decoder (LAED) mechanism for emotional time-series to solve this problem, as illustrated in Figure 4.

Figure 4.

Figure 4

Architecture of LAED Model.

The model is trained to reconstruct “normal” emotional time-series by taking the most recent emotional time-series instance as the encoder input and jointly predicting several future emotional time-series instances as the decoder output. Then, the reconstruction error between the real target and the decoder output is obtained and used to compute the likelihood of an anomaly at this time point. In the training process, the model only considers a normal instance and predicts a normal instance. The intuition is that when the model encounters new data, including normal and abnormal instances, the reconstruction error of the abnormal emotional valences would be higher than that of the normal ones.

We built a continuous time-series on H={x(1), x(2),,x(D)} using January data. Every x(i) denotes one hour at a time, computed from Section 3.3, where each sequence x(i) consisted of eight-dimensional emotion vectors at a time instance. Here, we obtained multiple time-series Xip={x(i+1),x(i+2),,x(i+p)}(pD) by taking a window of length p over a larger time-series H. We first trained the LAED model with a training set that contained no anomalies to reconstruct instances of a normal emotional time-series. The model took the most recent p values as inputs and jointly reconstructed the output future q values X^i+pq={x^(i+p+1),x^(i+p+2),,x^(i+p+q)} with the target emotion time-series Xi+pq={x(i+p+1),x(i+p+2),,x(i+p+q)}. The model only reconstructed x^(i+p+1) (i.e., q = 1) if the prediction accuracy was low. In this process, he(m) is the hidden state of the encoder at step time m for each m{1, 2, , p}, and we utilized the attention mechanism he(m) to obtain a context vector representing the sum of different weights on hidden states and to improve prediction accuracy, so the context vector C is represented as follows in Equations (3)–(5):

C=m=1pαihe(m) (3)
α=Softmax(heW+b) (4)
he=[he(1); he(2);;he(p)], where [;] represents concatation. (5)

The context vector C was used for the initial state in the decoder. A fully connected layer on top of the LSTM decoder layer was used to predict the target emotion time-series. During the training process, the decoder used x(i+p+n) as input to obtain the hidden state hd(n+1), n{1, 2, , q1} and predicted x^(i+p+n+1) corresponding to the target x(i+p+n+1). However, during inference, the predicted value x^(i+p+n) was input to the decoder to obtain hd(n+1) to predict x^(i+p+n+1). Our model was trained by mean squared error (MSE) as the loss function to measure the distance between x(i+p+n+1) and x^(i+p+n+1).

Similar to Malhotra [26], we divided the emotional time-series into four parts: two for normal emotions, N and VN, and the other two for abnormal emotions, VA and T. Here, N refers to the training set for the LSTM encoder-decoder model, and VN is used for early stopping to prevent the model from overfitting the training set on N. The reconstruction error vector for step n in the decoder is given by Equation (6):

e(n)=|x(i+p+n)x^(i+p+n)| (6)

Then, we estimated the parameters μ and Σ of a Gaussian distribution N(μ, Σ) by maximum likelihood estimation (MLE) from eight-dimensional reconstruction errors in the normal set VN, and set a log probability densities (PD) of errors on VA as anomaly scores, where the lower value indicates a greater likelihood of anomalous emotions. For any x(i+p+n), the anomaly score is calculated in Equation. (7):

Sn=(e(n)μ)TΣ1(e(n)μ) (7)

where Sn is a manual threshold on log PD to separate anomalies from normal data, preventing the possibility of false positives from the test set T. Finally, a separate test set T was used to evaluate the model.

To separate the abrupt emotional time points from emotional time-series, we utilized a simple and effective threshold adjustment method to find the optimal log PD. Then, we used these log PD against all eight emotional sequences to ensure all the emotions were at the same level for detecting abrupt points. The performance of different log PD on VA in detecting the number of abrupt points is illustrated in Section 4.1.

3.3.2. Triggered Causes Tracking

The purpose of our study was to identify major events that caused an emotion outburst. Because of the limited information available while quarantined at home, the public paid significant attention to the pandemic news. However, depending on the release time of media reports and the transfer behaviors of online users, emotional volatility inevitably delayed or persisted. For this reason, we pinpointed the cause of the sudden change in the preceding period of the time window. Based on what we found, the most obvious emotion types inspired the public in Section 3.3.1, we verified the tracking results from the perspective of semantic features in the context and the perspective of emotion theory, respectively.

From the perspective of semantic features, each event has a unique corresponding eight-dimensional emotional valence; thus, we compared the difference in emotional distribution of the event and of the day. The distribution distance was measured by the Kullback–Leibler divergence [27]. Then, we located the cause ranking, which comes first in the order.

Emotion theory provides a psychological foundation for human cognition so that the exact discrete emotion is ruled in its elicitation condition. In particular, the OCC theory contains detailed discrete emotions in a context concerning the different consequences of events, actions of agents, and aspects of objects, and OCC theory is widely used as a computational model to depict the human cognition process. It identifies the underlying cognitive structure and emotional dimensions [28]. This approach argues that emotional reactions to an event are a product of personal interpretations and appraisals in a situational environment [29,30]. Therefore, we aligned and divided the eight discrete emotions in our lexicon with the emotional types in the hierarchical structure OCC model, as shown in Figure 5. In the emotional arousal path, we defined the eight emotional types as the emotional pathways of three different concerns in the OCC model, which reflected the three-appraisal event-to-emotion mapping processes and emotional responses. The eight-dimensional emotional response to the valenced reactions in the OCC model provided an important theoretical basis for tracing abnormalities.

Figure 5.

Figure 5

Eight Discrete Emotions Corresponding to the OCC Model.

Each discrete emotion is elicited by a unique pattern of cognitive appraisals, and situations with the same appraisal pattern will induce the same emotion [31]. At the same time, we marked the candidate events into seven topics and their focus in the OCC model in Table 2.

Table 2.

Seven Topics and Three Emotional Arousal Paths.

Topics Emotional Arousal Path
Pandemic news and data report Aspects of Objects
Medical care was on the front line Actions of Agents
Nation- and local-issued policy measures Consequence of Events
Public protection initiative Aspects of Objects
Scientific breakthroughs and knowledge dissemination Consequence of Events
Peripheral symptoms cause inner anxiety Actions of Agents
Be objective in daily life Aspects of Objects

We computed the emotional distribution of public opinion events and topics according to the abnormal emotions found by the LAED model. We tracked the emotion arousal path to ensure each topic was related to the emotional elicitation conditions. Thus, we obtained the emotional distribution of the candidate scenario in DC=d1C,d2C,,d8C, C{event,topic, day}. The influence of the corresponding emotional situation of the event on the overall distribution is calculated by KL divergence in Equations (8)–(9):

KL(Devent,Dday)=iDevent(i)logDevent(i)Dday(i) (8)
KL(Dtopic,Dday)=iDtopic(i)logDtopic(i)Dday(i) (9)

The method of emotion arousal path highlights the properties of emotion, which identify the issue of finding the most influential event according to the content. Because events are diverse, the emotion arousal path has a fixed induced path from the cognition process, so it more accurately positions the induced causes. The larger the KL value, the greater the difference between the two distributions. In brief, while the situational distribution was the most similar to the overall distribution of the day, we obtained the smallest KL value, which meant that this situation was more likely to influence the public emotion. Thus, we clearly identified emotional spikes when events occurred using the emotional arousal path and target-triggered causes.

For the triggered reasons, we chose the order of emotion-induced ranking metrics in precision in Equation (10) and mean average accuracy (MAP) in Equation (11):

Precison@K=YrelevantK (10)

where K refers to the top K texts of the real and predicted ranking, and the number of matched texts in prediction is Yrelevant. Therefore, assuming that the order of the real matched texts in the prediction K1,K2,,Kr, where r is the number of all matched texts in the entire list, then:

MAP=i=1rP@K|r| (11)

4. Experiment Results and Analyses

In this section, we provide the results of this study. Our emotional perception model was able to position an abrupt time point during COVID-19 that reflected the superiority of our model. Additionally, we include a visual display of pertinent research and reach important conclusions.

4.1. Detection Effect of Abrupt Time Points

The evaluation criteria for the model were geared toward the emotional change detection task. We used the LAED model to detect abnormal emotional valence. Depending on the training of regular “normal” emotions on N, we used the validation set VA to verify the feasibility of the model. Table 3 below shows the average result of different log PD on VA to detect the number of abrupt points in eight emotional time-series.

Table 3.

Performance Comparison of Different Thresholds.

log PD Predictive Abrupt Points True Value Accuracy Recall
Abnormal Normal
−1.0 24 13 11 0.541 0.812
−1.5 17 12 5 0.705 0.75
−2.0 13 10 3 0.769 0.625
−2.5 9 7 2 0.777 0.437
−3.0 4 4 0 1.0 0.25

The set VA contained 1000 emotional time-series with 16 abnormal points. From the table above, the number of predictive abrupt time points decreased as log PD declined, and the true value of abnormal also reduced. The aim was for the model to detect the real abnormal points as much as possible. Although all the abnormal points were found when log PD was set at −3.0, the recall (4/16) was quite low to fit the real points. Therefore, we balanced the accuracy and recall and verified the optimal threshold when log PD was equal to −2.0, where the accuracy of model prediction was 76.9% (10/13) and the recall was 62.5% (10/16).

Before we present the LAED model performance, we demonstrate eight emotional volatilities in terms of the day. As shown in in Figure 6, to observe daily emotional volatility, we used the black line to represent the volatility curve of the original emotional valence, and the red dotted line is the polynomial trend line.

Figure 6.

Figure 6

Eight Emotional Volatilities in January and February.

On the top of the figure, all four volatilities are positive emotions, and the bottom are negative emotions. Overall, most emotional valences showed a downward trend in the early two months, as shown by the red dotted line, except for anxiety and hate. There were two main findings:

(1) There was an upward trend of the negative emotions: hate and anxiety. Hate maintained a higher valence from 20 January, when academician Nanshan Zhong announced the phenomenon of “human-to-human” transmission, where we pinpoint a red dot in the figure. This news also triggered anxiety to an obvious increase, with the greatest anxiety identified on 29 January, when all 31 provinces in mainland China launched a first-level response to public health emergencies; thus, we marked them as well.

Notably, these two events belong to the topic of pandemic news and data reports and scientific breakthroughs and knowledge dissemination, which correspond to the emotion arousal path of Aspects of Objects and Consequence of Events. Thus, we believe that hate and anxiety, which differ from other emotions in volatility trends, are more sensitive and accurate in emotional cause tracing.

(2) Almost all positive emotions showed a significant downward trend, where the red dotted lines appear in a similar direction in January. It was on 20 January that all positive emotions dropped sharply in the black volatility line, and almost all of them reached the lowest valence as a result. The difference is that the spike of surprise was quite early on 4 January, when Wuhan reported a total of 44 unexplained cases of viral pneumonia. In the emotion arousal path, surprise was identified as a compound appraisal both of Actions of Agents and Consequence of Events; thus, tied to the reaction of agents and events, where Wuhan was the agent for the public in focus. Furthermore, we also found that the trends of sorrow and anger were similar to those of the positives, possibly because the national pandemic prevention measures eased the emotion of the public.

We need to determine the reasons for these abrupt emotional changes based on the LAED model and cognitive emotion arousal path. Although we observed valence according to the results of daily statistics, the abnormal time point cannot be specifically identified owing to the lack of data on a daily basis. We present the model effect of emotional abrupt time point detection based on hourly intervals in months, and use the January data, a total of 744 h, as an example. The reason we selected January data was because COVID-19 was unknown during the initial stage. It provided a comparison of emotional volatility before and after policy control and social governance.

Based on the results of our previous analysis, hate, anxiety, and surprise were found to be attractive because they are different from other emotions. In our model detection results (shown in Figure 7, Figure 8 and Figure 9), the lower yellow part represents the threshold position based on the logPD, the orange and purple mark shows the real and predicted emotional valence, and the error is expressed as a shaded green area where there is the abnormal time interval.

Figure 7.

Figure 7

Hate Volatility and Abrupt Time Points.

Figure 8.

Figure 8

Anxiety Volatility and Abrupt Time Points.

Figure 9.

Figure 9

Surprise Volatility and Abrupt Time Points.

Hate has an obvious valence rise process, and the overall trend of volatility shows several key event detections in Figure 7. In the position of the x axis around 380, the first higher position of hate, the National Health Commission did not rule out the possibility of human-to-human transmission and thus triggered a significant hate valence increase among the public. Then, around the 490 position, the emotion surge was reignited owing to the human-to-human transmission. The follow-up was identified around position 580, when the global high-risk events were defined by the WHO, and the Red Cross Society reported more events. The positions of the emotional time points were seen to correspond completely with the pandemic public opinions.

As pointed out in Section 2, anxiety was the main emotion seen in public psychology during COVID-19. We can see in Figure 8 that the overall trend is gradually decreasing and slowing down. The initial spike occurred on the timeline around 160, when the WHO declared it was not SARS and MERS. Subsequently, news of “human-to-human” and news of bat origin sparked a rise that led to public anxiety.

Although the intuitive sense in COVID-19 is the dominance of negative emotions, our government is always issuing policies, opening information, and vigorously promoting positive emotions. Surprise, the volatility varying greatly in Figure 9, shows the public exhibited surprise at the timely release of prevention and control measures.

Many spikes and abnormalities appear after each data update and the diagnosis treatment, which indicates that the public paid attention to the world around them as well as the pandemic prevention plan when quarantined at home. In addition, we found that the valence at night (in a day) is much greater than during daytime trading. Although the government announced relevant news and policies in the daytime, the media generally reported it to the public on the evening news. Considering the spread of information transmission, the fluctuations of public emotions were especially apparent between 10:00 p.m. and 12:00 p.m. This situation was identified in all eight emotions.

4.2. Tracking Effect of Triggered Causes

We show that the emotional distribution of events is an important factor that can be traced back to the overall emotional response on each particular day. We can also visualize the distribution of eight-dimensional emotions (as shown in Figure 10). In particular, on 23 January, the daily distribution was almost the same as that of the Wuhan closure event, with a KL value of 0.0098, which indicates that this event was the biggest factor affecting the public emotion on that day.

Figure 10.

Figure 10

Emotional Distribution of the Days and Public Events.

Table 4 illustrates the ranking of typical public opinion events corresponding to emotional arousal paths. We randomly selected three days of candidate events to demonstrate the validity of the emotional path.

Table 4.

Performance of KL Value of Candidate Events for Three Days.

Time Point Emotion Type Public Opinion Events Topics Path KL Value
20-Jan surprise
sorrow
joy
hate
anxiety
Inclusion into class B infectious disease and class A preventive and control measures Nation and local issued policy measures Consequence of Events 0.0982
The high-level panel of the National Health Commission answered questions Epidemic news and data report Aspects of Objects 0.0375
National Health Commission confirmed “human-to-human” transmission Scientific breakthroughs and dissemination Consequence of Events 0.0674
23-Jan anger
joy
expect
hate
At 10 a.m., Wuhan locked down Nation and local issued policy measures Consequence of Events 0.0098
Traffic in Wuhan was suspended Nation and local issued policy measures Consequence of Events 0.0786
Guangdong activated a first-level response to a major public health emergency The public took the initiative to protection Aspects of Objects 0.0907
Zhejiang activated a first-level response to a major public health emergency The public took the initiative to protection Aspects of Objects 0.0667
Civil railway announced waive cancellation fees State and local issued policy measures Consequence of Events 0.2835
The Ministry of Education would redeploy epidemic prevention and control work State and local issued policy measures Consequence of Events 0.0747
Star fans donated to Wuhan Medical care was on the front line Actions of Agents 0.0992
28-Jan surprise
hate
expect
anxiety
WHO declared the Novel Coronavirus a high risk globally Epidemic news and data report Aspects of Objects 0.0354
Novel coronavirus mRNA vaccine was officially under development State and local issued policy measures Consequence of Events 0.1568
The epidemic peaked in about a week or 10 days Scientific breakthroughs and dissemination Consequence of Events 0.1736
The Chinese Academy of Medical Sciences announced that bats are the origin of the virus Scientific breakthroughs and dissemination Consequence of Events 0.0245
The Ministry of Education postponed the opening of the school State and local issued policy measures Consequence of Events 0.0651

From the perspective of tracking triggered events, emotional valence can only be calculated from the content, and the similar emotional distribution can be accurately tracked to locate induced events. However, the premise exists that we have adequate candidate events; otherwise, there would be a lack of content of rank and the track of reason.

Therefore, we solved the problem of insufficient data in the form of an emotion arousal path by monitoring the abnormal emotions. Based on the comparison of candidate events and target topics from the emotion arousal path over the preceding three days, we compared the results after human cognition of emotional psychology (Table 5). This showed the ranking to be effective and similar to the actual situation using this cognitive method.

Table 5.

The Ranking Comparison of Events and Emotional Paths.

Abrupt Time Point Events in Media Events in Dataset Topics Event MAP Topic MAP
20-Jan 6 3 3 0.333 0.667
23-Jan 9 7 3 0.429 1
28-Jan 18 5 3 0.6 0.667

4.3. Correlation between the Number of Infected People and Emotions Exhibited

To more accurately measure the role of specific discrete emotions in the study, we explored the emotional volatility through algorithms and found the spike point in a previous study. Next, we attempted to find the exact discrete emotions according to the number of infected people. For this, we used data on the number of infected persons released by the National Health Commission (since 13 January). We compared the Pearson correlation coefficient between infected persons and discrete emotions and found some interesting phenomena.

We first construct the correlation heat map for January in Figure 11 (shown on the left). The deeper shade of red represents positive correlation, and the deeper shade of blue represents negative correlation. By comparing the cumulative number of infected persons, the number of deaths, the number of cured persons, and the number of suspected cases, the calculation results showed that anxiety was significantly positively correlated with the number of infected persons, love was significantly negatively correlated (p < 0.01), and surprise was negatively correlated with the number of infected persons (p < 0.05). When time shifts to late January, on 31 January, the total number of confirmed cases was over 10,000, while the number of suspected cases exceeded 20,000.

Figure 11.

Figure 11

Heat map of correlation between the number of infected people and emotions exhibited. ** p < 0.01, * p < 0.05.

Next, we show the correlation for February in Figure 11 (shown on the right), and the calculation results show that sorrow and anxiety were positively correlated with the cumulative number of infections, while love was still significantly negatively correlated. However, there was an inversion of the results in suspected cases, and love presented a significant positive correlation. This conclusion suggests that in the initial stage of the pandemic, anxiety was dominant among the public because of the sudden incidence and strange variations caused. However, as the pandemic continued for a period of time, the increase in the number of confirmed cases led to increased sadness, and the correlation of anxiety decreased from the perspective of numerical and color responses. At the same time, the increase in suspected cases also indicated that the suspected cases were still under medical observation and not confirmed, which meant that the country was stepping up nucleic acid testing, in addition to implementing prevention and control efforts, and so the psychological feeling of security promoted correlation of the positive emotions.

5. Discussion

Our research summarized the evolution the law of public emotions in public health emergencies. However, the labeled anomaly of the emotion abrupt time point is rare, which is always a major challenge to the anomaly detection task. Moving forward, we will stick to our social-emotional perception model to discover other potentials in public health, and conduct extensive experiments on other domains and tasks, such as public safety, to further validate the model’s feasibility and stability.

6. Conclusions

Our study focused on monitoring and measuring public emotions embedded in the social media context during public health emergencies, and provided three contributions: (1) the construction of a discrete emotion lexicon representing the public under the state of the COVID-19 pandemic, (2) the accurate detection of emotion abrupt time points for different discrete emotions affecting the public, and (3) the location of specific emotional causes that induced abnormal emotion following the emotional arousal path, using the OCC emotion model. The public emotional perception model has a more prepared traceability scheme for the induced causes and has confirmed that information disclosure including prevention and control measures have driven positive emotions, such as love and expect.

Author Contributions

S.J.: conceptualization, methodology, software, writing—original draft; H.Z.: software, validation, writing—reviewing and editing; J.Q.: supervision, writing—reviewing and editing, project administration, funding acquisition; B.F.: supervision; T.X.: validation, visualization, writing—reviewing and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (No. 72042004).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data can be found here: ai.suibe.net (accessed on 22 April 2021).

Conflicts of Interest

The authors declare no conflict of interest.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Zhang Y., Sun Y., Kim Y. The influence of individual differences on consumer’s selection of online sources for health information. Comput. Hum. Behav. 2017;67:303–312. doi: 10.1016/j.chb.2016.11.008. [DOI] [Google Scholar]
  • 2.Jiménez Ó., Sánchez-Sánchez L.C., García-Montes J.M. Psychological Impact of COVID-19 Confinement and Its Relationship with Meditation. Int. J. Environ. Res. Public Health. 2020;17:6642. doi: 10.3390/ijerph17186642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Jungmann S.M., Witthft M. Health anxiety, cyberchondria, and coping in the current COVID-19 pandemic: Which factors are related to coronavirus anxiety? J. Anxiety Disord. 2020;73:102239. doi: 10.1016/j.janxdis.2020.102239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Garfin D.R., Silver R.C., Holman E.A. The novel coronavirus (COVID-2019) outbreak: Amplification of public health consequences by media exposure. Health Psychol. 2020;39:355–357. doi: 10.1037/hea0000875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Jelodar H., Wang Y., Orji R., Huang H. Deep Sentiment Classification and Topic Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using LSTM Recurrent Neural Network Approach. IEEE J. Biomed. Health Inform. 2020;24:2733–2742. doi: 10.1109/JBHI.2020.3001216. [DOI] [PubMed] [Google Scholar]
  • 6.Yang Z., Ouyang T., Xiangling F.U., Peng X. A decision-making algorithm for online shopping using deep-learning–based opinion pairs mining and q-rung orthopair fuzzy interaction Heronian mean operators. Int. J. Intell. Syst. 2020;35:783–825. doi: 10.1002/int.22225. [DOI] [Google Scholar]
  • 7.Lerner J.S., Tiedens L.Z. Portrait of the angry decision maker: How appraisal tendencies shape anger’s influence on cognition. J. Behav. Decis. Mak. 2010;19:115–137. doi: 10.1002/bdm.515. [DOI] [Google Scholar]
  • 8.Hridoy S.A.A., Ekram M.T., Islam M.S., Ahmed F., Rahman R.M. Localized twitter opinion mining using sentiment analysis. Decis. Anal. 2015;2:8. doi: 10.1186/s40165-015-0016-4. [DOI] [Google Scholar]
  • 9.Chung W., Zeng D. Dissecting emotion and user influence in social media communities: An interaction modeling approach. Inf. Manag. 2020;57:103108. doi: 10.1016/j.im.2018.09.008. [DOI] [Google Scholar]
  • 10.A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs. [(accessed on 20 September 2017)]; Available online: https://arxiv.org/abs/1103.2903.
  • 11.Peng H., Cambria E., Hussain A. A Review of Sentiment Analysis Research in Chinese Language. Cogn. Comput. 2017;9:423–435. doi: 10.1007/s12559-017-9470-8. [DOI] [Google Scholar]
  • 12.Thelwall M. TensiStrength: Stress and relaxation magnitude detection for social media texts. Inf. Process. Manag. 2016;53:106–121. doi: 10.1016/j.ipm.2016.06.009. [DOI] [Google Scholar]
  • 13.Lin H., Jia J., Qiu J., Zhang Y., Shen G., Xie L., Jie T., Ling F., Chua T.S. Detecting Stress Based on Social Interactions in Social Networks. IEEE Trans. Knowl. Data Eng. 2017;29:1820–1833. doi: 10.1109/TKDE.2017.2686382. [DOI] [Google Scholar]
  • 14.Xu H., Kai Z., Qiu L., Hu C. Expanding Chinese sentiment dictionaries from large scale unlabeled corpus; Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation; Sendai, Japan. 4–7 November 2010; pp. 301–310. [Google Scholar]
  • 15.Fu X., Ouyang T., Chen J., Luo X. Listening to the investors: A novel framework for online lending default prediction using deep learning neural networks. Inf. Process. Manag. 2020;57:102236. doi: 10.1016/j.ipm.2020.102236. [DOI] [Google Scholar]
  • 16.Quan C., Ren F. Construction of a blog emotion corpus for Chinese emotional expression analysis; Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing; Singapore. 6–7 August 2009; pp. 1446–1454. [Google Scholar]
  • 17.Qi J., Fu X., Ge Z. Subjective Well-being Measurement based on Chinese Grassroots Blog Text Sentiment Analysis. Inf. Manag. 2015;52:859–869. doi: 10.1016/j.im.2015.06.002. [DOI] [Google Scholar]
  • 18.Asmundson G.J.G., Taylor S. How health anxiety influences responses to viral outbreaks like COVID-19: What all decision-makers, health authorities, and health care professionals need to know. J. Anxiety Disord. 2020;71:102211. doi: 10.1016/j.janxdis.2020.102211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Li X., Feng S., Wang D., Zhang Y. Context-aware emotion cause analysis with multi-attention-based neural network. Knowl. -Based Syst. 2019;174:205–218. doi: 10.1016/j.knosys.2019.03.008. [DOI] [Google Scholar]
  • 20.Ortony A., Clore G.L., Collins A. The Cognitive Structure of Emotions. Contemp. Sociol. 1988;18:2147–2153. [Google Scholar]
  • 21.Gao K., Xu H., Wang J. A rule-based approach to emotion cause detection for Chinese micro-blogs. Expert Syst. Appl. 2015;42:4517–4528. doi: 10.1016/j.eswa.2015.01.064. [DOI] [Google Scholar]
  • 22.Tan S., Yang L., Sun H., Guan Z., He X. Interpreting the Public Sentiment Variations on Twitter. IEEE Trans. Knowl. Data Eng. 2014;26:1158–1170. [Google Scholar]
  • 23.Sun X., Zhang C., Li G., Sun D., Ren F., Zomaya A., Ranjan R. Detecting Users’ Anomalous Emotion Using Social Media for Business Intelligence. J. Comput. Sci. 2018;25:193–200. doi: 10.1016/j.jocs.2017.05.029. [DOI] [Google Scholar]
  • 24.Giachanou A., Mele I., Crestani F. Explaining Sentiment Spikes in Twitter; Proceedings of the 25th ACM International Conference on Information and Knowledge Management, CIKM 2016; Indianapolis, IN, USA. 24–28 October 2016. [Google Scholar]
  • 25.Sheridan D.P., Decker H.K., Kloumann I.M., Bliss C.A., Danforth C.M., Johan B. Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter. PLoS ONE. 2011;6:e26752. doi: 10.1371/journal.pone.0026752. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.LSTM-Based Encoder-Decoder for Multi-Sensor Anomaly Detection. [(accessed on 5 May 2020)]; Available online: https://arxiv.org/abs/1607.00148.
  • 27.Kullback S., Leibler R.A. On information and sufficiency. Ann. Math. Stat. 1951;22:79–86. doi: 10.1214/aoms/1177729694. [DOI] [Google Scholar]
  • 28.Jiang S., Qi J. Cognitive Detection of Multiple Discrete Emotions from Chinese Online Reviews; Proceedings of the 2016 IEEE 1st International Conference on Data Science in Cyberspace, DSC 2016; Changsha, China. 27 February 2017; pp. 137–142. [Google Scholar]
  • 29.Frijda N.H. Emotion, Cognitive Structure, and Action Tendency. Cogn. Emot. 1987;1:115–143. doi: 10.1080/02699938708408043. [DOI] [Google Scholar]
  • 30.Roseman I.J. Cognitive determinants of emotion: A structural theory. Rev. Personal. Soc. Psychol. 1984;5:11–36. [Google Scholar]
  • 31.Richins M.L. Measuring Emotions in the Consumption Experience. J. Consum. Res. 1997;24:127–146. doi: 10.1086/209499. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data can be found here: ai.suibe.net (accessed on 22 April 2021).


Articles from International Journal of Environmental Research and Public Health are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES