Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2022 Sep 16;17(9):e0274247. doi: 10.1371/journal.pone.0274247

Emotional discourse analysis of COVID-19 patients and their mental health: A text mining study

Yu Deng 1,#, Minjun Park 2,#, Juanjuan Chen 3, Jixue Yang 4, Luxue Xie 4, Huimin Li 4, Li Wang 5,#, Yaokai Chen 6,*
Editor: Giuseppe Carrà7
PMCID: PMC9481002  PMID: 36112638

Abstract

COVID-19 has caused negative emotional responses in patients, with significant mental health consequences for the infected population. The need for an in-depth analysis of the emotional state of COVID-19 patients is imperative. This study employed semi-structured interviews and the text mining method to investigate features in lived experience narratives of COVID-19 patients and healthy controls with respect to five basic emotions. The aim was to identify differences in emotional status between the two matched groups of participants. The results indicate generally higher complexity and more expressive emotional language in healthy controls than in COVID-19 patients. Specifically, narratives of fear, happiness, and sadness by COVID-19 patients were significantly shorter as compared to healthy controls. Regarding lexical features, COVID-19 patients used more emotional words, in particular words of fear, disgust, and happiness, as opposed to those used by healthy controls. Emotional disorder symptoms of COVID-19 patients at the lexical level tended to focus on the emotions of fear and disgust. They narrated more in relation to self or family while healthy controls mainly talked about others. Our automatic emotional discourse analysis potentially distinguishes clinical status of COVID-19 patients versus healthy controls, and can thus be used to predict mental health disorder symptoms in COVID-19 patients.

Introduction

A deadly global pandemic is expected to be associated with various emotions in different people, and might cause negative emotional responses, with serious consequences for mental health [1, 2]. The emergence of the COVID-19 (coronavirus disease 2019) has raised significant questions about the mental health burden of the population relating to social restrictions, lockdowns, school and business closures, loss of livelihoods, decreases in economic activity, and shifting priorities of governments in their attempt to control COVID-19. Notably, COVID-19 may result in significant physical and mental consequences among those who become infected. While the physical treatment modalities for COVID-19 have received extensive attention, the need for exploration of the mental health impact of SARS-CoV-2 infection on patients has never been more urgent [3].

After the outbreak of the current COVID-19 pandemic, a growing body of survey studies concerning the emotional state and mental health of COVID-19 patients have shown that patients afflicted by COVID-19 experience post-traumatic stress disorder, boredom, loneliness, anger, insomnia, anxiety, depression, and distress [410]. What the preceding studies have in common is that they show that the acquisition of COVID-19 leads to an increase in negative emotions and a decrease in positive emotions for infected patients.

While a considerable number of surveys have been conducted to investigate the mental health problems of COVID-19 patients, it seems that the lived experience narrative discourse concerning the dynamic psychological and emotional experiences of COVID-19 patients are currently underestimated (e.g., [1120]). Notably, Missel et al. [14] used telephone interviews to explore the lived experiences of 15 individuals infected with COVID-19 in Denmark. Qualitative analysis demonstrated that being diagnosed with COVID-19 was seen as a threat to patients’ existence and to their bodily perception, as well as to social relationships. COVID-19 patients experience unpleasant emotions given the negative social interactions that COVID-19 causes. Venturas et al. [19] conducted inductive, in-depth interviews with 11 hospitalized COVID-19 patients in Spain, and their phenomenological qualitative results indicated that although patients were uncertain about the coronavirus and felt frustrated about hospital isolation, they positively adapted to the situation and felt confident and safe in the care of medical staff. Efficient high-tech communication with relatives had somehow alleviated their negative emotions. Son et al. [16] interviewed 16 discharged COVID-19 patients in South Korea, and the phenomenological interpretation of their patient narratives showed that COVID-19 patients experienced social stigma and feelings of guilt due to negative attitudes from society and social media. However, the patients developed positive emotions through the social support of families and friends. In the context of China, Sun et al. [17] conducted semi-structured interviews with 16 COVID-19 patients from Henan, China. Patients talked about their feelings during hospitalization and isolation. Results revealed diachronic changes to their emotional experiences. Specifically, patients’ emotional responses toward the infection, which included fear, denial, and stigma during the acute phase of infection, gradually changed into positive emotions (i.e., acceptance, trust) in the later recovery phase. Through a qualitative study of the lived experience narratives of 16 hospitalized COVID-19 survivors from Nanning, China, Wu et al. [20] identified anxiety, trauma, and self-stigmatization among COVID-19 patients. Their negative emotional responses were strongly associated with social interactions, suggesting that it is urgent to minimize negative social impacts on COVID-19 infected individuals. Li et al. [13] used a phenomenological approach in a qualitative study of 13 hospitalized COVID-19 patients in Wuhan. Inductive thematic analysis of patient narrative discourse showed that COVID-19 patients experienced negative emotions such as confusion, uncertainty, worry, and guilt. The negative emotions mainly concerned anxiety regarding social discrimination and poor financial security. Their positive emotions were aligned with expectations of life after recovery, abundant social support from healthcare workers and family, as well as the strength of the government. Deng et al. [11] conducted semi-structural interviews with 6 re-positive patients from Wuhan and Chongqing to elicit their narrative discourse of recurrent COVID-19 infections. Their results revealed that SARS-CoV-2 nucleic test re-positive patients demonstrated emotional problems such as anxiety, depression, irritation, stress, mistrust, insomnia, suicidal tendency, grief, panic, and worry. Social isolation, lack of emotional support, and negative news in the mass media were the main risk factors leading to their mental health problems.

Clearly, the narrative discourse of the emotional experience of COVID-19 acquisition encompasses rich personal information in linguistic form [21]. Published literature concerning the lived experiences of COVID-19 patients reveals that emotional discourse is an important marker providing information about patients’ internal mental representations of the COVID-19 pandemic. To date, the emotional discourse analysis of patients’ lived experiences of COVID-19 extensively focuses on the qualitative approach [22], whereas the quantitative approach is currently underrepresented. Recent studies have used natural language processing (NLP) techniques to extract real-world text data concerning COVID-19 from online resources such as Twitter, Facebook, and Reddit, in order to provide automated population-level health surveillance [2326]. For instance, Crocamo et al. [23] conducted sentiment analysis of 3,308,476 English tweets concerning COVID-19 discussions between January 19 and March 3, 2020, by computing a polarity compound score and using a transformer-based model. Their results demonstrated an increasing trend towards negative sentiment as the COVID-19 pandemic proceeded. Low et al. [24] used NLP techniques to detect changes in mental health support groups (e.g., Schizophrenia, Suicide Watch, Depression) and non-mental health groups (e.g., Personal Finance, Conspiracy Theory) at the outbreak of COVID-19. Drawing on the Reddit data of 826,961 users by using unsupervised methods such as topic modeling and clustering, Low et al. found that support groups related to attention-deficit/hyperactivity disorder, eating disorders, and anxiety showed the most negative semantic changes during COVID-19. Health anxiety was found to be a salient theme across Reddit through machine learning analyses. Patel et al. [25] used data mining methods to compute 739,434 COVID-19 related posts by 53,134 users from online health forums such as Health Boards and Inspire & Health Unlocked from 1 January 2020 to 31 May 2020. Their results showed that the global lockdown policy had led to an increase of discussion related to COVID-19-related disorders, and 25% of the COVID-19-related titles mentioned an associated physical or mental health comorbidity. Viviani et al. [26] conducted word identity and sentiment analysis of vulnerability to psychological distress in the COVID-19 context by gathering texts related to COVID-19 on the Twitter microblogging platform between August and December 2020. The automatic analysis indicated that the social distancing scenario was associated with ‘annoyance’ and ‘protest’, while the vaccine scenario was linked to ‘killing’ and ‘murder’. Regarding anxiety, the target scenarios of social distancing and symptoms & hospitalization were closely related to risk, worry, and fear. As for vulnerability scores in the word categories, the lexicon identifiers regarding greater psychological vulnerability fell into the ‘anger’ and ‘risk’ categories, in particular for social distancing and vaccines & vaccinations. There was a large increase in vulnerability identifiers in the ‘death’ category shared by the ‘symptoms & hospitalization’ scenario.

Text mining based on real-world data is a sensitive method to reveal emotional and mental health burden during COVID-19, given that it provides an effective marker to identify the mental status of vulnerable groups and alarming themes during the ongoing pandemic [24]. It is noteworthy that the automatic analysis of COVID-19 patients’ texts is somehow overlooked in the literature. Along this line of enquiry, the present study combined in-depth semi-structured interviews with text mining methods to investigate features in lived experience narratives regarding COVID-19 in persons with COVID-19 and in healthy controls related to five basic emotions. Our aim was to identify differences in emotionality between COVID-19 patients and healthy controls based on narrative discourse, in which the subjects describe their past emotional experiences of COVID-19. We hypothesize that mental health variables can be inferred across different emotions through linguistic markers in the narrative discourse [27, 28].

Materials and methods

Study design

The present study combined qualitative and quantitative approaches in data collection and text analysis. The narrative data emerged from a larger cross-sectional project, i.e., “Lived-Experience Narratives and Mental Health of People in China during the COVID-19 Pandemic” [1112, 22, 29]. The project used semi-structured interviews to collect narrative discourses concerning lived experiences of COVID-19 in China, from the period from 30 June 2020 to 31 January 2021. The project teams recruited participants by convenience sampling. COVID-19 patients were randomly recruited from Chongqing Public Health Medical Center, China. The inclusion criteria for this study included those COVID-19 patients who had been hospitalized for at least 14 days and who were required to quarantine at home or at a hotel for a further 14 days after being discharged from hospital [12]. The healthy controls, who had experienced the first wave of the COVID-19 pandemic but were not infected by SARS-CoV-2, were randomly selected from citizens of Wuhan, China. Due to the COVID-19 prevention policies, telephone interviews were employed to ensure the personal safety of the interviewer and participants [20]. Participants’ narrative discourses of lived experiences involved multiple dimensions of their emotional and mental health status. The term “lived experiences” designates the phenomenological tradition regarding experiences of the everyday life world. Such lived experiences are prereflective and less available to our awareness [14]. Thus, the lived experience narratives are capable of capturing areas of participants’ mental health status that cannot be explored by the survey method [12, 14]

The lived experience narrative discourse was then annotated qualitatively according to the five primary emotions [21] and the 21 emotional subcategories in the Chinese Affective Lexicon Ontology [30]. In the discourse that follows, the generic features of the emotional narratives (i.e., mean word-length, words per sentence, sentences per narrative, and words per narrative), frequency of emotional words, and significant word identity features between the two matched groups were computed quantitatively by text mining methods. Based on text analysis of emotional narratives, differences between persons infected with COVID-19 and healthy controls were measured with respect to each of their mental health statuses during the COVID-19 pandemic.

Data collection

We collected lived experience narratives from 58 participants (34 persons infected with COVID-19 and 24 healthy controls). Demographic and clinical information for this cohort is provided in Table 1. Written informed consent was obtained from all individual participants before their interview. Ethics committee approval was received for this study from the ethics committee of the Ethics Committee of Chongqing Public Health Medical Center (ID: 2020-048-02-KY). All procedures involving human participants were in accordance with the ethical standards of the institutional and national research committees and with the Helsinki declaration and its later amendments, or with comparable ethical standards.

Table 1. Participant information.

Variables Full sample (N = 58) COVID-19 patients (N = 34) Healthy Controls (N = 24)
Gender: male 29 18 11
Gender: female 29 16 13
Marital status: married 34 24 10
Marital status: unmarried 22 8 14
Marital status: divorced 2 2 N/A
Mean age (S.D.) 34.42 (10.70) 38.82 (9.95) 27.91 (8.28)
Mean time in hospital (S.D.) 17.29 days (13.67) N/A

Participants attended the in-depth semi-structural interview by telephone, as paid volunteers. They were asked to narrate their personal experiences and feelings concerning COVID-19 during the interview. The interview domain concerned multiple dimensions of their lived experience during COVID-19, such as work and daily life, social isolation, family support, social relationships, attitude towards death, comments on medical service, and national policy [11, 12, 22, 29]. Patient interview questions included “How were you infected with COVID-19”, “What did you do before, during, and after hospitalization?”, “How were your work and daily life impacted due to the infection?”, “How did you feel when you were isolated in the hospital?”, “What gave you comfort when you were isolated?”, “What changes in perceptions of the world do you have?”, “How did you deal with social relationship after discharge”, “What was your attitude toward death after the contraction of COVID-19?”, “What was the most unforgettable event you experienced during the pandemic?”, “What is your opinion about the role our country and medical staff play in combating COVID-19?”[11, 12, 22]. Healthy controls’ interview questions included the following: “What did you do at the outbreak of COVID-19?”, “How was your work and daily life impacted by the pandemic?”, “What gave you comfort when you felt frustrated?”, “What changes in perceptions of the world did you have?”, “How did you feel in witnessing death every day?”, “How did you feel when the city was locked down?”, “What was the most unforgettable event you experienced during the pandemic?”, “What was your attitude toward death during COVID-19?”, “What is your opinion about the role our country and medical staff play in combating COVID-19?” [29].

The telephone interviews were conducted during two periods, namely from June 2020 to August 2020 and from October 2020 to January 2021, by two researchers specializing in psychology and psycholinguistics. The interviews were arranged during summer and autumn-winter as dictated by participants’ requirement and their pandemic situation. Each interview lasted approximately 40–60 minutes, and was audio-recorded. The recorded audio files were transcribed verbatim by the interviewers to allow for text analysis.

Data processing

All the recorded interview files were transcribed verbatim. The raw COVID-19 interview corpus consisted of 326,202 words. The raw data encompassed conversations between the interviewer and interviewees. Given that the aim of this study was to investigate the narrative discourse of participants, we used a Python script to exclude interviewer questions and utterances, leaving behind only those narratives of the interviewees. Furthermore, typographical errors and inconsistencies occurring among different transcribers were corrected. We obtained a corpora sample size of 167,795 word tokens for the 34 COVID-19 patients, and a sample size of 121,372 word tokens for the 24 healthy controls. Emotional narrative tagging and text preprocessing were conducted for the purpose of measuring sentiment distribution and linguistic features between the two matched groups.

Emotional narrative tagging

Based on the text of the interviewees discourses, three researchers conducted sentiment classification and manual annotation of the emotional narratives [22]. In distinguishing universal emotions, Ekman [31] classified the six basic emotions as anger, disgust, fear, happiness, sadness, and surprise. Hong et al. [21] optimized Ekman’s classification as five types—happy, angry, sad, fear, and disgust. Based on Ekman’s six basic emotions, Xu et al. [30] established the Chinese Affective Lexicon Ontology, which includes seven basic traditional Chinese emotions called qi qing七情 (i.e., fear 惧, anger 怒, joy 乐, sadness 哀, disgust 恶, surprise 惊, and good 好), and 21 emotional subcategories over 27,466 lexical entries. The 21 emotional subcategories have largely extended the range of emotional experience analysis. To minimalize the basic emotions and enlarge the subcategories of emotions, the present study combined the five primary emotions of Hong et al. [21] and the 21 emotional subcategories of Xu et al. [30] in emotional narrative tagging (Table 2). We first manually coded the emotional narratives, which referred to each discourse unit that encompassed congruent topics or emotions within the interview texts [21, 22]. Three researchers coded the discourse unit of emotional narratives together. Inconsistencies and disagreements were settled by a fourth researcher. Five translated exemplars of emotional narratives are shown below.

Table 2. Emotional categorization and labels [22].

No Subcategorization [30] Primary Emotions [21] Polarity
1 Joy (PA) HAPPY Positive
2 Comfort (PE) HAPPY Positive
3 Respect (PD) HAPPY Positive
4 Praise (PH) HAPPY Positive
5 Trust (PG) HAPPY Positive
6 Like (PB) HAPPY Positive
7 Wish (PK) HAPPY Positive
8 Angry (NA) ANGRY Negative
9 Upset (NB) SAD Negative
10 Disappointed (NJ) SAD Negative
11 Guilty (NH) SAD Negative
12 Grief (PF) SAD Negative
13 Panic (NI) FEAR Negative
14 Dread (NC) FEAR Negative
15 Shame (NG) FEAR Negative
16 Depressed (NE) FEAR Negative
17 Hate (ND) DISGUST Negative
18 Criticize (NN) DISGUST Negative
19 Envious (NK) DISGUST Negative
20 Suspect (NL) FEAR Negative
21 Surprise (PC) FEAR/HAPPY Negative/Positive

Note: The initials in brackets are the original ID tags in Xu et al.,’s [30] emotion dictionary. P stands for positive and N for negative.

++HAPPY (PH/PD) ++: The medical staffs were so great. I found that they were young girls when they took off the protective equipment. They persisted in helping the COVID-19 patients and were so nice to patients. They are so great and selfless in our eyes. (Patient 2)

++SAD (NB) ++: On the way to the isolation hospital, I cried all the way and felt extremely upset. (Patient 3)

++ANGRY (NA) ++: The internet is convenient. My personal information was exposed on the net. This made me irritated. I showed them the official document of negative test result for COVID-19. However, people in our community still refused to let me in. I felt extremely angry at that time. They showed discrimination to persons who had contracted COVID-19. (Patient 27)

++FEAR (NC) ++: I felt very fearful, extremely fearful. My heart beat fiercely. I felt that my heart beat the fastest in my life. I was very nervous and scared. I felt at a loss. My heat beat even 240 times per minute at that time. (Patient 6)

++DISGUST (ND) ++: I vomited when I smelled the flavor of the medicine. I kept taking the medicine, but vomited all the time. The doctor prescribed some anti-omitting medicine for me, but it did not work. Every time I saw the medicine, I felt disgusted and uncomfortable. (Patient 7)

As these excerpts show, emotional narratives were classified in terms of five basic emotions and 21 emotional subcategories, by using five primary emotional tags plus subcategories, namely ++HAPPY (x)++, ++SAD(x)++, ++ANGRY(x)++, ++FEAR(x)++, and ++DISGUST(x)++. For narratives lacking emotion, the tag ++EMPTY++ was used.

Text preprocessing

The structured narrative corpus was then subjected to grammatical processing. First, individual discourses were divided into sentences and paragraphs based on punctuation marks. Word tokens, which are fundamental linguistic units, were obtained through word segmentation of individual sentences. It is worth noting that Chinese word segmentation differs from English tokenization due to the absence of a distinct boundary between individual Chinese words. Hence, the Chinese word segmentation task was performed using the LTP toolkit of Harbin Institute of Technology [32], and incorrectly segmented parts of sentences were corrected manually. The segmented corpus was used for the analysis of generic features and word identity features of emotional narratives between the two matched groups. Second, regarding the analysis of sentiment polarity and 5/21 types of emotional words, irrelevant functional words such as prepositions, auxiliary verbs, and conjunctions were removed from the corpus in order to maximize the effect of sentiment analysis. Functional words are syntactically meaningful, but undermine the macroscopic semantic interpretation of the emotional words. By removing the preceding ‘noise‘ elements, we obtained a sample size of 141,878 words (functional words excluded) for the patient group, and a sample size of 106, 499 words (functional words excluded) for the healthy controls.

Measures

Based on the preprocessed corpus labeled with emotional tags, quantitative analysis of the emotional narrative followed. The first step involved automated analysis of generic features of the emotional narratives of COVID-19 patients and healthy controls, including mean word-length, words per sentence, sentences per narrative, and words per narrative. In the second step, sentiment analysis was implemented by calculating the frequency of emotional words. In doing so, differences regarding polarity and intensity of emotions were exposed between patients and controls. Lastly, significant features of word identity across the emotional narratives were compared between the two matched groups.

Analysis of generic features of emotional narratives

In the present study, generic features of emotional narratives, including Type-Token ratio, mean word-length, average number of words per sentence, number of sentences per narrative, and number of words per narrative were measured among patients and healthy controls. The Type-Token ratio shows the number of unique words divided by the total number of words in the emotional narratives corpus. A lower ratio indicates more use of the same words in the text. Mean word-length refers to the average number of Chinese characters per word. Longer words are usually more complex and indicates better use of language. Regarding average number of words per sentence, longer sentences tend to be syntactically more complex and express more complex ideas. As for number of sentences per narrative, more sentences in a specific emotional narrative of similar length indicate the use of shorter, less complex sentences. Total number of words per narrative specifies word frequency in a specific emotional narrative [21].

The emotional narratives corpus was loaded into Python for generic feature analysis, drawing on the ‘CorpusReader’ class in NLTK (Natural Language Toolkit)—an open source Python library for natural language processing [33]. This package made file opening and stream control robust, and lent support to large-scale text retrieving and operating of the current corpus. As CorpusReader class in NLTK provides various text processing measurements (e.g., paras(), sents(), words()), it was used to automatically measure grammatical categories, such as word, sentence, and paragraph, in specific narratives of patients and controls (with fileids, categories argument). In this regard, average X per Y = len(X) / len(Y) was computed with various sub-functions of CorpusReader classes including reader.words(), reader.sents(), reader.paras() in conditions such as reader.words (categories = ‘patient’).

Analysis of frequency of emotional words

To compute sentiment statistics, we used the package called CategorizedCorpusReader.words(categories = ‘group’) in NLTK to calculate frequency of emotional words from specific group of interest (e.g., patient or control, positive or negative, HAPPY or SAD). First, words with no substantive meaning (functional words, punctuations) were filtered out, based on the stopwords list of Harbin Institute of Technology [32]. Next, frequency of emotional polarity and 5/21 categories of emotional words were extracted according to the Chinese Affective Lexicon ontology [30], by utilizing ‘probability.ConditionalFreqDist’ class in NLTK. By doing so, a conditional frequency distribution table was generated. ConditionalFreqDist (CFD) is essentially word frequency data stored in key:value pairs, where ‘key’ is the user-defined condition and ‘value’ is word frequency distribution (a ‘FreqDist’ class) of a group satisfying the condition. CFD is able to create individual ‘FreqDist’ objects with specific conditions of various sentiments (i.e., polarity, primary, or subcategorized emotions) and source of observation (i.e., patient or control).

Utilizing these tools, we conducted sentiment analysis by selecting the frequency of emotional vocabulary for each group according to different conditions. For instance, the occurrence of the vocabulary group of the primary emotion ‘DISGUST’ among healthy controls was retrieved with the query (CFD[’control’][’disgust’]). Other emotional types were measured in a similar way (e.g., CFD [‘patient’][‘NC’] calculated the frequency of fear related words among the patient group). Apart from searching for individual observation values, we used the plot function in ‘FreqDist’ and the Python library, ‘Matplotlob’, to visualize the frequency of different types of emotional words by groups.

Analysis of significant word identity features of emotional narratives

Significant word identity features, such as the probabilistic tendency of specific words, can mark mental status of COVID-19 patients as compared to healthy controls. In this study, the Log-likelihood ratio (LL) was adopted to measure the difference in word identity between the two sub-corpora. Table 3 shows the rationales of the comparison and mathematical formula [3436] that were implemented via ‘AntConc’ software. Table 4 shows a comparison of word identity in SAD narratives between the two matched groups. Looking at ‘not’ and ‘no’ in Table 4, the high LL value suggests that patients narrated more negative experiences or attitudes compared to healthy controls. The same is true of words such as ‘isolation’, ‘days’, ‘want’, ‘worry’, and ‘infectious’, which concerned patients’ personal experiences of infection, quarantine, and treatment. In contrast, words such as ‘Wuhan’, ‘life’, ‘world’, and ‘patient’ were used more frequently in healthy controls compared to the patient group, suggesting that healthy controls were more concerned with external circumstances such as the community and the country rather than themselves.

Table 3. Rationale of Log-likelihood ratio (LL) for word identity comparison [3436].
Patient Narratives (i*) Control Narratives (i*) Total
Frequency of target word (O*) a b a+b
Frequency of other words c-a d-b c+d-a-b
Total (*N) c d c+d
Table 4. Most distinctive words of SAD narratives between the two matched groups.
Word Feature Log-likelihood Patient (N = 34) Control (N = 24) Overuse/Underuse
Frequency Relative Frequency Frequency Relative Frequency
then 52.53 178 0.63% 226 1.30% -
not 48.14 609 2.16% 224 1.29% +
Wuhan 46.30 11 0.04% 48 0.28% -
anyway 43.98 70 0.25% 4 0.02% +
possibly 42.66 79 0.28% 123 0.71% -
isolation 41.81 80 0.28% 7 0.04% +
no 39.47 160 0.57% 34 0.20% +
for sure 32.94 61 0.22% 5 0.03% +
days 29.95 130 0.46% 29 0.17% +
want 28.81 192 0.68% 55 0.32% +
life 27.14 9 0.03% 32 0.18% -
sister 25.99 27 0.10% 0 0.00% +
world 25.98 1 0.00% 17 0.10% -
patient 25.64 7 0.02% 28 0.16% -
worry 25.57 34 0.12% 1 0.01% +
infectious 25.02 26 0.09% 0 0.00% +

Note: Log-likelihood values of 15.13 or higher are significant (d.f. = 1, p<0.0001)

2lnλ=2iOiln(OiEi),whereEi=NiiOiiNi

Results

Generic features of emotional narratives

The statistical results of generic features in the emotional narratives between COVID-19 patients and healthy controls are shown in Table 5. The type-token ratio (TTR) of vocabulary was used to measure lexical diversity [37]. Mean word-length was also an essential factor in detecting psychological processes of speakers in communication. With regard to mean word-length in sentences, emotionally traumatized patients tended to use longer sentences and irrelevant words for expression of a simple idea. In the literature, a smaller number of sentences in patient narratives and smaller number of words per narrative suggest possible frequent changes of topic caused by an unstable emotional state [21].

Table 5. Average value of generic features for COVID-19 patients and healthy controls.

Features Patient (N = 34) Control (N = 24) Welch’s t p
Type/token ratio 0.199 0.191 0.60 0.551 (df = 56.0)
Normalized 0.307 0.342 0.60 0.551 (df = 56.0)
Mean word-length 1.49 1.51 1.99 0.051 (df = 55.1)
Normalized 0.419 0.532 1.99 0.051 (df = 55.1)
Words/sentence 23.7 19.5 -5.75 < .001 (df = 52.6)
Normalized 0.510 0.247 -5.75 < .001 (df = 52.6)
Sentences/narrative 2.41 4.11 6.12 < .001 (df = 30.3)
Normalized 0.18 0.50 6.12 < .001 (df = 30.3)
Words/narrative 48.4 70.3 3.62 < .001 (df = 37.4)
Normalized 0.27 0.49 3.62 < .001 (df = 37.4)

Note: After min-max normalization, two-tailed, independent samples t-test

In our dataset, COVID-19 patients and healthy controls did not differ in the type-token ratio (t = 0.6, df = 56, p = 0.551). There was a slight difference in mean word-length, which was under the marginal significance level (t = 1.99, df = 55.1, p = 0.051). Notably, COVID-19 patients employed significantly more words per sentence (t = -5.75, df = 52.6, p<0.001), had significantly fewer sentences per narrative (t = 6.12, df = 30.3, p<0.001), and used significantly fewer words per narrative (t = 3.62, df = 37.4, p<0.001).

Detailed information for the number of words per narrative by the five basic emotions between COVID-19 patients and healthy controls is shown in Table 6 and Fig 1. When comparing overall narrative length by emotion, COVID-19 patient narratives of fear, happiness, and sadness were significantly shorter as compared to healthy controls. It is noteworthy that although narratives of anger and disgust were not statistically significant, as a whole, healthy controls were likely to express different emotions through a higher number of words as compared with patients, except for narratives of anger.

Table 6. The average number of words per narrative across five emotions.

Narrative types Patient (N = 34) Control (N = 24) Welch’s t p
ANGRY 91.8 71.8 0.990 0.327
DISGUST 73.6 90.8 -1.51 0.133
FEAR 72 102 -8.80 < .001
HAPPY 62.8 80.6 -5.71 < .001
SAD 62.9 96.5 -5.56 < .001

Fig 1. The average number of words per narrative across five emotions.

Fig 1

Frequency of emotional words

We focused on examining frequency distribution of emotional words related to sentiment polarity and the 5/21 emotional classification. The frequency distribution of emotional words between the two matched groups is shown in Figs 24. Regarding the overall distribution of the emotionality, COVID-19 patients generally used more emotional words than healthy controls, regardless of using negative or positive words (see Fig 2). For emotional words across the five emotions of happiness, fear, disgust, sadness, and anger, COVID-19 patients generally used more words of fear, disgust, and happiness, as compared to healthy controls (see Fig 3). To calculate the frequency of emotional words precisely, we computed the frequency of emotional words in terms of Xu et al.’s [30] 21-type classification. The results are shown in Fig 4. It is clear that COVID-19 patients used more emotional words of NC (dread), NE (depressed), NN (criticize), PA (joy), PG (trust), and PB (like) than the healthy controls. According to Table 2, emotional words of NC and NE fall into the emotion of fear, NN belongs to the emotion of disgust, and PA, PG, and PB are classified into the emotion of happiness. The result of the 21-type classification of emotional words was consistent with the 5-type classification, as seen in Figs 3 and 4. Taking the 5-type and 21-type classifications together, mental health issues of COVID-19 patients seem to have been concentrated on the emotions of fear and disgust. Given that COVID-19 patients had experienced the processes of infection, isolation, and recovery, their emotions fluctuated more dramatically when compared to healthy controls, as depicted in Fig 4. Although patients initially experienced frustrated emotional phases at the time of confirmed infection, their final recovery made COVID-19 patients happy and optimistic, and they trusted the government and medical staff in a positive emotional manner.

Fig 2. Frequency of positive and negative words between COVID-19 patients and healthy controls.

Fig 2

Fig 4. Frequency of the 21 types of emotional words between COVID-19 patients and healthy controls.

Fig 4

The 21-type ID tags: Joy (PA), Comfort (PE), Respect (PD), Praise (PH), Trust (PG), Like (PB), Wish (PK), Angry (NA), Upset (NB), Disappointed (NJ), Guilty (NH), Grief (PF), Panic (NI), Dread (NC), Shame (NG), Depressed (NE), Hate (ND), Criticize (NN), Envious (NK), Suspect (NL), Surprise (PC).

Fig 3. Frequency of emotional words across happy, fear, disgust, sad, and angry between COVID-19 patients and healthy controls.

Fig 3

Significant word identity features of emotional narratives

To demonstrate the significant features of word identity between COVID-19 patients and healthy controls, we computed the word identity features across narratives of five basic emotions by means of the Log-likelihood difference test. Results are shown in Table 7.

Table 7. Significant word identity features between COVID-19 patients and healthy controls.

Narratives of Anger
Features more common in Patients
    Words We
Feature more common in Controls
    Words Some, news, hmm, anger, Red Cross, very, will, should
Narratives of Disgust
Features more common in Patients
    Words We, eat, I
Feature more common in Controls
    Words Some, buy, Wuhan citizen, think, ah, more
Narratives of Fear
Features more common in Patients
    Words No, test, I, again, days, positive, return, nucleic acid, negative, illness, fever, infection, husband, isolation, nucleic acid test, scared, unknown, test result, nucleic acid test, result, hotel, always, nothing, reinfection, eat, airplane, confirmed infection, diagnose, night, mind, test, problems, blood, know, end, sister, call, Hubei, blood test, medicine, Blood Transfusion, now, positive result, infectious, others, return home, Nepal, blood test, Singapore, pass, cure, negative test result, inquiry, time, domestic, public health center, hotel, suspect, report, sleep, Antibody, month, worry, scream, shout, there, reinfection, indoor, airline
Feature more common in Controls
    Words Uh, is, will, some, um, patient, then, all, actually, possibly, probably, lockdown, suddenly, pandemic, you, medical staff, volunteers, remember, special, one, Wuhan, city, whole, locked, goods, sight, that, particularly, more, people, patients, works, pause, vegetables, community, graduation, living condition, only, city, one month, dwellers, protective equipment, many, inside, seafood, feeling
Narratives of Happiness
Features more common in Patients
    Words I, good, doctor, report, now, should, not, anyway, nurses, nothing, mood, question, what, isolation, ill, medical staff, examine, positive for COVID-19,body, ah, also, call, eat, discharge, domestic, blood test, negative for COVID-19,cooperate, problem, days, musical instruments, music, mind, antibody, result, really, clothing, event, suppose, public health, sleep, symptoms, later, here, never, sister, all right, think, no, fear
Feature more common in Controls
     Words Ah, then, Wuhan, is, some, pandemic, actually, possible, all, um, volunteers, goods, special, community, sudden, include, will, one, medical staff, pandemic hospital, city, people, organization, later, better, medical team, building, but, protective, news, students, cat, highway, patients
Narratives of Sadness
Features more common in Patients
    Words I, no, anyway, isolation, not, for sure, again, ah, my god, think, sister, worry, infectious, others, husband, then, result, return, should, sister, two, fear
Feature more common in Controls
    Words Then, ah, later, Wuhan, city, possibly, he, one, um, first aid, that, life, world, patient, like, Wuhan people, one, some, car

Looking at Table 7, we observed that COVID-19 patients were more likely to use the first person pronouns “I” and “we” to describe their five emotions, suggesting that patient emotional narratives were more about self or family surviving the pandemic, while healthy controls mainly talked about others (e.g., some people, Wuhan citizens, Red Cross, medical staff). Patients’ negative emotions (anger, disgust, fear, sadness) were mainly concerned with the topics of infection and reinfection, disease symptoms, nucleic acid testing, isolation in hospital and hotel, worry about infecting family and friends, lack of social support, and stigma. In contrast, negative emotions for healthy controls were mainly about the lockdown of the city, the community, medical staff, volunteers, patients, the prevailing living conditions, and the protective measures. With respect to the positive emotion (i.e., happiness), COVID-19 patients talked more about their journey of recovery, namely about how they were cured by medical staff and how they dealt with pressure successfully. The positive emotions of healthy controls were mainly concerned with how the city, the community, and the hospitals operated in an orderly way by collective community efforts.

It is worth noting that patient narratives of happiness encompass words belonging to the ‘fear’ and ‘disgust’ domains. The reason is that word identity feature was extracted from a bag-of-words model, based on the following examples:

++HAPPY(PE)++I don’t think I showed fear of death. I think death is not dreadful. We have elderly family members, so I don’t think death is dreadful. We can keep a positive view about life and treat death a natural thing. I think everyone will die, and will finally reach the end of life. This is a natural process. For me, death is not dreadful. (Patient 13)

++HAPPY(PE)++ I never felt irritable in mood; I felt peaceful during hospital treatment. At the beginning of hospital isolation, I showed severe symptoms and event felt difficult to breathe and to eat. However, three days later I felt like eating an ox, and I could eat meals of two persons every day. I felt happy with good appetite and my immunity enhanced. I recovered very soon. (Patient 27)

In the preceding two excerpts, Patient 13 and Patient 27 described their positive emotion toward COVID-19 infection. They demonstrated no fear toward death, and felt ‘peaceful’ with the virus. These two narratives of happiness encompass negative words in the domain of fear and disgust, such as ‘fear’, ‘dreadful’,’die’, ‘irritable’, ‘severe symptoms’, and ‘difficult’. However, they are syntagmatically combined with words of negation and represent patients’ positive emotions concerning their confidence in recovery.

Discussion

Research into mental and physical health relating to the COVID-19 pandemic has applied text analysis to social media data such as Twitter, Facebook, and Reddit to forecast the possible emergence of emotional disorders and mental illness [2326]. Emotional narratives of patients with confirmed COVID-19 disease have proven to be an important marker to gain a deeper understanding of emotion-related mental health issues due to COVID-19 [22]. Drawing on the text mining approach, this study aimed to examine differences in emotional status between matched groups of COVID-19 patients and healthy controls, who narrated their lived experiences of COVID-19. As part of our established procedure to assess evoked emotional discourse, we tagged narratives of happy, sad, angry, fearful, and disgusted emotions of both groups. Automated analysis of generic features, sentiment word frequency, and significant word features in the emotional narratives between COVID-19 patients and healthy controls reflected their prevailing mental health status. As for generic features of emotional narratives, we observed that persons afflicted by COVID-19 employed significantly more words per sentence, had significantly fewer sentences per narrative, and used significantly fewer words per narrative, as compared to healthy controls. Patients demonstrated generally lower complexity of emotional language across different types of emotional narratives as opposed to healthy controls, other than anger. It is noteworthy that patients’ emotional narratives of fear, happiness, and sadness were significantly shorter compared to healthy controls. Regarding different categories of emotional words, patients generally used more emotional words, in particular, more words of fear (i.e., dread and depression), disgust (i.e., criticism), and happiness (i.e., joy, trust, and like), as compared to healthy controls. This suggests that patients’ emotional symptoms at the lexical level were concentrated on the emotions of fear and disgust. The analysis of the significant word features of emotional narratives showed that COVID-19 patients talked more about themselves or their family surviving COVID-19 in terms of unexpected infection, isolation, and macro-identity [22], while healthy controls mainly talked about others.

At the sentence level, patients used longer sentences to describe their emotion, revealing that their attention to topic was not as fixed as that of healthy controls, and that their emotions were unstable. This was associated with their emotional perturbation due to the COVID-19 infection. At the narrative level, healthy controls demonstrated generally higher complexity and more expressive emotional language (i.e., narratives of fear, sadness, happiness, and disgust) than COVID-19 patients, suggesting that patients’ mental health disruptions had limited the linguistic range of expression of their emotional experiences. This research finding echoes the observations of prior psychiatric studies (e.g., of schizophrenia) concerning abnormalities in the production of coherent discourse due to cognitive dysfunction involving attention and thoughts [21]. A surprising finding in the present study is that narratives of anger were longer in COVID-19 patients than in healthy controls, which was in sharp contrast to the narratives of fear, happiness, sadness, and disgust. One explanation for this finding is that the long period of quarantine and relative lack of social support led to severe symptoms of worry, anxiety, depression, anger, and irritation among COVID-19 patients, given the perception of social isolation [11]. In our interviews, most patients were infected by SARS-CoV-2 passively by their colleagues, neighbors, or family members. They denied the fact that they were infected and tended to attribute the infection to other people. As a result, the salient ‘angry’ emotion emerged in the minds of COVID-19 patients during the first stage of infection [17]. Furthermore, when COVID-19 patients experienced social stigmatization and extreme loneliness, it is possible that their anger would evolve into an extreme emotional disorder. Overall, our results are consistent with those of published literature, which shows that COVID-19 patients are particularly vulnerable to the emotional impact of coronavirus [1114, 16, 1920, 22, 29].

Our findings at the lexical level support the fact that COVID-19 patients demonstrate a higher risk of psychiatric problems such as anxiety, fear, depression and stress, as compared to healthy controls [8, 9]. The emotional words used by COVID-19 patients were denser in volume than in healthy controls. In particular, the frequency of negative emotional words of fear and disgust was much higher in COVID-19 patients than in healthy controls. It is noteworthy that COVID-19 patients used more emotional words of happiness as compared to healthy controls. This is consistent with Sun et al. [17] and Deng et al.’s [22] findings regarding the dynamic path of the emotional and mental health journey of COVID-19 patients. Specifically, at the beginning of infection, the first salient emotion that emerged in patients’ minds was fear, denial, and shame. After a period of isolation in hospital, COVID-19 patients gradually accepted the reality of their diagnosis and cooperated with medical staff during their treatment. In the recovery stage, patients looked forward to a favorable test result. When they finally tested negative via nucleic acid testing, their emotions changed to happiness [17]. In our interviews, the infection and the long period of quarantine led to severe symptoms of worry, anxiety, depression, anger, and irritation among COVID-19 patients, resulting in the perception of social isolation [11, 22]. Notably, most patients were infected with COVID-19 passively by others. They felt reluctant to accept their confirmed infection and might ascribe the “tragedy” to external circumstances. Consequently, negative emotions of fear and sadness increased in COVID-19 patients at the time of infection. When patients experienced extreme loneliness and social discrimination, their negative emotions had the potential to result in the development of an overt mental health disorder. However, after patients were cured, they felt released and extremely excited. Their emotions were transformed into positive ones. This possibly accounts for the use of more emotional words of happiness in COVID-19 patients than in healthy controls. Given the dynamic psychological and emotional journey traversed by COVID-19 patients, it can well be understood why COVID-19 patients concentrated exclusively on themselves and their own well-being, while healthy controls mainly discussed others. Thus, the complex interaction of different embodied experiences, the environment, and psychological experiences interacted together to jointly mold the different emotional experiences and emotional responses in the two matched groups of participants [29].

At the general level and in terms of clinical research, generic features, emotional word frequency, and significant word identity features in the emotional narratives seen in the present study may suggest the presence of overt emotional disorders in COVID-19 patients. These linguistic features may be closely related to patient psychological trauma. In clinical management, emotional care for COVID-19 patients is likely to be equally as important as medical care to holistically manage and cure physical illness. Positive emotions play a critical role in adjustment and rehabilitation of psychological trauma associated with infection by COVID-19 [13]. In the present study, positive emotions of COVID-19 patients were associated with enthusiastic support of medical staff and the government, as well as their own expectations around their new life after recovery. This finding is consistent with that of previous studies [13, 17, 19, 22], and suggests that it is essential to reinforce positive emotions and to relieve negative emotions of patients by providing adequate and timely social support and accurate scientific information to patients [13, 22, 3840]. Firstly, positive social interactions with health care workers, patient peers, relatives, friends, and family members are critical for regulating patient emotional responses to COVID-19 infection. Adequate emotional care should be allocated to COVID-19 patients and within their ordinary social circle. The responsibility and duty of care for emotional healing of COVID-19 patients should not be confined to their family and friends only. It is particularly necessary that health care professionals take responsibility for follow-up support of COVID-19-infected people during their recovery journey [14]. Secondly, the government should provide accurate and timeous scientific information regarding COVID-19, and establish functionally viable policies to eliminate COVID-19-related discrimination regarding employment and isolation. Policies should also be implemented to protect the privacy of COVID-19 patients. Lastly, it is necessary for the mental healthcare system to track, assess, treat, and protect the mental health status of COVID-19-infected patients [13, 40]. Specific measures may include provision of one-on-one consultations, psychological counseling sessions, interactive group support, network consultations, and telephone consultations. These psychological interventional channels should be made available to COVID-19 patients in order to share problems and enhance emotional support [4042].

In the era of big data, emotional discourse analysis of COVID-19 patient data based on novel machine learning algorithms, and recent developments based on natural language processing techniques are promising approaches for the design of a critical surveillance system to manage pandemic-like health scenarios. To assess the emotional status of COVID-19 patients in a comprehensive manner, large-scale emotional discourse analysis based on interviews and stored social media data can be integrated into essential surveillance tools for the management of pandemic-related mental health issues. These measures represent a novel prediction and preventive approach to effectively mitigate emotional disorders resulting from COVID-19, and also allow timely preventive interventions to be implemented for COVID-19 patients [22, 43].

Study limitations and future research

This study has several limitations. First, we collected the narrative discourses of 34 COVID-19 patients and 24 healthy controls by convenience sampling. Our sample size was small and not entirely balanced, and our findings may thus not be generalizable to large populations. Furthermore, the interviews were conducted during two different waves of the COVID-19 pandemic within China, namely summer and autumn-winter, across 2020–2021. The dynamic change of emotional status among the two groups across the two periods may have long-term effects. A second limitation has to do with the techniques of sentiment analysis, which largely rested on the Chinese Affective Lexicon Ontology [30] and the NLTK (Natural Language Toolkit) in Python [33]. It should be noted that semantic analysis based on deep learning methods of NLP was underestimated. Third, we investigated the emotional disorders of COVID-19 patients under collective emotions as opposed to healthy controls. Individual-level correlates, such as clinical therapy, quarantine period, education level, occupation, and income, were therefore underestimated and not presented in our data.

In light of these challenges, we hope and anticipate that similar future investigations could draw on larger sample sizes of patient narratives. Enquires along this line can control the intervals of different interviews and compare emotional symptoms of COVID-19 patients across different periods. Furthermore, recent developments in NLP techniques, such as transformer-based approaches (e.g., BERT, BioBert) and deep neural network learning can be utilized to conduct sentiment analysis of COVID-19 patient narratives. These models are able to measure both the semantics of emotional words and the context that these words occur in robustly. Finally, future sentiment analysis of COVID-19 patients can consider different individual-level, socio-demographic, and pre-existing clinical correlates, given that these factors have the potential to be used as markers to predict different emotional and mental burdens among different categories of patients [3, 43, 44].

Conclusion

The global COVID-19 pandemic is an overt threat to the emotional well-being and the mental health status of persons afflicted by COVID-19. Our text-mining-based emotional discourse analysis focused on generic features, emotional word frequency, and significant word identity features of emotional narratives in COVID-19 patients and healthy controls. Our results indicate generally lower complexity and less expressive emotional narratives in COVID-19 patients than in healthy controls. This reveals that psychological disruptions appear to inhibit the production of a detailed and coherent emotional discourse in COVID-19 patients. At the lexical level, COVID-19 patients used more words of fear, disgust, and happiness, as compared to healthy controls. Furthermore, patients narrated more words with respect to self or family while healthy controls talked mainly about others. Overall, while the emotional symptoms of COVID-19 patients at the lexical level focused on the negative emotions of fear and disgust concerning their confirmed infection and long period of treatment, their emotion was positively regulated by their recovery and adequate support from medical staff [22].

Our emotional discourse analysis may be used to distinguish the clinical status of COVID-19 patients, and can thus be used to predict the extent of emotional disorder symptoms in these patients. In clinical practice, emotional care and psychological counseling for COVID-19 patients must be an essential component of patient management, and should be considered to be an integral central element of current and future COVID-19 management strategies [11, 40]. During the present COVID-19 era, large-scale text mining of real-world data, such as interview data, social media data, and online health forum data, reveals the possibility of the design of a central surveillance system to manage emotional and mental health issues concerning COVID-19 (and possibly other future pandemic-like) infection. This system can be harnessed to inform and guide public healthcare policy and provide useful insights into developing novel therapeutics related to mental health symptoms [25].

Data Availability

The minimal dataset is available within the paper. Additional data is available in figshare: https://doi.org/10.6084/m9.figshare.20522775.

Funding Statement

This research was funded by Humanities and Social Sciences Research Project of Chongqing Education Commission in the form of a grant (21SKGH143) awarded to YD. This study was also funded by Foundation of First-class Discipline of Foreign Languages & Literature, Chongqing in the form of a grant (SISUWYJY202104) awarded to YD. This study was also funded by Teaching reform project of Sichuan International Studies University in the form of a grant (JY2296294) awarded to YD. This study's APC was funded by Chongqing Talent Cultivation Program in the form of a grant (cstc2021ycjh-bgzxm0275) awarded to YC. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Mzelewska A, Imbir KK. Interoceptive awareness and beliefs about health and the body as predictors of the intensity of emotions experienced at the beginning of the pandemic. PeerJ. 2021; 9, e12542. doi: 10.7717/peerj.12542 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Schelhorn I, Schlüter S, Paintner K, Shiban Y, Lugo R, Meyer M, et al. Emotions and emotion up-regulation during the COVID-19 pandemic in Germany. PLOS ONE 2022; 17(1), e0262283. doi: 10.1371/journal.pone.0262283 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Santomauro DF, Herrera A, Shadid J, Zheng P, Ashbaugh C, Pigott DM et al. Global prevalence and burden of depressive and anxiety disorders in 204 countries and territories in 2020 due to the COVID-19 pandemic. The Lancet. 2021; 398, 10312, 1700–1712. doi: 10.1016/S0140-6736(21)02143-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bo HX, Li W, Yang Y, Wang Y, Zhang Q, Cheung T, et al. Posttraumatic stress symptoms and attitude toward crisis mental health services among clinically stable patients with COVID-19 in China. Psychol. Med. 2021; 51(6),1052–1053. doi: 10.1017/S0033291720000999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Dong H, Hu R, Lu C, Huang D, Cui D, Huang G, et al. Investigation on the mental health status of pregnant women in China during the pandemic of COVID-19. Arch. Gynecol. and Obstet 2020; (2), 303. doi: 10.1007/s00404-020-05805-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kaseda ET, Levine AJ. Post-traumatic stress disorder: A differential diagnostic consideration for COVID-19 survivors. Clin Neuropsychol. 2020; 34(7–8):1498–1514. doi: 10.1080/13854046.2020.1811894 [DOI] [PubMed] [Google Scholar]
  • 7.Luo X, Estill J, Wang Q, Lv M, Liu Y, Liu E, et al. The psychological impact of quarantine on coronavirus disease 2019 (covid-19). Psychiatry Res. 2020; 291,113193. doi: 10.1016/j.psychres.2020.113193 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Muruganandam P, Neelamegam S, Menon V, Alexander J, Chaturvedi SK. COVID-19 and severe mental illness: Impact on patients and its relation with their awareness about COVID-19. Psychiatry Res. 2020; 291, 113265. doi: 10.1016/j.psychres.2020.113265 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Taquet M, Luciano S, Geddes JR, Harrison PJ. Bidirectional associations between COVID-19 and psychiatric disorder: Retrospective cohort studies of 62354 COVID-19 cases in the USA. Lancet Psychiatry 2021; 8(2),130–140. doi: 10.1016/S2215-0366(20)30462-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Vahedian-Azimi A, Moayed MS, Rahimibashar F, Shojaei S, Ashtari S, Pourhoseingholi MA. Comparison of the severity of psychological distress among four groups of an Iranian population regarding COVID-19 pandemic. BMC Psychiatry 2020; 20, 402. doi: 10.1186/s12888-020-02804-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Deng Y, Wang L, Yang J, Xie L, Chen Y. How covid-19 patient narratives concerning reinfection mirror their mental health: A case series. Psychiatria Danubina 2021; 33(1), 114–119. doi: 10.24869/psyd.2021.114 [DOI] [PubMed] [Google Scholar]
  • 12.Deng Y, Yang J, Wang L, Chen Y. How lived experiences of COVID-19 shape mental health: A case series of COVID-19 patients from Wuhan, China. Alpha Psychiatry 2021; 22(5),269–274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Li T, Hu Y, Xia L, Wen L, Ren W, Xia W, et al. Psychological experience of patients with confirmed COVID-19 at the initial stage of pandemic in Wuhan, China: A qualitative study. BMC Public Health 2021; 21, 2257. doi: 10.1186/s12889-021-12277-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Missel M, Bernild C, Christensen SW, Dagyaran I, Berg SK. It’s Not just a virus! Lived experiences of people diagnosed with COVID-19 infection in Denmark. Qualitative Health Research 2021; 31(5), 822–834. doi: 10.1177/1049732321990360 [DOI] [PubMed] [Google Scholar]
  • 15.Sahoo S, Mehra A, Suri V, Malhotra P, Yaddanapudi LN, Dutt Puri G, et al. Lived experiences of the corona survivors (patients admitted in COVID wards): A narrative real-life documented summaries of internalized guilt, shame, stigma, anger. Asian J. Psychiatr. 2020; 53,102187. doi: 10.1016/j.ajp.2020.102187 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Son HM, Choi WH, Hwang YH.; Yang HR. The lived experiences of COVID-19 patients in South Korea: A qualitative study. Int.J. Environ. Res. Public Health. 2021; 18, 7419. doi: 10.3390/ijerph18147419 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Sun N, Wei L, Wang H, Wang X, Gao M, Hu X, et al. Qualitative study of the psychological experience of COVID-19 patients during hospitalization. Journal of Affective Disorders. 2021; 278, 15–22. doi: 10.1016/j.jad.2020.08.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Toulabi T, Pour FJ, Veiskramian A, Heydari H. Exploring COVID-19 patients’ experiences of psychological distress during the disease course: a qualitative study. BMC Psychiatry. 2021; 21, 625. doi: 10.1186/s12888-021-03626-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Venturas M, Prats J, Querol E, Zabalegui A, Fabrellas N, Rivera P, et al. Lived experiences of hospitalized COVID-19 patients: A qualitative study. Int. J. Environ. Res. Public Health. 2021, 18, 10958. doi: 10.3390/ijerph182010958 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wu C, Cheng J, Zou J, Duan L. Campbell J. E. Health-related quality of life of hospitalized COVID-19 survivors: An initial exploration in Nanning city, China. Social Science & Medicine. 2021; 274, 113748. doi: 10.1016/j.socscimed.2021.113748 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hong K, Nenkova A, March ME, Parker AP, Verma R, Kohler CG. Lexical use in emotional autobiographical narratives of persons with schizophrenia and healthy controls. Psychiatry Research. 2015; 225(1–2),40–49. doi: 10.1016/j.psychres.2014.10.002 [DOI] [PubMed] [Google Scholar]
  • 22.Deng Y, Li H, Park M. Emotional experiences of COVID-19 patients in China: A qualitative study. Int. J. Environ. Res. Public Health. 2022; 19, 9491. doi: 10.3390/ijerph19159491 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Crocamo C, Viviani M, Famiglini L, Bartoli F, Pasi G, Carrà G. Surveilling COVID-19 emotional contagion on Twitter by sentiment analysis. European Psychiatry. 2021; 64(1), e17, 1–6. doi: 10.1192/j.eurpsy.2021.3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Low DM, Rumker L, Talkar T, Torous J, Cecchi G, Ghosh SS. Natural language processing reveals vulnerable mental health support groups and heightened health anxiety on Reddit during COVID-19: Observational study. J Med Internet Res. 2020. Oct 12; 22(10):e22635. doi: 10.2196/22635 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Patel R, Smeraldi F, Abdollahyan M, Irving J, Bessant C. Analysis of mental and physical disorders associated with COVID-19 in online health forums: a natural language processing study. BMJ Open. 2021. Nov 5; 11(11): e056601. doi: 10.1136/bmjopen-2021-056601 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Viviani M, Crocamo C, Mazzola M, Bartoli F, Carrà G, Pasi G. Assessing vulnerability to psychological distress during the COVID-19 pandemic through the analysis of microblogging content. Future Gener Comput Syst. 2021. Dec; 125: 446–459. doi: 10.1016/j.future.2021.06.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Chung CK, Pennebaker JW. Revealing dimensions of thinking in open-ended self-descriptions: An automated meaning extraction method for natural language. Journal of Research in Personality. 2008; 42 (1), 96–132. doi: 10.1016/j.jrp.2007.04.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Frattaroli J. Experimental disclosure and its moderators: A meta-analysis. Psychological Bulletin 2006; 132(6), 823. doi: 10.1037/0033-2909.132.6.823 [DOI] [PubMed] [Google Scholar]
  • 29.Deng Y, Yang J, Wan W. Embodied metaphor in communication about lived experiences of the COVID-19 pandemic in Wuhan, China. PLOS ONE. 2021; 16(12), e0261968. doi: 10.1371/journal.pone.0261968 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Xu L, Lin H, Pan Y, Ren H, Chen J. Constructing the affective lexicon ontology. Journal of the China Society for Scientific and Technical Information. 2008; 27(2), 180–185. [Google Scholar]
  • 31.Ekman P. An argument for basic emotions. Cognition Emotion. 1992; 6(3–4), 169–200. [Google Scholar]
  • 32.Che W, Feng Y, Qin L, Liu T. N-LTP: An open-source neural language technology platform for Chinese. https://arxiv.org/abs/2009.11616.
  • 33.Bird S, Klein E, Loper E. Natural Language Processing with Python. O’Reilly Media Inc, 2009. https://www.nltk.org/book/ [Google Scholar]
  • 34.Williams K. The failure of Pearson’s goodness of fit statistic. Journal of the Royal Statistical Society. 1976, 25(1) 49. [Google Scholar]
  • 35.Rayson PE. Matrix: A statistical method and software tool for linguistic analysis through corpus comparison. Lancaster University, 2003. [Google Scholar]
  • 36.Rayson P. Garside R. Comparing corpora using frequency profiling. In The Workshop on Comparing Corpora. 2000, 1–6.
  • 37.Kettunen K. Can type-token ratio be used to show morphological complexity of languages? Journal of Quantitative Linguistics. 2014; 21(3), 223–245. [Google Scholar]
  • 38.Bao Y, Sun Y, Meng S, Shi J, Lu L. 2019-nCoV epidemic: Address mental health care to empower society. Lancet. 2020; 395(10224), e37–e38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Waugh CE. The roles of positive emotion in the regulation of emotional responses to negative events. Emotion. 2020; 20(1), 54–58. doi: 10.1037/emo0000625 [DOI] [PubMed] [Google Scholar]
  • 40.Xiang YT, Yang Y, Li W, Zhang L, Zhang Q, Cheung T, et al. Timely mental health care for the 2019 novel coronavirus outbreak is urgently needed. Lancet Psychiat. 2020; 7(3), 228–229. doi: 10.1016/S2215-0366(20)30046-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Deng Y, Yang J. Psychological status of frontline healthcare professionals at the outbreak of COVID-19 in Wuhan: A narrative case series. Psychiatry and Clinical Psychopharmacology. 2021; 31(2), 233–237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Duan L, Zhu G. Psychological interventions for people affected by the COVID-19 epidemic. Lancet Psychiat. 2020; 7(4), 300–302. doi: 10.1016/S2215-0366(20)30073-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Crocamo C, Bachi B, Calabrese A, Callovini T, Cavaleri D, Cioni RM, et al. Some of us are most at risk: Systematic review and meta-analysis of correlates of depressive symptoms among healthcare workers during the SARS-CoV-2 outbreak. Neurosci Biobehav Rev. 2021;131:912–922. doi: 10.1016/j.neubiorev.2021.10.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Brooks SK, Webster RK, Smith LE, et al. The psychological impact of quarantine and how to reduce it: rapid review of the evidence. The Lancet, 2020; 395(10227):912–920. 10.1016/S0140-6736(20)30460-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Giuseppe Carrà

19 May 2022

PONE-D-22-06307Emotional Discourse Analysis of COVID-19 Patients and their Mental Health: An NLP-based StudyPLOS ONE

Dear Dr. Chen,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jul 03 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Giuseppe Carrà, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf.

2. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In the manuscript entitled “Emotional Discourse Analysis of COVID-19 Patients and their Mental Health: An NLP based Study”, the authors aimed to study potential differences in the emotional status between people who reported COVID-19 infection and healthy controls, benefiting from novel approaches based on natural language processing methods.

The topic is very interesting and the specific methodological approach appears promising in the mental health field, possibly contributing to explore emotional status and the potential link with a differential mental health burden.

Nevertheless, some changes are required to improve the manuscript quality:

• In the Introduction section, the study justification should be clearly presented, by focusing on the need to explore mental health burden or at least proxies of it ( https://doi.org/10.1016/S0140-6736(21)02143-7 )

Similarly, in the Abstract, at least one sentence is needed to describe the study justification.

• In the Materials and Methods section, the authors should better clarify the nature of the study (e.g., cross-sectional design). When were narratives collected for people who experienced COVID-19? The authors may wish to take this into account also in the Discussion section.

• Further details should be added in terms of sampling strategies and potentially sample size calculations with additional comments in the Limitations section, where appropriate.

• Similarly, NLP should be described more in detail, thus providing the reader with additional information regarding the methodological approach for features computation. For example, the authors should report whether they benefited from a specific software (and related packages) and they should add a more in-depth description of chosen techniques. In addition, they should clarify whether they combined qualitative and quantitative approaches.

• In the current version of the manuscript, the study design paragraph includes some details that are specific of the chosen methodological approach. Therefore, it would be useful to add a reference to the paragraph in which the authors describe the current approach in detail.

• More importantly, a subparagraph Measures should be added to the Materials and Methods section.

• Page 3 lines 146-149: It seems more like a result rather than a methodological description. Please check.

• Lines 162-180: How were interview's domains defined? Please clarify.

• The rationale of choosing two data collection periods (i.e., Summer and Autumn-Winter) should be clarified. Did the authors check data for differences across the two periods? I suggest discussing this issue further.

• In Table 2, what do the initials in brackets mean? Please add relevant footnotes.

• Some details mentioned at the beginning of the Results section need to be disclosed in the Materials and Methods section. Please revise the logical flow of the Methods section to include all relevant methodological details in order to let the reader fully understand the analyses.

• Moreover, the techniques to implement sentiment analyses are not clearly presented in the Methods. Please revise related sentences, also referring to commonly used approaches such as the Valence Aware Dictionary for sEntiment Reasoning (VADER) and the Bidirectional Encoder Representations from Transformers (BERT) ( https://doi.org/10.1192/j.eurpsy.2021.3 )

• There is some ambiguity between the two paragraphs of the Results section. For example lines 274-284 seem to fit best the next heading (Sentiment Frequency of emotional words and word identity). Please clarify.

I suggest providing a clearer definition of the measures in the Materials and Methods section (see my previous comment about an additional “Measures” paragraph) and then following an unequivocal logical flow in the Results section according to what is described in the Methods. Moreover, types of emotional words (see Figure 4 footnotes) should be described in the Materials and Methods section.

• When interpreting Table 4, the authors focused on some emotions according to statistically significance (0.05 threshold). However, table 4 seems to show an additional perspective about potential emotional differences between patients and controls. In particular, although some estimates were not statistically significant, it shows that, as a whole, controls were likely to express different emotions through a higher number of words as compared with patients, except for anger. Although these results should be interpreted with caution, this perspective should be taken into account. I suggest the authors to revise relevant sentences in the Results section and add a comment on that in the Discussion section. Do you think that this might be related to the time interval between COVID-19 infection and the interview? In other words, can this be considered a short- or also a long-term effect? What are potential explanations? Please add a comment in the Discussion section, also considering available information from collected data.

• Furthermore, the authors should clarify whether they collected meta data such as information on therapy/recovery, but also gender, age, etc… or they should acknowledge the lack of key information as a study limitation. Individual-level correlates may play a role that needs to be taken into account for future research. Indeed, based on available evidence, different individual-level, socio-demographic and pre-existing clinical correlates are likely to be associated with an increased mental health burden and should be taken into account ( https://doi.org/10.1016/S0140-6736(20)30460-8 ; https://doi.org/10.1016/j.neubiorev.2021.10.010 ; https://doi.org/10.1016/S0140-6736(21)02143-7 )

• Lines 286-287: “We focused exclusively on examining lexical features in the emotional narratives related to the emotionality.” Please clarify. In addition, did the authors mean frequency distribution or rather sentiment polarity?

• Qualitative results are not adequately justified through representative excerpts from the data. Therefore, some excerpts from the qualitative analysis should be provided and discussed in order to let the reader better understand the results, along with those provided next to Table 2.

• The Discussion section appear to jump to conclusions. These need to be clarified and justified. For example lines 351-352; 356-358 (Sentiment or content analysis?); 362-364; 371-373. Please give additional explanations and consider providing adequate references. As a result, the first paragraph of the Discussion needs to be extensively revised.

• How was happiness combined with emotional words belonging to fear and disgust domains? In other words, happiness is somehow opposite to fear and disgust. How did the authors interpret these findings? (Consistently, please report the name of the emotion.)

• Furthermore, based on available evidence, the authors should add a comment in the Discussion section on the use of novel machine learning algorithms and relevant methods grounded on natural language processing techniques as a promising approach for the design of a critical surveillance system to manage pandemic-like scenarios. These considerations should contribute to emphasize the importance of an emotional discourse analysis that may possibly benefit also from social media integration for timely preventive interventions ( https://doi.org/10.1192/j.eurpsy.2021.3 )

• I suggest revising the Conclusion paragraph, by making the first part more compact in order to let the reader better understand the take home message.

Reviewer #2: In the study presented in this article, the authors used semi-structured interviews and affirm to use natural language processing to analyze the characteristics of lived experience narratives between patients with COVID-19 and a group of healthy patients related to five basic emotions. The aim of the work was to identify differences in emotional state between the two groups of participants. The results obtained indicate generally higher complexity and more expressive emotional language in the healthy patient group than in the COVID-19 patients.

The work has a rather relevant objective but, in my opinion, it fails to be convincing with respect to the results obtained, this due to several weaknesses and naive solutions used in the interpretation of the results.

- First of all, semi-structured telephone interviews were used in the study. It is not discussed sufficiently how it is ensured that the interview modality and the questions asked do not in any way imply bias with respect to the answers provided. Has it been thought to also consider other content generated by these users independently of the telephone interview? There are works that have tried to assess the psychological vulnerability of users with respect to this aspect, and other works more similar to the proposed work, which were not mentioned in the work, for example:

* Low, Daniel M., et al. "Natural language processing reveals vulnerable mental health support groups and heightened health anxiety on reddit during covid-19: Observational study." Journal of medical Internet research 22.10 (2020): e22635.

* Patel, Rashmi, et al. "Analysis of mental and physical disorders associated with COVID-19 in online health forums: a natural language processing study." BMJ open 11.11 (2021): e056601.

* Viviani, Marco, et al. "Assessing vulnerability to psychological distress during the COVID-19 pandemic through the analysis of microblogging content." Future Generation Computer Systems 125 (2021): 446-459.

It is true that the authors take into consideration specific groups of users affected and not affected by COVID-19, however there are interesting aspects of the previous works that could be taken into consideration, the lack of which constitutes another weak point of the presented work.

- In fact, the authors claim to use "the natural language processing method". Regardless of the fact that the sentence itself is incorrect, as NLP includes a series of techniques for natural language processing (and it is not a "method"), the fact remains that the text analysis method used by the authors is rather simple and may in any case refer to techniques of "text mining" rather than "natural language processing". In fact, no semantic or natural language understanding analysis is carried out, as frequentist analyzes of the appearance of particular textual characteristics within the interviews are essentially taken into consideration.

- In evaluating users' emotions, the authors therefore follow an approach based essentially on the lexicon, without making any reference to recent developments in the NLP field which take into account both the semantics of words and the context in which these words appear (BERT, BioBERT , transformer-based approaches, use of deep neural networks).

Ultimately I can say that the work from the point of view of language analysis with respect to the problem in question is not particularly innovative compared to previous works of literature, and, in any case it refers to concepts that are ultimately not really used within the work (for example NLP).

I also believe that on a rather limited group of patients it is difficult to draw conclusions that are statistically relevant, even this aspect should have been discussed more carefully.

Finally, I must say that the level of English should also be improved, because very often there are sentences that are not understandable or at least not in standard English.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 1

Giuseppe Carrà

25 Aug 2022

Emotional Discourse Analysis of COVID-19 Patients and their Mental Health: A Text Mining Study

PONE-D-22-06307R1

Dear Dr. Chen,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Giuseppe Carrà, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Giuseppe Carrà

8 Sep 2022

PONE-D-22-06307R1

Emotional Discourse Analysis of COVID-19 Patients and their Mental Health: A Text Mining Study

Dear Dr. Chen:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Giuseppe Carrà

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: Response to Reviewer comments.docx

    Data Availability Statement

    The minimal dataset is available within the paper. Additional data is available in figshare: https://doi.org/10.6084/m9.figshare.20522775.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES