Weighted Joint Sentiment-Topic Model for Sentiment Analysis Compared to ALGA: Adaptive Lexicon Learning Using Genetic Algorithm

Amjad Osmani; Jamshid Bagherzadeh Mohasefi

doi:10.1155/2022/7612276

. 2022 Jul 31;2022:7612276. doi: 10.1155/2022/7612276

Weighted Joint Sentiment-Topic Model for Sentiment Analysis Compared to ALGA: Adaptive Lexicon Learning Using Genetic Algorithm

Amjad Osmani ^1,^2,^✉, Jamshid Bagherzadeh Mohasefi ³

PMCID: PMC9374039 PMID: 35965748

Abstract

Latent Dirichlet Allocation (LDA) is an approach to unsupervised learning that aims to investigate the semantics among words in a document as well as the influence of a subject on a word. As an LDA-based model, Joint Sentiment-Topic (JST) examines the impact of topics and emotions on words. The emotion parameter is insufficient, and additional parameters may play valuable roles in achieving better performance. In this study, two new topic models, Weighted Joint Sentiment-Topic (WJST) and Weighted Joint Sentiment-Topic 1 (WJST1), have been presented to extend and improve JST through two new parameters that can generate a sentiment dictionary. In the proposed methods, each word in a document affects its neighbors, and different words in the document may be affected simultaneously by several neighbor words. Therefore, proposed models consider the effect of words on each other, which, from our view, is an important factor and can increase the performance of baseline methods. Regarding evaluation results, the new parameters have an immense effect on model accuracy. While not requiring labeled data, the proposed methods are more accurate than discriminative models such as SVM and logistic regression in accordance with evaluation results. The proposed methods are simple with a low number of parameters. While providing a broad perception of connections between different words in documents of a single collection (single-domain) or multiple collections (multidomain), the proposed methods have prepared solutions for two different situations (single-domain and multidomain). WJST is suitable for multidomain datasets, and WJST1 is a version of WJST which is suitable for single-domain datasets. While being able to detect emotion at the level of the document, the proposed models improve the evaluation outcomes of the baseline approaches. Thirteen datasets with different sizes have been used in implementations. In this study, perplexity, opinion mining at the level of the document, and topic_coherency are employed for assessment. Also, a statistical test called Friedman test is used to check whether the results of the proposed models are statistically different from the results of other algorithms. As can be seen from results, the accuracy of proposed methods is above 80% for most of the datasets. WJST1 achieves the highest accuracy on Movie dataset with 97 percent, and WJST achieves the highest accuracy on Electronic dataset with 86 percent. The proposed models obtain better results compared to Adaptive Lexicon learning using Genetic Algorithm (ALGA), which employs an evolutionary approach to make an emotion dictionary. Results show that the proposed methods perform better with different topic number settings, especially for WJST1 with 97% accuracy at |Z| = 5 on the Movie dataset.

1. Introduction

Opinion extraction is one of the main branches of natural language processing (NLP) research. Comment extraction (emotion analysis) now is widely used in websites containing different types of merchandise. Online product reviews can help customers buy a product and help manufacturers discover new opportunities by analyzing user feedback. Consequently, automated analysis of reviews is critical. Emotion Analyzer can browse comments on the web and categorize many comments as positive or negative tags. This research is important because it makes managing customer requests easier and more efficient because product owners automatically extract customer feedback and use customer feedback to sell products. There are different methods for extracting opinions and analyzing them, and in this research, an intelligent method has been used [1–7]. Topic modeling presumes that the input text document set contains several unknown subjects that need recognition. Each subject (topic) is an unknown distribution of words, and each review (text document) is a distribution of subjects. The aim is to detect concealed knowledge in textual data related to the user's comments. Several methods perform subject modelings, such as Latent Dirichlet Allocation (LDA) and Probabilistic Latent Semantics Analysis (PLSA). PLSA is a method that can produce the data perceived in a document-term matrix. LDA is a probabilistic method because it is exhibited in a probabilistic language, and it is a generative model because it is about ensuring that documents are produced. LDA has based on the premise that a review is a combination of subjects in which each topic is distributed over words. The linear growth of PLSA parameters indicates that the method is prone to overfitting. LDA can be easily extended to new documents. In addition, increasing the training data size does not lead to the growth of LDA-related parameters [7].

In LDA, subjects are related to documents, and words are related to subjects. To model the emotion of reviews, Joint Sentiment-Topic (JST) [8] establishes an extra layer of emotion between the layers of document and subject, where the emotion labels are related to the documents, the subjects are related to the emotion labels, and words are tagged with emotions and related topics. This study assumes that each word in a document affects its neighbors, and different words in the document may be affected simultaneously by several neighbor words. Thus, the proposed models consider the effect of words on each other. The proposed models add two parameters (weight and window) to JST. The window parameter represents the range of the effect of a word, and the weight parameter represents the strength of the effect of the word. These two parameters play an important role in better classification, as seen in the evaluation section. Using the parameters weight and window, two new methods are introduced that have revealed notable dominance over the baseline algorithms, such as JST, Topic Sentiment modeling (TS) [9], Reverse-JST (RJST) [10], and Tying-JST model (TJST) [8].

More and more improved algorithms and strategies are used to solve sentiment analysis problems. However, none of the researchers have improved the accuracy besides generating a sentiment dictionary. Different from other related studies, in this study, the proposed models improve topic-model-based sentiment classification using two parameters (weight and window). The proposed models consider the effect of words on each other. They can also generate a sentiment dictionary that includes words and scores that specify positive and negative labels and their weight. Accuracy is calculated using two formulas. Finally, by evaluating the proposed methods and the comparison with other algorithms on thirteen datasets of different sizes, the results show that the algorithms presented in this study are superior to the compared algorithms in terms of accuracy, perplexity, and topic_coherency.

The rest of this article is arranged as follows: Section 2 shows a summarized overview of previous works in emotion analysis and the use of topic modeling in emotion analysis. The proposed models are provided in Section 3. The evaluation results are discussed in Section 4, and Section 5 concludes this article.

2. Related Works

The value of emotion analysis may be highlighted by analyzing customer happiness from online services like email. It is also feasible to employ emotion mining to evaluate the opinions of various people in order to make them aware of things that have favorable reviews. Major types of classification in emotion analysis are document, sentence, and aspect. An opinion is a quadrilateral (g, s, h, t), where g is the target, s is sentiment, h is the author's opinion, and t is the opinion expression time [11, 12, 13]. Many attempts have been made to detect emotions and explore the knowledge embedded in text data. Topic modeling obtains concealed subjects of documents. In topic modeling, the aim is to discover the best set of hidden variables that can express the observed data. LDA has been used as a topic model to effectively explore subjects in the documents [7]. LDA has motivated countless algorithms to expand to solve different problems [14–17]. In [18], the authors exhibit three topic models which make better LDA using date, helpfulness, and subtopic parameters. Articles [8, 10, 19] describe the methodology JST. This model expands LDA using a sentiment layer. This method cannot accurately identify the different emotions and is used as a baseline method in most articles. Several methods are similar to JST [8, 10, 20]). The aspect and Sentiment Unification Model (ASUM) [20] is similar to JST. JST assumes that each word represents an aspect, but ASUM assumes that each sentence represents a description of an aspect. A variation of the JST model is TJST [8]. The main difference between JST and TJST is that to sample a word in a document during the generative process of documents, JST selects a subject-document distribution for each document, whereas TJST uses one subject-document distribution for all documents. According to [10], the emotion influences the subject in JST, whereas in RJST, the subject influences the emotion. According to [9], there is only one topic-sentiment distribution for all documents in the TS, while there is a distribution for each document in RJST.

Several methods have been introduced for text emotion analysis that uses topic modeling [21–23, 78]. In [24], the authors introduce an algorithm that creates a review containing both shared subjects and subjects distributed over words as special data. Two topic models are proposed in [79]: Multilabel Supervised Topic Model (MSTM) and Sentiment Latent Topic Model (SLTM). Both methods could be used to categorize social emotions. In [25], the authors introduce a Sentiment Enriched and Latent-Dirichlet-Allocation-based review rating Prediction (SELDAP) to predict ratings using topics and sentiments of reviews. In [26], the authors introduce a method named Hierarchical Clinical Embeddings combined with Topic modeling (HCET), which can integrate five types of Electronic Health Record (EHR) data over several visits to predict depression. The authors of [80] presented the word Sense aware LDA (SLDA) approach that uses word sense in topic formation. In [27], the authors introduce a survey of different short text topic modeling methods. They provide a detailed analysis of algorithms and discuss their performance. The authors proposed a segment-level joint topic-sentiment model (STSM) in [81], where each sentence is divided into parts by conjunctions, and the assumption that all terms in a section convey the same emotion is presented. In [28], the authors provided a thorough examination of subject modeling methods.

Deep learning provides an approach to utilizing large volumes of calculation and data using little manual engineering. Recently, deep learning approaches to analyzing emotions have reached a considerable triumph [29, 30, 47, 77]. Optimization methods have developed significantly in recent years [31–37]. Optimization methods are widely used in the feature section, notably for text. In [38], the authors proposed a multiobjective-grey wolf-optimization algorithm to categorize sentiments. In [39], the authors proposed a binary grey wolf optimizer method to classify labels in the text. In the following article [40], the authors introduced a new optimization method that mimics the model of a successful person in society. Their article used this method to categorize emotions, which achieved very good results. There are several works on using user behavior for sentiment analysis. Tag sentiment aspect (TSA) framework, a new probabilistic generative topic framework, was presented by [48] with three implementation editions. TSA is on the basis of LDA. In [41], the authors concentrate on user-based methods on social networks, where users create text data to show their views on different topics and make connections with other users to create a social network. In [42], the authors used a signed social network to detect the emotions of reviews as an unsupervised approach. Various works use other techniques for sentiment analysis problems [43–45]. In Adaptive Lexicon learning using a Genetic Algorithm (ALGA) [46], some emotion dictionaries for a dataset in the training stage are constructed using the genetic method. These sets are utilized in the testing stage. Each lexicon comprises both words and their scores. A chromosome is modeled as a vector of emotional words and scores in the genetic approach. Scores are in the range of (the lowest score of an emotional word, the highest score of an emotional word). The main goal of ALGA is to create a lexicon that minimizes the error in the training stage.

In [47], the authors proposed a deep learning-based topic-level opinion mining method. The approach is novel in that it works at thelevel of the sentence to explore the subject using online latent semanticindexing and then employs a subject-level attention method in an extendedshort-term memory network to detect emotion. In [62], the authors proposed a joint aspect-based sentiment topic model that extracts multigrained aspects and emotions. In [49], parts-of-speech (POS) tagging is performed via a hidden Markov model, and unigrams, bigrams, and bi-tagged features are extracted. Also, the nonparametric hierarchical Dirichlet process is employed to extract the joint sentiment-topic features. In [50], the authors used an unsupervised machine learning method to extract emotion at the document and word levels. In [51], the authors proposed a new framework for joint sentiment-topic modeling based on the Restricted Boltzmann Machine (RBM), a type of neural network. In [52], the authors proposed a probabilistic method to incorporate textual reviews and overall ratings, considering their natural connection for a joint sentiment-topic prediction. In [53], the authors proposed a hybrid topic model-based method for aspect extraction and emotion categorization of reviews. LDA is used for aspect extraction and two-layer bidirectional long short-term memory for emotion categorization. In [54], the authors proposed a joint sentiment-topic model that uses Markov Random Field Regularizer and can extract more coherent and diverse topics from short texts. In [55], the authors proposed a topic model with a new document-level latent sentiment variable for each topic, which moderates the word frequency within a topic. In [56], the authors proposed a new method for text emotion detection, aiming to improve the LSTM network by integrating emotional intelligence and attention mechanism. In [57], the authors proposed a new model for aspect-based emotion detection. The model is a novel adaptation of the LDA algorithm for product aspect extraction.

In [58], the authors introduced a new deep learning-based algorithm for emotion detection, using available ratings as weak supervision signals. In [59], the authors introduced a new deep learning-based algorithm for emotion detection, using two hidden layers. The first layer learns sentence vectors to represent the semantics of sentences, and in the second layer, the relations of sentences are encoded. In [60], the authors introduced a transformer-based model for emotion detection that encodes representation from a transformer and applies deep embedding to improve the quality of tweets. In [61], the authors introduced an attention-based deep method using two independent layers. By having to consider temporal information flow in two directions, it will retrieve both past and future contexts.

In this study, the proposed methods have tried to increase the accuracy with fewer parameters and, at the same time, simplicity compared to the existing methods. The proposed methods analyze emotions at the document-level and create an emotional dictionary. They are also the first methods that create an emotional dictionary through a topic modeling technique automatically and accurately. The proposed methods are the first methods that consider the words in the text and their effect on each other in a dynamic and weighty way.

Table 1 compares a number of articles presented in recent years in emotion analysis in terms of method, language, and dataset. In the method column, as can be seen, the combination of topic modeling and deep learning methods has recently been considered. In the language column, it is specified in which language the proposed method has been tested. The name of the dataset that has been tested can also be seen in the dataset column.

Table 1.

A general comparison of similar methods in recent years.

References	Method	Language	Dataset	General result
Pathak et al. [47]	Deep learning + topic modeling	English	Facebook, Ethereum, Bitcoin, SemEval-2017	Facebook-0.79, Ethereum-0.844, Bitcoin-0.817, SemEval-2017-0.889
Tang et al.[62]	Topic modeling	English	Amazon, Yelp	Amazon-0.82, Yelp-0.84
Kalarani and Selva Brunda [49]	Joint sentiment-topic features + POS tagging + SVM and ANN	English	Balanced dataset, unbalanced data	SVM-0.84, ANN-0.87
Farkhod et al. [50]	Topic modeling	English	IMDB	IMDB-F1 score-70.0
Fatemi and Safayani [51]	Topic modeling + restricted Boltzmann machine	English	20-Newsgroups (20NG), movie review (MR), multidomain sentiment (MDS)	Perplexity: MR: 406.74
Pathik and Shukla [53]	Deep learning + topic modeling	English	Yelp, Amazon, IMDB	Yelp- 0.75, Amazon-0.76, IMDB- 0.82
Sengupta et al.[54]	Topic modeling	English	Movies, Twitter	Perplexity: Movies- 3834.7, Twitter- 280.75
Huang et al.[56]	Deep learning	English	IMDB, Yelp	IMDB-0.963, Yelp-0.735
Özyurt and Akcayol [57]	Topic modeling	English + Turkish	User reviews in Turkish language about smartphones, SemEval-2016, Task-5 Turkish restaurant reviews	Precision-81.36 Recall-83.43 F-score-82.39
Zhao et al. [58]	Deep learning	English	Amazon review	CNN-87.7, LSTM-87.9
Rao et al. [59]	Deep learning	English	Yelp 2014, 2015, IMDb	Yelp2014–63.9 Yelp2015–63.8, IMDb-44.3
Naseem et al. [60]	Deep learning	English	Airline dataset	Airline dataset = 0.95
Basiri et al. [61]	Attention-based deep learning	English	Sentiment140, Airline, Kindle dataset, movie review	Kindle dataset = 0.93, Airline = 0.92, movie review = 0.90, Sentiment140 = 0.81

Symbol	Description
Collections
R	Set of all documents
V	Vocabulary set
Q	Set of all distinct windows (with different sizes)
Z	Set of all topics
E	Set of all distinct weights
S	Set of all sentiment labels

Init parameters
q	Window variable
e	Weight variable
r	Document variable
z	Topic variable
w	Word variable
s	Sentiment variable

Distributions
θ	Probability of z given s and r
φ	Probability of w given z, s, q, and e
π	Probability of s given r (s₀ = Positive label, s₁ = Negative label)
ψ	Probability of e given z, s, and r
ξ	Probability of q given z and r

Hyper parameters
α	Dirichlet prior distribution for θ
β	Dirichlet prior distribution for φ
γ	Dirichlet prior distribution for π
δ	Dirichlet prior distribution for ψ
μ	Dirichlet prior distribution for ξ

Model	Time complexity
JST, RJST, TJST, and TS	O(G· w_ALL·\|S\|·\|Z\|)
WJST	O(G· w_ALL·\|S\|·\|Z\|·\|Q\|·\|E\|)

#	Dataset	Number of reviews	Vocabulary size	Number of words
1	Movie	400	6592	41540
2	Electronic	400	4501	29117
3	Automotive	400	3590	19733
4	Android	400	2173	9723
5	STS	359	1489	3784
6	SOMD	916	2013	7772
7	Sanders	1224	3221	14100
8	Magazines	1800	8040	125387
9	Sports	2000	8582	113921
10	MR	2000	33054	733022
11	Amazon	1000	1521	7296
12	IMDB	1000	2556	9706
13	Yelp	1000	1679	7726

Model	Parameters
JST	Max_iteration:5000; \|Z\| = 5,10,15,20; α=0.1; γ=0.016 × (average document length); β=0.01;
RJST	Max_iteration:5000; \|Z\| = 5,10,15,20; α=0.1; γ=0.016 × (average document length); β=0.01;
TJST	Max_iteration:5000; \|Z\| = 5,10,15,20; α=0.1; γ=0.016 × (average document length); β=0.01;
TS	Max_iteration:5000; \|Z\| = 5,10,15,20; α=0.1; γ=0.016 × (average document length); β=0.01;
WJST	Max_iteration:5000; \|Z\| = 5,10,15,20; α=0.3; γ=0.016 × (average document length); β=0.01; μ=3; δ=9; E = [−5, +5]; Q = {1,2,3,4,5,6};
WJST1	Max_iteration:5000; \|Z\| = 5,10,15,20; α=0.3; γ=0.016 × (average document length); β=0.01; μ=3; δ=9; E = [−5, +5]; Q = {1,2,3,4,5,6};

Android
Metric\ model	RND	AFINN	RND + AFINN	JST	TJST	RJST	TS	WJST	WJST1
Accuracy1	0.48	0.6975	0.58	0.625	0.765	0.5825	0.5425	0.795	0.865
Accuracy2	—	—	—	—	—	—	—	0.7825	0.8525
Perplexity	—	—	—	17.4581	19.7185	17.4426	17.8706	14.396	14.7631
Topic_coherency	—	—	—	−2.0645	−0.8536	−1.9914	−2.373	−0.5547	−0.187

Automotive
Accuracy1	0.4925	0.625	0.535	0.6575	0.7675	0.615	0.5525	0.755	0.8
Accuracy2	—	—	—	—	—	—	—	0.7475	0.795
Perplexity	—	—	—	22.6838	24.0385	21.8044	22.4878	18.4612	19.0627
Topic_coherency	—	—	—	−1.0158	−0.4712	−1.4986	−0.9008	−0.9311	−0.326

Electronic
Accuracy1	0.465	0.675	0.52	0.7025	0.76	0.5525	0.5475	0.8625	0.8475
Accuracy2	—	—	—	—	—	—	—	0.875	0.855
Perplexity	—	—	—	23.3586	24.3024	23.471	24.0239	19.2999	20.2452
Topic_coherency	—	—	—	−1.5892	−1.0482	−1.2996	−1.2719	−0.5322	−1.1683

Movie
Accuracy1	0.525	0.595	0.555	0.7575	0.9475	0.62	0.5425	0.8475	0.97
Accuracy2	—	—	—	—	—	—	—	0.8325	0.9675
Perplexity	—	—	—	25.2787	26.5813	25.1684	25.4488	21.0494	22.1082
Topic_coherency	—	—	—	−0.4089	−0.111	−1.0947	−1.0214	−0.0602	−0.1329

Android
Metric	Metric\Dic	AFINN	NO_AFINN
WJST	Accuracy1	0.7425	0.5725
	Accuracy2	0.7375	0.58
	Perplexity	15.53	16.1551
	Topic_Coh	−2.2654	−1.7346
WJST1	Accuracy1	0.855	0.81
	Accuracy2	0.8475	0.8075
	Perplexity	16.3399	16.0482
	Topic_Coh	−2.2228	−0.1295

Automotive
WJST	Accuracy1	0.7125	0.6025
	Accuracy2	0.7025	0.6075
	Perplexity	20.4488	20.4065
	Topic_Coh	−3.2282	−1.6628
WJST1	Accuracy1	0.7925	0.7025
	Accuracy2	0.79	0.7125
	Perplexity	20.5213	21.1296
	Topic_Coh	−0.326	−1.1809

Electronic
WJST	Accuracy1	0.8525	0.705
	Accuracy2	0.8425	0.6825
	Perplexity	20.0579	20.2615
	Topic_Coh	−0.5322	−0.5926
WJST1	Accuracy1	0.8475	0.76
	Accuracy2	0.855	0.765
	Perplexity	21.8195	21.5739
	Topic_Coh	−1.5586	−1.6968

Movie
WJST	Accuracy1	0.8475	0.71
	Accuracy2	0.8325	0.715
	Perplexity	22.4588	22.6124
	Topic_Coh	−0.9342	−0.4637
WJST1	Accuracy1	0.9575	0.485
	Accuracy2	0.945	0.4875
	Perplexity	23.5662	22.8134
	Topic_Coh	−1.2359	−1.3717

Model	Metric\topic	5	10	15	20
RND	Accuracy	0.48	0.48	0.48	0.48
AFINN	Accuracy	0.6975	0.6975	0.6975	0.6975
AFINN + RND	Accuracy	0.58	0.58	0.58	0.58
Bing_Liu	Accuracy	0.6975	0.6975	0.6975	0.6975
Bing_Liu + RND	Accuracy	0.5775	0.5775	0.5775	0.5775
IMDB	Accuracy	0.7025	0.7025	0.7025	0.7025
IMDB + RND	Accuracy	0.6125	0.6125	0.6125	0.6125
8K	Accuracy	0.5425	0.5425	0.5425	0.5425
8K + RND	Accuracy	0.515	0.515	0.515	0.515
JST	Accuracy	0.625	0.6175	0.6225	0.6125
	Perplexity	19.6726	19.5187	19.182	17.4581
	Topic_Coh	−4.6026	−2.5848	−2.2753	−2.0645
TJST	Accuracy	0.7575	0.7175	0.765	0.7475
	Perplexity	21.1487	20.3726	20.1516	19.7185
	Topic_Coh	−0.8536	−1.568	−3.4285	−2.8739
RJST	Accuracy	0.5825	0.54	0.555	0.5325
	Perplexity	19.9429	19.1915	18.137	17.4426
	Topic_Coh	−3.3792	−1.9914	−3.403	−3.4386
TS	Accuracy	0.5425	0.5275	0.53	0.5175
	Perplexity	20.4934	19.1762	18.3833	17.8706
	Topic_Coh	−3.9618	−3.2137	−2.373	−2.569
WJST	Accuracy1	0.7925	0.7425	0.795	0.79
	Accuracy2	0.7775	0.7375	0.775	0.7825
	Perplexity	16.7303	15.53	14.6351	14.396
	Topic_Coh	−0.5547	−2.2654	−2.9122	−2.6465
WJST1	Accuracy1	0.81	0.855	0.865	0.85
	Accuracy2	0.7925	0.8475	0.8525	0.8375
	Perplexity	16.6787	16.3399	15.6662	14.7631
	Topic_Coh	−0.187	−2.2228	−1.5703	−2.0204

Android
Model	Metric\window	1	2	3	4	5	6
WJST	Accuracy1	0.8375	0.78	0.7925	0.7425	0.725	0.9075
	Accuracy2	0.83	0.77	0.7775	0.735	0.725	0.9075
	Perplexity	18.3948	16.7266	16.7303	16.166	15.9499	15.1128
	Topic_Coh	−1.4063	−0.9279	−0.5547	−1.2929	−1.7475	−0.746
WJST1	Accuracy1	0.8975	0.8525	0.81	0.8675	0.755	0.8025
	Accuracy2	0.8825	0.83	0.7925	0.8675	0.75	0.7875
	Perplexity	19.3734	18.0031	16.6787	16.8518	16.1015	16.5461
	Topic_Coh	−2.2084	−1.6272	−0.1870	−0.7753	−1.328	−1.7836

Automotive
WJST	Accuracy1	0.7775	0.74	0.755	0.735	0.7375	0.69
	Accuracy2	0.7725	0.745	0.7475	0.745	0.725	0.685
	Perplexity	23.6365	21.7712	21.2008	20.9688	20.5095	19.6748
	Topic_Coh	−1.1684	−1.7095	−1.6542	−0.2824	−1.7604	−0.1311
WJST1	Accuracy1	0.805	0.8125	0.8	0.7575	0.78	0.7725
	Accuracy2	0.805	0.7975	0.79	0.755	0.78	0.7725
	Perplexity	23.2684	22.2601	21.4357	20.8092	20.5091	20.2379
	Topic_Coh	−1.7318	−0.5912	−1.1883	−0.7637	−0.62	−0.5134

Electronic
WJST	Accuracy1	0.845	0.7675	0.8625	0.7325	0.7775	0.7325
	Accuracy2	0.845	0.7575	0.875	0.7275	0.7825	0.7225
	Perplexity	22.561	22.0462	20.5489	20.7952	20.7495	20.0283
	Topic_Coh	−0.8471	−0.6207	−2.0442	−1.4345	−0.786	−0.9412
WJST1	Accuracy1	0.8025	0.875	0.79	0.8675	0.8625	0.835
	Accuracy2	0.7975	0.87	0.8	0.865	0.8625	0.8375
	Perplexity	23.5903	23.546	22.6012	22.8559	21.8897	21.4464
	Topic_Coh	−0.8251	−0.9581	−1.1683	−1.2189	−0.8492	−0.3402

Movie
WJST	Accuracy1	0.855	0.815	0.7225	0.6575	0.7	0.765
	Accuracy2	0.845	0.7975	0.71	0.6625	0.685	0.765
	Perplexity	24.5779	24.5153	23.1145	23.1209	23.2709	22.2328
	Topic_Coh	−0.4063	−0.0216	−0.0751	−2.5351	−0.0888	−0.0791
WJST1	Accuracy1	0.9725	0.9775	0.97	0.965	0.575	0.595
	Accuracy2	0.96	0.965	0.9625	0.955	0.5875	0.5725
	Perplexity	26.117	25.09	24.6302	23.9778	22.7068	22.3572
	Topic_Coh	−0.1348	−0.1348	−0.1348	−0.0315	−0.0378	−0.0106

Average section
WJST	Accuracy1	0.8287	0.7756	0.7831	0.7168	0.735	0.7737
	Accuracy2	0.8231	0.7675	0.7775	0.7175	0.7293	0.77
	Perplexity	22.2925	21.2648	20.3986	20.2627	20.1199	19.2621
	Topic_Coh	−0.957	−0.8199	−1.082	−1.3862	−1.0956	−0.4743
WJST1	Accuracy1	0.8693	0.8793	0.8425	0.8643	0.7431	0.7512
	Accuracy2	0.8612	0.8656	0.8362	0.8606	0.745	0.7425
	Perplexity	23.0872	22.2248	21.3364	21.1236	20.3017	20.1469
	Topic_Coh	−1.225	−0.8278	−0.6696	−0.6973	−0.7087	−0.6619

Dataset	Android		Automotive		Electronic		Movie
Model	Word	Score	Word	Score	Word	Score	Word	Score
WJST1	Nice	5	Much	5	Satisfy	5	See	5
	Cute	4	Use	4	Crew	4	Father	4
	Favorit	3	Long	3	Way	3	Pray	3
	Perfect	2	Expens	2	Fluid	2	Human	2
	Great	1	Stuff	1	Feel	1	Event	1
	Type	−1	Fals	−1	Pull	−1	Terribl	−1
	Wast	−2	Serious	−2	Side	−2	Sens	−2
	Everi	−3	Extens	−3	Nervous	−3	Lost	−3
	Unknown	−4	Space	−4	Even	−4	Sure	−4
	Everyth	−5	Know	−5	Extend	−5	Injur	−5

WJST	Nice	4	Much	3	Satisfy	−2	See	−4
	Cute	1	Use	2	Crew	2	Father	5
	Favorit	2	Long	4	Way	4	Pray	−3
	Perfect	2	Expens	−4	Fluid	1	Human	5
	Great	2	Stuff	−5	Feel	−3	Event	1
	Type	1	Fals	−5	Pull	−4	Terribl	4
	Wast	−5	Serious	1	Side	−1	Sens	−4
	Everi	4	Extens	5	Nervous	3	Lost	3
	Unknown	5	Space	−5	Even	−2	Sure	2
	Everyth	2	Know	4	Extend	−5	Injur	−5

Model	Sentiment	Top 10 words
WJST	+	jesu, film, God, love, mel, Christian, life, suffer, believ, roman
WJST	−	movi, godzilla, bad, dvd, origin, horror, buy, version, worst, actor

WJST1	+	Jesu, mel, passion, mother, stori, realli, great, everyon, God, like
WJST1	−	godzilla, monster, go, time, star, know, kill, make, militari, American

JST	+	mel, stori, mother, two, realli, becom, anoth, God, like, back
JST	−	godzilla, look, monster, american, militari, like, worst, zellweg, emmerich, quit

Magazine
Metric\ model	RND	AFINN	RND + AFINN	JST	TJST	RJST	TS	WJST	WJST1
Accuracy1	0.515	0.6522	0.5822	0.6705	0.705	0.5411	0.5022	0.8355	0.81
Accuracy2	—	—	—	—	—	—	—	0.8372	0.8083
Perplexity	—	—	—	21.8506	23.0349	21.4593	21.3828	19.9914	21.2095
Topic_coherency	—	—	—	−0.0548	−0.0348	−0.0946	−0.0561	−0.132	−0.0077

Sport
Accuracy1	0.5285	0.686	0.5725	0.653	0.709	0.5565	0.5155	0.798	0.802
Accuracy2	—	—	—	—	—	—	—	0.782	0.795
Perplexity	—	—	—	22.874	23.1356	22.0264	22.3361	21.968	21.4821
Topic_coherency	—	—	—	−0.2234	−0.0876	−0.1369	−0.0544	−0.1406	−0.2242

MR
Accuracy1	0.4895	0.601	0.5455	0.613	0.62	0.51	0.5	0.821	0.8445
Accuracy2	—	—	—	—	—	—	—	0.818	0.843
Perplexity	—	—	—	33.8663	35.0695	35.2359	34.6704	33.222	33.7698
Topic_coherency	—	—	—	−0.021	−0.0139	−0.0012	−0.0409	−0.001	−0.0106

Amazon
Accuracy1	0.491	0.731	0.574	0.611	0.645	0.609	0.54	0.779	0.796
Accuracy2	—	—	—	—	—	—	—	0.829	0.798
Perplexity	—	—	—	12.6442	13.7316	13.349	14.2397	10.9211	12.9946
Topic_coherency	—	—	—	−0.8318	−4.3775	−0.4224	−0.5411	−0.5874	−0.2696

IMDB
Accuracy1	0.498	0.698	0.575	0.605	0.616	0.546	0.545	0.76	0.77
Accuracy2	—	—	—	—	—	—	—	0.761	0.774
Perplexity	—	—	—	21.1053	20.4419	20.8543	19.7124	14.6719	18.9035
Topic_coherency	—	—	—	−1.3334	−1.4868	−0.8853	−1.1246	−0.9666	−0.9438

Yelp
Accuracy1	0.506	0.689	0.559	0.579	0.614	0.561	0.547	0.737	0.773
Accuracy2	—	—	—	—	—	—	—	0.726	0.769
Perplexity	—	—	—	15.4565	16.7169	15.5965	15.3614	12.2145	13.3453
Topic_coherency	—	—	—	−2.1865	−2.5609	−1.9632	−1.1714	−2.0815	−2.3715

Android
Model	Metric\window	1	2	3	4	5	6
WJST	Accuracy1	0.71	0.775	0.7025	0.69	0.7175	0.675
	Accuracy2	0.7	0.765	0.6975	0.68	0.705	0.6675
	Perplexity	17.791	15.8544	16.1124	15.6932	13.8904	14.1876
	Topic_Coh	−1.6	−1.9691	−1.4526	−4.6888	−2.6353	−1.2267
WJST1	Accuracy1	0.8025	0.85	0.845	0.8025	0.8675	0.76
	Accuracy2	0.7925	0.83	0.84	0.7875	0.865	0.7525
	Perplexity	18.9319	17.7849	17.6145	15.8017	15.4004	15.0917
	Topic_Coh	−1.2666	−1.6744	−3.027	−2.3428	−1.1215	−2.0183

Automotive
WJST	Accuracy1	0.7425	0.7375	0.68	0.6825	0.6725	0.64
	Accuracy2	0.74	0.73	0.6775	0.68	0.6625	0.6325
	Perplexity	20.8433	20.7212	19.6843	20.0801	19.1691	18.7129
	Topic_Coh	−1.4406	−2.0489	−2.6701	−2.2155	−1.1019	−1.1651
WJST1	Accuracy1	0.7825	0.76	0.755	0.7725	0.7475	0.7675
	Accuracy2	0.785	0.7575	0.7475	0.775	0.7525	0.76
	Perplexity	22.1856	21.3601	20.6057	19.8915	19.402	19.0827
	Topic_Coh	−1.5638	−1.474	−1.1274	−1.7761	−1.2042	−1.1755

Electronic
WJST	Accuracy1	0.7975	0.72	0.76	0.6875	0.7675	0.6825
	Accuracy2	0.795	0.715	0.7475	0.66	0.765	0.665
	Perplexity	21.9171	21.0509	20.4858	20.26	19.4089	19.5549
	Topic_Coh	−0.8282	−1.0407	−0.9593	−0.9779	−1.0413	−0.6259
WJST1	Accuracy1	0.7475	0.7425	0.7775	0.7425	0.7725	0.78
	Accuracy2	0.745	0.7475	0.78	0.74	0.7625	0.785
	Perplexity	23.2269	22.4573	21.7203	21.4598	20.8683	20.5084
	Topic_Coh	−1.7905	−1.5082	−1.4854	−0.9143	−1.2881	−1.3489

Movie
WJST	Accuracy1	0.7425	0.77	0.8075	0.615	0.7	0.595
	Accuracy2	0.7425	0.765	0.7875	0.59	0.69	0.595
	Perplexity	23.7529	23.322	22.1056	21.8573	20.8467	20.9653
	Topic_Coh	−1.0908	−0.9134	−0.9145	−0.6129	−1.1099	−0.716
WJST1	Accuracy1	0.9725	0.965	0.96	0.9675	0.96	0.9625
	Accuracy2	0.975	0.96	0.955	0.955	0.9575	0.9575
	Perplexity	26.1797	25.3138	24.5149	24.1116	23.5025	23.3023
	Topic_Coh	−0.0218	−0.2225	−1.9033	−0.0897	−0.2896	−0.0569

Model	Sentiment	Top 10 words
WJST	+	app, game, sudoku, play, version, enjoy, option, want, hint, like
WJST	−	work, app, would, fire, live, station, tri, say, select, kindl, load, user

WJST1	+	sudoku, tri, love, game, time, easi, tablet, star, call, make
WJST1	−	close, tablet, seem, get, year, download, much, station, time, android

JST	+	station, want, even, peopl, work, avail, version, puzzl, custom, believ
JST	−	use, app, find, review, great, got, total, new, night, fake

Model	Sentiment	Top 10 words
WJST	+	read, book, screen, touch, kindl, page, better, wifi, ebook, like
WJST	−	work, went, new, servic, need, bad, system, hous, number, mine

WJST1	+	googl, amazon, book, color, store, kindl, download, small, pdf
WJST1	−	time, work, two, much, one, power, comput, phone, go, unit

JST	+	book, touch, read, page, free, librari, touch, screen, much, pdf
JST	−	plug, work, could, devic, comput, charger, router, cabl, item, design

Method/dataset	MR	Sanders	SOMD	STS	Amazon	IMDB	Yelp
ALGA [46]	—	0.8067	0.8147	0.7668	—	—	—
ALGA-SW [46]	—	0.7868	0.7877	0.7886	—	—	—
BPSO (Shang et al., 2016)	—	—	—	—	0.7439	0.79	0.789
BICA (Mirhosseini et al., 2017)	—	—	—	—	0.793	0.745	0.763
BABC (Schiezaro et al., 2013)	—	—	—	—	0.7509	0.74	0.736
MaxEnt (Saif et al., 2014)	—	0.8362	—	0.7782	—	—	—
NB (Saif et al., 2014)	—	0.8266	—	0.8106	—	—	—
LS-all [13]	—	0.8199	—	—	—	—	—
SVM-all [13]	—	0.8214	—	—	—	—	—
RMTL [13]	—	0.827875	—	—	—	—	—
MTL-graph [13]	—	0.801725	—	—	—	—	—
CMSC [13]	—	0.846325	—	—	—	—	—
LSTM-all [13]	—	0.8063	—	—	—	—	—
MTL-CNN [13]	—	0.829825	—	—	—	—	—
MTL-DNN [13]	—	0.817	—	—	—	—	—
ASP-MTL [13]	—	0.85125	—	—	—	—	—
NeuroSent [13]	—	0.834575	—	—	—	—	—
DAM [13]	—	0.863225	—	—	—	—	—
SVM-BoW (Da Silva et al., 2014)	—	0.8243	0.7402	—	—	—	—
SVM-BoW + lex (Da Silva et al., 2014)	—	0.8398	0.7893	—	—	—	—
RF-BoW (Da Silva et al., 2014)	—	0.7924	0.7391	—	—	—	—
RF-BoW + lex (Da Silva et al., 2014)	—	0.8235	0.7936	—	—	—	—
LR-BoW (Da Silva et al., 2014)	—	0.7745	0.7238	—	—	—	—
LR-BoW + lex (Da Silva et al., 2014)	—	0.7949	0.7806	—	—	—	—
MNB-BoW (Da Silva et al., 2014)	—	0.7982	0.7543	—	—	—	—
MNB-BoW + lex (Da Silva et al., 2014)	—	0.8341	0.8013	—	—	—	—
ENS(LR + RF + MNB)-BoW (Da Silva et al., 2014)	—	0.8276	0.7555	—	—	—	—
ENS(LR + RF + MNB)-BoW + lex (Da Silva et al., 2014)	—	0.8489	0.8035	—	—	—	—
SVM-FH (Da Silva et al., 2014)	—	0.4975	0.5131	—	—	—	—
SVM-FH + lex (Da Silva et al., 2014)	—	0.7500	0.6299	—	—	—	—
RF-FH (Da Silva et al., 2014)	—	0.5564	0.6136	—	—	—	—
RF-FH + lex (Da Silva et al., 2014)	—	0.7163	0.7260	—	—	—	—
LR-FH (Da Silva et al., 2014)	—	0.5694	0.6529	—	—	—	—
LR-FH + lex (Da Silva et al., 2014)	—	0.7598	0.7303	—	—	—	—
MNB-FH (Da Silva et al., 2014)	—	0.5425	0.6070	—	—	—	—
MNB-FH + lex (Da Silva et al., 2014)	—	0.7508	0.7139	—	—	—	—
ENS(LR + RF + MNB)-FH (Da Silva et al., 2014)	—	0.5784	0.6517	—	—	—	—
ENS(LR + RF + MNB)-FH + lex (Da Silva et al., 2014)	—	0.7663	0.7456	—	—	—	—
WS-TSWE' [76]	0.841	—	—	—	—	—	—
WS-TSWE [76]	0.824	—	—	—	—	—	—
TSWE-P [76]	0.726	—	—	—	—	—	—
TSWE + P [76]	0.782	—	—	—	—	—	—
JSTH [76]	0.681	—	—	—	—	—	—
HTSM [76]	0.796	—	—	—	—	—	—
SAE (Pagliardini et al., 2018)	0.861	—	—	—	—	—	—
ParagraphVec DBOW (Pagliardini et al., 2018)	0.763	—	—	—	—	—	—
ParagraphVec DM (Pagliardini et al., 2018)	0.764	—	—	—	—	—	—
IST (Pu et al., 2019)	0.827	—	—	—	—	—	—
UST (Pu et al., 2019)	0.832	—	—	—	—	—	—
UIST (Pu et al., 2019)	0.845	—	—	—	—	—	—
RND	0.4895	0.4852	0.4847	0.5346	0.491	0.498	0.506
AFINN	0.601	0.674	0.3395	0.734	0.731	0.698	0.689
AFINN + RND	0.5455	0.5424	0.4475	0.6038	0.574	0.575	0.559
JST [8]	0.613	0.6176	0.4934	0.6398	0.611	0.605	0.579
TJST [8]	0.62	0.5898	0.4639	0.6814	0.645	0.616	0.614
RJST [12]	0.51	0.6086	0.4967	0.6232	0.609	0.546	0.561
TS [9]	0.5	0.5612	0.5251	0.565	0.54	0.545	0.547
WJST-1	0.821	0.7268	.75786	0.7126	0.779	0.76	0.737
WJST-2	0.818	0.7203	0.75775	0.7181	0.829	0.761	0.726
WJST1-1	0.8445	0.832	0.8	0.8317	0.796	0.77	0.773
WJST1-2	0.843	0.8352	0.7859	0.8373	0.798	0.774	0.769

Model	Gram	Metric\dataset	Android	Automotive	Electronic	Movie
JST	U ^∗	Accuracy	0.625	0.6275	0.675	0.7575
		Perplexity	19.6726	24.7154	25.227	27.3111
		Topic_Coh	−4.6026	−1.486	−1.5892	−1.0123
	U + B^∗	Accuracy	0.655	0.7025	0.75	0.835
		Perplexity	66.1882	86.6897	81.3375	98.9201
		Topic_Coh	−2.4595	−1.0417	−0.9823	−0.4165
	U + B + T^∗	Accuracy	0.6775	0.5825	0.7525	0.7275
		Perplexity	114.8812	158.7113	156.1383	190.8403
		Topic_Coh	−1.5776	−2.3915	−0.7030	−0.1288

WJST	U	Accuracy1	0.7925	0.755	0.8625	0.7225
		Accuracy2	0.7775	0.7475	0.875	0.71
		Perplexity	16.7303	21.2008	20.5489	23.1145
		Topic_Coh	−0.5547	−1.6542	−2.0442	−0.0751
	U + B	Accuracy1	0.6825	0.63	0.67	0.65
		Accuracy2	0.68	0.625	0.6675	0.6475
		Perplexity	36.5505	49.8584	48.2008	58.0838
		Topic_Coh	−1.3976	−4.0214	−1.2513	−0.0423
	U + B + T	Accuracy1	0.7325	0.6825	0.635	0.6475
		Accuracy2	0.7325	0.6825	0.615	0.6325
		Perplexity	52.9205	72.7629	76.2545	88.1860
		Topic_Coh	−1.3242	−1.5525	−1.2409	−2.5170

WJST1	U	Accuracy1	0.81	0.80	0.79	0.97
		Accuracy2	0.7925	0.79	0.80	0.9625
		Perplexity	16.6787	21.4357	22.6012	24.6302
		Topic_Coh	−0.187	−1.1883	−1.1683	−0.1348
	U + B	Accuracy1	0.7	0.7625	0.8575	0.935
		Accuracy2	0.7	0.7625	0.8525	0.935
		Perplexity	38.5603	49.5167	54.7041	62.3459
		Topic_Coh	−2.0962	−1.8607	−0.9744	−0.0617
	U + B + T	Accuracy1	0.83	0.7625	0.745	0.9475
		Accuracy2	0.8275	0.7675	0.745	0.9475
		Perplexity	55.5819	75.3624	77.6660	96.3675
		Topic_Coh	−1.3837	−0.1840	−0.9334	−0.0333

PERMALINK

Weighted Joint Sentiment-Topic Model for Sentiment Analysis Compared to ALGA: Adaptive Lexicon Learning Using Genetic Algorithm

Amjad Osmani

Jamshid Bagherzadeh Mohasefi

Abstract

1. Introduction

2. Related Works

Table 1.

3. Proposed Models

Figure 1.

3.1. Motivation

Figure 2.

Figure 3.

3.2. The Problem Statement

3.2.1. The Problem We Are Trying to Solve or Improve

3.2.2. The Solution to the Problem

3.3. The General Structure of WJST

Table 2.

Figure 4.

Figure 5.

Figure 6.

Table 3.

3.4. The General Structure of WJST1

Figure 7.

4. Experimental Results

Table 4.

Table 5.

Table 6.

Table 7.

Table 8.

Table 9.

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

4.1. Sentiment Scores for the Words in a Dataset

Table 15.

Table 16.

4.2. Topic Discovery

Table 17.

Table 18.

Table 19.

4.3. Sentiment Classification at Document-Level

Figure 8.

Figure 9.

Figure 10.

4.4. Evaluation Results According to the Different Situations, with AFINN and NO_AFINN States

4.5. Evaluation Results According to the Different Sentiment Dictionaries

Table 20.

Table 21.

Figure 11.

Figure 12.

4.6. Evaluation Results According to the Different Number of Topics

4.7. Evaluations Results According to the Different Number of Distinct Windows

Figure 13.

Figure 14.

Figure 15.

4.8. Sentiment Classification Using Proposed Methods in Comparison to ALGA

Figure 16.

4.9. Sentiment Classification Using Proposed Methods on Multidomain Datasets

Table 22.

Figure 17.

4.10. Comparison with Other Methods

Table 23.

4.11. Comparison with Discriminative Models

Table 24.

4.12. Comparison with JST According to Extended Features

Table 25.

4.13. Discussions on the Limitations of the Proposed Methods

4.14. A Concise Description of the Proposed Solutions and the Results

5. Conclusion

6. Future Work

Data Availability

Conflicts of Interest

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK