Sentiment Thesaurus, Synset and Word2Vec Based Improvement in Bigram Model for Classifying Product Reviews

S Poomagal; B Malar; E M Ranganayaki; K Deepika; G Dheepak

doi:10.1007/s42979-022-01305-8

. 2022 Aug 6;3(6):422. doi: 10.1007/s42979-022-01305-8

Sentiment Thesaurus, Synset and Word2Vec Based Improvement in Bigram Model for Classifying Product Reviews

S Poomagal ^1,^✉, B Malar ¹, E M Ranganayaki ¹, K Deepika ¹, G Dheepak ¹

PMCID: PMC9362630 PMID: 35965951

Abstract

Classifying product reviews is one of the tasks in Natural Language Processing by which the sentiment of the reviewer towards a product can be identified. This identification is useful for the growth of the business by increasing the number of satisfied customers through product quality improvement. Bigram models are more popular in performing this classification since it considers the occurrence of two words consecutively in the reviews. In the existing works on bigram models, semantically similar words to the words present in bigrams are not considered. As the reviewers use different words with the same meaning to express their feeling, we proposed improved bigram models in which semantically similar words to the words in bigrams are also used for classifying the reviews. In the proposed models, sentiment polarity thesaurus is constructed by including sentiment words and their synonyms. The combinations of constructed thesaurus, Synset and Word2Vec are used for extracting synonyms for the words in the reviews. Performance of the proposed models is compared with the traditional bigram model and state-of-the-art methods. It is observed from the results that our models are able to achieve better performance than traditional model and recent methods.

Keywords: Classification, Synset, Word2Vec, Bigram, Natural Language Processing, Unigram

Introduction

Sentiment analysis or opinion mining of product reviews understands the sentiment of users for a particular product. This sentiment can be positive or negative based on the quality and features of the product and each review can be assigned to one of these two classes according to the words present in it. This assignment task can be done with the help of classification algorithms and the bigram model.

While considering classification algorithms, the features in the reviews are to be represented in the form of a vector space model with weight assignment for each feature. For assigning weight for each feature, the occurrence of each feature independent of other surrounding features is considered. This is overcome when using the n-gram model in which the classifier model is generated by looking at the occurrence of n features continuously in a review. In the n-gram model, bigrams usually give better results when used for sentiment analysis. Bigram model can further be improved by utilizing semantically similar features of the list of existing features since different reviewers use a variety of similar words to express their sentiments. This can be caught when including synonyms also for classifying the reviews. For including synonyms in the bigrams, thesaurus containing words and their synonyms can be constructed. Since constructing a thesaurus for the entire collection of English words is tough, only positive and negative sentiment words can be considered. But to improve the performance further, synonym for non-sentiment words can also be included. This can be done using Synset and Word2Vec.

This has motivated us to improve bigram model by including semantically similar words using combinations of thesaurus, Synset and Word2Vec. In the proposed thesaurus-based method, thesaurus of positive and negative polarity words is formed and it is used to find similar words for the sentiment-related words in the reviews. For the remaining non-sentiment words, a combination of Synset and Word2Vec dictionaries is used for getting similar words. In the proposed Synset and Word2Vec-based models, semantically similar words are extracted for both sentiment and non-sentiment words from Synset and Word2Vec. Also when finding the probability of test review, instead of considering only the original bigram, probabilities of the combinations of similar words represented by each word in bigrams are taken and class is assigned based on the cumulative probability of those combinations.

The objectives of the proposed method are three fold,

It proposes new improved bigram models for classifying reviews as positive or negative.
It improves the classification performance by including synonyms of words in the reviews from sentiment thesaurus, NLTK’s Synset and Google’s Word2Vec.
Experiments are conducted on different datasets and results are analyzed and reported.

This paper is organized as follows: “Related Work” discusses the existing works on the bigram model. “Problem Statement” states the problem. “Proposed Method” explains the steps in the proposed method. “Experimental Results” states the results produced and “Conclusion” concludes the paper.

Related Work

Product review classification finds the feelings of reviewers by assigning classes for each review as positive or negative. The extracted information tells the quality of the product and it can be used to improve the product. This analysis can also be used to automate the task of assigning the polarity of the future reviews and suggesting the same kind of products to the users.

Many research works are going on in the area of sentiment classification. Pang et al. [1] have utilized the bag of words model for finding the sentiment class of the text content. The class can be positive or negative based on the terms present in the text. Various machine learning algorithms and n-gram are used for the classification of text. Salvetti et al. [2] have executed machine learning algorithms NB and Markov model using hypernym from Wordnet and Part of Speech (PoS) tagging. They concluded that PoS gave better results than Wordnet.

NB model is [3] used for sentiment classification. In this work, a pair of derived features that can be combined are extracted and to improve the accuracy of the result, additional derived features are added. Sahu and Ahuja [4] have proposed a technique in which polarity-based feature selection is done. Once the features are selected, various classification algorithms are used and came to conclusion that the random forest algorithm gave better results when compared to other classification algorithms.

Bodapati et al. [5] did sentence-based sequence modeling and NN-based classification using LSTM. They have converted the words into vectors using Skipgram model of Word2Vec. They compared their work with basic classification algorithms and concluded that LSTM performed better than other techniques. Tripathy et al. [6] compared various machine learning algorithms using unigram, bigram and trigram with various feature selection schemes such as Term Occurrence, Binary term occurrence, Term frequency and TF-IDF. In the case of NB, they concluded that bigram performs better than unigram and trigram.

Vinodhini and Chandrasekaran [7] have proposed a system consisting of 3 models Unigram, Unigram + Bigram, Unigram + Bigram + Trigram and found that 3rd model outperformed others when classified using a hybrid combination of PNN and PCA. Bakliwal et al. [8] proposed a model in which the combination of Unigram, Bigram and Trigram is done and also used POS tagging for this representation. It is found that the combination of all three representations along with POS tag outperformed when classified using MLP.

Acosta et al. [9] have compared two training algorithms using CBOW(Continuous Bag of Words) and SG (SkipGram). The models were tested using both hierarchical SoftMax and negative sampling. They observed that SG based training model outperformed CBOW based model. Bansal and Srivastava [10] converted every document to a weighted combination of word vectors using TFIDF scheme and discovered that CBOW outperformed SG in various machine learning algorithms especially Random Forest. Mohammed and Fatima [11] analyzed and compared Unigram, Unigram-POS, Bigram, SkipGram and found the combination of Unigram-POS + SkipGram had performed better in both MNB and SVM classification.

Awachate et al. [12] introduced a model with the combination of features selected using Chi-square and list of words which is an intersection between the words in the corpuses and in the sentiment word list. The proposed model was found to have a better performance over the conventional representation when classified using Kernlab SVM. Hameed et al. [13] used BoW representation in which feature selection is performed by Information gain using Chi-square method and it was discovered that a combination of Unigram + Bigram with this representation gave better results.

Aspect-based sentiment analysis of movie reviews is done by Thet et al. [14]. They considered that the movie review includes independent clauses that specify different sentiments toward various aspects of a movie. They have used a language-based approach for finding the sentiment of the clauses using the sentiment scores calculated for each word based on the grammatical dependency structure of the clause. Sentiment scores of 32,000 words taken from SentiWordNet have been used by them. Singh et al. [15] used machine learning classifiers namely, Naïve Bayes, J48, BFTree and OneR for optimization of sentiment analysis.

Performance of SVM, Maximum Entropy and Scoring methods when classifying movie reviews have been compared by Tsutsumi et al. [16]. They considered 1400 reviews from which half are positive reviews and the remaining half are negative reviews. It is proved in the paper that SVM performs better than the other two techniques.

Word2Vec Convolutional Neural Network (CNN) with random, Skipgram and CBOW models are used [17] to classify news articles and tweets. They proved that skipgram model performs better in the case of tweets and CBOW model works well when the news articles are classified.

Sasikala and Mary [18] proposed a technique to perform sentiment analysis of product reviews by using Deep Learning Modified Neural Network and Improved Adaptive Neuro-Fuzzy Inferences System. The first model is used to classify the review as positive or negative. The second model predicts the demand for a product in the market.

SVM-based product review classification for Indonesian reviews is done with the Word2Vec model as features by Fauzi [19]. He presented the results using binary TF, Raw TF and TFIDF in terms of the measure accuracy.

Poomagal et al. [20] proposed a new Tag_score model for tweets clustering using improved K-means. They used synonyms to reduce the number of words in the collection and the sentiment score of each word to compute the initial centroid for K-means algorithm.

News text data classification is performed by Zhang [21] using a new intelligent model. The intelligent model is constructed for cross-domain text sentiment classification. A domain invariant dictionary is constructed to combine two different methods.

Fei et al. [22] combined improved cross-entropy loss function with the CNN model and LSTM network for cross-domain sentiment classification. They analyzed the influence of each word in improving classification performance.

Souma et al. [23] classified news article based on stock price returns. They utilized a deep learning algorithm for the purpose of classification. They utilized word vectors created from Wikipedia and Gigaword by Glove.

Chandra and Krishna [24] used a deep learning model for analyzing the sentiments of tweets during the rise of COVID-19 in India. They concluded that most of the tweets are optimistic. Miranda et al. [25] have analyzed people’s sentiment on the Indonesian general election using Sentiwordnet and Naïve Bayes algorithm.

Problem Statement

Given a set of labeled product reviews, the proposed models aim to assign sentiment for the reviews with a high accuracy rate. This is performed by first constructing bigrams of semantically similar words of the words from each test review using sentiment thesaurus, Synset and Word2vec and then the reviews are classified into positive or negative based on the probability.

Proposed Method

The proposed models classify the reviews into positive or negative by initially finding similar words of words present in test review from the thesaurus, Synset and Word2Vec. Steps in the proposed method are as follows.

Preprocessing of reviews
Bigram probability matrix formation
Classifying a test review
- (i)
  Sentiment thesaurus formation
- (ii)
  Similar words extraction
- (iii)
  Bigram with similar words probability calculation
- (iv)
  Predicting class of a test review

Initially, the training reviews are preprocessed to collect only the words which are useful for performing classification. Then the words from positive reviews and negative reviews are formed as bigram models by computing the probability of each bigram and constructing a probability matrix for each category separately. When classifying a test review, thesaurus, Synset and Word2Vec are used. When using a thesaurus, for each word in the test review, semantically similar words are taken from thesaurus if the word is a sentiment word, else, similar words are retrieved from Synset or Word2Vec. When using Synset and Word2Vec, semantically similar words are extracted from Synset and Word2Vec only for both sentiment and non-sentiment words. Then these extracted similar words are compared with the list of words in training reviews. The words present in positive/negative training reviews are combined separately to form two different sets for each word of the test review. Based on the probability of newly formed test review bigrams with similar words combination in training reviews, class is assigned for test review.

Preprocessing of Reviews

The reviews are preprocessed by removing stop words, punctuations, and other non-alphabetical characters and converting all the characters to lower case. For instance, the review “Exterior of the phone was good” is changed to “exterior phone good”.

Let $R = {r_{i}}_{i = 1}^{n}$ be the set of training reviews, $P R = {{pr}_{i}}_{i = 1}^{pn}$ and $N R = {{nr}_{i}}_{i = 1}^{nn}$ represent the collection of positive reviews and negative reviews in the training dataset after preprocessing. Let ${pr}_{i} = {p w_{ij}}_{j = 1}^{{wc}_{i}}$ and ${nr}_{i} = {n w_{ij}}_{j = 1}^{{wnc}_{i}}$ be the set of words in $i$ th positive and negative review, respectively. Let ${wc}_{i}$ and ${wnc}_{i}$ be the number of words in $i$ th positive and negative review, respectively.

For example, the training dataset contains n = 8 reviews represented as $r_{1}, r_{2}, \dots r_{8} .$ From these 8 reviews, assume that pn = 4 reviews are assigned with positive sentiment and nn = 4 reviews are assigned with negative sentiment. Positive reviews are represented as ${{p r}_{1}, {pr}_{2}, {pr}_{3}, {pr}_{4}}$ and negative reviews are represented as ${{nr}_{1}, {nr}_{2}, {nr}_{3}, {nr}_{4}}$ . Each review has a collection of words and let us assume the number of words in 4 positive reviews is given as ${wc}_{1} = 3, {wc}_{2} = 2, {wc}_{3} = 4, {wc}_{4} = 2$ and in 4 negative reviews is given as ${wnc}_{1} = 2, {wnc}_{2} = 2, {wnc}_{3} = 3, {wnc}_{4} = 2$ . So each positive review ${pr}_{i}$ is represented as a set of words { ${pw}_{i 1}, {pw}_{i 2}, \dots {pw}_{{iwc}_{i}}$ }. That is, ${pr}_{1}$ is represented as { ${pw}_{11}, {pw}_{12}, {pw}_{13}$ }, ${pr}_{2}$ is represented as { ${pw}_{21}, {pw}_{22}$ }, ${pr}_{3}$ is represented as { ${pw}_{31}, {pw}_{32}, {pw}_{33}$ , ${pw}_{34}$ }, ${pr}_{4}$ is represented as { ${pw}_{41}, {pw}_{42}$ } and so on. Similarly, negative reviews are represented with the set of words in it.

Another task done during preprocessing is to extract all distinct words from the reviews and to form two lists. One list containing distinct words from positive reviews and another list has words from negative reviews. This is done since these lists are used while forming the list of synonymous words during testing.

Let $p l i s t and n l i s t$ be two lists containing words from positive reviews and negative reviews, respectively. The respective count values are $p w c and n w c .$ The lists are defined in Eqs. (1) and (2):

p l i s t = \{p w_{ij} if p w_{ij} is not in p l i s t where 1 \leq i \leq w c_{j} for each j in 1 \leq j \leq p n\},

n l i s t = \{n w_{ij} if n w_{i} is not in n l i s t where 1 \leq i \leq w n c_{j} for each j in 1 \leq j \leq n n\} .

Bigram Probability Matrix Formation

After preprocessing, the positive and negative training reviews are represented as a matrix containing words present in the reviews as rows and columns. Each entry in the matrix represents the probability with which the word representing the corresponding row can follow the word representing the respective column in the positive/negative reviews collection. Let $p o s {prob}_{w_{i} w_{j}} = p r o b (w_{j}| w_{i})$ where $1 \leq i, j \leq p w c$ and $n e g {prob}_{w_{i} w_{j}} = p r o b (w_{j}| w_{i})$ where $1 \leq i, j \leq n w c$ represent the probability matrix of positive reviews and negative reviews, respectively.

Classifying a Test Review

Once the processing of the training dataset is over, test review can be classified into positive or negative by first finding the similar context words of the words present in the test data, finding the probability of all combinations of its bigrams in the training dataset. Finally, based on the logarithmic summation of probabilities of test review bigrams in positive and negative training reviews, class is predicted.

Sentiment Thesaurus Formation

For forming thesaurus for sentiment words, a set of positive and negative words are collected from http://ptrckprry.com/course/ssd/data/positive-words.txt and http://ptrckprry.com/course/ssd/data/negative-words.txt [26]. Synonyms for each word in the set are extracted from thesaurus.com. Two major advantages of using web-based thesaurus are that it has a huge vocabulary and it includes many synonyms for any given sentiment word. Also it has synonyms for almost all sentiment words passed to it. Major reason for using this thesaurus in product review classification is due to the fact that the reviewers use words with the high sentiment more often in presenting their view about a product.

The words collected are represented using $poslist$ and $neglist$ as in Eqs. (3) and (4). Equations (5) and (6) show the respective synonyms extracted from the website. Let ${possyn}_{i}$ and ${negsyn}_{i}$ be the synonym set of $i$ th word in positive list and negative list, respectively:

p o s l i s t = {{posw}_{i}}_{i = 1}^{pcount},

n e g l i s t = {{negw}_{i}}_{i = 1}^{ncount},

{possyn}_{i} = {{possynw}_{ij}}_{j = 1}^{{psyncount}_{i}},

{negsyn}_{i} = {{negsynw}_{ij}}_{j = 1}^{{nsyncount}_{i}} .

Similar Words Extraction

All unique words from the test review are collected by applying preprocessing, the extracted words are compared with the constructed thesaurus and if it is a sentiment word and present in it, the respective set of similar words are collected. If the word is a non-sentiment word, then it is sent to Synset of Wordnet or Google’s Word2Vec to find semantically similar words for them. Also, for both sentiment and non-sentiment words, Synset and Word2Vec are used.

Once the words are retrieved, they are compared with positive training words and negative training words to construct two collections for each word. The first collection with all similar words of the word which are also present in positive training reviews and the second one with all similar words occurring in negative training reviews. This process is mathematically defined as follows.

Let $W = {w_{i}}_{i = 1}^{m}$ represent the set of unique words from the test review. Let ${psynw}_{i}$ and ${nsynw}_{i}$ be the set of similar words of $i$ th word in a positive thesaurus or Synset and negative thesaurus or Synset, respectively. Let $p {W o r d 2 V e c w}_{i}$ and n ${W o r d 2 V e c w}_{i}$ be the set of similar context words of $i$ th word in positive thesaurus or Word2Vec and negative thesaurus or Word2Vec, respectively. They are described in Eqs. (7)–(10):

p s y n w_{i} = \{\begin{matrix} p o s s y n_{j} & if p o s w_{j} = w_{i} and 1 \leq j \leq p c o u n t \\ s y n s e t (w_{i}) & Otherwise \end{matrix},)

n s y n w_{i} = \{\begin{matrix} n e g s y n_{j} & if n e g w_{j} = w_{i} and 1 \leq j \leq n c o u n t \\ s y n s e t (w_{i}) & Otherwise \end{matrix},)

p W o r d 2 V e c w_{i} = \{\begin{matrix} p o s s y n_{j} & if p o s w_{j} = w_{i} and 1 \leq j \leq p c o u n t \\ W o r d 2 V e c_s i m (w_{i}) & Otherwise \end{matrix},)

n W o r d 2 V e c w_{i} = \{\begin{matrix} n e g s y n_{j} & if n e g w_{j} = w_{i} and 1 \leq j \leq n c o u n t \\ W o r d 2 V e c_s i m (w_{i}) & Otherwise \end{matrix} .)

When using synonyms of sentiment words only from Synset or Word2Vec, Eqs. (7)–(10) are changed to include only $s y n s e t (w_{i}) if p o s w_{j} = w_{i} and 1 \leq j \leq p c o u n t$ , $s y n s e t (w_{i}) if n e g w_{j} = w_{i} and 1 \leq j \leq n c o u n t$ , $W o r d 2 V e c_s i m (w_{i}) if p o s w_{j} = w_{i} and 1 \leq j \leq p c o u n t$ , $W o r d 2 V e c_s i m (w_{i}) if n e g w_{j} = w_{i} and 1 \leq j \leq n c o u n t$ , respectively.

Let $p t w_{i} and n t w_{i}$ represent words present in $p {synw}_{i}$ and $n {synw}_{i}$ which are also available in positive training reviews and negative training reviews, respectively. Initially, $\forall_{1 \leq i \leq m} {ptw}_{i} = {ntw}_{i} = w_{i}$ and the process is defined in Eqs. (11) and (12):

p t w_{i} = \{p s y n w_{ij} if p s y n w_{ij} \in p l i s t and 1 \leq j \leq p c_{i}\},

n t w_{i} = \{n s y n w_{ij} if n s y n w_{ij} \in n l i s t and 1 \leq j \leq n c_{i}\} .

In the same way, words in $p {W o r d 2 V e c w}_{i}$ or $n {W o r d 2 V e c w}_{i}$ which are also occurring in positive and negative training reviews are represented as ${pvtw}_{i} and {nvtw}_{i,}$ respectively. Initially, $\forall_{1 \leq i \leq m} {pvtw}_{i} = {nvtw}_{i} = w_{i}$ and the above process is described in Eqs. (13) and (14):

p v t w_{i} = \{p W o r d 2 V e c w_{ij} if p W o r d 2 V e c w_{ij} \in p l i s t and 1 \leq j \leq p c v_{i}\},

n v t w_{i} = \{n W o r d 2 V e c w_{ij} if n W o r d 2 V e c w_{ij} \in n l i s t and 1 \leq j \leq n c v_{i}\},

where ${pc}_{i}, {nc}_{i}, {pcv}_{i} and {ncv}_{i}$ represent the number of words in ${psynw}_{i}$ , ${nsynw}_{i}$ , ${p W o r d 2 V e c w}_{i}$ and ${n W o r d 2 V e c w}_{i,}$ respectively.

Bigrams with Similar Words Probability Calculation

Once the sets are formed for each word in the test review, the combinations of the similar words in positive/negative sets are used to form bigrams and the probability of each formed bigram in positive/negative training datasets is found. Finally, the summation of all positive/negative logarithmic probabilities is calculated and the respective class is assigned to the test review. Equations (15) and (16) explain this process for Synset and thesaurus-based model.

P ({\hat{y}}_{1} | w_{1} \dots w_{m}) = \forall_{w_{i} \in p t w_{x}} \forall_{w_{j} \in p t w_{x + 1}} \sum log (p o s p r o b_{w_{i} w_{j}}) where 1 \leq x \leq m - 1,

P ({\hat{y}}_{2} | w_{1} \dots w_{m}) = \forall_{w_{i} \in n t w_{x}} \forall_{w_{j} \in n t w_{x + 1}} \sum log (n e g p r o b_{w_{i} w_{j}}) where 1 \leq x \leq m - 1 .

In the same way, the probability of test review belonging to each category is calculated in Word2Vec and thesaurus-based model as mentioned in Eqs. (17) and (18):

P ({\hat{y}}_{1} | w_{1} \dots w_{m}) = \forall_{w_{i} \in p v t w_{x}} \forall_{w_{j} \in p v t w_{x + 1}} \sum log (p o s p r o b_{w_{i} w_{j}}) where 1 \leq x \leq m - 1,

P ({\hat{y}}_{2} | w_{1} \dots w_{m}) = \forall_{w_{i} \in n v t w_{x}} \forall_{w_{j} \in n v t w_{x + 1}} \sum log (n e g p r o b_{w_{i} w_{j}}) where 1 \leq x \leq m - 1 .

Predicting Class of a Test Review

Test review sentiment is predicted based on the summation of logarithmic probabilities of test bigrams in positive/negative training reviews. This prediction process is given in Eq. (19):

\hat{y} = \underset{k = 1, 2}{argmax} P ({\hat{y}}_{k} | w_{1} \dots w_{m}) .

Experimental Results

To evaluate the performance of the proposed bigram improvement methods, 10 product review datasets from Amazon.com are considered. These datasets contain reviews by customers of various products which express their sentiment of them about the product. Each dataset has 1000 positive and 1000 negative reviews.

Experimental Setup

Experiments are conducted by predicting the class of test reviews based on probability values calculated using the scenarios presented in Table 1. Methods from M2–M6 consider similar words from only one resource in each case whereas the remaining methods M7–M13 consider similar words from combinations of all three resources.

Table 1.

Scenarios

M1	Traditional bigram model
Methods which use single resource
M2	Similar words extraction from only Synset
M3	Similar words extraction from only Word2Vec
M4	Similar words extraction from thesaurus only for sentiment words
M5	Similar words extraction from Synset only for sentiment words
M6	Similar words extraction from Word2Vec only for sentiment words
Methods which use various combinations of resources
M7	Similar words extraction from thesaurus and Synset for sentiment words
M8	Similar words extraction from thesaurus and Word2Vec for sentiment words
M9	Similar words extraction from Synset and Word2Vec for sentiment words
M10	Similar words extraction from thesaurus, Synset and Word2Vec for sentiment words
M11	Similar words extraction from thesaurus for sentiment words and from Synset for non-sentiment words
M12	Similar words extraction from thesaurus for sentiment words and from Word2Vec for non-sentiment words
M13	Similar words extraction from thesaurus for sentiment words and Synset and Word2Vec for non-sentiment words

Predicted	Actual
Predicted	Positive class	Negative class
Positive class	TP	FP
Negative class	FN	TN

Words	Synonyms
Beautiful	'alluring', 'appealing', 'charming', 'cute', 'dazzling', 'delicate', 'delightful', 'elegant', 'exquisite', 'fascinating', 'fine', 'good-looking', 'gorgeous', 'graceful', 'grand', 'handsome', 'lovely', 'magnificent', 'marvelous', 'pleasing', 'pretty', 'splendid', 'stunning', 'superb', 'wonderful', 'admirable', 'angelic', 'beauteous', 'bewitching', 'classy', 'comely', 'divine', 'enticing', 'excellent', 'fair', 'foxy', 'ideal', 'nice', 'pulchritudinous', 'radiant', 'ravishing', 'refined', 'resplendent', 'shapely', 'sightly', 'statuesque', 'sublime', 'symmetrical', 'taking', 'well-formed'
Better	'exceptional', 'improved', 'superior', 'choice', 'exceeding', 'fitter', 'preferred', 'sophisticated', 'surpassing', 'bigger', 'finer', 'greater', 'larger', 'more desirable', 'more suitable', 'more valuable', 'preferable', 'prominent'
Disagree	'clash', 'contradict', 'differ', 'dissent', 'diverge', 'conflict', 'counter', 'depart', 'deviate', 'discord', 'disharmonize', 'vary', 'war', 'be dissimilar'
Fail	'decline', 'fall', 'abozt', 'backslide', 'blunder', 'deteriorate', 'fizzle', 'flop', 'flounder', 'fold', 'founder', 'miscarry', 'miss', 'slip', 'be defeated', 'be found lacking', 'be ruined', 'come to nothing', 'fall short', 'go astray', 'go down swinging', 'go up in smoke', 'hit bottom', 'lose control', 'lose status', 'miss the boat', 'run aground'

Words	Synset	Word2Vec
Picture	'picture', 'image', 'impression', 'scene', 'movie', 'film', 'pic', 'video', 'photograph', 'photo', 'fancy', 'see', 'figure', 'show'	'pictures', 'photograph', 'photo', 'photos', 'images', 'image'
Guess	'guess', 'guessing', 'think', 'suppose', 'imagine', 'pretend'	'suppose', 'think', 'yeah', 'maybe', 'probably', 'anyway', 'know', 'hey'
Place	'place', 'position', 'shoes', 'home', 'post', 'office', 'situation', 'space', 'put', 'set', 'lay', 'rate', 'range', 'order', 'grade', 'locate', 'site', 'point', 'send'	'places', 'placed', 'finish'
Thing	'thing', 'matter'	'things', 'something', 'stuff', 'really', 'think', ‘aspect’, 'reason', 'kind'
Sleep	'sleep', 'slumber', 'sopor', 'nap', 'rest', 'eternal_rest', 'eternal_sleep', 'quietus', 'kip'	'sleeping', 'restful_sleep', 'restorative_sleep', 'slept', 'nap', 'wakings', 'naps', 'fitful_sleep', 'Sleep', 'doze’
Happy	'happy', 'felicitous', 'glad', 'well-chosen'	'glad', 'pleased', 'ecstatic', 'overjoyed', 'thrilled', 'satisfied', 'delighted’, 'disappointed', 'excited'
Sad	'sad', 'deplorable', 'distressing', 'lamentable', 'pitiful', 'sorry'	'saddening', 'Sad’, 'saddened', 'heartbreaking', 'disheartening', 'saddens_me', 'distressing', `reminders_bobbing'
Good	'good’, ‘goodness', 'commodity', 'trade_good’, ‘full’, 'estimable', 'honorable', 'respectable', 'beneficial', 'just', 'upright', 'adept', 'expert', 'practiced', 'proficient', 'skillful', 'skilful', 'dear', 'near', 'dependable', 'safe', 'secure', 'right', 'ripe', 'well', 'effective', 'in_effect', 'in_force', 'serious', 'sound', 'salutary', 'honest', 'undecomposed', 'unspoiled', 'unspoilt’, ‘thoroughly', 'soundly'	'great', 'terrific', 'decent', 'nice', 'excellent', 'fantastic', 'better', 'solid', 'lousy'
Bad	'bad', 'badness', 'big', 'tough’, ‘spoiled', 'spoilt’, ‘regretful', 'sorry’, ‘uncollectible', 'risky’, ‘high-risk', 'speculative', 'unfit', 'unsound’, ‘forged', 'defective', 'badly'	'good', 'terrible', 'horrible', 'Bad', 'lousy', 'crummy', 'horrid', 'awful', 'dreadful', 'horrendous'

Existing bigram	Sample new bigrams
('phone', 'good')	('phone', 'beneficial') ('phone', 'sound') ('phone', 'effective') ('telephone', 'beneficial') ('telephone', 'sound') ('telephone', 'effective') ('headphone', 'beneficial') ('headphone', 'sound') ('headphone', 'effective') ('earphone', 'beneficial') ('earphone', 'sound') ('earphone', 'effective') ('telephone', 'good') ('headphone', 'sound') ('earphone', 'sound')
('good', 'price')	('beneficial', 'cost') ('beneficial', 'price') ('sound', 'cost') ('sound', 'price') ('effective', 'cost') ('effective', 'price')
('price', 'paid,')	('cost', 'paid,') ('cost', 'give,') ('cost', 'pay,') ('cost', 'devote,') ('cost', 'bear,') ('price', 'give,') ('price', 'pay,') ('price', 'devote,') ('price', 'bear,')
('battery', 'lasts')	('battery', 'survive') ('barrage', 'survive') ('barrage', 'lasts') ('battery', 'live') ('barrage', 'live')
('camera', 'decent')	('camera', 'nice') ('camera', 'adequate') ('camera', 'enough') ('camera', 'properly') ('camera', 'right')

Dataset	Total number of words	Therasus + Synset	Therasus + Word2Vec
Apparel	5924	1306	1485
Books	14,576	2993	4666
DVD	17,451	3180	5136
Electronics	11,628	2070	3146
Health	8274	1574	2548
Kitchen	8199	1703	2525
Music	12,067	2185	3745
Sports	9818	1834	3003
Toys	9416	1692	2752
Video	13,970	2593	4456

Dataset	Accuracy						Winner
Dataset	M1	M2	M3	M4	M5	M6	Winner
Apparel	70.5	75	74	71	72.5	73	M2 (75%)
Books	70	74.5	75.5	76	75.5	73	M4 (76%)
DVD	66	74	68.5	70.5	74.5	67	M5 (74.5%)
Electronics	75	79	74	76	78.5	73	M2 (79%)
Health	74	79	77	77	77.5	75.5	M2 (79%)
Kitchen	72	77.5	76	73.5	74	71.5	M2 (77.5%)
Music	65.5	72	70	68.5	71.5	67	M2 (72%)
Sports	68	73.5	74	73	72.5	67	M3 (74%)
Toys	68	67	71.5	70.5	69	70.5	M3 (71.5%)
Video	70.5	78.5	73.5	74	75.5	71	M2 (78.5%)

Word	Useless words returned by Word2Vec
Guess	“hey”, “know”
Thing	“really”, “think”, “kind”
See	“expect”, “imagine”
Came	“got”, “gave”, “ran”
Supposed	“going”, “trying”, ”wanted”, “not”, “want”

Dataset	Winner from Table 7	Winner from Table 8	Overall winner
Apparel	M2 (75%)	M11 (79%)	M11
Books	M4 (76%)	M10 (80%)	M10
DVD	M5 (74.5%)	M8 (77.5%)	M8
Electronics	M2 (79%)	M11 (79%)	M2 and M11
Health	M2 (79%)	M11 (80%)	M11
Kitchen	M2 (77.5%)	M11 (80.5%)	M11
Music	M2 (72%)	M11 (73.5%)	M11
Sports	M3 (74%)	M11 (74%)	M3 and M11
Toys	M3 (71.5%)	M7 (74%)	M7
Video	M2 (78.5%)	M11 (78%)	M2

Dataset	Winner from Table 15	Winner from Table 16	Overall winner
Apparel	M2 (0.761904762)	M11 (0.798076923)	M11
Books	M4 (0.747368421)	M10 (0.8245614)	M10
DVD	M5 (0.73015873)	M10 (0.78761062)	M10
Electronics	M5 (0.78817734)	M11 (0.778947368)	M5
Health	M2 (0.783505155)	M8 (0.80733945)	M8
Kitchen	M2 (0.784688995)	M11 (0.804020101)	M11
Music	M2 (0.745454545)	M11 (0.751173709)	M11
Sports	M2 (0.757990868)	M11 (0.759259259)	M11
Toys	M3 (0.719211823)	M7 (0.7699115)	M7
Video	M2 (0.794258373)	M11 (0.784313725)	M2

Dataset	Winner from Table 18	Winner from Table 19	Overall winner
Apparel	M4 and M5 (0.415)	M10 (0.445)	M10
Books	M2 (0.365)	M10 (0.47)	M10
DVD	M5 (0.345)	M10 (0.445)	M10
Electronics	M5 (0.4)	M10 (0.445)	M10
Health	M5 (0.39)	M10 (0.445)	M10
Kitchen	M2 and M5 (0.41)	M10 (0.47)	M10
Music	M5 (0.415)	M7 and M10 (0.455)	M7 and M10
Sports	M5 (0.445)	M9 and M10 (0.445)	M5, M9 and M10
Toys	M5 (0.375)	M10 (0.44)	M10
Video	M2 and M5 (0.415)	M10 (0.445)	M10

Dataset	Precision						Winner
Dataset	M1	M2	M3	M4	M5	M6	Winner
Apparel	0.695238	0.727273	0.718182	0.669355	0.68595	0.7053571	M2 (0.727273)
Books	0.75	0.752577	0.831169	0.788889	0.78022	0.75	M3 (0.831169)
DVD	0.728571	0.772727	0.717647	0.766234	0.775281	0.765625	M5 (0.775281)
Electronics	0.797619	0.795918	0.76087	0.765306	0.776699	0.7613636	M1 (0.797619)
Health	0.815789	0.808511	0.793478	0.787234	0.772277	0.8	M1 (0.815789)
Kitchen	0.72	0.752294	0.745283	0.715596	0.706897	0.7087379	M2 (0.752294)
Music	0.653465	0.683333	0.675439	0.67619	0.674797	0.6770833	M2 (0.683333)
Sports	0.676471	0.697479	0.714286	0.694915	0.669173	0.6416667	M3 (0.714286)
Toys	0.691489	0.67	0.708738	0.699029	0.669643	0.7070707	M3 (0.708738)
Video	0.71134	0.761468	0.72381	0.726415	0.721739	0.71875	M2 (0.761468)

Dataset	Precision							Winner
Dataset	M7	M8	M9	M10	M11	M12	M13	Winner
Apparel	0.6376812	0.6376812	0.6343284	0.6357143	0.768518	0.79347826	0.6752137	M12 (0.79347826)
Books	0.728	0.7372881	0.7398374	0.734375	0.795454	0.84285714	0.7926829	M12 (0.84285714)
DVD	0.7083333	0.7477477	0.7226891	0.7063492	0.810127	0.8125	0.7272727	M12 (0.8125)
Electronics	0.6984127	0.7107438	0.6771654	0.6641791	0.822222	0.825	0.7058824	M12 (0.825)
Health	0.7190083	0.7457627	0.725	0.7063492	0.840909	0.8630137	0.7303371	M12 (0.8630137)
Kitchen	0.673913	0.6642336	0.7045455	0.6666667	0.808080	0.82022472	0.6767677	M12 (0.82022472)
Music	0.602649	0.6111111	0.6267606	0.5947712	0.707965	0.73333333	0.6727273	M12 (0.73333333)
Sports	0.6444444	0.6444444	0.6544118	0.6402878	0.706897	0.75581395	0.6806723	M12 (0.75581395)
Toys	0.6904762	0.6916667	0.6692308	0.6717557	0.696970	0.79012346	0.6862745	M12 (0.79012346)
Video	0.6850394	0.699187	0.704	0.6899225	0.769231	0.79012346	0.7053571	M12 (0.79012346)

Dataset	Recall						Winner
Dataset	M1	M2	M3	M4	M5	M6	Winner
Apparel	0.73	0.8	0.79	0.83	0.83	0.79	M4 and M5 (0.83)
Books	0.6	0.73	0.64	0.71	0.71	0.6	M2 (0.73)
DVD	0.51	0.68	0.61	0.59	0.69	0.49	M5 (0.69)
Electronics	0.67	0.78	0.7	0.75	0.8	0.67	M5 (0.8)
Health	0.62	0.76	0.73	0.74	0.78	0.68	M5 (0.78)
Kitchen	0.72	0.82	0.79	0.78	0.82	0.73	M2 and M5 (0.82)
Music	0.66	0.82	0.77	0.71	0.83	0.65	M5 (0.83)
Sports	0.69	0.83	0.8	0.82	0.89	0.77	M5 (0.89)
Toys	0.65	0.67	0.73	0.72	0.75	0.7	M5 (0.75)
Video	0.69	0.83	0.76	0.77	0.83	0.69	M2 and M5 (0.83)

Dataset	Recall							Winner
Dataset	M7	M8	M9	M10	M11	M12	M13	Winner
Apparel	0.88	0.88	0.85	0.89	0.83	0.73	0.79	M10 (0.89)
Books	0.91	0.87	0.91	0.94	0.7	0.59	0.65	M10 (0.94)
DVD	0.85	0.83	0.86	0.89	0.64	0.52	0.64	M10 (0.89)
Electronics	0.88	0.86	0.86	0.89	0.74	0.66	0.6	M10 (0.89)
Health	0.87	0.88	0.87	0.89	0.74	0.63	0.65	M10 (0.89)
Kitchen	0.93	0.91	0.93	0.94	0.8	0.73	0.67	M10 (0.94)
Music	0.91	0.88	0.89	0.91	0.8	0.66	0.74	M7 and M10 (0.91)
Sports	0.87	0.87	0.89	0.89	0.82	0.65	0.81	M9 and M10 (0.89)
Toys	0.87	0.83	0.87	0.88	0.69	0.64	0.7	M10 (0.88)
Video	0.87	0.86	0.88	0.89	0.8	0.64	0.79	M10 (0.89)

Dataset	Winner from Table 21	Winner from Table 22	Overall winner
Apparel	M4 and M5 (0.17)	M10 (0.11)	M10
Books	M2 (0.27)	M10 (0.06)	M10
DVD	M5 (0.31)	M10 (0.11)	M10
Electronics	M5 (0.2)	M10 (0.11)	M10
Health	M5 (0.22)	M10 (0.11)	M10
Kitchen	M2 and M5 (0.18)	M10 (0.06)	M10
Music	M5 (0.17)	M7 and M10 (0.09)	M7 and M10
Sports	M5 (0.11)	M9 and M10 (0.11)	M5, M9 and M10
Toys	M5 (0.25)	M10 (0.12)	M10
Video	M2 and M5 (0.17)	M10 (0.11)	M10

PERMALINK

Sentiment Thesaurus, Synset and Word2Vec Based Improvement in Bigram Model for Classifying Product Reviews

S Poomagal

B Malar

E M Ranganayaki

K Deepika

G Dheepak

Abstract

Introduction

Related Work

Problem Statement

Proposed Method

Preprocessing of Reviews

Bigram Probability Matrix Formation

Classifying a Test Review

Sentiment Thesaurus Formation

Similar Words Extraction

Bigrams with Similar Words Probability Calculation

Predicting Class of a Test Review

Experimental Results

Experimental Setup

Table 1.

Table 2.

Sample Words from Thesaurus, Synset and Word2Vec

Table 3.

Table 4.

Table 5.

Fig. 1.

Table 6.

Comparative Study

Table 8.

Table 7.

Table 9.

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Table 16.

Table 17.

Table 18.

Table 19.

Table 20.

Table 21.

Table 22.

Table 23.

Comparison with State of the Art Methods

Fig. 2.

Fig. 3.

Fig. 4.

Fig. 5.

Fig. 6.

Fig. 7.

Fig. 8.

Fig. 9.

Fig. 10.

Fig. 11.

Fig. 12.

Fig. 13.

Fig. 14.

Discussion

Findings

Conclusion

List of Symbols

Funding

Available of data and material

Code availability

Declarations

Conflict of interest

Ethical approval

Informed consent

Footnotes

Contributor Information

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES