TY - JOUR
T1 - Sentiment strength detection with a context-dependent lexicon-based convolutional neural network
AU - Huang, Minghui
AU - Xie, Haoran
AU - Rao, Yanghui
AU - Feng, Jingrong
AU - Wang, Fu Lee
N1 - Publisher Copyright:
© 2020 Elsevier Inc.
PY - 2020/5
Y1 - 2020/5
N2 - Sentiment strength detection is an essential task in sentiment analysis, wherein the sentiment strength of subjective text is automatically determined. Sentiment analysis has numerous applications in different sectors, including business and social domains. In this study, we present a model to effectively extract the features and strength of sentiment from words and text using a context-dependent, lexicon-based convolutional neural network. To build this convolutional neural network, the model is trained using the sentiment polarity for each word from a co-occurrence pattern of words and labels. Then, a context-dependent lexicon is generated from the corpus, which is used to generate positive and negative sentiment word embeddings. Positive sentiment word embeddings, negative sentiment word embeddings, and the pre-trained word embeddings are input to a 3-channel convolutional neural network (CNN) to predict the strength of the sentiments. Moreover, with the trained convolutional neural network model, we can obtain a learned sentiment strength-specific word embedding, which generates a sentiment strength-specific lexicon (SSS-Lex) that contains word associations and sentiment intensity scores. To validate the effectiveness of sentiment strength detection in the proposed model, we evaluate the model using six real-world datasets. Furthermore, to evaluate the sentiment strength-specific lexicon, we compare it with seven existing lexicons in three evaluation tasks from the SemEval-2015 and SemEval-2016 competitions. Experimental results indicate that the proposed model can predict the sentiment strength of documents more effectively than the baseline methods, and that the SSS-Lex is of higher quality than the existing lexicons.
AB - Sentiment strength detection is an essential task in sentiment analysis, wherein the sentiment strength of subjective text is automatically determined. Sentiment analysis has numerous applications in different sectors, including business and social domains. In this study, we present a model to effectively extract the features and strength of sentiment from words and text using a context-dependent, lexicon-based convolutional neural network. To build this convolutional neural network, the model is trained using the sentiment polarity for each word from a co-occurrence pattern of words and labels. Then, a context-dependent lexicon is generated from the corpus, which is used to generate positive and negative sentiment word embeddings. Positive sentiment word embeddings, negative sentiment word embeddings, and the pre-trained word embeddings are input to a 3-channel convolutional neural network (CNN) to predict the strength of the sentiments. Moreover, with the trained convolutional neural network model, we can obtain a learned sentiment strength-specific word embedding, which generates a sentiment strength-specific lexicon (SSS-Lex) that contains word associations and sentiment intensity scores. To validate the effectiveness of sentiment strength detection in the proposed model, we evaluate the model using six real-world datasets. Furthermore, to evaluate the sentiment strength-specific lexicon, we compare it with seven existing lexicons in three evaluation tasks from the SemEval-2015 and SemEval-2016 competitions. Experimental results indicate that the proposed model can predict the sentiment strength of documents more effectively than the baseline methods, and that the SSS-Lex is of higher quality than the existing lexicons.
KW - Convolutional neural network
KW - Sentiment analysis
KW - Sentiment strength detection
KW - Sentiment strength-specific lexicon
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=85079536399&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2020.02.026
DO - 10.1016/j.ins.2020.02.026
M3 - Article
AN - SCOPUS:85079536399
SN - 0020-0255
VL - 520
SP - 389
EP - 399
JO - Information Sciences
JF - Information Sciences
ER -