Effect of Using Regression in Sentiment Analysis

Onal I., Ertugrul A. M.

22nd IEEE Signal Processing and Communications Applications Conference (SIU), Trabzon, Türkiye, 23 - 25 Nisan 2014, ss.1822-1825, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Basıldığı Şehir: Trabzon
Basıldığı Ülke: Türkiye
Sayfa Sayıları: ss.1822-1825
Anahtar Kelimeler: Twitter, sentiment analysis, regression, confidence scores
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

In this study, the effect of using regression on sentiment classification of Twitter data was analyzed. In other words, whether the strength of sentiment better discriminates the classes or not. Since our dataset includes class confidence scores rather than discrete class labels, regression analysis was employed on each class separately. Then, each tweet was assigned the class whose estimated confidence score is maximum among others after regression. The feature set used includes unigrams, POS tags, emoticons, sentiments of words and POS tags of sentiments. The results of experiments indicate that using classification on discrete class labels perform much better than using regression on continuous confidence scores.