Effect of Using Regression in Sentiment Analysis

Onal I., Ertugrul A. M.

22nd IEEE Signal Processing and Communications Applications Conference (SIU), Trabzon, Turkey, 23 - 25 April 2014, pp.1822-1825 identifier

  • Publication Type: Conference Paper / Full Text
  • City: Trabzon
  • Country: Turkey
  • Page Numbers: pp.1822-1825
  • Keywords: Twitter, sentiment analysis, regression, confidence scores
  • Middle East Technical University Affiliated: Yes


In this study, the effect of using regression on sentiment classification of Twitter data was analyzed. In other words, whether the strength of sentiment better discriminates the classes or not. Since our dataset includes class confidence scores rather than discrete class labels, regression analysis was employed on each class separately. Then, each tweet was assigned the class whose estimated confidence score is maximum among others after regression. The feature set used includes unigrams, POS tags, emoticons, sentiments of words and POS tags of sentiments. The results of experiments indicate that using classification on discrete class labels perform much better than using regression on continuous confidence scores.