Classifying political tweets using naïve bayes and support vector machines

Ahmed Al Hamoud, Ali Alwehaibi, Kaushik Roy, Marwan Bikdash

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Twitter, which is one of the most popular microblogging platforms and contains a huge amount of meaningful information, can be used in opinion mining and sentiment analysis. Twitter data contains text communication of more than 330 million active users monthly. This research effort applies the machine learning techniques to determine whether the contents of tweets are political or apolitical. Preprocessing involves cleaning-up the texts to obtain meaningful information and accurate opinions. Bag-of-Words (BOW), Term Frequency (TF) and Term Frequency-Inverse Document Frequency (TF-IDF) were used to extract the features from twitter data. We then used Chi-Square technique to select the salient features from a high dimensional feature set. Finally, Support Vector Machines (SVMs) and Naive Bayes (NB) were applied to classify the twitter data. The results suggest that SVMs with BOW provide the highest accuracy and F-measure.

Original languageEnglish
Title of host publicationRecent Trends and Future Technology in Applied Intelligence - 31st International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2018, Proceedings
EditorsOtmane Ait Mohamed, Malek Mouhoub, Samira Sadaoui, Moonis Ali
PublisherSpringer Verlag
Pages736-744
Number of pages9
ISBN (Print)9783319920573
DOIs
StatePublished - 2018
Externally publishedYes
Event31st International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems IEA/AIE 2018 - Montreal, Canada
Duration: Jun 25 2018Jun 28 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10868 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference31st International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems IEA/AIE 2018
Country/TerritoryCanada
CityMontreal
Period06/25/1806/28/18

Keywords

  • Feature selection
  • Natural language processing
  • Opinion mining
  • Sentiment analysis

Fingerprint

Dive into the research topics of 'Classifying political tweets using naïve bayes and support vector machines'. Together they form a unique fingerprint.

Cite this