Language Independent Sentiment Analysis of theShukran Social Network Using Apache Spark

01/25/2021
by   Walid Iguider, et al.
0

This paper describes theShukran Sentiment Analysis system. theShukran is a social network micro-blogging service that allows users posting photos or videos and descriptions of their daily life activities. This social network rapidly gained a large amount of users. It provides people from different cultures and countries the possibility to share in different languages their stories, ideas, opinions, and news from their real life, and makes the cultural diversity the center of relationships between its users. Sentiment analysis aims to extract the opinion of the public about some topic by processing text data. One of its several tasks, the polarity detection, aims at categorizing the elements in a dataset (sentences, posts, etc.) into classes such as positive, negative and neutral. In the system we propose, and that represents the sentiment analysis core engine of theShukran social network, we will detect the original language of users posts, translate them into English and evaluate their sentiment (whether positive or negative). We propose the use of a Naive Bayes classifier and SentiWordNet and SenticNet for the sentiment evaluation. The language detection and translation are performed using TextBlob, a Python library for processing textual data.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset