In recent years, the classification of the social networks' data has witnessed an increasing interest. It aims at extracting opinions, emotions and attitudes from social networks' data such as Facebook… Click to show full abstract
In recent years, the classification of the social networks' data has witnessed an increasing interest. It aims at extracting opinions, emotions and attitudes from social networks' data such as Facebook comments or tweets. This new scientific research area is called sentiment analysis. (It is sometimes called opinion mining.) In this article, we propose a new method to classify tweets into three classes: positive, negative or neutral. The proposed method is a new hybrid approach based on the fuzzy logic with its three important steps (fuzzification, Rule Inference/aggregation and defuzzification) and the concepts of information retrieval system (IRS) by calculating the semantic similarity between a tweet to classify and two opinion documents (one for the positive opinion words and another one for the negative opinion words) using the WordNet dictionary. To remedy the calculation time’s problem—if we have a huge dataset of tweets—we decide to parallelize our work using the Hadoop framework with its distributed file system (HDFS) and the MapReduce programming model. The experimental results show that our approach outperforms some other methods from the literature as well as by using the fuzzy logic, we improve the results of the classification.
               
Click one of the above tabs to view related content.