Global Analytics on Tweets Sentiment

(GATS)

Under the Guidance of

Dr. Sunanda Gupta

Prateek Sharma

2011ECS19

Rajesh Kumar Pathak

2011ECS42

Sunny Kumar

2011ECS43

Introduction- GATS

  • Acquiring Data
  • Storage of Data
  • Cleaning Data
  • Sentiment Analysis
  • Visualization
https://github.com/sunnykrGupta/Glob_Analytics

First Phase - Acquiring Data

https://github.com/sunnykrGupta/Glob_Analytics

Fig. Tweet Data from Twitter

Second Phase - Understanding and Cleaning tweets

https://github.com/sunnykrGupta/Glob_Analytics

Fig. Tweet After Filter

https://github.com/sunnykrGupta/Glob_Analytics

Third Phase - Acquiring Geo-location Details

https://github.com/sunnykrGupta/Glob_Analytics

Fig. Tweet with Location Details

https://github.com/sunnykrGupta/Glob_Analytics

Fourth Phase - Prepare Data for processing

https://github.com/sunnykrGupta/Glob_Analytics

Fifth Phase - Sentiment Analysis of Tweet

  • Each tweet is given a polarity by classification engine ranging values from ­[-1.0 to 1.0]. 
  • We have given a positive label to those tweets whose polarity value is ​greater than or equal to [0.2], ​ negative label to those tweets whose value is ​equal to or below ­[-0.10] ​, while values between [-0.10 : 0.2] ​ is given neutral label. 
  • These are tuning parameter. It can be adjusted accordingly but will produce different results and Visualization will be changed accordingly. 
  • On the basis of results obtained we gave scoring to each country.

 
           

https://github.com/sunnykrGupta/Glob_Analytics

Sixth Phase - Visualization (Analytics)

- Plotting each tweet

- Density Plotting of Tweets

- Choropleth Concept

- Graphical Study

https://github.com/sunnykrGupta/Glob_Analytics

Tools and Technologies

  1.  Twitter Streaming API as a source of data.
  2.  Python for manipulating the data. 
  3.  MongoDB as the database.
  4.  TextBlob library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks.
  5. for visualizing the data as dots on map, choropleth and through graphs.
https://github.com/sunnykrGupta/Glob_Analytics

In this exploratory project, we have conducted some experiments on the Twitter data and  presented results for sentiment analysis on Twitter. Sentiment analysis is a very wide branch for  research. We plan ahead to improve our technique used for determining the sentiment value. 
Sarcastic comments are the ones which are very difficult to identify. Tweets containing sarcastic comments give exactly opposite results owing to the mindset of the author. These are almost impossible to track. So it’s important to relate the interpretation with the context of the tweets. Also the use of native language combined with English usage is difficult to interpret. 
 
Time efficiency is an important aspect where our project lags. Resolving tweets is most time consuming process because of api request limits. Exceeding certain parameter leads to blacklisted by geocoder service providers. We need a real time location resolving system to develop analytics more faster. 

Conclusion

https://github.com/sunnykrGupta/Glob_Analytics

References

https://github.com/sunnykrGupta/Glob_Analytics

Global Analytics on Tweets Sentiments

By Sunny Gupta

Global Analytics on Tweets Sentiments

Global Analytics on Tweets Sentiments

  • 1,795