Connecting Social Media Data with Observed Hybrid Data for Environment Monitoring

Environmental monitoring has been regarded as one of effective solutions to protect our living places from potential risks. Traditional methods rely on periodically recording assessments of observed objects, which results in large amount of hybrid data sets. Additionally public opinions regarding certain topics can be extracted from social media and used as another source of descriptive data. In this work, we investigate how to connect and process the public opinions from social media with hybrid observation records. Particularly, we study Twitter posts from designated region with respect to specific topics, such as marine environmental activities. Sentiment analysis on tweets is performed to reflect public opinions on the environmental topics. Additionally two hybrid data sets have been considered. To process these data we use Hadoop cluster and utilize NoSql and relational databases to store data distributed across nodes in share nothing architecture. We compare the public sentiments in social media with scientific observations in real time and show that the “citizen science” enhanced with real time analytics can provide avenue to nominatively monitor natural environments. The approach presented in this paper provides an innovative method to monitor environment with the power of social media analysis and distributed computing.

[1]  Fabrício Benevenuto,et al.  A Benchmark Comparison of State-of-the-Practice Sentiment Analysis Methods , 2015, ArXiv.

[2]  Jürgen Broß,et al.  Aspect-Oriented Sentiment Analysis of Customer Reviews Using Distant Supervision Techniques , 2013 .

[3]  Giuseppe M. L. Sarnè,et al.  Combining trust and skills evaluation to form e-Learning classes in online social networks , 2017, Inf. Sci..

[4]  Jaroslav Pokorný,et al.  Opportunities in Big Data Management and Processing , 2014, DB&IS.

[5]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[6]  Xiaojun Li,et al.  A sentiment analysis model for hotel reviews based on supervised learning , 2011, 2011 International Conference on Machine Learning and Cybernetics.

[7]  William Claster,et al.  Naïve Bayes and unsupervised artificial neural nets for Cancun tourism social media data analysis , 2010, 2010 Second World Congress on Nature and Biologically Inspired Computing (NaBIC).

[8]  William B. Claster,et al.  Thailand -- Tourism and Conflict: Modeling Sentiment from Twitter Tweets Using Naïve Bayes and Unsupervised Artificial Neural Nets , 2010, 2010 Second International Conference on Computational Intelligence, Modelling and Simulation.

[9]  Zoran Milosevic,et al.  Influence of Parallelism Property of Streaming Engines on Their Performance , 2016, ADBIS.

[10]  Kjetil Nørvåg,et al.  A study of opinion mining and visualization of hotel reviews , 2012, IIWAS '12.