Harnessing Multi-Source Data about Public Sentiments and Activities for Informed Design

The intelligence of Smart Cities (SC) is represented by its ability in collecting, managing, integrating, analyzing, and mining multi-source data for valuable insights. In order to harness multi-source data for an informed place design, this paper presents “Public Sentiments and Activities in Places” multi-source data analysis flow (PSAP) in an Informed Design Platform (IDP). In terms of key contributions, PSAP implements 1) an Interconnected Data Model (IDM) to manage multi-source data independently and integrally, 2) an efficient and effective data mining mechanism based on multi-dimension and multi-measure queries (MMQs), and 3) concurrent data processing cascades with Sentiments in Places Analysis Mechanism (SPAM) and Activities in Places Analysis Mechanism (APAM), to fuse social network data with other data on public sentiment and activity comprehensively. As proved by a holistic evaluation, both SPAM and APAM outperform compared methods. Specifically, SPAM improves its classification accuracy gradually and significantly from 72.37 to about 85 percent within nine crowd-calibration cycles, and APAM with an ensemble classifier achieves the highest precision of 92.13 percent, which is approximately 13 percent higher than the second best method. Finally, by applying MMQs on “Sentiment&Activity Linked Data”, various place design insights of our testbed are mined to improve its livability.

[1]  Ralf Tönjes,et al.  CityPulse: Large Scale Data Analytics Framework for Smart Cities , 2016, IEEE Access.

[2]  Yu Zheng,et al.  Methodologies for Cross-Domain Data Fusion: An Overview , 2015, IEEE Transactions on Big Data.

[3]  Rajiv Ranjan,et al.  Migrating Smart City Applications to the Cloud , 2016, IEEE Cloud Computing.

[4]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[5]  Jeremy Ellman,et al.  Simple Approaches of Sentiment Analysis via Ensemble Learning , 2015 .

[6]  Tianyi Ma,et al.  CITY FEED , 2016, ACM Trans. Intell. Syst. Technol..

[7]  Bin Cheng,et al.  Building a Big Data Platform for Smart Cities: Experience and Lessons from Santander , 2015, 2015 IEEE International Congress on Big Data.

[8]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[9]  Thomas Hofmann,et al.  Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification , 2017, WWW.

[10]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[11]  Alberto Abelló,et al.  Incremental Consolidation of Data-Intensive Multi-Flows , 2016, IEEE Transactions on Knowledge and Data Engineering.

[12]  Joaquim Melendez,et al.  Identifying services for short-term load forecasting using data driven models in a Smart City platform , 2017 .

[13]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[14]  Enrico Capobianco,et al.  Smart Cities, Big Data, and Communities: Reasoning From the Viewpoint of Attractors , 2016, IEEE Access.

[15]  Raphaël Troncy,et al.  A Replication Study of the Top Performing Systems in SemEval Twitter Sentiment Analysis , 2016, International Semantic Web Conference.

[16]  Alessandro Moschitti,et al.  Twitter Sentiment Analysis with Deep Convolutional Neural Networks , 2015, SIGIR.

[17]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[18]  Ria Mae Borromeo,et al.  Automatic vs. Crowdsourced Sentiment Analysis , 2015, IDEAS.

[19]  V. Sharmila,et al.  Using Hashtags to Capture Fine Emotion Categories from Tweets , 2019 .

[20]  Brendan T. O'Connor,et al.  Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters , 2013, NAACL.

[21]  Prem Prakash Jayaraman,et al.  OpenIoT: Open Source Internet-of-Things in the Cloud , 2014, OpenIoT@SoftCOM.

[22]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.