Confidence-aware truth estimation in social sensing applications

This paper presents a confidence-aware maximum likelihood estimation framework to solve the truth estimation problem in social sensing applications. Social sensing has emerged as a new paradigm of data collection, where a group of individuals volunteer (or are recruited) to share certain observations or measurements about the physical world. A key challenge in social sensing applications lies in ascertaining the correctness of reported observations from unvetted data sources with unknown reliability. We refer to this problem as truth estimation. The prior works have made significant efforts to solve this problem by developing various truth estimation algorithms. However, an important limitation exists: they assumed a data source makes all her/his observations with the same degree of confidence, which may not hold in many real-world social sensing applications. In this paper, we develop a new confidence-aware truth estimation scheme that removes this limitation by explicitly considering different degrees of confidence that sources express on the reported data. The new truth estimation scheme solves a maximum likelihood estimation problem to determine both the correctness of collected data and the reliability of data sources. We compare our confidence-aware scheme with the state-of-the-art baselines through both an extensive simulation study and three real world case studies based on Twitter. The evaluation shows that our new scheme outperforms all compared baselines and significantly improves the accuracy of the truth estimation results in social sensing applications.

[1]  Charu C. Aggarwal,et al.  On scalability and robustness limitations of real and asymptotic confidence bounds in social sensing , 2012, 2012 9th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks (SECON).

[2]  Dong Wang,et al.  Social Sensing: Building Reliable Systems on Unreliable Data , 2015 .

[3]  Jiawei Han,et al.  Evaluating Event Credibility on Twitter , 2012, SDM.

[4]  Charu C. Aggarwal,et al.  Using humans as sensors: An estimation-theoretic perspective , 2014, IPSN-14 Proceedings of the 13th International Symposium on Information Processing in Sensor Networks.

[5]  Wen Hu,et al.  Efficient Computation of Robust Average of Compressive Sensing Data in Wireless Sensor Networks in the Presence of Sensor Faults , 2013, IEEE Transactions on Parallel and Distributed Systems.

[6]  Shivakant Mishra,et al.  CenWits: a sensor-based loosely coupled search and rescue system using witnesses , 2005, SenSys '05.

[7]  Gregory Dudek,et al.  Context Dependent Movie Recommendations Using a Hierarchical Bayesian Model , 2009, Canadian Conference on AI.

[8]  Suman Nath,et al.  ACE: Exploiting Correlation for Energy-Efficient and Continuous Context Sensing , 2012, IEEE Transactions on Mobile Computing.

[9]  Dan Roth,et al.  Generalized fact-finding , 2011, WWW.

[10]  Jiawei Han,et al.  Heterogeneous network-based trust analysis: a survey , 2011, SKDD.

[11]  Gerhard Friedrich,et al.  Recommender Systems - An Introduction , 2010 .

[12]  Haiying Shen,et al.  Leveraging Social Networks to Combat Collusion in Reputation Systems for Peer-to-Peer Networks , 2013, 2011 IEEE International Parallel & Distributed Processing Symposium.

[13]  Zhao Yuping A Novel Anti-Collision Protocol in Multiple Readers RFID Sensor Networks , 2008 .

[14]  Mirco Musolesi,et al.  Urban sensing systems: opportunistic or participatory? , 2008, HotMobile '08.

[15]  K. Yıldırım CLOCK SYNCHRONIZATION IN WIRELESS SENSOR NETWORKS , 2012 .

[16]  Yang Zhang,et al.  CarTel: a distributed mobile sensor computing system , 2006, SenSys '06.

[17]  Charu C. Aggarwal,et al.  On Quantifying the Accuracy of Maximum Likelihood Estimation of Participant Reliability in Social Sensing , 2011 .

[18]  Taylor Cassidy,et al.  The Wisdom of Minority: Unsupervised Slot Filling Validation based on Multi-dimensional Truth-Finding , 2014, COLING.

[19]  M. Hansen,et al.  Participatory Sensing , 2019, Internet of Things.

[20]  Wang Li-cun Design of DBA algorithm in EPON upstream channel in support of SLA , 2005 .

[21]  Philip S. Yu,et al.  Truth Discovery with Multiple Conflicting Information Providers on the Web , 2007, IEEE Transactions on Knowledge and Data Engineering.

[22]  Ming Zhou,et al.  Coooolll: A Deep Learning System for Twitter Sentiment Classification , 2014, *SEMEVAL.

[23]  Stephen P. Boyd,et al.  A scheme for robust distributed sensor fusion based on average consensus , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[24]  Hengchang Liu,et al.  Exploitation of Physical Constraints for Reliable Social Sensing , 2013, 2013 IEEE 34th Real-Time Systems Symposium.

[25]  Mirco Musolesi,et al.  Sensing meets mobile social networks: the design, implementation and evaluation of the CenceMe application , 2008, SenSys '08.

[26]  Emiliano Miluzzo,et al.  The BikeNet mobile sensing system for cyclist experience mapping , 2007, SenSys '07.

[27]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[28]  Charu C. Aggarwal,et al.  Recursive Fact-Finding: A Streaming Approach to Truth Estimation in Crowdsourcing Applications , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems.

[29]  Charu C. Aggarwal,et al.  Optimizing quality-of-information in cost-sensitive sensor data fusion , 2011, 2011 International Conference on Distributed Computing in Sensor Systems and Workshops (DCOSS).

[30]  Audun Jøsang,et al.  A survey of trust and reputation systems for online service provision , 2007, Decis. Support Syst..

[31]  Xiaoxin Yin,et al.  Semi-supervised truth discovery , 2011, WWW.

[32]  Mani Srivastava,et al.  Human-centric sensing , 2012, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[33]  Vana Kalogeraki,et al.  Privacy preservation for participatory sensing data , 2013, 2013 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[34]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[35]  Dong Wang,et al.  On quantifying the quality of information in social sensing , 2012 .

[36]  Minyue Fu,et al.  Target Tracking in Wireless Sensor Networks Based on the Combination of KF and MLE Using Distance Measurements , 2012, IEEE Transactions on Mobile Computing.

[37]  Dan Roth,et al.  Latent credibility analysis , 2013, WWW.

[38]  Deborah Estrin,et al.  Biketastic: sensing and mapping for better biking , 2010, CHI.

[39]  Dong Wang,et al.  Analytic Challenges in Social Sensing , 2014 .

[40]  Tarek F. Abdelzaher,et al.  Maximum likelihood analysis of conflicting observations in social sensing , 2014, TOSN.

[41]  Charu C. Aggarwal,et al.  On Credibility Estimation Tradeoffs in Assured Social Sensing , 2013, IEEE Journal on Selected Areas in Communications.

[42]  Dan Roth,et al.  Knowing What to Believe (when you already know something) , 2010, COLING.

[43]  Lance Kaplan,et al.  On truth discovery in social sensing: A maximum likelihood estimation approach , 2012, 2012 ACM/IEEE 11th International Conference on Information Processing in Sensor Networks (IPSN).

[44]  Lin Zhong,et al.  Human as Real-Time Sensors of Social and Physical Events: A Case Study of Twitter and Sports Games , 2011, ArXiv.

[45]  Wilfred Ng,et al.  Truth Discovery in Data Streams: A Single-Pass Probabilistic Approach , 2014, CIKM.

[46]  Yu Hen Hu,et al.  Maximum likelihood multiple-source localization using acoustic energy measurements with wireless sensor networks , 2005, IEEE Transactions on Signal Processing.

[47]  New York Dover,et al.  ON THE CONVERGENCE PROPERTIES OF THE EM ALGORITHM , 1983 .

[48]  Divesh Srivastava,et al.  Global detection of complex copying relationships between sources , 2010, Proc. VLDB Endow..

[49]  Cecilia Mascolo,et al.  SociableSense: exploring the trade-offs of adaptive sampling and computation offloading for social sensing , 2011, MobiCom.

[50]  Charu C. Aggarwal,et al.  Social Sensing , 2013, Managing and Mining Sensor Data.

[51]  Joel J. P. C. Rodrigues,et al.  Wireless Sensor Networks: a Survey on Environmental Monitoring , 2011, J. Commun..