The Information Funnel: Exploiting Named Data for Information-Maximizing Data Collection

This paper describes the exploitation of hierarchical data names to achieve information-utility maximizing data collection in social sensing applications. We describe a novel transport abstraction, called the information funnel. It encapsulates a data collection protocol for social sensing that maximizes a measure of delivered information utility, that is the minimized data redundancy, by diversifying the data objects to be collected. The abstraction leverages named-data networking, a communication paradigm where data objects are named instead of hosts. We argue that this paradigm is especially suited for utility-maximizing transport in resource constrained environments, because hierarchical data names give rise to a notion of distance between named objects that is a function of only the topology of the name tree. This distance, in turn, can expose similarities between named objects that can be leveraged for minimizing redundancy among objects transmitted over bottlenecks, thereby maximizing their aggregate utility. With a proper hierarchical name space design, our protocol prioritizes transmission of data objects over bottlenecks to maximize information utility, with very weak assumptions on the utility function. This prioritization is achieved merely by comparing data name prefixes, without knowing application-level name semantics, which makes it generalizable across a wide range of applications. Evaluation results show that the information funnel improves the utility of the collected data objects compared to other lossy protocols.

[1]  Marco Conti,et al.  Design and performance evaluation of ContentPlace, a social-aware data dissemination system for opportunistic networks , 2010, Comput. Networks.

[2]  Martin Vetterli,et al.  Network correlated data gathering with explicit communication: NP-completeness and algorithms , 2006 .

[3]  Martin Vetterli,et al.  Network correlated data gathering with explicit communication: NP-completeness and algorithms , 2006, IEEE/ACM Transactions on Networking.

[4]  Xing Xie,et al.  T-drive: driving directions based on taxi trajectories , 2010, GIS '10.

[5]  Yan Gao,et al.  Optimizing File Retrieval in Delay-Tolerant Content Distribution Community , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems.

[6]  Marco Conti,et al.  ContentPlace: social-aware data dissemination in opportunistic networks , 2008, MSWiM '08.

[7]  Mani B. Srivastava,et al.  Context-aware sensor data dissemination for mobile users in remote areas , 2012, 2012 Proceedings IEEE INFOCOM.

[8]  Baochun Li,et al.  A Distributed Framework for Correlated Data Gathering in Sensor Networks , 2008, IEEE Transactions on Vehicular Technology.

[9]  Lixia Zhang,et al.  DMND: Collecting data from mobiles using Named Data , 2010, 2010 IEEE Vehicular Networking Conference.

[10]  Van Jacobson,et al.  Networking named content , 2009, CoNEXT '09.

[11]  Ramesh Govindan,et al.  The impact of spatial correlation on routing with compression in wireless sensor networks , 2008, TOSN.

[12]  Vishnu Navda,et al.  Efficient gathering of correlated data in sensor networks , 2008, TOSN.

[13]  Guohong Cao,et al.  User-centric data dissemination in disruption tolerant networks , 2011, 2011 Proceedings IEEE INFOCOM.

[14]  Ji Luo,et al.  Delay Tolerant Event Collection in Sensor Networks with Mobile Sink , 2010, 2010 Proceedings IEEE INFOCOM.

[15]  Moustafa Ghanem,et al.  Distributed Clustering-Based Aggregation Algorithm for Spatial Correlated Sensor Networks , 2011, IEEE Sensors Journal.

[16]  Özgür B. Akan,et al.  Spatio-temporal correlation: theory and applications for wireless sensor networks , 2004, Comput. Networks.

[17]  Swarun Kumar,et al.  CarSpeak: a content-centric network for autonomous driving , 2012, SIGCOMM '12.

[18]  Gunnar Karlsson,et al.  Wireless Ad Hoc Podcasting , 2007, 2007 4th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[19]  Ramesh Govindan,et al.  MediaScope: Selective on-demand media retrieval from mobile devices , 2013, 2013 ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[20]  Guohong Cao,et al.  Supporting cooperative caching in ad hoc networks , 2004, IEEE Transactions on Mobile Computing.

[21]  Ramesh Govindan,et al.  Information-maximizing caching in Ad Hoc networks with named data networking , 2013, 2013 IEEE 2nd Network Science Workshop (NSW).

[22]  Alexander Afanasyev,et al.  Let's ChronoSync: Decentralized dataset state synchronization in Named Data Networking , 2013, 2013 21st IEEE International Conference on Network Protocols (ICNP).

[23]  Jörg Ott,et al.  Redundancy and distributed caching in mobile DTNs , 2007, MobiArch '07.

[24]  Srinivasan Seshan,et al.  CARE: content aware redundancy elimination for challenged networks , 2012, HotNets-XI.

[25]  Md. Yusuf Sarwar Uddin,et al.  MINERVA: Information-Centric Programming for Social Sensing , 2013, 2013 22nd International Conference on Computer Communication and Networks (ICCCN).

[26]  Jie Wu,et al.  MOPS: Providing Content-Based Service in Disruption-Tolerant Networks , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems.

[27]  Md. Yusuf Sarwar Uddin,et al.  PhotoNet: A Similarity-Aware Picture Delivery Service for Situation Awareness , 2011, 2011 IEEE 32nd Real-Time Systems Symposium.

[28]  Jian Pei,et al.  An Energy-Efficient Data Collection Framework for Wireless Sensor Networks by Exploiting Spatiotemporal Correlation , 2007, IEEE Transactions on Parallel and Distributed Systems.

[29]  Pan Hui,et al.  A socio-aware overlay for publish/subscribe communication in delay tolerant networks , 2007, MSWiM '07.

[30]  Donald F. Towsley,et al.  On optimal communication cost for gathering correlated data through wireless sensor networks , 2006, MobiCom '06.

[31]  Shaojie Tang,et al.  COUPON: Cooperatively Building Sensing Maps in Mobile Opportunistic Networks , 2013, 2013 IEEE 10th International Conference on Mobile Ad-Hoc and Sensor Systems.