Tracking semantic relationships for effective data management in home networks

The amount of data that home users generate, store, and peruse has grown significantly in the past few years. Increasingly, organizing this huge amount of data - in order to make it easy to browse, query and access - is becoming challenging. Many recent proposals have emphasized the importance of data management in home networks and proposed mechanisms for managing replicas across devices to increase availability. Essentially, they capture the relationship "is copy of" between files across devices. However, files can be semantically related. Users are often interested in finding data that has such semantic relationships; tracking these relationships helps users to effectively search based on content or human-understandable context, organize data and manage the limited storage while ensuring availability of information. However, inferring semantic relationships just based on user-defined tags and file names can be challenging, since users may not follow any standard or unique naming conventions. We argue that such semantic relationships should be derived on the basis of content itself, and propose to leverage recent developments in multimedia processing literature, with minimal user involvement. The decentralized, heterogeneous and dynamic operational environment of home networks present interesting systems and network challenges. In this paper, we have highlighted several candidate designs and system-optimizations that can help build an effective semantic-aware data management for home networks. As ongoing work, we are working on a prototype implementation of a decentralized data management system.

[1]  Catherine C. Marshall,et al.  Cimbiosys: a platform for content-based partial replication , 2009, NSDI 2009.

[2]  Jason Flinn,et al.  quFiles: The right file at the right time , 2010, TOS.

[3]  Lei Gao,et al.  PRACTI Replication , 2006, NSDI.

[4]  Ted Wobber,et al.  Fidelity-Aware Replication for Mobile Devices , 2010, IEEE Trans. Mob. Comput..

[5]  Lorrie Faith Cranor,et al.  Perspective: Semantic Data Management for the Home , 2009, FAST.

[6]  Petr Kuznetsov,et al.  PodBase: transparent storage management for personal devices , 2008, IPTPS.

[7]  David Salesin,et al.  Fast multiresolution image querying , 1995, SIGGRAPH.

[8]  Alec Wolman,et al.  MAUI: making smartphones last longer with code offload , 2010, MobiSys '10.

[9]  Peter L. Reiher,et al.  Roam: a scalable replication system for mobile computing , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[10]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[11]  Marvin Theimer,et al.  Flexible update propagation for weakly consistent replication , 1997, SOSP.

[12]  Byung-Gon Chun,et al.  Augmented Smartphone Applications Through Clone Cloud Execution , 2009, HotOS.

[13]  Mahadev Satyanarayanan,et al.  Tactics-based remote execution for mobile computing , 2003, MobiSys '03.

[14]  Catherine C. Marshall,et al.  A Platform for Content-based Partial Replication , 2009, NSDI.

[15]  Kai Li,et al.  Image similarity search with compact data structures , 2004, CIKM '04.

[16]  Kun Li,et al.  iScope: personalized multi-modality image search for mobile devices , 2009, MobiSys '09.

[17]  Paramvir Bahl,et al.  The Case for VM-Based Cloudlets in Mobile Computing , 2009, IEEE Pervasive Computing.

[18]  Avideh Zakhor,et al.  Estimation of Web video multiplicity , 1999, Electronic Imaging.

[19]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[20]  Brandon Salmon,et al.  Learning to Share: A Study of Sharing Among Home Storage Devices (CMU-PDL-07-107) , 2007 .

[21]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.