Efficient Storage of Big-Data for Real-Time GPS Applications

GPS applications need real-time responsiveness and are location-sensitive. GPS data is time-variant, dynamic and large. Current methods of centralized or distributed storage with static data impose constraints on addressing the real-time requirement of such applications. In this project we explore the need for real-timeliness of location based applications and evolve a methodology of storage mechanism for the GPS application's data. So far, the data is distributed based on zones and it also has limited redundancy leading to non-availability in case of failures. In our approach, data is partitioned into cells giving priority to Geo-spatial location. The geography of an area like a district, state, country or for that matter the whole world is divided into data cells. The size of the data cells is decided based on the previously observed location specific queries on the area. The cell size is so selected that a majority of the queries are addressed within the cell itself. This enables computation to happen closer to data location. As a result, data communication overheads are eliminated. We also build some data redundancy, which is used not only to enable failover mechanisms but also to target performance. This is done by nine-cell approach wherein each cell stores data of eight of its neighbours along with its own data. Cells that have an overload of queries, can easily pass-off some of their workload to their near neighbours and ensure timeliness in response. Further, effective load balancing of data ensures better utilization of resources. Experimental results show that our approach improves query response times, yields better throughput and reduces average query waiting time apart from enabling real-time updates on data.

[1]  Fabián E. Bustamante,et al.  Distributed or Centralized Traffic Advisory Systems-The Application's Take , 2009, 2009 6th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[2]  Minglu Li,et al.  HERO: Online Real-Time Vehicle Tracking in Shanghai , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.

[3]  Shashi Shekhar,et al.  Spatial big-data challenges intersecting mobility and cloud computing , 2012, MobiDE '12.

[4]  Haiyan Wu Research on the Data Storage and Access Model in Distributed Computing Environment , 2008, 2008 Third International Conference on Convergence and Hybrid Information Technology.

[5]  Mahbub Hassan,et al.  How much of dsrc is available for non-safety use? , 2008, VANET '08.

[6]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[7]  Yang Zhang,et al.  CarTel: a distributed mobile sensor computing system , 2006, SenSys '06.

[8]  Raja Sengupta,et al.  Empirical determination of channel characteristics for DSRC vehicle-to-vehicle communication , 2004, VANET '04.

[9]  T. Logenthiran,et al.  Intelligent management of distributed storage elements in a smart grid , 2011, 2011 IEEE Ninth International Conference on Power Electronics and Drive Systems.

[10]  Alexandre M. Bayen,et al.  Virtual trip lines for distributed privacy-preserving traffic monitoring , 2008, MobiSys '08.