Using cloud computing to process intensive floating car data for urban traffic surveillance

The advances in data collection techniques and geosimulation models have contributed to the abundance of geospatial data in urban systems. The surveillance of urban traffic systems relies on the effective handling of near real-time traffic observation data, which is usually data-intensive in nature. We investigated the processing of massive floating car data (FCD) for traffic surveillance in cloud-computing environments, with the goal of exploring the use of emerging cloud-computing technologies to solve data-intensive geospatial problems in urban traffic systems. The experimental results indicated that cloud-computing technologies such as Bigtable and MapReduce can provide substantial utility for data-intensive geospatial computing, as both scalability and near real-time computational performance can be adequately enhanced through our proposed data storage, management, and parallel processing models. The applicability and utility of cloud computing was evaluated for three typical geospatial computing tasks for urban traffic monitoring, namely, FCD query, FCD map matching, and speed computation for road links.

[1]  Shaowen Wang A CyberGIS Framework for the Synthesis of Cyberinfrastructure, GIS, and Spatial Analysis , 2010 .

[2]  David A. Bader,et al.  An Experimental Comparison of Parallel Algorithms for Ear Decomposition of Graphs using Two Leading Paradigms , 2000 .

[3]  Geoffrey C. Fox,et al.  MapReduce for Data Intensive Scientific Analyses , 2008, 2008 IEEE Fourth International Conference on eScience.

[4]  Shaowen Wang,et al.  Grid computing of spatial statistics: using the TeraGrid for G  i* (d) analysis , 2008, Concurr. Comput. Pract. Exp..

[5]  Shaowen Wang,et al.  A quadtree approach to domain decomposition for spatial interpolation in Grid computing environments , 2003, Parallel Comput..

[6]  Daniel Atkins,et al.  Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure , 2003 .

[7]  J. Greenfeld MATCHING GPS OBSERVATIONS TO LOCATIONS ON A DIGITAL MAP , 2002 .

[8]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[9]  Copyright © Intel Corporation 2008 * Other names and brands may be claimed as the property of others , 2004 .

[10]  Won Kim,et al.  Cloud Computing: Today and Tomorrow , 2009, J. Object Technol..

[11]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[12]  Xing Xie,et al.  An Interactive-Voting Based Map Matching Algorithm , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[13]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[14]  Qunying Huang,et al.  Using spatial principles to optimize distributed computing for enabling the physical science discoveries , 2011, Proceedings of the National Academy of Sciences.

[15]  Zhipeng Li,et al.  An Approach to Urban Traffic State Estimation by Fusing Multisource Information , 2009, IEEE Transactions on Intelligent Transportation Systems.

[16]  Alexander S. Szalay,et al.  Data-Intensive Computing in the 21st Century , 2008, Computer.

[17]  Robert B. Noland,et al.  Current map-matching algorithms for transport applications: State-of-the art and future research directions , 2007 .

[18]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[19]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[20]  Chandra Krintz,et al.  Evaluating the Performance Impact of Xen on MPI and Process Execution For HPC Systems , 2006, First International Workshop on Virtualization Technology in Distributed Computing (VTDC 2006).

[21]  Yanbing Liu,et al.  Real-time urban traffic monitoring with global positioning system-equipped vehicles , 2010 .

[22]  Shaowen Wang,et al.  Grid computing of spatial statistics: using the TeraGrid for G i * (d) analysis , 2008 .

[23]  Geoffrey C. Fox,et al.  IEEE TRANSACTIONS ON JOURNAL NAME, MANUSCRIPT ID 1 Cloud Technologies for Bioinformatics Applications , 2022 .

[24]  Ming-Hsiang Tsou,et al.  Please Scroll down for Article International Journal of Geographical Information Science Developing a Grid-enabled Spatial Web Portal for Internet Giservices and Geospatial Cyberinfrastructure Developing a Grid-enabled Spatial Web Portal for Internet Giservices and Geospatial Cyberinfrastructure , 2022 .

[25]  C. Yang,et al.  Introduction to distributed geographic information processing research , 2009, Int. J. Geogr. Inf. Sci..

[26]  Bin Zhou,et al.  Distributed geospatial information processing: sharing distributed geospatial resources to support Digital Earth , 2008, Int. J. Digit. Earth.

[27]  Peter Scull,et al.  A Top-Down Approach to the State Factor Paradigm for Use in Macroscale Soil Analysis , 2010 .

[28]  Mark Gahegan,et al.  Geospatial Cyberinfrastructure: Past, present and future , 2010, Comput. Environ. Urban Syst..

[29]  Chengyang Zhang,et al.  Map-matching for low-sampling-rate GPS trajectories , 2009, GIS.

[30]  Edward Walker,et al.  Benchmarking Amazon EC2 for High-Performance Scientific Computing , 2008, login Usenix Mag..

[31]  Jon Louis Bentley,et al.  Quad trees a data structure for retrieval on composite keys , 1974, Acta Informatica.

[32]  Sanjay Ghemawat,et al.  MapReduce: simplified data processing on large clusters , 2008, CACM.