Processing of crowd-sourced data from an internet of floating things

Sensors incorporated into mobile devices provide unique opportunities to capture detailed environmental information that cannot be readily collected in other ways. We show here how data from networked navigational sensors on leisure vessels can be used to construct unique new datasets, using the example of underwater topography (bathymetry) to demonstrate the approach. Specifically, we describe an end-to-end workflow that involves the collection of large numbers of timestamped (position, depth) measurements from "internet of floating things" devices on leisure vessels; the communication of data to cloud resources, via a specialized protocol capable of dealing with delayed, intermittent, or even disconnected networks; the integration of measurement data into cloud storage; the efficient correction and interpolation of measurements on a cloud computing platform; and the creation of a continuously updated bathymetric database. Our prototype implementation of this workflow leverages the FACE-IT Galaxy workflow engine to integrate network communication and database components with a CUDA-enabled algorithm running in a virtualized cloud environment.

[1]  Carlos Reaño,et al.  Reducing the performance gap of remote GPU virtualization with InfiniBand Connect-IB , 2016, 2016 IEEE Symposium on Computers and Communication (ISCC).

[2]  Cheol-Ho Hong,et al.  On the Virtualization of CUDA Based GPU Remoting on ARM and X86 Machines in the GVirtuS Framework , 2017, International Journal of Parallel Programming.

[3]  Ladislav Huraj,et al.  Design and performance evaluation of snow cover computing on GPUs , 2010 .

[4]  Francisco J. Jiménez-Hornero,et al.  Using general-purpose computing on graphics processing units (GPGPU) to accelerate the ordinary kriging algorithm , 2014, Comput. Geosci..

[5]  Ian T. Paulsen,et al.  The Common Oceanographer: Crowdsourcing the Collection of Oceanographic Data , 2014, PLoS biology.

[6]  Manish Parashar,et al.  Architecting the cyberinfrastructure for National Science Foundation Ocean Observatories Initiative (OOI) , 2016 .

[7]  Christophe Maisondieu,et al.  Wind-induced drift of objects at sea: The leeway field method , 2011 .

[8]  Seungmin Rho,et al.  Geo-Conquesting Based on Graph Analysis for Crowdsourced Metatrails from Mobile Sensing , 2017, IEEE Communications Magazine.

[9]  Salvatore Cuomo,et al.  Piecewise Hermite interpolation via barycentric coordinates , 2015 .

[10]  Antonio Pescapè,et al.  On the Integration of Cloud Computing and Internet of Things , 2014, 2014 International Conference on Future Internet of Things and Cloud.

[11]  Antonio Novellino,et al.  The European Marine Data and Observation Network (EMODnet): Your Gateway to European Marine and Coastal Data , 2016 .

[12]  Gang Mei,et al.  Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation , 2016, SpringerPlus.

[13]  Raffaele Montella,et al.  The High Performance Internet of Things: Using GVirtuS to Share High-End GPUs with ARM Based Cluster Computing Nodes , 2013, PPAM.

[14]  Hui Zang,et al.  Anonymization of location data does not work: a large-scale measurement study , 2011, MobiCom.

[15]  Raffaele Montella,et al.  Virtualizing General Purpose GPUs for High Performance Cloud Computing: An Application to a Fluid Simulator , 2012, 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications.

[16]  Raffaele Montella,et al.  A globus toolkit 4 based instrument service for environmental data acquisition and distribution , 2008, UPGRADE '08.

[17]  Vikram K. Narayana,et al.  GPU Resource Sharing and Virtualization on High Performance Computing Systems , 2011, 2011 International Conference on Parallel Processing.

[18]  Carlos Reaño,et al.  A Performance Comparison of CUDA Remote GPU Virtualization Frameworks , 2015, 2015 IEEE International Conference on Cluster Computing.

[19]  Christian Heipke,et al.  Crowdsourcing geospatial data , 2010 .

[20]  Ian T. Foster,et al.  WaComM: A Parallel Water Quality Community Model for Pollutant Transport and Dispersion Operational Predictions , 2016, 2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).

[21]  Reano Carlos,et al.  Reducing the performance gap of remote GPU virtualization with InfiniBand Connect-IB , 2016 .

[22]  Kevin R. Fall,et al.  A delay-tolerant network architecture for challenged internets , 2003, SIGCOMM '03.

[23]  Katia Obraczka,et al.  A survey on congestion control for delay and disruption tolerant networks , 2015, Ad Hoc Networks.

[24]  Charles E. Catlett,et al.  Array of things: a scientific research instrument in the public way: platform design and early lessons learned , 2017, SCOPE@CPSWeek.

[25]  D. Shepard A two-dimensional interpolation function for irregularly-spaced data , 1968, ACM National Conference.

[26]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[27]  Raffaele Montella,et al.  SIaaS - Sensing Instrument as a Service Using Cloud Computing to Turn Physical Instrument into Ubiquitous Service , 2012, 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications.

[28]  Mark Gahegan,et al.  A Predictive Model for Frequently Viewed Tiles in a Web Map , 2010, Trans. GIS.

[29]  Giulio Giunta,et al.  Virtualizing CUDA Enabled GPGPUs on ARM Clusters , 2015, PPAM.

[30]  B. Calder Automatic Statistical Processing of Multibeam Echosounder Data , 2003 .

[31]  Xiaoping Ma,et al.  Performance evaluation of MQTT and CoAP via a common middleware , 2014, 2014 IEEE Ninth International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP).

[32]  Salvatore Cuomo,et al.  A class of piecewise interpolating functions based on barycentric coordinates , 2014 .

[33]  Ian T. Foster,et al.  SOLE: Linking Research Papers with Science Objects , 2012, IPAW.

[34]  Buddhadeb Halder,et al.  Evolution of crowdsourcing: potential data protection, privacy and security concerns under the new media age , 2014 .

[35]  Giulio Giunta,et al.  Using grid computing based components in on demand environmental data delivery , 2007, UPGRADE '07.

[36]  Salil S. Kanhere,et al.  Participatory Sensing: Crowdsourcing Data from Mobile Smartphones in Urban Spaces , 2011, 2011 IEEE 12th International Conference on Mobile Data Management.

[37]  Xuan Shi,et al.  Kriging interpolation over heterogeneous computer architectures and systems , 2013 .

[38]  Matthew Lease,et al.  On Quality Control and Machine Learning in Crowdsourcing , 2011, Human Computation.

[39]  Giulio Giunta,et al.  Virtualizing high-end GPGPUs on ARM clusters for the next generation of high performance cloud computing , 2013, Cluster Computing.

[40]  Peter M. Chen,et al.  Execution replay of multiprocessor virtual machines , 2008, VEE '08.

[41]  Alex Rodriguez,et al.  Experiences building Globus Genomics: a next‐generation sequencing analysis service using Galaxy, Globus, and Amazon Web Services , 2014, Concurr. Comput. Pract. Exp..

[42]  François Armand,et al.  Shared device driver model for virtualized mobile handsets , 2008, MobiVirt '08.

[43]  Salvatore Cuomo,et al.  A parallel PDE-based numerical algorithm for computing the Optical Flow in hybrid systems , 2017, J. Comput. Sci..

[44]  R. Webster,et al.  Kriging: a method of interpolation for geographical information systems , 1990, Int. J. Geogr. Inf. Sci..

[45]  Ilaria Iermano,et al.  Integration of numerical modeling and observations for the Gulf of Naples monitoring network , 2012 .

[46]  Andrew Danner,et al.  Hybrid MPI/GPU interpolation for grid DEM construction , 2012, SIGSPATIAL/GIS.

[47]  Guido Benassai,et al.  Implementation of WWIII wave model for the study of risk inundation on the coastlines of Campania, Italy , 2006 .

[48]  Raimon Tolosana-Delgado,et al.  Interpolation algorithm ranking using cross-validation and the role of smoothing effect. A coal zone example , 2010, Comput. Geosci..

[49]  Ian T. Foster,et al.  Globus Online: Accelerating and Democratizing Science through Cloud-Based Services , 2011, IEEE Internet Computing.

[50]  E. Lynn Usery,et al.  Implications of Web Mercator and Its Use in Online Mapping , 2014, Cartogr. Int. J. Geogr. Inf. Geovisualization.

[51]  Sergio Iserte,et al.  Remote GPU Virtualization: Is It Useful? , 2016, 2016 2nd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era (HiPINEB).

[52]  Salvatore Cuomo,et al.  A Novel Triangle-based Method for Scattered Data Interpolation , 2014 .

[53]  Ian T. Foster,et al.  Numerical and Implementation Issues in Food Quality Modeling for Human Diseases Prevention , 2017, HEALTHINF.