Workflow-based automatic processing for Internet of Floating Things crowdsourced data

Abstract Data from sensors incorporated into mobile devices, such as networked navigational sensors, can be used to capture detailed environmental information. We describe here a workflow and framework for using sensors on boats to construct unique new datasets of underwater topography (bathymetry). Starting with a large number of measurements of position, depth, etc., obtained from such an Internet of Floating Things, we illustrate how, with a specialized protocol, data can be communicated to cloud resources, even when using delayed, intermittent, or disconnected networks. We then propose a method for automatic sensor calibration based on a novel reputation approach. Sampled depth data are interpolated efficiently on a cloud computing platform in order to provide a continuously updated bathymetric database. Our prototype implementation uses the FACE-IT Galaxy workflow engine to manage network communication and exploits the computational power of GPGPUs in a virtualized cloud environment, working with a CUDA-parallel algorithm, for efficient data processing. We report on an initial evaluation involving data from a sailing vessel in Italian coastal waters.

[1]  Francisco J. Jiménez-Hornero,et al.  Using general-purpose computing on graphics processing units (GPGPU) to accelerate the ordinary kriging algorithm , 2014, Comput. Geosci..

[2]  José Luis Hernández-Stefanoni,et al.  Comparing the performance of two spatial interpolation methods for creating a digital bathymetric model of the Yucatan submerged platform , 2007 .

[3]  Yonggang Wang,et al.  Research and implementation on spatial data storage and operation based on Hadoop platform , 2010, 2010 Second IITA International Conference on Geoscience and Remote Sensing.

[4]  Paul Van Dooren,et al.  Reputation Systems and Optimization , 2008 .

[5]  Raffaele Montella,et al.  Rip current evidence by hydrodynamic simulations, bathymetric surveys and UAV observation , 2017 .

[6]  Cheol-Ho Hong,et al.  On the Virtualization of CUDA Based GPU Remoting on ARM and X86 Machines in the GVirtuS Framework , 2017, International Journal of Parallel Programming.

[7]  Carlos Reaño,et al.  Reducing the performance gap of remote GPU virtualization with InfiniBand Connect-IB , 2016, 2016 IEEE Symposium on Computers and Communication (ISCC).

[8]  Ilaria Iermano,et al.  Integration of numerical modeling and observations for the Gulf of Naples monitoring network , 2012 .

[9]  Michael Edward Hohn,et al.  An Introduction to Applied Geostatistics: by Edward H. Isaaks and R. Mohan Srivastava, 1989, Oxford University Press, New York, 561 p., ISBN 0-19-505012-6, ISBN 0-19-505013-4 (paperback), $55.00 cloth, $35.00 paper (US) , 1991 .

[10]  Andrew Danner,et al.  Hybrid MPI/GPU interpolation for grid DEM construction , 2012, SIGSPATIAL/GIS.

[11]  Ladislav Huraj,et al.  Design and performance evaluation of snow cover computing on GPUs , 2010 .

[12]  Giulio Giunta,et al.  Enabling Android-Based Devices to High-End GPGPUs , 2016, ICA3PP.

[13]  Antonio Novellino,et al.  The European Marine Data and Observation Network (EMODnet): Your Gateway to European Marine and Coastal Data , 2016 .

[14]  Vikram K. Narayana,et al.  GPU Resource Sharing and Virtualization on High Performance Computing Systems , 2011, 2011 International Conference on Parallel Processing.

[15]  Matthew Lease,et al.  On Quality Control and Machine Learning in Crowdsourcing , 2011, Human Computation.

[16]  Gang Mei,et al.  Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation , 2016, SpringerPlus.

[17]  D. Shepard A two-dimensional interpolation function for irregularly-spaced data , 1968, ACM National Conference.

[18]  Giulio Giunta,et al.  Virtualizing high-end GPGPUs on ARM clusters for the next generation of high performance cloud computing , 2013, Cluster Computing.

[19]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[20]  Raffaele Montella,et al.  SIaaS - Sensing Instrument as a Service Using Cloud Computing to Turn Physical Instrument into Ubiquitous Service , 2012, 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications.

[21]  Ian T. Foster,et al.  SOLE: Linking Research Papers with Science Objects , 2012, IPAW.

[22]  Salvatore Cuomo,et al.  A Novel Triangle-based Method for Scattered Data Interpolation , 2014 .

[23]  Raffaele Montella,et al.  The High Performance Internet of Things: Using GVirtuS to Share High-End GPUs with ARM Based Cluster Computing Nodes , 2013, PPAM.

[24]  Ian T. Foster,et al.  Numerical and Implementation Issues in Food Quality Modeling for Human Diseases Prevention , 2017, HEALTHINF.

[25]  Xicheng Tan,et al.  OpenCL Implementation of a Parallel Universal Kriging Algorithm for Massive Spatial Data Interpolation on Heterogeneous Systems , 2016, ISPRS Int. J. Geo Inf..

[26]  Rashid Mehmood,et al.  Data Fusion and IoT for Smart Ubiquitous Environments: A Survey , 2017, IEEE Access.

[27]  Raffaele Montella,et al.  Virtualizing General Purpose GPUs for High Performance Cloud Computing: An Application to a Fluid Simulator , 2012, 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications.

[28]  Ian T. Foster,et al.  Klimatic: A Virtual Data Lake for Harvesting and Distribution of Geospatial Data , 2016, 2016 1st Joint International Workshop on Parallel Data Storage and data Intensive Scalable Computing Systems (PDSW-DISCS).

[29]  Sokol Kosta,et al.  A fast, secure, reliable, and resilient data transfer framework for pervasive IoT applications , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[30]  Katia Obraczka,et al.  A survey on congestion control for delay and disruption tolerant networks , 2015, Ad Hoc Networks.

[31]  Lanfang Dong,et al.  Research on CUDA-based Kriging Interpolation Algorithm , 2011 .

[32]  B. Calder Automatic Statistical Processing of Multibeam Echosounder Data , 2003 .

[33]  N. Cressie The origins of kriging , 1990 .

[34]  Peter M. Chen,et al.  Execution replay of multiprocessor virtual machines , 2008, VEE '08.

[35]  François Armand,et al.  Shared device driver model for virtualized mobile handsets , 2008, MobiVirt '08.

[36]  Hui Zang,et al.  Anonymization of location data does not work: a large-scale measurement study , 2011, MobiCom.

[37]  Sokol Kosta,et al.  Using GPGPU Accelerated Interpolation Algorithms for Marine Bathymetry Processing with On-Premises and Cloud Based Computational Resources , 2017, PPAM.

[38]  Christian Heipke,et al.  Crowdsourcing geospatial data , 2010 .

[39]  Raffaele Montella,et al.  A globus toolkit 4 based instrument service for environmental data acquisition and distribution , 2008, UPGRADE '08.

[40]  Ian T. Foster,et al.  WaComM: A Parallel Water Quality Community Model for Pollutant Transport and Dispersion Operational Predictions , 2016, 2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).

[41]  Fabio Matano,et al.  High-resolution morpho-bathymetry of Pozzuoli Bay, southern Italy , 2016 .

[42]  Xiaoping Ma,et al.  Performance evaluation of MQTT and CoAP via a common middleware , 2014, 2014 IEEE Ninth International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP).

[43]  Salvatore Cuomo,et al.  A class of piecewise interpolating functions based on barycentric coordinates , 2014 .

[44]  Raffaele Montella,et al.  Monitoring and Modelling Coastal Vulnerability and Mitigation Proposal for an Archaeological Site (Kaulonia, Southern Italy) , 2018, Sustainability.

[45]  Livia Marcellino,et al.  Some remarks about a community open source Lagrangian pollutant transport and dispersion model , 2017, EUSPN/ICTH.

[46]  Antonio Novellino,et al.  European marine observation data network — EMODnet physics , 2015, OCEANS 2015 - Genova.

[47]  Salvatore Cuomo,et al.  A parallel PDE-based numerical algorithm for computing the Optical Flow in hybrid systems , 2017, J. Comput. Sci..

[48]  Salvatore Cuomo,et al.  IoT-based collaborative reputation system for associating visitors and artworks in a cultural scenario , 2017, Expert Syst. Appl..

[49]  Buddhadeb Halder,et al.  Evolution of crowdsourcing: potential data protection, privacy and security concerns under the new media age , 2014 .

[50]  Giulio Giunta,et al.  Using grid computing based components in on demand environmental data delivery , 2007, UPGRADE '07.

[51]  Salil S. Kanhere,et al.  Participatory Sensing: Crowdsourcing Data from Mobile Smartphones in Urban Spaces , 2011, 2011 IEEE 12th International Conference on Mobile Data Management.

[52]  Xuan Shi,et al.  Kriging interpolation over heterogeneous computer architectures and systems , 2013 .

[53]  Raffaele Montella,et al.  Wave run-up prediction and observation in a micro-tidal beach , 2017, Natural Hazards and Earth System Sciences.

[54]  Mathias Payer,et al.  Impact of GC design on power and performance for Android , 2015, SYSTOR.

[55]  Seungmin Rho,et al.  Geo-Conquesting Based on Graph Analysis for Crowdsourced Metatrails from Mobile Sensing , 2017, IEEE Communications Magazine.

[56]  Salvatore Cuomo,et al.  Piecewise Hermite interpolation via barycentric coordinates , 2015 .

[57]  Antonio Pescapè,et al.  On the Integration of Cloud Computing and Internet of Things , 2014, 2014 International Conference on Future Internet of Things and Cloud.

[58]  Awais Ahmad,et al.  A generic methodology for geo‐related data semantic annotation , 2018, Concurr. Comput. Pract. Exp..

[59]  Arun Kumar Sangaiah,et al.  Reproducing dynamics related to an Internet of Things framework: A numerical and statistical approach , 2017, J. Parallel Distributed Comput..

[60]  Giulio Giunta,et al.  Virtualizing CUDA Enabled GPGPUs on ARM Clusters , 2015, PPAM.

[61]  Ian T. Foster,et al.  Processing of crowd-sourced data from an internet of floating things , 2017, WORKS@SC.

[62]  Alex Rodriguez,et al.  Experiences building Globus Genomics: a next‐generation sequencing analysis service using Galaxy, Globus, and Amazon Web Services , 2014, Concurr. Comput. Pract. Exp..

[63]  Dana Petcu Consuming Resources and Services from Multiple Clouds , 2013, Journal of Grid Computing.

[64]  Carlos Reaño,et al.  A Performance Comparison of CUDA Remote GPU Virtualization Frameworks , 2015, 2015 IEEE International Conference on Cluster Computing.

[65]  Mani B. Srivastava,et al.  Reputation-based framework for high integrity sensor networks , 2008, TOSN.

[66]  Øyvind Breivik,et al.  Forecasting the Drift of Objects and Substances in the Ocean , 2006 .

[67]  Kevin R. Fall,et al.  A delay-tolerant network architecture for challenged internets , 2003, SIGCOMM '03.

[68]  Giulio Giunta,et al.  Accelerating Linux and Android applications on low‐power devices through remote GPGPU offloading , 2017, Concurr. Comput. Pract. Exp..

[69]  Mark Gahegan,et al.  A Predictive Model for Frequently Viewed Tiles in a Web Map , 2010, Trans. GIS.

[70]  Ian T. Foster,et al.  Globus Online: Accelerating and Democratizing Science through Cloud-Based Services , 2011, IEEE Internet Computing.

[71]  Giulio Giunta,et al.  Marine bathymetry processing through GPGPU virtualization in high performance cloud computing , 2018, Concurr. Comput. Pract. Exp..

[72]  E. Lynn Usery,et al.  Implications of Web Mercator and Its Use in Online Mapping , 2014, Cartogr. Int. J. Geogr. Inf. Geovisualization.

[73]  Ian T. Foster,et al.  Performance, Resilience, and Security in Moving Data from the Fog to the Cloud: The DYNAMO Transfer Framework Approach , 2018, IDCS.

[74]  Sergio Iserte,et al.  Remote GPU Virtualization: Is It Useful? , 2016, 2016 2nd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era (HiPINEB).

[75]  Giulio Giunta,et al.  A mathematical model of collaborative reputation systems , 2012, Int. J. Comput. Math..

[76]  Jack P. C. Kleijnen,et al.  Kriging Metamodeling in Simulation: A Review , 2007, Eur. J. Oper. Res..

[77]  Paul Van Dooren,et al.  Iterative Filtering in Reputation Systems , 2010, SIAM J. Matrix Anal. Appl..

[78]  Guido Benassai,et al.  Implementation of WWIII wave model for the study of risk inundation on the coastlines of Campania, Italy , 2006 .

[79]  Raimon Tolosana-Delgado,et al.  Interpolation algorithm ranking using cross-validation and the role of smoothing effect. A coal zone example , 2010, Comput. Geosci..

[80]  Ian T. Foster,et al.  DYNAMO: Distributed Leisure Yacht-Carried Sensor-Network for Atmosphere and Marine Data Crowdsourcing Applications , 2018, 2018 IEEE International Conference on Cloud Engineering (IC2E).

[81]  Ian T. Foster,et al.  Using the FACE-IT portal and workflow engine for operational food quality prediction and assessment: An application to mussel farms monitoring in the Bay of Napoli, Italy , 2020, Future Gener. Comput. Syst..