Review of the Complexity of Managing Big Data of the Internet of Things

There is a growing awareness that the complexity of managing Big Data is one of the main challenges in the developing field of the Internet of Things (IoT). Complexity arises from several aspects of the Big Data life cycle, such as gathering data, storing them onto cloud servers, cleaning and integrating the data, a process involving the last advances in ontologies, such as Extensible Markup Language (XML) and Resource Description Framework (RDF), and the application of machine learning methods to carry out classifications, predictions, and visualizations. In this review, the state of the art of all the aforementioned aspects of Big Data in the context of the Internet of Things is exposed. The most novel technologies in machine learning, deep learning, and data mining on Big Data are discussed as well. Finally, we also point the reader to the state-of-the-art literature for further in-depth studies, and we present the major trends for the future.

[1]  Higinio Mora,et al.  Improving Urban Accessibility: A Methodology For Urban Dynamics Analysis In Smart, Sustainable And Inclusive Cities , 2017 .

[2]  Karl Aberer,et al.  Semantic Sensor Data Search in a Large-Scale Federated Sensor Network , 2011, SSN.

[3]  Bibhudatta Sahoo,et al.  Sensing and Actuation as a Service Delivery Model in Cloud Edge centric Internet of Things , 2018, Future Gener. Comput. Syst..

[4]  Wei Fan,et al.  Mining big data: current status, and forecast to the future , 2013, SKDD.

[5]  Mubashir Husain Rehmani,et al.  Mobile Edge Computing: Opportunities, solutions, and challenges , 2017, Future Gener. Comput. Syst..

[6]  Ray Y. Zhong,et al.  Intelligent Manufacturing in the Context of Industry 4.0: A Review , 2017 .

[7]  Roy Goldman,et al.  From Semistructured Data to XML: Migrating the Lore Data Model and Query Language , 1999, Markup Lang..

[8]  Julian Szymanski,et al.  An IoT-Based Computational Framework for Healthcare Monitoring in Mobile Environments , 2017, Sensors.

[9]  Athanasios V. Vasilakos,et al.  Data Mining for the Internet of Things: Literature Review and Challenges , 2015, Int. J. Distributed Sens. Networks.

[10]  Vinod Vokkarane,et al.  A New Deep Learning-Based Food Recognition System for Dietary Assessment on An Edge Computing Service Infrastructure , 2018, IEEE Transactions on Services Computing.

[11]  Christian S. Jensen,et al.  Google fusion tables: web-centered data management and collaboration , 2010, SIGMOD Conference.

[12]  David Parmenter,et al.  Key Performance Indicators: Developing, Implementing,and Using Winning KPIs , 2007 .

[13]  Ujjwal Maulik,et al.  A Survey of Multiobjective Evolutionary Algorithms for Data Mining: Part I , 2014, IEEE Transactions on Evolutionary Computation.

[14]  David Gil,et al.  Internet of Things: A Review of Surveys Based on Context Aware Intelligent Services , 2016, Sensors.

[15]  Xin Liu,et al.  Mining Outlier Data in Mobile Internet-Based Large Real-Time Databases , 2018, Complex..

[16]  Amit P. Sheth,et al.  Toward Practical Privacy-Preserving Analytics for IoT and Cloud-Based Healthcare Systems , 2018, IEEE Internet Computing.

[17]  Mahadev Satyanarayanan,et al.  A Scalable and Privacy-Aware IoT Service for Live Video Analytics , 2017, MMSys.

[18]  Ping Wang,et al.  An open, flexible and multilevel data storing and processing platform for very large scale sensor network , 2012, 2012 14th International Conference on Advanced Communication Technology (ICACT).

[19]  Kwan-Liu Ma,et al.  Big-Data Visualization , 2013, IEEE Computer Graphics and Applications.

[20]  Alon Y. Halevy,et al.  Crowdsourcing systems on the World-Wide Web , 2011, Commun. ACM.

[21]  Suman Jana,et al.  DeepTest: Automated Testing of Deep-Neural-Network-Driven Autonomous Cars , 2017, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[22]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[23]  Jason Jianjun Gu,et al.  Deep Neural Networks for wireless localization in indoor and outdoor environments , 2016, Neurocomputing.

[24]  Mahadev Satyanarayanan,et al.  The Emergence of Edge Computing , 2017, Computer.

[25]  Botond Kádár,et al.  Orchestrated Platform for Cyber-Physical Systems , 2018, Complex..

[26]  David Gil,et al.  A Computational Architecture Based on RFID Sensors for Traceability in Smart Cities , 2015, Sensors.

[27]  Subhas Chandra Mukhopadhyay,et al.  Smart Sensors and Internet of Things: A Postgraduate Paper , 2017, IEEE Sensors Journal.

[28]  Mianxiong Dong,et al.  When Weather Matters: IoT-Based Electrical Load Forecasting for Smart Grid , 2017, IEEE Communications Magazine.

[29]  Jiangchuan Liu,et al.  When deep learning meets edge computing , 2017, 2017 IEEE 25th International Conference on Network Protocols (ICNP).

[30]  Gustavo Alonso,et al.  Perm: Processing Provenance and Data on the Same Data Model through Query Rewriting , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[31]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[32]  Pedro M. Domingos,et al.  Learning to map between ontologies on the semantic web , 2002, WWW '02.

[33]  Antonio Iera,et al.  The Internet of Things: A survey , 2010, Comput. Networks.

[34]  Nicholas J. Horton,et al.  Data Science in Statistics Curricula: Preparing Students to “Think with Data” , 2014, 1410.3127.

[35]  Lida Xu,et al.  The internet of things: a survey , 2014, Information Systems Frontiers.

[36]  Shui Yu,et al.  Efficient Performance Monitoring for Ubiquitous Virtual Networks Based on Matrix Completion , 2018, IEEE Access.

[37]  Ching-Han Chen,et al.  High-level modeling and synthesis of smart sensor networks for Industrial Internet of Things , 2017, Comput. Electr. Eng..

[38]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[39]  Vasant Dhar,et al.  Data science and prediction , 2012, CACM.

[40]  Li-Der Chou,et al.  Task Scheduling for Edge Computing with Agile VNFs On-Demand Service Model toward 5G and Beyond , 2018, Wirel. Commun. Mob. Comput..

[41]  Seref Sagiroglu,et al.  Big data: A review , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[42]  Dong Yu,et al.  Deep Learning and Its Applications to Signal and Information Processing [Exploratory DSP] , 2011, IEEE Signal Processing Magazine.

[43]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[44]  Paramvir Bahl,et al.  The Case for VM-Based Cloudlets in Mobile Computing , 2009, IEEE Pervasive Computing.

[45]  Raquel Pérez-delHoyo,et al.  A Comprehensive System for Monitoring Urban Accessibility in Smart Cities , 2017, Sensors.

[46]  Kim-Kwang Raymond Choo,et al.  A multi-objective software defined network traffic measurement , 2017 .

[47]  Gonzalo Mateos,et al.  Health Monitoring and Management Using Internet-of-Things (IoT) Sensing with Cloud-Based Processing: Opportunities and Challenges , 2015, 2015 IEEE International Conference on Services Computing.

[48]  George Strawn,et al.  Data Scientist , 2016, IT Professional.

[49]  Abdulsalam Yassine,et al.  Software defined network traffic measurement: Current trends and challenges , 2015, IEEE Instrumentation & Measurement Magazine.

[50]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[51]  Mohammad Mehedi Hassan,et al.  Tradeoff between User Quality-Of-Experience and Service Provider Profit in 5G Cloud Radio Access Network , 2017 .

[52]  Alon Y. Halevy,et al.  MiniCon: A scalable algorithm for answering queries using views , 2000, The VLDB Journal.

[53]  Marimuthu Palaniswami,et al.  Internet of Things (IoT): A vision, architectural elements, and future directions , 2012, Future Gener. Comput. Syst..

[54]  Dong Yu,et al.  Deep Learning and Its Applications to Signal and Information Processing , 2011 .

[55]  Hug March,et al.  The Smart City and other ICT-led techno-imaginaries: Any room for dialogue with Degrowth? , 2016, Journal of Cleaner Production.

[56]  Tingting Wang,et al.  Research on Workshop-Based Positioning Technology Based on Internet of Things in Big Data Background , 2018, Complex..

[57]  Yiqiang Chen,et al.  Semi-supervised deep extreme learning machine for Wi-Fi based localization , 2015, Neurocomputing.

[58]  Francisco Herrera,et al.  On the use of MapReduce for imbalanced big data using Random Forest , 2014, Inf. Sci..

[59]  Naveen K. Chilamkurti,et al.  Deep Learning: The Frontier for Distributed Attack Detection in Fog-to-Things Computing , 2018, IEEE Communications Magazine.

[60]  João Alberto Rubim Sarate,et al.  Smart city and quality of life: Citizens’ perception in a Brazilian case study , 2018 .

[61]  Roy Goldman,et al.  From semistructured data to XML , 2000 .

[62]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.

[63]  Tomas Blomquist,et al.  The Internet of Things: Are you ready for what’s coming? , 2017 .

[64]  Chikio Hayashi,et al.  What is Data Science ? Fundamental Concepts and a Heuristic Example , 1998 .

[65]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[66]  Todd J. Green Containment of conjunctive queries on annotated relations , 2009, ICDT.

[67]  Divesh Srivastava,et al.  Global detection of complex copying relationships between sources , 2010, Proc. VLDB Endow..

[68]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[69]  Mianxiong Dong,et al.  Learning IoT in Edge: Deep Learning for the Internet of Things with Edge Computing , 2018, IEEE Network.

[70]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[71]  Madoka Yuriyama,et al.  Sensor-Cloud Infrastructure - Physical Sensor Management with Virtualized Sensors on Cloud Computing , 2010, 2010 13th International Conference on Network-Based Information Systems.

[72]  Guang-Zhong Yang,et al.  A Deep Learning Approach to on-Node Sensor Data Analytics for Mobile or Wearable Devices , 2017, IEEE Journal of Biomedical and Health Informatics.

[73]  Zhile Yang,et al.  Complex Power System Status Monitoring and Evaluation Using Big Data Platform and Machine Learning Algorithms: A Review and a Case Study , 2018, Complex..

[74]  Marcelo Bissi Pires,et al.  Sensor Fusion and Smart Sensor in Sports and Biomedical Applications , 2016, Sensors.

[75]  Wai Lee,et al.  Smart sensing for IoT applications , 2016, 2016 13th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT).

[76]  Phil Simon,et al.  The Visual Organization: Data Visualization, Big Data, and the Quest for Better Decisions , 2014 .

[77]  Michel L. Ehrenhard,et al.  The shift to Cloud Computing: The impact of disruptive technology on the enterprise software business ecosystem , 2017 .

[78]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.

[79]  Irene C. L. Ng,et al.  The Internet-of-Things: Review and research directions , 2017 .

[80]  Cheryl Ann Alexander,et al.  Big Data and Visualization: Methods, Challenges and Technology Progress , 2015 .

[81]  Marcus Fontoura,et al.  Querying XML streams , 2005, The VLDB Journal.

[82]  Jianli Pan,et al.  Future Edge Cloud and Edge Computing for Internet of Things Applications , 2018, IEEE Internet of Things Journal.

[83]  Ahmed M. Shahat Osman A novel big data analytics framework for smart cities , 2019, Future Gener. Comput. Syst..

[84]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[85]  Antonio Puliafito,et al.  Exploring Container Virtualization in IoT Clouds , 2016, 2016 IEEE International Conference on Smart Computing (SMARTCOMP).

[86]  Jared Dean,et al.  Big Data, Data Mining, and Machine Learning: Value Creation for Business Leaders and Practitioners , 2014 .

[87]  Jinxing Hu,et al.  Government affairs service platform for smart city , 2018, Future Gener. Comput. Syst..

[88]  Magnus Johnsson,et al.  Collaborative Working Architecture for IoT-Based Applications† , 2018, Sensors.

[89]  Maurizio Giacobbe,et al.  Evaluating Information Quality in Delivering IoT-as-a-Service , 2018, 2018 IEEE International Conference on Smart Computing (SMARTCOMP).

[90]  Arkady B. Zaslavsky,et al.  Sensing as a Service and Big Data , 2013, ArXiv.

[91]  Bruno Volckaert,et al.  A Secure Multi-Tier Mobile Edge Computing Model for Data Processing Offloading Based on Degree of Trust , 2018, Sensors.

[92]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[93]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[94]  Bruno Volckaert,et al.  Fog Computing: Enabling the Management and Orchestration of Smart City Applications in 5G Networks , 2017, Entropy.

[95]  Lida Xu,et al.  Compressed Sensing Signal and Data Acquisition in Wireless Sensor Networks and Internet of Things , 2013, IEEE Transactions on Industrial Informatics.

[96]  Juan Trujillo,et al.  A Novel Multidimensional Approach to Integrate Big Data in Business Intelligence , 2015, J. Database Manag..

[97]  Xiangru Meng,et al.  Dynamic Virtual Network Reconfiguration Method for Hybrid Multiple Failures Based on Weighted Relative Entropy , 2018, Entropy.

[98]  Tie Qiu,et al.  Survey on fog computing: architecture, key technologies, applications and open issues , 2017, J. Netw. Comput. Appl..

[99]  Gian Marco Revel,et al.  A Smart Sensing Architecture for Domestic Monitoring: Methodological Approach and Experimental Validation , 2018, Sensors.