An Efficient Multidimensional Big Data Fusion Approach in Machine-to-Machine Communication

Machine-to-Machine communication (M2M) is nowadays increasingly becoming a world-wide network of interconnected devices uniquely addressable, via standard communication protocols. The prevalence of M2M is bound to generate a massive volume of heterogeneous, multisource, dynamic, and sparse data, which leads a system towards major computational challenges, such as, analysis, aggregation, and storage. Moreover, a critical problem arises to extract the useful information in an efficient manner from the massive volume of data. Hence, to govern an adequate quality of the analysis, diverse and capacious data needs to be aggregated and fused. Therefore, it is imperative to enhance the computational efficiency for fusing and analyzing the massive volume of data. Therefore, to address these issues, this article proposes an efficient, multidimensional, big data analytical architecture based on the fusion model. The basic concept implicates the division of magnitudes (attributes), i.e., big datasets with complex magnitudes can be altered into smaller data subsets using five levels of the fusion model that can be easily processed by the Hadoop Processing Server, resulting in formalizing the problem of feature extraction applications using earth observatory system, social networking, or networking applications. Moreover, a four-layered network architecture is also proposed that fulfills the basic requirements of the analytical architecture. The feasibility and efficiency of the proposed algorithms used in the fusion model are implemented on Hadoop single-node setup on UBUNTU 14.04 LTS core i5 machine with 3.2GHz processor and 4GB memory. The results show that the proposed system architecture efficiently extracts various features (such as land and sea) from the massive volume of satellite data.

[1]  Anthony Rowe,et al.  Sensor Data as a Service -- A Federated Platform for Mobile Data-centric Service Development and Sharing , 2013, 2013 IEEE International Conference on Services Computing.

[2]  Hai Jin,et al.  Building a network highway for big data: architecture and challenges , 2014, IEEE Network.

[3]  Seungmin Rho,et al.  Interactive scheduling for mobile multimedia service in M2M environment , 2013, Multimedia Tools and Applications.

[4]  Lorenzo Bruzzone,et al.  A Fuzzy-Statistics-Based Affinity Propagation Technique for Clustering in Multispectral Images , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Surya S. Durbha,et al.  Feature Identification via a Combined ICA–Wavelet Method for Image Information Mining , 2010, IEEE Geoscience and Remote Sensing Letters.

[6]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[7]  Renjie Huang,et al.  Design and Deployment of Sensor Network for Real-Time High-Fidelity Volcano Monitoring , 2010, IEEE Transactions on Parallel and Distributed Systems.

[8]  Chung-Chih Lin,et al.  Wireless Health Care Service System for Elderly With Dementia , 2006, IEEE Transactions on Information Technology in Biomedicine.

[9]  Qiang Yang,et al.  Heterogeneous Transfer Learning for Image Clustering via the SocialWeb , 2009, ACL.

[10]  M. Mayilvaganan,et al.  A cloud-based architecture for Big-Data analytics in smart grid: A proposal , 2013, 2013 IEEE International Conference on Computational Intelligence and Computing Research.

[11]  Runze Li,et al.  Statistical inference in massive data sets , 2012 .

[12]  Guoqing Li,et al.  Remote-Sensing Image Denoising Using Partial Differential Equations and Auxiliary Images as Priors , 2012, IEEE Geoscience and Remote Sensing Letters.

[13]  Divyakant Agrawal,et al.  Big data and cloud computing: current state and future opportunities , 2011, EDBT/ICDT '11.

[14]  Dharmendra Singh,et al.  A Statistical-Measure-Based Adaptive Land Cover Classification Algorithm by Efficient Utilization of Polarimetric SAR Observables , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[15]  Philippe Collet,et al.  From Sensors to Visualization Dashboards: Need for Language Composition , 2013 .

[16]  Albert Y. Zomaya,et al.  Estimating the Statistical Characteristics of Remote Sensing Big Data in the Wavelet Transform Domain , 2014, IEEE Transactions on Emerging Topics in Computing.

[17]  Didier Stricker,et al.  Creating and benchmarking a new dataset for physical activity monitoring , 2012, PETRA '12.

[18]  Jiang Li,et al.  Dimensionality reduction of hyperspectral data using discrete wavelet transform feature extraction , 2002, IEEE Trans. Geosci. Remote. Sens..

[19]  Hung-Yun Hsieh,et al.  Data-centric clustering for data gathering in machine-to-machine wireless networks , 2013, 2013 IEEE International Conference on Communications Workshops (ICC).

[20]  Keith W. Miller,et al.  Big Data: New Opportunities and New Challenges [Guest editors' introduction] , 2013, Computer.

[21]  Radu State,et al.  A Big Data Architecture for Large Scale Security Monitoring , 2014, 2014 IEEE International Congress on Big Data.

[22]  Awais Ahmad,et al.  An efficient divide-and-conquer approach for big data analytics in machine-to-machine communication , 2016, Neurocomputing.

[23]  Purnamrita Sarkar,et al.  The Big Data Bootstrap , 2012, ICML.

[24]  Belur V. Dasarathy,et al.  Sensor fusion potential exploitation-innovative architectures and illustrative applications , 1997, Proc. IEEE.

[25]  Liang Hu,et al.  An efficient multidimensional fusion algorithm for IoT data based on partitioning , 2013 .

[26]  David R. Thompson,et al.  Superpixel Endmember Detection , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Anand Paul,et al.  Graph based M2M optimization in an IoT environment , 2013, RACS.

[28]  Jianguo Lu,et al.  Bias Correction in a Small Sample from Big Data , 2013, IEEE Transactions on Knowledge and Data Engineering.

[29]  Joseph M. Hellerstein,et al.  MAD Skills: New Analysis Practices for Big Data , 2009, Proc. VLDB Endow..

[30]  Robert D. Nowak,et al.  Wavelet-based statistical signal processing using hidden Markov models , 1998, IEEE Trans. Signal Process..

[31]  Graham Cormode,et al.  Histograms and Wavelets on Probabilistic Data , 2010, IEEE Trans. Knowl. Data Eng..

[32]  Lucien Wald,et al.  Some terms of reference in data fusion , 1999, IEEE Trans. Geosci. Remote. Sens..

[33]  Yongliang Wang,et al.  Research on Big Data Architecture, Key Technologies and Its Measures , 2013, 2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing.

[34]  Anil V. Rao,et al.  Optimal Control of an Underwater Sensor Network for Cooperative Target Tracking , 2009, IEEE Journal of Oceanic Engineering.

[35]  Brice Morin,et al.  SENSAPP as a Reference Platform to Support Cloud Experiments: From the Internet of Things to the Internet of Services , 2012, 2012 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing.

[36]  Romain Rouvoy,et al.  Dynamic Deployment of Sensing Experiments in the Wild Using Smartphones , 2013, DAIS.

[37]  Deborah Estrin,et al.  Impact of network density on data aggregation in wireless sensor networks , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[38]  Arkady B. Zaslavsky,et al.  Sensing as a Service and Big Data , 2013, ArXiv.

[39]  Zdzisław Pawlak,et al.  Rough set theory and its applications , 2002, Journal of Telecommunications and Information Technology.

[40]  Liang Dong,et al.  Starfish: A Self-tuning System for Big Data Analytics , 2011, CIDR.

[41]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[42]  Antonio J. Plaza,et al.  A quantitative and comparative analysis of endmember extraction algorithms from hyperspectral data , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Héctor Pomares,et al.  mHealthDroid: A Novel Framework for Agile Development of Mobile Health Applications , 2014, IWAAL.

[44]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[45]  Rafael García,et al.  Fusion of multispectral and panchromatic images using improved IHS and PCA mergers based on wavelet decomposition , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[46]  S. R,et al.  Data Mining with Big Data , 2017, 2017 11th International Conference on Intelligent Systems and Control (ISCO).

[47]  Pierre Moulin,et al.  Information-theoretic analysis of interscale and intrascale dependencies between image wavelet coefficients , 2001, IEEE Trans. Image Process..

[48]  Seppo Törmä,et al.  SPARQL-Based Applications for RDF-Encoded Sensor Data , 2012, SSN.

[49]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[50]  Deborah Estrin,et al.  Building efficient wireless sensor networks with low-level naming , 2001, SOSP.

[51]  Kai-Kuang Ma,et al.  Unsupervised Change Detection for Satellite Images Using Dual-Tree Complex Wavelet Transform , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[52]  Lakshmish Ramaswamy,et al.  Towards a Quality-centric Big Data Architecture for Federated Sensor Services , 2013, 2013 IEEE International Congress on Big Data.

[53]  Jeffrey D. Ullman,et al.  Big data: a research agenda , 2013, IDEAS '13.

[54]  Kie B. Eomb Restoration of multispectral images by total variation with auxiliary image , 2013 .

[55]  Michel Riveill,et al.  An Architecture to Support the Collection of Big Data in the Internet of Things , 2014, 2014 IEEE World Congress on Services.

[56]  Seungmin Rho,et al.  Probabilistic Model for M2M in IoT networking and communication , 2016, Telecommun. Syst..

[57]  Anand Paul,et al.  Real-Time Power Management for Embedded M2M Using Intelligent Learning Methods , 2014, TECS.

[58]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[59]  O. Khan,et al.  ACM Transactions on Embedded Computing Systems continued on back cover , 2018 .

[60]  P.E. Ross Managing care through the air [remote health monitoring] , 2004, IEEE Spectrum.

[61]  Awais Ahmad,et al.  Power Aware Mobility Management of M2M for IoT Communications , 2015, Mob. Inf. Syst..

[62]  Eduardo F. Nakamura,et al.  Information fusion for wireless sensor networks: Methods, models, and classifications , 2007, CSUR.

[63]  Awais Ahmad,et al.  Optimized data transmission using cooperative devices in clustered D2D communication , 2014, RACS '14.

[64]  Didier Stricker,et al.  Introducing a New Benchmarked Dataset for Activity Monitoring , 2012, 2012 16th International Symposium on Wearable Computers.