Blending Big Data Analytics: Review on Challenges and a Recent Study

With the collection of massive amounts of data every day, big data analytics has emerged as an important trend for many organizations. These collected data can contain important information that may be key to solving wide-ranging problems, such as cyber security, marketing, healthcare, and fraud. To analyze their large volumes of data for business analyses and decisions, large companies, such as Facebook and Google, adopt analytics. Such analyses and decisions impact existing and future technology. In this paper, we explore how big data analytics is utilized as a technique for solving problems of complex and unstructured data using such technologies as Hadoop, Spark, and MapReduce. We also discuss the data challenges introduced by big data according to the literature, including its six V’s. Moreover, we investigate case studies of big data analytics on various techniques of such analytics, namely, text, voice, video, and network analytics. We conclude that big data analytics can bring positive changes in many fields, such as education, military, healthcare, politics, business, agriculture, banking, and marketing, in the future.

[1]  Luc Moreau,et al.  Provenance Network Analytics , 2017, Data Mining and Knowledge Discovery.

[2]  Minrui Zheng,et al.  Big geospatial data analytics for global mangrove biomass and carbon estimation , 2018 .

[3]  María Fabiana Piccoli,et al.  Solving a Big-Data Problem with GPU: The Network Traffic Analysis , 2015 .

[4]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[5]  Veda C. Storey,et al.  Big data technologies and Management: What conceptual modeling can do , 2017, Data Knowl. Eng..

[6]  Abdullah Gani,et al.  A survey on indexing techniques for big data: taxonomy and performance evaluation , 2016, Knowledge and Information Systems.

[7]  Daniel Schall,et al.  Geospatial Analytics in the Large for Monitoring Depth of Cover for Buried Pipeline Infrastructure , 2018, 2018 IEEE International Conference on Cloud Engineering (IC2E).

[8]  Kai Yang,et al.  Deep Network Analyzer (DNA): A Big Data Analytics Platform for Cellular Networks , 2017, IEEE Internet of Things Journal.

[9]  Dimitris Gritzalis,et al.  Stress level detection via OSN usage pattern and chronicity analysis: An OSINT threat intelligence module , 2017, Comput. Secur..

[10]  Jason J. Jung,et al.  Social big data: Recent achievements and new challenges , 2015, Information Fusion.

[11]  Ayoub Ait Lahcen,et al.  Big Data technologies: A survey , 2017, J. King Saud Univ. Comput. Inf. Sci..

[12]  Ching-Tang Fan,et al.  Heterogeneous Information Fusion and Visualization for a Large-Scale Intelligent Video Surveillance System , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[13]  B. S. Manjunath,et al.  Kestrel: Video Analytics for Augmented Multi-Camera Vehicle Tracking , 2018, 2018 IEEE/ACM Third International Conference on Internet-of-Things Design and Implementation (IoTDI).

[14]  Shahriar Akter,et al.  How to improve firm performance using big data analytics capability and business strategy alignment , 2016 .

[15]  Avita Katal,et al.  Big data: Issues, challenges, tools and Good practices , 2013, 2013 Sixth International Conference on Contemporary Computing (IC3).

[16]  Adegboyega Ojo,et al.  A social media text analytics framework for double-loop learning for citizen-centric public services: A case study of a local government Facebook use , 2017, Gov. Inf. Q..

[17]  Nagwa M. Elaraby,et al.  Deep Learning : Effective Tool for Big Data Analytics , 2016 .

[18]  Yibo Wang,et al.  Leveraging deep learning with LDA-based text analytics to detect automobile insurance fraud , 2018, Decis. Support Syst..

[19]  Terry Anthony Byrd,et al.  Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations , 2018 .

[20]  Paloma Martínez,et al.  Turning user generated health-related content into actionable knowledge through text analytics services , 2016, Comput. Ind..

[21]  Hing Kai Chan,et al.  Recent Development in Big Data Analytics for Business Operations and Risk Management , 2017, IEEE Transactions on Cybernetics.

[22]  Carlos Francisco Simões Gomes,et al.  Text Mining Business Intelligence: A small sample of what words can say , 2015, ITQM.

[23]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[24]  Leslie Monplaisir,et al.  Modeling of fuzzy-based voice of customer for business decision analytics , 2017, Knowl. Based Syst..

[25]  Aakanksha Chowdhery,et al.  Model Predictive Compression for Drone Video Analytics , 2018, 2018 IEEE International Conference on Sensing, Communication and Networking (SECON Workshops).

[26]  Xue-wen Chen,et al.  Big Data Deep Learning: Challenges and Perspectives , 2014, IEEE Access.

[27]  Dipali Kadam,et al.  Multidisciplinary Model for Smart Agriculture using Internet-of-Things ( IoT ) , Sensors , Cloud-Computing , Mobile-Computing & Big-Data Analysis , 2015 .

[28]  Davide Anguita,et al.  Condition Based Maintenance in Railway Transportation Systems Based on Big Data Streaming Analysis , 2015, INNS Conference on Big Data.

[29]  Zhihan Lv,et al.  Next-Generation Big Data Analytics: State of the Art, Challenges, and Future Research Topics , 2017, IEEE Transactions on Industrial Informatics.

[30]  Claes Andersson,et al.  Interactive Voice Response with Feedback Intervention in Outpatient Treatment of Substance Use Problems in Adolescents and Young Adults: A Randomized Controlled Trial , 2016, International Journal of Behavioral Medicine.

[31]  Fernando Rua,et al.  Towards of a Real-time Big Data Architecture to Intensive Care , 2017, EUSPN/ICTH.

[32]  Li-Minn Ang,et al.  Video Analytics for Customer Emotion and Satisfaction at Contact Centers , 2018, IEEE Transactions on Human-Machine Systems.

[33]  Athanasios V. Vasilakos,et al.  Big data analytics: a survey , 2015, Journal of Big Data.

[34]  Choo-Yee Ting,et al.  Geospatial Analytics in Retail Site Selection and Sales Prediction , 2018, Big Data.

[35]  Pedro Ruivo,et al.  Unlocking the drivers of big data analytics value in firms , 2019, Journal of Business Research.

[36]  Muhammad Atif Tahir,et al.  Towards cloud based big data analytics for smart future cities , 2013, 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing.

[37]  GaniAbdullah,et al.  The rise of "big data" on cloud computing , 2015 .

[38]  Álvaro Sicilia,et al.  From big data to smart energy services: An application for intelligent energy management , 2020, Future Gener. Comput. Syst..

[39]  Zhenglei Yi,et al.  Social Computing for Mobile Big Data , 2016, Computer.

[40]  Emad Samuel Malki Ebeid,et al.  Open geospatial infrastructure for data management and analytics in interdisciplinary research , 2018, Comput. Electron. Agric..

[41]  Thorsten Meinl,et al.  KNIME - the Konstanz information miner: version 2.0 and beyond , 2009, SKDD.

[42]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[43]  Sancheng Peng,et al.  Social networking big data: Opportunities, solutions, and challenges , 2018, Future Gener. Comput. Syst..

[44]  Vipin Kumar,et al.  Trends in big data analytics , 2014, J. Parallel Distributed Comput..

[45]  Awais Ahmad,et al.  Deep learning in big data Analytics: A comparative study , 2017, Comput. Electr. Eng..

[46]  Dilpreet Singh,et al.  A survey on platforms for big data analytics , 2014, Journal of Big Data.

[47]  Ryuki Tachibana,et al.  Major depressive disorder discrimination using vocal acoustic features. , 2018, Journal of affective disorders.

[48]  U. Dinesh Kumar,et al.  Every drop counts: unleashing the prospective locations for water harvesting using geospatial analytics , 2017, IML.

[49]  Jimeng Sun,et al.  Big data analytics for healthcare , 2013, KDD.

[50]  Changsoo Lee,et al.  ORANGE: Spatial big data analysis platform , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[51]  Albert Y. Zomaya,et al.  pipsCloud: High performance cloud computing for remote sensing big data management and processing , 2018, Future Gener. Comput. Syst..

[52]  Jan Kohout,et al.  Network Traffic Fingerprinting Based on Approximated Kernel Two-Sample Test , 2018, IEEE Transactions on Information Forensics and Security.

[53]  Okyay Kaynak,et al.  Big Data for Modern Industry: Challenges and Trends [Point of View] , 2015, Proc. IEEE.

[54]  Mais Farkhadov,et al.  Application of speech analytics in information space monitoring systems , 2017, 2017 5th International Conference on Control, Instrumentation, and Automation (ICCIA).

[55]  Kheng Cher Yeo,et al.  Critical review of machine learning approaches to apply big data analytics in DDoS forensics , 2018, 2018 International Conference on Computer Communication and Informatics (ICCCI).

[56]  Han Liu,et al.  Challenges of Big Data Analysis. , 2013, National science review.

[57]  Constantinos Patsakis,et al.  Profiling tax and financial behaviour with big data under the GDPR , 2019, Comput. Law Secur. Rev..

[58]  Hongli Zhang,et al.  Mobile cloud sensing, big data, and 5G networks make an intelligent and smart world , 2015, IEEE Network.

[59]  Wenwen Li,et al.  Constructing gazetteers from volunteered Big Geo-Data based on Hadoop , 2013, Comput. Environ. Urban Syst..

[60]  Srinivas Bangalore,et al.  Bootstrapping Multilingual Intent Models via Machine Translation for Dialog Automation , 2018, ArXiv.

[61]  Hugh J. Watson,et al.  Update Tutorial: Big Data Analytics: Concepts, Technology, and Applications , 2019, Commun. Assoc. Inf. Syst..

[62]  Andrew R.G. Large,et al.  The importance of volunteered geographic information for the validation of flood inundation models , 2018, Journal of Hydrology.

[63]  Awais Ahmad,et al.  Urban planning and building smart cities based on the Internet of Things using Big Data analytics , 2016, Comput. Networks.

[64]  Ashiq Anjum,et al.  Deep Learning Hyper-Parameter Optimization for Video Analytics in Clouds , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[65]  D. P. Acharjya,et al.  A Survey on Big Data Analytics: Challenges, Open Research Issues and Tools , 2016 .

[66]  C. Krishna Mohan,et al.  Visual Big Data Analytics for Traffic Monitoring in Smart City , 2016, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA).

[67]  Melnned M. Kantardzic Big Data Analytics , 2013, Lecture Notes in Computer Science.

[68]  Reza Nejabati,et al.  Multilayer network analytics with SDN-based monitoring framework , 2017, IEEE/OSA Journal of Optical Communications and Networking.

[69]  Colleen Richey,et al.  Privacy-Preserving Speech Analytics for Automatic Assessment of Student Collaboration , 2016, INTERSPEECH.

[70]  Mohamed F. Mokbel,et al.  Demonstration of Taghreed: A system for querying, analyzing, and visualizing geotagged microblogs , 2014, 2015 IEEE 31st International Conference on Data Engineering.

[71]  John Klein,et al.  A Reference Architecture for Big Data Systems in the National Security Domain , 2016, 2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE).

[72]  Awais Ahmad,et al.  Socio-cyber network: The potential of cyber-physical system to define human behaviors using big data analytics , 2019, Future Gener. Comput. Syst..

[73]  Katrina Sin,et al.  Application of Big Data in Education Data Mining and Learning Analytics-A Literature Review , 2015, SOCO 2015.

[74]  Betul Karakus,et al.  Call center performance evaluation using big data analytics , 2016, 2016 International Symposium on Networks, Computers and Communications (ISNCC).

[75]  Wei Xu,et al.  Secondhand seller reputation in online markets: A text analytics framework , 2018, Decis. Support Syst..

[76]  Fern Halper,et al.  Operationalizing and Embedding Analytics for Action , 2016 .

[77]  Abderrahim Beni Hssane,et al.  Big data security and privacy in healthcare: A Review , 2017, EUSPN/ICTH.

[78]  Weiguo Fan,et al.  An Integrated Text Analytic Framework for Product Defect Discovery , 2015 .

[79]  Tariq Rahim Soomro,et al.  Big Data Analysis: Apache Spark Perspective , 2015 .

[80]  Jameela Al-Jaroodi,et al.  Real-time big data analytics: Applications and challenges , 2014, 2014 International Conference on High Performance Computing & Simulation (HPCS).

[81]  Ahmed Elragal,et al.  Big Data Analytics in Support of the Decision Making Process , 2016 .

[82]  Vishal Gupta,et al.  Big data analytics techniques: A survey , 2015, 2015 International Conference on Green Computing and Internet of Things (ICGCIoT).

[83]  Z. Irani,et al.  Critical analysis of Big Data challenges and analytical methods , 2017 .

[84]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[85]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.