Big Data Analytics Concepts, Technologies Challenges, and Opportunities

The rapid observed increase in using the Internet led to the presence of huge amounts of data. Traditional data technologies, techniques, and even applications cannot cope with the new data’s volume, structure, and types of styles. Big data concepts come to assimilate this non-stop flooding. Big data analysis process used to jewel the useful data and exclude the other one which provides better results with minimum resource utilization, time, and cost. Feature selection principle is a traditional data dimension reduction technique, and big data analytics provided modern technologies and frameworks that feature selection can be integrated with them to provide better performance for the principle itself and help in preprocessing of big data on the other hand. The main objective of this paper is to survey the most recent research challenges for big data analysis and preprocessing processes. The analysis is carried out via acquiring data from resources, storing them, then filtered to pick up the useful ones and dismissing the unwanted ones then extracting information. Before analyzing data, it needs preparation to remove noise, fix incomplete data and put it in a suitable pattern. This is done in the preprocessing step by various models like data reduction, cleaning, normalization, preparation, integration, and transformation.

[1]  Yueming Hu,et al.  Distributed Feature Selection for Efficient Economic Big Data Analysis , 2018, IEEE Transactions on Big Data.

[2]  Francisco Herrera,et al.  A survey on data preprocessing for data stream mining: Current status and future directions , 2017, Neurocomputing.

[3]  Prem Prakash Jayaraman,et al.  Big Data Reduction Methods: A Survey , 2016, Data Science and Engineering.

[4]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[5]  Rocco Aversa,et al.  Big data (lost) in the cloud , 2014, Int. J. Big Data Intell..

[6]  Nandini S. Sidnal,et al.  A Proposed Contextual Model for Big Data Analysis Using Advanced Analytics , 2018 .

[7]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[8]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[9]  Xiaojing Wang,et al.  Mining algorithm for association rules in big data based on Hadoop , 2018 .

[10]  B. Bharathi,et al.  A survey paper on big data analytics , 2017, 2017 International Conference on Information Communication and Embedded Systems (ICICES).

[11]  J. Alberto Espinosa,et al.  Big Data: Issues and Challenges Moving Forward , 2013, 2013 46th Hawaii International Conference on System Sciences.

[12]  Francisco Herrera,et al.  Big data preprocessing: methods and prospects , 2016 .

[13]  Yifeng Chen,et al.  GMR: graph-compatible MapReduce programming model , 2017, Multimedia Tools and Applications.

[14]  Francisco Herrera,et al.  DPASF: a flink library for streaming data preprocessing , 2018, Big Data Analytics.

[15]  Ayoub Ait Lahcen,et al.  Big Data technologies: A survey , 2017, J. King Saud Univ. Comput. Inf. Sci..

[16]  Christian Prehofer,et al.  Big data analytics architecture for real-time traffic control , 2017, 2017 5th IEEE International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS).

[17]  Srinath Srinivasa,et al.  Big Data Analytics , 2015 .

[18]  Carlos Francisco Simões Gomes,et al.  Big Data: A Global Overview , 2018, Studies in Big Data.

[19]  Terry Anthony Byrd,et al.  Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations , 2018 .

[20]  Ankur Dumka,et al.  Smart ambulance system using concept of big data and internet of things , 2019, Healthcare Data Analytics and Management.

[21]  Miftachul Huda,et al.  Big Data Emerging Technology: Insights into Innovative Environment for Online Learning Resources , 2018, Int. J. Emerg. Technol. Learn..

[22]  Zeeshan Ahmed,et al.  Systematically Dealing Practical Issues Associated to Healthcare Data Analytics , 2019, Lecture Notes in Networks and Systems.

[23]  Francisco Herrera,et al.  Big Data: Tutorial and guidelines on information and process fusion for analytics algorithms with MapReduce , 2018, Inf. Fusion.

[24]  Sven Hartmann,et al.  Big Data and Data Analytics in Aviation , 2018, Advances in Aeronautical Informatics.

[25]  Keith W. Miller,et al.  Big Data: New Opportunities and New Challenges [Guest editors' introduction] , 2013, Computer.

[26]  P. Muthulakshmi,et al.  A SURVEY ON BIG DATA ISSUES AND CHALLENGES , 2018 .

[27]  Dilpreet Singh,et al.  A survey on platforms for big data analytics , 2014, Journal of Big Data.

[28]  Deepti Gaur,et al.  Robust Fuzzy Neuro system for Big Data Analytics , 2017 .

[29]  Yue-Shan Chang,et al.  Big data platform for air quality analysis and prediction , 2018, 2018 27th Wireless and Optical Communication Conference (WOCC).

[30]  Vishal Gupta,et al.  Big data analytics techniques: A survey , 2015, 2015 International Conference on Green Computing and Internet of Things (ICGCIoT).

[31]  Z. Irani,et al.  Critical analysis of Big Data challenges and analytical methods , 2017 .

[32]  Sapna Gambhir,et al.  Mobile Agent Based MapReduce Framework for Big Data Processing , 2018 .

[33]  Sally M. El-Ghamrawy,et al.  A Dynamic Spark-based Classification Framework for Imbalanced Big Data , 2018, Journal of Grid Computing.

[34]  Petri T. Helo,et al.  Big data applications in operations/supply-chain management: A literature review , 2016, Comput. Ind. Eng..

[35]  Joseph R. Rausch,et al.  Sample size planning for statistical power and accuracy in parameter estimation. , 2008, Annual review of psychology.

[36]  Sunil Tiwari,et al.  Big data analytics in supply chain management between 2010 and 2016: Insights to industries , 2018, Comput. Ind. Eng..