A systematic literature review on features of deep learning in big data analytics

The aims of this study are to identify the existing features of DL approaches for using in BDA and identify the key features that affect the effectiveness of DL approaches. Method: A Systematic Literature Review (SLR) was carried out and reported based on the preferred reporting items for systematic reviews. 4065 papers were retrieved by manual search in four databases which are Google Scholar, Taylor & Francis, Springer Link and Science Direct. 34 primary studies were finally included. Result: From these studies, 70% were journal articles, 25% were conference papers and 5% were contributions from the studies consisted of book chapters. Five features of DL were identified and analyzed. The features are (1) hierarchical layer, (2) high-level abstraction, (3) process high volume of data, (4) universal model and (5) does not over fit the training data. Conclusion: This review delivers the evidence that DL in BDA is an active research area. The review provides researchers with some guidelines for future research on this topic. It also provides broad information on DL in BDA which could be useful for practitioners.

[1]  I. Song,et al.  Analytics over large-scale multidimensional data: the big data revolution! , 2011, DOLAP '11.

[2]  Rajkumar Buyya,et al.  Big Data Analytics = Machine Learning + Cloud Computing , 2016, ArXiv.

[3]  Lei Guo,et al.  Networking Big Data: Definition, Key Technologies and Challenging Issues of Transmission , 2015, BigCom.

[4]  D. Maltby Big Data Analytics , 2014 .

[5]  Khin Mi Mi Aung,et al.  Building a large-scale object-based active storage platform for data analytics in the internet of things , 2016, The Journal of Supercomputing.

[6]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[7]  Rodina Binti Ahmad,et al.  A systematic literature review on Enterprise Architecture Implementation Methodologies , 2015, Inf. Softw. Technol..

[8]  Shein-Chung Chow,et al.  On Big-Data Analytics in Biomedical Research , 2015 .

[9]  P. Cronin,et al.  Undertaking a literature review: a step-by-step approach. , 2008, British journal of nursing.

[10]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[11]  Zohra Bellahsene,et al.  A Constraint Optimization Method for Large-Scale Distributed View Selection , 2016, Trans. Large Scale Data Knowl. Centered Syst..

[12]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[13]  Allan Pinkus,et al.  Multilayer Feedforward Networks with a Non-Polynomial Activation Function Can Approximate Any Function , 1991, Neural Networks.

[14]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[15]  Ivo D. Dinov,et al.  Methodological challenges and analytic opportunities for modeling and interpreting Big Healthcare Data , 2016, GigaScience.

[16]  Lian Duan,et al.  Big data analytics and business analytics , 2015 .

[17]  Derek C. Rose,et al.  Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier] , 2010, IEEE Computational Intelligence Magazine.

[18]  Swarup Roy,et al.  Big Data Analytics in Bioinformatics: A Machine Learning Perspective , 2015, ArXiv.

[19]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[20]  Sachchidanand Singh,et al.  Big Data analytics , 2012 .

[21]  Bineng Zhong,et al.  CNNTracker: Online discriminative object tracking via deep convolutional neural network , 2016, Appl. Soft Comput..

[22]  Zhi-Quan Luo,et al.  A Unified Algorithmic Framework for Block-Structured Optimization Involving Big Data: With applications in machine learning and signal processing , 2015, IEEE Signal Processing Magazine.

[23]  Paul Mineiro,et al.  Machine learning for big data , 2013, SIGMOD '13.

[24]  Shan Suthaharan,et al.  Big data classification: problems and challenges in network intrusion prediction with machine learning , 2014, PERV.