An Analysis on Deep Learning Approach Performance in Classifying Big Data Set

Big data sets are mainly derived from social media as well as stock market exchange. It is commonly described according to its main characteristics the 3Vs, which refers to Volume, Velocity and Variety. Big data sets often contributed to difficulties faced by the back end groups such as data analyst, system developer, programmer, and network analyst due to its complexity issue. To overcome this issue, many researchers and professionals have proposed and initiated various solutions, for instance; algorithm, software, hardware and framework related to big data. One beneficial and popularly known approach in dealing with big data is deep learning. It is an extension of neural network that is able to analyze huge data sets without assistance from any parameterization methods. To make use of this advantage, this paper aimed to evaluate the capability of deep learning in analyzing big data sets. Several data sets were selected and support vector machine (SVM) was chosen as a benchmark method for the experimental work. The results obtained revealed that deep learning has outperformed SVM in classifying big data set. As a conclusion, deep learning can be categorized as one of the best machine learning approaches to be used in decision analysis process. It can also be used as an alternative approach to other traditional approaches such as Naive Bayes or SVM which require more data processing phases.

[1]  Mahdi Fahmideh,et al.  Big data analytics architecture design - An application in manufacturing systems , 2019, Comput. Ind. Eng..

[2]  Laura Palagi,et al.  Neural networks for small scale ORC optimization , 2017 .

[3]  Rob Deardon,et al.  Deep learning for supervised classification of spatial epidemics. , 2019, Spatial and spatio-temporal epidemiology.

[4]  El-Sayed M. El-Horbaty,et al.  Classification using deep learning neural networks for brain tumors , 2017, Future Computing and Informatics Journal.

[5]  Mohamed Elhoseny,et al.  Feature selection based on artificial bee colony and gradient boosting decision tree , 2019, Appl. Soft Comput..

[6]  Ali Selamat,et al.  A New Soft Rough Set Parameter Reduction Method for an Effective Decision-Making , 2017, New Trends in Software Methodologies, Tools and Techniques.

[7]  Timothée Masquelier,et al.  Deep Learning in Spiking Neural Networks , 2018, Neural Networks.

[8]  Christopher Leckie,et al.  High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning , 2016, Pattern Recognit..

[9]  Amir Abbas Najafi,et al.  Support vector machine with Dirichlet feature mapping , 2018, Neural Networks.

[10]  Ruo-Ping Han,et al.  Disease prediction with different types of neural network classifiers , 2016, Telematics Informatics.

[11]  Ayoub Ait Lahcen,et al.  APRA: An approximate parallel recommendation algorithm for Big Data , 2018, Knowl. Based Syst..

[12]  Md. Zakirul Alam Bhuiyan,et al.  A Survey on Deep Learning in Big Data , 2017, 22017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC).

[13]  Ritika Wason,et al.  Deep learning: Evolution and expansion , 2018, Cognitive Systems Research.