A Survey: Classification of Big Data

In the current decades large data sets are mostly available from the source, extraction and analysis of data is an interesting and challenging task. Big Data relate to expansive bulk size, developing datasets that are intricate and have numerous self-ruling spring. Prior advances were not ready to deal with capacity and handling of enormous dataset in this manner Big Data idea appears. This is a monotonous employment for clients to distinguish precise data from enormous unstructured data. Along these lines, there ought to be some system which characterize unstructured data into sorted out shape which causes client to effectively get to required data. Arrangement systems over big value-based database give expected dataset to the clients from huge datasets further straightforward way. There are two primary arrangement procedures, administered and unsupervised. In this paper we concentrated on to investigation of various administered characterization methods. Encourage this paper demonstrates use of every system and their points of interest and confinements.

[1]  Xiao Liu,et al.  A DT-SVM Strategy for Stock Futures Prediction with Big Data , 2013, 2013 IEEE 16th International Conference on Computational Science and Engineering.

[2]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[3]  Cheng Hao Jin,et al.  Ensemble method for classification of high-dimensional data , 2014, 2014 International Conference on Big Data and Smart Computing (BIGCOMP).

[4]  Wei Dai,et al.  A MapReduce Implementation of C4.5 Decision Tree Algorithm , 2014 .

[5]  Aurobinda Routray,et al.  Evolutionary algorithm based optimization for power quality disturbances classification using support vector machines , 2010 .

[6]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[7]  K Krishnaiah,et al.  Review on “Data Mining with Big Data , 2018 .

[8]  S. Sukumaran,et al.  A study on classification techniques in data mining , 2013, 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT).

[9]  Guoyin Li,et al.  Support vector machine classifiers with uncertain knowledge sets via robust optimization , 2014 .

[10]  Shan Suthaharan,et al.  Big data classification: problems and challenges in network intrusion prediction with machine learning , 2014, PERV.

[11]  Jiawei Han,et al.  Classifying large data sets using SVMs with hierarchical clusters , 2003, KDD '03.

[12]  Samer Samarah,et al.  The application of semantic-based classification on big data , 2014, 2014 5th International Conference on Information and Communication Systems (ICICS).

[13]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[14]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.