A novel network framework using similar-to-different learning strategy

Most of the existing classification techniques concentrate on learning the datasets as a single similar unit, in spite of so many differentiating attributes and complexities involved. However, traditional classification techniques are required to analyze the datasets prior to learning, and if not doing so, they loss their performance in terms of accuracy and AUC. To this end, many of the machine learning problems can be very easily solved just by carefully observing human learning and training nature and then mimicking the same in the machine learning. In response to these issues, we present a comprehensive suite of experiments carefully designed to provide conclusive, reliable, and significant results to the problem of efficient learning. This paper proposes a novel, simple, and effective machine learning paradigm that explicitly exploits this important similar-to-different (S2D) human learning strategy and implements it based on two algorithms (C4.5 and CART) efficiently. The framework not only analyzes the data sets prior to implementation, but also carefully allows classifier to have a systematic study so as to mimic the human training technique designed for efficient learning. Experimental results show that the method outperforms the state-of-the-art methods in terms of learning capability and breaks through the gap between human and machine learning. In fact, the proposed similar-to-different (S2D) strategy may also be useful in efficient learning of real-world complex and high-dimensional data sets, especially which are very typical to learn with traditional classifiers.

[1]  Ian Witten,et al.  Data Mining , 2000 .

[2]  Sanjay Jain,et al.  Some natural conditions on incremental learning , 2007, Inf. Comput..

[3]  Manoj Kumar Tiwari,et al.  Soft decision trees: A genetically optimized cluster oriented approach , 2009, Expert Syst. Appl..

[4]  Brian A. Malloy,et al.  The IELR(1) algorithm for generating minimal LR(1) parser tables for non-LR(1) grammars with conflict resolution , 2010, Sci. Comput. Program..

[5]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[6]  Gwénolé Quellec,et al.  A multiple-instance learning framework for diabetic retinopathy screening , 2012, Medical Image Anal..

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  Rita Cucchiara,et al.  Optimized Block-Based Connected Components Labeling With Decision Trees , 2010, IEEE Transactions on Image Processing.

[9]  Ali Mirza Mahmood,et al.  A novel pruning approach using expert knowledge for data-specific pruning , 2011, Engineering with Computers.

[10]  Kevin Duh,et al.  Flexible sample selection strategies for transfer learning in ranking , 2012, Inf. Process. Manag..

[11]  Ching-Hung Lee,et al.  Nonlinear systems design by a novel fuzzy neural system via hybridization of electromagnetism-like mechanism and particle swarm optimisation algorithms , 2012, Inf. Sci..

[12]  Costantino Grana,et al.  Optimal decision trees for local image processing algorithms , 2012, Pattern Recognit. Lett..

[13]  Rita Cucchiara,et al.  Optimal Decision Trees Generation from OR-Decision Tables , 2011, ICIAP.

[14]  P. N. Suganthan,et al.  A dynamic neighborhood learning based particle swarm optimizer for global numerical optimization , 2012, Inf. Sci..

[15]  Edwin Lughofer,et al.  Hybrid active learning for reducing the annotation effort of operators in classification systems , 2012, Pattern Recognit..

[16]  Richard Gil,et al.  A novel integrated knowledge support system based on ontology learning: Model specification and a case study , 2012, Knowl. Based Syst..

[17]  D. J. Newman,et al.  UCI Repository of Machine Learning Database , 1998 .

[18]  Serkan Çelik,et al.  Vocabulary learning strategy use of Turkish EFL learners , 2010 .

[19]  Yongheng Jiang,et al.  Iterative learning belief rule-base inference methodology using evidential reasoning for delayed coking unit , 2011, 2011 International Symposium on Advanced Control of Industrial Processes (ADCONIP).

[20]  Hanqing Lu,et al.  Asymmetric propagation based batch mode active learning for image retrieval , 2013, Signal Process..

[21]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[22]  Ali Mirza Mahmood,et al.  An Improved CART Decision Tree for Datasets with Irrelevant Feature , 2011, SEMCCO.

[23]  Charles X. Ling,et al.  Supervised Learning with Minimal Effort , 2010, PAKDD.

[24]  Ali Mirza Mahmood,et al.  Early Detection of Clinical Parameters in Heart Disease by Improved Decision Tree Algorithm , 2010, 2010 Second Vaagdevi International Conference on Information Technology for Real World Problems.

[25]  Kenneth C. Sevcik,et al.  The synthetic approach to decision table conversion , 1976, CACM.

[26]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[27]  Guodong Zhou,et al.  Hierarchical learning strategy in semantic relation extraction , 2008, Inf. Process. Manag..

[28]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .