MBFerns: classification and extraction of actionable knowledge using Multi-Branch Ferns-based Naive Bayesian classifier

Classification is one of the tasks that are most frequently carried out in real world applications. A large number of techniques have been developed based on statistics and machine learning methods. These classification techniques usually suffer from various limitations, and there is no single technique that works best for all classification problems. Two major drawbacks in existing techniques are accuracy and lack of actionable knowledge from results. To overcome these problems, a novel algorithm called Multi-Branch Ferns (MBFerns), and R-package has been developed to build multi-branch ferns (multi-branch decision tree) and to generate key features from training dataset employing Naive Bayesian probabilistic model as classifier. The proposed algorithm performs well for general classification problems and extracting actionable knowledge from training data. The proposed method has been evaluated with best existing classification methods namely, Random Forest (RF), Support Vector Machine (SVM) and Artificial Neural Network (ANN) on medical benchmark data, available at https://archive.ics.uci.edu/ml/datasets/ such as Breast Cancer, Cryotherapy, Cardiotocography, Dermatology, Echocardiogram, EEG Eye State, Fertility, Haberman's Survival, Hepatitis, Indian Liver Patient, Mammographic Mass, Parkinsons, etc. Detailed investigation on proposed Multi-Branch Ferns (MBFerns) with respect to accuracy, time, space complexity and knowledge discovery has also been presented.

[1]  Shumpei Niida,et al.  Estrogen Regulates the Production of VEGF for Osteoclast Formation and Activity in op/op Mice , 2003, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[2]  William Marsh,et al.  Not just data: A method for improving prediction with knowledge , 2014, J. Biomed. Informatics.

[3]  Rakhi Batra,et al.  Actionable Knowledge Discovery for Increasing Enterprise Profit, Using Domain Driven-Data Mining , 2019, IEEE Access.

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  Rashedur M. Rahman,et al.  Decision Tree and Naïve Bayes Algorithm for Classification and Generation of Actionable Knowledge for Direct Marketing , 2013 .

[6]  Miron B. Kursa,et al.  rFerns: An Implementation of the Random Ferns Method for General-Purpose Machine Learning , 2012, 1202.1121.

[7]  G. Tang,et al.  Indian Hedgehog: A Mechanotransduction Mediator in Condylar Cartilage , 2004, Journal of dental research.

[8]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[9]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[10]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[11]  Chengqi Zhang,et al.  Flexible Frameworks for Actionable Knowledge Discovery , 2010, IEEE Transactions on Knowledge and Data Engineering.

[12]  Amit P. Sheth,et al.  From Data to Actionable Knowledge: Big Data Challenges in the Web of Things , 2013, IEEE Intell. Syst..

[13]  Andrew Gilbert,et al.  Action recognition using Randomised Ferns , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[14]  Vincent Lepetit,et al.  Fast Keypoint Recognition in Ten Lines of Code , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Nasrin Kalanat,et al.  Extracting actionable knowledge from social networks with node attributes , 2019, Expert Syst. Appl. X.

[16]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[17]  Renato A. Krohling,et al.  Information Technology and Quantitative Management ( ITQM 2015 ) A-TOPSIS – An approach Based on TOPSIS for Ranking Evolutionary Algorithms , 2015 .