Integrated Learning via Randomized Forests and Localized Regression With Application to Medical Diagnosis

The tree-based machine learning functions on the divide-and-conquer principle and is known to perform well in certain applications. In this paper, we first give a new data partitioning rule using the mean of the data columns to grow the tree till the child nodes are small in size. Then, the local regression is applied to leave nodes to enhance the resolution of the node outputs. Randomization is introduced at tree growth and forest creation. The local prediction accuracies on the leaves are used to select a subset of the test data for actual predictions. The case study on the diagnosis of autistic spectrum disorder shows that the proposed method achieves the prediction accuracy of the ensemble at above 96% with reduced variance, which is much better than those reported in the literature.

[1]  Rajeshwar Singh,et al.  Use of soft computing techniques in medical decision making: A survey , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[2]  Shawn K. McGuire,et al.  Automated microscopy and machine learning for expert-level malaria field diagnosis , 2015, 2015 IEEE Global Humanitarian Technology Conference (GHTC).

[3]  Sudan Basnet,et al.  A novel fuzzy-logic controller for an artificial heart , 2009, 2009 IEEE Control Applications, (CCA) & Intelligent Control, (ISIC).

[4]  Koushik Maharatna,et al.  Classification of autism spectrum disorder using supervised learning of brain connectivity measures extracted from synchrostates , 2014, Journal of neural engineering.

[5]  Rashidah Funke Olanrewaju,et al.  Application of neural networks in early detection and diagnosis of Parkinson's disease , 2014, 2014 International Conference on Cyber and IT Service Management (CITSM).

[6]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[7]  B. Spyropoulos,et al.  A Neuro-Fuzzy Controller for the estimation of Tidal Volume and Respiration Frequency ventilator settings for COPD patients ventilated in control mode , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[8]  N.K. Swain,et al.  A Survey of Application of Fuzzy Logic in Intelligent Transportation Systems (ITS) and Rural ITS , 2006, Proceedings of the IEEE SoutheastCon 2006.

[9]  Fangbing Yan,et al.  Application of Logistic Regression Model in an Epidemiological Study , 2015 .

[10]  Meng Joo Er,et al.  A Survey of Adaptive Fuzzy Controllers: Nonlinearities and Classifications , 2016, IEEE Transactions on Fuzzy Systems.

[11]  Mei-Hui Wang,et al.  A Fuzzy Expert System for Diabetes Decision Support Application , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  Rui Wang,et al.  Approximate Principal Direction Trees , 2012, ICML.

[13]  Abdulkader Helwan,et al.  Neural network diagnosis of heart disease , 2015, 2015 International Conference on Advances in Biomedical Engineering (ICABME).

[14]  Wayan Suparta,et al.  Adaptive Neuro-Fuzzy Interference System , 2016 .

[15]  Ritu Tiwari,et al.  Expert system for breast cancer diagnosis: A survey , 2016, 2016 International Conference on Computer Communication and Informatics (ICCCI).

[16]  Lixun Zhang,et al.  Experiment study of fuzzy impedance control on horizontal lower limbs rehabilitation robot , 2011, 2011 International Conference on Electronics, Communications and Control (ICECC).

[17]  Bram van den Bekerom Using Machine Learning for Detection of Autism Spectrum Disorder , 2017 .

[18]  Sonali Agarwal,et al.  Clinical decision support system for diabetes disease diagnosis using optimized neural network , 2014, 2014 Students Conference on Engineering and Systems.

[19]  Matti Pietikäinen,et al.  Thorax disease diagnosis using deep convolutional neural network , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[20]  Kemal Polat,et al.  Detection of ECG Arrhythmia using a differential expert system approach based on principal component analysis and least square support vector machine , 2007, Appl. Math. Comput..

[21]  A. Volein,et al.  Prediction of Autism at 3 Years from Behavioural and Developmental Measures in High-Risk Infants: A Longitudinal Cross-Domain Classifier Analysis , 2018, Journal of autism and developmental disorders.

[22]  Xinpei Wang,et al.  Classification of heart sound signals with BP neural network and logistic regression , 2017, 2017 Chinese Automation Congress (CAC).

[23]  Jun Shi,et al.  Fused Hierarchical Neural Networks for Cardiovascular Disease Diagnosis , 2012, IEEE Sensors Journal.

[24]  Vladik Kreinovich,et al.  Fuzzy logic and its applications in medicine , 2001, Int. J. Medical Informatics.

[25]  Samuel Oluwarotimi Williams,et al.  Hybrid Intelligent System for the Diagnosis of Typhoid Fever , 2013 .

[26]  Leonardo Yunda,et al.  A Web-based Fuzzy Inference System Based Tool for Cardiovascular Disease Risk Assessment , 2015 .

[27]  Dimitrios I. Fotiadis,et al.  A Framework for Fuzzy Expert System Creation—Application to Cardiovascular Diseases , 2007, IEEE Transactions on Biomedical Engineering.

[28]  Sanjoy Dasgupta,et al.  Which Spatial Partition Trees are Adaptive to Intrinsic Dimension? , 2009, UAI.