Comparative Performance of Random Forest Algorithms and Discriminant Analysis in Classifications

The use of computer-based systems plays a leading role today as an analytical technique in diagnosing diseases, including predicting autism disorders. This research aimed to predict the class or target by using related variables accurately. The prediction process involved the number of attributes and records from the dataset. The dataset used in this study was the autism disorder dataset taken from the UCI repository machine learning; the data was already in number (binary) thus, no dummy variable was necessary is not needed. In this study, we used the Random Forest algorithm and discriminant analysis. The purpose of this study was to predict the type of autism disorder in children according to their prescribed symptoms. Two types of disorders used as target variable; communication disorders and behavioral disorders. Meanwhile, the independent variables are symptom 1, symptom 2, symptom 3, symptom 4, and symptom 5. The results from these are later processed using the SPSS software. The results from the Random Forest method indicated that 74.2% of forming a classification and a sensitivity value of 26.3%. Meanwhile, the percentage in the discriminant analysis was 70.1%. This value indicated that the decision tree was more reliable in classifying target types

[1]  M. A. Jabbar,et al.  Random Forest Modeling for Network Intrusion Detection System , 2016 .

[2]  Aniati Murni Arymurthy,et al.  Random Subspace Method for Sleep Stage Classification of Autism Patients , 2018, 2018 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI).

[3]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[4]  Nabila Shahnaz Khan,et al.  A Machine Learning Approach to Predict Autism Spectrum Disorder , 2019, 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE).

[5]  Liang Liang,et al.  An improved random decision trees algorithm with application to land cover classification , 2010, 2010 18th International Conference on Geoinformatics.