A novel permission-based Android malware detection system using feature selection based on linear regression

With the developments in mobile and wireless technology, mobile devices have become an important part of our lives. While Android is the leading operating system in market share, it is the platform most targeted by attackers. Although many solutions have been proposed in the literature for the detection of Android malware, there is still a need for attribute selection methods to be used in Android malware detection systems. In this study, a machine learning-based malware detection system is proposed to distinguish Android malware from benign applications. At the feature selection stage of the proposed malware detection system, it is aimed to remove unnecessary features by using a linear regression-based feature selection approach. In this way, the dimension of the feature vector is reduced, the training time is decreased, and the classification model can be used in real-time malware detection systems. When the results of the study are examined, the highest 0.961 is obtained according to the F-measure metric by using at least 27 features.

[1]  Patrick D. McDaniel,et al.  Understanding Android Security , 2009, IEEE Security & Privacy Magazine.

[2]  Elizabeth A. Peck,et al.  Introduction to Linear Regression Analysis , 2001 .

[3]  Sankardas Roy,et al.  Deep Ground Truth Analysis of Current Android Malware , 2017, DIMVA.

[4]  Hahn-Ming Lee,et al.  DroidMat: Android Malware Detection through Manifest and API Calls Tracing , 2012, 2012 Seventh Asia Joint Conference on Information Security.

[5]  Ainuddin Wahid Abdul Wahab,et al.  A review on feature selection in mobile malware detection , 2015, Digit. Investig..

[6]  Azhar Imran,et al.  Implementation and Use of Disease Diagnosis Systems for Electronic Medical Records Based on Machine Learning: A Complete Review , 2020, IEEE Access.

[7]  Kamlesh Dutta,et al.  A Survey on Various Threats and Current State of Security in Android Platform , 2019, ACM Comput. Surv..

[8]  Sakir Sezer,et al.  A New Android Malware Detection Approach Using Bayesian Classification , 2013, 2013 IEEE 27th International Conference on Advanced Information Networking and Applications (AINA).

[9]  Yong Fan,et al.  A Systematic Literature Review of Android Malware Detection Using Static Analysis , 2020, IEEE Access.

[10]  Oktay Yildiz,et al.  Permission-based Android Malware Detection System Using Feature Selection with Genetic Algorithm , 2019, Int. J. Softw. Eng. Knowl. Eng..

[11]  Jayashree Padmanabhan,et al.  Machine Learning in Automatic Speech Recognition: A Survey , 2015 .

[12]  Praveen Kumar Reddy Maddikunta,et al.  An ensemble machine learning approach through effective feature extraction to classify fake news , 2021, Future Gener. Comput. Syst..

[13]  Juan E. Tapiador,et al.  Dendroid: A text mining approach to analyzing and classifying code structures in Android malware families , 2014, Expert Syst. Appl..

[14]  Malay Kishore Dutta,et al.  Android Malware Detection Using Genetic Algorithm based Optimized Feature Selection and Machine Learning , 2019, 2019 42nd International Conference on Telecommunications and Signal Processing (TSP).

[15]  Mamoun Alazab,et al.  Intelligent mobile malware detection using permission requests and API calls , 2020, Future Gener. Comput. Syst..

[16]  Fakhri Alam Khan,et al.  Static malware detection and attribution in android byte-code through an end-to-end deep system , 2020, Future Gener. Comput. Syst..

[17]  Miao Zhang,et al.  A Review of Android Malware Detection Approaches Based on Machine Learning , 2020, IEEE Access.

[18]  Muttukrishnan Rajarajan,et al.  Android Security: A Survey of Issues, Malware Penetration, and Defenses , 2015, IEEE Communications Surveys & Tutorials.

[19]  Gonzalo Álvarez,et al.  PUMA: Permission Usage to Detect Malware in Android , 2012, CISIS/ICEUTE/SOCO Special Sessions.

[20]  José Manuel Benítez,et al.  Empirical study of feature selection methods based on individual feature evaluation for classification problems , 2011, Expert Syst. Appl..

[21]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[22]  Douglas Fisher,et al.  Machine Learning Approaches to Estimating Software Development Effort , 1995, IEEE Trans. Software Eng..

[23]  Mauro Conti,et al.  ANASTASIA: ANdroid mAlware detection using STatic analySIs of Applications , 2016, 2016 8th IFIP International Conference on New Technologies, Mobility and Security (NTMS).

[24]  Wenjuan Sun,et al.  A detection method for android application security based on TF-IDF and machine learning , 2020, PloS one.

[25]  Konrad Rieck,et al.  DREBIN: Effective and Explainable Detection of Android Malware in Your Pocket , 2014, NDSS.

[26]  Xiaohui Wei,et al.  Exploring the financial indicators to improve the pattern recognition of economic data based on machine learning , 2020, Neural Computing and Applications.

[27]  Sakir Sezer,et al.  Analysis of Bayesian classification-based approaches for Android malware detection , 2016, IET Inf. Secur..

[28]  Nor Badrul Anuar,et al.  ABC: Android Botnet Classification using feature selection and classification algorithms , 2017 .

[29]  Simin Nadjm-Tehrani,et al.  Crowdroid: behavior-based malware detection system for Android , 2011, SPSM '11.

[30]  Praveen Kumar Reddy Maddikunta,et al.  An effective feature engineering for DNN using hybrid PCA-GWO for intrusion detection in IoMT architecture , 2020, Comput. Commun..

[31]  R. T. Goswami,et al.  A feature selection technique based on rough set and improvised PSO algorithm (PSORS-FS) for permission based detection of Android malwares , 2018, Int. J. Mach. Learn. Cybern..

[32]  Cengiz Acartürk,et al.  The analysis of feature selection methods and classification algorithms in permission based Android malware detection , 2014, 2014 IEEE Symposium on Computational Intelligence in Cyber Security (CICS).

[33]  Ahmad Salah,et al.  A Lightweight Android Malware Classifier Using Novel Feature Selection Methods , 2020, Symmetry.

[34]  Xiangliang Zhang,et al.  Constructing Features for Detecting Android Malicious Applications: Issues, Taxonomy and Directions , 2019, IEEE Access.

[35]  Victor Sreeram,et al.  A low complexity linear regression approach to time synchronization in underwater networks , 2011, 2011 8th International Conference on Information, Communications & Signal Processing.

[36]  Altyeb Altaher,et al.  Classification of Android Malware Applications using Feature Selection and Classification Algorithms , 2016 .

[37]  Qing Ye,et al.  FAMD: A Fast Multifeature Android Malware Detection Framework, Design, and Implementation , 2020, IEEE Access.