论文信息 - Identifying Android Malware Using Network-Based Approaches

Identifying Android Malware Using Network-Based Approaches

The proliferation of Android applications has resulted in many malicious apps entering the market and causing significant damage. Robust techniques that determine if an app is malicious are greatly needed. We propose the use of network-based approaches to effectively separate malicious from benign apps, based on a small labeled dataset. The apps in our dataset come from the Google Play Store and have been scanned for malicious behavior using VirusTotal to produce a ground truth dataset with labels malicious or benign. The apps in the resulting dataset have been represented in the form of binary feature vectors (where the features represent permissions, intent actions, discriminative APIs, obfuscation signatures, and native code signatures). We have used these vectors to build a weighted network that captures the “closeness” between apps. We propagate labels from the labeled apps to unlabeled apps, and evaluate the effectiveness of the approaches studied using the Fl-measure. We have conducted experiments to compare three variants of the label propagation approaches on datasets that consist of increasingly larger amounts of labeled data.

[1] Zoubin Ghahramani,et al. Learning from labeled and unlabeled data with label propagation , 2002 .

[2] Doina Caragea,et al. Android malware detection with weak ground truth data , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[3] Chih-Jen Lin,et al. A Practical Guide to Support Vector Classication , 2008 .

[4] Yanfang Ye,et al. FindMal: A file-to-file social network based malware detection framework , 2016, Knowl. Based Syst..

[5] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[6] Alexander Zien,et al. Label Propagation and Quadratic Criterion , 2006 .

[7] Xiangliang Zhang,et al. Content-Agnostic Malware Detection in Heterogeneous Malicious Distribution Graph , 2016, CIKM.

[8] Yanfang Ye,et al. Analyzing File-to-File Relation Network in Malware Detection , 2015, WISE.

[9] Tao Li,et al. File Relation Graph Based Malware Detection Using Label Propagation , 2015, WISE.

[10] Bernhard Schölkopf,et al. Learning with Local and Global Consistency , 2003, NIPS.

[11] Guanhua Yan,et al. Transductive malware label propagation: Find your lineage from your neighbors , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[12] Nic Herndon,et al. Experimental Study with Real-world Data for Android App Security Analysis using Machine Learning , 2015, ACSAC.