Malicious web content detection using machine leaning

Naive users using a browser have no idea about the back-end of the page. The users might be tricked into giving away their credentials or downloading malicious data. Our aim is to create an extension for Chrome which will act as middleware between the users and the malicious websites, and mitigate the risk of users succumbing to such websites. Further, all harmful content cannot be exhaustively collected as even that is bound to continuous development. To counter this we are using machine learning-to train the tool and categorize the new content it sees every time into the particular categories so that corresponding action can be taken.

[1]  Monther Aldwairi,et al.  MALURLS: A Lightweight Malicious Website Classification Based on URL Features , 2012 .

[2]  Ganesh Venkataraman,et al.  Adaptive Semantic Search: Re-Ranking of Search Results Based on Webpage Feature Extraction and Implicitly Learned Knowledge of User Interests , 2014, 2014 10th International Conference on Semantics, Knowledge and Grids.

[3]  Benjamin Livshits,et al.  ZOZZLE: Fast and Precise In-Browser JavaScript Malware Detection , 2011, USENIX Security Symposium.

[4]  Tsuhan Chen,et al.  Malicious web content detection by machine learning , 2010, Expert Syst. Appl..

[5]  Saumya K. Debray,et al.  Automatic Simplification of Obfuscated JavaScript Code: A Semantics-Based Approach , 2012, 2012 IEEE Sixth International Conference on Software Security and Reliability.

[6]  J. B. Patil,et al.  Survey on Malicious Web Pages Detection Techniques , 2015 .

[7]  Suku Nair,et al.  A comparison of machine learning techniques for phishing detection , 2007, eCrime '07.

[8]  Heejo Lee,et al.  Detecting Malicious Web Links and Identifying Their Attack Types , 2011, WebApps.

[9]  Komminist Weldemariam,et al.  Malicious Website Detection: Effectiveness and Efficiency Issues , 2011, 2011 First SysSec Workshop.

[10]  Nazife Baykal,et al.  Feature extraction and classification phishing websites based on URL , 2015, 2015 IEEE Conference on Communications and Network Security (CNS).

[11]  Gary Warner,et al.  Identifying vulnerable websites by analysis of common strings in phishing URLs , 2009, 2009 eCrime Researchers Summit.

[12]  Jianhua Sun,et al.  Malicious Websites Detection and Search Engine Protection , 2013 .

[13]  Anazida Zainal,et al.  Feature extraction process: A phishing detection approach , 2013, 2013 13th International Conference on Intellient Systems Design and Applications.

[14]  Xiaotie Deng,et al.  Phishing Web page detection , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[15]  Lawrence K. Saul,et al.  Identifying suspicious URLs: an application of large-scale online learning , 2009, ICML '09.

[16]  Giovanni Vigna,et al.  Prophiler: a fast filter for the large-scale detection of malicious web pages , 2011, WWW.

[17]  Lei Li,et al.  Naive Bayes classification algorithm based on small sample set , 2011, 2011 IEEE International Conference on Cloud Computing and Intelligence Systems.

[18]  Lawrence K. Saul,et al.  Beyond blacklists: learning to detect malicious web sites from suspicious URLs , 2009, KDD.

[19]  Carolyn Penstein Rosé,et al.  CANTINA+: A Feature-Rich Machine Learning Framework for Detecting Phishing Web Sites , 2011, TSEC.

[20]  Joby James,et al.  Detection of phishing URLs using machine learning techniques , 2013, 2013 International Conference on Control Communication and Computing (ICCC).