RERS BASED ON BEHAVIOR ANALYSIS

The growing popularity of cyberlocker service has led to significant impact on the Internet that it is considered as one of the biggest contributors to the global Internet traffic estimated to be illegally traded content. Due to the anonymity property of cyberlocker, it is difficult for investigators to track user identity directly on cyberlocker site. In order to find the potential relationships between cyberlocker users, we propose a framework to collect cyberlocker related data from public forums where cyberlocker users usually distribute cyberlocker links for others to download and identity information can be gathered easily. Different kinds of sharing behaviors of forum user are extracted to build the profile, which is then analyzed with statistical techniques. The experiment results demonstrate that the framework can effectively detect profiles with similar behaviors for identity tracking and produce a taxonomy of forum users to provide insights for investigating cyberlocker-based piracy.

[1]  K. P. Chow,et al.  Profiling and Tracking a Cyberlocker Link Sharer in a Public Web Forum , 2015, IFIP Int. Conf. Digital Forensics.

[2]  Pramod Patil,et al.  Web Forum Crawling Techniques , 2014 .

[3]  Fionn Murtagh,et al.  Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? , 2011, Journal of Classification.

[4]  Sanjeev R. Kulkarni,et al.  Measurement and understanding of Cyberlocker URL-sharing sites: Focus on movie files , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[5]  E. Hickey,et al.  Serial Murderers and their Victims , 1991 .

[6]  Patrick Bellot,et al.  Characteristics analysis for the cyberlockers files study on Rapidgator Speedyshare and 1Fichier , 2013, 8th International Conference for Internet Technology and Secured Transactions (ICITST-2013).

[7]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[8]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .