论文信息 - Measuring and Understanding Crowdturfing in the App Store

Measuring and Understanding Crowdturfing in the App Store

Application marketplaces collect ratings and reviews from users to provide references for other consumers. Many crowdturfing activities abuse user reviews to manipulate the reputation of an app and mislead other consumers. To understand and improve the ecosystem of reviews in the app market, we investigate the existence of crowdturfing based on the App Store. This paper reports a measurement study of crowdturfing and its reviews in the App Store. We use a sliding window to obtain the relationship graph between users and the community detection method to binary classify the detected communities. Then, we measure and analyze the crowdturfing obtained from the classification and compare them with genuine users. We analyze several features of crowdturfing, such as ratings, sentiment scores, text similarity, and common words. We also investigate which apps crowdturfing often appears in and reveal their role in app ranking. These insights are used as features in machine learning models, and the results show that they can effectively train classifiers and detect crowdturfing reviews with an accuracy of up to 98.13%. This study reveals malicious crowdfunding practices in the App Store and helps to strengthen the review security of app marketplaces.

Xiaomei Zhang | Shilin Wang | Fangqi Li | Qi-Qi Hu | Zhushou Tang

[1] D. Ursino,et al. Extraction and analysis of text patterns from NSFW adult content in Reddit , 2022, Data Knowl. Eng..

[2] Marimuthu Palaniswami,et al. Identifying Groups of Fake Reviewers Using a Semisupervised Approach , 2021, IEEE Transactions on Computational Social Systems.

[3] Antonino Nocera,et al. Investigating negative reviews and detecting negative influencers in Yelp through a multi-dimensional social network based model , 2021, Int. J. Inf. Manag..

[4] Keping Yu,et al. Deep Graph neural network-based spammer detection under the perspective of heterogeneous cyberspace , 2021, Future Gener. Comput. Syst..

[5] Antonino Nocera,et al. Defining and detecting k-bridges in a social network: The Yelp case, and more , 2020, Knowl. Based Syst..

[6] Bo Liu,et al. Co-Detection of crowdturfing microblogs and spammers in online social networks , 2019, World Wide Web.

[7] Philip S. Yu,et al. Uncovering Download Fraud Activities in Mobile App Markets , 2019, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[8] Muhammad Rifki Shihab,et al. Negative online reviews of popular products: understanding the effects of review proportion and quality on consumers’ attitude and intention to buy , 2019, Electron. Commer. Res..

[9] Yiqun Liu,et al. Detecting Crowdturfing "Add to Favorites" Activities in Online Shopping , 2018, WWW.

[10] Minhui Xue,et al. Fake reviews tell no tales? dissecting click farming in content-generated social networks , 2018, China Communications.

[11] Anna Cinzia Squicciarini,et al. Combating Crowdsourced Review Manipulators: A Neighborhood-Based Approach , 2018, WSDM.

[12] Ben Y. Zhao,et al. Automated Crowdturfing Attacks and Defenses in Online Review Systems , 2017, CCS.

[13] Wei Niu,et al. Crowdsourced App Review Manipulation , 2017, SIGIR.

[14] Zhuo Wang,et al. Detecting Review Spammer Groups via Bipartite Graph Projection , 2016, Comput. J..

[15] Jong Kim,et al. CrowdTarget: Target-based Detection of Crowdturfing in Online Social Networks , 2015, CCS.

[16] Sencun Zhu,et al. AppWatcher: unveiling the underground market of trading mobile app reviews , 2015, WISEC.

[17] Kyumin Lee,et al. Characterizing and automatically detecting crowdturfing in Fiverr and Twitter , 2015, Social Network Analysis and Mining.

[18] Venkatesan Guruswami,et al. CopyCatch: stopping group attacks by spotting lockstep behavior in social networks , 2013, WWW.

[19] Gang Wang,et al. Serf and turf: crowdturfing for fun and profit , 2011, WWW.

[20] Jean-Loup Guillaume,et al. Fast unfolding of communities in large networks , 2008, 0803.0476.

[21] L. Breiman. Random Forests , 2001, Machine Learning.