I call BS: Fraud Detection in Crowdfunding Campaigns

Donations to charity-based crowdfunding environments have been on the rise in the last few years. Unsurprisingly, deception and fraud in such platforms have also increased, but have not been thoroughly studied to understand what characteristics can expose such behavior and allow its automatic detection and blocking. Indeed, crowdfunding platforms are the only ones typically performing oversight for the campaigns launched in each service. However, they are not properly incentivized to combat fraud among users and the campaigns they launch: on the one hand, a platform's revenue is directly proportional to the number of transactions performed (since the platform charges a fixed amount per donation); on the other hand, if a platform is transparent with respect to how much fraud it has, it may discourage potential donors from participating. In this paper, we take the first step in studying fraud in crowdfunding campaigns. We analyze data collected from different crowdfunding platforms, and annotate 700 campaigns as fraud or not. We compute various textual and image-based features and study their distributions and how they associate with campaign fraud. Using these attributes, we build machine learning classifiers, and show that it is possible to automatically classify such fraudulent behavior with up to 90.14% accuracy and 96.01% AUC, only using features available from the campaign's description at the moment of publication (i.e., with no user or money activity), making our method applicable for real-time operation on a user browser.

[1]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[2]  Amit V. Deokar,et al.  Detecting Fraudulent Behavior on Crowdfunding Platforms: The Role of Linguistic and Content-Based Cues in Static and Dynamic Contexts , 2016, J. Manag. Inf. Syst..

[3]  J. M. Serrano,et al.  Association rules applied to credit card fraud detection , 2009, Expert Syst. Appl..

[4]  J. R. Landis,et al.  An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. , 1977, Biometrics.

[5]  D. Cumming,et al.  Disentangling Crowdfunding from Fraudfunding , 2016, Journal of Business Ethics.

[6]  Siva Viswanathan,et al.  Judging Borrowers by the Company They Keep: Friendship Networks and Information Asymmetry in Online Peer-to-Peer Lending , 2011, Manag. Sci..

[7]  Devin G. Pope,et al.  What’s in a Picture? , 2011, The Journal of Human Resources.

[8]  Valentina Franzoni,et al.  A Deep Learning Semantic Approach to Emotion Recognition Using the IBM Watson Bluemix Alchemy Language , 2017, ICCSA.

[9]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[10]  C. F. Bond,et al.  Lie detection across cultures , 1990 .

[11]  Lance A. Young,et al.  Trust and Credit: The Role of Appearance in Peer-to-peer Lending , 2012 .

[12]  Paul Belleflamme,et al.  The Economics of Crowdfunding Platforms , 2015, Inf. Econ. Policy.

[13]  Justin R. Sydnor,et al.  What's in a Picture?: Evidence of Discrimination from Prosper.com , 2012 .

[14]  Yong Lu,et al.  P2P Lending Fraud Detection: A Big Data Approach , 2015, PAISI.

[15]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[16]  Davis E. King,et al.  Dlib-ml: A Machine Learning Toolkit , 2009, J. Mach. Learn. Res..

[17]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[19]  Praveen Pathak,et al.  Detecting Management Fraud in Public Companies , 2010, Manag. Sci..

[20]  Jiebo Luo,et al.  Aesthetics and Emotions in Images , 2011, IEEE Signal Processing Magazine.

[21]  Abhinav Srivastava,et al.  Credit Card Fraud Detection Using Hidden Markov Model , 2008, IEEE Transactions on Dependable and Secure Computing.

[22]  Shamik Sural,et al.  Credit card fraud detection: A fusion approach using Dempster-Shafer theory and Bayesian learning , 2009, Inf. Fusion.

[23]  Ahmed Abbasi,et al.  MetaFraud: A Meta-Learning Framework for Detecting Financial Fraud , 2012, MIS Q..

[24]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[25]  Jiebo Luo,et al.  Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark , 2016, AAAI.

[26]  Georgios Zervas,et al.  Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud , 2015, Manag. Sci..

[27]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[28]  Patricia M. Dechow,et al.  Predicting Material Accounting Misstatements , 2010 .

[29]  Yue Gao,et al.  Exploring Principles-of-Art Features For Image Emotion Recognition , 2014, ACM Multimedia.

[30]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[31]  Alexander Benlian,et al.  The emergence and effects of fake social information: Evidence from crowdfunding , 2016, Decis. Support Syst..