The determinants of crowdfunding success: A semantic text analytics approach

In the era of the Social Web, crowdfunding has become an increasingly more important channel for entrepreneurs to raise funds from the crowd to support their startup projects. Previous studies examined various factors such as project goals, project durations, and categories of projects that might influence the outcomes of the fund raising campaigns. However, textual information of projects has rarely been studied for analyzing crowdfunding successes. The main contribution of our research work is the design of a novel text analytics-based framework that can extract latent semantics from the textual descriptions of projects to predict the fund raising outcomes of these projects. More specifically, we develop the Domain-Constraint Latent Dirichlet Allocation (DC-LDA) topic model for effective extraction of topical features from texts. Based on two real-world crowdfunding datasets, our experimental results reveal that the proposed framework outperforms a classical LDA-based method in predicting fund raising success by an average of 11% in terms of F1 score. The managerial implication of our research is that entrepreneurs can apply the proposed methodology to identify the most influential topical features embedded in project descriptions, and hence to better promote their projects and improving the chance of raising sufficient funds for their projects. A text analytics methodology is proposed for analyzing and predicting crowdfunding success.Domain-constraint LDA is developed to extract semantic features from texts.State-of-the-art feature selection and data mining methods are explored.Our research contributes to advance the computational method for analyzing crowdfunding success.

[1]  Pilsung Kang,et al.  Late payment prediction models for fair allocation of customer contact lists to call center agents , 2016, Decis. Support Syst..

[2]  Ethan Mollick The Dynamics of Crowdfunding: An Exploratory Study , 2014 .

[3]  Devashish Mitra,et al.  The role of crowdfunding in entrepreneurial finance , 2012 .

[4]  Paul Belleflamme,et al.  Crowdfunding: Tapping the Right Crowd , 2013, SSRN Electronic Journal.

[5]  Max Welling,et al.  Fast collapsed gibbs sampling for latent dirichlet allocation , 2008, KDD.

[6]  Rick Wash,et al.  The Value of Completing Crowdfunding Projects , 2013, ICWSM.

[7]  Raymond Y. K. Lau,et al.  Product aspect extraction supervised with online domain knowledge , 2014, Knowl. Based Syst..

[8]  Tim Kappel,et al.  Ex Ante Crowdfunding and the Recording Industry: A Model for the U.S. , 2009 .

[9]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[10]  Hua Xu,et al.  Constrained LDA for Grouping Product Features in Opinion Mining , 2011, PAKDD.

[11]  José Alberto García-Avilés,et al.  CROWDFUNDING AND NON-PROFIT MEDIA , 2012 .

[12]  Elizabeth Gerber,et al.  Crowdfunding support tools: predicting success & failure , 2013, CHI Extended Abstracts.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Raymond Y. K. Lau,et al.  A Probabilistic Generative Model for Mining Cybercriminal Networks from Online Social Media , 2014, IEEE Computational Intelligence Magazine.

[15]  Bing Liu,et al.  Topic Modeling using Topics from Many Domains, Lifelong Learning and Big Data , 2014, ICML.

[16]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[17]  Li-Te Cheng,et al.  Crowdfunding inside the enterprise: employee-initiatives for innovation and collaboration , 2013, CHI.

[18]  Elizabeth Gerber,et al.  Crowdfunding , 2013, ACM Trans. Comput. Hum. Interact..

[19]  Ee-Peng Lim,et al.  On strategies for imbalanced text classification using SVM: A comparative study , 2009, Decis. Support Syst..

[20]  Yue Lu,et al.  Opinion integration through semi-supervised topic modeling , 2008, WWW.

[21]  Bing Liu,et al.  Mining topics in documents: standing on the shoulders of big data , 2014, KDD.

[22]  Anindya Ghose,et al.  An Empirical Examination of Users' Information Hiding in a Crowdfunding Context , 2013, ICIS.

[23]  Hang Li,et al.  Named entity mining from click-through data using weakly supervised latent dirichlet allocation , 2009, KDD.

[24]  M. A. H. Farquad,et al.  Preprocessing unbalanced data using support vector machine , 2012, Decis. Support Syst..

[25]  Paul Belleflamme,et al.  Individual crowdfunding practices , 2013, Venture Capital.

[26]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[27]  Chih-Fong Tsai,et al.  Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches , 2010, Decis. Support Syst..

[28]  A. Ghose,et al.  Cultural Differences and Geography as Determinants of Online Prosocial Lending , 2014, MIS Q..

[29]  Gordon Burtch,et al.  Reducing Medical Bankruptcy Through Crowdfunding: Evidence from GiveForward , 2014, ICIS.

[30]  Padhraic Smyth,et al.  Statistical entity-topic models , 2006, KDD '06.

[31]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[32]  Xiaojin Zhu,et al.  Incorporating domain knowledge into topic modeling via Dirichlet Forest priors , 2009, ICML '09.

[33]  Hongnian Yu,et al.  Mutual information based input feature selection for classification problems , 2012, Decis. Support Syst..

[34]  Vadlamani Ravi,et al.  Detection of financial statement fraud and feature selection using data mining techniques , 2011, Decis. Support Syst..

[35]  Elizabeth Gerber,et al.  Understanding the role of community in crowdfunding work , 2014, CSCW.

[36]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[37]  Tanja Aitamurto THE IMPACT OF CROWDFUNDING ON JOURNALISM , 2011 .

[38]  Eric Gilbert,et al.  The language that gets people to give: phrases that predict success on kickstarter , 2014, CSCW.

[39]  Frank Kleemann,et al.  Un(der)paid innovators: the commercial utilization of consumer work through crowdsourcing , 2008 .

[40]  David E. Goldberg,et al.  Genetic algorithms and Machine Learning , 1988, Machine Learning.

[41]  A. Parasuraman,et al.  1 CROWDFUNDING : TRANSFORMING CUSTOMERS INTO INVESTORS THROUGH INNOVATIVE , 2011 .

[42]  Marcos André Gonçalves,et al.  BROOF: Exploiting Out-of-Bag Errors, Boosting and Random Forests for Effective Automated Classification , 2015, SIGIR.

[43]  K. Selvakuberan,et al.  Combined Feature Selection and classification – A novel approach for the categorization of web pages , 2008 .

[44]  Elizabeth Gerber,et al.  Design principles: crowdfunding as a creativity support tool , 2012, CHI Extended Abstracts.

[45]  Philip S. Yu,et al.  Inferring the impacts of social media on crowdfunding , 2014, WSDM.

[46]  Vincent Etter,et al.  Launch hard or go home!: predicting the success of kickstarter campaigns , 2013, COSN '13.

[47]  A. Stemler The JOBS Act and Crowdfunding: Harnessing the Power – and Money – of the Masses , 2013 .

[48]  Anindya Ghose,et al.  An Empirical Examination of Peer Referrals in Online Crowdfunding , 2014, ICIS.