Identifying financially successful start-up profiles with data mining

Research highlights? Start-ups are crucial in the modern economy as they provide dynamism and growth. ? We identify configurations of initial resource bundles, strategy and environment that lead to superior performance in start-ups. ? We rely on data mining for the analysis because it accounts for the premises of configurational theory. ? The results confirm the existence of multivariate ideal configurations and equifinality. Start-ups are crucial in the modern economy as they provide dynamism and growth. Research on the performance of new ventures increasingly investigates initial resources as determinants of success. Initial resources are said to be important because they imprint the firm at start-up, limit its strategic choices, and continue to impact its performance in the long run. The purpose of this paper is to identify configurations of initial resource bundles, strategy and environment that lead to superior performance in start-ups. To date, interdependencies between resources on the one hand and between resources, strategy and environment on the other hand have been neglected in empirical research. We rely on data mining for the analysis because it accounts for premises of configurational theory, including reversed causality, intradimensional interactions, multidimensional dependencies, and equifinality. We apply advanced data mining techniques, in the form of rule extraction from non-linear support vector machines, to induce accurate and comprehensible configurations of resource bundles, strategy and environment. We base our analysis on an extensive survey among 218 Flemish start-ups. Our experiments indicate the good performance of rule extraction technique ALBA. Finally, for comprehensibility, intuitiveness and implementation reasons, the tree is transformed into a decision table.

[1]  Jeff W. Trailer,et al.  Measuring performance in entrepreneurship research , 1996 .

[2]  Nick Bontis,et al.  The knowledge-based view of the firm and its theoretical precursor , 2006 .

[3]  Bart Baesens,et al.  An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models , 2011, Decis. Support Syst..

[4]  Barbara von Halle,et al.  Business Rules Applied: Building Better Systems Using the Business Rules Approach , 2001 .

[5]  J. Barney,et al.  Resource-Based Theory: Creating and Sustaining Competitive Advantage , 2007 .

[6]  Jan Vanthienen,et al.  An Illustration of Verification and Validation in the Modelling Phase of KBS Development , 1998, Data Knowl. Eng..

[7]  Bart Baesens,et al.  Including Domain Knowledge in Customer Churn Prediction Using AntMiner+ , 2009, Industrial Conference on Data Mining - Workshop DMM.

[8]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[9]  Warren Boeker,et al.  Strategic Change: The Effects Of Founding And History , 1989 .

[10]  David L. Deeds,et al.  The impact of stocks and flows of organizational knowledge on firm performance: an empirical investigation of the biotechnology industry , 1999 .

[11]  P. Pompe,et al.  The prediction of bankruptcy of small- and medium-sized industrial firms , 2005 .

[12]  Bart Baesens,et al.  Ant-Based Approach to the Knowledge Fusion Problem , 2006, ANTS Workshop.

[13]  J. Barney Firm Resources and Sustained Competitive Advantage , 1991 .

[14]  Steven H. Hanks,et al.  An examination of the substitutability of founders human and financial capital in emerging business ventures , 1998 .

[15]  Ganesh D. Bhatt,et al.  Knowledge management in organizations: examining the interaction between technologies, techniques, and people , 2001, J. Knowl. Manag..

[16]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[17]  Bart Baesens,et al.  Building comprehensible customer churn prediction models with advanced rule induction techniques , 2011, Expert Syst. Appl..

[18]  Vipin Kumar,et al.  Introduction to Data Mining, (First Edition) , 2005 .

[19]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[20]  T. Douglas,et al.  The temporal nature of growth determinants in new bank foundings: implications for new venture research design , 2004 .

[21]  Carolyn Y. Woo,et al.  Initial Human and Financial Capital as Predictors of New Venture Performance , 1994 .

[22]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[23]  Bart Baesens,et al.  Predicting going concern opinion with data mining , 2008, Decis. Support Syst..

[24]  Bart Baesens,et al.  Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings , 2008, IEEE Transactions on Software Engineering.

[25]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[26]  Johan A. K. Suykens,et al.  Benchmarking Least Squares Support Vector Machine Classifiers , 2004, Machine Learning.

[27]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[28]  B. Wernerfelt,et al.  A Resource-Based View of the Firm , 1984 .

[29]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[30]  Bart Baesens,et al.  Decompositional Rule Extraction from Support Vector Machines by Active Learning , 2009, IEEE Transactions on Knowledge and Data Engineering.

[31]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[32]  Monique Snoeck,et al.  Classification With Ant Colony Optimization , 2007, IEEE Transactions on Evolutionary Computation.

[33]  K. Boal,et al.  Strategic resources: Traits, configurations and paths to sustainable competitive advantage , 2007 .

[34]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[35]  Per Davidsson,et al.  Smallness, Newness and Regional Development , 1998 .

[36]  P. P. McDougall,et al.  An examination of the impact of initial founding conditions and decisions upon the performance of new bank start-ups , 2000 .