A Supervised Approach to Predict Company Acquisition with Factual and Topic Features Using Profiles and News Articles on TechCrunch

Merger and Acquisition (M&A) prediction has been an interesting and challenging research topic in the past a few decades. However, past work has only adopted numerical features in building models, and yet the valuable textual information from the great variety of social media sites has not been touched at all. To fully explore this information, we used the profiles and news articles for companies and people on TechCrunch, the leading and largest public database for the tech world, which anybody can edit. Specifically, we explored topic features via topic modeling techniques, as well as a set of other novel features of our design within a machine learning framework. We conducted experiments of the largest scale in the literature, and achieved a high true positive rate (TP) between 60% to 79.8% with a false positive rate (FP) mostly between 0% and 8.3% over company categories with a small number of missing attributes in the CrunchBase profiles.

[1]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[2]  Fotios Pasiouras,et al.  Financial characteristics of banks involved in acquisitions: evidence from Asia , 2007 .

[3]  Sungbin Cho,et al.  A hybrid approach based on the combination of variable selection using decision trees and case-based reasoning using the Mahalanobis distance: For bankruptcy prediction , 2010, Expert Syst. Appl..

[4]  Constantin Zopounidis,et al.  Prediction of company acquisition in Greece by means of the rough set approach , 1997, Eur. J. Oper. Res..

[5]  E. Deakin Discriminant Analysis Of Predictors Of Business Failure , 1972 .

[6]  Mehryar Mohri,et al.  AUC Optimization vs. Error Rate Minimization , 2003, NIPS.

[7]  Chih-Ping Wei,et al.  Patent Analysis for Supporting Merger and Acquisition (M&A) Prediction: A Data Mining Approach , 2008, WEB.

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  Jyrki Ali-Yrkkö,et al.  Does patenting increase the probability of being acquired? Evidence from cross-border and domestic acquisitions , 2005 .

[11]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[12]  W. Beaver Financial Ratios As Predictors Of Failure , 1966 .

[13]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[14]  Gordon V. Karels,et al.  Multivariate Normality and Forecasting of Business Bankruptcy , 1987 .

[15]  Kai A. Konrad,et al.  Merger Target Selection and Financial Structure , 2002 .

[16]  Kyung-shik Shin,et al.  An application of support vector machines in bankruptcy prediction model , 2005, Expert Syst. Appl..

[17]  Ramesh Sharda,et al.  Bankruptcy prediction using neural networks , 1994, Decis. Support Syst..

[18]  David L. Olson,et al.  Comparative analysis of data mining methods for bankruptcy prediction , 2012, Decis. Support Syst..

[19]  Michele Banko,et al.  Mitigating the Paucity-of-Data Problem: Exploring the Effect of Training Corpus Size on Classifier Performance for Natural Language Processing , 2001, HLT.

[20]  Susan T. Dumais,et al.  Characterizing Microblogs with Topic Models , 2010, ICWSM.

[21]  Bijayananda Naik,et al.  Predicting Corporate Acquisitions: An Application of Uncertain Reasoning Using Rule Induction , 2003, Inf. Syst. Frontiers.

[22]  L. Gayle Rayburn,et al.  DEVELOPMENT OF PREDICTION MODELS FOR HORIZONTAL AND VERTICAL MERGERS , 1997 .