Corporate Communication Network and Stock Price Movements: Insights From Data Mining

Grounded on communication theories, we propose to use a data-mining algorithm to detect communication patterns within a company to determine if such patterns may reveal the performance of the company. Specifically, we would like to find out whether or not there exist any association relationships between the frequency of e-mail exchange of the key employees in a company and the performance of the company as reflected in its stock prices. If such relationships do exist, we would also like to know whether or not the company’s stock price could be accurately predicted based on the detected relationships. To detect the association relationships, a data-mining algorithm is proposed here to mine e-mail communication records and historical stock prices so that based on the detected relationship, rules that can predict changes in stock prices can be constructed. Using the data-mining algorithm and a set of publicly available Enron e-mail corpus and Enron’s stock prices recorded during the same period, we discovered the existence of interesting, statistically significant, association relationships in the data. In addition, we also discovered that these relationships can predict stock price movements with an average accuracy of around 80%. The results confirm the belief that corporate communication has identifiable patterns and such patterns can reveal meaningful information of corporate performance as reflected by such indicators as stock market performance. Given the increasing popularity of social networks, the mining of interesting communication patterns could provide insights into the development of many useful applications in many areas.

[1]  Muh-Cherng Wu,et al.  An effective application of decision tree to stock trading , 2006, Expert Syst. Appl..

[2]  S. Haberman The Analysis of Residuals in Cross-Classified Tables , 1973 .

[3]  Ralph Katz,et al.  Communication Patterns, Project Performance and Task Characteristics: An Empirical Evaluation and Integration in An R&D Setting , 2017 .

[4]  R. Katz The Effects of Group Longevity on Project Communication and Performance. , 1982 .

[5]  William W. Cohen,et al.  Recommending Recipients in the Enron Email Corpus , 1972 .

[6]  P. M. Chawan,et al.  Study of Data Mining Techniques used for Financial Data Analysis , 2013 .

[7]  Li Bing,et al.  Public Sentiment Analysis in Twitter Data for Prediction of a Company's Stock Price Movements , 2014, 2014 IEEE 11th International Conference on e-Business Engineering.

[8]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[9]  Terrill L. Frantz,et al.  Communication Networks from the Enron Email Corpus “It's Always About the People. Enron is no Different” , 2005, Comput. Math. Organ. Theory.

[10]  Yang Wang,et al.  From Association to Classification: Inference Using Weight of Evidence , 2003, IEEE Trans. Knowl. Data Eng..

[11]  B. Wernerfelt,et al.  DETERMINANTS OF FIRM PERFORMANCE: THE RELATIVE IMPORTANCE OF ECONOMIC AND ORGANIZATIONAL FACTORS , 1989 .

[12]  Phichhang Ou,et al.  Prediction of Stock Market Index Movement by Ten Data Mining Techniques , 2009 .

[13]  Bin Gu,et al.  Do online reviews matter? - An empirical investigation of panel data , 2008, Decis. Support Syst..

[14]  R. Dolphin The strategic role of investor relations , 2004 .

[15]  Manisha Gahirwal,et al.  Inter Time Series Sales Forecasting , 2013, 1303.0117.

[16]  B. Fingleton Models of Category Counts , 1984 .

[17]  Ronaldo Menezes,et al.  Assessing organizational stability via network analysis , 2009, 2009 IEEE Symposium on Computational Intelligence for Financial Engineering.

[18]  Binoy B. Nair,et al.  A GA-artificial neural network hybrid system for financial time series forecasting , 2011 .

[19]  W. Burke,et al.  A Causal Model of Organizational Performance and Change , 1992 .

[20]  David J. T. Sumpter,et al.  A Dynamical Approach to Stock Market fluctuations , 2011, Int. J. Bifurc. Chaos.

[21]  Deborah J. Barrett Change communication: using strategic employee communication to facilitate major change , 2002 .

[22]  Andrew K. C. Wong,et al.  DECA: A Discrete-Valued Data Clustering Algorithm , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Yiming Yang,et al.  The Enron Corpus: A New Dataset for Email Classi(cid:12)cation Research , 2004 .

[24]  Carol Anne Hargreaves,et al.  Prediction of Stock Performance Using Analytical Techniques , 2013 .

[25]  Jaideep Srivastava,et al.  Who Thinks Who Knows Who? Socio-cognitive Analysis of Email Networks , 2006, Sixth International Conference on Data Mining (ICDM'06).

[26]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[27]  Andrew K. C. Wong,et al.  Learning sequential patterns for probabilistic inductive prediction , 1994 .

[28]  G. Miller Sociology. Social scientists wade into the tweet stream. , 2011, Science.

[29]  Enireddy. Vamsidhar,et al.  Prediction of Rainfall Using Backpropagation Neural Network Model , 2010 .

[30]  Ramandeep S. Randhawa,et al.  The Auditor's Slippery Slope: An Analysis of Reputational Incentives , 2010, Manag. Sci..

[31]  Douglas W. Oard,et al.  Making sense of archived e-mail: Exploring the Enron collection with NetLens , 2010 .

[32]  France T́elécom,et al.  Optimal Bin Number for Equal Frequency Discretizations in Supervized Learning , 2007 .

[33]  Richard B. Higgins,et al.  How corporate communication of strategy affects share price , 1992 .

[34]  Binoy B. Nair,et al.  A Stock Market Trend Prediction System Using a Hybrid Decision Tree-Neuro-Fuzzy System , 2010, 2010 International Conference on Advances in Recent Technologies in Communication and Computing.

[35]  Wei Zhang,et al.  Detect community structure from the Enron Email Corpus Based on Link Mining , 2006, Sixth International Conference on Intelligent Systems Design and Applications.

[36]  C. Marston,et al.  Investor relations: a European survey , 2001 .

[37]  Jafar Adibi,et al.  The Enron Email Dataset Database Schema and Brief Statistical Report , 2004 .

[38]  Eric Gilbert,et al.  Widespread Worry and the Stock Market , 2010, ICWSM.

[39]  A. Parasuraman,et al.  Communication and Control Processes in the Delivery of Service Quality , 1988 .

[40]  V. P. Mohandas,et al.  Predicting the BSE Sensex: Performance comparison of adaptive linear element, feed forward and time delay neural networks , 2012, 2012 International Conference on Power, Signals, Controls and Computation.

[41]  Rajashree Dash,et al.  Comparative Analysis of Supervised and Unsupervised Discretization Techniques , 2011 .

[42]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[43]  Munmun De Choudhury,et al.  Can blog communication dynamics be correlated with stock market activity? , 2008, Hypertext.

[44]  Phillip G. Clampitt,et al.  Employee Perceptions of the Relationship Between Communication and Productivity: A Field Study , 1993 .