Understanding Financial News with Multi-layer Network Analysis

What is in the news? We address this question by constructing and comparing multi-layer networks from different sources. The layers consist of the same nodes (hence multiplex networks), but links are constructed from textual news on one hand, and empirical data on the other hand. Nodes represent entities of interest, recognized in the news. From the news, links are extracted from significant co-occurrences of entities, and from strong positive and negative sentiment associated with the co-occurrences. In a case study, the observed entities are 50 countries, extracted from more than 1.3 million financial news acquired over a period of 2 years. The empirical network layers are constructed from the geographical proximity, the trade connections, and from correlations between financial indicators of the same countries. Different network comparison metrics are used to explore the similarity between the news and the empirical networks. We examine the overlap of the most important links in the constructed networks, and compare their structural similarity by node centrality and main k-cores. The comparative analysis reveals that the co-occurrences of countries in the news most closely match their geographical proximity, while positive sentiment links most closely match the trade connections between the countries. Correlations between financial indicators have the lowest similarity to financial news.

[1]  K. Pearson VII. Note on regression and inheritance in the case of two parents , 1895, Proceedings of the Royal Society of London.

[2]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[3]  Guido Caldarelli,et al.  Hypergraph topological quantities for tagged social networks , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Vinko Zlatic,et al.  Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources , 2014, PloS one.

[5]  Petra Kralj Novak,et al.  Analysis of Financial News with NewsStream , 2015 .

[6]  Roded Sharan,et al.  The large-scale organization of the bacterial network of ecological co-occurrence interactions , 2010, Nucleic acids research.

[7]  Pei-Chun Lee,et al.  Mapping knowledge structure by keyword co-occurrence: a first look at journal papers in Technology Foresight , 2010, Scientometrics.

[8]  J. Aizenman,et al.  What is the Risk of European Sovereign Debt Defaults? Fiscal Space, CDS Spreads and Market Pricing of Risk , 2011 .

[9]  Guido Caldarelli,et al.  Scale-Free Networks , 2007 .

[10]  K. Singleton,et al.  Default and Recovery Implicit in the Term Structure of Sovereign CDS Spreads , 2008 .

[11]  Paul C. Tetlock Giving Content to Investor Sentiment: The Role of Media in the Stock Market , 2005, The Journal of Finance.

[12]  Petra Kralj Novak,et al.  Analysis of Financial News with NewsStream, Technical Report IJS-DP-11965 , 2015, ArXiv.

[13]  Vijay V. Raghavan,et al.  A critical investigation of recall and precision as measures of retrieval system performance , 1989, TOIS.

[14]  Sofus A. Macskassy,et al.  More than Words: Quantifying Language to Measure Firms' Fundamentals the Authors Are Grateful for Assiduous Research Assistance from Jie Cao and Shuming Liu. We Appreciate Helpful Comments From , 2007 .

[15]  Kent A. Spackman,et al.  Using co-occurrence network structure to extract synonymous gene and protein names from MEDLINE abstracts , 2005, BMC Bioinformatics.

[16]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[17]  K. Börner,et al.  Mapping topics and topic bursts in PNAS , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[18]  P. Bonacich Factoring and weighting approaches to status scores and clique identification , 1972 .

[19]  Philip Edmonds,et al.  Choosing the Word Most Typical in Context Using a Lexical Co-occurrence Network , 1997, ACL.

[20]  Yitzhak Pilpel,et al.  Global and Local Architecture of the Mammalian microRNA–Transcription Factor Regulatory Network , 2007, PLoS Comput. Biol..

[21]  Guido Caldarelli,et al.  Random hypergraphs and their applications , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  Vittorio Loreto,et al.  Network properties of folksonomies , 2007, AI Commun..

[23]  Dennis M. Wilkinson,et al.  A method for finding communities of related genes , 2004, Proceedings of the National Academy of Sciences of the United States of America.