An influence assessment method based on co-occurrence for topologically reduced big data sets

The extraction of meaningful, accurate, and relevant information is at the core of Big Data research. Furthermore, the ability to obtain an insight is essential in any decision-making process, even though the diverse and complex nature of big data sets raises a multitude of challenges. In this paper, we propose a novel method to address the automated assessment of influence among concepts in big data sets. This is carried out by investigating their mutual co-occurrence, which is determined via topologically reducing the corresponding network. The main motivation is to provide a toolbox to classify and analyse influence properties, which can be used to investigate their dynamical and statistical behaviour, potentially leading to a better understanding and prediction of the properties of the system(s) they model. An evaluation was carried out on two real-world data sets, which were analysed to test the capabilities of our system. The results show the potential of our approach, indicating both accuracy and efficiency.

[1]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[2]  Marcello Trovati,et al.  Reduced Topologically Real-World Networks: A Big-Data Approach , 2015, Int. J. Distributed Syst. Technol..

[3]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[4]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[5]  K. Menger Zur allgemeinen Kurventheorie , 1927 .

[6]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[7]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[8]  Dan I. Moldovan,et al.  Causal Relation Extraction , 2008, LREC.

[9]  Béla Bollobás,et al.  Modern Graph Theory , 2002, Graduate Texts in Mathematics.

[10]  Ali Sheikhani,et al.  Localization of premature ventricular contraction foci in normal individuals based on multichannel electrocardiogram signals processing , 2013, SpringerPlus.

[11]  M. Newman,et al.  Why social networks are different from other types of networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Nik Bessis,et al.  Extraction, Identification, and Ranking of Network Structures from Data Sets , 2014, 2014 Eighth International Conference on Complex, Intelligent and Software Intensive Systems.

[13]  Massimo Poesio,et al.  Acquiring Bayesian Networks from Text , 2004, LREC.

[14]  Natalia Kryvinska,et al.  Strategic Demands on Information Services in Uncertain Businesses: A Layer-Based Framework from a Value Network Perspective , 2011, 2011 International Conference on Emerging Intelligent Data and Web Technologies.

[15]  Nik Bessis,et al.  Interconnectedness of Complex Systems of Internet of Things through Social Network Analysis for Disaster Management , 2012, 2012 Fourth International Conference on Intelligent Networking and Collaborative Systems.

[16]  Lise Getoor,et al.  Relationship Identification for Social Network Discovery , 2007, AAAI.

[17]  Gerhard Hindricks,et al.  Relevant ventricular septal defect caused by steam pop during ablation of premature ventricular contraction. , 2013, Circulation.

[18]  Michael Schwind,et al.  Scale-free networks , 2006, Wirtschaftsinf..

[19]  Nik Bessis,et al.  An Analytical Tool to Map Big Data to Networks with Reduced Topologies , 2014, 2014 International Conference on Intelligent Networking and Collaborative Systems.

[20]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[21]  Mukesh K. Mohania,et al.  Cloud Computing and Big Data Analytics: What Is New from Databases Perspective? , 2012, BDA.

[22]  Jonathan Schaffer,et al.  Causation, influence, and effluence , 2001 .

[23]  Aboul Ella Hassanien,et al.  Dimensionality reduction of medical big data using neural-fuzzy classifier , 2014, Soft Computing.

[24]  S. Bornholdt,et al.  Scale-free topology of e-mail networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Jonathan D. Wren Using fuzzy set theory and scale-free network properties to relate MEDLINE terms , 2006, Soft Comput..

[26]  Marcello Trovati,et al.  Influence Discovery in Semantic Networks: An Initial Approach , 2014, 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation.