The Rise of Big Data Science: A Survey of Techniques, Methods and Approaches in the Field of Natural Language Processing and Network Theory

The continuous creation of data has posed new research challenges due to its complexity, diversity and volume. Consequently, Big Data has increasingly become a fully recognised scientific field. This article provides an overview of the current research efforts in Big Data science, with particular emphasis on its applications, as well as theoretical foundation.

[1]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[2]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[3]  Yao Yuan Chow Application of Data Analytics to Cyber Forensic Data , 2016 .

[4]  Guilin Qi,et al.  A Tableau Algorithm for Possibilistic Description Logic , 2008, RR.

[5]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[6]  Richard Hill,et al.  A Kuramoto Model Based Approach to Extract and Assess Influence Relations , 2015, ISICA.

[7]  Gunnar E. Carlsson,et al.  Topology and data , 2009 .

[8]  Dan I. Moldovan,et al.  Causal Relation Extraction , 2008, LREC.

[9]  Gregory R. Grant,et al.  Bioinformatics - The Machine Learning Approach , 2000, Comput. Chem..

[10]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[11]  Marcello Trovati,et al.  Reduced Topologically Real-World Networks: A Big-Data Approach , 2015, Int. J. Distributed Syst. Technol..

[12]  Geoffrey I. Webb,et al.  Advances in Knowledge Discovery and Data Mining , 2018, Lecture Notes in Computer Science.

[13]  Joshua B. Tenenbaum,et al.  The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth , 2001, Cogn. Sci..

[14]  Krzysztof Janowicz Extending Semantic Similarity Measurement with Thematic Roles , 2005, GeoS.

[15]  I. S. P. Daryle Niedermayer,et al.  An Introduction to Bayesian Networks and Their Contemporary Applications , 2008, Innovations in Bayesian Networks.

[16]  Risto Miikkulainen,et al.  Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[17]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[18]  Nik Bessis,et al.  An influence assessment method based on co-occurrence for topologically reduced big data sets , 2016, Soft Comput..

[19]  Massimo Poesio,et al.  Acquiring Bayesian Networks from Text , 2004, LREC.

[20]  Ciprian Dobre,et al.  Big Data and Internet of Things: A Roadmap for Smart Environments , 2014, Big Data and Internet of Things.

[21]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[22]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[23]  Du Zhang,et al.  On Temporal Properties of Knowledge Base Inconsistency , 2009, Trans. Comput. Sci..

[24]  Linpeng Huang,et al.  A Solution for Data Inconsistency in Data Integration , 2011, J. Inf. Sci. Eng..

[25]  R. Pollack,et al.  Surveys on discrete and computational geometry : twenty years later : AMS-IMS-SIAM Joint Summer Research Conference, June 18-22, 2006, Snowbird, Utah , 2008 .

[26]  Herbert Edelsbrunner,et al.  Computational Topology - an Introduction , 2009 .

[27]  Marcello Trovati,et al.  A Survey of Topological Data Analysis (TDA) Methods Implemented in Python , 2017, INCoS.

[28]  Kavitha Srinivas OWL Reasoning in the Real World: Searching for Godot , 2009, Description Logics.

[29]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[30]  Afra Zomorodian,et al.  The conformal alpha shape filtration , 2006, The Visual Computer.

[31]  Nik Bessis,et al.  Extraction, Identification, and Ranking of Network Structures from Data Sets , 2014, 2014 Eighth International Conference on Complex, Intelligent and Software Intensive Systems.

[32]  Felix Naumann,et al.  Data Fusion – Resolving Data Conflicts for Integration , 2009 .

[33]  Nik Bessis,et al.  Automated extraction of fragments of Bayesian networks from textual sources , 2017, Appl. Soft Comput..

[34]  Judea Pearl,et al.  Bayesian Networks , 1998, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[35]  Paolo Ceravolo,et al.  Consistent Process Mining over Big Data Triple Stores , 2013, 2013 IEEE International Congress on Big Data.

[36]  Finn Verner Jensen,et al.  Bayesian networks , 1998, Data Mining and Knowledge Discovery Handbook.

[37]  Noel E. Sharkey Connectionist Natural Language Processing: Readings from Connection Science , 1992 .

[38]  Ronen Feldman,et al.  The Text Mining Handbook: DIAL: A Dedicated Information Extraction Language for Text Mining , 2006 .

[39]  Nik Bessis,et al.  An Analytical Tool to Map Big Data to Networks with Reduced Topologies , 2014, 2014 International Conference on Intelligent Networking and Collaborative Systems.

[40]  Du Zhang Granularities and Inconsistencies in Big Data Analysis , 2013, Int. J. Softw. Eng. Knowl. Eng..

[41]  Srini Narayanan,et al.  Bayesian Models of Human Sentence Processing , 1998 .

[42]  Jonathan D. Wren Using fuzzy set theory and scale-free network properties to relate MEDLINE terms , 2006, Soft Comput..

[43]  Marcello Trovati,et al.  Influence Discovery in Semantic Networks: An Initial Approach , 2014, 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation.

[44]  Sabeur Aridhi,et al.  An experimental survey on big data frameworks , 2016, Future Gener. Comput. Syst..

[45]  Ted Pedersen Integrating Natural Language Subtasks with Bayesian Belief Networks , 1999 .

[46]  S. Bornholdt,et al.  Scale-free topology of e-mail networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  LINDA G. DEMICHIEL,et al.  Resolving Database Incompatibility: An Approach to Performing Relational Operations over Mismatched Domains , 1989, IEEE Trans. Knowl. Data Eng..

[48]  Srividya Kona Bansal,et al.  Integrating Big Data: A Semantic Extract-Transform-Load Framework , 2015, Computer.

[49]  Jan Chomicki,et al.  Computing consistent query answers using conflict hypergraphs , 2004, CIKM '04.

[50]  Mukesh K. Mohania,et al.  Cloud Computing and Big Data Analytics: What Is New from Databases Perspective? , 2012, BDA.

[51]  S. Britto Ramesh Kumar,et al.  Conflict Identification and Resolution in Heterogeneous Datasets: A Comprehensive Survey , 2015 .

[52]  Nik Bessis,et al.  An investigation on human dynamics in enclosed spaces , 2018, Comput. Electr. Eng..