Big Data Processing Technologies in Distributed Information Systems

Abstract The analysis of Big data technologies was provided. An example of MapReduce paradigm application, uploading of big volumes of data, processing and analyzing of unstructured information and its distribution into the clustered database was provided. The article summarizes the concept of "big data". Examples of methods for working with arrays of unstructured data are given. The parallel system Resilient Distributed Datasets (RDD) is organized. The class of basic database operations was realized: database con-nection, table creation, getting in line id, returning all elements of the database, update, delete and create the line.

[1]  Nataliia Melnykova,et al.  The New Approaches of Heterogeneous Data Consolidation , 2018, 2018 IEEE 13th International Scientific and Technical Conference on Computer Sciences and Information Technologies (CSIT).

[2]  Andrea De Mauro,et al.  A formal definition of Big Data based on its essential features , 2016 .

[3]  Jianfeng Tang,et al.  The NoSQL Principles and Basic Application of Cassandra Model , 2012, 2012 International Conference on Computer Science and Service System.

[4]  Natalia Kryvinska,et al.  Web intelligence in practice , 2014, J. Serv. Sci. Res..

[5]  Natalia Kryvinska,et al.  Building consistent formal specification for the service enterprise agility foundation , 2012, J. Serv. Sci. Res..

[6]  Reynold Xin,et al.  Apache Spark , 2016 .

[7]  Piet Daas,et al.  Big Data as a Source for Official Statistics , 2015 .

[8]  Siddharth Swarup Rautaray,et al.  Big Data Analytics for Medical Applications , 2018 .

[9]  Pete Wyckoff,et al.  Hive - A Warehousing Solution Over a Map-Reduce Framework , 2009, Proc. VLDB Endow..

[10]  Yuriy Syerov,et al.  Verifying the Medical Specialty from User Profile of Online Community for Health-Related Advices , 2018, IDDM.

[11]  Henry Markram,et al.  Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations , 2002, Neural Computation.

[12]  Cong Wang,et al.  Toward publicly auditable secure cloud data storage services , 2010, IEEE Network.

[13]  M. Janssen,et al.  Factors influencing big data decision-making quality , 2017 .