Analysis and Solution Model of Distributed Computing in Scientific Calculations

Processing huge amounts of data is currently of concern in various fields of science and commercial data processing, such as pharmaceutical drug development, astronomical probe data processing, security analysis of large amounts of communication data, etc. Generally, centrally administered methods are used, but their employment and operation are very expensive. The aim of this paper is to present a model of high-capacity data processing that is based on the technology of Apache Hadoop with emphasis on use of volunteer host devices with the service distribution via the Internet.

[1]  Hitesh S. Nalamwar,et al.  Efficient data management tools for the heterogeneous big data warehouse , 2016 .

[2]  Feng Xia,et al.  MapReduce: Review and open challenges , 2016, Scientometrics.

[3]  Ondrej Krejcar,et al.  Use of cloud computing in biomedicine , 2016, Journal of biomolecular structure & dynamics.

[5]  Nong Xiao,et al.  A Configuration Management Study to Fast Massive Writing for Distributed NoSQL System , 2016, IEICE Trans. Inf. Syst..

[6]  Victor I. Chang,et al.  Composable architecture for rack scale big data computing , 2017, Future Gener. Comput. Syst..

[7]  Josef Horalek,et al.  The deployment of Security Information and Event Management in cloud infrastructure , 2015, 2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA).

[8]  Kinshuk,et al.  A framework for scheduling and managing big data applications in a distributed infrastructure , 2015, 2015 Seventh International Conference on Advanced Computing (ICoAC).

[9]  Weishan Zhang,et al.  A genetic algorithm-based job scheduling model for big data analytics , 2016, EURASIP Journal on Wireless Communications and Networking.

[10]  Vladimir Sobeslav,et al.  OpenSource Automation in Cloud Computing , 2015 .

[11]  Kiyoshi Kobayashi,et al.  Big data-based deterioration prediction models and infrastructure management: towards assetmetrics , 2017 .

[12]  Nada Lavrac,et al.  ClowdFlows: Online workflows for distributed big data mining , 2017, Future Gener. Comput. Syst..

[13]  Yasuhiro Nakamura,et al.  Classification of HTTP automated software communication behaviour using NoSql database , 2016, 2016 International Conference on Electronics, Information, and Communications (ICEIC).