Join query optimization in the distributed database system using an artificial bee colony algorithm and genetic operators

As the main factor in the distributed database systems, query optimization is aimed at finding an optimal execution plan to reduce the runtime. In such systems, because of the repeated relations on various sites, the query optimization is very challenging. Moreover, the query optimization issue with large‐scale distributed databases is an NP‐hard problem. Therefore, in this paper, an Artificial Bee Colony Algorithm based on Genetic Operators (ABC‐GO) is proposed to find a solution to join the query optimization problems in the distributed database systems. The ABC algorithm has the global–local search capabilities and genetic operators to create new candidate solutions for improving the performance of the ABC algorithm. The obtained results have shown that the cost of the query evaluation is minimized and the quality of Top‐K query plans is improved for a given distributed query. Moreover, this method decreases the overhead. However, it needs a longer execution time.

[1]  Nima Jafari Navimipour,et al.  A new method for trust and reputation evaluation in the cloud environments using the recommendations of opinion leaders' entities and removing the effect of troll entities , 2016, Comput. Hum. Behav..

[2]  Vikram Singh,et al.  Distributed Query Plan generation using Aggregation based Multi-Objective Genetic Algorithm , 2014, ICTCS '14.

[3]  Wan-li Xiang,et al.  An efficient and robust artificial bee colony algorithm for numerical optimization , 2013, Comput. Oper. Res..

[4]  M. Tuba Artificial Bee Colony ( ABC ) Algorithm with Crossover and Mutation , 2012 .

[5]  Dennis McLeod,et al.  A Probe-Based Technique to Optimize Join Queries in Distributed Internet Databases , 2000, Knowledge and Information Systems.

[6]  Nima Jafari Navimipour,et al.  Nature inspired meta‐heuristic algorithms for solving the service composition problem in the cloud environments , 2018, Int. J. Commun. Syst..

[7]  Dr. P. K. Butey,et al.  Query Optimization by Genetic Algorithm , 2012 .

[8]  Zainudin Zukhri,et al.  A Hybrid Optimization Algorithm based on Genetic Algorithm and Ant Colony Optimization , 2013 .

[9]  Ridhi Kapoor Selectivity & Cost Estimates in Query Optimization in Distributed Databases , 2013 .

[10]  Frederico G. Guimarães,et al.  Query join ordering optimization with evolutionary multi-agent systems , 2014, Expert Syst. Appl..

[11]  Masatoshi Yoshikawa,et al.  Query processing for distributed databases using generalized semi-joins , 1982, SIGMOD '82.

[12]  Ataollah Ebrahimzadeh,et al.  Brain tissue segmentation using an unsupervised clustering technique based on PSO algorithm , 2010, 2010 17th Iranian Conference of Biomedical Engineering (ICBME).

[13]  Maciej Matysiak Efficient Optimization of Large Join Queries Using Tabu Search , 1995, Inf. Sci..

[14]  Noradin Ghadimi,et al.  Robust placement and tuning of UPFC via a new multiobjective scheme-based fuzzy theory , 2015, Complex..

[15]  Ming-Syan Chen,et al.  Using join operations as reducers in distributed query processing , 1990, DPDS '90.

[16]  Ayoub Bagheri,et al.  Finding shortest path with learning algorithms , 2008 .

[17]  Nima Jafari Navimipour,et al.  An improved genetic algorithm for task scheduling in the cloud environments using the priority queues: Formal verification, simulation, and statistical testing , 2017, J. Syst. Softw..

[18]  Dervis Karaboga,et al.  A novel binary artificial bee colony algorithm based on genetic operators , 2015, Inf. Sci..

[19]  Sherif Sakr,et al.  Big Data 2.0 Processing Systems: Taxonomy and Open Challenges , 2016, Journal of Grid Computing.

[20]  Noradin Ghadimi,et al.  Fuzzy stochastic long-term model with consideration of uncertainties for deployment of distributed energy resources using interactive honey bee mating optimization , 2014 .

[21]  Hamid Haj Seyyed Javadi,et al.  Application of Bees Algorithm in Multi-Join Query Optimization , 2012 .

[22]  Ahmad Habibizad Navin,et al.  Job scheduling in the Expert Cloud based on genetic algorithms , 2014, Kybernetes.

[23]  T. V. Vijay Kumar,et al.  Distributed query plan generation using BCO , 2015 .

[24]  Shyam Padia,et al.  Query Optimization Strategies in Distributed Databases , 2015 .

[25]  Nima Jafari Navimipour,et al.  Service allocation in the cloud environments using multi-objective particle swarm optimization algorithm based on crowding distance , 2017, Swarm Evol. Comput..

[26]  Noradin Ghadimi,et al.  Solving a novel multiobjective placement problem of recloser and distributed generation sources in simultaneous mode by improved harmony search algorithm , 2015, Complex..

[27]  S. Kandeepan,et al.  Database Optimization Using Genetic Algorithms for Distributed Databases , 2017 .

[28]  Nima Jafari Navimipour,et al.  MapReduce and Its Applications, Challenges, and Architecture: a Comprehensive Review and Directions for Future Research , 2017, Journal of Grid Computing.

[29]  Koji Zettsu,et al.  Constrained region selection method based on configuration space for visualization in scientific dataset search , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[30]  Sambit Kumar Mishra,et al.  Evaluating Query Execution Plans by Implementing Join Operators using Particle Swarm Optimization , 2014 .

[31]  S. Venkata Lakshmi,et al.  Query optimization using clustering and Genetic Algorithm for Distributed Databases , 2016, 2016 International Conference on Computer Communication and Informatics (ICCCI).

[32]  Noradin Ghadimi,et al.  An adaptive neuro-fuzzy inference system for islanding detection in wind turbine as distributed generation , 2015, Complex..

[33]  Vikram Singh,et al.  Distributed Query Processing Plans Generationusing Genetic Algorithm , 2011 .

[34]  Helen X. Xiang,et al.  Query optimization over a heterogeneously distributed scientific database , 2013, 2013 IEEE International Conference on Big Data.

[35]  Ismail Hakki Toroslu,et al.  Dynamic programming solution for multiple query optimization problem , 2004, Inf. Process. Lett..

[36]  Sangkyu Rho,et al.  A nested genetic algorithm for distributed database design , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[37]  Rajesh Kumar,et al.  Distributed query processing plan generation using iterative improvement and simulated annealing , 2013, 2013 3rd IEEE International Advance Computing Conference (IACC).

[38]  Gurdev Singh,et al.  Stochastic Analysis of DSS Queries for a Distributed Database Design , 2013 .

[39]  Mehdi Hosseinzadeh,et al.  A framework to expedite joint energy-reserve payment cost minimization using a custom-designed method based on Mixed Integer Genetic Algorithm , 2018, Eng. Appl. Artif. Intell..

[40]  Noradin Ghadimi,et al.  Optimal Placement of Distributed Generations in Radial Distribution Systems Using Various PSO and DE Algorithms , 2013 .

[41]  Derya Birant,et al.  An ant colony optimisation approach for optimising SPARQL queries by reordering triple patterns , 2015, Inf. Syst..

[42]  Nima Jafari Navimipour,et al.  Nature‐inspired meta‐heuristic algorithms for solving the load balancing problem in the software‐defined network , 2019, Int. J. Commun. Syst..

[43]  T. Dokeroglu,et al.  Particle Swarm Intelligence as a new heuristic for the optimization of distributed database queries , 2012, 2012 6th International Conference on Application of Information and Communication Technologies (AICT).

[44]  Yunjian Peng,et al.  An Optimization for Distributed Database Multi-join Query Based on Improved Genetic Algorithm , 2018 .

[45]  Wanli Zuo,et al.  Optimizing Large Query by Simulated Annealing Algorithm Based On Graph-Based Approach , 2011, J. Softw..

[46]  Haiyan Zhao,et al.  A Hybrid Swarm Intelligent Method Based on Genetic Algorithm and Artificial Bee Colony , 2010, ICSI.

[47]  Nima Jafari Navimipour,et al.  Intrusion detection for cloud computing using neural networks and artificial bee colony optimization algorithm , 2019, ICT Express.

[48]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[49]  Vikram Singh,et al.  Generating Optimal Query Plans for Distributed Query Processing using Teacher-Learner Based Optimization , 2015 .

[50]  Hamid Haj Seyyed Javadi,et al.  Multi-Join Query Optimization Using the Bees Algorithm , 2010, DCAI.

[51]  Philip S. Yu,et al.  Using join operations as reducers in distributed query processing , 1990, [1990] Proceedings. Second International Symposium on Databases in Parallel and Distributed Systems.

[52]  Carlos Juiz,et al.  Genetic Algorithm for Multi-Objective Optimization of Container Allocation in Cloud Architecture , 2017, Journal of Grid Computing.

[53]  Gurvinder Singh,et al.  Design and analysis of stochastic DSS query optimizers in a distributed database system , 2016 .

[54]  Noradin Ghadimi,et al.  A PSO-Based Fuzzy Long-Term Multi-Objective Optimization Approach for Placement and Parameter Setting of UPFC , 2014 .

[55]  Hasan Asil,et al.  Presenting a New Method for Optimizing Join Queries Processing in Heterogeneous Distributed Databases , 2010, 2010 Third International Conference on Knowledge Discovery and Data Mining.

[56]  Sanyang Liu,et al.  An Improved Artificial Bee Colony Algorithm and Its Application , 2013 .

[57]  Shiwen Li,et al.  Query Optimization of Distributed Database Based on Parallel Genetic Algorithm and Max-Min Ant System , 2015, 2015 8th International Symposium on Computational Intelligence and Design (ISCID).

[58]  Vikram Singh,et al.  Distributed Query Processing Plans generation using Teacher Learner Based Optimization , 2016, ArXiv.

[59]  Daniel J. Abadi,et al.  VLL: a lock manager redesign for main memory database systems , 2014, The VLDB Journal.

[60]  Rahul Singh,et al.  Distributed Query Plan Generation using Ant Colony Optimization , 2015, Int. J. Appl. Metaheuristic Comput..

[61]  Seyed Mohammad Taghi Rouhani Rankoohi,et al.  A multi-colony ant algorithm for optimizing join queries in distributed database systems , 2012, Knowledge and Information Systems.

[62]  Noradin Ghadimi,et al.  PSO Based Fuzzy Stochastic Long-Term Model for Deployment of Distributed Energy Resources in Distribution Systems With Several Objectives , 2013, IEEE Systems Journal.