A Multi-Objective Approach to Big Data View Materialization

Big data comprises voluminous and heterogeneous data that has a limited level of trustworthiness. This data is used to generate valuable information that can be used for decision making. However, decision making queries on Big data consume a lot of time for processing resulting in higher response times. For effective and efficient decision making, this response time needs to be reduced. View materialization has been used successfully to reduce the query response time in the context of a data warehouse. Selection of such views is a complex problem vis-à-vis Big data and is the focus of this paper. In this paper, the Big data view selection problem is formulated as a bi-objective optimization problem with the two objectives being the minimization of the query evaluation cost and the minimization of the update processing cost. Accordingly, a Big data view selection algorithm that selects Big data views for a given query workload, using the vector evaluated genetic algorithm, is proposed. The proposed algorithm aims to generate views that are able to reduce the response time of decision-making queries.

[1]  T. V. Vijay Kumar,et al.  Materialized View Selection using Improvement based Bee Colony Optimization , 2015, Int. J. Softw. Sci. Comput. Intell..

[2]  T. V. Vijay Kumar,et al.  Materialized View Selection Using Bumble Bee Mating Optimization , 2017, Int. J. Decis. Support Syst. Technol..

[3]  T. V. Vijay Kumar,et al.  Materialized View Selection using Marriage in Honey Bees Optimization , 2015, Int. J. Nat. Comput. Res..

[4]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[5]  Zakarya A. Alzamil,et al.  Big Data Contextual Analytics Study on Arabic Tweets Summarization , 2019, Int. J. Knowl. Syst. Sci..

[6]  Dhruba Kumar Bhattacharyya,et al.  Materialized view selection using evolutionary algorithm for speeding up big data query processing , 2017, Journal of Intelligent Information Systems.

[7]  Jay Prakash,et al.  Multi-Objective Materialized View Selection Using Improved Strength Pareto Evolutionary Algorithm , 2019, International Journal of Artificial Intelligence and Machine Learning.

[8]  T. V. Vijay Kumar,et al.  Materialised view selection using randomised algorithms , 2015, Int. J. Bus. Inf. Syst..

[9]  Alberto Abelló,et al.  MapReduce Performance Models for Hadoop 2.x , 2017, EDBT/ICDT Workshops.

[10]  Serge Abiteboul,et al.  On views and XML , 1999, PODS '99.

[11]  Nick Roussopoulos,et al.  Materialized views and data warehouses , 1998, SGMD.

[12]  Adam Jacobs,et al.  The pathologies of big data , 2009, Commun. ACM.

[13]  Jay Prakash,et al.  Multi-objective materialized view selection using MOGA , 2020, International Journal of System Assurance Engineering and Management.

[14]  T. V. Vijay Kumar,et al.  Query answering-based view selection , 2015, Int. J. Bus. Inf. Syst..

[15]  T. V. Vijay Kumar,et al.  Materialized View Selection using Artificial Bee Colony Optimization , 2017, Int. J. Intell. Inf. Technol..

[16]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[17]  Jay Prakash,et al.  A Multi-Objective Approach for Materialized View Selection , 2019, Int. J. Oper. Res. Inf. Syst..

[18]  T. V. Vijay Kumar,et al.  Query Frequency based View Selection , 2017 .

[19]  Santosh Kumar,et al.  Materialised view selection using differential evolution , 2014 .

[20]  T. V. Vijay Kumar,et al.  Materialised view selection using BCO , 2016, Int. J. Bus. Inf. Syst..

[21]  Cecilia Zanni-Merk,et al.  Ontologies to Lead Knowledge Intensive Evolutionary Algorithms: Principles and Case Study , 2016, Int. J. Knowl. Syst. Sci..

[22]  Shouyang Wang,et al.  A New Back-Propagation Neural Network Algorithm for a Big Data Environment Based on Punishing Characterized Active Learning Strategy , 2013, Int. J. Knowl. Syst. Sci..

[23]  Akshay Kumar,et al.  Big data and analytics: issues, challenges, and opportunities , 2015 .