Parallel approaches for processing data tabulation: A case study of population census 2010

Parallel approaches have been proposed in wide areas for improving the system performance. The proposal introduces three approaches to process statistical data tabulation. First, task parallelism to exploit the use of multi-cores in shared memory architecture is developed according to a bushy tree query plan for decomposing tasks. Each decomposed task is allocated to a processor core such that the workloads among processor cores are nearly balanced. Secondly, data parallelism to utilize available interconnected PCs is developed according to master-slave paradigm. Data distribution adopts a simple Round Robin approach to achieve workload balance and no data dependency among PCs. Lastly, hybrid parallelism is proposed to combine the former approaches for optimizing the use of available processor cores in interconnected PCs. The experimental results shows considerably good performance in terms of parallel time execution, speed up and efficiency.

[1]  Yannis E. Ioannidis,et al.  Query optimization , 1996, CSUR.

[2]  Kyuseok Shim,et al.  Including Group-By in Query Optimization , 1994, VLDB.

[3]  Gene Wagenbreth,et al.  High Performance Computing: Programming and Applications , 2010 .

[4]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[5]  Leonie Kohl,et al.  Parallel Programming In C With Mpi And Open Mp , 2016 .

[6]  Laura Schweitzer,et al.  Database Systems A Practical Approach To Design Implementation And Management , 2016 .

[7]  Carolyn E. Begg,et al.  Database Systems: A Practical Approach to Design, Implementation and Management , 1998 .

[8]  Mostafa Bamha,et al.  Parallel processing of "group-by join" queries on shared nothing machines , 2006, ICSOFT.

[9]  Waqar Hasan,et al.  Optimization of SQL Queries for Parallel Machines , 1996, Lecture Notes in Computer Science.

[10]  Rajkumar Buyya,et al.  Parallel Programming Models and Paradigms , 1998 .

[11]  Hugh Garraway Parallel Computer Architecture: A Hardware/Software Approach , 1999, IEEE Concurrency.

[12]  Frank Huber,et al.  Query Processing on Multi-Core Architectures , 2009, Grundlagen von Datenbanken.

[13]  Georg Hager,et al.  Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes , 2009, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[14]  Emery D. Berger,et al.  Grace: safe multithreaded programming for C/C++ , 2009, OOPSLA 2009.

[15]  David J. DeWitt,et al.  Parallel Database Systems: The Future of High Performance Database Processing 1 , 1992 .

[16]  Wei Hong Parallel Query Processing Using Shared Memory Multiprocessors and Disk Arrays , 1992 .