GPU-Accelerated Block-Max Query Processing

In this paper, we propose a method for parallel top-k query processing on GPU(s). We employ a novel partitioning strategy which splits the posting lists according to document ID numbers. Individual GPU threads simultaneously perform top-k query processing within their allocated subsets of posting lists, the results of the query are merged to give the final top-k results. We further design a CPU-GPU cooperative query processing method, where a majority of queries involving shorter posting lists are processed on the GPU side. We experiment with AND, OR, WAND, and Block-Max WAND (BMW) queries, with experimental results showing a promising improvement in query throughput, particularly in the case of BMW queries.

[1]  Ellen M. Voorhees,et al.  Overview of TREC 2003. , 2003 .

[2]  Gang Wang,et al.  Fast lists intersection with Bloom filter using graphics processing units , 2011, SAC '11.

[3]  Gang Wang,et al.  Efficient lists intersection by CPU-GPU cooperative computing , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[4]  Rong Zheng,et al.  Asynchronous stochastic gradient descent for DNN training , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Torsten Suel,et al.  Performance of compressed inverted list caching in search engines , 2008, WWW.

[6]  Torsten Suel,et al.  Inverted index compression and query processing with optimized document ordering , 2009, WWW '09.

[7]  Jun Pang,et al.  Rhythm: harnessing data parallel hardware for server workloads , 2014, ASPLOS.

[8]  Shirish Tatikonda,et al.  Posting list intersection on multicore architectures , 2011, SIGIR.

[9]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[10]  Torsten Suel,et al.  Faster top-k document retrieval using block-max indexes , 2011, SIGIR.

[11]  Stephen E. Robertson,et al.  GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[12]  Mauricio Marín,et al.  Efficient Parallel Block-Max WAND Algorithm , 2013, Euro-Par.

[13]  Gang Wang,et al.  Efficient Parallel Lists Intersection and Index Compression Algorithms using Graphics Processing Units , 2011, Proc. VLDB Endow..

[14]  Fabrizio Silvestri,et al.  Sorting Out the Document Identifier Assignment Problem , 2007, ECIR.

[15]  Torsten Suel,et al.  Using graphics processors for high-performance IR query processing , 2008, WWW.

[16]  Torsten Suel,et al.  Batch query processing for web search engines , 2011, WSDM '11.

[17]  Andrei Z. Broder,et al.  Efficient query evaluation using a two-level retrieval process , 2003, CIKM '03.

[18]  Bingsheng He,et al.  GPUQP: query co-processing using graphics processors , 2007, SIGMOD '07.