Evaluation of Triple Indices in Retrieving Web Documents

Indexing is one of the important technique used to make searching and retrieval process more accurate and faster. In this paper, the indexing features of triple stores are investigated to see the pattern of semantic indexing that will improve the current information retrieval indexing. To do the evaluation, we choose the best triple stores, Allegro Graph. Allegro Graph require more than one triple indices pattern to be matched with the query. Therefore we do the evaluation of triple indices to determine which triple indices pattern gives for fast retrieval. During experiment, we use RDF data. Then we matched the triple indices pattern with the SPARQL query to retrieve the relevant data. As the result, it would take longer if the system contains unneeded triple indices which will waste CPU times and disk space. Removing unneeded triple indices pattern will provide for fast retrieval.

[1]  Sherif Sakr,et al.  AdaptRDF: adaptive storage management for RDF databases , 2012, Int. J. Web Inf. Syst..

[2]  Lei Zou,et al.  gStore: Answering SPARQL Queries via Subgraph Matching , 2011, Proc. VLDB Endow..

[3]  Adina Crainiceanu,et al.  Rya: a scalable RDF triple store for the clouds , 2012, Cloud-I '12.

[4]  Atanas Kiryakov,et al.  Semantic Annotation, Indexing, and Retrieval , 2003, SEMWEB.

[5]  Catherine Roussey,et al.  Semantic Indexing of Technical Documentation , 2009 .

[6]  Rada Mihalcea,et al.  Semantic Indexing using WordNet Senses , 2000 .

[7]  Gerald Reif,et al.  Updating relational data via SPARQL/update , 2010, EDBT '10.

[8]  Jayanta Banerjee,et al.  Making Unstructured Data SPARQL Using Semantic Indexing in Oracle Database , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[9]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[10]  Xin Wang,et al.  Storing and Indexing RDF Data in a Column-Oriented DBMS , 2010, 2010 2nd International Workshop on Database Technology and Applications.

[11]  Thanh Nguyen,et al.  The effect of semantic index in information retrieval development , 2008, iiWAS.

[12]  Oktie Hassanzadeh,et al.  Data Management Issues on the Semantic Web , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[13]  Sherif Sakr,et al.  Relational processing of RDF queries: a survey , 2010, SGMD.

[14]  Gerhard Weikum,et al.  The RDF-3X engine for scalable management of RDF data , 2010, The VLDB Journal.

[15]  David Austerberry Cataloging and Indexing , 2007 .

[16]  Alistair Moffat,et al.  An Efficient Indexing Technique for Full Text Databases , 1992, Very Large Data Bases Conference.

[17]  Bernhard Haslhofer,et al.  Europeana RDF Store Report , 2011 .

[18]  Gerhard Weikum,et al.  x-RDF-3X , 2010, Proc. VLDB Endow..

[19]  S. Srinivasan,et al.  A Survey of Text Mining : Retrieval , Extraction and Indexing Techniques , 2012 .

[20]  Kathryn S. McKinley,et al.  The Effect of Collection Organization and Query Locality on Information Retrieval System Performance , 2002 .

[21]  Daniel J. Abadi,et al.  SW-Store: a vertically partitioned DBMS for Semantic Web data management , 2009, The VLDB Journal.

[22]  Ron Sacks-Davis,et al.  An e cient indexing technique for full-text database systems , 1992, VLDB 1992.