Efficient Access Control of Large Scale RDF Data Using Prefix-Based Labeling

A massive amount of resource description framework (RDF) data are available on the web. An RDF data publisher may want to prevent a few users from accessing a certain part of the RDF data. Various approaches have been proposed to reject a given SPARQL query that is intended to access instances or classes that are required to be protected. The problem of such access control management can be cast to processing ancestor/descendant relationship query over class hierarchy. The prefix-based labeling scheme has been applied to the fast processing of ancestor/descendant relationship queries. However, we observed that the existing approaches are ineffective in dealing with massive amounts of RDF data because the adopted labeling schemes produce labels of large sizes. Hence, we adopted the state-of-the art MapReduce-based algorithm for prefix-based labeling to reduce the label size based on the structural information of RDF data. Experiments with real-world RDF datasets showed that the proposed approach is more efficient than the conventional methods.

[1]  Ioannis Xenarios,et al.  The UniProtKB/Swiss-Prot Tox-Prot program: A central hub of integrated venom protein data. , 2012, Toxicon : official journal of the International Society on Toxinology.

[2]  Edith Cohen,et al.  Reachability and distance queries via 2-hop labels , 2002, SODA '02.

[3]  Dong-Hyuk Im,et al.  Similarity-based Change Detection for RDF in MapReduce , 2016 .

[4]  Jeffrey Xu Yu,et al.  Reachability querying: an independent permutation labeling approach , 2014, The VLDB Journal.

[5]  Giovanni Tummarello,et al.  A Node Indexing Scheme for Web Entity Retrieval , 2010, ESWC.

[6]  Sang-Won Lee,et al.  Backward inference and pruning for RDF change detection using RDBMS , 2013, J. Inf. Sci..

[7]  Shankar Pal,et al.  XQuery Implementation in a Relational Database System , 2005, VLDB.

[8]  Dong-Hyuk Im,et al.  Efficiently Answering Reachability Queries for Tree-Structured Data in Repetitive Prime Number Labeling Schemes , 2018 .

[9]  Stefan Decker,et al.  Secure Manipulation of Linked Data , 2013, SEMWEB.

[10]  Ya-Hui Chang,et al.  A Compact and Efficient Labeling Scheme for XML Documents , 2013, DASFAA.

[11]  Vassilis Christophides,et al.  Optimizing taxonomic semantic web queries using labeling schemes , 2004, J. Web Semant..

[12]  Tok Wang Ling,et al.  DDE: from dewey to a fully dynamic XML labeling scheme , 2009, SIGMOD Conference.

[13]  Dirk Ahlers,et al.  Assessment of the accuracy of GeoNames gazetteer data , 2013, GIR '13.

[14]  X. Wu,et al.  A prime number labeling scheme for dynamic ordered XML trees , 2004, Proceedings. 20th International Conference on Data Engineering.

[15]  Tok Wang Ling,et al.  QED: a novel quaternary encoding to completely avoid re-labeling in XML updates , 2005, CIKM '05.

[16]  Kyong-Ha Lee,et al.  Parallel labeling of massive XML data with MapReduce , 2014, The Journal of Supercomputing.

[17]  Seog Park,et al.  Efficient authorization conflict detection using graph labeling in controlling access to RDF , 2015, Comput. Stand. Interfaces.

[18]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[19]  Hong-Gee Kim,et al.  A MapReduce-Based Approach for Prefix-Based Labeling of Large XML Data , 2016, JIST.

[20]  Hong-Gee Kim,et al.  G-Diff: A Grouping Algorithm for RDF Change Detection on MapReduce , 2014, JIST.

[21]  Amit Jain,et al.  Secure resource description framework: an access control model , 2006, SACMAT '06.

[22]  Hossein Saiedian,et al.  Reusable Prime Number Labeling Scheme for Hierarchical Data Representation in Relational Databases , 2014, J. Comput. Inf. Technol..

[23]  Gang Wu,et al.  Adapting Prime Number Labeling Scheme for Directed Acyclic Graphs , 2006, DASFAA.

[24]  Timothy W. Finin,et al.  Policy-Based Access Control for an RDF Store , 2005, IJCAI 2007.

[25]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[26]  Tok Wang Ling,et al.  A Dynamic Labeling Scheme Using Vectors , 2007, DEXA.

[27]  Tok Wang Ling,et al.  Indexing and querying XML using extended Dewey labeling scheme , 2011, Data Knowl. Eng..

[28]  Sang-Won Lee,et al.  A Version Management Framework for RDF Triple Stores , 2012, Int. J. Softw. Eng. Knowl. Eng..

[29]  Lu Joan,et al.  Investigation into Indexing XML Data Techniques , 2014 .

[30]  Fabian M. Suchanek,et al.  YAGO3: A Knowledge Base from Multilingual Wikipedias , 2015, CIDR.

[31]  Patrick E. O'Neil,et al.  ORDPATHs: insert-friendly XML node labels , 2004, SIGMOD '04.

[32]  Li Qin,et al.  Concept-level access control for the Semantic Web , 2003, XMLSEC '03.