Mining Link Patterns in Linked Data

As the explosive growth of online linked data, an emerging problem is what and how we can learn from these data. An important knowledge we can obtain is the link patterns among objects, which are helpful for characterizing, analyzing and understanding of linked data. In this paper, we present a novel approach of mining link patterns. A Typed Object Graph is proposed as the data model, and a gSpan-based algorithm is proposed for pattern mining. A type determination policy is introduced in cases of multi-types and a data clustering algorithm is proposed to improve scalability. Time performance and mining results are discussed by experiments.

[1]  Bamshad Mobasher,et al.  Integrating Semantic Knowledge with Web Usage Mining for Personalization , 2009 .

[2]  Isabelle Mirbel,et al.  DFS-based frequent graph pattern extraction to characterize the content of RDF Triple Stores , 2010 .

[3]  Alan L. Rector,et al.  Web ontology segmentation: analysis, classification and use , 2006, WWW '06.

[4]  Jiawei Han,et al.  CloseGraph: mining closed frequent graph patterns , 2003, KDD '03.

[5]  Samir Khuller,et al.  Link Prediction for Annotation Graphs Using Graph Summarization , 2011, SEMWEB.

[6]  Yugyung Lee,et al.  OntoKhoj: a semantic web portal for ontology searching, ranking and classification , 2003, WIDM '03.

[7]  Yuzhong Qu,et al.  Integrating Lightweight Reasoning into Class-Based Query Refinement for Object Search , 2008, ASWC.

[8]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[9]  Amit P. Sheth,et al.  Semantic Association Identification and Knowledge Discovery for National Security Applications , 2005, J. Database Manag..

[10]  Alexander Maedche,et al.  Clustering Ontology-Based Metadata in the Semantic Web , 2002, PKDD.

[11]  Haofen Wang,et al.  Snippet Generation for Semantic Web Search Engines , 2008, ASWC.

[12]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[13]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[14]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[15]  Anthony K. H. Tung,et al.  Semantic Mining and Analysis of Gene Expression Data , 2004, VLDB.

[16]  Lora Aroyo,et al.  The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I , 2011, SEMWEB.

[17]  Alun D. Preece,et al.  Instance Based Clustering of Semantic Web Resources , 2008, ESWC.

[18]  Jan Komorowski,et al.  Principles of Data Mining and Knowledge Discovery , 2001, Lecture Notes in Computer Science.