Towards Knowledge Discovery in the Semantic Web

In the past, data mining and machine learning research has developed various techniques to learn on data and to extract patterns from data to support decision makers in various tasks, such as customer profiling, targeted marketing, store layout, and fraud detection (Tan et al., 2005, p.1). In addition, the World Wide Web increasingly offers distributed information that can be useful for strategic, tactical or operational decisions, including news, events, financial information, information about competitors as well as information about the social networks of customers and employees etc. The Web thus has the potential for a high impact on competitive actions and competitive dynamics of enterprises that should utilize this information. However, the growing amount of these distributed information resources leads to a dilemma:”... the more distributed and independently managed that resources on the Web become, the greater is their potential value, but the harder it is to extract value...” (Singh and Huhns, 2005, p.7). On the one hand the human ability for information processing is limited (Edelmann, 2000, p.168), whilst otherwise the amount of available information of the Web increases exponentially, which leads to increasing information saturation (Krcmar, 2004, p.52). In this context, it becomes more and more important to detect useful patterns in the Web, thus use it as a rich source for data mining (Berendt et al., 2002; Han and Kamber, 2006, p.628) in addition to company internal databases. The extraction of information and interesting patterns out of the Web is a complex task, because the current Web is mainly utilized for human consumption. This means that the available information is represented by mark-up languages such as XHTML1 and its predecessors that describe only a visual presentation. Unfortunately, these languages are not sufficient to let software agents ”under-

[1]  Usama M. Fayyad From mining the web to inventing the new sciences underlying the internet , 2007, KDD '07.

[2]  Andreas Hotho,et al.  Towards Semantic Web Mining , 2002, SEMWEB.

[3]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[4]  Jennifer E. Rowley,et al.  The wisdom hierarchy: representations of the DIKW hierarchy , 2007, J. Inf. Sci..

[5]  Achim Rettinger,et al.  Towards Machine Learning on the Semantic Web , 2008, URSW.

[6]  Alun D. Preece,et al.  Instance Based Clustering of Semantic Web Resources , 2008, ESWC.

[7]  Andreas Hotho,et al.  Semantic Web Mining: State of the art and future directions , 2006, J. Web Semant..

[8]  S. Džeroski,et al.  Relational Data Mining , 2001, Springer Berlin Heidelberg.

[9]  Heiner Müller-Merbach Knowledge is more than information , 2004 .

[10]  Saso Dzeroski,et al.  Multi-relational data mining: an introduction , 2003, SKDD.

[11]  Luc De Raedt,et al.  Logical and Relational Learning: From ILP to MRDM (Cognitive Technologies) , 2008 .

[12]  Theo Tryfonas,et al.  Frontiers in Artificial Intelligence and Applications , 2009 .

[13]  Deborah L. McGuinness,et al.  Owl web ontology language guide , 2003 .

[14]  Munindar P. Singh,et al.  Service-Oriented Computing: Semantics, Processes, Agents , 2010 .

[15]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[16]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[17]  Dunja Mladenic,et al.  A Roadmap for Web Mining: From Web to Semantic Web , 2003, EWMF.

[18]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[19]  Arno J. Knobbe,et al.  Multi-Relational Data Mining ( paper id : 294 ) , 1999 .

[20]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[21]  Enrico Motta,et al.  Toward a New Generation of Semantic Web Applications , 2008, IEEE Intelligent Systems.

[22]  Boris Motik,et al.  Managing multiple and distributed ontologies on the Semantic Web , 2003, The VLDB Journal.

[23]  Travis D. Breaux,et al.  Using Ontology in Hierarchical Information Clustering , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[24]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[25]  Nicola Fanizzi,et al.  Conceptual Clustering and Its Application to Concept Drift and Novelty Detection , 2008, ESWC.

[26]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[27]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[28]  Alexander Maedche,et al.  Clustering Ontology-Based Metadata in the Semantic Web , 2002, PKDD.

[29]  Vasant G Honavar,et al.  Learning Relational Bayesian Classifiers on the Semantic Web , 2006 .

[30]  Jan Komorowski,et al.  Principles of Data Mining and Knowledge Discovery , 2001, Lecture Notes in Computer Science.

[31]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .