Semantic Web Search Engines

The last couple of years have seen an increasing growth in the amount of Semantic Web data made available, and exploitable, on the Web. Compared to the Web, one unique feature of the Semantic Web is its friendly interface with software programs. In order to better serve human users with software programs, supporting infrastructures for finding and selecting the distributed online Semantic Web data are needed. A number of Semantic Web search engines have emerged recently. These systems are based on different design principles and provide different levels of support for users and/or applications. In this chapter, a survey of these Semantic Web search engines is presented, together with the detailed description of the design of two prominent systems: Swoogle and Watson. The way these systems are used to enable domain applications and support cutting-edge research on Semantic Web technologies is also discussed. In particular, this chapter includes examples of a new generation of semantic applications that, thanks to Semantic Web search engines, exploit online knowledge at runtime, without the need for laborious acquisition in specific domains. In addition, through collecting large amounts of semantic content online, Semantic Web search engines such as Watson and Swoogle allow researchers to better understand how knowledge is formally published online and how Semantic Web technologies are used. In other terms, by mining the collected semantic documents, it becomes possible to get an overview and explore the Semantic Web landscape today. The first section below (Sect. 16.1) presents a general overview of the area, including the main challenges, related systems, as well as an abstract specification of what is called Semantic Web search engines. It also includes a detailed overview of the two systems more specifically considered as case studies, Swoogle (Sect. 16.1.4) and Watson (Sect. 16.1.5). Section 16.2 shows how these systems are currently being used and applied, both as development platforms to make possible the realization of applications exploiting Semantic Web content (Sect. 16.2.1), and as research platforms, allowing one to better understand the content of the Semantic Web, how knowledge is published online and how it is structured. Finally, Sect. 16.3 briefly introduces other resources to be considered in the area of Semantic Web search engines, and Sect. 16.4 concludes the chapter.

[1]  Mathieu d'Aquin,et al.  Formally measuring agreement and disagreement in ontologies , 2009, K-CAP '09.

[2]  Patricia Ordóñez de Pablos,et al.  Mailing Lists and Social Semantic Web , 2010 .

[3]  Enrico Motta,et al.  Toward a New Generation of Semantic Web Applications , 2008, IEEE Intelligent Systems.

[4]  Eyal Oren,et al.  Sindice.com: Weaving the Open Linked Data , 2007, ISWC/ASWC.

[5]  Andrei Z. Broder,et al.  A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.

[6]  Yun Peng,et al.  Finding and Ranking Knowledge on the Semantic Web , 2005, SEMWEB.

[7]  Enrico Motta,et al.  Detecting Different Versions of Ontologies in Large Ontology Repositories , 2009, IWOD@ISWC.

[8]  Ljiljana Stojanovic,et al.  Consistent Evolution of OWL Ontologies , 2005, ESWC.

[9]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[10]  Fouad Zablith,et al.  Dynamic ontology evolution , 2008 .

[11]  Enrico Motta,et al.  Characterizing Knowledge on the Semantic Web with Watson , 2007, EON.

[12]  Enrico Motta,et al.  Semantically enriching folksonomies with FLOR , 2008 .

[13]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.

[14]  Enrico Motta,et al.  What Can be Done with the Semantic Web? An Overview Watson-based Applications , 2008, SWAP.

[15]  Timothy W. Finin,et al.  Finding Data, Knowledge, and Answers on the Semantic Web , 2007, FLAIRS.

[16]  Antonio Gulli,et al.  The indexable web is more than 11.5 billion pages , 2005, WWW '05.

[17]  Harry Chen,et al.  SOUPA: standard ontology for ubiquitous and pervasive applications , 2004, The First Annual International Conference on Mobile and Ubiquitous Systems: Networking and Services, 2004. MOBIQUITOUS 2004..

[18]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[19]  Yuzhong Qu,et al.  Falcons: searching and browsing entities on the semantic web , 2008, WWW.

[20]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[21]  Li Ding,et al.  Characterizing the Semantic Web on the Web , 2006, SEMWEB.

[22]  Chutiporn Anutariya,et al.  SQORE-Based Ontology Retrieval System , 2007, DEXA.

[23]  Peter W. Eklund,et al.  Conceptual structures: current research and practice , 1992 .

[24]  Enrico Motta,et al.  Identifying Key Concepts in an Ontology, through the Integration of Cognitive Principles with Statistical and Topological Measures , 2008, ASWC.

[25]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[26]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[27]  Stefan Decker,et al.  Sig.ma: live views on the web of data , 2010, WWW '10.

[28]  Jürgen Umbrich,et al.  SWSE: Answers Before Links! , 2007, Semantic Web Challenge.

[29]  Enrico Motta,et al.  Exploring the Semantic Web as Background Knowledge for Ontology Matching , 2008, J. Data Semant..

[30]  John F. Sowa,et al.  Conceptual graphs summary , 1992 .

[31]  Enrico Motta,et al.  A Platform for Semantic Web Studies , 2010 .

[32]  Enrico Motta,et al.  PowerMap: Mapping the Real Semantic Web on the Fly , 2006, SEMWEB.

[33]  Mathieu d'Aquin,et al.  Sharing and reusing aligned ontologies with cupboard , 2009, K-CAP '09.

[34]  Enrico Motta,et al.  Bridging the gap between folksonomies and the semantic web: an experience report , 2007 .

[35]  Harith Alani,et al.  Ranking Ontologies with AKTiveRank , 2006, SEMWEB.

[36]  James A. Hendler,et al.  A Survey of the Web Ontology Landscape , 2006, SEMWEB.

[37]  Christian Bizer,et al.  The Emerging Web of Linked Data , 2009, IEEE Intelligent Systems.

[38]  Enrico Motta,et al.  Using background knowledge for ontology evolution , 2008 .