OntoPortal: An ontology-supported portal architecture with linguistically enhanced and focused crawler technologies

Abstract This paper proposed the techniques of ontology and linguistics to develop a fully-automatic annotation technique, coupling with an automatic ontology construction method, could play a key role in the development of Semantic Portals. An ontology-supported portal architecture: OntoPortal was proposed according to this technique, in which three internal components Portal Interface, Semantic Portal, and OntoCrawler was integrated to rapidly and precisely collect information on Internet and capture true user’s intention and accordingly provide high-quality query answers to meet the user requests. This paper also demonstrated the OntoPortal prototype which defined how a semantic portal is interacting with the user by providing five different types of interaction patterns such as including keyword search, synonym search, POS (Part-of-Speech)-constrained keyword search, natural language query, and semantic index search. The preliminary experiment outcomes proved the technology proposed in this paper to be able to really up-rise the precision and recall rates of webpage searching and accordingly showed that it can indeed retrieve better semantic-directed information to meet user requests.

[1]  Sheng-Yuan Yang An Ontology-Supported and Fully-Automatic Annotation Technology for Semantic Portals , 2007, IEA/AIE.

[2]  Henrik Eriksson,et al.  Knowledge modeling at the millennium : The design and evolution of Protégé-2000 , 1999 .

[3]  Michael Uschold,et al.  Ontologies: principles, methods and applications , 1996, The Knowledge Engineering Review.

[4]  Thomas Strang,et al.  Integrating Agents, Ontologies, and Semantic Web Services for Collaboration on the Semantic Web , 2005, AAAI Fall Symposium: Agents and the Semantic Web.

[5]  Muzammil Khan,et al.  An MDA-based Approach for Specifying Semantic Portals , 2007, IEEE International Conference on Web Services (ICWS 2007).

[6]  Chabane Djeraba Dominos: A New Web Crawler's Design , 2004 .

[7]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[8]  Asunción Gómez-Pérez,et al.  Six challenges for the Semantic Web , 2002, KR 2002.

[9]  Sheng-Yuan Yang,et al.  Ontology-Supported User Models for Interface Agents , 1999 .

[10]  Stefan Decker,et al.  Creating Semantic Web Contents with Protégé-2000 , 2001, IEEE Intell. Syst..

[11]  James A. Hendler,et al.  The Semantic Web 10 , 2011 .

[12]  G. Aghila,et al.  Ontology-based Web crawler , 2004, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004..

[13]  Kristina Lerman,et al.  Using the structure of Web sites for automatic segmentation of tables , 2004, SIGMOD '04.

[14]  W. A. Pinheiro,et al.  An ontology based-approach for semantic search in portals , 2004 .

[15]  James A. Hendler,et al.  Speinning the Semantic Web , 2003 .

[16]  Enrico Motta,et al.  Semi-automatic annotation of contested knowledge on the world wide web , 2004, WWW Alt. '04.

[17]  Antonio Calabrese,et al.  Web-pages annotation and adaptability. A semantic portal on the International Space Station , 2005, SWAP.

[18]  John Mylopoulos,et al.  Semi-Automatic Semantic Annotations for Web Documents , 2005, SWAP.

[19]  Jesualdo Tomás Fernández-Breis,et al.  An ontology-based intelligent system for recruitment , 2006, Expert Syst. Appl..

[20]  Steffen Staab,et al.  S-CREAM: Semiautomatic CREAtion of Metadata , 2002, SAAKM@ECAI.

[21]  Sebastiano Vigna,et al.  UbiCrawler: a scalable fully distributed Web crawler , 2004, Softw. Pract. Exp..

[22]  Von-Wun Soo,et al.  The conflict detection and resolution in knowledge merging for image annotation , 2006, Inf. Process. Manag..

[23]  Antonio J. Serrano,et al.  Web mining based on Growing Hierarchical Self-Organizing Maps: Analysis of a real citizen web portal , 2008, Expert Syst. Appl..

[24]  Dan Tufis,et al.  Tagging romanian texts: a case study for QTAG, a language independent probabilistic tagger , 1998 .

[25]  Eero Hyvönen,et al.  ONTODELLA -- A Projection and Linking Service for Semantic Web Applications , 2006, 17th International Workshop on Database and Expert Systems Applications (DEXA'06).

[26]  York Sure-Vetter,et al.  An infrastructure for scalable, reliable semantic portals , 2004, IEEE Intelligent Systems.