Filtering Method for the Annotated and Non-Annotated Web Pages

With the great mass of the pages managed through the world, and especially with the advent of the Web, it has become more difficult to find the relevant pages after an interrogation. Furthermore, the manual filtering of the indexed Web pages is a laborious task. A new filtering method of the annotated Web pages (by our semantic annotation process) and the non-annotated Web pages (retrieved from search engine “Google”) is then necessary to group the relevant Web pages for the user. In this paper, the authors will first synthesize their previous work of the semantic annotation of Web pages. Then, they will define a new filtering method based on three activities. The authors will also present their querying and filtering component of Web pages; their purpose is to demonstrate the feasibility of the filtering method. Finally, the authors will present an evaluation of this component, which has proved its performance for multiple domains. KeyWoRdS Domain Ontologies, Filtering Method, Relevant Web Pages, Semantic Annotation, Semantic Web Environment

[1]  Philippe Blache,et al.  A semantic vector space and features-based approach for automatic information filtering , 2004, Expert Syst. Appl..

[2]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[3]  Jon Corson-Rikert,et al.  The VIVO Ontology: Enabling Networking of Scientists , 2011 .

[4]  Alistair Moffat,et al.  Efficient Extended Boolean Retrieval , 2012, IEEE Transactions on Knowledge and Data Engineering.

[5]  Edward A. Fox,et al.  Research Contributions , 2014 .

[6]  Matthias Samwald,et al.  The bio-zen plus ontology , 2008, Appl. Ontology.

[7]  Rafik Bouaziz,et al.  Automation and evaluation of the semantic annotation of Web resources , 2013, 8th International Conference for Internet Technology and Secured Transactions (ICITST-2013).

[8]  G. N. Lance,et al.  A General Theory of Classificatory Sorting Strategies: 1. Hierarchical Systems , 1967, Comput. J..

[9]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[10]  Vassilios Peristeras,et al.  Interlinking the Social Web with Semantics , 2008, IEEE Intelligent Systems.

[11]  Amir Masoud Rahmani,et al.  Link Processing for Fuzzy Web Pages Clustering and Classification , 2009 .

[12]  Edward Fox,et al.  Extending the boolean and vector space models of information retrieval with p-norm queries and multiple concept types , 1983 .

[13]  Ramachandra V. Pujeri,et al.  DISTRIBUTED APPROACH to WEB PAGE CATEGORIZATION USING MAP- REDUCE PROGRAMMING MODEL , 2012 .

[14]  Rafik Bouaziz,et al.  Fuzzy semantic annotation of Web resources , 2014, 2014 World Symposium on Computer Applications & Research (WSCAR).

[15]  Steffen Lohmann,et al.  Adding Semantics to Social Software Engineering: (Re-)Using Ontologies in a Community-oriented Requirements Engineering Environment , 2010, Software Engineering.

[16]  Rafik Bouaziz,et al.  Automation of the semantic annotation of web resources , 2014 .

[17]  Falk Scholer,et al.  The challenge of high recall in biomedical systematic search , 2009, DTMBIO.