KnowledgeSeeker — an ontological agent-based system for retrieving and analyzing Chinese web articles

In this paper, we present the KnowledgeSeeker, an ontological agent-based system that is designed to help users find, retrieve, and analyze news article from the Internet and then present the content in a semantic web. We present the benefits of using ontologies to analyze the semantics of Chinese text, and also the advantages of using a semantic web to organize information semantically. KnowledgeSeeker also demonstrates the advantages of using ontologies to identify topics. We use a Chinese document corpus to evaluate KnowledgeSeeker and the testing result was compared to other approaches. KnowledgeSeeker is able to identify the topics of Chinese web articles with an accuracy of nearly 87% and has a processing speed of less than one second per article. It is also able to organize content flexibly and understands knowledge more accurately than methods that use ontology definition.

[1]  Michael Wooldridge,et al.  Agent technology: foundations, applications, and markets , 1998 .

[2]  Jonathan Barnes What is ontology , 1994 .

[3]  James Nga-Kwok Liu,et al.  iJADE eMiner - A Web-Based Mining Agent Based on Intelligent Java Agent Development Environment (iJADE) on Internet Shopping , 2001, PAKDD.

[4]  John Yen,et al.  Relevant data expansion for learning concept drift from sparsely labeled data , 2005, IEEE Transactions on Knowledge and Data Engineering.

[5]  Natenapa Sriharee Semantic Web Services Discovery Using Ontology-Based Rating Model , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[6]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[7]  Xiaolong Wang,et al.  Quantifying semantic similarity of Chinese words from HowNet , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[8]  Bob J. Wielinga,et al.  Ontology-Based Photo Annotation , 2001, IEEE Intell. Syst..

[9]  Olga Kosheleva,et al.  IEEE International Conference on Fuzzy Systems , 1996 .

[10]  Raymond S. T. Lee Fuzzy-neuro approach to agent applications : from the AI perspective to modern ontology , 2006 .

[11]  Philip S. Yu,et al.  On using partial supervision for text categorization , 2004, IEEE Transactions on Knowledge and Data Engineering.

[12]  Yuefeng Li,et al.  Capturing Evolving Patterns for Ontology-based Web Mining , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[13]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[14]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[15]  Siegfried Handschuh,et al.  Adding Provenance and Evolution Information to Modularized Argumentation Models , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[16]  Kok Wee Gan,et al.  Annotating Information Structures in Chinese Texts Using HowNet , 2000, ACL 2000.

[17]  Pablo Castells,et al.  A Multi-Purpose Ontology-Based Approach for Personalized Content Filtering and Retrieval , 2006, 2006 First International Workshop on Semantic Media Adaptation and Personalization (SMAP'06).

[18]  Manjula Patel,et al.  Knowledge Discovery in an Agents Environment , 2004, ESWS.

[19]  Steffen Staab,et al.  CREAM: CREAting Metadata for the Semantic Web , 2003, Comput. Networks.