An intelligent agent-based system for multilingual financial news digest

Online financial news from different sources is widely available on the internet. There are systems available to help investors extract and analyse the financial news from these sources but many of these systems present news articles without categorisation and do not provide enough query options to accurately yet comprehensively search for news. In this paper, we extend our previous work to develop an intelligent agent-based system for multilingual news extraction. We adopt a document categorisation approach based on fuzzy keyword classification. The system applies fuzzy clustering to obtain a classification of keywords by concepts of the category. A category profile is developed and used as a search interface for document browsing. Experimental results show that the proposed categorise news agent is capable of categorising news documents with a reasonable rate of accuracy and the grouping news agent is able to assemble news groups of similar contents to facilitate information retrieval.

[1]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[2]  Hang Li,et al.  Using Bilingual Web Data to Mine and Rank Translations , 2001 .

[3]  Kam-Fai Wong,et al.  A Design of Temporal Event Extraction from Chinese Financial News , 2003, Int. J. Comput. Process. Orient. Lang..

[4]  Wai Lam,et al.  FIDS: an intelligent financial Web news articles digest system , 2001, IEEE Trans. Syst. Man Cybern. Part A.

[5]  James Nga-Kwok Liu,et al.  Intelligent Financial News Digest System , 2005, KES.

[6]  Nk Liu,et al.  Towards an intelligent web-based agent system (iWAF) for e-Finance application , 2002 .

[7]  Ollivier Haemmerlé,et al.  Conceptual Graphs and Ontologies for Information Retrieval , 2007, ICCS.

[8]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[9]  Emmanuel Morin,et al.  Extracting Semantic Relationships between Terms: Supervised vs. Unsupervised Methods , 1999 .

[10]  Yiming Yang,et al.  A study of thresholding strategies for text categorization , 2001, SIGIR '01.

[11]  Jorng-Tzong Horng,et al.  Applying genetic algorithms to query optimization in document retrieval , 2000, Inf. Process. Manag..

[12]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[13]  Yiu-Kai Ng,et al.  Categorizing and extracting information from multilingual HTML documents , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).

[14]  Rainer Hoch,et al.  On the evaluation of document analysis components by recall, precision, and accuracy , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[15]  Bruno Pouliquen,et al.  Navigating multilingual news collections using automatically extracted information , 2005 .

[16]  Lina Zhou,et al.  A hybrid method for abstracting newspaper articles , 1999 .

[17]  James Nga-Kwok Liu,et al.  Design and Implement a Web News Retrieval System , 2005, KES.

[18]  Stuart E. Middleton,et al.  Ontological user profiling in recommender systems , 2004, TOIS.

[19]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .