论文信息 - Improved Reinforcement-based Profile Learning for Document Filtering

Improved Reinforcement-based Profile Learning for Document Filtering

Summary A personalized information filtering system tailors user queries to the current user interests and adapt the information as they change over time. The system monitors a stream of incoming documents to learn user information needs in the form of profiles and yield relevant documents filtered to only those matches the user profiles. To learn the profile, the significance of query terms will be accessed and weights will be assigned to each term in the profile. This article proposed purity terms weighting method for profile learning in a personalized information filtering system. The main idea is to weigh the terms based on their pure frequencies, in addition to the number of pure relevant documents that contain them. The profiles are discriminated based on top weighed terms that represent the profiles. Profiles are also updated with every selected relevant document in order to match user interests. The efficiency of the proposed method is measured by using linear utility accuracy tested on TREC 2002 filtering track. The experimental results show improvement in terms selection and profile building accuracy as compared with Rocchio’s Algorithm, Okapi/BSS Basic Search System, and the incremental profile learning approach.

Aida Mustapha | Selangor Darul Ehsan | Zaiton Muda

[1] Sahin Albayrak,et al. Agent technology for personalized information filtering: the PIA-system , 2005, SAC '05.

[2] Stephen E. Robertson,et al. The TREC 2002 Filtering Track Report , 2002, TREC.

[3] Mohand Boughanem,et al. Incremental profile learning based on a reinforcement method , 2005, SAC '05.

[4] Jason D. M. Rennie. Improving multi-class text classification with Naive Bayes , 2001 .

[5] Gerard Salton,et al. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[6] Shyi-Ming Chen,et al. Query expansion for document retrieval based on fuzzy rules and user relevance feedback techniques , 2006, Expert Syst. Appl..

[7] M. F. Porter,et al. An algorithm for suffix stripping , 1997 .

[8] Chris Buckley,et al. Learning routing queries in a query zone , 1997, SIGIR '97.

[9] Min Zhang,et al. Incremental Learning for Profile Training in Adaptive Document Filtering , 2002, TREC.

[10] Javed Mostafa,et al. A multilevel approach to intelligent information filtering: model, system, and evaluation , 1997, TOIS.

[11] Stephen E. Robertson,et al. Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[12] Stephen E. Robertson,et al. Okapi/Keenbow at TREC-8 , 1999, TREC.

[13] Gerard Salton,et al. Improving Retrieval Performance by Relevance Feedback , 1997 .

[14] Yoav Shoham,et al. Fab: content-based, collaborative recommendation , 1997, CACM.