Transferring and Retraining Learned Information Filters

Any system that learns how to filter documents will suffer poor performance during an initial training phase. One way of addressing this problem is to exploit filters learned by other users in a collaborative fashion. We investigate "direct transfer" of learned filters in this setting--a limiting case for any collaborative learning system. We evaluate the stability of several different learning methods under direct transfer, and conclude that symbolic learning methods that use negatively correlated features of the data perform poorly in transfer, even when they perform well in more conventional evaluation settings. This effect is robust: it holds for several learning methods, when a diverse set of users is used in training the classifier, and even when the learned classifiers can be adapted to the new user's distribution. Our experiments give rise to several concrete proposals for improving generalization performance in a collaborative setting, including a beneficial variation on a feature selection method that has been widely used in text categorization.

[1]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[2]  Yoram Singer,et al.  Context-sensitive learning methods for text categorization , 1996, SIGIR '96.

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  Johannes Fürnkranz,et al.  Incremental Reduced Error Pruning , 1994, ICML.

[5]  Rich Caruana,et al.  Algorithms and Applications for Multitask Learning , 1996, ICML.

[6]  William W. Cohen Learning Rules that Classify E-Mail , 1996 .

[7]  Sebastian Thrun,et al.  Is Learning The n-th Thing Any Easier Than Learning The First? , 1995, NIPS.

[8]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[9]  R. Mike Cameron-Jones,et al.  FOIL: A Midterm Report , 1993, ECML.

[10]  David D. Lewis,et al.  A comparison of two learning algorithms for text categorization , 1994 .

[11]  Sholom M. Weiss,et al.  Automated learning of decision rules for text categorization , 1994, TOIS.

[12]  Tom M. Mitchell,et al.  Experience with a learning personal assistant , 1994, CACM.

[13]  Michael J. Pazzani,et al.  Syskill & Webert: Identifying Interesting Web Sites , 1996, AAAI/IAAI, Vol. 1.

[14]  William W. Cohen Fast Eeective Rule Induction , 1995 .

[15]  Stuart J. Russell,et al.  Machine Learning, Proceedings of the Twelfth International Conference on Machine Learning, Tahoe City, California, USA, July 9-12, 1995 , 1995, ICML.

[16]  Lorien Y. Pratt,et al.  Discriminability-Based Transfer between Neural Networks , 1992, NIPS.

[17]  SingerYoram,et al.  Context-sensitive learning methods for text categorization , 1999 .

[18]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.