Personalized information delivery: an analysis of information filtering methods

10345 Abstract 10346 Abstract 10347 Abstract 10348 :iltered Abstracts Abstract 10349 Abstract 1040210349 Abstract 10402 Highly rated abstracts added to document profile each month T h e score for e ach new T M was t he cos ine b e t w e e n t he T M vec tor a n d the nearest i n t e r e s t vector . T h e new T M s were t h e n r a n k e d based o n the i r m a x i m u m cosine score. T h u s , T M s o c c u r r i n g close to any p o i n t o f i n t e r e s t fo r a p a r t i c u l a r e m p l o y e e would be r a n k e d t he h ighes t . T h e same type o f c o m p a r i s o n was d o n e fo r the d o c u m e n t prof i les . Each docu m e n t in t he d o c u m e n t p rof i l e was r e p r e s e n t e d as a s epa ra t e vec tor a n d c o m p a r e d to all new T M s . T h e new T M s were t h e n r a n k e d based o n the m a x i m u m cosine for e ach T M to any d o c u m e n t in t he d o c u m e n t prof i le . T h e s e c o m p a r i s o n s r e su l t ed in f o u r r a n k o r d e r e d lists o f abstracts , o n e for e ach f i l t e r ing m e t h o d . T h e top abs t rac t s f r o m each m e t h o d were t h e n sen t to each emp loyee , w h o r a t e d t h e m fo r r e l evance to his o r h e r t echn ica l in teres ts . Fo r each m e t h o d , the t op 25% o f t he abs t rac t s r a t e d 4 or h i g h e r were t h e n i n c o r p o r a t e d in to t he e m p l o y e e s ' d o c u m e n t p ro files. 3 I n the f i rs t m o n t h o f s tudy, on ly the w o r d prof i l e was u sed s ince e m p l o y e e s h a d no t p rev ious ly indica ted which abs t rac t s to i nc lude in t h e i r d o c u m e n t prof i le . I n the subseFlOure 11. The f i l t e r i n g p r o c e s s Table 1. The four f i l ter ing methods

[1]  Edward M. Housman,et al.  State of the Art in Selective Dissemination of Information , 1970, IEEE Transactions on Engineering Writing and Speech.

[2]  Dagobert Soergel,et al.  The importance of SDI for current awareness in fields with severe scatter of information , 1979, J. Am. Soc. Inf. Sci..

[3]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[4]  Susan T. Dumais,et al.  Statistical semantics: analysis of the potential performance of keyword information systems , 1984 .

[5]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[6]  Marcia J. Bates,et al.  Subject access in online catalogs: A design model , 1986 .

[7]  K. R. Grant,et al.  Semistructured messages are surprisingly useful for computer-supported coordination , 1987, TOIS.

[8]  Nicholas J. Belkin,et al.  Knowledge Elicitation Using Discourse Analysis , 1987, Int. J. Man Mach. Stud..

[9]  Paul B. Kantor,et al.  A Study of Information Seeking and Retrieving. III. Searchers, Searches, and Overlap* , 1988 .

[10]  Richard A. Harshman,et al.  Information retrieval using a singular value decomposition model of latent semantic structure , 1988, SIGIR '88.

[11]  Kevin Crowston,et al.  How do experienced information lens users use rules? , 1989, CHI '89.

[12]  Robert B. Allen,et al.  User Models: Theory, Method, and Practice , 1990, Int. J. Man Mach. Stud..

[13]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[14]  Peter W. Foltz Using latent semantic indexing for information filtering , 1990 .

[15]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[16]  Susan T. Dumais,et al.  Improving the retrieval of information from external sources , 1991 .

[17]  W. Bruce Croft,et al.  Evaluation of an inference network-based retrieval model , 1991, TOIS.

[18]  Nicholas J. Belkin,et al.  Information filtering and information retrieval: two sides of the same coin? , 1992, CACM.