E-Newspaper classification and distribution based on user profiles and thesaurus.

Electronic mail offers the promise of rapid and primary communication of essential information. It becomes important to correctly use this technology as tool for sending an electronic version of the daily newspaper to the subscribers taking in our account their profiles. In this paper, we describe how a thesaurus-based system can simplify this task. This system provides the functions to index and retrieve a collection of E-mail messages based on thesaurus to create user profiles. The thesaurus is used not only for indexing and retrieving messages, but also for classifying E-Newspaper articles. By automatically indexing the daily newspaper articles using a thesaurus, the system can easily selects relevant E-Newspaper articles according to user profile and send an E-Mail message to the readers containing them. I. INTRODUCTION ith the advent of the World Wide Web, access to newspapers throughout the world has been revolutionized. For instance, international newspapers that may take weeks to arrive at the university, office or home can now be read over the Web the same day they are published. In addition, a far greater variety of newspapers are available over the Web than can be offered by one library. Bear in mind, however, that electronic newspapers accessible over the web have limitations: the entire print issue is rarely available, and advertisements, the classified pages, and items of local interest are often omitted; usually only the most recent issue (or the most recent week) can be electronically accessed, previous weeks, months, and years of the paper are usually not retained on the web; and the trend seems to be that more and more digital newspapers can be accessed only through a paid subscription, similar to that of a print newspaper. That we are in the age of information is evident quite clearly in newspapers as an information source. Nearly everywhere you can have newspapers delivered to your doorstep, each with hundreds of new articles each day. Nearly everywhere in

[1]  Hans-Peter Frei,et al.  Concept based query expansion , 1993, SIGIR.

[2]  Gregory Grefenstette,et al.  Use of syntactic context to produce term association lists for text retrieval , 1992, SIGIR '92.

[3]  Tomonari Kamba,et al.  ANATAGONOMY: a personalized newspaper on the World Wide Web , 1997, Int. J. Hum. Comput. Stud..

[4]  Panos Constantopoulos,et al.  A method for monolingual thesauri merging , 1997, SIGIR '97.

[5]  Carolyn J. Crouch,et al.  An approach to the automatic construction of global thesauri , 1990, Inf. Process. Manag..

[6]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[7]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[8]  David D. Lewis,et al.  An evaluation of phrasal and clustered representations on a text categorization task , 1992, SIGIR '92.

[9]  Qi Lu,et al.  Efficient Profile Matching for Large Scale Webcasting , 1998, Comput. Networks.

[10]  Gerda Ruge,et al.  Experiments on Linguistically-Based Term Associations , 1992, Inf. Process. Manag..

[11]  Thierry Lafouge,et al.  Improving Information Retrieval by Combining User Profile and Document Segmentation , 1996, Inf. Process. Manag..

[12]  Yoram Singer,et al.  Context-sensitive learning methods for text categorization , 1996, SIGIR '96.

[13]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[14]  Michael J. Pazzani,et al.  A hybrid user model for news story classification , 1999 .

[15]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[16]  Mark Claypool,et al.  Combining Content-Based and Collaborative Filters in an Online Newspaper , 1999, SIGIR 1999.

[17]  Bruce Krulwich,et al.  LIFESTYLE FINDER: Intelligent User Profiling Using Large-Scale Demographic Data , 1997, AI Mag..

[18]  Philip K. Chan,et al.  A Non-Invasive Learning Approach to Building Web User Profiles , 1999 .

[19]  Dagobert Soergel,et al.  Multilingual Thesauri in Cross-Language Text and Speech Retrieval , 1997 .

[20]  W. Bruce Croft,et al.  An Association Thesaurus for Information Retrieval , 1994, RIAO.

[21]  Roy Rada,et al.  Merging Thesauri: Principles and Evaluation , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Dirk Vervenne,et al.  Advanced document management through thesaurus-based indexing: the IKEM platform , 1999 .

[23]  Roy Rada,et al.  Augmenting thesauri for information systems , 1987, TOIS.

[24]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[25]  Wolfgang Effelsberg,et al.  A rule-based approach for merging generalization hierarchies , 1988, Inf. Syst..