The parallel evolution of search engines and digital libraries: their convergence to the Mega-Portal

There are many and varied search engines (SEs) on the Internet but it is still hard to locate and concentrate only on materials relevant to a specific task. Digital libraries (DLs) could better provide such services on the Web. There is a real need to formulate a methodology for the understanding of both these types of Web data repositories. We classify both SEs and DLs using similar criteria on both a functional scale and a generational time-line. In particular, we introduce the idea of the third generation Harvested DL and its resulting DL harvesting model. By comparing them and analyzing their characteristics, we discover that they actually share much in common. Finally, we note the expected incorporation of intelligent techniques and knowledge management in fourth generation SEs and DLs and the expected convergence of their interfaces and structures in the fifth generation - the Mega-Portal.

[1]  Ariel J. Frank,et al.  Intelligent Information Harvesting Architecture: An Application to a High School Environment. , 1996 .

[2]  Ora Lassila,et al.  WEB METADATA : A Matter of Semantics , 1998 .

[3]  Daniel E. O'Leary,et al.  Using AI in Knowledge Management: Knowledge Bases and Ontologies , 1998, IEEE Intell. Syst..

[4]  Gary Marchionini,et al.  Toward a worldwide digital library , 1998, CACM.

[5]  Loren G. Terveen,et al.  Constructing, organizing, and visualizing collections of topically related Web resources , 1999, TCHI.

[6]  Hsinchun Chen,et al.  Digital Libraries: Social issues and technological advances , 1999, Adv. Comput..

[7]  Richard Einer Peterson Eight Internet Search Engines Compared , 1997, First Monday.

[8]  Daniel E. O'Leary,et al.  Enterprise Knowledge Management , 1998, Computer.

[9]  Peretz Shoval,et al.  Experimentation with an information filtering system that combines cognitive and sociological filtering integrated with user stereotypes , 1999, Decis. Support Syst..

[10]  Kevin Chen-Chuan Chang,et al.  Using Distributed Objects to Build the Stanford Digital Library Infobus , 1999, Computer.

[11]  Shmuel Tomi Klein,et al.  Information Retrieval from Annotated Texts , 1999, J. Am. Soc. Inf. Sci..

[12]  Godfrey Rust,et al.  Metadata: The Right Approach, An Integrated Model for Descriptive and Rights Metadata in E-commerce , 1998, D Lib Mag..

[13]  Peter B. Danzig,et al.  Harvest: A Scalable, Customizable Discovery and Access System , 1994 .

[14]  Edward A. Fox,et al.  Digital libraries , 1995, CACM.

[15]  Candy Schwartz,et al.  Web Search Engines , 1998, J. Am. Soc. Inf. Sci..

[16]  Richard D. Hackathorn,et al.  Web Farming for the Data Warehouse , 1998 .

[17]  Yen-Jen Oyang,et al.  Content and knowledge management in a digital library and museum , 2000, J. Am. Soc. Inf. Sci..

[18]  Jeffrey A. Rydberg-Cox Knowledge Management in the Perseus Digital Library , 2000 .

[19]  Josiane Mothe,et al.  TetraFusion: information discovery on the Internet , 1999, IEEE Intell. Syst..

[20]  Donald J. Waters Transforming Libraries Through Digital Preservation , 1998 .

[21]  David Clark Natural Language, Relevancy Ranking, and Common Sense , 1999 .

[22]  Peter B. Danzig,et al.  Scalable Internet resource discovery: research problems and approaches , 1994, CACM.

[23]  B. Cesnik,et al.  Digital Libraries , 2001, Yearbook of Medical Informatics.

[24]  Yelena Yesha,et al.  Strategic directions in electronic commerce and digital libraries: towards a digital agora , 1996, CSUR.

[25]  Ariel J. Frank,et al.  Katsir: A Framework for Harvesting Digital Libraries on the Web , 2000, ECIS.

[26]  Peretz Shoval,et al.  Stereotypes in Information Filtering Systems , 1997, Inf. Process. Manag..

[27]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.