Landscaping the information space of large multi-database networks

Abstract The promises of network-accessible information are increasingly difficult to achieve. These difficulties are due to a variety of causes, such as, the rapid growth in the volume of network-available information and the increasing complexity, diversity and terminological fluctuations of the different information sources available. This paper presents a conceptual architecture for the organization information space across collections of component systems in a multi-database network that provides serendipity, exploration and contextualisation support so that users can achieve logical connections between concepts they are familiar with and schema terms employed in multi-database systems. Large-scale searching for multi-database schema information is guided by a combination of lexical, structural and semantic aspects of schema terms in order to reveal more meaning both about the contents of an information term and about its placement within the distributed information space.

[1]  H. Clements,et al.  Social Science Information Gateway (SOSIG) , 2001 .

[2]  Thomas R. Gruber,et al.  Ontolingua: a mechanism to support portable ontologies , 1991 .

[3]  Fabio Crestani,et al.  Towards data modelling in information retrieval , 1989, J. Inf. Sci..

[4]  Nicola Guarino,et al.  Formal ontology, conceptual analysis and knowledge representation , 1995, Int. J. Hum. Comput. Stud..

[5]  Maristella Agosti,et al.  A two-level hypertext retrieval model for legal data , 1991, SIGIR '91.

[6]  Craig A. Knoblock,et al.  Retrieving and Integrating Data from Multiple Information Sources , 1993, Int. J. Cooperative Inf. Syst..

[7]  Hans Weigand,et al.  Linguistic tool based information elicitation in large heterogeneous database networks , 1996 .

[8]  Tom M. Mitchell,et al.  Learning to Extract Symbolic Knowledge from the World Wide Web , 1998, AAAI/IAAI.

[9]  Norbert Fuhr,et al.  Probabilistic Models in Information Retrieval , 1992, Comput. J..

[10]  Maristella Agosti,et al.  New prospectives in information retrieval techniques: a hypertext prototype in environmental law , 1989 .

[11]  Mike P. Papazoglou Unraveling the semantics of conceptual schemas , 1995, CACM.

[12]  Peter Bruza,et al.  Two Level Hypermedia An Improved Architecture for Hypertext , 1990, DEXA.

[13]  Brian Everitt,et al.  Cluster analysis , 1974 .

[14]  Kristian J. Hammond,et al.  Combining Databases and Knowledgebases for Assisted Browsing , 1995 .

[15]  Peter B. Danzig,et al.  Harvest: A Scalable, Customizable Discovery and Access System , 1994 .

[16]  Arthur H. M. ter Hofstede,et al.  Query Formulation as an Information Retrieval Problem , 1996, Comput. J..

[17]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[18]  Roy Rada,et al.  Ranking documents with a thesaurus , 1989, JASIS.

[19]  Ali R. Hurson,et al.  Automated resolution of semantic heterogeneity in multidatabases , 1994, TODS.

[20]  B. Pinkerton,et al.  Finding What People Want : Experiences with the WebCrawler , 1994, WWW Spring 1994.

[21]  Theo P. van der Weide,et al.  A Feedback Mechanism for Query by Navigation , 1995, Australasian Database Conference.

[22]  Paolo Merialdo,et al.  To Weave the Web , 1997, VLDB.

[23]  Jeff Heflin,et al.  Coping with Changing Ontologies in a Distributed Environment , 1999 .

[24]  Wanda Pratt,et al.  Network-Based Information Brokers , 1995 .

[25]  Yiyu Yao,et al.  A probabilistic inference model for information retrieval , 1991, Inf. Syst..

[26]  Maristella Agosti,et al.  A Hypertext Environment for Interacting with Large Textual Databases , 1992, Inf. Process. Manag..

[27]  Dennis McLeod,et al.  The design and experimental evaluation of an information discovery mechanism for networks of autonomous database systems , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[28]  Timothy C. Craven Linked phrase indexing , 1978, Inf. Process. Manag..

[29]  John Kirriemuir,et al.  Cross-Searching Subject Gateways. D-Lib Magazine , 1998 .

[30]  Dan Brickley,et al.  Cross-Searching Subject Gateways: The Query Routing and Forward Knowledge Approach , 1998, D Lib Mag..

[31]  Mike P. Papazoglou,et al.  Pro-active Information Elicitation in Wide-Area Information Networks , 1996, CODAS.

[32]  Mike P. Papazoglou,et al.  A Scalable Architecture for Autonomous Heterogeneous Database Interactions , 1995, VLDB.

[33]  Silvana Castano,et al.  Semantic dictionary design for database interoperability , 1997, Proceedings 13th International Conference on Data Engineering.

[34]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[35]  Daniel Kuokka,et al.  Supporting Information Retrieval via Matchmaking , 1995 .

[36]  Victor R. Lesser,et al.  Cooperative information-gathering: a distributed problem-solving approach , 1997, IEE Proc. Softw. Eng..

[37]  R GruberThomas Toward principles for the design of ontologies used for knowledge sharing , 1995 .

[38]  Mark A. Sheldon Content routing: a scalable architecture for network-based information discovery , 1995 .

[39]  Vipul Kashyap,et al.  Semantic heterogeneity in global information systems: The role of metadata , 1996 .

[40]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[41]  Peter Bruza,et al.  Hyperindices: A Novel Aid for Searching in Hypermedia , 1992, ECHT.

[42]  Hsinchun Chen,et al.  Interactive term suggestion for users of digital libraries: using subject thesauri and co-occurrence lists for information retrieval , 1996, DL '96.

[43]  Peter Bruza,et al.  Preferential Models of Query by Navigation , 1998 .

[44]  Henderik Alex Proper,et al.  What Is Information Discovery About? , 1999, J. Am. Soc. Inf. Sci..

[45]  Debra A Hiom,et al.  The Social Science Information Gateway. , 1995 .

[46]  Terje Brasethvik,et al.  A semantic modeling approach to metadata , 1998, Internet Res..

[47]  Chanathip Namprempre,et al.  HyPursuit: a hierarchical network search engine that exploits content-link hypertext clustering , 1996, HYPERTEXT '96.

[48]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[49]  Mike P. Papazoglou,et al.  Content-Based Organization of the Information Space in Multi-Database Networks , 1998, CAiSE.

[50]  Gerald Kowalski,et al.  Information Retrieval Systems: Theory and Implementation , 1997 .