Complex relationships and knowledge discovery support in the InfoQuilt system

Abstract. Support for semantic content is becoming more common in Web-accessible information systems. We see this support emerging with the use of ontologies and machine-readable, annotated documents. The practice of domain modeling coupled with the extraction of domain-specific, contextually relevant metadata also supports the use of semantics. These advancements enable knowledge discovery approaches that define complex relationships between data that is autonomously collected and managed. The InfoQuilt (One of the incarnations of the InfoQuilt system, as applied to the geographic information as part of the NSF Digital Library II initiative is the ADEPT-UGA system [Ade]. This research was funded in part by National Science Foundation grant IIS-9817432.) system supports one such knowledge discovery approach. This paper presents (parts of) the InfoQuilt system with the focus on its use for modeling and utilizing complex semantic inter-domain relationships to enable human-assisted knowledge discovery over Web-accessible heterogeneous data. This includes the specification and execution of Information Scale (IScapes), a semantically rich information request and correlation mechanism.

[1]  Amit P. Sheth,et al.  Infoharness: Managing Distributed, Heterogeneous Information , 1999, IEEE Internet Comput..

[2]  Amit P. Sheth,et al.  Managing Semantic Content for the Web , 2002, IEEE Internet Comput..

[3]  Wendy G. Lehnert,et al.  Information extraction , 1996, CACM.

[4]  Stephen Soderland,et al.  Learning Information Extraction Rules for Semi-Structured and Free Text , 1999, Machine Learning.

[5]  Laura M. Haas,et al.  Transforming Heterogeneous Data with Database Middleware: Beyond Integration , 1999, IEEE Data Eng. Bull..

[6]  Raymond J. Mooney,et al.  Relational Learning of Pattern-Match Rules for Information Extraction , 1999, CoNLL.

[7]  Amit P. Sheth,et al.  Data Semantics: What, Where, and How? , 1995 .

[8]  Ion Muslea,et al.  Extraction Patterns for Information Extraction Tasks: A Survey , 1999 .

[9]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[10]  Amit P. Sheth Panel: Data Semantics: what, where and how? , 1995, DS-6.

[11]  Gerd Stumme,et al.  FCA-MERGE: Bottom-Up Merging of Ontologies , 2001, IJCAI.

[12]  Vipul Kashyap,et al.  OBSERVER: An Approach for Query Processing in Global Information Systems Based on Interoperation Across Pre-Existing Ontologies , 2000, Distributed and Parallel Databases.

[13]  Wolfgang Wahlster,et al.  Creating of Semantic Web , 2022 .

[14]  Craig A. Knoblock,et al.  Query processing in the SIMS information mediator , 1997 .

[15]  Craig A. Knoblock,et al.  The Ariadne Approach to Web-Based Information Integration , 2001, Int. J. Cooperative Inf. Syst..

[16]  Divesh Srivastava,et al.  Data model and query evaluation in global information systems , 1995, Journal of Intelligent Information Systems.

[17]  Amit P. Sheth,et al.  Logical information modeling of Web-accessible heterogeneous digital assets , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[18]  Amit P. Sheth,et al.  Semantic Content Management for Enterprises and the Web , 2001 .

[19]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[20]  Sriram Lakshminarayan SEMANTIC INTEROPERABILITY IN DIGITAL LIBRARIES USING INTER-ONTOLOGICAL RELATIONSHIPS , 2000 .

[21]  Craig A. Knoblock,et al.  Planning by Rewriting: E ciently Generating High-Quality Plans , 1999 .

[22]  Gio Wiederhold,et al.  Value-added Mediation in Large-Scale Information Systems , 1995, DS-6.

[23]  Vipul Kashyap,et al.  Information Brokering Across Heterogeneous Digital Data , 2000, Advances in Database Systems.

[24]  Frederico T. Fonseca,et al.  Semantic Granularity in Ontology-Driven Geographic Information Systems , 2002, Annals of Mathematics and Artificial Intelligence.

[25]  Dilpreet Singh,et al.  AN AGENT BASED ARCHITECTURE FOR QUERY PLANNING AND COST MODELING OF WEB SOURCES , 2000 .

[26]  Terence R. Smith,et al.  The Alexandria digital earth prototype , 2001, JCDL '01.

[27]  Ellen Riloff,et al.  Automatically Constructing a Dictionary for Information Extraction Tasks , 1993, AAAI.

[28]  Amit P. Sheth,et al.  Planning and Optimizing Semantic Information Requests Using Domain Modeling and Resource Characteristics , 2001, CoopIS.

[29]  Jennifer Widom,et al.  The TSIMMIS Approach to Mediation: Data Models and Languages , 1997, Journal of Intelligent Information Systems.

[30]  Calton Pu,et al.  XWRAP: an XML-enabled wrapper construction system for Web information sources , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[31]  Arnaud Sahuguet,et al.  WysiWyg Web Wrapper Factory (W4F) , 1999 .

[32]  Vipul Kashyap,et al.  Semantic Information Brokering: How Can a Multi-agent Approach Help? , 1999, CIA.

[33]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[34]  Michel Klein,et al.  Combining and relating ontologies: an analysis of problems and solutions , 2001, OIS@IJCAI.

[35]  George R. Krupka SRA: Description of the SRA System as Used for MUC-6 , 1995, MUC.

[36]  Sasikumar Mukundan,et al.  Spinning the Semantic Web , 2004 .

[37]  Mike P. Papazoglou,et al.  Cooperative Information Systems : Current Trends and Directions , 1997 .

[38]  Vipul Kashyap,et al.  InfoSleuth: agent-based semantic integration of information in open and dynamic environments , 1997, SIGMOD '97.

[39]  Jerry R. Hobbs,et al.  Accessing Information and Services on the DAML-Enabled Web , 2001, SemWeb.

[40]  Vipul Kashyap,et al.  Media-independent correlation of Information: What? How? , 1996, MD.

[41]  Arnaud Sahuguet,et al.  W4f: a Wysiwyg Web Wrapper Factory for Minute-made Wrappers , 1998 .

[42]  B. Harris,et al.  Urban Growth Models , 2001 .

[43]  A. Sheth,et al.  Information Brokering Across Heterogeneous Digital Data , 2000, Advances in Database Systems.

[44]  Sibel Adali,et al.  A uniform framework for integrating knowledge in heterogeneous knowledge systems , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[45]  Vipul Kashyap,et al.  So Far (Schematically) yet So Near (Semantically) , 1992, DS-5.

[46]  Hector Garcia-Molina,et al.  Extracting Semistructured Information from the Web. , 1997 .

[47]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[48]  Vipul Kashyap,et al.  Semantic and schematic similarities between database objects: a context-based approach , 1996, The VLDB Journal.

[49]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[50]  Vipul Kashyap,et al.  Semantic heterogeneity in global information systems: The role of metadata , 1996 .

[51]  Craig A. Knoblock,et al.  Query reformulation for dynamic information integration , 1996, Journal of Intelligent Information Systems.

[52]  David Fisher,et al.  CRYSTAL: Inducing a Conceptual Dictionary , 1995, IJCAI.