SEAL - Tying Up Information Integration and Web Site Management by Ontologies

Community web sites exhibit two dominating properties: They often need to integrate many different information sources and they require an adequate web site management system. SEAL (SEmantic portAL) is a conceptual model that exploits ontologies for fulfilling the requirements set forth by these two properties at once. The ontology provides a high level of sophistication for web information integration as well as for web site management. We describe the SEAL conceptual architecture as well as its current implementation in KAON.

[1]  S. Boag,et al.  XQuery 1.0 : An XML query language, W3C Working Draft 12 November 2003 , 2003 .

[2]  Ian Horrocks,et al.  Ontology Reasoning in the SHOQ(D) Description Logic , 2001, IJCAI.

[3]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[4]  Anand Rajaraman,et al.  Answering Queries Using Limited External Processors. , 1996, ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems.

[5]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[6]  Stefano Paraboschi,et al.  Data-Driven, One-To-One Web Site Generation for Data-Intensive Applications , 1999, VLDB.

[7]  Laura M. Haas,et al.  Towards heterogeneous multimedia information systems: the Garlic approach , 1995, Proceedings RIDE-DOM'95. Fifth International Workshop on Research Issues in Data Engineering-Distributed Object Management.

[8]  Dan Suciu,et al.  Query containment for conjunctive queries with regular expressions , 1998, PODS.

[9]  Eugene J. Shekita,et al.  Querying XML Views of Relational Data , 2001, VLDB.

[10]  Serge Abiteboul,et al.  Adaptive on-line page importance computation , 2003, WWW '03.

[11]  Alan M. Frieze,et al.  A General Model of Undirected Web Graphs , 2001, ESA.

[12]  Diego Calvanese,et al.  Information integration: conceptual modeling and reasoning support , 1998, Proceedings. 3rd IFCIS International Conference on Cooperative Information Systems (Cat. No.98EX122).

[13]  Soumen Chakrabarti,et al.  Accelerated focused crawling through online relevance feedback , 2002, WWW.

[14]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[15]  Raphael Volz,et al.  Migrating data-intensive web sites into the Semantic Web , 2002, SAC '02.

[16]  Sylvie Ranwez,et al.  Ontology-supported and ontology-driven conceptual navigation on the World Wide Web , 2000, HYPERTEXT '00.

[17]  Nick Roussopoulos,et al.  Interoperability of multiple autonomous databases , 1990, CSUR.

[18]  Deborah L. McGuinness Ontologies for Electronic Commerce , 2003 .

[19]  Dan Suciu,et al.  Declarative specification of Web sites with Strudel , 2000, The VLDB Journal.

[20]  Jennifer Widom,et al.  The TSIMMIS Approach to Mediation: Data Models and Languages , 1997, Journal of Intelligent Information Systems.

[21]  Michael R. Genesereth,et al.  The Conceptual Basis for Mediation Services , 1997, IEEE Expert.

[22]  Franz Baader,et al.  A Scheme for Integrating Concrete Domains into Concept Languages , 1991, IJCAI.

[23]  Russ Bubley,et al.  Randomized algorithms , 1995, CSUR.

[24]  Ian Horrocks,et al.  The Semantic Web: The Roles of XML and RDF , 2000, IEEE Internet Comput..

[25]  James P. Callan,et al.  Query-based sampling of text databases , 2001, TOIS.

[26]  Hector Garcia-Molina,et al.  Template-based wrappers in the TSIMMIS system , 1997, SIGMOD '97.

[27]  David J. DeWitt,et al.  Following the paths of XML Data: An algebraic framework for XML query evaluation , 2001 .

[28]  James P. Callan,et al.  Automatic discovery of language models for text databases , 1999, SIGMOD '99.

[29]  Bertram Ludäscher,et al.  Navigation-Driven Evaluation of Virtual Mediated Views , 2000, EDBT.

[30]  Marco Gori,et al.  Focused Crawling Using Context Graphs , 2000, VLDB.

[31]  Guido Moerkotte,et al.  Evaluating queries with generalized path expressions , 1996, SIGMOD '96.

[32]  Guijun Wang,et al.  ProFusion*: Intelligent Fusion from Multiple, Distributed Search Engines , 1996, J. Univers. Comput. Sci..

[33]  Gio Wiederhold,et al.  Intelligent integration of information , 1993, SIGMOD Conference.

[34]  Ian Horrocks,et al.  OilEd: a Reason-able Ontology Editor for the Semantic Web , 2001, Description Logics.

[35]  Alon Y. Halevy,et al.  The nimble integration engine , 2001, SIGMOD '01.

[36]  Laura M. Haas,et al.  Optimizing Queries Across Diverse Data Sources , 1997, VLDB.

[37]  Yannis Papakonstantinou,et al.  Query rewriting for semistructured data , 1999, SIGMOD '99.

[38]  Sophie Cluet,et al.  Your mediators need data conversion! , 1998, SIGMOD '98.

[39]  Valter Crescenzi,et al.  The (Short) Araneus Guide to Web-Site Development , 1999, WebDB.

[40]  Clement T. Yu,et al.  Concept hierarchy based text database categorization in a metasearch engine environment , 2000, Proceedings of the First International Conference on Web Information Systems Engineering.

[41]  Norbert Fuhr,et al.  A probabilistic relational algebra for the integration of information retrieval and database systems , 1997, TOIS.

[42]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[43]  Yannis Papakonstantinou,et al.  XML query forms (XQForms): declarative specification of XML query interfaces , 2001, WWW '01.

[44]  Werner Nutt,et al.  The Complexity of Concept Languages , 1997, KR.

[45]  E Davio,et al.  The times they are a'changin'. , 1970, The Journal of practical nursing.

[46]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[47]  Krishna Bharat,et al.  Improved algorithms for topic distillation in a hyperlinked environment , 1998, SIGIR '98.

[48]  Andrew McCallum,et al.  Using Reinforcement Learning to Spider the Web Efficiently , 1999, ICML.

[49]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[50]  Paolo Paolini,et al.  A Conceptual Model and a Tool Environment for Developing More Scalable, Dynamic, and Customizable Web Applications , 1998, EDBT.

[51]  Ian Horrocks,et al.  OIL: An Ontology Infrastructure for the Semantic Web , 2001, IEEE Intell. Syst..

[52]  Rudi Studer,et al.  How to structure and access XML documents with ontologies , 2001, Data Knowl. Eng..

[53]  Laura M. Haas,et al.  Capabilities-Based Query Rewriting in Mediator Systems , 2004, Distributed and Parallel Databases.

[54]  Dan Suciu,et al.  Aggregation and Accumulation of XML Data. , 2001 .

[55]  Diego Calvanese,et al.  Rewriting of regular expressions and regular path queries , 1999, PODS '99.

[56]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[57]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[58]  Volker Haarslev,et al.  High Performance Reasoning with Very Large Knowledge Bases: A Practical Case Study , 2000, IJCAI.

[59]  Ravi Kumar,et al.  On Semi-Automated Web Taxonomy Construction , 2001, WebDB.

[60]  Andreas Witt,et al.  Lessons Learned from Applying AI to the Web , 2000, Int. J. Cooperative Inf. Syst..

[61]  Jennifer Widom,et al.  Database Systems: The Complete Book , 2001 .

[62]  Gerhard Weikum,et al.  The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking , 2002, EDBT.

[63]  Taher H. Haveliwala Efficient Computation of PageRank , 1999 .

[64]  Jon M. Kleinberg,et al.  Mining the Web's Link Structure , 1999, Computer.

[65]  Deborah L. McGuinness,et al.  The Chimaera Ontology Environment , 2000, AAAI/IAAI.

[66]  Amar Gupta,et al.  Integration of Information Systems: Bridging Heterogeneous Databases , 1989 .

[67]  Dieter Fensel,et al.  Ontobroker: Ontology Based Access to Distributed and Semi-Structured Information , 1999, DS-8.

[68]  Sriram Raghavan,et al.  Crawling the Hidden Web , 2001, VLDB.

[69]  Martin van den Berg,et al.  Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.

[70]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[71]  Yannis Papakonstantinou,et al.  The Enosys Markets data integration platform: lessons from the trenches , 2001, CIKM '01.

[72]  Soumen Chakrabarti,et al.  Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction , 2001, WWW '01.

[73]  Deborah L. McGuinness,et al.  Matching in Description Logics , 1999, J. Log. Comput..

[74]  Ian Horrocks,et al.  Using an Expressive Description Logic: FaCT or Fiction? , 1998, KR.

[75]  Hamid Pirahesh,et al.  Extensible query processing in starburst , 1989, SIGMOD '89.

[76]  Vassilis Christophides,et al.  On wrapping query languages and efficient XML integration , 2000, SIGMOD '00.

[77]  Alon Y. Halevy,et al.  Declarative Web Site Management with Tiramisu , 1999, WebDB.

[78]  Henry F. Korth,et al.  Query Languages for Nested Relational Databases , 1987, NF².

[79]  Ian Horrocks The FaCT System , 1998, TABLEAUX.

[80]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[81]  Dan Suciu,et al.  Efficient evaluation of XML middle-ware queries , 2001, SIGMOD '01.

[82]  Steffen Staab,et al.  The Times They Are A-Changin' - The Corporate History Analyzer , 2000, PAKM.

[83]  Peter F. Patel-Schneider,et al.  DLP System Description , 1998, Description Logics.

[84]  Steffen Staab,et al.  Leveraging Corporate Skill Knowledge - From ProPer to OntoProPer , 2000, PAKM.

[85]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[86]  Luis Gravano,et al.  Probe, count, and classify: categorizing hidden web databases , 2001, SIGMOD '01.

[87]  Ian Horrocks,et al.  Practical Reasoning for Expressive Description Logics , 1999, LPAR.

[88]  Stefano Ceri,et al.  Web Modeling Language (WebML): a modeling language for designing Web sites , 2000, Comput. Networks.

[89]  Anand Rajaraman,et al.  Answering queries using templates with binding patterns (extended abstract) , 1995, PODS.

[90]  Hector Garcia-Molina,et al.  Efficient Crawling Through URL Ordering , 1998, Comput. Networks.

[91]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[92]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.

[93]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[94]  Divyakant Agrawal,et al.  Scalable collection summarization and selection , 1999, DL '99.

[95]  Norbert Fuhr,et al.  XIRQL: a query language for information retrieval in XML documents , 2001, SIGIR '01.

[96]  Carole A. Goble,et al.  Conceptual Open Hypermedia = The Semantic Web? , 2001, SemWeb.

[97]  Gustavo Rossi,et al.  Navigating between objects. Lessons from an object-oriented framework perspective , 2000, CSUR.

[98]  Yannis Papakonstantinou,et al.  Expressive Capabilities Description Languages and Query Rewriting Algorithms , 2000, J. Log. Program..

[99]  Henrik Eriksson,et al.  Knowledge modeling at the millennium : The design and evolution of Protégé-2000 , 1999 .

[100]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[101]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[102]  Hamid Pirahesh,et al.  Efficiently publishing relational data as XML documents , 2001, The VLDB Journal.

[103]  Daphne Koller,et al.  Hierarchically Classifying Documents Using Very Few Words , 1997, ICML.

[104]  Sivan Toledo,et al.  Improving the memory-system performance of sparse-matrix vector multiplication , 1997, IBM J. Res. Dev..

[105]  Yannis Papakonstantinou,et al.  Object Fusion in Mediator Systems , 1996, VLDB.

[106]  Asunción Gómez-Pérez,et al.  (KA)2: building ontologies for the Internet: a mid-term report , 1999, Int. J. Hum. Comput. Stud..

[107]  Guido Moerkotte,et al.  Nested Queries in Object Bases , 1993, DBPL.

[108]  Volker Haarslev,et al.  RACER System Description , 2001, IJCAR.

[109]  Soumen Chakrabarti,et al.  Enhanced topic distillation using text, markup tags, and hyperlinks , 2001, SIGIR '01.

[110]  Ralf Küsters,et al.  Computing the Least Common Subsumer and the Most Specific Concept in the Presence of Cyclic ALN-Concept Descriptions , 1998, KI.

[111]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[112]  Abraham Silberschatz,et al.  Extended algebra and calculus for nested relational databases , 1988, TODS.

[113]  Michael Kifer,et al.  Logical foundations of object-oriented and frame-based languages , 1995, JACM.

[114]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[115]  Gerhard Weikum,et al.  The BINGO! focused crawler: from bookmarks to archetypes , 2002, Proceedings 18th International Conference on Data Engineering.

[116]  Steffen Staab,et al.  Semantic community Web portals , 2000, Comput. Networks.

[117]  Stephen Fox,et al.  Heterogeneous distributed database systems for production use , 1990, ACM Comput. Surv..

[118]  François Bancilhon,et al.  A Query Language for the O2 Object-Oriented Database System , 1989, DBPL.