Security, caching and self-management in distributed information systems

In this thesis, we investigate three different aspects of distributed information systems: security, caching, and self-management. We describe our concept of a security system for distributed and open systems using our query processing system ObjectGlobe as an example. One part of this concept is our OperatorCheck server, which validates the semantics of an operator and analyzes its quality before the operator is actually used in queries. This is done semi-automatically using an oracle-based approach to compare a formal specification of an operator against its implementation. Further security measures are integrated into the query processing engine: secure communication channels are established, authentication and authorization are performed, and overload situations are avoided by admission control. Operators are guarded using Java's security model to prevent unauthorized resource access and leakage of data. The resource consumption of operators is monitored and limited to avoid resource monopolization. We present a semantic caching scheme suitable for caching responses from Web services on the SOAP protocol level. Web services are typically described using WSDL documents. For semantic caching we developed an XML-based declarative language to annotate WSDL documents with information about the caching-relevant semantics of requests and responses. Using this information, our semantic cache answers requests based on the responses of similar previously executed requests. Performance experiments---based on the scenarios of TPC-W and TPC-W Version 2---conducted using our prototype implementation demonstrate the effectiveness of the proposed semantic caching scheme. We present a novel autonomic computing concept which is hiding the ever increasing complexity of managing IT infrastructures. For this purpose, we virtualize, pool, and monitor hardware to provide a dynamic computing infrastructure. A fuzzy-logic-based controller supervises all services running on this virtual platform. Higher-level services such as business applications profit from running on this platform. For example, failed services are restarted automatically. A service overload is detected and remedied by either starting additional service instances or by moving the service to a more powerful server. The capabilities and constraints of the services and the hardware environment are specified in a declarative XML language.

[1]  Chris I. Dalton,et al.  An operating system approach to securing e-services , 2001, CACM.

[2]  Gerhard Weikum,et al.  The LRU-K page replacement algorithm for database disk buffering , 1993, SIGMOD Conference.

[3]  Michael W. Godfrey,et al.  Secure and portable database extensibility , 1998, SIGMOD '98.

[4]  Gerhard Weikum,et al.  Self-tuning Database Technology and Information Services: from Wishful Thinking to Viable Engineering , 2002, VLDB.

[5]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[6]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[7]  Alfons Kemper,et al.  A Framework for Context-Aware Adaptable Web Services , 2004, EDBT.

[8]  Alfons Kemper,et al.  ObjectGlobe: Ubiquitous query processing on the Internet , 2001, The VLDB Journal.

[9]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[10]  Alfons Kemper,et al.  A publish and subscribe architecture for distributed metadata management , 2002, Proceedings 18th International Conference on Data Engineering.

[11]  Alfons Kemper,et al.  Quality of service in an information economy , 2003, TOIT.

[12]  Guido Moerkotte,et al.  A Blackboard Architecture for Query Optimization in Object Bases , 1993, VLDB.

[13]  Hamid Pirahesh,et al.  Starburst Mid-Flight: As the Dust Clears , 1990, IEEE Trans. Knowl. Data Eng..

[14]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[15]  Richard S. Bird,et al.  Introduction to functional programming , 1988, Prentice Hall International series in computer science.

[16]  Tony Bourke Server Load Balancing , 2001 .

[17]  Alfons Kemper,et al.  Security for Distributed E-Service Composition , 2001, TES.

[18]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[19]  Daniela Florescu,et al.  XL: an XML programming language for web service specification and composition , 2002, Comput. Networks.

[20]  Ming Zhang,et al.  Autonomia: an autonomic computing environment , 2003, Conference Proceedings of the 2003 IEEE International Performance, Computing, and Communications Conference, 2003..

[21]  Alfons Kemper,et al.  Sicherheit in einem Java-basierten verteilten System autonomer Objekte , 1999, BTW.

[22]  Lei Gao,et al.  Application specific data replication for edge services , 2003, WWW '03.

[23]  Scott Oaks,et al.  Java Security , 1998 .

[24]  Dongwon Lee,et al.  Semantic caching via query matching for web sources , 1999, CIKM '99.

[25]  Christopher Allen,et al.  The TLS Protocol Version 1.0 , 1999, RFC.

[26]  Zhen Li,et al.  AutoMate: Enabling Autonomic Applications on the Grid , 2006, Cluster Computing.

[27]  Matthew MacDonald,et al.  Web Services Architecture , 2004 .

[28]  Gerhard Weikum,et al.  The Web in 2010: Challenges and Opportunities for Database Research , 2001, Informatics.

[29]  Russ Housley,et al.  Internet X.509 Public Key Infrastructure Certificate and CRL Profile , 1999, RFC.

[30]  Wei Sun,et al.  Solving satisfiability and implication problems in database systems , 1996, TODS.

[31]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[32]  M. Tamer Özsu,et al.  Evaluation of Strong Consistency Web Caching Techniques , 2002, World Wide Web.

[33]  Anees Shaikh,et al.  Enhancing Web Performance , 2002, Communication Systems: The State of the Art.

[34]  Dennis Shasha,et al.  2Q: A Low Overhead High Performance Buffer Management Replacement Algorithm , 1994, VLDB.

[35]  Prashant J. Shenoy,et al.  Cooperative leases: scalable consistency maintenance in content distribution networks , 2002, WWW.

[36]  Lada A. Adamic,et al.  Zipf's law and the Internet , 2002, Glottometrics.

[37]  Anne H. H. Ngu,et al.  Declarative composition and peer-to-peer provisioning of dynamic Web services , 2002, Proceedings 18th International Conference on Data Engineering.

[38]  Alfons Kemper,et al.  Flexible and Reliable Web Service Execution , 2002 .

[39]  Gottfried Vossen,et al.  Web & Datenbanken. Konzepte, Architekturen, Anwendungen , 2003 .

[40]  Alfons Kemper,et al.  ServiceGlobe: Flexible and Reliable Web Services on the Internet , 2003, WWW.

[41]  Boris Chidlovskii,et al.  Semantic caching of Web queries , 2000, The VLDB Journal.

[42]  Michael Stonebraker,et al.  The design of POSTGRES , 1986, SIGMOD '86.

[43]  Michael J. Franklin,et al.  Cache investment: integrating query optimization and distributed data placement , 2000, TODS.

[44]  J. Hopcroft,et al.  Independence results in computer science , 1976, SIGA.

[45]  Alfons Kemper,et al.  Towards context-aware adaptable web services , 2004, WWW Alt. '04.

[46]  Richard S. Bird,et al.  Introduction to functional programming using haskeu , 1998 .

[47]  Volker Markl,et al.  LEO: An autonomic query optimizer for DB2 , 2003, IBM Syst. J..

[48]  D. Agrawal,et al.  View Invalidation for Dynamic Content Caching in Multitiered Architectures , 2002, Very Large Data Bases Conference.

[49]  Gio Wiederhold,et al.  Intelligent integration of information , 1993, SIGMOD Conference.

[50]  Kevin Wilkinson,et al.  The Iris Architecture and Implementation , 1990, IEEE Trans. Knowl. Data Eng..

[51]  Petr Jan Horn,et al.  Autonomic Computing: IBM's Perspective on the State of Information Technology , 2001 .

[52]  Alfons Kemper,et al.  ServiceGlobe: Distributing E-Services Across the Internet , 2002, VLDB.

[53]  Valérie Issarny,et al.  Caching Strategies for Data-Intensive Web Sites , 2000, VLDB.

[54]  Birgitta König-Ries,et al.  Strategies for Semantic Caching , 2001, DEXA.

[55]  Robert Breton Replication Strategies for High Availability and Disaster Recovery , 1998, IEEE Data Eng. Bull..

[56]  Regina Dunlea,et al.  Simple Object Access Protocol (SOAP) , 2005 .

[57]  Michael J. Carey,et al.  Reducing the Braking Distance of an SQL Query Engine , 1998, VLDB.

[58]  Edsger W. Dijkstra,et al.  Structured programming , 1972, A.P.I.C. Studies in data processing.

[59]  Alfons Kemper,et al.  Building Scalable Electronic Market Places Using HyperQuery-Based Distributed Query Processing , 2005, World Wide Web.

[60]  Alfons Kemper,et al.  Reliable Web Service Execution and Deployment in Dynamic Environments , 2003, TES.

[61]  D. Box,et al.  Simple object access protocol (SOAP) 1.1 , 2000 .

[62]  Philip S. Yu,et al.  The state of the art in locally distributed Web-server systems , 2002, CSUR.

[63]  Sam Lightstone,et al.  SMART: Making DB2 (More) Autonomic , 2002, VLDB.

[64]  Glenford J. Myers,et al.  Art of Software Testing , 1979 .

[65]  David J. DeWitt,et al.  A performance study of three high availability data replication strategies , 2005, Distributed and Parallel Databases.

[66]  Alfons Kemper,et al.  ObjectGlobe: Open Distributed Query Processing Services on the Internet , 2001, IEEE Data Eng. Bull..

[67]  Dongwon Lee,et al.  Towards Intelligent Semantic Caching for Web Sources , 2001, Journal of Intelligent Information Systems.

[68]  Alfons Kemper,et al.  Kontext-basierte Personalisierung von Web Services , 2003, BTW.

[69]  Thorsten von Eicken,et al.  Resource Control for Database Extensions , 1998 .

[70]  Peter C. Lockemann,et al.  The Network as a Global Database: Challenges of Interoperability, Proactivity, Interactiveness, Legacy , 1997, VLDB.

[71]  Alfons Kemper,et al.  Hyperqueries: Dynamic Distributed Query Processing on the Internet , 2001, VLDB.

[72]  Divesh Srivastava,et al.  Semantic Data Caching and Replacement , 1996, VLDB.

[73]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[74]  Douglas B. Terry,et al.  Caching XML Web Services for Mobility , 2003, ACM Queue.

[75]  David J. DeWitt,et al.  The Wisconsin Benchmark: Past, Present, and Future , 1991, The Benchmark Handbook.