Evaluations of architectural designs and implementation for database-driven web sites

Response time is a key differentiation point among electronic commerce (e-commerce) applications. For many e-commerce applications, Web pages are created dynamically based on the current business state, stored in database systems. The architecture of database-driven e-commerce Web sites are more complex than that of typical Web sites. It requires integration of Web servers, application servers, and back-end database systems as well as synchronization of multiple databases if caches are used for acceleration of content delivery. In this paper, we analyze the factors that impact the performance and scalability of a database-driven Web site. We experimentally test (1) the performance metrics of database update, query, and synchronization; (2) trigger overhead; and (3) application server (AS) and database connection overhead and constraints. We describe several architectural design approaches for database-driven Web sites and present experimental results on their performance under various conditions, including varying request rates, update-to-request ratio, cache hit ratio, number of regional AS/data cache suites deployed, and database size. We also discuss how to handle Web page requests that involve SSL, cookies, and fragment pages (i.e. by frame or JSP).

[1]  Eric Levy-Abegnoli,et al.  Design and performance of a Web server accelerator , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[2]  Alexandros Labrinidis,et al.  On the Materialization of WebViews , 1999, WebDB.

[3]  Alexandros Labrinidis,et al.  Adaptive WebView Materialization , 2001, WebDB.

[4]  Arun Iyengar,et al.  A Scalable and Highly Available System for Serving Dynamic Data at Frequently Accessed Web Sites , 1998, Proceedings of the IEEE/ACM SC98 Conference.

[5]  Mark Crovella,et al.  On the network impact of dynamic server selection , 1999, Comput. Networks.

[6]  Divyakant Agrawal,et al.  Cache Portal: Technology for Accelerating Database-driven e-commerce Web Sites , 2001, VLDB.

[7]  Michael Dahlin,et al.  Coordinated Placement and Replacement for Large-Scale Distributed Caches , 2002, IEEE Trans. Knowl. Data Eng..

[8]  Pablo Rodriguez,et al.  SPREAD: Scalable platform for reliable and efficient automated distribution , 2000, Comput. Networks.

[9]  Michael Dahlin,et al.  Design considerations for distributed caching on the Internet , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[10]  Balachander Krishnamurthy,et al.  Analyzing factors that influence end-to-end Web performance , 2000, Comput. Networks.

[11]  Fred Douglis,et al.  HPP: HTML Macro-Preprocessing to Support Dynamic Document Caching , 1997, USENIX Symposium on Internet Technologies and Systems.

[12]  Scott Shenker,et al.  A scalable Web cache consistency architecture , 1999, SIGCOMM '99.

[13]  Arun Iyengar,et al.  A scalable system for consistently caching dynamic Web data , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[14]  Margo I. Seltzer,et al.  World Wide Web Cache Consistency , 1996, USENIX Annual Technical Conference.

[15]  Chengjie Liu,et al.  Maintaining Strong Cache Consistency in the World Wide Web , 1998, IEEE Trans. Computers.

[16]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[17]  Gerti Kappel,et al.  The TriGS active object-oriented database system— an overview , 1998, SGMD.

[18]  Tao Yang,et al.  Exploiting Result Equivalence in Caching Dynamic Web Content , 1999, USENIX Symposium on Internet Technologies and Systems.

[19]  Sanjoy Paul,et al.  Distributed caching with centralized control , 2001, Comput. Commun..

[20]  Allan Kuchinsky,et al.  Integrating user-perceived quality into Web server design , 2000, Comput. Networks.

[21]  Abdelsalam Heddaya,et al.  WebWave: globally load balanced fully distributed caching of hot published documents , 1997, Proceedings of 17th International Conference on Distributed Computing Systems.

[22]  Alexandros Labrinidis,et al.  WebView materialization , 2000, SIGMOD '00.

[23]  Pei Cao,et al.  WCIP: Web Cache Invalidation Protocol , 2001 .

[24]  Divyakant Agrawal,et al.  Enabling dynamic content caching for database-driven web sites , 2001, SIGMOD '01.