Assessing the Suitability of In-Memory Databases in an Enterprise Context

It is still not fully clear if the increased query execution speed offered by in-memory databases unfolds its potential benefits over traditional disk-based databases in an enterprise context. This paper aims at comparing the performance of in-memory versus disk-based databases in such context to assess the added value of in-memory database technology for businesses. To achieve this goal, the authors conducted a literature review to find performance comparisons and benchmarks of the two database types in business contexts. Based on the results of this review an own performance comparison methodology assembling workloads mostly found in enterprise environments is compiled. The devised methodology is then applied by comparing two implementations of each database technology. The results show, that not all businesses profit equally from deploying in-memory databases but rather only those, which are dealing with large amounts of data and many concurrent users. Additionally, the tests revealed that disk-based and in-memory databases outperform each other at different query types. Limitations were that our research took place in a virtualized setup, that the SQL queries generating the workload were not subjected to any performance optimizations and that no database tuning or data staging took place. The research's genuine value lies in the compilation of a performance comparison for the two database technologies in an enterprise context, which can be used to further examine the suitability of in-memory databases for enterprise data storage requirements.

[1]  Michael J. Carey,et al.  A Study of Index Structures for a Main Memory Database Management System , 1986, HPTS.

[2]  Alexander Thomasian,et al.  Analysis of database performance with dynamic locking , 1990, JACM.

[3]  Michael J. Carey,et al.  A performance evaluation of pointer-based joins , 1990, SIGMOD '90.

[4]  Paul W. P. J. Grefen,et al.  PRISMA/DB: A Parallel Main Memory Relational DBMS , 1992, IEEE Trans. Knowl. Data Eng..

[5]  Hector Garcia-Molina,et al.  Main Memory Database Systems: An Overview , 1992, IEEE Trans. Knowl. Data Eng..

[6]  Vijay Kumar,et al.  Performance Measurement of Main Memory Database Recovery Algorithms Based on Update-in-Place and Shadow Approaches , 1992, IEEE Trans. Knowl. Data Eng..

[7]  Suzanne W. Dietrich,et al.  A Practitioner's Introduction to Database Performance Benchmarks and Measurements , 1992, Comput. J..

[8]  S. Sudarshan,et al.  Dalí: A High Performance Main Memory Storage Manager , 1994, VLDB.

[9]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Alfons Kemper,et al.  Database performance in the real world: TPC-D and SAP R/3 , 1997, SIGMOD '97.

[11]  S. Sudarshan,et al.  DataBlitz: A High Performance Main-Memory Storage Manager , 1994, VLDB.

[12]  Jeffrey Scott Vitter,et al.  Scalable mining for classification rules in relational databases , 1998, Proceedings. IDEAS'98. International Database Engineering and Applications Symposium (Cat. No.98EX156).

[13]  Peter M. Chen,et al.  Integrating reliable memory in databases , 1998, The VLDB Journal.

[14]  Gary Valentin,et al.  DB2 Universal Database Performance Tuning , 1999, IEEE Data Eng. Bull..

[15]  Josep Torrellas,et al.  Cache optimization for memory-resident decision support commercial workloads , 1999, Proceedings 1999 IEEE International Conference on Computer Design: VLSI in Computers and Processors (Cat. No.99CB37040).

[16]  Hongjun Lu,et al.  T-tree or B-tree: main memory database index structure revisited , 2000, Proceedings 11th Australasian Database Conference. ADC 2000 (Cat. No.PR00528).

[17]  Sven Helmer,et al.  The implementation and performance of compressed databases , 2000, SGMD.

[18]  Tom Jones,et al.  An enterprise directory solution with DB2 , 2000, IBM Syst. J..

[19]  Ramesh C. Agarwal,et al.  Block oriented processing of relational database operations in modern computer architectures , 2001, Proceedings 17th International Conference on Data Engineering.

[20]  Christian Böhm,et al.  Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases , 2001, CSUR.

[21]  Sam Lightstone,et al.  Toward autonomic computing with DB2 universal database , 2002, SGMD.

[22]  Hamid Pirahesh,et al.  WinMagic: subquery elimination using window aggregation , 2003, SIGMOD '03.

[23]  M. Tamer Özsu,et al.  XBench benchmark and performance testing of XML DBMSs , 2004, Proceedings. 20th International Conference on Data Engineering.

[24]  Kenneth A. Ross,et al.  Buffering databse operations for enhanced instruction cache performance , 2004, SIGMOD '04.

[25]  David B. Skillicorn,et al.  Developing a characterization of business intelligence workloads for sizing new database systems , 2004, DOLAP '04.

[26]  Kai-Uwe Sattler,et al.  Depth-first frequent itemset mining in relational databases , 2005, SAC '05.

[27]  Yuanyuan Zhou,et al.  Hibernator: helping disk arrays sleep through the winter , 2005, SOSP '05.

[28]  Darcy G. Benoit,et al.  Automatic Diagnosis of Performance Problems in Database Management Systems , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[29]  Daniel J. Abadi,et al.  Performance tradeoffs in read-optimized databases , 2006, VLDB.

[30]  Dinesh Manocha,et al.  GPUTeraSort: high performance graphics co-processor sorting for large database management , 2006, SIGMOD Conference.

[31]  Raghunath Othayoth Nambiar,et al.  The making of TPC-DS , 2006, VLDB.

[32]  Michael Stonebraker,et al.  The End of an Architectural Era (It's Time for a Complete Rewrite) , 2007, VLDB.

[33]  Haixun Wang,et al.  Semantic Data Management: Towards Querying Data with their Meaning , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[34]  Matthias Nicola,et al.  An XML transaction processing benchmark , 2007, SIGMOD '07.

[35]  Raghunath Othayoth Nambiar,et al.  Why You Should Run TPC-DS: A Workload Analysis , 2007, VLDB.

[36]  Xueqi Cheng,et al.  Text Feature Ranking Based on Rough-set Theory , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[37]  Michael Stonebraker,et al.  H-store: a high-performance, distributed main memory transaction processing system , 2008, Proc. VLDB Endow..

[38]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[39]  Wei Cao,et al.  Simulation of main memory database parallel recovery , 2009, SpringSim '09.

[40]  Martin L. Kersten,et al.  Database Architecture Evolution: Mammals Flourished long before Dinosaurs became Extinct , 2009, Proc. VLDB Endow..

[41]  Carsten Binnig,et al.  Dictionary-based order-preserving string compression for main memory column stores , 2009, SIGMOD Conference.

[42]  Christoph Koch,et al.  DBToaster: A SQL Compiler for High-Performance Delta Processing in Main-Memory Databases , 2009, Proc. VLDB Endow..

[43]  Alexander Zeier,et al.  Optimizing Write Performance for Read Optimized Databases , 2010, DASFAA.

[44]  Alexander Zeier,et al.  Data structures for mixed workloads in in-memory databases , 2010, 5th International Conference on Computer Sciences and Convergence Information Technology.

[45]  Alexander Zeier,et al.  The effects of virtualization on main memory systems , 2010, DaMoN '10.

[46]  Alexander Zeier,et al.  Hauptspeicherdatenbanken für Unternehmensanwendungen , 2010, Datenbank-Spektrum.

[47]  Alexander Zeier,et al.  Application-aware aggregations in a columnar in-memory database , 2011, The 3rd International Conference on Data Mining and Intelligent Information Technology Applications.

[48]  Gottfried Vossen,et al.  In-Memory-Datenmanagement in betrieblichen Anwendungssystemen , 2011, Wirtschaftsinf..

[49]  Jignesh M. Patel,et al.  High-Performance Concurrency Control Mechanisms for Main-Memory Databases , 2011, Proc. VLDB Endow..

[50]  Maurizio Marchese,et al.  Text Clustering with Seeds Affinity Propagation , 2011, IEEE Transactions on Knowledge and Data Engineering.

[51]  Calisto Zuzarte,et al.  DB2 performance measurement and tuning hands on exercises , 2011, CASCON.

[52]  Kamesh Munagala,et al.  Interaction-aware scheduling of report-generation workloads , 2011, The VLDB Journal.

[53]  Alexander Zeier,et al.  In-memory data management: an inflection point for enterprise applications , 2011 .

[54]  Martin Bichler,et al.  Efficient Deployment of Main-Memory DBMS in Virtualized Data Centers , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[55]  Patrick Martin,et al.  Discovering Indicators for Congestion in DBMSs , 2012, 2012 IEEE 28th International Conference on Data Engineering Workshops.

[56]  Alfons Kemper,et al.  Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems , 2012, Proc. VLDB Endow..

[57]  Wolfgang Lehner,et al.  SAP HANA database: data management for modern business applications , 2012, SGMD.

[58]  Zhen He,et al.  MCJoin: a memory-constrained join for column-store main-memory databases , 2012, SIGMOD Conference.

[59]  Ippokratis Pandis,et al.  OLTP on Hardware Islands , 2012, Proc. VLDB Endow..

[60]  Pavol Zavarsky,et al.  Performance Analysis of Oracle Database in Virtual Environments , 2012, 2012 26th International Conference on Advanced Information Networking and Applications Workshops.

[61]  P. Bagade,et al.  Designing performance monitoring tool for NoSQL Cassandra distributed database , 2012, International Conference on Education and e-Learning Innovations.

[62]  Subrahmanyam Murala,et al.  Local Tetra Patterns: A New Feature Descriptor for Content-Based Image Retrieval , 2012, IEEE Transactions on Image Processing.

[63]  M. Courtney Puzzling out big data , 2012 .

[64]  Tilmann Rabl,et al.  Efficient update data generation for DBMS benchmarks , 2012, ICPE '12.

[65]  Philipp Rösch,et al.  A Storage Advisor for Hybrid-Store Databases , 2012, Proc. VLDB Endow..

[66]  Sap Ag Harnessing the Power of Big Data in Real Time through In-Memory Technology and Analytics , 2012 .

[67]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[68]  Peter Kilpatrick,et al.  WIQ: Work-Intensive Query Scheduling for In-Memory Database Systems , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[69]  Jörg Becker,et al.  Auswirkungen von In-Memory-Datenmanagement auf Geschäftsprozesse im Business Intelligence , 2013, Wirtschaftsinformatik.

[70]  Martin Courtney Upwardly mobile [Communications Enterprise Mobile] , 2013 .