Open Access Open Access  Restricted Access Subscription or Fee Access

A Comprehensive Survey on Optimizing Storage Models, Data Layouts, and System Catalogs

Md. Shifatul Ahsan Apurba

Abstract


This survey compares four papers that propose solutions for optimizing the storage model, data layout, and system catalogs in data management systems for hybrid workloads, which consist of both analytical and transactional queries. These solutions include a main memory hybrid storage engine that separates the storage of analytical and transactional data and uses a sophisticated query optimizer, a hands-free adaptive store that adjusts the storage layout based on access patterns, a hybrid storage engine that combines the strengths of row- and column- store systems and a column layout optimization method that considers both analytical and transactional access patterns and uses ghost values to support updates. These papers highlight the importance of designing storage systems specifically for hybrid workloads and the need for further research in this area.


Full Text:

PDF

References


Grund M, Krüger J, Plattner H, Zeier A, Cudre-Mauroux P, Madden S. Hyrise: a main memory hybrid storage engine. Proc VLDB Endow. 2010 Nov 1; 4(2): 105–16.

Alagiannis I, Idreos S, Ailamaki A. H2O: a hands-free adaptive store. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. 2014 Jun 18; 1103–1114.

Antcheva I, Ballintijn M, Bellenot B, Biskup M, Brun R, Buncic N, Canal P, Casadei D, Couet O, Fine V, Franco L. ROOT—A C++ framework for petabyte data storage, statistical analysis and visualization. Comput Phys Commun. 2009 Dec 1; 180(12): 2499–512.

Arulraj J, Pavlo A, Menon P. Bridging the archipelago between row-stores and column-stores for hybrid workloads. In Proceedings of the 2016 International Conference on Management of Data. 2016 Jun 14; 583–598.

Huang D, Liu Q, Cui Q, Fang Z, Ma X, Xu F, Shen L, Tang L, Zhou Y, Huang M, Wei W. TiDB: a Raft-based HTAP database. Proc VLDB Endow. 2020 Aug 1; 13(12): 3072–84.

Backasch R, Hempel G, Blochwitz C, Werner S, Groppe S, Pionteck T. An architectural template for composing application specific datapaths at runtime. In 2015 IEEE International Conference on ReConFigurable Computing and FPGAs (ReConFig). 2015 Dec 7; 1–6.

Athanassoulis M, Bøgh KS, Idreos S. Optimal column layout for hybrid workloads. Proc VLDB Endow. 2019 Sep 1; 12(13): 2393–407.

Wang JC, Ding D, Wang H, Christensen C, Wang Z, Chen H, Li J. Polyjuice: High-Performance Transactions via Learned Concurrency Control. In 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI). 2021 Jul 14; 198–216.

Pinnecke M, Broneske D, Durand GC, Saake G. Are databases fit for hybrid workloads on GPUs? A storage engine's perspective. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE). 2017 Apr 19; 1599–1606.


Refbacks

  • There are currently no refbacks.