Home
Hasso-Plattner-Institut
Hasso-Plattner-Institut
Prof. Dr. h.c. mult. Hasso Plattner
schließen
schließen

Our team is giving a series of lectures and seminars with a focus on enterprise systems design and in-memory data management. Strong links to the industry ensure a close connection between theory and its implementation in the real world.

schließen
schließen
schließen

If you are having questions regarding one of our publications, please contact the authors.

schließen
  1. HOME
  2. > Projects
  3. > Project Archive
  4. > HANA Load Simulator

HANA Load Simulator

Motivation

The HANA Load Simulator creates a realistic enterprise workload of thousands of concurrent users and executes that workload on different database configurations simultaneously. A dashboard monitors several performance indicators of each database, incl. data footprint, transaction latencies, throughput, and overall CPU utilization. The dashboard can also be used to configure several workload parameters like OLTP and OLAP query frequencies or the ratio of current and historical queries. This provides a simple and interactive tool to assess key performance characteristics of different database setups (e.g., single- vs. multi-node) side-by-side and in real-time.

Experiment

We compare a) a single database node with b) a multi-node setup consisting of a master node (current/actual data only), one replica node of the master for running OLAP transactions, and a cold node for historical data. Both setups have an equal total amount of cores and main memory. The usage of the replica node can be switched on and off. The workload consists of three types of transactions (ratio configurable): invoice postings (sFIN-adapted), read-only transactions incl. transactional queries (incl. BKPF-BSEG-joins), and OLAP transactions incl. read-heavy analytical queries.

With the partitioning into current and historical and replication of the current data, we see the following improvements (90% current-only OLAP transactions, 100% current-only OLTP transactions, one of 100 queries being analytical):

Improved performance

  • Transactional processing is improved even without the use of a replica due to the smaller data set. Activating the replica, the multi-node setup is faster by a factor of up to 4x for mixed workloads.

  • The higher the skew is towards an current-only workload, the more the new architecture outperforms the traditional setup.

  • When adding analytical users to the system, a replica of the current master node lowers the latency of OLTP transactions due to better load distribution.

Reduced costs

  • Historical data can be purged and better compressed to decrease the memory footprint and require less main memory than the traditional setup with all data being memory-resident.

  • Overall system costs potentially decrease as smaller servers for the historical nodes can be deployed, hence avoiding disproportional prices for large server systems.

Thousands of SAP customers in the fully occupied Orange County Convention Center in Orlando, FL and watchers of the live stream saw the impact of current / historical data optimization for SAP HANA in terms of database performance and system load.

Martin Boissier and Carsten Meyer had the chance to present the master project -HANA Load Simulator- (Daniel Kurzynski, Rui Ruhrlaender, Christopher Schmidt, Jannik Marten, Jan-Peer Rudolph, Alexander Franke, Jasper Schulz, and Pedro Flemming) live on stage during Prof. Hasso Plattner's keynote speech.

Desirable: Seeing and comparing the impact of fundamental system changes, helps to understand the meaning of those changes and the true value behind them. Changing simulation parameter and getting direct feedback allows to explore system behavior and to consider the odds of various options.

Feasible: SAP HANA features read-only replication and current/historical data partitioning. Workload and data set can be generated and adapted close to a productive load and deployed to different hardware setups.

Viable: A technical description is less convincing than a running system. If people see positive effects of a system setup they are more willing to test it. If people can evaluate the impact of changes in their own environment, they are more willing to buy it.

Vision

The HANA Load Simulator shall visualize the impact of current/historical partitioning and read-only replication on a customer's productive system. Also the possibility of increased reliability (via high availability) using hot-standby replicas will be shown in future versions. With access to a customer’s production data and a corresponding workload trace, the simulator can mimic the real production system in order to show the feasibility and benefits of the mentioned concepts on different hardware setups.

News

22.09.2023 | Trends and Concepts in the Softwareindustry Seminar offered in WiSe 2023/2024

Trends and Concepts in the Softwareindustry Seminar offered in WiSe 2023/2024 > Zum Artikel

22.05.2023 | Christopher Hagedorn Successfully Defended His PhD Thesis

Christopher Hagedorn Successfully Defended His PhD Thesis > Zum Artikel

03.03.2023 | Last Trends and Concepts course of Prof. Hasso Plattner

After more than 20 years of teaching, our founder and benefactor Prof. Hasso Plattner visited the HPI this week for his … > Zum Artikel

01.03.2023 | Jan Kossmann Successfully Defended His PhD Thesis

Last week, Jan Kossmann another PhD student of our EPIC group successfully defended his thesis on the topic of … > Zum Artikel

26.02.2023 | Paper on Data Tiering in Hyrise Published in BTW Proceedings

Our latest paper on data tiering in Hyrise "Workload-Driven Data Placement for Tierless In-Memory Database Systems" by … > Zum Artikel

24.02.2023 | Paper on EPIC Research Group Published in SIGMOD Record

Our report "Enterprise Platform and Integration Concepts Research at HPI" has been published in the December issue of … > Zum Artikel

30.11.2022 | Paper on Database Optimizations for Spatio-Temporal Data published in PVLDB

Our paper "Robust and Budget-Constrained Encoding Configurations for In-Memory Database Systems" has been published in … > Zum Artikel

04.10.2022 | Günter Hesse Successfully Defended His PhD Thesis

Last week, Günter Hesse another PhD student of our EPIC group successfully defended his thesis on the topic of "A … > Zum Artikel

08.07.2022 | Successful PhD Defense by Markus Dreseler

Markus Dreseler has successfully defended his PhD thesis on Automatic Tiering for In-Memory Database Systems. > Zum Artikel

Literature

"A Course in In-Memory Data Management" by Prof. Dr. h.c. Hasso Plattner. This book is the culmination of six years work of in-memory research. As such, it provides the technical foundation for combined transactional and analytical workloads inside one single database as well as examples of new applications that are now possible given the availability of the new technology. The book is available at Springer.

Contact

Dr. Michael Perscheid

Chair Representative

Tel.: +49 (331) 5509-566

E-Mail:


Office:

Room: V-2.12

Tel.: +49 (331) 5509-560

Fax: +49 (331) 5509-579

E-Mail:

Follow us on Twitter

Contact Details

AltStyle によって変換されたページ (->オリジナル) /