Inproceedings,

Buzzy: Towards Realistic DBMS Benchmarking via Tailored, Representative, Synthetic Workloads

, , , , , and .
Companion of the ACM/SPEC International Conference on Performance Engineering, ICPE '21, page 175–178. New York, NY, USA, Association for Computing Machinery, (2021)
DOI: 10.1145/3447545.3451175

Abstract

Distributed Database Management Systems (DBMS) are a crucial component of modern IT applications. Understanding their performance and non-functional properties is of paramount importance. Yet, benchmarking distributed DBMS has proven to be difficult in practice. Either, a realistic workload is often mapped to a synthetic workload without knowing if this mapping is correct or available workload traces are replayed. While the latter approach provides more realistic results, real-world traces are hard to obtain and their scope is limited in time scale and variance.We propose collecting real-world traces and then applying data generation techniques to synthesize similar realistic traces based on it. Based in this approach, we can obtain workloads for benchmarking, exhibit variability with respect to different aspects of interest while still being similar to the original traces. Varying generation parameters, we are able to support benchmarking what-if scenarios with hypothetical workloads and introduced anomalies.

Tags

Users

  • @se-group
  • @simon.eismann
  • @joh.grohmann
  • @samuel.kounev

Comments and Reviews