DOAJ Open Access 2021

Automatic configuration of the Cassandra database using irace

Moisés Silva-Muñoz Alberto Franzin Hugues Bersini

Abstrak

Database systems play a central role in modern data-centered applications. Their performance is thus a key factor in the efficiency of data processing pipelines. Modern database systems expose several parameters that users and database administrators can configure to tailor the database settings to the specific application considered. While this task has traditionally been performed manually, in the last years several methods have been proposed to automatically find the best parameter configuration for a database. Many of these methods, however, use statistical models that require high amounts of data and fail to represent all the factors that impact the performance of a database, or implement complex algorithmic solutions. In this work we study the potential of a simple model-free general-purpose configuration tool to automatically find the best parameter configuration of a database. We use the irace configurator to automatically find the best parameter configuration for the Cassandra NoSQL database using the YCBS benchmark under different scenarios. We establish a reliable experimental setup and obtain speedups of up to 30% over the default configuration in terms of throughput, and we provide an analysis of the configurations obtained.

Penulis (3)

M

Moisés Silva-Muñoz

A

Alberto Franzin

H

Hugues Bersini

Format Sitasi

Silva-Muñoz, M., Franzin, A., Bersini, H. (2021). Automatic configuration of the Cassandra database using irace. https://doi.org/10.7717/peerj-cs.634

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.7717/peerj-cs.634
Informasi Jurnal
Tahun Terbit
2021
Sumber Database
DOAJ
DOI
10.7717/peerj-cs.634
Akses
Open Access ✓