arXiv Open Access 2024

Parallel Gaussian process with kernel approximation in CUDA

Davide Carminati
Lihat Sumber

Abstrak

This paper introduces a parallel implementation in CUDA/C++ of the Gaussian process with a decomposed kernel. This recent formulation, introduced by Joukov and Kulić (2022), is characterized by an approximated -- but much smaller -- matrix to be inverted compared to plain Gaussian process. However, it exhibits a limitation when dealing with higher-dimensional samples which degrades execution times. The solution presented in this paper relies on parallelizing the computation of the predictive posterior statistics on a GPU using CUDA and its libraries. The CPU code and GPU code are then benchmarked on different CPU-GPU configurations to show the benefits of the parallel implementation on GPU over the CPU.

Topik & Kata Kunci

Penulis (1)

D

Davide Carminati

Format Sitasi

Carminati, D. (2024). Parallel Gaussian process with kernel approximation in CUDA. https://arxiv.org/abs/2403.12797

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓