arXiv Open Access 2024

Communication efficient application of sequences of planar rotations to a matrix

Thijs Steel Julien Langou
Lihat Sumber

Abstrak

We present an efficient algorithm for the application of sequences of planar rotations to a matrix. Applying such sequences efficiently is important in many numerical linear algebra algorithms for eigenvalues. Our algorithm is novel in three main ways. First, we introduce a new kernel that is optimized for register reuse in a novel way. Second, we introduce a blocking and packing scheme that improves the cache efficiency of the algorithm. Finally, we thoroughly analyze the memory operations of the algorithm which leads to important theoretical insights and makes it easier to select good parameters. Numerical experiments show that our algorithm outperforms the state-of-the-art and achieves a flop rate close to the theoretical peak on modern hardware.

Topik & Kata Kunci

Penulis (2)

T

Thijs Steel

J

Julien Langou

Format Sitasi

Steel, T., Langou, J. (2024). Communication efficient application of sequences of planar rotations to a matrix. https://arxiv.org/abs/2412.01852

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓