arXiv Open Access 2025

Performance Optimization of 3D Stencil Computation on ARM Scalable Vector Extension

Hongguang Chen
Lihat Sumber

Abstrak

Stencil computation is essential in high-performance computing, especially for large-scale tasks like liquid simulation and weather forecasting. Optimizing its performance can reduce both energy consumption and computation time, which is critical in disaster prediction. This paper explores optimization techniques for 7-point 3D stencil computation on ARM's Scalable Vector Extension (SVE), using the Roofline model and tools like Gem5 and cacti. We evaluate software optimizations such as vectorization and tiling, as well as hardware adjustments in ARM SVE vector lengths and cache configurations. The study also examines performance, power consumption, and chip area trade-offs to identify optimal configurations for ARM-based systems.

Topik & Kata Kunci

Penulis (1)

H

Hongguang Chen

Format Sitasi

Chen, H. (2025). Performance Optimization of 3D Stencil Computation on ARM Scalable Vector Extension. https://arxiv.org/abs/2503.01348

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓