Semantic Scholar Open Access 2014 303 sitasi

Management of an academic HPC cluster: The UL experience

Sébastien Varrette P. Bouvry Hyacinthe Cartiaux F. Georgatos

Abstrak

The intensive growth of processing power, data storage and transmission capabilities has revolutionized many aspects of science. These resources are essential to achieve high-quality results in many application areas. In this context, the University of Luxembourg (UL) operates since 2007 an High Performance Computing (HPC) facility and the related storage by a very small team. The aspect of bridging computing and storage is a requirement of UL service - the reasons are both legal (certain data may not move) and performance related. Nowadays, people from the three faculties and/or the two Interdisciplinary centers within the UL, are users of this facility. More specifically, key research priorities such as Systems Bio-medicine (by LCSB) and Security, Reliability & Trust (by SnT) require access to such HPC facilities in order to function in an adequate environment. The management of HPC solutions is a complex enterprise and a constant area for discussion and improvement. The UL HPC facility and the derived deployed services is a complex computing system to manage by its scale: at the moment of writing, it consists of 150 servers, 368 nodes (3880 computing cores) and 1996 TB of shared storage which are all configured, monitored and operated by only three persons using advanced IT automation solutions based on Puppet [1], FAI [2] and Capistrano [3]. This paper covers all the aspects in relation to the management of such a complex infrastructure, whether technical or administrative. Most design choices or implemented approaches have been motivated by several years of experience in addressing research needs, mainly in the HPC area but also in complementary services (typically Web-based). In this context, we tried to answer in a flexible and convenient way many technological issues. This experience report may be of interest for other research centers and universities belonging either to the public or the private sector looking for good if not best practices in cluster architecture and management.

Topik & Kata Kunci

Penulis (4)

S

Sébastien Varrette

P

P. Bouvry

H

Hyacinthe Cartiaux

F

F. Georgatos

Format Sitasi

Varrette, S., Bouvry, P., Cartiaux, H., Georgatos, F. (2014). Management of an academic HPC cluster: The UL experience. https://doi.org/10.1109/HPCSim.2014.6903792

Akses Cepat

Lihat di Sumber doi.org/10.1109/HPCSim.2014.6903792
Informasi Jurnal
Tahun Terbit
2014
Bahasa
en
Total Sitasi
303×
Sumber Database
Semantic Scholar
DOI
10.1109/HPCSim.2014.6903792
Akses
Open Access ✓