Multi-tenant virtual GPUs for optimising performance of a financial risk application

[EN] Graphics Processing Units (GPUs) are becoming popular accelerators in modern High-Performance Computing (HPC) clusters. Installing GPUs on each node of the cluster is not efficient resulting in high costs and power consumption as well as underutilisation of the accelerator. The research reporte...

Descripción completa

Detalles Bibliográficos
Autores: Prades, Javier|||0000-0003-3349-2200, Silla, Federico|||0000-0002-6435-1200, Varghese, Blesson, Reaño González, Carlos
Tipo de recurso: artículo
Fecha de publicación:2017
País:España
Institución:Universitat Politècnica de València (UPV)
Repositorio:RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia
Idioma:inglés
OAI Identifier:oai:riunet.upv.es:10251/103729
Acceso en línea:https://riunet.upv.es/handle/10251/103729
Access Level:acceso abierto
Palabra clave:GPU virtualisation
Acceleration-as-a-Service
rCUDA
Multi-tenancy
Energy efficiency
ARQUITECTURA Y TECNOLOGIA DE COMPUTADORES
Descripción
Sumario:[EN] Graphics Processing Units (GPUs) are becoming popular accelerators in modern High-Performance Computing (HPC) clusters. Installing GPUs on each node of the cluster is not efficient resulting in high costs and power consumption as well as underutilisation of the accelerator. The research reported in this paper is motivated towards the use of few physical GPUs by providing cluster nodes access to remote GPUs on-demand for a financial risk application. We hypothesise that sharing GPUs between several nodes, referred to as multi-tenancy, reduces the execution time and energy consumed by an application. Two data transfer modes between the CPU and the GPUs, namely concurrent and sequential, are explored. The key result from the experiments is that multi-tenancy with few physical GPUs using sequential data transfers lowers the execution time and the energy consumed, thereby improving the overall performance of the application.