Multi-tenant virtual GPUs for optimising performance of a financial risk application

[EN] Graphics Processing Units (GPUs) are becoming popular accelerators in modern High-Performance Computing (HPC) clusters. Installing GPUs on each node of the cluster is not efficient resulting in high costs and power consumption as well as underutilisation of the accelerator. The research reporte...

ver descrição completa

Detalhes bibliográficos
Autores: Prades, Javier|||0000-0003-3349-2200, Silla, Federico|||0000-0002-6435-1200, Varghese, Blesson, Reaño González, Carlos
Formato: artículo
Fecha de publicación:2017
País:España
Recursos:Universitat Politècnica de València (UPV)
Repositorio:RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia
Idioma:inglés
OAI Identifier:oai:riunet.upv.es:10251/103729
Acesso em linha:https://riunet.upv.es/handle/10251/103729
Access Level:acceso abierto
Palavra-chave:GPU virtualisation
Acceleration-as-a-Service
rCUDA
Multi-tenancy
Energy efficiency
ARQUITECTURA Y TECNOLOGIA DE COMPUTADORES
Descrição
Resumo:[EN] Graphics Processing Units (GPUs) are becoming popular accelerators in modern High-Performance Computing (HPC) clusters. Installing GPUs on each node of the cluster is not efficient resulting in high costs and power consumption as well as underutilisation of the accelerator. The research reported in this paper is motivated towards the use of few physical GPUs by providing cluster nodes access to remote GPUs on-demand for a financial risk application. We hypothesise that sharing GPUs between several nodes, referred to as multi-tenancy, reduces the execution time and energy consumed by an application. Two data transfer modes between the CPU and the GPUs, namely concurrent and sequential, are explored. The key result from the experiments is that multi-tenancy with few physical GPUs using sequential data transfers lowers the execution time and the energy consumed, thereby improving the overall performance of the application.