Increasing the effectiveness of directory caches by avoiding the tracking of noncoherent memory blocks

A key aspect in the design of efficient multiprocessor systems is the cache coherence protocol. Although directory-based protocols constitute the most scalable approach, the limited size of the directory caches together with the growing size of systems may cause frequent evictions and, consequently,...

Descripción completa

Detalles Bibliográficos
Autores: Cuesta Sáez, Blas Antonio, Ros Bardisa, Alberto, Robles Martínez, Antonio, Duato Marín, José Francisco, Gómez Requena, María Engracia|||0000-0003-1466-4118
Tipo de recurso: artículo
Fecha de publicación:2013
País:España
Institución:Universitat Politècnica de València (UPV)
Repositorio:RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia
Idioma:inglés
OAI Identifier:oai:riunet.upv.es:10251/38109
Acceso en línea:https://riunet.upv.es/handle/10251/38109
Access Level:acceso abierto
Palabra clave:Multiprocessor
Cache coherence
Directory cache
Operating system
Coherence deactivation
Noncoherent blocks
ARQUITECTURA Y TECNOLOGIA DE COMPUTADORES
Descripción
Sumario:A key aspect in the design of efficient multiprocessor systems is the cache coherence protocol. Although directory-based protocols constitute the most scalable approach, the limited size of the directory caches together with the growing size of systems may cause frequent evictions and, consequently, the invalidation of cached blocks, which jeopardizes system performance. Directory caches keep track of every memory block stored in processor caches in order to provide coherent access to the shared memory. However, a significant fraction of the cached memory blocks do not require coherence maintenance (even in parallel applications) because they are either accessed by just one processor or they are never modified. In this paper, we propose to deactivate the coherence protocol for those blocks that do not require coherence. This deactivation means directory caches do not have to keep track of noncoherent blocks, which reduces directory cache occupancy and increases its effectiveness. Since the detection of noncoherent blocks is carried out by the operating system, our proposal only requires minor hardware modifications. Simulation results show that, thanks to our proposal, directory caches can avoid the tracking of about 66 percent (on average) of the blocks accessed by a wide range of applications, thereby improving the efficiency of directory caches. This contributes either to shortening the runtime of parallel applications by 15 percent (on average) while keeping directory cache size or to maintaining performance while using directory caches 16 times smaller.