The Impact of Research Data Infrastructures: The Case of the AlphaFold Database

While the scientific output of research infrastructures is well documented, the broader effects of their secondary outputs, such as computational resources and datasets, remain poorly understood. To better understand the benefits of these public resources, this study explores the AlphaFold (AFDB) da...

Descripción completa

Detalles Bibliográficos
Autores: Romasanta, Angelo Kenneth, Wareham, Jonathan, Pujol Priego, Laia
Tipo de recurso: artículo
Fecha de publicación:2025
País:España
Institución:Universitat Ramon Llull (URL)
Repositorio:DAU Arxiu Digital de la Universitat Ramon Llull
OAI Identifier:oai:dau.url.edu:20.500.14342/5868
Acceso en línea:http://hdl.handle.net/20.500.14342/5868
https://doi.org/10.23726/cij.2025.1597
Access Level:acceso abierto
Palabra clave:Research infrastructure
AlphaFold
Bibliometrics
Scientific impact
Descripción
Sumario:While the scientific output of research infrastructures is well documented, the broader effects of their secondary outputs, such as computational resources and datasets, remain poorly understood. To better understand the benefits of these public resources, this study explores the AlphaFold (AFDB) database, a collaboration between DeepMind and the European Molecular Biology Laboratory (EMBL) that democratizes access to protein structure data. Employing a quantitative case study strategy using bibliometric analysis, this study compares publications indexed in the Web of Science Core Collection citing the original AF paper (Jumper et al., 2021) with those citing the AlphaFold database (Varadi et al., 2022), covering publications up to August 2024. We examine the impact of the EMBL AlphaFold database on research themes, collaboration patterns, and scientific impact. Our exploratory analysis identifies several impacts: studies leveraging the AF database investigate application-focused themes and require collaboration between fewer institutions. This research highlights the wide-ranging impacts of research infrastructures, emphasizing the need for comprehensive impact assessments to inform future research policy and funding decisions.