Interactive multidimensional modeling of linked data for exploratory OLAP

Exploratory OLAP aims at coupling the precision and detail of corporate data with the information wealth of LOD. While some techniques to create, publish, and query RDF cubes are already available, little has been said about how to contextualize these cubes with situational data in an on-demand fash...

Descripción completa

Detalles Bibliográficos
Autores: Gallinucci, Enrico, Golfarelli, Matteo, Rizzi Bach, Stefano, Abelló Gamazo, Alberto|||0000-0002-3223-2186, Romero Moral, Óscar|||0000-0001-6350-8328
Tipo de recurso: artículo
Fecha de publicación:2018
País:España
Institución:Universitat Politècnica de Catalunya (UPC)
Repositorio:UPCommons. Portal del coneixement obert de la UPC
Idioma:inglés
OAI Identifier:oai:upcommons.upc.edu:2117/119270
Acceso en línea:https://hdl.handle.net/2117/119270
https://dx.doi.org/10.1016/j.is.2018.06.004
Access Level:acceso abierto
Palabra clave:Data warehousing
Multidimensional modeling
Linked data
Exploratory OLAP
Gestor de dades
Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació
Descripción
Sumario:Exploratory OLAP aims at coupling the precision and detail of corporate data with the information wealth of LOD. While some techniques to create, publish, and query RDF cubes are already available, little has been said about how to contextualize these cubes with situational data in an on-demand fashion. In this paper we describe an approach, called iMOLD, that enables non-technical users to enrich an RDF cube with multidimensional knowledge by discovering aggregation hierarchies in LOD. This is done through a user-guided process that recognizes in the LOD the recurring modeling patterns that express roll-up relationships between RDF concepts, then translates these patterns into aggregation hierarchies to enrich the RDF cube. Two families of aggregation patterns are identified, based on associations and generalization respectively, and the algorithms for recognizing them are described. To evaluate iMOLD in terms of efficiency and effectiveness we compare it with a related approach in the literature, we propose a case study based on DBpedia, and we discuss the results of a test made with real users.