Automatic generation of ETL processes from conceptual models

Data warehouses (DW) integrate different data sources in order to give a multidimensional view of them to the decision-maker. To this aim, the ETL (Extraction, Transformation and Load) processes are responsible for extracting data from heterogeneous operational data sources, their transformation (co...

Descripción completa

Detalles Bibliográficos
Autores: Muñoz, Lilia, Trujillo, Juan, Mazón, Jose Norberto
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2018
País:Panamá
Institución:Universidad Tecnológica de Panamá
Repositorio:Repositorio Institucional de documento digitales de acceso abierto de la UTP
Idioma:inglés
OAI Identifier:oai:ridda2.utp.ac.pa:123456789/4915
Acceso en línea:https://dl.acm.org/citation.cfm?id=1651298
http://ridda2.utp.ac.pa/handle/123456789/4915
Access Level:acceso embargado
Palabra clave:Automatic
ETL processes
conceptual models
Descripción
Sumario:Data warehouses (DW) integrate different data sources in order to give a multidimensional view of them to the decision-maker. To this aim, the ETL (Extraction, Transformation and Load) processes are responsible for extracting data from heterogeneous operational data sources, their transformation (conversion, cleaning, standardization, etc.), and its load in the DW. In recent years, several conceptual modeling approaches have been proposed for designing ETL processes. Although these approaches are very useful for documenting ETL processes and supporting the designer tasks, these proposals fail to give mechanisms to carry out an automatic code generation stage. Such a stage should be required to both avoid fails and save development time in the implementation of complex ETL process. Therefore, in this paper we define an approach for the automatic code generation of ETL processes. To this aim, we align the modeling of ETL processes in DW with MDA (Model Driven Architecture) by formally defining a set of QVT (Query, View, Transformation) transformations.