MODELO PARA EL PROCESO DE EXTRACCIÓN, TRANSFORMACIÓN Y CARGA EN BODEGAS DE DATOS. UNA APLICACIÓN CON DATOS AMBIENTALES MODEL FOR THE EXTRACTION, TRANSFORMATION AND LOAD PROCESS IN DATA WAREHOUSES. AN APPLICATION WITH ENVIRONMENTAL DATA

Data warehouse management requires a procedure to ensure the accuracy, completeness, and centralization of data when there are several sources of information, thus making the use of specialized applications for Extraction, Transformation, and Loading of Data -ETLnecessary. These applications have conflicts with the parameterization, lack the implementation of correction filters adaptable to the data characteristics, and can demand high costs for their implementation. In this article, it is presented a generic model that applies the stages of ETL and allows monitoring the process to keep a historical record of errors filtered and to calculate indicators to identify quality in processing. Model validation was performed on a case study with environmental data. The model showed satisfactory results. Finally, it is planned to conduct validations of the model in other areas, including new types and data structures.

[1]  Juan Trujillo,et al.  A UML Based Approach for Modeling ETL Processes in Data Warehouses , 2003, ER.

[2]  J. Cortés Estadística y probabilidad , 2017 .

[3]  Julián Moreno-Cadavid,et al.  Una Aproximación Multi-Agente para el Soporte al Proceso de Extracción- Transformación-Carga en Bodegas de Datos A Multi-Agent Approach for the Extract-Transform-Load Process Support in Data Warehouses , 2012 .

[4]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[5]  Q. H. Wu,et al.  Power system data warehouses , 2001 .

[6]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[7]  Panos Vassiliadis,et al.  Conceptual modeling for ETL processes , 2002, DOLAP '02.

[8]  Mauricio Orozco-Alzate,et al.  Hydro-meteorological data analysis using OLAP techniques , 2014 .

[9]  Juan Carlos,et al.  Construcción y poblamiento de un datawarehouse basado en el paradigma de bases de datos objeto relacional Construction and population of a datawarehouse based on the paradigm Of databases relational object , 2011 .

[10]  John van den Hoven Data Warehousing: Bringing it All Together , 1998, Inf. Syst. Manag..

[11]  Sonia Jaramillo Valbuena,et al.  SISTEMAS PARA ALMACENAR GRANDES VOLÚMENES DE DATOS , 2015 .

[12]  孙傲冰,et al.  A New ETL Approach Based on Data Virtualization , 2015 .

[13]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit , 2009 .

[14]  G. Höfner,et al.  Data integration , 1993 .

[15]  Francisco Moreno,et al.  Análisis del modelo de almacenamiento MOLAP frente al modelo de almacenamiento ROLAP , 2006 .

[16]  Abdeltawab M. Hendawi,et al.  A proposed model for data warehouse ETL processes , 2011, J. King Saud Univ. Comput. Inf. Sci..