Data warehousing provides an interesting alternative to the traditional approach of heterogeneous database integration. Rather than using a query driven approach, data warehousing employs an update driven approach in which information from multiple, heterogeneous sources is integrated in advance and stored in a warehouse for direct querying and analysis. To build a data warehouse various tools are used like modeling tools to design a warehouse, database tools to physically build the database and loading the data and programming languages to extract the data from sources, apply business transformations and load it in consistent format. The conventional process of developing custom code or scripts for this is always a costly, error prone and time consuming. In this paper we propose a web based ETL framework with unique feature of preconfigured multi source connection which can be stored and used in future if needed to perform sequence of transformations. A viewable transformation report with time taken to perform the transformations and mapping source to target metadata is made available that provides scope to user to measure data quality and accuracy. Also new feature of entire loading process of data movement from source to target system is made visible to the user. The entire above mentioned things have been modeled using UML for web based approach.
[1]
David W. Embley,et al.
Conceptual-Model-Based Data Extraction from Multiple-Record Web Pages
,
1999,
Data Knowl. Eng..
[2]
Radha Krishna Author,et al.
An Object Oriented Modeling and Implementation of Web Based ETL Process
,
2010
.
[3]
Patrick Valduriez,et al.
Scaling Access to Heterogeneous Data Sources with DISCO
,
1998,
IEEE Trans. Knowl. Data Eng..
[4]
Panos Vassiliadis,et al.
Modeling ETL activities as graphs
,
2002,
DMDW.
[5]
Panos Vassiliadis,et al.
On the Logical Modeling of ETL Processes
,
2002,
CAiSE.
[6]
Ralph Kimball,et al.
The Data Warehouse Lifecycle Toolkit
,
2009
.
[7]
Panos Vassiliadis,et al.
A Framework for the Design of ETL Scenarios
,
2003,
CAiSE.