Eradicate Inconsistencies and Gaffe from Data Sets Using Data Sanitization

The problem of data cleaning, which consists of removing inconsistencies and errors from original data sets, is well known in the area of decision support systems and data warehouses. This holds regardless of the application - relational database joining, web-related, or scientific. In all cases, existing ETL (Extraction Transformation Loading) and data cleaning tools for writing data cleaning programs are insufficient. The main reason for using the computers is to organize the data in an efficient and effective manner .In early days for valuable data can be organization sake we have to use the tools like Queries. In these some problems are arises. That is why these languages are called as Data Management systems. There were so many limitations in the management system like data inconsistency, inconvenience in retrieval of data etc. Because of all these limitations we have to face the problems like memory inefficiency and heavy in consumption of time and also lack of quality. To overcome all these problems we have designed software (mean ETL tool) which organizes the data in a very efficient manner with respect to redundant data. Our project deals with the data organization by giving all data oriented features and by solving the problems like data inconsistency and data redundancy.