Agent based preprocessing

The current data mining tools is used to build knowledge based on a huge historical data. At present, businesses are facing with fast growing data that are very valuable in contributing knowledge. Knowledge should be updated regularly in order to ensure its quality and precision thus improve the decision making process. Data mining has shown great potential in extracting valuable knowledge from large databases. However, current data mining algorithms and tools are costly and several are too complex in their operations when dealing with large databases. In recent years, agents have become a popular paradigm in computing, because its autonomous, flexible and provides intelligence. Embedding agents in the current data mining processes and tools are believed to be able to solve the obstacle. One of the most important process in data mining is data preprocessing. It is reported that 60% of the data mining project is on preprocessing. Data preprocessing involves integration, selection, cleaning and transformation of data set that will be used for mining. This paper focuses on an agent-based preprocessing framework. The aims is to provides an auto preprocessing a set of new data, which suite to data mining novice user. The proposed agent based preprocessing framework consists of seven agents: user interface agents, coordinator agent, identify agent, CleanMiss agent, CleanNoisy agent, transformation agent and discretization agent. User interface agent is designed in such a way to provide interface suite to novice users. Coordinator agent is responsible for coordinating and cooperating with all other agents to achieve the goals. Identify agent responsible to provide an adaptive user data cleaning profiling. CleanMiss agent, CleanNoisy agent, transformation agent and discretization agent provide various types of techniques autonomously, which ended with proposing the best cleaning techniques from various types of techniques to keep in the preprocessing profile. This paper is start by introducing the data mining process problem includes data preprocessing which agent can solve data mining problems. By applying agent in data preprocessing, a tool that intelligence yet flexible can be produced.

[1]  Bo Yang,et al.  Research and design of distributed training algorithm for neural networks , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[2]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[3]  Michael Luck,et al.  Understanding Agent Systems , 2001, Springer Series on Agent Technology.

[4]  Florin Leon,et al.  Efficient Distributed Data Mining using Intelligent Agents , .

[5]  Qian Wei,et al.  An Agent Based Fault Diagnosis Support System and Its Application , 2006, 2006 IEEE International Conference on Service Operations and Logistics, and Informatics.

[6]  Pericles A. Mitkas,et al.  Agent Intelligence Through Data Mining (Multiagent Systems, Artificial Societies, and Simulated Organizations) , 2005 .

[7]  John-Jules Ch. Meyer Agent Technology , 2008, Wiley Encyclopedia of Computer Science and Engineering.

[8]  Pericles A. Mitkas,et al.  Information agents cooperating with heterogenous data sources for customer-order management , 2004, SAC '04.

[9]  Alexander B. Bordetsky,et al.  Agent-based support for collaborative data mining in systems management , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[10]  A. Roadmapof A Roadmap of Agent Research and Development , 1995 .

[11]  A.M.B. Ahmad,et al.  An architecture design of the intelligent agent for speech recognition and translation , 2004, 2004 IEEE Region 10 Conference TENCON 2004..

[12]  F. Leon,et al.  Mining Association Rules in Geographic Information Systems using Intelligent Agents , 2004 .

[13]  Chunsheng Li,et al.  Agent-Based Pattern Mining of Discredited Activities in Public Services , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[14]  Timo Steffens,et al.  Understanding Agent Systems , 2004, Künstliche Intell..

[15]  Richard Murch,et al.  Intelligent Software Agents , 1998 .

[16]  Sung Wook Baik,et al.  Agent Based Distributed Data Mining , 2004, PDCAT.

[17]  Dr. Alex A. Freitas Data Mining and Knowledge Discovery with Evolutionary Algorithms , 2002, Natural Computing Series.

[18]  Patrik Floréen,et al.  An Architecture for Distributed Agent-Based Data Preprocessing , 2005, AIS-ADM.

[19]  Ramasamy Uthurusamy,et al.  Data mining and knowledge discovery in databases , 1996, CACM.

[20]  R. Garduno-Ramirez,et al.  An architecture of multi-agent system applied to fossil-fuel power unit , 2004, IEEE Power Engineering Society General Meeting, 2004..

[21]  Reda Alhajj,et al.  Multiagent reinforcement learning using OLAP-based association rules mining , 2003, IEEE/WIC International Conference on Intelligent Agent Technology, 2003. IAT 2003..

[22]  P. C. Janca,et al.  Practical design of intelligent agent systems , 1998 .

[23]  Ayse Yasemin Seydim INTELLIGENT AGENTS: A DATA MINING PERSPECTIVE , 2001 .

[24]  Chengqi Zhang,et al.  A Human-Friendly MAS for Mining Stock Data , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[25]  Zhong-Yang Xiong,et al.  Distributed intrusion detection based on clustering , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[26]  Margaret H. Dunham,et al.  Data Mining: Introductory and Advanced Topics , 2002 .

[27]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[28]  Emil Jovanov,et al.  An agent based framework for virtual medical devices , 2002, AAMAS '02.

[29]  Reda Alhajj,et al.  Fuzzy OLAP association rules mining-based modular reinforcement learning approach for multiagent systems , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[30]  Rahul Ramachandran,et al.  Agent framework for intelligent data processing , 2004, IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium.

[31]  Qing He,et al.  Execution Engine of Meta-learning System for KDD in Multi-agent Environment , 2005, AIS-ADM.

[32]  Tak-Chung Fu,et al.  Agent-based network intrusion detection system using data mining approaches , 2005, Third International Conference on Information Technology and Applications (ICITA'05).

[33]  Zili Zhang,et al.  Agents and Data Mining: Mutual Enhancement by Integration , 2005, AIS-ADM.

[34]  Yun-Lan Wang,et al.  Mobile-agent-based distributed and incremental techniques for association rules , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[35]  Piotr Jedrzejowicz,et al.  An agent-based approach to ANN training , 2005, Knowl. Based Syst..

[36]  Jerzy Bala,et al.  Applications of Distributed Mining Techniques For Knowledge Discovery in Dispersed Sensory Data , 2004 .

[37]  Lulu Zhang,et al.  A multiagent data warehousing (MADWH) and multiagent data mining (MADM) approach to brain modeling and neurofuzzy control , 2004, Inf. Sci..

[38]  Anália Lourenço,et al.  Agent-based knowledge extraction services inside enterprise data warehousing systems environments , 2001, 12th International Workshop on Database and Expert Systems Applications.

[39]  Matthias Klusch,et al.  Distributed data mining and agents , 2005, Eng. Appl. Artif. Intell..

[40]  Vijayan Sugumaran,et al.  Application of intelligent agent technology for managerial data analysis and mining , 1999, DATB.