Experience Report: A Component-Based Data Management and Knowledge Discovery Framework for Aviation Studies

Organizations are beginning to apply data mining and knowledge discovery techniques to their corporate data sets, thereby enabling the identification of trends and the discovery of inductive knowledge. Since traditional transaction databases are not optimized for analytical processing, they must be transformed. This article proposes the use of modular components to decrease the overall amount of human processing and intervention necessary for the transformation process. Our approach configures components to extract data-sets using a set of “extraction hints.†Our framework incorporates decentralized, generic components that are reusable across domains and databases. Finally, we detail an implementation of our component-based framework for an aviation data set.

[1]  Marian H. Nodine,et al.  Active Information Gathering in InfoSleuth , 1999, CODAS.

[2]  Robert Engels Component-based user guidance in knowledge discovery and data mining , 1999, DISKI.

[3]  Surajit Chaudhuri,et al.  Integrating data mining with SQL databases: OLE DB for data mining , 2001, Proceedings 17th International Conference on Data Engineering.

[4]  Desney S. Tan,et al.  Developing a Generic Augmented-Reality Interface , 2002, Computer.

[5]  M. Brian Blake,et al.  Developmental and Operational Processes for Agent-Oriented Database Navigation for Knowledge Discovery , 2003, SEKE.

[6]  Padhraic Smyth,et al.  Knowledge Discovery and Data Mining: Towards a Unifying Framework , 1996, KDD.

[7]  Yike Guo,et al.  An Architecture for Distributed Enterprise Data Mining , 1999, HPCN Europe.

[8]  Kyuseok Shim,et al.  Developing Tightly-Coupled Data Mining Applications on a Relational Database System , 1996, KDD.

[9]  Lisa Singh,et al.  A Component-Based Data Management and Knowledge Discovery Framework for Aviation Studies , 2006 .

[10]  Yun Wang,et al.  Determining the Minimum Sample Size of Audit Data Required to Profile User Behavior and Detect Anomaly Intrusion , 2006, Int. J. Bus. Data Commun. Netw..

[11]  James S. DeArmon,et al.  - 1-- Assessing NAS Performance : Normalizing for the Effects of Weather , 2001 .

[12]  Jianping Zhang,et al.  Mining aviation data to understand impacts of severe weather on airspace system performance , 2002, Proceedings. International Conference on Information Technology: Coding and Computing.

[13]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[14]  J. Bueno,et al.  KDCOM: A Knowledge Discovery Component Framework , 1998 .

[15]  M. Brian Blake,et al.  Agent-Based Communication for Distributed Workflow Management Using Jini Technologies , 2003, Int. J. Artif. Intell. Tools.

[16]  Michael R. Smith,et al.  The Theory and Implementation of InputValidator: A Semi-Automated Value-Level Bypass Testing Tool , 2008, Int. J. Inf. Technol. Web Eng..

[17]  Hwan-Seung Yong,et al.  The Chamois Component-Based Knowledge Engineering Framework , 2002, Computer.

[18]  Ralph Kimball,et al.  The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses , 1996 .