A web-aware interoperable data mining system

Abstract The development of web-aware data mining systems has received a great deal of attention in recent years. It plays a key enabling role for competitive businesses in the E-commerce era. One of the challenges in developing web-aware data mining systems is to integrate and coordinate existing data mining applications in a seamless manner so that cost-effective systems can be developed without the need of costly proprietary products. In this paper we present an approach for developing an interoperable web-aware data mining system to achieve this purpose. This approach applies Remote Method Invocation and high level code wrapper of Java distributed object computing to address the issues of interoperability in heterogeneous environments, which includes programming language, platform, and visual object model. The effectiveness of the proposed system is demonstrated through the integration and enhancement of the two well-known standalone data mining tools, SOM_PAK and Nenet, and runs with the iris data and air pollution data.

[1]  Guido Deboeck Software Tools for Self-Organizing Maps , 1998 .

[2]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[3]  Alex Berson,et al.  Building Data Mining Applications for CRM , 1999 .

[4]  F. P. Goyla Legacy integration-changing perspectives [Cobol] , 2000 .

[5]  Dan Harkey,et al.  Client/Server programming with Java and Corba , 1997 .

[6]  Fang Wei,et al.  Web-enabling legacy applications , 1998, Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250).

[7]  Sheng-Tun Li,et al.  A Java-centric Distributed Object-based Paradigm for Surveillance Services and Visual Message Exchange , 1999, J. Vis. Lang. Comput..

[8]  Mu-Chun Su,et al.  Application of neural networks in cluster analysis , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[9]  Walter D. Potter,et al.  Using DCOM to support interoperability in forest ecosystem management decision support systems , 2000 .

[10]  Amjad Umar,et al.  Application (re)engineering: building Web-based applications and dealing with legacies , 1997 .

[11]  David West,et al.  A comparison of SOM neural network and hierarchical clustering methods , 1996 .

[12]  Uday R. Kulkarni,et al.  Self-organizing map network as an interactive clustering tool - An application to group technology , 1995, Decis. Support Syst..

[13]  Rudolf F. Albrecht,et al.  Artificial Neural Nets and Genetic Algorithms , 1995, Springer Vienna.

[14]  Shaw K. Chen,et al.  The comparative ability of self-organizing neural networks to define cluster structure , 1995 .

[15]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[16]  Jim Conallen,et al.  Modeling Web application architectures with UML , 1999, CACM.

[17]  B. Thuraisingham A primer for understanding and applying data mining , 2000 .

[18]  David E. Booth,et al.  The use of fuzzy clustering algorithm and self-organizing neural networks for identifying potentially failing banks: an experimental study , 2000 .

[19]  Sheng-Tun Li,et al.  Multi-resolution spatio-temporal data mining for the study of air pollutant regionalization , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[20]  Paddy Nixon,et al.  Bridging Boundaries: CORBA in Perspective , 1997, IEEE Internet Comput..

[21]  Michael Stonebraker,et al.  Migrating Legacy Systems: Gateways, Interfaces, and the Incremental Approach , 1995 .

[22]  Robert L. Probert,et al.  The distributed object computing paradigm: concepts and applications , 1999, J. Syst. Softw..

[23]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[24]  T. Kohonen,et al.  Visual Explorations in Finance with Self-Organizing Maps , 1998 .

[25]  Nicolás J. Medrano-Marqués,et al.  Feature Map Architectures for Pattern Recognition: Techniques for Automatic Region Selection , 1995, ICANNGA.