Data Mining with Independent Component Analysis

Independent component analysis (ICA)/blind source separation (BSS) has received many attentions in neural network and signal processing area recent years. In this paper, we consider the data mining problem with ICA. The data model of under-complete ICA in data mining is given and then gives the most popular ICA algorithm-natural gradient algorithm (NGA). Several applications of data mining with ICA is considered, such as latent variable decompositions, multivariate time series analysis and prediction, text document data analysis, extracting hidden signals in satellite images, weather data mining and so on. All these discussions suggest the huge potential outlook of data mining using ICA. The other contribution of this paper is it contains several literature surveys on various aspects of data mining using ICA

[1]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[2]  F.S. Wang,et al.  Novel algorithm for independent component analysis with flexible score functions , 2004, Proceedings 7th International Conference on Signal Processing, 2004. Proceedings. ICSP '04. 2004..

[3]  Andrzej Cichocki,et al.  Adaptive blind signal and image processing , 2002 .

[4]  Jerome H. Friedman,et al.  DATA MINING AND STATISTICS: WHAT''S THE CONNECTION , 1997 .

[5]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[6]  Shun-ichi Amari,et al.  Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.

[7]  Schuster,et al.  Separation of a mixture of independent signals using time delayed correlations. , 1994, Physical review letters.

[8]  Jayanta Basak,et al.  Weather Data Mining Using Independent Component Analysis , 2004, J. Mach. Learn. Res..

[9]  Aapo Hyvärinen,et al.  Complexity Pursuit: Separating Interesting Components from Time Series , 2001, Neural Computation.

[10]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[11]  Lars Kai Hansen,et al.  Signal Detection Using Ica: Application to Chat Room Topic Spotting , 2002 .

[12]  Erkki Oja,et al.  Independent Component Analysis for Parallel Financial Time Series , 1998, International Conference on Neural Information Processing.

[13]  Shun-ichi Amari,et al.  Natural Gradient Learning for Over- and Under-Complete Bases in ICA , 1999, Neural Computation.

[14]  H. Schneeweiß,et al.  Factor Analysis and Principal Components , 1995 .

[15]  Mohamed Ben Ahmed,et al.  Satellite image analysis based on the method of blind separation of sources for the extraction of information , 2002, IEEE International Geoscience and Remote Sensing Symposium.

[16]  Ella Bingham,et al.  Advances in independent component analysis with applications to data mining , 2003 .

[17]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[18]  Paul A. Viola,et al.  Restructuring Sparse High Dimensional Data for Effective Retrieval , 1998, NIPS.

[19]  Ella Bingham,et al.  ICA and SOM in text document analysis , 2002, SIGIR '02.

[20]  T. Eltoft,et al.  Data augmentation using a combination of independent component analysis and non-linear time-series prediction , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[21]  Y. Kosugi,et al.  ICA AIDED LINEAR SPECTRAL MIXTURE ANALYSIS OF AGRICULTURAL REMOTE SENSING IMAGES , 2003 .

[22]  Ying Han,et al.  FINDING UNDERLYING FACTORS IN TIMESERIES , 2002, Cybern. Syst..

[23]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[24]  Andrew D. Back,et al.  A First Application of Independent Component Analysis to Extracting Structure from Stock Returns , 1997, Int. J. Neural Syst..

[25]  C. H. Chen The use of independent component analysis as a tool for data mining , 2002, IEEE International Geoscience and Remote Sensing Symposium.

[26]  L. K. Hansen,et al.  Independent Components in Text , 2000 .

[27]  Ata Kabán,et al.  A Dynamic Probabilistic Model to Visualise Topic Evolution in Text Streams , 2002, Journal of Intelligent Information Systems.

[28]  Ata Kabán,et al.  Topic Identification in Dynamical Text by Complexity Pursuit , 2003, Neural Processing Letters.