A New MDS Algorithm for Textual Data Analysis

MDS algorithms are data analysis techniques that have been successfully applied to generate a visual representation of multivariate object relationships considering only a similarity matrix. However in high dimensional spaces the concept of proximity become meaningless due to the data sparsity and the maps generated by common MDS algorithms fail often to reflect the object proximities.

[1]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[2]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[3]  Manuel Martín-Merino,et al.  Self Organizing Map and Sammon Mapping for Asymmetric Proximities , 2001, ICANN.

[4]  Manuel Martín-Merino,et al.  Visualizing asymmetric proximities with MDS models , 2003, The European Symposium on Artificial Neural Networks.

[5]  A. Morineau,et al.  Multivariate descriptive statistical analysis , 1984 .

[6]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[7]  Anil K. Jain,et al.  Artificial neural networks for feature extraction and multivariate data projection , 1995, IEEE Trans. Neural Networks.

[8]  Philip S. Yu,et al.  Redefining Clustering for High-Dimensional Applications , 2002, IEEE Trans. Knowl. Data Eng..

[9]  Elizabeth R. Jessup,et al.  Matrices, Vector Spaces, and Information Retrieval , 1999, SIAM Rev..

[10]  Alberto Muòoz,et al.  Compound Key Word Generation from Document Databases Using A Hierarchical Clustering ART Model , 1997 .

[11]  Malcolm P. Atkinson,et al.  Issues Raised by Three Years of Developing PJama: An Orthogonally Persistent Platform for Java , 1999, ICDT.

[12]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[13]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[14]  Charu C. Aggarwal,et al.  Re-designing distance functions and distance-based applications for high dimensional data , 2001, SGMD.

[15]  R. Mooney,et al.  Impact of Similarity Measures on Web-page Clustering , 2000 .

[16]  Keinosuke Fukunaga,et al.  Statistical Pattern Recognition , 1993, Handbook of Pattern Recognition and Computer Vision.

[17]  A. Buja,et al.  Inequalities and Positive-Definite Functions Arising from a Problem in Multidimensional Scaling , 1994 .

[18]  Kurt Hornik,et al.  Artificial Neural Networks — ICANN 2001 , 2001, Lecture Notes in Computer Science.