Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Teuvo Kohonen Name of the publication New Developments of Nonlinear Projections for the Visualization of Structures in Nonvectorial Data Sets Publisher School of Science Unit Department of Information and Computer Science Series Aalto University publication series SCIENCE + TECHNOLOGY 8/2011 Field of research Computer science Abstract New nonlinear projections for the visualization of structures in nonvectorial data sets are suggested. Since there exist problems with the convergence of the traditional multidimensional scaling (MDS) when the data are nonvectorial, a new version of the MDS, called the nearest-neighbors multidimensional scaling (NN-MDS), is introduced. While it represents the local data structures more accurately and converges fast, two amendments had to be added, in order to describe the global structures as well. A new initialization method called the GENINIT is also introduced. It is very fast and may be used as a nonlinear projection, too, but it is more suitable for the initialization of the more accurate learning algorithms.New nonlinear projections for the visualization of structures in nonvectorial data sets are suggested. Since there exist problems with the convergence of the traditional multidimensional scaling (MDS) when the data are nonvectorial, a new version of the MDS, called the nearest-neighbors multidimensional scaling (NN-MDS), is introduced. While it represents the local data structures more accurately and converges fast, two amendments had to be added, in order to describe the global structures as well. A new initialization method called the GENINIT is also introduced. It is very fast and may be used as a nonlinear projection, too, but it is more suitable for the initialization of the more accurate learning algorithms.
[1]
Rolf Apweiler,et al.
The SWISS-PROT protein sequence data bank and its supplement TrEMBL
,
1997,
Nucleic Acids Res..
[2]
J. Kruskal.
Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis
,
1964
.
[3]
John W. Sammon,et al.
A Nonlinear Mapping for Data Structure Analysis
,
1969,
IEEE Transactions on Computers.
[4]
M.Kleinberg Jon,et al.
Advances in Self-Organizing Maps, 7th International Workshop, WSOM 2009, St. Augustine, FL, USA, June 8-10, 2009. Proceedings
,
2009,
WSOM.
[5]
Myron Wish,et al.
Three-Way Multidimensional Scaling
,
1978
.
[6]
Teuvo Kohonen,et al.
Self-Organizing Maps
,
2010
.
[7]
D. Lipman,et al.
Improved tools for biological sequence comparison.
,
1988,
Proceedings of the National Academy of Sciences of the United States of America.
[8]
R. Shepard.
The analysis of proximities: Multidimensional scaling with an unknown distance function. II
,
1962
.
[9]
R. Shepard.
The analysis of proximities: Multidimensional scaling with an unknown distance function. I.
,
1962
.
[10]
Tamotsu Kasai,et al.
A Method for the Correction of Garbled Words Based on the Levenshtein Metric
,
1976,
IEEE Transactions on Computers.