论文信息 - Self-Organizing-Map-Based Metamodeling for Massive Text Data Exploration

Self-Organizing-Map-Based Metamodeling for Massive Text Data Exploration

In this study, we describe the use of the self-organizing map (SOM) as a metamodeling technique to design a parallel text data exploration system. Firstly, the large textual collections are divided into various small data subsets. Based on the different subsets, different unitary SOM models, i.e., base models, are then trained for word clustering map. In this phase, different SOM models are implemented in parallel to gain greater computational efficiency. Finally, a SOM-based metamodel can be produced to formulate a text category map through learning from all base models. For illustration the proposed metamodel is applied to a massive text data collection.

Kin Keung Lai | Lean Yu | Ligang Zhou | Shouyang Wang

[1] Timo Honkela,et al. WEBSOM - Self-organizing maps of document collections , 1998, Neurocomputing.

[2] Jau-Hsiung Huang,et al. On Parallel Processing Systems: Amdahl's Law Generalized and Some Results on Optimal Design , 1992, IEEE Trans. Software Eng..

[3] Samuel Kaski,et al. Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[4] Yeuvo Jphonen,et al. Self-Organizing Maps , 1995 .

[5] T. Landauer,et al. Indexing by Latent Semantic Analysis , 1990 .

[6] Hsin-Chang Yang,et al. A Multilingual Text Mining Approach Based on Self-Organizing Maps , 2004, Applied Intelligence.

[7] Teuvo Kohonen,et al. Self-Organizing Maps , 2010 .