论文信息 - Self Organizing Maps for the Clustering of Large Sets of Labeled Graphs

Self Organizing Maps for the Clustering of Large Sets of Labeled Graphs

Data mining on Web documents is one of the most challenging tasks in machine learning due to the large number of documents on the Web, the underlying structures (as one document may refer to another document), and the data is commonly not labeled (the class in which the document belongs is not known a-priori). This paper considers latest developments in Self-Organizing Maps (SOM), a machine learning approach, as one way to classifying documents on the Web. The most recent development is called a Probability Mapping Graph Self-Organizing Map (PMGraphSOM), and is an extension of an earlier Graph-SOM approach; this encodes undirected and cyclic graphs in a scalable fashion. This paper illustrates empirically the advantages of the PMGraphSOM versus the original GraphSOM model in a data mining application involving graph structured information. It will be shown that the performances achieved can exceed the current state-of-the art techniques on a given benchmark problem.

Ah Chung Tsoi | Markus Hagenbuchner | Alessandro Sperduti | Shujia Zhang

[1] Ah Chung Tsoi,et al. Contextual Processing of Graphs using Self-Organizing Maps , 2005, ESANN.

[2] Andrew Trotman,et al. Focused Access to XML Documents, 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Dagstuhl Castle, Germany, December 17-19, 2007. Selected Papers , 2008, INEX.

[3] Ah Chung Tsoi,et al. Efficient Clustering of Structured Documents Using Graph Self-Organizing Maps , 2008, INEX.

[4] Karl Pearson F.R.S.. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[5] Ah Chung Tsoi,et al. A Supervised Self-Organizing Map for Structured Data , 2001, WSOM.

[6] Teuvo Kohonen,et al. Self-Organization and Associative Memory , 1988 .

[7] Ah Chung Tsoi,et al. Projection of undirected and non-positional graphs using Self Organizing Maps , 2009, ESANN.

[8] A. McNair. THE HALF-LIFE OF VANADIUM-50 , 1961 .

[9] Horst Bunke,et al. Self-organizing map for clustering in the graph domain , 2002, Pattern Recognit. Lett..