Comparison of Centrality Indexes in Network Japanese Text Analysis

 Abstract—There is the research in fashion that expresses the text analysis result with network structure between words in recent years. Clarifying the relation of the words is important for the future legacy text mining technologies such as a derivation of the conceptual meaning between words and its relationship, new evaluation indexes for degree of similarity between documents, and a visualization of the relationship. In such text network analyses domain, a method of node evaluation is not defined clearly, so far. For this background, the intensive comparative evaluation has been made with three typical indexes in network analysis which are degree centrality, closeness centrality, and betweenness centrality. We have made a conclusion that the betweenness centrality marked the best result.