Embedded Representations of Wikipedia Categories

In this paper, we present an approach to building neural representations of the Wikipedia category graph. We test four different methods and examine the neural embeddings in terms of preservation of graphs edges, neighborhood coverage in representation space, and their influence on the results of a task predicting parent of two categories. The main contribution of this paper is application of neural representations for improving the structure of Wikipedia categories graph. We also show that a neural representation based solely on categories’ names can be an alternative to the other representations build using more complex approaches.