Document Categorization using Multilingual Associative Networks based on Wikipedia

Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.

[1]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[2]  Jian Hu,et al.  Cross lingual text classification by mining multilingual topics from wikipedia , 2011, WSDM '11.

[3]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[4]  Roger C. Schank,et al.  SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[5]  John Algeo,et al.  Problems in the origins and development of the English language , 1964 .

[6]  W. Bruce Croft,et al.  Cross-lingual relevance models , 2002, SIGIR '02.

[7]  Franciska de Jong,et al.  Using Wikipedia with associative networks for document classification , 2013, ESANN.

[8]  G. Marcus The Algebraic Mind: Integrating Connectionism and Cognitive Science , 2001 .

[9]  Francis Bond,et al.  Linking and Extending an Open Multilingual Wordnet , 2013, ACL.

[10]  Gerard de Melo,et al.  Multilingual Text Classification Using Ontologies , 2007, ECIR.

[11]  Mounia Lalmas,et al.  Hierarchical Text Categorisation based on Neural Networks and Dempster-Shafer Theory of Evidence , 2002 .

[12]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[13]  Núria Bel,et al.  Cross-Lingual Text Categorization , 2003, ECDL.

[14]  C SchankRoger,et al.  Dynamic Memory: A Theory of Reminding and Learning in Computers and People , 1983 .

[15]  Franciska de Jong,et al.  Hierarchical document categorization using associative networks , 2013 .

[16]  T. Abma,et al.  qualitative research: is meaning lost in translation? , 2022 .

[17]  Fabian M. Suchanek,et al.  YAGO3: A Knowledge Base from Multilingual Wikipedias , 2015, CIDR.

[18]  Hsin-Chang Yang,et al.  Construction of supervised and unsupervised learning systems for multilingual text categorization , 2009, Expert Syst. Appl..

[19]  Chih-Ming Chen,et al.  A Hierarchical Neural Network Document Classifier with Linguistic Feature Selection , 2005, Applied Intelligence.

[20]  W. Bechtel Connectionism and the Philosophy of Mind: An Overview , 2010 .

[21]  Niels Bloom Using Natural Language Processing to Improve Document Categorization with Associative Networks , 2012, NLDB.