Cross-lingual keyword recommendation using latent topics

Multi-lingual text processing is important for content-based and hybrid recommender systems. It helps recommender systems extract content information from broader sources. It also enables systems to recommend items in a user's native language. We propose a cross-lingual keyword recommendation method, which is built on an extended latent Dirichlet allocation model, for extracting latent features from parallel corpora. With this model, the proposed method can recommend keywords from text written in different languages. We evaluate the proposed method using a cross-lingual bibliographic database that contains both English and Japanese abstracts and keywords and show that the proposed method can recommend keywords from abstracts in a cross-lingual environment with almost the same accuracy as in a monolingual environment.

[1]  Carol Peters,et al.  Cross-Language Information Retrieval (CLIR) Track Overview , 1997, TREC.

[2]  Susan T. Dumais,et al.  Automatic Cross-Language Information Retrieval Using Latent Semantic Indexing , 1998 .

[3]  Sanjeev Khudanpur,et al.  Cross-lingual latent semantic analysis for language modeling , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Wei Wang,et al.  Cross language information retrieval based on LDA , 2009, 2009 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[5]  Ralf Krestel,et al.  Latent dirichlet allocation for tag recommendation , 2009, RecSys '09.

[6]  Douglas W. Oard,et al.  Bilingual topic aspect classification with a few training examples , 2008, SIGIR '08.

[7]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Atsuhiro Takasu,et al.  Latent Topic Extraction from Relational Table for Record Matching , 2009, Discovery Science.

[10]  Stefano Battiston,et al.  Personalised and dynamic trust in social networks , 2009, RecSys '09.