Cross-Lingual Text Categorization

This article deals with the problem of Cross-Lingual Text Categorization (CLTC), which arises when documents in different languages must be classified according to the same classification tree. We describe practical and cost-effective solutions for automatic Cross-Lingual Text Categorization, both in case a sufficient number of training examples is available for each new language and in the case that for some language no training examples are available.

[1]  Amita Goyal Chin Text Databases and Document Management: Theory and Practice , 2000 .

[2]  Stan Matwin,et al.  A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization , 2001 .

[3]  James Mayfield,et al.  Comparing cross-language query expansion techniques by degrading translation resources , 2002, SIGIR '02.

[4]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[5]  Cornelis H. A. Koster,et al.  Taming Wild Phrases , 2003, ECIR.

[6]  M. Teresa Cabré Castellví,et al.  Automatic term detection: A review of current systems , 2001 .

[7]  W. Bruce Croft,et al.  Cross-lingual relevance models , 2002, SIGIR '02.

[8]  Cornelis H. A. Koster,et al.  Uncertainty-Based Noise Reduction and Term Selection in Text Categorization , 2002, ECIR.

[9]  Ido Dagan,et al.  Mistake-Driven Learning in Text Categorization , 1997, EMNLP.

[10]  Djoerd Hiemstra,et al.  Disambiguation Strategies for Cross-Language Information Retrieval , 1999, ECDL.

[11]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[12]  Leah S. Larkey,et al.  A patent search and classification system , 1999, DL '99.

[13]  Ellen Riloff,et al.  Little words can make a big difference for text classification , 1995, SIGIR '95.

[14]  David D. Lewis,et al.  An evaluation of phrasal and clustered representations on a text categorization task , 1992, SIGIR '92.

[15]  Douglas W. Oard,et al.  Improved Cross-Language Retrieval using Backoff Translation , 2001, HLT.

[16]  John D. Lafferty,et al.  Information Retrieval as Statistical Translation , 2017 .

[17]  Dale Schuurmans,et al.  General Convergence Results for Linear Discriminant Updates , 1997, COLT '97.