Creating Chinese-English Comparable Corpora

[1]  Bao Hong,et al.  An Extended Keyword Extraction Method , 2012 .

[2]  Mitsuru Ishizuka,et al.  Keyword extraction from a single document using word co-occurrence statistical information , 2004, Int. J. Artif. Intell. Tools.

[3]  Martin Braschler,et al.  Multilingual Information Retrieval Based on Document Alignment Techniques , 1998, ECDL.

[4]  Tao Tao,et al.  Mining comparable bilingual text corpora for cross-language information integration , 2005, KDD '05.

[5]  Azadeh Shakery,et al.  Topic Based Creation of a Persian-English Comparable Corpus , 2011, AIRS.

[6]  Martti Juhola,et al.  Focused web crawling in the acquisition of comparable corpora , 2008, Information Retrieval.

[7]  Ian H. Witten,et al.  Thesaurus-based index term extraction for agricultural documents , 2005 .

[8]  José Gabriel Pereira Lopes,et al.  Using LocalMaxs Algorithm for the Extraction of Contiguous and Non-contiguous Multiword Lexical Units , 1999, EPIA.

[9]  Martti Juhola,et al.  Creating and exploiting a comparable corpus in cross-language information retrieval , 2007, TOIS.

[10]  Philip Resnik,et al.  Mining the Web for Bilingual Text , 1999, ACL.

[11]  Bruno Pouliquen,et al.  Navigating multilingual news collections using automatically extracted information , 2005 .

[12]  Huang De-gen Chinese Word Segmentation Based on the Marginal Probabilities Generated by CRFs , 2009 .

[13]  Haitao Yu,et al.  Mining Large-scale Comparable Corpora from Chinese-English News Collections , 2010, COLING.

[14]  Kuo Zhang,et al.  Keyword extraction based on tf/idf for Chinese news document , 2007, Wuhan University Journal of Natural Sciences.

[15]  Dragos Stefan Munteanu,et al.  Improving Machine Translation Performance by Exploiting Non-Parallel Corpora , 2005, CL.

[16]  Christopher C. Yang,et al.  Building parallel corpora by automatic title alignment using length-based and text-based approaches , 2004, Inf. Process. Manag..

[17]  Xiaojun Wan,et al.  CollabRank: Towards a Collaborative Approach to Single-Document Keyphrase Extraction , 2008, COLING.