English-to-Traditional Chinese Cross-lingual Link Discovery in Articles with Wikipedia Corpus

In this paper, we design a processing flow to produce linked data in articles, providing anchorbased term’s additional information and related terms in different languages (English to Chinese). Wikipedia has been a very important corpus and knowledge bank. Although Wikipedia describes itself not a dictionary or encyclopedia, it is if high potential values in applications and data mining researches. Link discovery is a useful IR application, based on Data Mining and NLP algorithms and has been used in several fields. According to the results of our experiment, this method does make the result has improved.