Extracting parallel corpora from web comparable documents to improve the quality of an English-Farsi translation system