Evaluation Methods to Improve Information Content Extraction from the Web