Information retrieval, mining, and extraction of content from the Web