Making Web Results Relevant with SAS
暂无分享,去创建一个
Many companies search the Web to learn about their competition and understand their potential customers. But how accurate are these search results? For instance, have you ever submitted the query "SAS", only to get results back about "Scandinavian Airline Systems"? This paper presents a SAS-based solution to accessing and clustering Yahoo! search engine results by using SAS ® Text Miner. We demonstrate how to use matrix factorization techniques, clustering algorithms, and visualizations to discriminate between subsets of documents that are returned as the result of a query.
[1] Soumen Chakrabarti,et al. Mining the web - discovering knowledge from hypertext data , 2002 .
[2] Oren Etzioni,et al. Web document clustering: a feasibility demonstration , 1998, SIGIR '98.