论文信息 - Using Web Text Mining to Predict Future Events: A Test of the Wisdom of Crowds Hypothesis

Using Web Text Mining to Predict Future Events: A Test of the Wisdom of Crowds Hypothesis

This chapter describes an algorithm that predicts events by mining Internet data. A number of specialized Internet search engine queries were designed to summarize results from relevant web pages. At the core of these queries was a set of algorithms that embody the wisdom of crowds hypothesis. This hypothesis states that under the proper conditions the aggregated opinion of a number of nonexperts is more accurate than the opinion of a set of experts. Natural language processing techniques were used to summarize the opinions expressed from all relevant web pages. The specialized queries predicted event results at a statistically significant level. It was hypothesized that predictions from the entire Internet would outperform the predictions of a smaller number of highly ranked web pages. This hypothesis was not confirmed. This data replicated results from an earlier study and indicated that the Internet can make accurate predictions of future events. Evidence that the Internet can function as a wise crowd as predicted by the wisdom of crowds hypothesis was mixed.

Lutz Hamel | Scott Ryan

[1] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[2] James P. Bagrow,et al. How famous is a scientist?—Famous to those who know us , 2004 .

[3] Sandip Debnath,et al. Information incorporation in online in-Game sports betting markets , 2003, EC '03.

[4] Lutz Hamel,et al. The Internet Democracy: A Predictive Model Based on Web Text Mining , 2007, DMIN.

[5] E. Fama. Random Walks in Stock Market Prices , 1965 .

[6] M. V. SIMKIN,et al. Theory of Aces: Fame by Chance or Merit? , 2003 .

[7] James Surowiecki. The wisdom of crowds: Why the many are smarter than the few and how collective wisdom shapes business, economies, societies, and nations Doubleday Books. , 2004 .