Predicting the long tail of book sales: unearthing the power-law exponent.

The concept of the long tail has recently been used to explain the phenomenon in e-commerce where the total volume of sales of the items in the tail is comparable to that of the most popular items. In the case of online book sales, the proportion of tail sales has been estimated using regression techniques on the assumption that the data obeys a power-law distribution. Here we propose a different technique for estimation based on a generative model of book sales that results in an asymptotic power-law distribution of sales, but which does not suffer from the problems related to power-law regression techniques. We show that the proportion of tail sales predicted is very sensitive to the estimated power-law exponent. In particular, if we assume that the power-law exponent of the cumulative distribution is closer to 1.1 rather than to 1.2 (estimates published in 2003, calculated using regression by two groups of researchers), then our computations suggest that the tail sales of Amazon.com, rather than being 40% as estimated by Brynjolfsson, Hu and Smith in 2003, are actually closer to 20%, the proportion estimated by its CEO.

[1]  Ravi Kumar,et al.  Self-similarity in the web , 2001, TOIT.

[2]  Gary G. Yen,et al.  Fitting to the Power-Law Distribution , 2004 .

[3]  Yu Jeffrey Hu,et al.  From Niches to Riches: Anatomy of the Long Tail , 2006 .

[4]  H. Simon,et al.  ON A CLASS OF SKEW DISTRIBUTION FUNCTIONS , 1955 .

[5]  chris-anderson The Long Tail: How Endless Choice is Creating Unlimited Demand , 2007 .

[6]  A. Chris,et al.  Long Tail: How Endless Choice is Creating Unlimited Demand, London: Random House. , 1996 .

[7]  Alison L Gibbs,et al.  On Choosing and Bounding Probability Metrics , 2002, math/0209021.

[8]  Manfred Schroeder,et al.  Fractals, Chaos, Power Laws: Minutes From an Infinite Paradise , 1992 .

[9]  M. Newman Power laws, Pareto distributions and Zipf's law , 2005 .

[10]  Erik Brynjolfsson,et al.  Consumer Surplus in the Digital Economy: Estimating the Value of Increased Product Variety at Online Booksellers , 2003, Manag. Sci..

[11]  Mark Levene,et al.  A stochastic evolutionary model exhibiting power-law behaviour with an exponential cutoff , 2002, cond-mat/0209463.

[12]  J. Chevalier,et al.  Measuring Prices and Price Competition Online: Amazon.com and BarnesandNoble.com , 2003 .

[13]  Derek de Solla Price,et al.  A general theory of bibliometric and other cumulative advantage processes , 1976, J. Am. Soc. Inf. Sci..

[14]  Ronald L. Graham,et al.  Concrete Mathematics, a Foundation for Computer Science , 1991, The Mathematical Gazette.

[15]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[16]  Mark Levene,et al.  A stochastic model for the evolution of the Web allowing link deletion , 2006, TOIT.