Functional Data Analysis for Sparse Auction Data

Bid arrivals of eBay auctions often exhibit “bid sniping”, a phenomenon where “snipers” place their bids at the last moments of an auction. This is one reason why bid histories for eBay auctions tend to have sparse data in the middle and denser data both in the beginning and at the end of the auction. Time spacing of the bids is thus irregular and sparse. For nearly identical products that are auctioned repeatedly, one may view the price history of each of these auctions as realization of an underlying smooth stochastic process, the price process. While the traditional Functional Data Analysis (FDA) approach requires that entire trajectories of the underlying process are observed without noise, this assumption is not satisfied for typical auction data. We provide a review of a recently developed version of functional principal component analysis (Yao et al., 2005), which is geared towards sparse, irregularly observed and noisy data, the principal analysis through conditional expectation (PACE) method. The PACE method borrows and pools information from the sparse data in all auctions. This allows the recovery of the price process even in situations where only few bids are observed. In a modified approach, we adapt PACE to summarize the bid history for varying current times during an ongoing auction through time-varying principal component scores. These scores then serve as time-varying predictors for the closing price. We study the resulting time-varying predictions using both linear regression and generalized additive modelling, with current scores as predictors. These methods will be illustrated with a case study for 157 Palm M515 PDA auctions from e-Bay, and the proposed methods are seen to work reasonably well. Other related issues will also be discussed.

[1]  Wolfgang Jank,et al.  Visualizing Online Auctions , 2005 .

[2]  Mayukh Dass,et al.  Modeling On-Line Art Auction Dynamics Using Functional Data Analysis , 2006, math/0609292.

[3]  Jianqing Fan,et al.  Statistical Estimation in Varying-Coefficient Models , 1999 .

[4]  H. Müller,et al.  Shrinkage Estimation for Functional Principal Component Scores with Application to the Population Kinetics of Plasma Folate , 2003, Biometrics.

[5]  Li Ping Yang,et al.  Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data , 1998 .

[6]  Jianqing Fan,et al.  Local polynomial modelling and its applications , 1994 .

[7]  Ying Zhang,et al.  Time‐Varying Functional Regression for Predicting Remaining Lifetime Distributions from Longitudinal Trajectories , 2005, Biometrics.

[8]  P. Hall,et al.  NONPARAMETRIC KERNEL REGRESSION SUBJECT TO MONOTONICITY CONSTRAINTS , 2001 .

[9]  S. Wood Modelling and smoothing parameter estimation with multiple quadratic penalties , 2000 .

[10]  B. Silverman,et al.  Estimating the mean and covariance structure nonparametrically when the data are curves , 1991 .

[11]  R. Tibshirani,et al.  The Monotone Smoothing of Scatterplots , 1984 .

[12]  Jianqing Fan,et al.  Two‐step estimation of functional linear models with applications to longitudinal data , 1999 .

[13]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[14]  H. D. Brunk,et al.  Statistical inference under order restrictions : the theory and application of isotonic regression , 1973 .

[15]  Henry W. Altland,et al.  Applied Functional Data Analysis , 2003, Technometrics.

[16]  R. Shibata An optimal selection of regression variables , 1981 .

[17]  William B. Capra,et al.  An Accelerated-Time Model for Response Curves , 1997 .

[18]  H. Müller,et al.  Functional Data Analysis for Sparse Longitudinal Data , 2005 .

[19]  Wolfgang Jank,et al.  Functional Data Analysis in Electronic Commerce Research , 2006, math/0609173.

[20]  Colin O. Wu,et al.  Nonparametric Mixed Effects Models for Unequally Sampled Noisy Curves , 2001, Biometrics.

[21]  J. Kalbfleisch Statistical Inference Under Order Restrictions , 1975 .

[22]  Dr. M. G. Worster Methods of Mathematical Physics , 1947, Nature.

[23]  Wolfgang Jank,et al.  Profiling Price Dynamics in Online Auctions Using Curve Clustering , 2005 .

[24]  Stanley R. Johnson,et al.  Varying Coefficient Models , 1984 .

[25]  J. O. Ramsay,et al.  Functional Data Analysis (Springer Series in Statistics) , 1997 .

[26]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[27]  Catherine A. Sugar,et al.  Principal component models for sparse functional data , 1999 .