A faster estimation method for the probability of informed trading using hierarchical agglomerative clustering

The probability of informed trading (PIN) is a commonly used market microstructure measure for detecting the level of information asymmetry. Estimating PIN can be problematic due to corner solutions, local maxima and floating point exceptions (FPE). Yan and Zhang [J. Bank. Finance, 2012, 36, 454–467] show that whilst factorization can solve FPE, boundary solutions appear frequently in maximum likelihood estimation for PIN. A grid search initial value algorithm is suggested to overcome this problem. We present a faster method for reducing the likelihood of boundary solutions and local maxima based on hierarchical agglomerative clustering (HAC). We show that HAC can be used to determine an accurate and fast starting value approximation for PIN. This assists the maximum likelihood estimation process in both speed and accuracy.

[1]  Jun Pan,et al.  The Information of Option Volume for Future Stock Prices , 2004 .

[2]  H. Lin,et al.  A computing bias in estimating the probability of informed trading , 2011 .

[3]  Oleg Bondarenko,et al.  Reflecting on the VPIN Dispute , 2013 .

[4]  Yuxing Yan,et al.  Quality of PIN Estimates and the PIN-Return Relationship , 2014 .

[5]  Torben G. Andersen,et al.  VPIN and the Flash Crash , 2011 .

[6]  Charles M. C. Lee,et al.  Inferring Trade Direction from Intraday Data , 1991 .

[7]  Maureen O'Hara,et al.  Factoring Information into Returns , 2005 .

[8]  Min Lib,et al.  Information-based trading , price impact of trades , and trade autocorrelation , 2004 .

[9]  B. Radhakrishna,et al.  Inferring investor behavior: Evidence from TORQ data , 2000 .

[10]  H. Edelsbrunner,et al.  Efficient algorithms for agglomerative hierarchical clustering methods , 1984 .

[11]  Moonsoo Kang Probability of Information-Based Trading and the January Effect , 2010 .

[12]  Lance A. Young,et al.  Why is PIN Priced? , 2007 .

[13]  J. G. Skellam The frequency distribution of the difference between two Poisson variates belonging to different populations. , 1946, Journal of the Royal Statistical Society. Series A.

[14]  David Easley,et al.  Is Information Risk a Determinant of Asset Returns , 2002 .

[15]  Stephen A. Hillegeist,et al.  How disclosure quality affects the level of information asymmetry , 2007 .

[16]  Lance A. Young,et al.  Information Asymmetry, Information Dissemination and the Effect of Regulation FD on the Cost of Capital , 2007 .

[17]  Wei Jiang,et al.  The Rodney L. White Center for Financial Research Price Informativeness , 2005 .

[18]  M. Pagano,et al.  IPO underpricing and after-market liquidity , 2006 .

[19]  Anthony S. Tay,et al.  Using High-Frequency Transaction Data to Estimate the Probability of Informed Trading , 2009 .

[20]  Stephen A. Hillegeist,et al.  Conference Calls and Information Asymmetry , 2003 .

[21]  Milton Abramowitz,et al.  Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables , 1964 .

[22]  David Easley,et al.  Time-Varying Arrival Rates of Informed and Uninformed Trades , 2001 .

[23]  David Easley,et al.  Liquidity, Information, and Infrequently Traded Stocks , 1996 .

[24]  David Easley,et al.  Financial analysts and information-based trade , 1998 .

[25]  Guojun Wu,et al.  Time-Varying Informed and Uninformed Trading Activities , 2004 .

[26]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[27]  Hendrik Bessembinder,et al.  Trade Execution Costs and Market Quality after Decimalization , 2003, Journal of Financial and Quantitative Analysis.

[28]  David Easley,et al.  How Stock Splits Affect Trading: A Microstructure Approach , 2001, Journal of Financial and Quantitative Analysis.

[29]  Maureen O'Hara,et al.  The Accuracy of Trade Classification Rules: Evidence from NASDAQ , 2000 .

[30]  David Easley,et al.  One Day in the Life of a Very Common Stock , 1997 .

[31]  Patrick J. Dennis,et al.  Who's Informed? An Analysis of Stock Ownership and Informed Trading , 2001 .

[32]  B. Everitt,et al.  Cluster Analysis: Everitt/Cluster Analysis , 2011 .

[33]  M. Ready,et al.  Credit Ratings and Stock Liquidity , 2003 .

[34]  David Easley,et al.  Flow Toxicity and Liquidity in a High Frequency World , 2012 .

[35]  Yuxing Yan,et al.  An Improved Estimation Method and Empirical Properties of the Probability of Informed Trading , 2010 .