Using XBRL to Conduct a Large-Scale Study of Discrepancies between the Accounting Numbers in Compustat and SEC 10-K Filings

ABSTRACT:  Compustat is frequently used for both research and decision making. It has been documented that information found in the Compustat database differs from both the information found in other accounting databases and the information disclosed in corporate financial filings (San Miguel 1977; Rosenberg and Houglet 1974; Yang, Vasarhelyi, and Liu 2003; Tallapally, Luehlfing, and Motha 2011, 2012; Boritz and No 2013). In this study, we conduct the first large-scale comparison of Compustat and 10-K data. Specifically, we compare 30 accounting line items of approximately 5,000 companies for the period from October 1, 2011 to September 30, 2012. We find that the values reported in Compustat significantly differ from the values reported in 10-K filings. We also find that the amount and magnitude of the original data alterations introduced by Compustat depend on the type of the accounting item and company characteristics, such as industry and size.

[1]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[2]  Michael S. Luehlfing,et al.  The Partnership Of EDGAR Online And XBRL - Should Compustat Care? , 2011, BIS 2011.

[3]  Barr Rosenberg,et al.  Error Rates in CRSP and Compustat Data Bases and their Implications , 1974 .

[4]  W. L. Beedles,et al.  A Note on Skewness and Data Errors , 1978 .

[5]  J. Efrim Boritz,et al.  The Quality of Interactive Data: XBRL Versus Compustat, Yahoo Finance, and Google Finance , 2013 .

[6]  Victor R. Basili,et al.  Comparing the Effectiveness of Software Testing Strategies , 1987, IEEE Transactions on Software Engineering.

[7]  Beth B. Kern,et al.  Differences in the COMPUSTAT and expanded Value Line databases and the potential impact , 1994 .

[8]  Caixing Liu,et al.  A note on the using of accounting databases , 2003, Ind. Manag. Data Syst..

[9]  E. Fama,et al.  Industry costs of equity , 1997 .

[10]  Miklos A. Vasarhelyi,et al.  XBRL Mandate: Thousands of Filing Errors and So What? , 2013, J. Inf. Syst..

[11]  J. Efrim Boritz,et al.  The Sec's XBRL Voluntary Filing Program on EDGAR: A Case for Quality Assurance , 2008 .

[12]  R. Bennin,et al.  Error Rates in CRSP and COMPUSTAT: A Second Look , 1980 .

[13]  R. Freeman,et al.  An earnings prediction approach to examining intercompany information transfers , 1992 .

[14]  Carsten Felden,et al.  Flex or Break? Extensions in XBRL Disclosures to the SEC , 2011 .

[15]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[16]  Richard W. McEnally A NOTE ON THE RETURN BEHAVIOR OF HIGH RISK COMMON STOCKS , 1974 .

[17]  Kathleen M. Kahle,et al.  The Impact of Industry Classifications on Financial Research , 1996, Journal of Financial and Quantitative Analysis.

[18]  Glenford J. Myers,et al.  A controlled experiment in program testing and code walkthroughs/inspections , 1978, CACM.

[20]  Barbara D. Klein,et al.  Data Quality in Linear Regression Models: Effect of Errors in Test Data and Errors in Training Data on Predictive Accuracy , 1999, Informing Sci. Int. J. an Emerg. Transdiscipl..

[21]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[22]  Andrew J. Rosman,et al.  Differences between COMPUSTAT and CRSP SIC codes and related effects on research , 1994 .