Research Commentary - Too Big to Fail: Large Samples and the p-Value Problem

The Internet has provided IS researchers with the opportunity to conduct studies with extremely large samples, frequently well over 10,000 observations. There are many advantages to large samples, but researchers using statistical inference must be aware of the p-value problem associated with them. In very large samples, p-values go quickly to zero, and solely relying on p-values can lead the researcher to claim support for results of no practical significance. In a survey of large sample IS research, we found that a significant number of papers rely on a low p-value and the sign of a regression coefficient alone to support their hypotheses. This research commentary recommends a series of actions the researcher can take to mitigate the p-value problem in large samples and illustrates them with an example of over 300,000 camera sales on eBay. We believe that addressing the p-value problem will increase the credibility of large sample IS research as well as provide more insights for readers.

[1]  Annette Vissing-Jorgensen,et al.  Limited Asset Market Participation and the Elasticity of Intertemporal Substitution , 2002, Journal of Political Economy.

[2]  J. Alberto Espinosa,et al.  Learning from Experience in Software Development: A Multilevel Analysis , 2007, Manag. Sci..

[3]  Lawrence A. Gordon,et al.  Market Value of Voluntary Disclosures Concerning Information Security , 2010, MIS Q..

[4]  Lorin M. Hitt,et al.  Self Selection and Information Role of Online Product Reviews , 2007, Inf. Syst. Res..

[5]  J. Tukey The Philosophy of Multiple Comparisons , 1991 .

[6]  Anindya Ghose,et al.  Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets , 2008, Inf. Syst. Res..

[7]  Maarten L. Buis,et al.  Stata Tip 53: Where did My P-Values Go? , 2007 .

[8]  Charles E. McCulloch,et al.  Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models , 2005 .

[9]  FormanChris,et al.  Examining the Relationship Between Reviews and Sales , 2008 .

[10]  Yuliang Yao,et al.  Private Network EDI vs. Internet Electronic Markets: A Direct Comparison of Fulfillment Performance , 2009, Manag. Sci..

[11]  Caesar Saloma,et al.  Things I have learned so far , 2008 .

[12]  Avi Goldfarb,et al.  Household-Specific Regressions Using Clickstream Data , 2006 .

[13]  Rahul Telang,et al.  Internet Exchanges for Used Books: An Empirical Analysis of Welfare Implications , 2005 .

[14]  Sandra E. Black,et al.  Entrepreneurship and Bank Credit Availability , 2002 .

[15]  Avi Goldfarb,et al.  Competition between Local and Electronic Markets: How the Benefit of Buying Online Depends on Where You Live , 2007, Manag. Sci..

[16]  R. P. Carver The Case Against Statistical Significance Testing , 1978 .

[17]  Anindya Ghose,et al.  Using Transaction Prices to Re-Examine Price Dispersion in Electronic Markets , 2006, Inf. Syst. Res..

[18]  Rahul Telang,et al.  Internet Exchanges for Used Books: An Empirical Analysis of Product Cannibalization and Welfare Impact , 2006, Inf. Syst. Res..

[19]  Richard Goldstein,et al.  Regression Methods in Biostatistics: Linear, Logistic, Survival and Repeated Measures Models , 2006, Technometrics.

[20]  Erran Carmel,et al.  Is the World Really Flat? A Look at Offshoring in an Online Programming Marketplace , 2008, MIS Q..

[21]  Ramayya Krishnan,et al.  An Empirical Analysis of Network Externalities in Peer-to-Peer Music Sharing Networks , 2003, ICIS.

[22]  E. Cannon,et al.  Euro-Illusion: A Natural Experiment , 2006 .

[23]  Austan Goolsbee What Happens When You Tax the Rich? Evidence from Executive Compensation , 1997, Journal of Political Economy.

[24]  Bruce Thompson,et al.  Statistical Significance, Result Importance, and Result Generalizability: Three Noteworthy But Somewhat Different Issues , 1989 .

[25]  Stelios Kafandaris,et al.  Problem Solving: A Statistician's Guide , 1996 .

[26]  Frank de Leeuw,et al.  THE DEMAND FOR HOUSING: A REVIEW OF CROSS-SECTION EVIDENCE , 1971 .

[27]  Henry C. Lucas,et al.  Are Foreign IT Workers Cheaper? U.S. Visa Policies and Compensation of Information Technology Professionals , 2010, Manag. Sci..

[28]  Raymond Hubbard,et al.  Why We Don't Really Know What Statistical Significance Means: A Major Educational Failure , 2006 .

[29]  Lee Sproull,et al.  The Role of Feedback in Managing the Internet-Based Volunteer Work Force , 2008, Inf. Syst. Res..

[30]  J. Scott Armstrong,et al.  Why We Don't Really Know What Statistical Significance Means: Implications for Educators , 2006 .

[31]  Eric Overby,et al.  Electronic and Physical Market Channels: A Multiyear Investigation in a Market for Products of Uncertain Quality , 2009, Manag. Sci..

[32]  TelangRahul,et al.  Competing with free , 2009 .

[33]  Edward E. Leamer,et al.  Specification Searches: Ad Hoc Inference with Nonexperimental Data , 1980 .

[34]  Gilad Ravid,et al.  Information overload and the message dynamics of online interaction spaces: a theoretical model and empirical exploration , 2004, IEEE Engineering Management Review.

[35]  Alan G. Sawyer,et al.  The Significance of Statistical Significance Tests in Marketing Research , 1983 .

[36]  Mayuram S. Krishnan,et al.  Human Capital and Institutional Effects in the Compensation of Information Technology Professionals in the United States , 2008, Manag. Sci..

[37]  Angelika Dimoka,et al.  The Nature and Role of Feedback Text Comments in Online Marketplaces: Implications for Trust Building, Price Premiums, and Seller Differentiation , 2006, Inf. Syst. Res..

[38]  Olle Häggström,et al.  The Cult of Statistical Significance , 2009 .

[39]  Mohammad S. Rahman,et al.  Battle of the Retail Channels: How Product Selection and Geography Drive Cross-Channel Competition , 2009, Manag. Sci..

[40]  Rahul Telang,et al.  DRAFT : Preliminary and Incomplete Comments Welcome Competing with Free : The Impact of Movie Broadcasts on DVD Sales and Internet Piracy , 2006 .

[41]  David H. Reiley,et al.  Pennies from Ebay: The Determinants of Price in Online Auctions , 2000 .

[42]  J. Medhi Problem Solving: A Statistician's Guide , 1996 .

[43]  Jacob Cohen,et al.  THINGS I HAVE LEARNED (SO FAR) , 1990 .

[44]  Galit Shmueli,et al.  Predictive Analytics in Information Systems Research , 2010, MIS Q..

[45]  Chrysanthos Dellarocas,et al.  The Sound of Silence in Online Feedback: Estimating Trading Risks in the Presence of Reporting Bias , 2006, Manag. Sci..

[46]  Galit Shmueli,et al.  To Explain or To Predict? , 2010 .

[47]  Michelle Riboud,et al.  Intergenerational effects on fertility behavior and earnings mobility in Spain. , 1988 .

[48]  K. Head,et al.  The Puzzling Persistence of the Distance Effect on Bilateral Trade , 2004, The Review of Economics and Statistics.

[49]  Anindya Ghose,et al.  An Empirical Analysis of Search Engine Advertising: Sponsored Search in Electronic Markets , 2009, Manag. Sci..

[50]  A PavlouPaul,et al.  The Nature and Role of Feedback Text Comments in Online Marketplaces , 2006 .

[51]  Anindya Ghose,et al.  Internet Exchanges for Used Goods: An Empirical Analysis of Trade Patterns and Adverse Selection , 2008, MIS Q..