Calibrating Survey Data using Iterative Proportional Fitting (Raking)

In this article, I introduce the ipfraking package, which implements weight-calibration procedures known as iterative proportional fitting, or raking, of complex survey weights. The package can handle a large number of control variables and trim the weights in various ways. It also provides diagnostic tools for the weights it creates. I provide examples of its use and a suggested workflow for creating raked replicate weights.

[1]  D. Pfeffermann The Role of Sampling Weights when Modeling Survey Data , 1993 .

[2]  R. Valliant,et al.  A comparison of variance estimators for poststratification to estimated control totals , 2010 .

[3]  Martin Wittenberg,et al.  An Introduction to Maximum Entropy and Minimum Cross-entropy Estimation Using Stata , 2010 .

[4]  E. Korn,et al.  Analysis of Health Surveys: Korn/Analysis , 1999 .

[5]  Jun Shao,et al.  Invited Discussion Paper Resampling Methods in Sample Surveys , 1996 .

[6]  Jack R. Anderson Design and estimation for the National Health Interview Survey, 1995-2004. , 2000, Vital and health statistics. Series 2, Data evaluation and methods research.

[7]  Edward L. Korn,et al.  Analysis of Large Health Surveys: Accounting for the Sampling Design , 1995 .

[8]  David C. Hoaglin,et al.  Practical Considerations in Raking Survey Data , 2009 .

[9]  P. Kott Using Calibration Weighting to Adjust for Nonresponse and Coverage Errors , 2006 .

[10]  Nicholas Winter SURVWGT: Stata module to create and manipulate survey weights , 2002 .

[11]  J. Bethlehem Weighting nonresponse adjustments based on auxiliary information , 2002 .

[12]  Robert M. Groves,et al.  Survey Nonresponse , 2002 .

[13]  Linearization variance estimation for generalized raking estimators in the presence of nonresponse , 2010 .

[14]  M. Elliott Model Averaging Methods for Weight Trimming. , 2008, Journal of official statistics.

[15]  Steve McConnell,et al.  Code complete - a practical handbook of software construction, 2nd Edition , 1993 .

[16]  C. J. Skinner,et al.  Domain means, regression and multi-variate analysis , 1989 .

[17]  C. Särndal,et al.  Calibration Estimators in Survey Sampling , 1992 .

[18]  David A. Binder,et al.  Design-Based and Model-Based Methods for Estimating Model Parameters , 2003 .

[19]  W. Deming,et al.  On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known , 1940 .

[20]  Phillip S. Kott,et al.  Using Calibration Weighting to Adjust for Nonresponse Under a Plausible Model (with full appendices) , 2007 .

[21]  R. Groves Nonresponse Rates and Nonresponse Bias in Household Surveys , 2006 .

[22]  M. Thompson Theory of Sample Surveys , 1997 .

[23]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[24]  S. Lundström,et al.  Calibration as a Standard Method for Treatment of Nonresponse , 1997 .

[25]  J. L. Harrison,et al.  The Government Printing Office , 1968, American Journal of Pharmaceutical Education.

[26]  Stanislav Kolenikov,et al.  Resampling Inference with Complex Survey Data , 1996 .

[27]  Stanislav Kolenikov,et al.  Resampling Variance Estimation for Complex Survey Data , 2010 .

[28]  Carl-Erik Särndal,et al.  Generalized Raking Procedures in Survey Sampling , 1993 .

[29]  Phillip S. Kott Calibration Weighting: Combining Probability Samples and Linear Prediction Models , 2009 .

[30]  William Gould Statistical Software Certification , 2001 .

[31]  Paul Zador,et al.  Variable selection and raking in propensity scoring. , 2007, Statistics in medicine.

[32]  M. Bergmann IPFWEIGHT: Stata module to create adjustment weights for surveys , 2011 .