Analysis of Complex Sample Survey Data

Social survey data are often collected using complex sample designs that include unequal selection probabilities, stratification, and clustering; and this departure from simple random sampling assumptions requires special methods for variance estimation and inferential analysis. This article reviews the issues and techniques for analysis of complex survey data; and the application of special techniques is illustrated, based on various complex survey data. Discussion centers on methodological and some practical issues in data analysis and the need for proper analysis of complex survey data.

[1]  J. N. K. Rao,et al.  Bootstrap and other methods to measure errors in survey estimates , 1988 .

[2]  K. Wolter Introduction to Variance Estimation , 1985 .

[3]  Donald Malec,et al.  Bayesian Inference for Finite Population Parameters in Multistage Cluster Sampling , 1985 .

[4]  Richard Valliant,et al.  Nonlinear Prediction Theory and the Estimation of Proportions in a Finite Population , 1985 .

[5]  R. Forthofer,et al.  Living arrangements and dietary patterns of older adults in the United States. , 1985, Journal of gerontology.

[6]  Robert E. Fay,et al.  A Jackknifed Chi-Squared Test for Complex Samples , 1985 .

[7]  R. Forthofer,et al.  DRGs in Psychiatry: An Empirical Evaluation , 1984, Medical care.

[8]  A. Scott,et al.  On Chi-Squared Tests for Multiway Contingency Tables with Cell Proportions Estimated from Survey Data , 1984 .

[9]  M. H. Hansen An Evaluation of Model-Dependent and Probability-Sampling Inferences in Sample Surveys , 1983 .

[10]  E. Bedrick Adjusted chi-squared tests for cross-classified tables of survey data , 1983 .

[11]  D. Binder On the variances of asymptotically normal estimators from complex surveys , 1983 .

[12]  W. DuMouchel,et al.  Using Sample Survey Weights in Multiple Regression Analyses of Stratified Samples , 1983 .

[13]  R. Forthofer,et al.  Relationship between dietary and biochemical measures of nutritional status in HANES I data. , 1982, The American journal of clinical nutrition.

[14]  J. Rao,et al.  Inference From Stratified Samples: Properties of the Linearization, Jackknife and Balanced Repeated Replication Methods , 1981 .

[15]  D. Pfeffermann,et al.  Regression Analysis of Data from a Cluster Sample , 1981 .

[16]  A. Scott,et al.  Regression Analysis Using Survey Data , 1981 .

[17]  A. Scott,et al.  The Analysis of Categorical Data from Complex Sample Surveys: Chi-Squared Tests for Goodness of Fit and Independence in Two-Way Tables , 1981 .

[18]  M. Swafford Three Parametric Techniques for Contingency Table Analysis: A Nontechnical Commentary , 1980 .

[19]  I. Fellegi Approximate Tests of Independence and Goodness of Fit Based on Stratified Multistage Samples , 1980 .

[20]  L. A. Goodman A Brief Guide to the Causal Analysis of Data from Surveys , 1979, American Journal of Sociology.

[21]  D. DeMets,et al.  Estimation of a Simple Regression Coefficient in Samples Arising from a Sub-Sampling Procedure , 1977 .

[22]  D. Brock,et al.  Strategies in the Multivariate Analysis of Data from Complex Surveys II: An Application to the United States National Health Interview Survey , 1976 .

[23]  G G Koch,et al.  A computer program for the generalized chi-square analysis of categorical data using weighted least squares (GENCAT). , 1976, Computer programs in biomedicine.

[24]  Joel E. Cohen,et al.  The Distribution of the Chi-Squared Statistic under Clustered Sampling from Contingency Tables , 1976 .

[25]  R. Royall The Linear Least-Squares Prediction Approach to Two-Stage Sampling , 1976 .

[26]  Beverley D. Causey,et al.  Computerized Method for Approximating the Variance of a Complicated Estimate , 1976 .

[27]  M. Gurney,et al.  Constructing Orthogonal Replications for Variance Estimation , 1975 .

[28]  Gary G. Koch,et al.  Strategies in the Multivariate Analysis of Data from Complex Surveys , 1975 .

[29]  L. Kish,et al.  Inference from Complex Samples , 1974 .

[30]  R. Hauser Contextual Analysis Revisited , 1974 .

[31]  George Farkas,et al.  Specification, Residuals and Contextual Effects , 1974 .

[32]  G. Koch,et al.  An Application of Multivariate Analysis to Complex Sample Survey Data , 1972 .

[33]  Leo A. Goodman,et al.  A General Model for the Analysis of Surveys , 1972, American Journal of Sociology.

[34]  Leo A. Goodman,et al.  A Modified Multiple Regression Approach to the Analysis of Dichotomous Variables , 1972 .

[35]  J. J. Chai Correlated Measurement Errors and the Least Squares Estimator of the Regression Coefficient , 1971 .

[36]  R. Woodruff A Simple Method for Approximating the Variance of a Complicated Estimate , 1971 .

[37]  Leslie Kish,et al.  Balanced Repeated Replications for Standard Errors , 1970 .

[38]  R. Royall On finite population sampling theory under certain linear regression models , 1970 .

[39]  G. Koch,et al.  Analysis of categorical data by linear models. , 1969, Biometrics.

[40]  W. G. Cochran Errors of Measurement in Statistics , 1968 .

[41]  H. S. Konijn Regression Analysis in Sample Surveys , 1962 .

[42]  N. Keyfitz Estimates of Sampling Variance where Two Units are Selected from Each Stratum , 1957 .

[43]  W. Deming On Simplifications of Sampling Design Through Replication with Equal Probabilities and without Stages , 1956 .

[44]  Leslie Kish,et al.  A Procedure for Objective Respondent Selection within the Household , 1949 .

[45]  F. F. Stephan History of the Uses of Modern Sampling Procedures , 1948 .

[46]  R. Plackett,et al.  THE DESIGN OF OPTIMUM MULTIFACTORIAL EXPERIMENTS , 1946 .

[47]  Frederick F. Stephan,et al.  Practical Problems of Sampling Procedure , 1936 .

[48]  G. Kalton,et al.  The treatment of missing survey data , 1986 .

[49]  Roderick J. A. Little,et al.  Estimating a Finite Population Mean from Unequal Probability Samples , 1983 .

[50]  Seymour Sudman,et al.  Chapter 5 – Applied Sampling , 1983 .

[51]  Robert G. Lehnen,et al.  Public Program Analysis , 1981 .

[52]  S. Brier Analysis of contingency tables under cluster sampling , 1980 .

[53]  Carl-Erik Särndal,et al.  On π-inverse weighting versus best linear unbiased weighting in probability sampling , 1980 .

[54]  Stanley Lemeshow,et al.  Estimating the variances of ratio estimates in complex sample surveys with two primary sampling units per stratum—a comparison of balanced replication and jackknife techniques , 1979 .

[55]  P. Altham Discrete variable analysis for individuals grouped into families , 1976 .

[56]  David R. Brillinger,et al.  The Asymptotic Behaviour of Tukey's General Method of Setting Approximate Confidence Limits (The Jackknife) When Applied to Maximum Likelihood Estimates , 1964 .