Construct Validity and Error Components of Survey Measures: A Structural Modeling Approach

Measurement errors can have profound effects on statistical relationships, and better information on the quality of measures seems needed. This study uses a new technology-structural modeling of data from special supplements to regular surveys-to generate estimates of construct validity, method effects (a major source of correlated error), and residual error (mainly random error) for a broad set of measures obtained from five national surveys and an organizational survey (total respondents = 7,706). Analysis of these estimates suggested that a typical survey item, when administered by a respected survey organization to a general population sample, can be expected to yield 50-83 percent valid variance, 0-7 percent method effects variance, and 14-48 percent residual variance. Multivariate analysis showed that over two-thirds of the variation in measurement quality could be explained by 13 survey design characteristics; characteristics of respondents explained a small additional portion. Results provide: (a) information on design conditions associated with better (or worse) measurement quality, (b) empirically based suggestions for improving measurement quality in future surveys, and (c) a set of coefficients for predicting the quality of measures not studied here. Frank M. Andrews is Program Director in the Survey Research Center and Professor of Psychology and of Population Planning at the University of Michigan. Gerald A. Cole and Mary Grace Moore made numerous and substantial contributions to the work reported here. The author is grateful to David Bowers, Angus Campbell, Charles Cannell, Philip Converse, Richard Curtin, Daniel Denison, and Robert Groves for allowing us to include methodological supplements in some of their surveys. Earlier versions of this paper were presented at the 1980 Annual Meeting of the American Psychological Association, at the 1982 Annual Meeting of the American Association for Public Opinion Research, and at the 1982 Conference on Health Survey Research Methods. This research was supported by grant #SOC78-07676 from the National Science Foundation. Public Opinion Quarterly Vol. 48:409-442 ? 1984 by the Trustees of Columbia University Published by Elsevier Science Publishing Co., Inc. 0033-362X/84/0048-409/$2.50 This content downloaded from 157.55.39.174 on Wed, 31 May 2017 18:33:53 UTC All use subject to http://about.jstor.org/terms 410 FRANK M. ANDREWS complex. Under certain combinations of error, an observed relationship can be "wrong" in both direction and magnitude. However, if one has information about the validity and error composition of the measures being analyzed, more informed judgments can be made about the underlying relationships that are of primary interest. Insightful survey researchers have always been interested in the quality of their data, and new information about data quality has been a major contributor to the development of survey technology. Much attention has been devoted to sampling errors, and there now exist good ways to estimate their magnitudes and much knowledge about how to reduce them. One kind of measurement error, bias (a consistent tendency for a measure to be higher or lower than it "should be") has also received considerable attention (Sudman and Bradburn, 1974). However, while bias can produce serious distortions in percentages, means, and other measures of central tendency, and hence is a threat that must always be considered, a bias that is constant for all respondents does not affect linear relationships at either the bivariate or multivariate level. It is other kinds of measurement errors that intrude on relationships-random and correlated measurement errors. 1 (Key terms are defined below.) These are the kinds of measurement errors investigated in this study. For each of the measures included in any particular analysis, one would like, ideally, to be able to apportion the total variance into three components: valid variance, correlated error variance, and random error variance. From this, one could know the extent to which the true bivariate relationships (i.e., the relationships among the concepts being investigated) were being attenuated (because of random measurement error) and/or inflated (because of correlated measurement error). In addition, one could sort out the complex effects that random and correlated measurement errors have on multivariate statistics such as regression coefficients, multiple and partial correlation coefficients, and path coefficients.2 A pair of examples, taken from the data of this study, will illustrate how misleading even a simple bivariate relationship between observed measures can be when allowance is not made for the effects of measurement errors. In Survey 2, the observed product-moment correlation between items having to do with perceptions about changes in I The conceptualization of measurement quality used here is similar to that discussed by Heise and Bohrnstedt (1970) and by Zeller and Carmines (1980). 2 The important impact that measurement errors have on statistics of relationships has received some attention in recent years (e.g., in sociology by Bohrnstedt and Carter, 1971; in psychology by Linn and Werts, 1973; in political science by Asher, 1974; in statistics by Cochran, 1970), but it still goes unrecognized by many data analysts. This content downloaded from 157.55.39.174 on Wed, 31 May 2017 18:33:53 UTC All use subject to http://about.jstor.org/terms CONSTRUCT VALIDITY AND ERROR COMPONENTS OF SURVEYS 411 business conditions over the past year and in the coming year averaged .41.3 After allowing for measurement error, however, the true relationship between respondents' perceptions was estimated to be .70. Thus in terms of overlapped variance, the observed relationship was only about one-third of what it should have been (17 percent versus 49 percent). In this case, random errors led to a gross deflation of the relationship. However, this does not always occur. Survey 5 produced a relationship of .44 between evaluations of own health and of work that had to be done around the house; but after allowing for measurement error, the true relationship was estimated to be .30. Here, correlated error overwhelmed random error, and the observed percentage of overlapped variance was more than double what it should have been (19 percent versus 9 percent). This study has four major goals, none of which has been pursued previously in a large-scale and systematic way: (1) Test the feasibility of incorporating a particular kind of methodological supplement in regular ongoing national and organizational surveys and of using structural model estimation techniques to generate estimates of measurement quality. (2) Provide descriptive information about estimated construct validity, method effects, and residual error for a broad range of survey measures as implemented by the standard data collection procedures of a respected survey organization. (3) Account for why some survey measures have higher (or lower) measurement quality than others. (4) Provide a means for predicting the construct validity and error components of other survey measures not actually examined in this study. These goals lead to a more general outcome of considerable importance to survey researchers and other users of survey data: more knowledge about how to produce better data. 1. Basic Notions About Validity and Measurement Error

[1]  Frank M. Andrews,et al.  The Quality of Survey Data as Related to Age of Respondent , 1986 .

[2]  G. Bohrnstedt,et al.  Social Measurement: Current Issues , 1981 .

[3]  K. Bollen,et al.  Pearson's R and Coarsely Categorized Measures , 1981 .

[4]  L. F. Carter,et al.  Sociological Methodology 1980. , 1981 .

[5]  Eli P. Cox,et al.  The Optimal Number of Response Alternatives for a Scale: A Review , 1980 .

[6]  P. Bentler,et al.  Significance Tests and Goodness of Fit in the Analysis of Covariance Structures , 1980 .

[7]  M. Corcoran,et al.  Sex Differences in Measurement Error in Status Attainment Models , 1980 .

[8]  D. Marlowe,et al.  The Approval Motive: Studies in Evaluative Dependence , 1980 .

[9]  Edward G. Carmines,et al.  Measurement in the social sciences , 1980 .

[10]  P. Bentler MULTIVARIATE ANALYSIS WITH LATENT VARIABLES: CAUSAL MODELING , 1980 .

[11]  Duane F. Alwin,et al.  MEASUREMENT MODELS FOR RESPONSE ERRORS IN SURVEYS: ISSUES AND APPLICATIONS , 1980 .

[12]  K. Jöreskog Structural analysis of covariance and correlation matrices , 1978 .

[13]  Daniel H. Hill A Methodological note on obtaining MCA coefficients and standard errors , 1978 .

[14]  Robert M. Hauser,et al.  Response Errors of Nonblack Males in Models of the Stratification Process , 1977 .

[15]  H. Gintis,et al.  Schooling and Achievement in American Society , 1977 .

[16]  Robert M. Hauser,et al.  Response Error in Earnings Functions for Nonblack Males , 1977 .

[17]  Bryan W. Coyle,et al.  A Review And Critique Of Analyses Of Multitrait-Multimethod Matrices. , 1977, Multivariate behavioral research.

[18]  James R. Kluegel,et al.  Subjective Class Identification: A Multiple Indicator Approach , 1977 .

[19]  Robert M. Hauser,et al.  Response Errors of Black and Nonblack Males in Models of the Intergenerational Transmission of Socioeconomic Status , 1977, American Journal of Sociology.

[20]  B. Muthén,et al.  Assessing Reliability and Stability in Panel Models , 1977 .

[21]  F. M. Andrews,et al.  The validity of measures of self-reported well-being , 1976 .

[22]  R. Lissitz,et al.  Effect of the number of scale points on reliability: A Monte Carlo approach. , 1975 .

[23]  Measurement errors in the estimation of home value , 1975 .

[24]  H. Blalock,et al.  Applied multivariate analysis and experimental designs , 1975 .

[25]  I. Fellegi An Improved Method of Estimating the Correlated Response Variance , 1974 .

[26]  James O. Ramsay,et al.  The effect of number of categories in rating scales on precision of estimation of scale values , 1973 .

[27]  R. Linn,et al.  Errors of Inference Due to Errors of Measurement , 1973 .

[28]  D. Alwin Approaches to the Interpretation of Relationships in the Multitrait-Multimethod Matrix , 1973 .

[29]  Robert J. Connor,et al.  Grouping for Testing Trends in Categorical Data , 1972 .

[30]  G. Bohrnstedt,et al.  Robustness in Regression Analysis , 1971 .

[31]  R. Boruch,et al.  Alternative Methods of Analysis: Multitrait-Multimethod Data , 1970 .

[32]  William G. Cochran,et al.  Some Effects of Errors of Measurement on Multiple Correlation , 1970 .

[33]  David R. Heise,et al.  VALIDITY, INVALIDITY, AND RELIABILITY , 1970 .

[34]  W. G. Cochran The effectiveness of adjustment by subclassification in removing bias in observational studies. , 1968, Biometrics.

[35]  Frank M. Andrews,et al.  Multiple classification analysis , 1967 .

[36]  D. Campbell,et al.  Convergent and discriminant validation by the multitrait-multimethod matrix. , 1959, Psychological bulletin.

[37]  L. Cronbach,et al.  Construct validity in psychological tests. , 1955, Psychological bulletin.

[38]  J. Guilford Psychometric methods, 2nd ed. , 1954 .

[39]  K. Pearson On the Measurement of the Influence of “Broad Categories” on Correlation , 1913 .