Structural Equations and Path Analysis for Discrete Data

This article proposes a solution to the long-standing methodological problem of incorporating discrete variables inoto causal models of social phenomena. Only a subset of the variety of ways in which discrete data arise in empirical social research can be satisfactorily modeled by conventional log-linear or logit approaches. Drawing on the insights of several literatures, this article exposits a general approach to causal models in which some or all variables are discretely measured and shows that path analytic methods are available which permit quantification of causal relationships among variables with the same flexibility and power of interpretation as is feasible in models that include only continous variables. It presents methods of identifying and estimating these models and shows how the direct and indirect effects of independent varibles can be calculated by extensions of usual path analysis methods for continuos variables An important distincion developed here is that discrete variables can play two roles: (1) as measures of inherently discrete phenomena and (2) as indicators of underlying continous variables. The value of this distinction is shown in two empirical examples examined previously by other authors. In examining the effects of social background and parental enouragement on college plans of high school seniors, the article shows that modeling a discrete measure of encouragement as an indicator of a latent continous variable rather than as an inherently discrete variable (as has been done in previous analyses) provides a clearer interpretation and a superior fit to the data. In examining the effects of state Fair-Employment-Practices Legislation on black-white wage differentials, this study shows that two distinct effects on the relative wage can be detected: the direct ameliorative effect of the law itself and the effects of the popular progressive sentiment for racial equality of which the law is an indicator. The methods and models presented here are not only natural generalizations of structural equation and path analysis methods for continuous varibles to include discrete variables but also provide a means of investigating a richer variety of substantive hypotheses than is feasible with methods for discrete data commonly used in the sociological literature to date.

[1]  J. Heckman,et al.  A Method for Minimizing the Impact of Distributional Assumptions in Econometric Models for Duration Data , 1984 .

[2]  D. J. Jackson,et al.  Factor Analysis and Measurement in Sociological Research. , 1983 .

[3]  Lars Peter Hansen,et al.  Multiperiod Probit Models and Orthogonality Condition Estimation , 1983 .

[4]  Leo A. Goodman,et al.  Association models and the bivariate normal for contingency tables with ordered categories , 1981 .

[5]  Charles F. Manski,et al.  STRUCTURAL MODELS FOR DISCRETE DATA: THE ANALYSIS OF DISCRETE CHOICE , 1981 .

[6]  T. Amemiya QUALITATIVE RESPONSE MODELS: A SURVEY , 1981 .

[7]  Leo A. Goodman,et al.  The Limitations of Log-Linear Analysis@@@Analyzing Qualitative/Categorical Data: Log-Linear Models and Latent Structure Analysis. , 1980 .

[8]  R. Stolzenberg The Measurement and Decomposition of Causal Effects in Nonlinear and Nonadditive Models , 1980 .

[9]  F. Krauss Latent Structure Analysis , 1980 .

[10]  P. McCullagh,et al.  The GLIM System, Release 3: Generalized linear interactive modelling , 1979 .

[11]  Bengt Muthén,et al.  A Structural Probit Model with Latent Variables , 1979 .

[12]  S. Gortmaker Poverty and infant mortality in the United States. , 1979, American sociological review.

[13]  L. A. Goodman A Brief Guide to the Causal Analysis of Data from Surveys , 1979, American Journal of Sociology.

[14]  S. Brier THE UTILITY OF SYSTEMS OF SIMULTANEOUS LOGISTIC RESPONSE EQUATIONS , 1979 .

[15]  Stephen E. Fienberg,et al.  The analysis of cross-classified categorical data , 1980 .

[16]  D. Wise,et al.  A CONDITIONAL PROBIT MODEL FOR QUALITATIVE CHOICE: DISCRETE DECISIONS RECOGNIZING INTERDEPENDENCE AND HETEROGENEOUS PREFERENCES' , 1978 .

[17]  J. Heckman Dummy Endogenous Variables in a Simultaneous Equation System , 1977 .

[18]  R. Hauser,et al.  Education, Occupation and Earnings. , 1977 .

[19]  Takeshi Amemiya,et al.  The Maximum Likelihood, the Minimum Chi-Square and the Nonlinear Weighted Least-Squares Estimator in the General Qualitative Response Model , 1976 .

[20]  R. Leik Causal Models with Nominal and Ordinal Data: Retrospective , 1976 .

[21]  James A. Davis,et al.  ANALYZING CONTINGENCY TABLES WITH LINEAR FLOW GRAPHS: D SYSTEMS , 1976 .

[22]  Takeshi Amemiya,et al.  A Modified Logit Model: A Note , 1975 .

[23]  R. Hauser,et al.  The Decomposition of Effects in Path Analysis , 1975 .

[24]  R. McKelvey,et al.  A statistical model for the analysis of ordinal level dependent variables , 1975 .

[25]  Takeshi Amemiya,et al.  Bivariate Probit Analysis: Minimum Chi-Square Methods , 1974 .

[26]  James A. Davis,et al.  Nonparametric path analysis—The multivariate structure of dichotomous data when using the odds Ratio or Yule's Q , 1974 .

[27]  A. Goldberger,et al.  Structural Equation Models in the Social Sciences. , 1974 .

[28]  L. A. Goodman The Analysis of Systems of Qualitative Variables When Some of the Variables Are Unobservable. Part I-A Modified Latent Structure Approach , 1974, American Journal of Sociology.

[29]  Leo A. Goodman,et al.  The analysis of multidimensional contingency tables when some variables are posterior to others: a modified path analysis approach , 1973 .

[30]  O. D. Duncan,et al.  Socioeconomic Background and Achievement. , 1974 .

[31]  Richard E. Quandt,et al.  Nonlinear methods in econometrics , 1973 .

[32]  Leo A. Goodman,et al.  A General Model for the Analysis of Surveys , 1972, American Journal of Sociology.

[33]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[34]  Arthur S. Goldberger,et al.  Econometrics and psychometrics: A survey of communalities , 1971 .

[35]  David R. Cox The analysis of binary data , 1970 .

[36]  K. Jöreskog A General Method for Estimating a Linear Structural Equation System. , 1970 .

[37]  J. Ashford,et al.  Multi-variate probit analysis. , 1970, Biometrics.

[38]  Paul F. Lazarsfeld,et al.  Latent Structure Analysis. , 1969 .

[39]  W. Landes The Economics of Fair Employment Laws , 1968, Journal of Political Economy.

[40]  W. H. Sewell,et al.  Social Class, Parental Encouragement, and Educational Aspirations , 1968, American Journal of Sociology.

[41]  O. D. Duncan Path Analysis: Sociological Examples , 1966, American Journal of Sociology.

[42]  Arnold Zellner,et al.  JOINT ESTIMATION OF RELATIONSHIPS INVOLVING DISCRETE RANDOM VARIABLES , 1965 .

[43]  E. Gumbel Bivariate Logistic Distributions , 1961 .

[44]  Lillian Cohen,et al.  Statistical Methods for Social Scientists. , 1954 .

[45]  J. Berkson Why I Prefer Logits to Probits , 1951 .

[46]  J. Gaddum Probit Analysis , 1948, Nature.

[47]  DAVID G. KENDALL,et al.  Introduction to Mathematical Statistics , 1947, Nature.

[48]  G. Yule On the Association of Attributes in Statistics: With Illustrations from the Material of the Childhood Society, &c , 1900 .