Test validity and the ethics of assessment.

Questions of the adequacy of a test as a measure of the characteristic it is interpreted to assess are answerable on scientific grounds by appraising psychometric evidence, especially construct validity. Questions of the appropriateness of test use in proposed applications are answerable on qthical grounds by appraising potential social consequences of the testing. The first set of answers provides an evidential basis for test interpretation, and the second set provides a consequential basis for test use. In addition, this article stresses (a) the importance of construct validity for test use because it provides a rational foundation for predictiveness and relevance, and (b) the importance of taking into account the value implications of test interpretations per se. By thus considering both the evidential and consequential bases of both test interpretation and test use, the roles of evidence and social values in the overall validation process are illuminated, and test validity comes to be based on ethical as well as

[1]  K. Popper,et al.  The Logic of Scientific Discovery , 1960 .

[2]  V. F. Lenzen,et al.  Procedures of Empirical Science , 1938 .

[3]  H. E. Brogden,et al.  On the interpretation of the correlation coefficient as a measure of predictive efficiency. , 1946, Journal of educational psychology.

[4]  F. Goodenough Mental testing : its history, principles, and applications , 1949 .

[5]  J. Smart,et al.  The Nature of Physical Reality. , 1951 .

[6]  R. L. Thorndike Personnel selection : test and measurement techniques , 1951 .

[7]  Carl G. Hempel,et al.  Fundamentals of Concept Formation in Empirical Science , 1952 .

[8]  L. Cronbach,et al.  Construct validity in psychological tests. , 1955, Psychological bulletin.

[9]  Roger T. Lennon,et al.  Assumptions Underlying the Use of Content Validity , 1956 .

[10]  H. Feigl Some major issues and developments in the philosophy of science of logical empiricism , 1956 .

[11]  J. Loevinger Objective Tests as Instruments of Psychological Theory , 1957 .

[12]  C. Churchman,et al.  Experience and Reflection , 1959 .

[13]  D. Campbell,et al.  Convergent and discriminant validation by the multitrait-multimethod matrix. , 1959, Psychological bulletin.

[14]  Michael Scriven,et al.  Minnesota Studies in the Philosophy of Science, Volume I. The Foundations of Science and the Concepts of Psychology and Psychoanalysis , 1959 .

[15]  M. Kendall,et al.  The Logic of Scientific Discovery. , 1959 .

[16]  D. Campbell Recommendations for APA test standards regarding construct, trait, or discriminant validity. , 1960 .

[17]  Robert L. Ebel,et al.  Must all tests be valid , 1961 .

[18]  C. Churchman Prediction and optimal decision : philosophical issues of a science of values / C.W. Churchman , 1962 .

[19]  Abraham Edel,et al.  Science and the structure of ethics , 1962 .

[20]  R. Coan FACTS, FACTORS, AND ARTIFACTS: THE QUEST FOR PSYCHOLOGICAL MEANING. , 1964, Psychological review.

[21]  A. Kaplan The Conduct of Inquiry: Methodology for Behavioural Science , 1965 .

[22]  S. Messick,et al.  PERSONALITY MEASUREMENT AND THE ETHICS OF ASSESSMENT. , 1965, The American psychologist.

[23]  G. Vickers,et al.  The art of judgment , 1965 .

[24]  Lee S. Shulman,et al.  Reconstruction of Educational Research , 1966 .

[25]  G. F. Stauffer,et al.  Use and evaluation of discrete test information in decision making. , 1966, The Journal of applied psychology.

[26]  D. Campbell,et al.  Unobtrusive Measures: Nonreactive Research in the Social Sciences , 1966 .

[27]  Glenn H. Bracht,et al.  The External Validity of Experiments1 , 1968 .

[28]  T. Cleary TEST BIAS: PREDICTION OF GRADES OF NEGRO AND WHITE STUDENTS IN INTEGRATED COLLEGES , 1968 .

[29]  K. Boulding,et al.  Value Systems and Social Process. , 1971 .

[30]  P. F. Wernimont,et al.  Signs, samples, and criteria. , 1968, The Journal of applied psychology.

[31]  Lee J. Cronbach,et al.  Psychological tests and personnel decisions , 1958 .

[32]  Edward F. Alf,et al.  Validity, Predictive Efficiency, and Practical Significance of Selection Tests. , 1969 .

[33]  Anthony J. Nitko,et al.  MEASUREMENT IN LEARNING AND INSTRUCTION. , 1970 .

[34]  R. L. Thorndike CONCEPTS OF CULTURE-FAIRNESS , 1971 .

[35]  Michael J. Kavanagh,et al.  Issues in managerial performance: Multitrait-multimethod analyses of ratings. , 1971 .

[36]  H. J. Einhorn,et al.  Methodological considerations relevant to discrimination in employment testing. , 1971, Psychological bulletin.

[37]  C. Churchman,et al.  The design of inquiring systems: basic concepts of systems and organization , 1971 .

[38]  Thomas S. Barrows,et al.  Early Childhood Education: Strategies for Research and Evaluation in Early Childhood Education. , 1972 .

[39]  Donald B. Rubin,et al.  The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles. , 1974 .

[40]  W. Nord,et al.  THE PRESENT STATUS OF INDUSTRIAL AND ORGANIZATIONAL PSYCHOLOGY , 1973 .

[41]  Ian I. Mitroff,et al.  Epistemology as General Systems Theory: An Approach to the Design of Complex Decision-Making Experiments , 1973 .

[42]  Ian I. Mitroff “Be it Resolved that Structured Debate Not Consensus Ought to form the Epistemic Cornerstone of OR/MS”: A Reaction to Ackoff's Note on Systems Science , 1973 .

[43]  Lawrence R. James,et al.  Criterion models and construct validity for criteria. , 1973 .

[44]  R. Linn Fair Test Use in Selection1 , 1973 .

[45]  N. Cole BIAS IN SELECTION , 1973 .

[46]  R. Snow Representative and Quasi-Representative Designs for Research on Teaching1 , 1974 .

[47]  R. Guion Open a New Window: Validities and Values in Psychological Measurement. , 1974 .

[48]  Richard E. Snow,et al.  Representative and quasi-representation designs for research on teaching. , 1974 .

[49]  P. Feyerabend Against Method: Outline of an Anarchistic Theory of Knowledge , 1976 .

[50]  A. Gross,et al.  Defining a "fair" or "unbiased" selection model: A question of utilities. , 1975 .

[51]  S. Messick THE STANDARD PROBLEM: MEANING AND VALUES IN MEASUREMENT AND EVALUATION , 1974 .

[52]  F. Schmidt,et al.  Critical analysis of the statistical and ethical implications of various definitions of test bias. , 1976 .

[53]  R. Linn IN SEARCH OF FAIR SELECTION PROCEDURES , 1976 .

[54]  L. Cronbach Equity in Selection--Where Psychometrics and Political Philosophy Meet. , 1976 .

[55]  M. R. Novick,et al.  AN EVALUATION OF SOME MODELS FOR CULTURE-FAIR SELECTION , 1976 .

[56]  Richard B. Darlington,et al.  A DEFENSE OF “RATIONAL” PERSONNEL SELECTION, AND TWO NEW METHODS , 1976 .

[57]  N. Cole,et al.  Utilities and the Issue of Fairness in a Decision Theoretic Model for Selection. , 1976 .

[58]  F. Schmidt,et al.  Fairness of psychological tests: Implications of four definitions for selection utility and minority hiring. , 1977 .

[59]  R. Guion Content Validity—The Source of My Discontent , 1977 .

[60]  Mary L. Tenopyr,et al.  CONTENT?CONSTRUCT CONFUSION , 1977 .

[61]  M. R. Novick,et al.  Equal Opportunity in Educational and Employment Selection. , 1977 .

[62]  R. Guion Scoring of content domain samples: The problem of fairness. , 1978 .

[63]  M. D. Dunnette,et al.  Personnel Selection and Classification Systems , 1979 .

[64]  T. Cook,et al.  Quasi-experimentation: Design & analysis issues for field settings , 1979 .

[65]  Samuel Messick,et al.  Potential uses of noncognitive measurement in education. , 1979 .

[66]  R. Guion,et al.  On Trinitarian doctrines of validity. , 1980 .

[67]  R. Ebel Comments on Some Problems of Employment Testing. , 1977 .