Coefficient alpha and the internal structure of tests

AbstractA general formula (α) of which a special case is the Kuder-Richardson coefficient of equivalence is shown to be the mean of all split-half coefficients resulting from different splittings of a test. α is therefore an estimate of the correlation between two random samples of items from a universe of items like those in the test. α is found to be an appropriate index of equivalence and, except for very short tests, of the first-factor concentration in the test. Tests divisible into distinct subtests should be so divided before using the formula. The index $$\bar r_{ij} $$ , derived from α, is shown to be an index of inter-item homogeneity. Comparison is made to the Guttman and Loevinger approaches. Parallel split coefficients are shown to be unnecessary for tests of common types. In designing tests, maximum interpretability of scores is obtained by increasing the first-factor concentration in any separately-scored subtest and avoiding substantial group-factor clusters within a subtest. Scalability is not a requisite.

[1]  C. Spearman CORRELATION CALCULATED FROM FAULTY DATA , 1910 .

[2]  W. Brown SOME EXPERIMENTAL RESULTS IN THE CORRELATION OF MENTAL ABILITIES1 , 1910 .

[3]  P. Walmsley,et al.  Statistical Method , 1923, Nature.

[4]  T. L. Kelley Note on the Reliability of a Test: A Reply to Dr. Crum's Criticism. , 2022 .

[5]  W. A. Brownell On the Accuracy with which Reliability may be Measured by Correlating Test Halves , 1933 .

[6]  F. Goodenough A critical note on the use of the term "reliability" in mental measurement. , 1936 .

[7]  M. W. Richardson,et al.  The theory of the estimation of test reliability , 1937 .

[8]  R. Jackson RELIABILITY OF MENTAL TESTS , 1939 .

[9]  P. Dressel Some remarks on the kuder-richardson reliability coefficient , 1940 .

[10]  G. A. Ferguson,et al.  The factorial interpretation of test difficulty , 1941 .

[11]  C. Hoyt Test reliability estimated by analysis of variance , 1941 .

[12]  C. I. Mosier A Short Cut in the Estimation of Split-Halves Coefficients , 1941 .

[13]  L. Thurstone,et al.  Factorial Studies Of Intelligence , 1941 .

[14]  Paul Horst,et al.  The prediction of personal adjustment. , 1942 .

[15]  G. A. Ferguson The reliability of mental tests , 1942 .

[16]  T. L. Kelley,et al.  The reliability coefficient , 1942 .

[17]  L. Cronbach On estimates of test reliability. , 1943 .

[18]  R. Wherry,et al.  The concept of test and item reliability in relation to factor pattern , 1943 .

[19]  Louis Guttman,et al.  A basis for analyzing test-retest reliability , 1945, Psychometrika.

[20]  L. Cronbach A case study of the split-half reliability coefficient. , 1946, Journal of educational psychology.

[21]  Lee J. Cronbach,et al.  A case study of the splithalf reliability coefficient. , 1946 .

[22]  L. Tucker,et al.  Maximum validity of a test with equivalent items , 1946, Psychometrika.

[23]  H. E. Brogden Variation in test validity with variation in the distribution of item difficulties, number of items, and degree of their intercorrelation , 1946, Psychometrika.

[24]  L. Festinger The treatment of qualitative data by scale analysis. , 1947, Psychological bulletin.

[25]  L. Cronbach Test “reliability”: Its meaning and determination , 1947, Psychometrika.

[26]  J. P. Guilford,et al.  Printed classification tests: Report no. 5. , 1947 .

[27]  J. Guilford,et al.  Printed classification tests , 1947 .

[28]  J. Loevinger A systematic approach to the construction and evaluation of tests of ability. , 1947 .

[29]  J. Loevinger,et al.  The technic of homogeneous tests compared with some aspects of scale analysis and factor analysis. , 1948, Psychological bulletin.

[30]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[31]  C. Burt THE INFLUENCE OF DIFFERENTIAL WEIGHTING , 1950 .

[32]  S. Stouffer,et al.  Measurement and Prediction , 1954 .

[33]  P. Vernon AN APPLICATION OF FACTORIAL ANALYSIS TO THE STUDY OF TEST ITEMS , 1950 .

[34]  C.H. Coombs,et al.  The Concepts of Reliability and Homogeneity , 1950 .

[35]  H. Gulliksen Theory of mental tests , 1952 .

[36]  J. Guilford,et al.  Changes in common-factor loadings as tests are altered homogeneously in length , 1950, Psychometrika.

[37]  W. Holtzman Fundamental statistics in psychology and education. , 1951 .

[38]  M. Woodbury On the standard length of a test , 1951 .

[39]  A. E. Maxwell,et al.  Fundamental statistics in psychology and education , 1943 .