Seven programmed test procedures were developed and investigated using existing data on 190 verbal-type items for 4,885 students. Tests were scored as if the student had taken items in the order implied by the test. Shortened conventional tests were also scored for comparative purposes. The criteria against which these procedures were evaluated were the 190-item total test score and four outside criterion tests.
Against the criterion of reproducing the 190-item total test score in the cross-validation sample the programmed tests were found to he only slightly superior to the shortened conventional tests. However, the programmed tests had correlations with the outside criterion tests that were substantially higher than the corresponding shortened conventional tests. It was estimated that a test which was parallel to the 190-item total test would have to be 3.36 times as long as the best programmed test to have an equal median correlation with the outside criterion tests.
[1]
A G Bayroff,et al.
AN EXPLORATORY STUDY OF BRANCHING TESTS
,
1967
.
[2]
William G. Mollenkopf,et al.
An experimental study of the effects on item-analysis data of changing item placement and test time limit
,
1950
.
[3]
P. Armitage.
Sequential Analysis with More than Two Alternative Hypotheses, and its Relation to Discriminant Function Analysis
,
1950
.
[4]
Donald A. Rock,et al.
An Exploratory Study of Programmed Tests
,
1968
.
[5]
Donald A. Rock,et al.
REPRODUCTION OF TOTAL TEST SCORE THROUGH THE USE OF SEQUENTIAL PROGRAMMED TESTS1
,
1968
.
[6]
Charles T. Myers,et al.
Item Rearrangement Under Typical Test Conditions
,
1968
.
[7]
Abraham Wald,et al.
Statistical Decision Functions
,
1951
.