Traditional methods of developing tests have been driven by item content specification and have relied on the use of summed ordinal scores in an attempt to create a quantitative index of ability. There are fundamental problems with this approach to test development. The first problem is that the summing of qualitative ordinal counts to create a total score does not result in a number that is a valid means of making quantitative comparisons of performances. The second problem is that reliance on content experts does not ensure that the test items indeed test the construct in question. As a result, the development of functional assessments based on Rasch measurement models is becoming a preferred method among rehabilitation professionals for constructing tests. Rasch measurement models offer an alternative approach to instrument development that results in unidimensional linear measures based on additive numbers. Rasch analyses also generate goodness-of-fit statistics that can be used to perform confirmatory construct validity analyses of the constructed scales based on (a) formulating expectations about what should happen when a group of persons take a test, and then (b) confirming that the test items fit that model. This paper describes how the many-faceted Rasch model was used to develop the motor scale of the Assessment of Motor and Process Skills (AMPS) (Fisher, 1992). The process of beginning test development with a theory of expectations related to a constructed scale and then confirming that the scale conforms to these expectations also is demonstrated.
[1]
J C Grip,et al.
Ordinal scales and foundations of misinference.
,
1989,
Archives of physical medicine and rehabilitation.
[2]
L. George,et al.
OARS Methodology
,
1985,
Journal of the American Geriatrics Society.
[3]
L. Rubenstein,et al.
Systematic biases in functional status assessment of elderly adults: effects of different data sources.
,
1984,
Journal of gerontology.
[4]
R. Hambleton.
Principles and selected applications of item response theory.
,
1989
.
[5]
Mary E. Lunz,et al.
Judge Consistency and Severity Across Grading Periods
,
1990
.
[6]
B. Wright,et al.
Observations are always ordinal; measurements, however, must be interval.
,
1989,
Archives of physical medicine and rehabilitation.
[7]
S. Doble.
Intrinsic Motivation and Clinical Practice: The Key to Understanding the Unmotivated Client
,
1988
.
[8]
W. Fisher.
Measurement-related problems in functional assessment.
,
1993,
The American journal of occupational therapy : official publication of the American Occupational Therapy Association.
[9]
Mary E. Lunz,et al.
Measuring the Impact of Judge Severity on Examination Scores
,
1990
.