论文信息 - Estimating within-group interrater reliability with and without response bias.

Estimating within-group interrater reliability with and without response bias.

Abstract : This article presents methods for assessing agreement among the judgments made by a single group of judges on a single variable in regard to a single target. For example, the group of judges could be editorial consultants, members of an assessment center, or members of a team. The single target could be a manuscript, a lower-level manager, or a team. The variable on which the target is judged could be overall publishability in the case of the manuscript, managerial potential for the lower-level manager, or team cooperativeness for the team. The methods presented are based on new procedures for estimating interrater reliability. For situations such as the above, these procedures are shown to furnish more accurate and interpretable estimates of agreement than estimates provided by procedures commonly used to estimate agreement, consistency, or interrater reliability. In addition, the proposed methods include processes for controlling for the spurious influences of response biases (e.g., positive leniency, social desirability) on estimates of interrater reliability. (Author)

L. James | R. Demaree | Gerrit Wolf

[1] L. Cronbach. Response Sets and Test Validity , 1946 .

[2] Raymond B. Cattell,et al. rp and other coefficients of pattern similarity , 1949, Psychometrika.

[3] L. Cronbach. Further Evidence on Response Sets and Test Design , 1950 .

[4] L. Cronbach,et al. Assessing similarity between profiles. , 1953, Psychological bulletin.

[5] I. A. Berg,et al. Response Bias in an Unstructured Questionnaire , 1954 .

[6] G. A. Miller. THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[7] J. Overall. NOTE ON MULTIVARIATE METHODS FOR PROFILE ANALYSIS. , 1964, Psychological bulletin.

[8] Leonard G. Rorer. THE GREAT RESPONSE-STYLE MYTH. , 1965 .

[9] A. Parducci. Category judgment: a range-frequency model. , 1965, Psychological review.

[10] S. Messick,et al. RESPONSE STYLES AS PERSONALITY VARIABLES: A THEORETICAL INTEGRATION OF MULTIVARIATE RESEARCH1 , 1965 .

[11] M. R. Novick,et al. Statistical Theories of Mental Test Scores. , 1971 .