Tests of data quality, scaling assumptions, and reliability of the Danish SF-36.

We used general population data (n = 4084) to examine data completeness, response consistency, tests of scaling assumptions, and reliability of the Danish SF-36 Health Survey. We compared traditional multitrait scaling analyses to analyses using polychoric correlations and Spearman correlations. The frequency of missing values was low, except for elderly people and people with lower levels of education. Response consistency was high and compared well with results for the U.S. SF-36. For respondents with computable scales in all eight domains, scaling assumptions (item internal consistency, item discriminant validity, equal item-own scale correlations, and equal variances) were satisfactory in the total sample and in all subgroups. The SF-36 could discriminate between levels of health in all subgroups, but there were skewness, kurtosis, and ceiling effects in many subgroups (elderly people and people with chronic diseases excepted). Concerning correlation methods, we found interesting differences indicating advantages of using methods that do not assume a normal distribution of answers as an addition to traditional methods.

[1]  J. Ware,et al.  Differential item functioning in the Danish translation of the SF-36. , 1998, Journal of clinical epidemiology.

[2]  A. Stewart,et al.  Methods of Constructing Health Measures , 1992 .

[3]  Howard Wainer,et al.  Estimating Coefficients in Linear Models: It Don't Make No Nevermind , 1976 .

[4]  M Sullivan,et al.  The factor structure of the SF-36 Health Survey in 10 countries: results from the IQOLA Project. International Quality of Life Assessment. , 1998, Journal of clinical epidemiology.

[5]  B. Muthén A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators , 1984 .

[6]  J. Ware,et al.  The Swedish SF-36 Health Survey--I. Evaluation of data quality, scaling assumptions, reliability and construct validity across general populations in Sweden. , 1995, Social science & medicine.

[7]  K. Bollen,et al.  Pearson's R and Coarsely Categorized Measures , 1981 .

[8]  R. Hambleton,et al.  Handbook of Modern Item Response Theory , 1997 .

[9]  John E. Ware,et al.  Measuring Functioning and Well-Being , 1992 .

[10]  C. McHorney,et al.  The MOS 36‐Item Short‐Form Health Survey (SF‐36): II. Psychometric and Clinical Tests of Validity in Measuring Physical and Mental Health Constructs , 1993, Medical care.

[11]  B. Gandek,et al.  MAP-R for windows: multitrait / multi-item analysis program - revised user's guide. , 1997 .

[12]  C. McHorney,et al.  Comparisons of the Costs and Quality of Norms for the SF-36 Health Survey Collected by Mail Versus Telephone Interview: Results From a National Survey , 1994, Medical care.

[13]  J. Nunnally Psychometric Theory (2nd ed), New York: McGraw-Hill. , 1978 .

[14]  K. Dean Population health research : linking theory and methods , 1993 .

[15]  C. Sherbourne,et al.  The MOS 36-item Short-Form Health Survey (SF-36): III. Tests of data quality, scaling assumptions, and reliability across diverse patient groups. , 1994 .

[16]  E. Muraki A Generalized Partial Credit Model , 1997 .

[17]  C. Sherbourne,et al.  The MOS 36-Item Short-Form Health Survey (SF-36) , 1992 .

[18]  J. Ware,et al.  Evaluating Translations of Health Status Questionnaires: Methods From the IQOLA Project , 1995, International Journal of Technology Assessment in Health Care.

[19]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[20]  R. Hays,et al.  Beyond Internal Consistency Reliability , 1990 .

[21]  T S Kristensen,et al.  The Danish SF-36 Health Survey: translation and preliminary validity studies. , 1998, Journal of clinical epidemiology.

[22]  William Meredith,et al.  Notes on factorial invariance , 1964 .

[23]  R. Dawes Judgment under uncertainty: The robust beauty of improper linear models in decision making , 1979 .

[24]  P M Bentler,et al.  Use of structural equation modeling to test the construct validity of the SF-36 Health Survey in ten countries: results from the IQOLA Project. International Quality of Life Assessment. , 1998, Journal of clinical epidemiology.

[25]  M Sullivan,et al.  Translating health status questionnaires and evaluating their quality: the IQOLA Project approach. International Quality of Life Assessment. , 1998, Journal of clinical epidemiology.

[26]  M Sullivan,et al.  The equivalence of SF-36 summary health scores estimated using standard and country-specific algorithms in 10 countries: results from the IQOLA Project. International Quality of Life Assessment. , 1998, Journal of clinical epidemiology.

[27]  Anastasia E. Raczek,et al.  Comparison of Rasch and summated rating scales constructed from SF-36 physical functioning items in seven countries: results from the IQOLA Project. International Quality of Life Assessment. , 1998, Journal of clinical epidemiology.

[28]  John E. Ware,et al.  SF-36 Health Survey. , 1990 .

[29]  J. Morris,et al.  The SF-36 health survey questionnaire: is it suitable for use with older adults? , 1995, Age and ageing.

[30]  Ron D. Hays,et al.  Beyond internal consistency reliability: Rationale and user’s guide for Multitrait Analysis Program on the microcomputer , 1990 .

[31]  J. E. Brazier,et al.  Validating the SF-36 health survey questionnaire: new outcome measure for primary care. , 1992, BMJ.

[32]  A. Stewart,et al.  Measuring Functioning and Well-Being: The Medical Outcomes Study Approach , 1992 .