A comparison of self-report and clinical diagnostic interviews for depression: diagnostic interview schedule and schedules for clinical assessment in neuropsychiatry in the Baltimore epidemiologic catchment area follow-up.

BACKGROUND The field of psychiatric epidemiology continues to employ self-report instruments, but the low degree of agreement between diagnoses achieved using these instruments vs. that achieved by psychiatrists in the clinical modality threatens the credibility of the results. METHODS In the Baltimore Epidemiologic Catchment Area follow-up, 349 individuals who had a Diagnostic Interview Schedule (DIS) interview were blindly examined by psychiatrists using the Schedules for Clinical Assessment in Neuropsychiatry (SCAN). Comparisons were made at the level of diagnosis, syndrome, and DSM-IV symptom group. Indexes of agreement were computed and characteristics of discrepant cases were identified. RESULTS Agreement on diagnosis of major depressive disorder was only fair (kappa = 0.20), with the DIS missing many cases judged to meet criteria for diagnosis using the SCAN (29% sensitivity). A major source of discrepancy was respondents with false-negative diagnoses who repeatedly failed to report DIS symptoms attributed to life crises or medical conditions. Older age, male sex, and lower impairment were associated with underdetection by the DIS, using logistic regression analysis. In spite of the diagnostic discrepancy, there was substantial correlation in numbers of symptom groups in the 2 modalities (r = 0.49). Agreement was highest (about 55% sensitivity and 90% specificity) when both the SCAN and DIS thresholds were set at the level of depression syndrome instead of diagnosis. CONCLUSIONS Weak agreement at the level of diagnosis continues to threaten the credibility of estimates of prevalence of specific disorders. A bias toward underreporting, as well as stronger agreement at the level of the depression syndrome and on ordinal measures of depressive symptoms, suggests that associations with risk factors are conservative.

[1]  T. Brugha,et al.  SCAN. Schedules for Clinical Assessment in Neuropsychiatry. , 1990, Archives of general psychiatry.

[2]  John K. Wing,et al.  Measurement and Classification of Psychiatric Symptoms: An Instruction Manual for the PSE and Catego Program , 1974 .

[3]  A. Farmer,et al.  A comparison of clinical and diagnostic interview schedule diagnoses. Physician reexamination of lay-interviewed cases in the general population. , 1985, Archives of general psychiatry.

[4]  L. Robins,et al.  National Institute of Mental Health Diagnostic Interview Schedule. Its history, characteristics, and validity. , 1981, Archives of general psychiatry.

[5]  Jennifer L. Kelsey,et al.  Methods in Observational Epidemiology , 1986 .

[6]  J. Anthony,et al.  Natural history of Diagnostic Interview Schedule/DSM-IV major depression. The Baltimore Epidemiologic Catchment Area follow-up. , 1997, Archives of general psychiatry.

[7]  R. Spitzer,et al.  Diagnosis and need for treatment are not the same. , 1998, Archives of general psychiatry.

[8]  G. Nestadt,et al.  Screening for Psychosis in the General Population with a Self‐Report Interview , 1991, The Journal of nervous and mental disease.

[9]  A. Frances Problems in defining clinical significance in epidemiological studies. , 1998, Archives of general psychiatry.

[10]  D. Blazer,et al.  The NIMH Epidemiologic Catchment Area program. Historical context, major objectives, and study population characteristics. , 1984, Archives of general psychiatry.

[11]  M. Folstein,et al.  Comparison of the lay Diagnostic Interview Schedule and a standardized psychiatric diagnosis. Experience in eastern Baltimore. , 1985, Archives of general psychiatry.

[12]  W. Eaton,et al.  Psychopathology and attrition in the Baltimore ECA 15-year follow-up 1981–1996 , 1999, Social Psychiatry and Psychiatric Epidemiology.

[13]  Maxim J. Schlossberg,et al.  Epidemiologic Field Methods in Psychiatry: The NIMH Epidemiologic Catchment Area Program. , 1986 .

[14]  T. Schlaepfer,et al.  Onset and recovery from panic disorder in the Baltimore Epidemiologic Catchment Area follow-up , 1998, British Journal of Psychiatry.

[15]  G. Parker Are the lifetime prevalence estimates in the ECA study accurate? , 1987, Psychological Medicine.

[16]  A. Farmer,et al.  The Composite International Diagnostic Interview. An epidemiologic Instrument suitable for use in conjunction with different diagnostic systems and in different cultures. , 1988, Archives of general psychiatry.

[17]  D S Rae,et al.  Limitations of diagnostic criteria and assessment instruments for mental disorders. Implications for research and policy. , 1998, Archives of general psychiatry.