Maximum likelihood estimation of reviewers' acumen in central review setting: categorical data

Successfully evaluating pathologists' acumen could be very useful in improving the concordance of their calls on histopathologic variables. We are proposing a new method to estimate the reviewers' acumen based on their histopathologic calls. The previously proposed method includes redundant parameters that are not identifiable and results are incorrect. The new method is more parsimonious and through extensive simulation studies, we show that the new method relies less on the initial values and converges to the true parameters. The result of the anesthetist data set by the new method is more convincing.

[1]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[2]  A. Meysamie,et al.  Reproducibility determination of WHO classification of endometrial hyperplasia/well differentiated adenocarcinoma and comparison with computerized morphometric data in curettage specimens in Iran , 2009, Diagnostic pathology.

[3]  Geoffrey J. McLachlan,et al.  The EM Algorithm , 2012 .

[4]  B Stenkvist,et al.  Histopathological systems of breast cancer classification: reproducibility and clinical significance. , 1983, Journal of clinical pathology.

[5]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[6]  H. Toutenburg Fleiss, J. L.: Statistical Methods for Rates and Proportions. John Wiley & Sons, New York‐London‐Sydney‐Toronto 1973. XIII, 233 S. , 1974 .

[7]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[8]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[9]  A. P. Dawid,et al.  Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm , 1979 .

[10]  D. L. Page Interobserver Agreement and Reproducibility in Classification of Invasive Breast Carcinoma: An NCI Breast Cancer Family Registry Study , 2007 .

[11]  James M Boyett,et al.  The influence of central review on outcome associations in childhood malignant gliomas: results from the CCG-945 experience. , 2003, Neuro-oncology.

[12]  Wei Zhao,et al.  A Fast Algorithm for Functional Mapping of Complex Traits , 2004, Genetics.

[13]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[14]  Hanina Hibshoosh,et al.  Interobserver agreement and reproducibility in classification of invasive breast carcinoma: an NCI breast cancer family registry study , 2006, Modern Pathology.

[15]  H. Barnhart,et al.  Modeling Concordance Correlation via GEE to Evaluate Reproducibility , 2001, Biometrics.

[16]  T. Zhou,et al.  The prognostic value of histological grading of posterior fossa ependymomas in children: a Children's Oncology Group study and a review of prognostic factors , 2008, Modern Pathology.

[17]  S. Hui,et al.  Evaluation of diagnostic tests without gold standards , 1998, Statistical methods in medical research.

[18]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[19]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .