Consistency tests for key comparison data

Results of International Key Comparisons of National Measurement Standards provide the technical basis for the Mutual Recognition Arrangement (MRA) formulated by Le Comite International des Poids et Mesures (CIPM). With many key comparisons already completed and a number of new key comparison experiments currently under way, we now have a better understanding of the statistical issues that need to be addressed for successfully analysing data from key comparisons and making proper interpretations of the results. There is clearly a need for a systematic approach to statistical analyses of key comparison data that can be implemented routinely by all participating laboratories.The determination of a key comparison reference value (KCRV) and its associated uncertainty and the degrees of equivalence are the central tasks in the evaluation of key comparison data. A satisfactory definition of a KCRV, however, is based on the assumption that all laboratories are estimating the same unknown quantity of the common circulating artefact, that is, the results from the different laboratories are mutually consistent. In this paper, we compare a number of statistical procedures for testing the consistency assumption.

[1]  Thomas Mathew,et al.  Models and Confidence Intervals for True Values in Interlaboratory Trials , 2004 .

[2]  Sam Weerahandi,et al.  Exact Statistical Methods for Data Analysis , 1998, Journal of the American Statistical Association.

[3]  Gerhard Ulm,et al.  Radiometry with synchrotron radiation , 2003 .

[4]  Dallas E. Johnson,et al.  Analysis of messy data , 1992 .

[5]  Raghu N. Kacker,et al.  Combined result and associated uncertainty from interlaboratory evaluations based on the ISO Guide , 2002 .

[6]  K Beissner,et al.  On a measure of consistency in comparison measurements , 2002 .

[7]  A. Steele,et al.  Exclusive statistics: simple treatment of the unavoidable correlations from key comparison reference values , 2001 .

[8]  Kam-Wah Tsui,et al.  Generalized p-Values in Significance Testing of Hypotheses in the Presence of Nuisance Parameters , 1989 .

[9]  N. F. Zhang,et al.  An Approach to Combining Results From Multiple Methods Motivated by the ISO GUM , 2000, Journal of research of the National Institute of Standards and Technology.

[10]  B. L. Welch ON THE COMPARISON OF SEVERAL MEAN VALUES: AN ALTERNATIVE APPROACH , 1951 .

[11]  M. Cox The evaluation of key comparison data , 2002 .

[12]  Lara S Schmidt,et al.  Atomic clock models using fractionally integrated noise processes , 2003 .

[13]  M. Cox,et al.  Evaluating degrees of equivalence using 'exclusive' statistics , 2003 .

[14]  R Willink,et al.  On the interpretation and analysis of a degree-of-equivalence , 2003 .

[15]  A G Steele,et al.  Data pooling and key comparison reference values , 2002 .