Evaluation of Estimation Algorithms: Credibility Tests

Assessments of estimation performance are often available. For example, many statistical estimators and filters provide assessments of the first two moments of their own estimation error (i.e., mean-square error [MSE] or error covariance matrix and bias). Are these assessments credible in that they reflect the true situation? The paper addresses this important yet little studied topic, referred to as the credibility of the assessments (or the estimators that make the assessments). We define the concept of credibility and formulate three classes of commonly encountered credibility-testing problems: MSE alone, bias alone, and MSE and bias jointly. Taking advantage of results in multivariate statistical analysis, we present several statistical tests for the credibility problems formulated and analyze and discuss in detail pros and cons of the proposed tests, contrasting with the existing test. How these tests can be used and how they perform are illustrated by representative numerical examples. For the existing MSE credibility test, we explain its underlying principle and analyze, discuss, and demonstrate its drawbacks and limitations. We also propose a test for comparing different credibility assessments.

[1]  N. L. Johnson,et al.  Encyclopedia of Statistical Sciences 2. , 1984 .

[2]  M. Kendall,et al.  Kendall's advanced theory of statistics , 1995 .

[3]  A. Farina,et al.  Tracking a ballistic target: comparison of several nonlinear filters , 2002 .

[4]  Yaakov Bar-Shalom,et al.  Consistency and robustness of PDAF for target tracking in cluttered environments , 1983, Autom..

[5]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[6]  LI X.RONG,et al.  Evaluation of estimation algorithms part I: incomprehensive measures of performance , 2006, IEEE Transactions on Aerospace and Electronic Systems.

[7]  B. K. Ghosh,et al.  Handbook of sequential analysis , 1991 .

[8]  K. Fang,et al.  Generalized Multivariate Analysis , 1990 .

[9]  X. Rong Li,et al.  Measuring Estimator's Credibility: Noncredibility Index , 2006, 2006 9th International Conference on Information Fusion.

[10]  Genshe Chen,et al.  Information theoretic measures for performance evaluation and comparison , 2009, 2009 12th International Conference on Information Fusion.

[11]  X. Rong Li,et al.  Practical Measures and Test for Credibility of an Estimator , 2001 .

[12]  Way Kuo,et al.  Recent Advances in Optimal Reliability Allocation , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[13]  Yi-Kuei Lin,et al.  Reliability Evaluation for an Information Network With Node Failure Under Cost Constraint , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[14]  Yaakov Bar-Shalom,et al.  Estimation and Tracking: Principles, Techniques, and Software , 1993 .

[15]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[16]  Peter J. Abeles Ellipsoidal Containment Regions for Non-Gaussian Distributions , 2007 .

[17]  Oliver E. Drummond,et al.  Comparison of various static multiple-model estimation algorithms , 1998, Defense, Security, and Sensing.

[18]  Krishna R. Pattipati,et al.  Anomaly Detection via Feature-Aided Tracking and Hidden Markov Models , 2007, 2007 IEEE Aerospace Conference.

[19]  X. Rong Li,et al.  Common fallacies in applying hypothesis testing , 2008, 2008 11th International Conference on Information Fusion.

[20]  X. Rong Li,et al.  Testing Estimator's Credibility - Part I: Tests for MSE , 2006, 2006 9th International Conference on Information Fusion.

[21]  A. Jazwinski Stochastic Processes and Filtering Theory , 1970 .

[22]  X. R. Li,et al.  ESTIMATOR'S CREDIBILITY AND ITS MEASURES , 2002 .

[23]  Gary Klein,et al.  User evaluation of information systems: by system typology , 1999, IEEE Trans. Syst. Man Cybern. Part A.

[24]  Huimin Chen,et al.  Track association and fusion with heterogeneous local trackers , 2007, 2007 46th IEEE Conference on Decision and Control.

[25]  E. S. Pearson Biometrika tables for statisticians , 1967 .

[26]  J. Gower,et al.  Multivariate Statistical Inference , 1977 .

[27]  Ehud Rivlin,et al.  Fusion of fixation and odometry for vehicle navigation , 1999, IEEE Trans. Syst. Man Cybern. Part A.

[28]  H. K. Nandi On Some Properties of Roy's Union-Intersection Tests , 1965 .

[29]  Hoang Pham Guest Editorial Special Issue on Critical Reliability Challenges and Practices , 2007 .

[30]  Heinz Kres,et al.  Statistische Tafeln zur multivariaten Analysis , 1975 .

[31]  R. Muirhead Aspects of Multivariate Statistical Theory , 1982, Wiley Series in Probability and Statistics.

[32]  P. Krishnaiah,et al.  16 Likelihood ratio tests for mean vectors and covariance matrices , 1980 .

[33]  B P Korin,et al.  On the distribution of a statistic used for testing a covariance matrix. , 1968, Biometrika.

[34]  Thia Kirubarajan,et al.  Estimation with Applications to Tracking and Navigation: Theory, Algorithms and Software , 2001 .

[35]  Calyampudi R. Rao,et al.  Linear statistical inference and its applications , 1965 .

[36]  K. Pillai,et al.  Distribution of the likelihood ratio criterion for testing a hypothesis specifying a covariance matrix , 1973 .

[37]  X. Rong Li,et al.  Testing Estimator's Credibility - Part II: Other Tests* , 2006, 2006 9th International Conference on Information Fusion.

[38]  Stephen C. Arnold,et al.  Kendall's advanced theory of statistics. Vol.2A: Classical inference and the linear model , 1999 .

[39]  D L Streiner,et al.  An Introduction to Multivariate Statistics , 1993, Canadian journal of psychiatry. Revue canadienne de psychiatrie.

[40]  Arjun K. Gupta The Theory of Linear Models and Multivariate Analysis , 1981 .

[41]  Erik Blasch,et al.  Nonlinear tracking evaluation using absolute and relative metrics , 2006, SPIE Defense + Commercial Sensing.

[42]  Shoutir Kishore Chatterjee,et al.  On an Extension of Stein's TwoSample Procedure to the MultiNormal Problem , 1959 .

[43]  J. Kiefer,et al.  Admissible Bayes Character of $T^2-, R^2-$, and Other Fully Invariant Tests for Classical Multivariate Normal Problems , 1965 .

[44]  Yakov Bar-Shalom,et al.  Multitarget-Multisensor Tracking: Principles and Techniques , 1995 .

[45]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[46]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[47]  G. Shafer,et al.  Probability and Finance: It's Only a Game! , 2001 .

[48]  Xizhao Wang,et al.  Covariance-Matrix Modeling and Detecting Various Flooding Attacks , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[49]  N. Giri Multivariate Statistical Inference , 1977 .

[50]  Hoang Pham Special Issue on Critical Reliability Challenges and Practices [Guest Editorial] , 2007, IEEE Trans. Syst. Man Cybern. Part A.