ABSTRACT Outlier detection has always been of interest for researchers and data miners. It has been well researched in different knowledge and application domains. This study aims at exploring the correctly identifying outliers using most commonly applied statistics. We evaluate the performance of AO, IO, LS, and TC as vulnerability to spurious outliers by means of empirical level of significance (ELS), power of the test indicating the sensitivity of the statistical tests in detecting changes and the vulnerability to masking of outliers in terms of misspecification frequencies are determined. We have observed that the sampling distribution of test statistic ηtp; tp = AO, IO, LS, TC in case of AR(1) model is connected with the values of n and φ. The sampling distribution of ηTC is less concentrated than the sampling distribution of ηAO, ηIO, and ηLS. In AR(1) process, empirical critical values for 1%, 5%, and 10% upper percentiles are found to be higher than those generally used. We have also found the evidence that the test statistics for transient change (TC) needs to be revisited as the test statistics ηTC is found to be eclipsed by ηAO, ηLS and ηIO at different δ values. TC keeps on confusing with IO and AO, and at extreme δ values it just gets equal to AO and LS.
[1]
Lon-Mu Liu,et al.
Joint Estimation of Model Parameters and Outlier Effects in Time Series
,
1993
.
[2]
Joel L. Horowitz,et al.
Empirically relevant critical values for hypothesis tests: A bootstrap approach
,
2000
.
[3]
R. Tsay.
Outliers, Level Shifts, and Variance Changes in Time Series
,
1988
.
[4]
G. C. Tiao,et al.
Estimation of time series parameters in the presence of outliers
,
1988
.
[5]
R. Tsay.
Time Series Model Specification in the Presence of Outliers
,
1986
.
[6]
M. Otto,et al.
Outliers in Time Series
,
1972
.
[7]
G. Box,et al.
Bayesian analysis of some outlier problems in time series
,
1979
.
[8]
V. Yohai,et al.
Influence Functionals for Time Series
,
1986
.