A clustering approach to detect multiple outliers in linear functional relationship model for circular data

ABSTRACT Outlier detection has been used extensively in data analysis to detect anomalous observation in data. It has important applications such as in fraud detection and robust analysis, among others. In this paper, we propose a method in detecting multiple outliers in linear functional relationship model for circular variables. Using the residual values of the Caires and Wyatt model, we applied the hierarchical clustering approach. With the use of a tree diagram, we illustrate the detection of outliers graphically. A Monte Carlo simulation study is done to verify the accuracy of the proposed method. Low probability of masking and swamping effects indicate the validity of the proposed approach. Also, the illustrations to two sets of real data are given to show its practical applicability.

[1]  R. Reyment,et al.  Statistics and Data Analysis in Geology. , 1988 .

[2]  David M. Sebert,et al.  A clustering algorithm for identifying multiple outliers in linear regression , 1998 .

[3]  I. Ibrahim COVRATIO Statistic for Simple Circular Regression Model , 2011 .

[4]  Ding-Zhu Du,et al.  A Decision Criterion for the Optimal Number of Clusters in Hierarchical Clustering , 2003, J. Glob. Optim..

[5]  A Simple Linear Functional Relationship Model for Circular Variables and Its Application , 2015 .

[6]  Abdul Ghapor Hussin,et al.  COVRATIO statistic for simple circular regression model. , 2011 .

[7]  Halim Setan,et al.  Multiple Outliers Detection Procedures in Linear Regression , 2003 .

[8]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[9]  Ibrahim Mohamed,et al.  Detection of outliers in simple circular regression models using the mean circular error statistic , 2013 .

[10]  F. E. Grubbs Procedures for Detecting Outlying Observations in Samples , 1969 .

[11]  A. G. Hussin,et al.  Identification of Influential Observations in Circular Regression Model , 2010 .

[12]  L. Wyatt,et al.  A linear functional relationship model for circular data with an application to the assessment of ocean wave measurements , 2003 .

[13]  Satari Siti Zanariah Parameter estimation and outlier detection for some types of circular model , 2015 .

[14]  Abdul Ghapor Hussin,et al.  Estimation of Functional Relationship Model for Circular Variables and Its Application in Measurement Problems , 2010 .

[15]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[16]  A. G. Hussin,et al.  PARAMETER ESTIMATION OF SIMULTANEOUS LINEAR FUNCTIONAL RELATIONSHIP MODEL FOR CIRCULAR VARIABLES ASSUMING EQUAL ERROR VARIANCES , 2015 .

[17]  A. G. Hussin,et al.  Asymptotic covariance and detection of influential observations in a linear functional relationship model for circular data with application to the measurements of wind directions , 2010 .

[18]  S. R. Jammalamadaka,et al.  Topics in Circular Statistics , 2001 .

[19]  N. I. Fisher Problems with the Current Definitions of the Standard Deviation of Wind Direction , 1987 .

[20]  K. Senthamarai Kannan,et al.  Multiple Linear Regression Models in Outlier Detection , 2012 .

[21]  S. R. Jammalamadaka,et al.  Directional Statistics, I , 2011 .