Bayesian Identification of Outliers and Change-Points in Measurement Error Models

The problem of outlier and change-point identification has received considerable attention in traditional linear regression models from both, classical and Bayesian standpoints. In contrast, for the case of regression models with measurement errors, also known as error-in-variables models, the corresponding literature is scarce and largely focused on classical solutions for the normal case. The main object of this paper is to propose clustering algorithms for outlier detection and change-point identification in scale mixture of error-in-variables models. We propose an approach based on product partition models (PPMs) which allows one to study clustering for the models under consideration. This includes the change-point problem and outlier detection as special cases. The outlier identification problem is approached by adapting the algorithms developed by Quintana and Iglesias [32] for simple linear regression models. A special algorithm is developed for the change-point problem which can be applied in a more general setup. The methods are illustrated with two applications: (i) outlier identification in a problem involving the relationship between two methods for measuring serum kanamycin in blood samples from babies, and (ii) change-point identification in the relationship between the monthly dollar volume of sales on the Boston Stock Exchange and the combined monthly dollar volumes for the New York and American Stock Exchanges.

[1]  P. Dellaportas,et al.  BAYESIAN ANALYSIS OF ERRORS-IN-VARIABLES REGRESSION MODELS , 1995 .

[2]  David J. Olive,et al.  Inconsistency of Resampling Algorithms for High-Breakdown Regression Estimators and a New Algorithm , 2002 .

[3]  F. Vilca-Labra Elliptical Functional Models , 1998 .

[4]  Andy H. Lee,et al.  Assessing Local Influence in Measurement Error Models , 1996 .

[5]  S. Richardson,et al.  Conditional independence models for epidemiological studies with covariate measurement error. , 1993, Statistics in medicine.

[6]  Jun S. Liu Nonparametric hierarchical Bayes via sequential imputations , 1996 .

[7]  C. Antoniak Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems , 1974 .

[8]  Donald Holbert,et al.  A Bayesian analysis of a switching linear model , 1982 .

[9]  Elliptical structural models , 1996 .

[10]  Sudhir Gupta,et al.  Statistical Regression With Measurement Error , 1999, Technometrics.

[11]  H. Bolfarine,et al.  Estimation of a structural linear regression model with a known reliability ratio , 1993 .

[12]  Mokhtar B. Abdullah Detection of influential observations in functional errors-in-variables model , 1995 .

[13]  Yi-Ping Chang,et al.  Inferences for the Linear Errors-in-Variables with Changepoint Models , 1997 .

[14]  S. MacEachern,et al.  Estimating mixture of dirichlet process models , 1998 .

[15]  E. M. Crowley Product Partition Models for Normal Means , 1997 .

[16]  M. Stephens Bayesian analysis of mixture models with an unknown number of components- an alternative to reversible jump methods , 2000 .

[17]  S. MacEachern,et al.  A semiparametric Bayesian model for randomised block designs , 1996 .

[18]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[19]  R. Arellano-Valle,et al.  Bayesian sensitivity analysis in elliptical linear regression models , 2000 .

[20]  F. Quintana,et al.  Bayesian clustering and product partition models , 2003 .

[21]  J. Hartigan,et al.  Product Partition Models for Change Point Problems , 1992 .

[22]  D. V. Lindley,et al.  The Bayesian Estimation of a Linear Functional Relationship , 1968 .

[23]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[24]  Lyle D. Broemeling,et al.  Bayesian Analysis of Linear Models. , 1986 .

[25]  J. Hartigan,et al.  A Bayesian Analysis for Change Point Problems , 1993 .

[26]  H. Bolfarine,et al.  Influence diagnostics for the structural errors-in-variables model under the Student-t distribution , 2002 .

[27]  Rosangela Helena Loschi,et al.  Extension to the product partition model: computing the probability of a change , 2005, Comput. Stat. Data Anal..