论文信息 - Exploring process data

Exploring process data

Abstract With the growth of computer usage at all levels in the process industries, the volume of available data has also grown enormously, sometimes to levels that render analysis difficult. Most of this data may be characterized as historical in the sense that it was not collected on the basis of experiments designed to test specific statistical hypotheses. Consequently, the resulting datasets are likely to contain unexpected features (e.g. outliers from various sources, unsuspected correlations between variables, etc.). This observation is important for two reasons: first, these data anomalies can completely negate the results obtained by standard analysis procedures, particularly those based on squared error criteria (a large class that includes many SPC and chemometrics techniques). Secondly and sometimes more importantly, an understanding of these data anomalies may lead to extremely valuable insights. For both of these reasons, it is important to approach the analysis of large historical datasets with the initial objective of uncovering and understanding their gross structure and character. This paper presents a brief survey of some simple procedures that have been found to be particularly useful at this preliminary stage of analysis.

R. K. Pearson | R. Pearson

[1] Edward Tufte,et al. Visual Explanations , 1997 .

[2] R. Martin,et al. Leave‐K‐Out Diagnostics for Time Series , 1989 .

[3] Peter J. Rousseeuw,et al. Robust regression and outlier detection , 1987 .

[4] John W. Tukey,et al. Exploratory Data Analysis. , 1979 .

[5] Jaroslav Hájek,et al. Theory of rank tests , 1969 .

[6] Laurie Davies,et al. The identification of multiple outliers , 1993 .

[7] Chris Aldrich,et al. Effect of fluid properties on two-phase froth characteristics , 1999 .

[8] Ralph B. D'Agostino,et al. Goodness-of-Fit-Techniques , 2020 .

[9] Irene A. Stegun,et al. Handbook of Mathematical Functions. , 1966 .

[10] A. Mackay,et al. A Dictionary of Scientific Quotations , 2019, A Dictionary of Scientific Quotations.

[11] Jordan Stoyanov,et al. Counterexamples in Probability , 1988 .

[12] VERNON J. CLANCEY,et al. Statistical Methods in Chemical Analyses , 1947, Nature.

[13] S. T. Buckland,et al. An Introduction to the Bootstrap. , 1994 .

[14] A. Negiz,et al. Statistical monitoring of multivariable dynamic processes with state-space models , 1997 .

[15] W. W. Muir,et al. Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1980 .

[16] J. Jobson. Applied Multivariate Data Analysis , 1995 .

[17] S. J. Devlin,et al. Robust Estimation of Dispersion Matrices and Principal Components , 1981 .

[18] N. L. Johnson,et al. Continuous Univariate Distributions. , 1995 .