Determination of minimum sample size for fault diagnosis of automobile hydraulic brake system using power analysis

Abstract Hydraulic brake in automobile engineering is considered to be one of the important components. Condition monitoring and fault diagnosis of such a component is very essential for safety of passengers, vehicles and to minimize the unexpected maintenance time. Vibration based machine learning approach for condition monitoring of hydraulic brake system is gaining momentum. Training and testing the classifier are two important activities in the process of feature classification. This study proposes a systematic statistical method called power analysis to find the minimum number of samples required to train the classifier with statistical stability so as to get good classification accuracy. Descriptive statistical features have been used and the more contributing features have been selected by using C4.5 decision tree algorithm. The results of power analysis have also been verified using a decision tree algorithm namely, C4.5.

[1]  A Donner,et al.  A goodness-of-fit approach to inference procedures for the kappa statistic: confidence interval construction, significance-testing and sample size estimation. , 1992, Statistics in medicine.

[2]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[3]  P. D. McFadden,et al.  Early Detection of Gear Failure by Vibration Analysis--I. Calculation of the Time Frequency Distribution , 1993 .

[4]  P Feigl,et al.  A graphical aid for determining sample size when comparing two independent proportions. , 1978, Biometrics.

[5]  K. I. Ramachandran,et al.  Automatic rule learning using decision tree for fuzzy classifier in fault diagnosis of roller bearing , 2007 .

[6]  S. N. Kavuri,et al.  Using fuzzy clustering with ellipsoidal units in neural networks for robust fault classification , 1993 .

[7]  V. Sugumaran,et al.  Comparative study of decision tree classifier and best first tree classifier for fault diagnosis of automobile hydraulic brake system using statistical features , 2013 .

[8]  Fabrice Bolaers,et al.  Early Detection of Gear Failure by Vibration Analysis , 2014 .

[9]  V. Sugumaran,et al.  Minimum sample size determination of vibration signals in machine learning approach to fault diagnosis using power analysis , 2010, Expert Syst. Appl..

[10]  S. Day,et al.  Internal pilot studies for estimating sample size. , 1994, Statistics in medicine.

[11]  S B Bull Sample size and power determination for a binary outcome and an ordinal exposure when logistic regression analysis is planned. , 1993, American journal of epidemiology.

[12]  N A Obuchowski,et al.  Computing Sample Size for Receiver Operating Characteristic Studies , 1994, Investigative radiology.

[13]  N L Geller,et al.  Interim analyses in randomized clinical trials: ramifications and guidelines for practitioners. , 1987, Biometrics.

[14]  J D Lantos Sample size: profound implications of mundane calculations. , 1993, Pediatrics.

[15]  N A Obuchowski,et al.  Sample size tables for receiver operating characteristic studies. , 2000, AJR. American journal of roentgenology.

[16]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[17]  S. Day,et al.  Sample size estimation for comparing two or more treatment groups in clinical trials. , 1991, Statistics in medicine.

[18]  George Stephanopoulos,et al.  Determination of minimum sample size and discriminatory expression patterns in microarray data , 2002, Bioinform..

[19]  N. R. Sakthivel,et al.  A method for calculation of optimum data size and bin size of histogram features in fault diagnosis of mono-block centrifugal pump , 2011, Expert Syst. Appl..

[20]  R. H. Browne On the use of a pilot sample for sample size determination. , 1995, Statistics in medicine.

[21]  Edgar Erdfelder,et al.  G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences , 2007, Behavior research methods.

[22]  S Greenland,et al.  On sample-size and power calculations for studies using confidence intervals. , 1988, American journal of epidemiology.

[23]  S L Beal,et al.  Sample size determination for confidence intervals on the population mean and on the difference between two population means. , 1989, Biometrics.

[24]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[25]  V. Sugumaran,et al.  Fault diagnostics of roller bearing using kernel based neighborhood score multi-class support vector machine , 2008, Expert Syst. Appl..

[26]  J. R. Quilan Decision trees and multi-valued attributes , 1988 .

[27]  Venkat Venkatasubramanian,et al.  Representing and diagnosing dynamic process data using neural networks , 1992 .