Wavelet feature extraction for high-dimensional microarray data

It is well known that the problem arising from high dimensionality of data should be considered in pattern recognition. Original microarray data are usually of high dimensionality, whereas, only limited training samples are available. Therefore, dimensionality reduction is an important strategy to greatly improve the classification performance of microarray data. A novel method of feature extraction and dimensionality reduction for high-dimensional microarray data is proposed in this study. A set of orthogonal wavelet detail coefficients based on wavelet decomposition at different levels is extracted to characterize the localized features of microarray data. Experiments are carried out on four datasets. A highly competitive accuracy is achieved in comparison with the performance of other models.

[1]  Guy Perrière,et al.  Between-group analysis of microarray data , 2002, Bioinform..

[2]  Constantin F. Aliferis,et al.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2004, Bioinform..

[3]  Ilya Levner,et al.  Feature selection and nearest centroid classification for protein mass spectrometry , 2005, BMC Bioinformatics.

[4]  A. Grossmann,et al.  DECOMPOSITION OF HARDY FUNCTIONS INTO SQUARE INTEGRABLE WAVELETS OF CONSTANT SHAPE , 1984 .

[5]  Li Shang,et al.  Feature selection in independent component subspace for microarray data classification , 2006, Neurocomputing.

[6]  Aik Choon Tan,et al.  Ensemble machine learning on gene expression data for cancer classification. , 2003, Applied bioinformatics.

[7]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Marcel Dettling,et al.  BagBoosting for tumor classification with gene expression data , 2004, Bioinform..

[9]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[10]  R.S.H. Istepanian,et al.  Application of wavelet modulus maxima in microarray spots recognition , 2003, IEEE Transactions on NanoBioscience.

[11]  Carlo Di Bello,et al.  PCA disjoint models for multiclass cancer analysis using gene expression data , 2003, Bioinform..

[12]  Robert R. Klevecz,et al.  Dynamic architecture of the yeast cell cycle uncovered by wavelet decomposition of expression microarray data , 2000, Functional & Integrative Genomics.

[13]  J. Downing,et al.  Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. , 2002, Cancer cell.

[14]  Huiqing Liu,et al.  Discovery of significant rules for classifying cancer diagnosis data , 2003, ECCB.

[15]  Danh V. Nguyen,et al.  Tumor classification by partial least squares using microarray gene expression data , 2002, Bioinform..

[16]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[17]  Pierre Baldi,et al.  A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes , 2001, Bioinform..

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[19]  Marina Vannucci,et al.  Gene selection: a Bayesian variable selection approach , 2003, Bioinform..

[20]  Dimitrios Vogiatzis,et al.  Active Learning with Wavelets for Microarray Data , 2005, WILF.

[21]  De-Shuang Huang,et al.  Independent component analysis-based penalized discriminant method for tumor classification using gene expression data , 2006, Bioinform..

[22]  E. Boerwinkle,et al.  Computational methods for gene expression-based tumor classification. , 2000, BioTechniques.

[23]  Ju Wang,et al.  Normalization of cDNA microarray data using wavelet regressions. , 2004, Combinatorial chemistry & high throughput screening.

[24]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.