Feature Generation II

This chapter examines feature generation with a focus on image and audio classification. Feature generation is a procedure that computes new variables that in one way or another originate from the stored values of the image array I ( m , n ). Some of the feature generation techniques can be considered common and can be applicable in both visual and audio modalities. A large number of features are the result of different approaches to exploit the specific nature of the signals and encode the required classification information in a more efficient way. This chapter also focuses on first- and second-order statistics features as well as the run-length method. The chapter discusses typical features used to characterize and classify audio information. The chapter presents a description of statistical properties of signals and images and the ways these can be exploited to extract information-rich features for classification. The chapter examines whether the notion of self-similarity is extendable to stochastic processes and, if it is, how useful it can be. The chain code for shape description is discussed in this chapter. Computer exercises are then offered to generate these features and use them for classification for some case studies.

[1]  R. Chellappa Two-Dimensional Discrete Gaussian Markov Random Field Models for Image Processing , 1989 .

[2]  William E. Higgins,et al.  Efficient Gabor filter design for texture segmentation , 1996, Pattern Recognit..

[3]  J. Robson,et al.  Application of fourier analysis to the visibility of gratings , 1968, The Journal of physiology.

[4]  Judith C. Brown,et al.  Musical frequency tracking using the methods of conventional and , 1991 .

[5]  Gregory H. Wakefield,et al.  Audio thumbnailing of popular music using chroma-based representations , 2005, IEEE Transactions on Multimedia.

[6]  J. Woods Markov image modeling , 1976 .

[7]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[8]  Matti Karjalainen,et al.  A computationally efficient multipitch analysis model , 2000, IEEE Trans. Speech Audio Process..

[9]  JEFFREY WOOD,et al.  Invariant pattern recognition: A review , 1996, Pattern Recognit..

[10]  H. B. Barlow,et al.  Unsupervised Learning , 1989, Neural Computation.

[11]  Harold H. Szu,et al.  Neural network adaptive wavelets for signal representation and classification , 1992 .

[12]  Richard C. Dubes,et al.  Performance evaluation for four classes of textural features , 1992, Pattern Recognit..

[13]  D. Esteban,et al.  Application of quadrature mirror filters to split band voice coding schemes , 1977 .

[14]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[15]  Oh-Wook Kwon,et al.  Phoneme recognition using ICA-based feature extraction and transformation , 2004, Signal Process..

[16]  Mary M. Galloway,et al.  Texture analysis using gray level run lengths , 1974 .

[17]  Manos Papadakis,et al.  Character recognition using a biorthogonal discrete wavelet transform , 1996, Optics & Photonics.

[18]  B. Mandelbrot,et al.  Fractional Brownian Motions, Fractional Noises and Applications , 1968 .

[19]  Morten Daehlen,et al.  Recognition of handwritten symbols , 1990, Pattern Recognit..

[20]  George-Othon Glentis,et al.  Efficient algorithm for two-dimensional finite impulse response (FIR) filtering and system identification , 1994, Other Conferences.

[21]  Danny Coomans,et al.  Classification Using Adaptive Wavelets for Feature Extraction , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[23]  Anil K. Jain,et al.  Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[24]  Whoi-Yul Kim,et al.  A novel approach to the fast computation of Zernike moments , 2006, Pattern Recognit..

[25]  Wesley E. Snyder,et al.  Application of Affine-Invariant Fourier Descriptors to Recognition of 3-D Objects , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Christian Jutten,et al.  Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture , 1991, Signal Process..

[27]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[28]  Chee-Way Chong,et al.  Translation invariants of Zernike moments , 2003, Pattern Recognit..

[29]  Samir Al-Emami,et al.  On-Line Recognition of Handwritten Arabic Characters , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Michèle Basseville,et al.  Modeling and estimation of multiresolution stochastic processes , 1992, IEEE Trans. Inf. Theory.

[31]  Bedrich J. Hosticka,et al.  A comparison of texture feature extraction using adaptive gabor filtering, pyramidal and tree structured wavelet transforms , 1996, Pattern Recognit..

[32]  S. Kay,et al.  Fractional Brownian Motion: A Maximum Likelihood Estimator and Its Application to Image Texture , 1986, IEEE Transactions on Medical Imaging.

[33]  Sergios Theodoridis,et al.  Classification of musical patterns using variable duration hidden Markov models , 2004, IEEE Transactions on Audio, Speech, and Language Processing.

[34]  Masataka Goto,et al.  A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals , 2004, Speech Commun..

[35]  Larry S. Davis,et al.  Texture Analysis Using Generalized Co-Occurrence Matrices , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[37]  Constantin Papaodysseus,et al.  On the automated recognition of seriously distorted musical recordings , 2001, IEEE Trans. Signal Process..

[38]  Aleksandra Mojsilovic,et al.  On the Selection of an Optimal Wavelet Basis for Texture Characterization , 1998, ICIP.

[39]  Frank Kurth,et al.  A unified approach to content-based and fault-tolerant music recognition , 2004, IEEE Transactions on Multimedia.

[40]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[41]  Gustavo Deco,et al.  Linear redundancy reduction learning , 1995, Neural Networks.

[42]  Wilson S. Geisler,et al.  Multichannel Texture Analysis Using Localized Spatial Filters , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Chi-Man Pun,et al.  Log-Polar Wavelet Energy Signatures for Rotation and Scale Invariant Texture Classification , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  Alexander G. Mamistvalov n-Dimensional Moment Invariants and Conceptual Mathematical Theory of Recognition n-Dimensional Solids , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Lijuan Cao,et al.  A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine , 2003, Neurocomputing.

[46]  B. Mandelbrot Fractal Geometry of Nature , 1984 .

[47]  M. Fox,et al.  Fractal feature analysis and classification in medical imaging. , 1989, IEEE transactions on medical imaging.

[48]  Sergios Theodoridis,et al.  Optical character recognition of the Orthodox Hellenic Byzantine Music notation , 2002, Pattern Recognit..

[49]  Ching Y. Suen,et al.  Automatic recognition of characters by Fourier descriptors and boundary line encodings , 1981, Pattern Recognit..

[50]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[51]  Patrick Flandrin,et al.  Wavelet analysis and synthesis of fractional Brownian motion , 1992, IEEE Trans. Inf. Theory.

[52]  Alessandro Vinciarelli,et al.  A survey on off-line Cursive Word Recognition , 2002, Pattern Recognit..

[53]  Qian Huang,et al.  Can the fractal dimension of images be measured? , 1994, Pattern Recognit..

[54]  Jan Flusser,et al.  Affine moment invariants: a new tool for character recognition , 1994, Pattern Recognit. Lett..

[55]  Anil K. Jain,et al.  Texture classification and segmentation using multiresolution simultaneous autoregressive models , 1992, Pattern Recognit..

[56]  Anil K. Jain,et al.  Markov Random Field Texture Models , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  Jan Flusser,et al.  Pattern recognition by affine moment invariants , 1993, Pattern Recognit..

[58]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[59]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[60]  Anjan Sarkar,et al.  A new approach for subset 2-D AR model identification for describing textures , 1997, IEEE Trans. Image Process..

[61]  Jian Fan,et al.  Texture Classification by Wavelet Packet Signatures , 1993, MVA.

[62]  Alan I. Penn,et al.  Estimating fractal dimension with fractal interpolation function models , 1997, IEEE Transactions on Medical Imaging.

[63]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[64]  Mandyam D. Srinath,et al.  Invariant character recognition with Zernike and orthogonal Fourier-Mellin moments , 2002, Pattern Recognit..

[65]  N. Kalouptsidis,et al.  Spectral analysis , 1993 .

[66]  J. L. Véhel,et al.  The Generalized Multifractional Brownian Motion , 2000 .

[67]  C.-C. Jay Kuo,et al.  Texture Roughness Analysis and Synthesis via Extended Self-Similar (ESS) Model , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[68]  Peter N. Heller,et al.  Theory of regular M-band wavelet bases , 1993, IEEE Trans. Signal Process..

[69]  Ronald R. Coifman,et al.  Wavelet analysis and signal processing , 1990 .

[70]  Sergios Theodoridis,et al.  Concurrent algorithms for a class of 1-D and 2-D Wiener FIR filters with symmetrical impulse response , 1989, IEEE Trans. Acoust. Speech Signal Process..

[71]  Ching Y. Suen,et al.  Hierarchical attributed graph representation and recognition of handwritten chinese characters , 1991, Pattern Recognit..

[72]  B. S. Manjunath,et al.  Rotation-invariant texture classification using modified Gabor filters , 1995, Proceedings., International Conference on Image Processing.

[73]  Sugata Ghosal,et al.  A moment-based unified approach to image feature detection , 1997, IEEE Trans. Image Process..

[74]  Chun-Shin Lin,et al.  New forms of shape invariants from elliptic fourier descriptors , 1987, Pattern Recognit..

[75]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[76]  Thomas R. Crimmins A Complete Set of Fourier Descriptors for Two-Dimensional Shapes , 1982, IEEE Transactions on Systems, Man, and Cybernetics.

[77]  John W. Woods,et al.  Two-dimensional discrete Markovian fields , 1972, IEEE Trans. Inf. Theory.

[78]  M. R. Turner,et al.  Texture discrimination by Gabor functions , 1986, Biological Cybernetics.

[79]  Alireza Khotanzad,et al.  Classification of invariant image representations using a neural network , 1990, IEEE Trans. Acoust. Speech Signal Process..

[80]  Lance M. Kaplan Extended fractal analysis for texture classification and segmentation , 1999, IEEE Trans. Image Process..

[81]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[82]  Alireza Khotanzad,et al.  Invariant Image Recognition by Zernike Moments , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[83]  Gil-Jin Jang,et al.  Feature vector transformation using independent component analysis and its application to speaker identification , 1999, EUROSPEECH.

[84]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[85]  Glenn Healey,et al.  Using Zernike moments for the illumination and geometry invariant classification of multispectral texture , 1998, IEEE Trans. Image Process..

[86]  Rama Chellappa,et al.  Texture classification using features derived from random field models , 1982, Pattern Recognit. Lett..

[87]  Jian Fan,et al.  Frame representations for texture segmentation , 1996, IEEE Trans. Image Process..

[88]  John G. Proakis,et al.  Digital Signal Processing: Principles, Algorithms, and Applications , 1992 .

[89]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[90]  Rangasami L. Kashyap,et al.  A Model-Based Method for Rotation Invariant Texture Classification , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[91]  J. Nachmias,et al.  Discrimination of simple and complex gratings , 1975, Vision Research.

[92]  Martin Vetterli,et al.  Wavelets and filter banks: theory and design , 1992, IEEE Trans. Signal Process..

[93]  Trygve Randen,et al.  Filtering for Texture Classification: A Comparative Study , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[94]  C.-C. Jay Kuo,et al.  Audio content analysis for online audiovisual data segmentation and classification , 2001, IEEE Trans. Speech Audio Process..

[95]  C.-C. Jay Kuo,et al.  Extending self-similarity for fractional Brownian motion , 1994, IEEE Trans. Signal Process..

[96]  J. L. Véhel,et al.  Stochastic fractal models for image processing , 2002, IEEE Signal Process. Mag..

[97]  M. Teague Image analysis via the general theory of moments , 1980 .

[98]  Mohamed A. Deriche,et al.  Signal modeling with filtered discrete fractional noise processes , 1993, IEEE Trans. Signal Process..

[99]  W. Ditto,et al.  Chaos: From Theory to Applications , 1992 .

[100]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[101]  Athanasios Papoulis,et al.  Probability, Random Variables and Stochastic Processes , 1965 .

[102]  B. S. Manjunath,et al.  Rotation-invariant texture classification using a complete space-frequency model , 1999, IEEE Trans. Image Process..

[103]  Francesco Camastra,et al.  Data dimensionality estimation methods: a survey , 2003, Pattern Recognit..

[104]  Anssi Klapuri,et al.  Multiple fundamental frequency estimation based on harmonicity and spectral smoothness , 2003, IEEE Trans. Speech Audio Process..

[105]  Dennis Gabor,et al.  Theory of communication , 1946 .

[106]  Alex Pentland,et al.  Fractal-Based Description of Natural Scenes , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[107]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[108]  Juha Karhunen,et al.  Representation and separation of signals using nonlinear PCA type learning , 1994, Neural Networks.

[109]  Jr. W.B. Richardson Applying wavelets to mammograms , 1995 .

[110]  Paul W. Fieguth,et al.  Fractal estimation using models on multiscale trees , 1996, IEEE Trans. Signal Process..

[111]  M. Unser Local linear transforms for texture measurements , 1986 .

[112]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[113]  K. R. Ramakrishnan,et al.  Fast computation of Legendre and Zernike moments , 1995, Pattern Recognit..

[114]  George-Othon Glentis,et al.  An Efficient Algorithm for Two-Dimensional FIR filtering and System Identification , 1994 .

[115]  Sergios Theodoridis,et al.  Recognition of isolated musical patterns using Context Dependent Dynamic Time Warping , 2002, 2002 11th European Signal Processing Conference.

[116]  King-Sun Fu,et al.  Shape Discrimination Using Fourier Descriptors , 1977, IEEE Trans. Syst. Man Cybern..

[117]  Chandan Singh Improved quality of reconstructed images using floating point arithmetic for moment calculation , 2006, Pattern Recognit..

[118]  F. Campbell,et al.  Orientational selectivity of the human visual system , 1966, The Journal of physiology.

[119]  M. I. Heywood,et al.  Fractional central moment method for movement-invariant object classification , 1995 .

[120]  J. Geweke,et al.  THE ESTIMATION AND APPLICATION OF LONG MEMORY TIME SERIES MODELS , 1983 .

[121]  Andrew F. Laine,et al.  Wavelet descriptors for multiresolution recognition of handprinted characters , 1995, Pattern Recognit..

[122]  Thomas H. Reiss,et al.  The revised Fundamental Theorem of Moment Invariants , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[123]  Michael Unser,et al.  Multiresolution Feature Extraction and Selection for Texture Segmentation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[124]  Chee-Way Chong,et al.  Translation and scale invariants of Legendre moments , 2004, Pattern Recognit..

[125]  Gösta H. Granlund,et al.  Fourier Preprocessing for Hand Print Character Recognition , 1972, IEEE Transactions on Computers.

[126]  Miroslaw Pawlak,et al.  On Image Analysis by Moments , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[127]  Raveendran Paramesran,et al.  Efficient computation of radial moment functions using symmetrical property , 2006, Pattern Recognit..

[128]  Joseph Naor,et al.  Multiple Resolution Texture Analysis and Classification , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[129]  M. Narasimha Murty,et al.  Growing subspace pattern recognition methods and their neural-network models , 1997, IEEE Trans. Neural Networks.

[130]  Theodosios Pavlidis,et al.  Direct Gray-Scale Extraction of Features for Character Recognition , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[131]  Charles R. Giardina,et al.  Elliptic Fourier features of a closed contour , 1982, Comput. Graph. Image Process..

[132]  Phil Brodatz,et al.  Textures: A Photographic Album for Artists and Designers , 1966 .

[133]  C.-C. Jay Kuo,et al.  Wavelet descriptor of planar curves: theory and applications , 1996, IEEE Trans. Image Process..

[134]  Zhu Liu,et al.  Multimedia content analysis-using both audio and visual clues , 2000, IEEE Signal Process. Mag..

[135]  R. Mukundan,et al.  Moment Functions in Image Analysis: Theory and Applications , 1998 .

[136]  Joseph Picone,et al.  Signal modeling techniques in speech recognition , 1993, Proc. IEEE.

[137]  Xiaoou Tang,et al.  Texture information in run-length matrices , 1998, IEEE Trans. Image Process..

[138]  Kenneth Falconer,et al.  Fractal Geometry: Mathematical Foundations and Applications , 1990 .

[139]  Petros Maragos,et al.  Measuring the Fractal Dimension of Signals: Morphological Covers and Iterative Optimization , 1993, IEEE Trans. Signal Process..

[140]  Michael Unser,et al.  Texture classification and segmentation using wavelet frames , 1995, IEEE Trans. Image Process..

[141]  Herbert Freeman,et al.  On the Encoding of Arbitrary Geometric Configurations , 1961, IRE Trans. Electron. Comput..

[142]  Ramesh C. Jain,et al.  Determining Motion Parameters for Scenes with Translation and Rotation , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[143]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[144]  Sergios Theodoridis,et al.  A Speech/Music Discriminator of Radio Recordings Based on Dynamic Programming and Bayesian Networks , 2008, IEEE Transactions on Multimedia.

[145]  D. Cavouras,et al.  Image analysis methods for solitary pulmonary nodule characterization by computed tomography. , 1992, European journal of radiology.

[146]  Nicolai Petkov,et al.  Comparison of texture features based on Gabor filters , 2002, IEEE Trans. Image Process..

[147]  Alan C. Bovik,et al.  Analysis of multichannel narrow-band filters for image texture segmentation , 1991, IEEE Trans. Signal Process..

[148]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[149]  Guy J. Brown,et al.  A multi-pitch tracking algorithm for noisy speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[150]  F. Arduini,et al.  Multifractals and texture classification , 1992 .

[151]  Marian Stewart Bartlett,et al.  Face recognition by independent component analysis , 2002, IEEE Trans. Neural Networks.

[152]  Sabri A. Mahmoud,et al.  Arabic character recognition using fourier descriptors and character contour encoding , 1994, Pattern Recognit..

[153]  A. Oppenheim,et al.  Signal processing with fractals: a wavelet-based approach , 1996 .

[154]  M. Schroeder Period histogram and product spectrum: new methods for fundamental-frequency measurement. , 1968, The Journal of the Acoustical Society of America.

[155]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[156]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[157]  Roland T. Chin,et al.  On Image Analysis by the Methods of Moments , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[158]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[159]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[160]  James M. Keller,et al.  Characteristics of Natural Scenes Related to the Fractal Dimension , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[161]  Joseph J. Atick Entropy Minimization: a Design Principle for Sensory Perception? , 1992, Int. J. Neural Syst..