Block-level discrete cosine transform coefficients for autonomic face recognition

This dissertation presents a novel method of autonomic face recognition based on the recently proposed biologically plausible network of networks (NoN) model of information processing. The NoN model is based on locally parallel and globally coordinated transformations. In the NoN architecture, the neurons or computational units form distributed networks, which themselves link to form larger networks. In the general case, an n-level hierarchy of nested distributed networks is constructed. This models the structures in the cerebral cortex described by Mountcastle and the architecture based on that proposed for information processing by Sutton. In the implementation proposed in the dissertation, the image is processed by a nested family of locally operating networks along with a hierarchically superior network that classifies the information from each of the local networks. The implementation of this approach helps obtain sensitivity to the contrast sensitivity function (CSF) in the middle of the spectrum, as is true for the human vision system. The input images are divided into N x N blocks to define the local regions of processing. The N x N two-dimensional Discrete Cosine Transform (DCT), a spatial frequency transform, is used to transform the data into the frequency domain. Thereafter, statistical operators that calculate various functions of spatial frequency in the block are used to produce a block-level DCT coefficient. The image is now transformed into a variable length vector that is trained with respect to the data set. The classification was done by the use of a backpropagation neural network. The proposed method yields excellent results on a benchmark database. The results of the experiments yielded a maximum of 98.5% recognition accuracy and an average of 97.4% recognition accuracy. An advanced version of the method where the local processing is done on offset blocks has also been developed. This has validated the NoN approach and further research using local processing as well as more advanced global operators is likely to yield even better results.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  C Blakemore,et al.  On the existence of neurones in the human visual system selectively sensitive to the orientation and size of retinal images , 1969, The Journal of physiology.

[3]  Kari Karhunen,et al.  Über lineare Methoden in der Wahrscheinlichkeitsrechnung , 1947 .

[4]  A. Ginsburg Psychological Correlates of a Model of the Human Visual System , 1971 .

[5]  A. G. Goldstein Race-related variation of facial features: Anthropometric data I , 1979 .

[6]  D. Broadbent,et al.  Some experiments bearing on the hypothesis that the visual system analyses spatial patterns in independent bands of spatial frequency , 1975, Vision Research.

[7]  Keinosuke Fukunaga,et al.  Statistical Pattern Recognition , 1993, Handbook of Pattern Recognition and Computer Vision.

[8]  A. Ardeshir Goshtasby,et al.  Description and Discrimination of Planar Shapes Using Shape Matrices , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[10]  Charilaos A. Christopoulos,et al.  Efficient computation of the two-dimensional fast cosine transform , 1994, Defense, Security, and Sensing.

[11]  Mark J. T. Smith,et al.  Enhancement of block transform coded images using residual spectra adaptive postfiltering , 1994, Proceedings of IEEE Data Compression Conference (DCC'94).

[12]  Jacques Gautrais,et al.  SpikeNET: A simulator for modeling large networks of integrate and fire neurons , 1999, Neurocomputing.

[13]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[14]  John G. Apostolopoulos,et al.  Postprocessing for very low bit-rate video compression , 1999, IEEE Trans. Image Process..

[15]  Michael David Kelly,et al.  Visual identification of people by computer , 1970 .

[16]  Robert J. Baron,et al.  Mechanisms of Human Facial Recognition , 1981, Int. J. Man Mach. Stud..

[17]  J. Robson,et al.  Application of fourier analysis to the visibility of gratings , 1968, The Journal of physiology.

[18]  Y. Kaya,et al.  A BASIC STUDY ON HUMAN FACE RECOGNITION , 1972 .

[19]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[20]  Farokh Marvasti,et al.  Application of Nonuniform Sampling to Error Concealment , 2001 .

[21]  Theodosios Pavlidis,et al.  Structural pattern recognition , 1977 .

[22]  Jing-Yu Yang,et al.  Face recognition based on the uncorrelated discriminant transformation , 2001, Pattern Recognit..

[23]  Monson H. Hayes,et al.  Hidden Markov models for face recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[24]  L. D. Harmon The recognition of faces. , 1973, Scientific American.

[25]  Ian Craw,et al.  Finding Face Features , 1992, ECCV.

[26]  Zixiang Xiong,et al.  A simple deblocking algorithm for JPEG compressed images using overcomplete wavelet representations , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[27]  Ferdinando Silvestro Samaria,et al.  Face recognition using Hidden Markov Models , 1995 .

[28]  Rod Adams,et al.  IMAGE RECOGNITION USING DISCRETE COSINE TRANSFORMS AS DIMENSIONALITY REDUCTION , 2001 .

[29]  Gutfreund Neural networks with hierarchically correlated patterns. , 1988, Physical review. A, General physics.

[30]  J. W. Shepherd,et al.  An Interactive Computer System for Retrieving Faces , 1986 .

[31]  Te-Won Lee,et al.  Independent Component Analysis , 1998, Springer US.

[32]  Allen M. Waxman,et al.  Recognizing faces from their parts , 1992, Other Conferences.

[33]  Bernd Girod,et al.  Classification of compound images based on transform coefficient likelihood , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[34]  Thomas Fromherz,et al.  A Survey of Face Recognition , 1997 .

[35]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[36]  Hua Yu,et al.  A direct LDA algorithm for high-dimensional data - with application to face recognition , 2001, Pattern Recognit..

[37]  Venu Govindaraju,et al.  A computational model for face location , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[38]  Subhash C. Kak,et al.  A class of instantaneously trained neural networks , 2002, Inf. Sci..

[39]  Ling Guan,et al.  A network of networks processing model for image regularization , 1997, IEEE Trans. Neural Networks.

[40]  Saad Ahmed Sirohey,et al.  Human Face Segmentation and Identification , 1998 .

[41]  A. Young,et al.  The human face , 1982 .

[42]  J. Beis,et al.  Hierarchical model of memory and memory loss , 1988 .

[43]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Rama Chellappa,et al.  A feature based approach to face recognition , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[45]  B. Hunt,et al.  The discreteW transform , 1985 .

[46]  Erik Hjelmås Feature-Based Face Recognition , 2000 .

[47]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[48]  James A. Anderson,et al.  Radar signal categorization using a neural network , 1990, Proc. IEEE.

[49]  Arun N. Netravali,et al.  Digital Pictures: Representation and Compression , 1988 .

[50]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[51]  Its'hak Dinstein,et al.  Variable block-size transform image coder , 1990, IEEE Trans. Commun..

[52]  Meng Joo Er,et al.  High-speed face recognition based on discrete cosine transform and RBF neural networks , 2005, IEEE Transactions on Neural Networks.

[53]  Juha Karhunen,et al.  Generalizations of principal component analysis, optimization problems, and neural networks , 1995, Neural Networks.

[54]  Aapo Hyvärinen,et al.  Survey on Independent Component Analysis , 1999 .

[55]  Joseph W. Carl,et al.  The Application of Filtered Transforms to the General Classification Problem , 1972, IEEE Transactions on Computers.

[56]  H. D. Ellis,et al.  Introduction to Aspects of Face Processing: Ten Questions in Need of Answers , 1986 .

[57]  Makoto Nagao,et al.  Line extraction and pattern detection in a photograph , 1969, Pattern Recognit..

[58]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[59]  C. Blakemore,et al.  Lateral Inhibition between Orientation Detectors in the Human Visual System , 1970, Nature.

[60]  Mark W. Cannon,et al.  Contrast sensation: A linear function of stimulus contrast , 1979, Vision Research.

[61]  J. Friedman Exploratory Projection Pursuit , 1987 .

[62]  Jiebo Luo,et al.  Applications of Gibbs random field in image processing: from segmentation to enhancement , 1994, Other Conferences.

[63]  Corinna Cortes,et al.  Hierarchical associative networks , 1987 .

[64]  Arcot Sowmya,et al.  Neural network approach to component versus holistic recognition of facial expressions in images , 1992, Other Conferences.

[65]  H. Abdi,et al.  Principal Component and Neural Network Analyses of Face Images: What Can Be Generalized in Gender Classification? , 1997, Journal of mathematical psychology.

[66]  N. Graham,et al.  Detection of grating patterns containing two spatial frequencies: a comparison of single-channel and multiple-channels models. , 1971, Vision research.

[67]  Jie Yang,et al.  An Efficient LDA Algorithm for Face Recognition , 2000 .

[68]  R. V. Prasad,et al.  Techniques and Standards for Image, Video and Audio Coding , 1998 .

[69]  V. Mountcastle,et al.  An organizing principle for cerebral function : the unit module and the distributed system , 1978 .

[70]  J. Daugman Spatial visual channels in the fourier plane , 1984, Vision Research.

[71]  Andrew B. Watson,et al.  Image Compression Using the Discrete Cosine Transform , 1994 .

[72]  K. Kim,et al.  Face recognition using kernel principal component analysis , 2002, IEEE Signal Process. Lett..

[73]  Erkki Oja,et al.  Independent Component Analysis , 2001 .

[74]  A. Grossmann,et al.  DECOMPOSITION OF HARDY FUNCTIONS INTO SQUARE INTEGRABLE WAVELETS OF CONSTANT SHAPE , 1984 .

[75]  Alistair G. Rust,et al.  Image redundancy reduction for neural network classification using discrete cosine transforms , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[76]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[77]  Ramesh A. Gopinath,et al.  Wavelet-based post-processing of low bit rate transform coded images , 1994, Proceedings of 1st International Conference on Image Processing.

[78]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.

[79]  Subhash Kak,et al.  Fast Classification Networks For Signal Processing , 2002 .

[80]  Alex Pentland,et al.  Face recognition using eigenfaces , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[81]  Stanislas Dehaene,et al.  Networks of Formal Neurons and Memory Palimpsests , 1986 .

[82]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[83]  Ling Guan Image restoration by a neural network with hierarchical cluster architecture , 1994, J. Electronic Imaging.

[84]  Osamu Nakamura,et al.  Identification of human faces based on isodensity maps , 1991, Pattern Recognit..

[85]  László Györfi,et al.  A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[86]  T. J. Stonham,et al.  Practical Face Recognition and Verification with Wisard , 1986 .

[87]  Takeo Kanade,et al.  Computer recognition of human faces , 1980 .

[88]  J. Szentágothai The Ferrier Lecture, 1977 The neuron network of the cerebral cortex: a functional interpretation , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[89]  F. Marvasti Nonuniform sampling : theory and practice , 2001 .

[90]  S. Carey,et al.  Development of face recognition: A maturational component? , 1980 .

[91]  Juha Karhunen,et al.  Principal component neural networks — Theory and applications , 1998, Pattern Analysis and Applications.

[92]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[93]  Didier J. Le Gall,et al.  The MPEG video compression algorithm , 1992, Signal Process. Image Commun..

[94]  G. F. Cooper,et al.  The spatial selectivity of the visual cells of the cat , 1969, The Journal of physiology.

[95]  Nikolas P. Galatsanos,et al.  Projection-based spatially adaptive reconstruction of block-transform compressed images , 1995, IEEE Trans. Image Process..

[96]  Yasuhito Suenaga,et al.  Robust face identification scheme: KL expansion of an invariant feature space , 1992, Other Conferences.

[97]  Guodong Guo,et al.  Face recognition by support vector machines , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[98]  Jean-Francois Cardoso,et al.  Blind signal separation: statistical principles , 1998, Proc. IEEE.

[99]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[100]  King-Sun Fu,et al.  Syntactic Pattern Recognition And Applications , 1968 .

[101]  David Beymer,et al.  Face recognition under varying pose , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[102]  Kannan Ramchandran,et al.  A simple algorithm for removing blocking artifacts in block-transform coded images , 1998, IEEE Signal Processing Letters.

[103]  Thomas S. Huang,et al.  Human face detection in a scene , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[104]  John J. Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities , 1999 .

[105]  A. Aertsen,et al.  Dynamics of neuronal interactions in monkey cortex in relation to behavioural events , 1995, Nature.

[106]  Andrew R. Webb,et al.  Statistical Pattern Recognition , 1999 .

[107]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[108]  William K. Pratt,et al.  Scene Adaptive Coder , 1984, IEEE Trans. Commun..

[109]  Anil K. Jain,et al.  Fingerprint classification and matching using a filterbank , 2001 .

[110]  J. Sergent Microgenesis of Face Perception , 1986 .

[111]  Joachim M. Buhmann,et al.  Size and distortion invariant object recognition by hierarchical graph matching , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[112]  Brunelli Poggio,et al.  HyberBF Networks for Gender Classification , 1992 .

[113]  Arnaud Delorme,et al.  Face identification using one spike per neuron: resistance to image degradations , 2001, Neural Networks.

[114]  J. Movshon,et al.  Spatial and temporal contrast sensitivity of neurones in areas 17 and 18 of the cat's visual cortex. , 1978, The Journal of physiology.

[115]  E. M. Wright,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[116]  Dennis Gabor,et al.  Theory of communication , 1946 .

[117]  Jin Zhong AN OPTIMAL SET OF UNCORRELATED DISCRIMINANT FEATURES , 1999 .

[118]  Ian Craw,et al.  Automatic extraction of face-features , 1987, Pattern Recognit. Lett..

[119]  Craig Partridge,et al.  Gigabit networking , 1993, Addison-Wesley professional computing series.

[120]  C. Blakemore,et al.  Size Adaptation: A New Aftereffect , 1969, Science.

[121]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[122]  Terrence J. Sejnowski,et al.  SEXNET: A Neural Network Identifies Sex From Human Faces , 1990, NIPS.

[123]  Arthur P Ginsburg,et al.  Visual Information Processing Based on Spatial Filters Constrained by Biological Data. , 1978 .

[124]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[125]  Ke Liu,et al.  A robust algebraic method for human face recognition , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[126]  L UHR,et al.  Pattern recognition over distortions, by human subjects and by a computer simulation of a model for human form perception. , 1962, Journal of experimental psychology.

[127]  M. Banks,et al.  The development of basic mechanisms of pattern vision: spatial frequency channels. , 1985, Journal of experimental child psychology.

[128]  Janette Atkinson,et al.  Channels in Vision: Basic Aspects , 1978 .

[129]  Ah Chung Tsoi,et al.  Face Recognition: A Hybrid Neural Network Approach , 1998 .

[130]  James A. Anderson,et al.  Computational and Neurobiological Features of a Network of Networks , 1995 .

[131]  Demetri Psaltis,et al.  Optical Neural Computers , 1987, Topical Meeting on Optical Computing.

[132]  P. O. Bishop,et al.  Spatial vision. , 1971, Annual review of psychology.

[133]  R. Haber,et al.  Visual Perception , 2018, Encyclopedia of Database Systems.

[134]  M S Banks,et al.  Infant pattern vision: a new approach based on the contrast sensitivity function. , 1981, Journal of experimental child psychology.

[135]  Farokh Marvasti,et al.  Subimage error concealment techniques , 1998, ISCAS '98. Proceedings of the 1998 IEEE International Symposium on Circuits and Systems (Cat. No.98CH36187).

[136]  Tomaso A. Poggio,et al.  Face recognition with support vector machines: global versus component-based approach , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[137]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[138]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[139]  Jerry D. Gibson,et al.  Distributions of the Two-Dimensional DCT Coefficients for Images , 1983, IEEE Trans. Commun..

[140]  Gilbert Strang,et al.  Wavelets and Dilation Equations: A Brief Introduction , 1989, SIAM Rev..

[141]  Cheng-Tie Chen Transform coding of digital images using variable block-size DCT with adaptive thresholding and quantization , 1990, Optics & Photonics.

[142]  Steve De Backer,et al.  Unsupervised Pattern Recognition - Dimensionality Reduction and Classification , 2002 .

[143]  Jiebo Luo,et al.  On the application of Gibbs random field in image processing: from segmentation to enhancement , 1995, J. Electronic Imaging.

[144]  A. W. Ellis Normality and pathology in cognitive functions , 1982 .

[145]  Sun-Yuan Kung,et al.  Face recognition/detection by probabilistic decision-based neural network , 1997, IEEE Trans. Neural Networks.

[146]  D. G. Albrecht,et al.  Visual cortical neurons: are bars or gratings the optimal stimuli? , 1980, Science.

[147]  Alan Bundy,et al.  Spatial Frequency Channels , 1984 .

[148]  M Kabrisky,et al.  A theory of pattern perception based on human physiology. , 1970, Ergonomics.

[149]  Thomas S. Huang,et al.  Image processing , 1971 .

[150]  Steve J. Young,et al.  HMM-based architecture for face identification , 1994, Image Vis. Comput..

[151]  Pierre Comon Independent component analysis - a new concept? signal processing , 1994 .

[152]  Ke Liu,et al.  Human face recognition method based on the statistical model of small sample size , 1992, Other Conferences.

[153]  Ah Chung Tsoi,et al.  Face recognition: a convolutional neural-network approach , 1997, IEEE Trans. Neural Networks.

[154]  Nikil Jayant,et al.  Adaptive postprocessing algorithms for low bit rate video signals , 1995, IEEE Trans. Image Process..

[155]  Ulf Grenander,et al.  Hands: A Pattern Theoretic Study of Biological Shapes , 1990 .

[156]  A. G. Goldstein Facial feature variation: Anthropometric data II , 1979 .