Computational Intelligence in Multimedia Processing

Computational intelligence (CI) is awell-established paradigm that incorporates characteristics of biological computers (brains) to perform a variety of tasks that are difficult or impossible to do with conventional computers. This paper reviews some of the applications of CI in multimedia processing, including shot detection in video, logotype detection, video copy detection and retrieval, and faces coding in video sequences.

[1]  Andrzej Skowron,et al.  Rudiments of rough sets , 2007, Inf. Sci..

[2]  Christophe Garcia,et al.  A Neural Scheme for Robust Detection of Transparent Logos in TV Programs , 2006, ICANN.

[3]  L. Polkowski Rough Sets: Mathematical Foundations , 2013 .

[4]  Xiang Cao,et al.  Neural Network Based Temporal Video Segmentation , 2002, Int. J. Neural Syst..

[5]  Erkki Oja,et al.  PicSOM-self-organizing image retrieval with MPEG-7 content descriptors , 2002, IEEE Trans. Neural Networks.

[6]  Ebroul Izquierdo,et al.  Logotype detection to support semantic-based video annotation , 2007, Signal Process. Image Commun..

[7]  A. Murat Tekalp,et al.  Temporal video segmentation using unsupervised clustering and semantic object tracking , 1998, J. Electronic Imaging.

[8]  Stefanos D. Kollias,et al.  On emotion recognition of faces and of speech using neural networks, fuzzy logic and the ASSESS system , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[9]  Tsuhan Chen,et al.  Eigenspace updating for non-stationary process and its application to face recognition , 2003, Pattern Recognit..

[10]  S. Grossberg Adaptive Resonance Theory , 2006 .

[11]  Maurice K. Wong,et al.  Algorithm AS136: A k-means clustering algorithm. , 1979 .

[12]  Philippe Aigrain,et al.  The automatic real-time analysis of film editing and transition effects and its applications , 1994, Comput. Graph..

[13]  Z. Pawlak Classification of objects by means of attributes , 1981 .

[14]  Zbigniew M. Wojcik,et al.  Rough approximation of shapes in pattern recognition , 1987, Comput. Vis. Graph. Image Process..

[15]  Yixin Chen,et al.  A Region-Based Fuzzy Feature Matching Approach to Content-Based Image Retrieval , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  William J. Christmas,et al.  Combining multiple experts for classifying shot changes in video sequences , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[17]  Sung-Han Park,et al.  An Automatic Cut Detection Algorithm Using Median Filter And Neural Network , 2002 .

[18]  Luis Torres,et al.  Efficient face coding in video sequences combining adaptive principal component analysis and a hybrid codec approach , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[19]  James F. Peters,et al.  Monocular Vision System that Learns with Approximation Spaces , 2008 .

[20]  S. Kulkarni Neural-fuzzy approach for content-based retrieval of digital video , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[21]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[22]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[23]  Henri Prade,et al.  Fuzzy Logic Techniques in Multimedia Database Querying: A Preliminary Investigation of the Potentials , 2001, IEEE Trans. Knowl. Data Eng..

[24]  Aboul Ella Hassanien,et al.  Detection of Spiculated Masses in Mammograms Based on Fuzzy Image Processing , 2004, ICAISC.

[25]  Sen Bai,et al.  Neural network based audio watermarking algorithm , 2005, ICMIT: Mechatronics and Information Technology.

[26]  Antonio Albiol,et al.  Detection of TV commercials , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[27]  Hung T. Nguyen,et al.  A First Course in Fuzzy Logic , 1996 .

[28]  Rainer Lienhart,et al.  Mining TV broadcasts for recurring video sequences , 2009, CIVR '09.

[29]  Ajith Abraham,et al.  Computational Intelligence in Multimedia Processing: Recent Advances , 2008 .

[30]  Daniela Hall,et al.  Brand identification using Gaussian derivative histograms , 2003, Machine Vision and Applications.

[31]  Noel Massey,et al.  Transition-based speech synthesis using neural networks , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[32]  Ravi Jain,et al.  D-SCIDS: Distributed soft computing intrusion detection system , 2007, J. Netw. Comput. Appl..

[33]  Ajay Divakaran Multimedia Content Analysis: Theory and Applications , 2008 .

[34]  Haibin Lu,et al.  A hierarchical organization scheme for video data , 2002, Pattern Recognit..

[35]  Rene V. Mayorga,et al.  RANFIS: Rough Adaptive Neuro-Fuzzy Inference System , 2007 .

[36]  James F. Peters,et al.  Near Sets. Toward Approximation Space-Based Object Recognition , 2007, RSKT.

[37]  Aboul Ella Hassanien,et al.  Image classification and retrieval algorithm based on rough set theory , 2003, South Afr. Comput. J..

[38]  Chi-Chun Lo,et al.  Video segmentation using a histogram-based fuzzy c-means clustering algorithm , 2001, Comput. Stand. Interfaces.

[39]  Li Chen,et al.  Video copy detection: a comparative study , 2007, CIVR '07.

[40]  Ahmet Ekin,et al.  Spatial detection of tv channel logos as outliers from the content , 2006, Electronic Imaging.

[41]  Emilio L. Zapata,et al.  A Clustering Technique for Video Copy Detection , 2007, IbPRIA.

[42]  D. Van De Ville,et al.  An overview of classical and fuzzy-classical filters for noise reduction , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[43]  H.P. Ng,et al.  Medical Image Segmentation Using K-Means Clustering and Improved Watershed Algorithm , 2006, 2006 IEEE Southwest Symposium on Image Analysis and Interpretation.

[44]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[45]  John S. Boreczky,et al.  A hidden Markov model framework for video segmentation using audio and image features , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[46]  Aboul Ella Hassanien,et al.  Computational Intelligence in Multimedia Processing: Foundation and Trends , 2008 .

[47]  David A. Forsyth,et al.  Towards auto-documentary: tracking the evolution of news stories , 2004, MULTIMEDIA '04.

[48]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[49]  Pawan Lingras,et al.  Interval Set Clustering of Web Users with Rough K-Means , 2004, Journal of Intelligent Information Systems.

[50]  James F. Peters,et al.  Near Sets. Special Theory about Nearness of Objects , 2007, Fundam. Informaticae.

[51]  Andrzej Czyzewski,et al.  Intelligent Processing of Stuttered Speech , 2003, Journal of Intelligent Information Systems.

[52]  Aly A. Farag,et al.  A modified fuzzy c-means algorithm for bias field estimation and segmentation of MRI data , 2002, IEEE Transactions on Medical Imaging.

[53]  James F. Peters,et al.  Robotic Target Tracking with Approximation Space-Based Feedback During Reinforcement Learning , 2009, RSFDGrC.

[54]  A. E. Hassanien Hiding iris data for authentication of digital images using wavelet theory , 2006, Pattern Recognition and Image Analysis.

[55]  Thomas Bäck,et al.  Evolutionary algorithms in theory and practice - evolution strategies, evolutionary programming, genetic algorithms , 1996 .

[56]  Lim Ee Hui,et al.  RBF neural network mouth tracking for audio-visual speech recognition system , 2004, 2004 IEEE Region 10 Conference TENCON 2004..

[57]  James F. Peters,et al.  Rough Neural Computing in Signal Analysis , 2001, Comput. Intell..

[58]  Shan Meng,et al.  A method of visual speech feature area localization , 2003, International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003.

[59]  Jan Larsen,et al.  Machine Learning for Signal Processing , 2008, Neurocomputing.

[60]  James F. Peters,et al.  Feature Selection: Near Set Approach , 2007, MCD.

[61]  T. Aaron Gulliver,et al.  A speech synthesizer for Persian text using a neural network with a smooth ergodic HMM , 2005, TALIP.

[62]  Satoshi Nakamura,et al.  Statistical multimodal integration for audio-visual speech processing , 2002, IEEE Trans. Neural Networks.

[63]  José Ignacio Benavides Benítez,et al.  Combining luminance and edge based metrics for robust temporal video segmentation , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[64]  Ramesh C. Jain,et al.  Production model based digital video segmentation , 1995, Multimedia Tools and Applications.

[65]  David B. Fogel,et al.  Evolutionary Computation: Towards a New Philosophy of Machine Intelligence , 1995 .

[66]  Andrzej Skowron,et al.  Nearness of Objects: Extension of Approximation Space Model , 2007, Fundam. Informaticae.

[67]  L. Castedo,et al.  A Novel Video Coding Scheme based on Principal Component Analysis , 2005, 2005 IEEE Workshop on Machine Learning for Signal Processing.

[68]  Laura Caponetti,et al.  Speech Emotion Recognition Using Spiking Neural Networks , 2006, ISMIS.

[69]  Olivier Buisson,et al.  Content-based video copy detection in large databases: a local fingerprints statistical similarity search approach , 2005, IEEE International Conference on Image Processing 2005.

[70]  David S. Doermann,et al.  The Indexing and Retrieval of Document Images: A Survey , 1998, Comput. Vis. Image Underst..

[71]  Alan Hanjalic,et al.  Logo recognition in video stills by string matching , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[72]  Jiebo Luo,et al.  Image segmentation via adaptive K-mean clustering and knowledge-based morphological operations with biomedical applications , 1998, IEEE Trans. Image Process..

[73]  Paul Over,et al.  High-level feature detection from video in TRECVid: a 5-year retrospective of achievements , 2009 .

[74]  James F. Peters,et al.  Approximation and Perception in Ethology‐Based Reinforcement Learning , 2008 .

[75]  Jenq-Neng Hwang,et al.  Neural networks for intelligent multimedia processing , 1998 .

[76]  B. S. Manjunath,et al.  An Eigenspace Update Algorithm for Image Analysis , 1997, CVGIP Graph. Model. Image Process..

[77]  Lawrence J. Fogel,et al.  Artificial Intelligence through Simulated Evolution , 1966 .

[78]  Subir Kumar Sarkar,et al.  A Hybrid Rough Set--Particle Swarm Algorithm for Image Pixel Classification , 2006, 2006 Sixth International Conference on Hybrid Intelligent Systems (HIS'06).

[79]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[80]  Yanjun Qi,et al.  Supervised classification for video shot segmentation , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[81]  Aleksandra Pizurica,et al.  Fuzzy logic recursive motion detection and denoising of video sequences , 2006, J. Electronic Imaging.

[82]  Orhan Karaali,et al.  Speech Synthesis with Neural Networks , 1998, ArXiv.

[83]  Marc Toussaint,et al.  Extracting Motion Primitives from Natural Handwriting Data , 2006, ICANN.

[84]  Chip-Hong Chang,et al.  Fuzzy-ART based adaptive digital watermarking scheme , 2005 .

[85]  K. R. Ramakrishnan,et al.  Neural net based scene change detection for video classification , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[86]  Josef Bigün,et al.  Audio-visual person authentication using lip-motion from orientation maps , 2007, Pattern Recognit. Lett..

[87]  Alex Pentland,et al.  Probabilistic Visual Learning for Object Representation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[88]  Riccardo Leonardi,et al.  Scene break detection: a comparison , 1998, Proceedings Eighth International Workshop on Research Issues in Data Engineering. Continuous-Media Databases and Applications.

[89]  Simon King,et al.  An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces , 2000, INTERSPEECH.

[90]  Nilesh V. Patel,et al.  Video shot detection and characterization for video databases , 1997, Pattern Recognit..

[91]  Alfredo Petrosino,et al.  Rough fuzzy set based scale space transforms and their use in image analysis , 2006, Int. J. Approx. Reason..

[92]  Douglas D. O'Shaughnessy,et al.  On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions , 2003, EURASIP J. Adv. Signal Process..

[93]  Kayvan Najarian,et al.  Maximizing strength of digital watermarks using neural networks , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[94]  Ralph M. Ford Fuzzy Logic Methods for Video Shot Boundary Detection and Classification , 2005 .

[95]  Mohan S. Kankanhalli,et al.  Automatic video logo detection and removal , 2005, Multimedia Systems.

[96]  Andrzej Czyzewski,et al.  Automatic identification of sound source position employing neural networks and rough sets , 2003, Pattern Recognit. Lett..

[97]  Bogdan Raducanu,et al.  Morphological neural networks for vision based self-localization , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[98]  Cordelia Schmid,et al.  An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering , 2010, IEEE Transactions on Multimedia.

[99]  Ing-Jr Ding Incremental MLLR speaker adaptation by fuzzy logic control , 2007, Pattern Recognit..

[100]  Ewa Orlowska,et al.  Verisimilitude based on concept analysis , 1990, Stud Logica.

[101]  Andrzej Skowron,et al.  Nearness in Approximation Spaces , 2006 .

[102]  James F. Peters,et al.  K-means Indiscernibility Relation over Pixels , 2004, Rough Sets and Current Trends in Computing.

[103]  Xi Xiao,et al.  A hybrid SVM/DDBHMM decision fusion modeling for robust continuous digital speech recognition , 2007, Pattern Recognit. Lett..

[104]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[105]  Haibin Lu,et al.  Robust gradual scene change detection , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[106]  M.X.H. Yan,et al.  Segmentation of 3D brain MR using an adaptive K-means clustering algorithm , 1994, Proceedings of 1994 IEEE Nuclear Science Symposium - NSS'94.

[107]  B. Uma Shankar,et al.  Novel Classification and Segmentation Techniques with Application to Remotely Sensed Images , 2007, Trans. Rough Sets.

[108]  Tong Wang,et al.  An Approach to Image Retrieval Based on Concept Lattices and Rough Set Theory , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[109]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[110]  Patrick Gros,et al.  Detecting repeats for video structuring , 2007, Multimedia Tools and Applications.

[111]  S. Grossberg,et al.  Adaptive pattern classification and universal recoding: I. Parallel development and coding of neural feature detectors , 1976, Biological Cybernetics.

[112]  Sushmita Mitra An evolutionary rough partitive clustering , 2004, Pattern Recognit. Lett..

[113]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[114]  Sang Hyun Kim,et al.  An efficient algorithm for video sequence matching using the modified Hausdorff distance and the directed divergence , 2002, IEEE Trans. Circuits Syst. Video Technol..

[115]  Avideh Zakhor,et al.  Fast similarity search and clustering of video sequences on the world-wide-web , 2005, IEEE Transactions on Multimedia.