Audio surveillance in unstructured environments

[1]  Francesco Beritelli,et al.  Human identity verification based on Mel frequency analysis of digital heart sounds , 2009, 2009 16th International Conference on Digital Signal Processing.

[2]  Masataka Goto,et al.  RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[3]  Andrzej Czyzewski,et al.  Audio-Visual Surveillance System for Application in Bank Operating Room , 2013, MCSS.

[4]  Kuldip K. Paliwal,et al.  Robust speech recognition in noisy environments based on subband spectral centroid histograms , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[6]  Jhing-Fa Wang,et al.  Robust Environmental Sound Recognition for Home Automation , 2008, IEEE Transactions on Automation Science and Engineering.

[7]  S. Suresh Kumar,et al.  Color based Urban and Agricultural Land classification by GLCM Texture Features , 2012 .

[8]  Shrikanth Narayanan,et al.  Environmental Sound Recognition With Time–Frequency Audio Features , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[10]  Vesa T. Peltonen,et al.  Audio-based context recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  György Fazekas,et al.  Automatic Ontology Generation for Musical Instruments Based on Audio Analysis , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Jérôme Louradour,et al.  Audio Events Detection in Public Transport Vehicle , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[13]  Yangsheng Xu,et al.  A surveillance robot with human recognition based on video and audio , 2010, 2010 IEEE International Conference on Robotics and Biomimetics.

[14]  Banshidhar Majhi,et al.  Mammogram classification using two dimensional discrete wavelet transform and gray-level co-occurrence matrix for detection of breast cancer , 2015, Neurocomputing.

[15]  David V. Anderson,et al.  Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing , 2006, SAPA@INTERSPEECH.

[16]  A. Ayatollahi,et al.  Comparing Gaussian and chirplet dictionaries for time-frequency analysis using matching pursuit decomposition , 2003, Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology (IEEE Cat. No.03EX795).

[17]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[18]  Jinhai Cai,et al.  Sensor Network for the Monitoring of Ecosystem: Bird Species Recognition , 2007, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information.

[19]  Tom J. Moir,et al.  Comparison of multiclass SVM classification techniques in an audio surveillance application under mismatched conditions , 2014, 2014 19th International Conference on Digital Signal Processing.

[20]  Wai Lok Woo,et al.  Wearable Audio Monitoring: Content-Based Processing Methodology and Implementation , 2014, IEEE Transactions on Human-Machine Systems.

[21]  Dejan Gjorgjevikj,et al.  Evaluation of Distance Measures for Multi-class Classification in Binary SVM Decision Tree , 2010, ICAISC.

[22]  H. Jaafar,et al.  Automatic syllables segmentation for frog identification system , 2013, 2013 IEEE 9th International Colloquium on Signal Processing and its Applications.

[23]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[24]  Zbigniew W. Ras,et al.  Multi-way Hierarchic Classification of Musical Instrument Sounds , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[25]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[26]  Rémi Gribonval,et al.  Fast matching pursuit with a multiscale dictionary of Gaussian chirps , 2001, IEEE Trans. Signal Process..

[27]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[28]  Zhen Zhang,et al.  Auto-classification of insect images based on color histogram and GLCM , 2010, 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery.

[29]  Luiz Eduardo Soares de Oliveira,et al.  Selection of Training Instances for Music Genre Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[30]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[31]  Steven Kay,et al.  Modern Spectral Estimation: Theory and Application , 1988 .

[32]  Monique Thonnat,et al.  Audio-Video Event Recognition System for Public Transport Security , 2006 .

[33]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[34]  Reza Sabzevari,et al.  Improvement of learning algorithms for RBF neural networks in a helicopter sound identification system , 2007, Neurocomputing.

[35]  S. Viazzi,et al.  A novel method to automatically measure the feed intake of broiler chickens by sound technology , 2014 .

[36]  Ying Li,et al.  Environmental Sound Recognition Using Double-Level Energy Detection , 2013 .

[37]  Guang Yang,et al.  Matching-pursuit-based adaptive wavelet-packet atomic decomposition applied in ultrasonic inspection , 2007 .

[38]  Yan Song,et al.  Robust Sound Event Classification Using Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[39]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[40]  Koji Abe,et al.  Sound classification for hearing aids using time-frequency images , 2011, Proceedings of 2011 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing.

[41]  Mohan S. Kankanhalli,et al.  Audio Based Event Detection for Multimedia Surveillance , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[42]  Augusto Sarti,et al.  Scream and gunshot detection and localization for audio-surveillance systems , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[43]  Lie Lu,et al.  Digital Object Identifier (DOI) 10.1007/s00530-002-0065-0 Multimedia Systems , 2003 .

[44]  R. Patterson,et al.  Complex Sounds and Auditory Images , 1992 .

[45]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[46]  Francesc Alías,et al.  Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification , 2012, IEEE Transactions on Multimedia.

[47]  Hanseok Ko,et al.  Acoustic and visual signal based context awareness system for mobile application , 2011, 2011 IEEE International Conference on Consumer Electronics (ICCE).

[48]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[49]  Jean-Marie Aerts,et al.  Original papers: Real-time recognition of sick pig cough sounds , 2008 .

[50]  Karthikeyan Umapathy,et al.  Multigroup classification of audio signals using time-frequency parameters , 2005, IEEE Transactions on Multimedia.

[51]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[52]  M. Tabacchi,et al.  A statistical pattern recognition approach for the classification of cooking stages. The boiling water case , 2013 .

[53]  C.-C. Jay Kuo,et al.  Environmental sound recognition using MP-based features , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[54]  Kuldip K. Paliwal,et al.  Spectral subband centroid features for speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[55]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[56]  Stan Z. Li,et al.  Content-based audio classification and retrieval using the nearest feature line method , 2000, IEEE Trans. Speech Audio Process..

[57]  Guy J. Brown,et al.  Fundamentals of Computational Auditory Scene Analysis , 2006 .

[58]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[59]  Luis Alejandro Sánchez-Pérez,et al.  Aircraft take-off noises classification based on human auditory’s matched features extraction , 2014 .

[60]  Yang Peng,et al.  Audio sensors fusion based on vote for robot navigation , 2013, 2013 25th Chinese Control and Decision Conference (CCDC).

[61]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[62]  Bin Guo,et al.  Social Activity Recognition and Recommendation Based on Mobile Sound Sensing , 2013, 2013 IEEE 10th International Conference on Ubiquitous Intelligence and Computing and 2013 IEEE 10th International Conference on Autonomic and Trusted Computing.

[63]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[64]  Diego H. Milone,et al.  Automatic recognition of ingestive sounds of cattle based on hidden Markov models , 2012, Computers and Electronics in Agriculture.

[65]  Michel Vacher,et al.  Information extraction from sound for medical telemonitoring , 2006, IEEE Transactions on Information Technology in Biomedicine.

[66]  Insu Song,et al.  Content-based classification of breath sound with enhanced features , 2014, Neurocomputing.

[67]  Buket D. Barkana,et al.  NON-SPEECH ENVIRONMENTAL SOUND CLASSIFICATION USING SVMS WITH A NEW SET OF FEATURES , 2012 .

[68]  Satoshi Nakamura,et al.  Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition , 2000, LREC.

[69]  Yuan Yan Tang,et al.  Recognizing complex events in real movies by combining audio and video features , 2014, Neurocomputing.

[70]  Arivazhagan Selvaraj,et al.  Texture classification using wavelet transform , 2003, Pattern Recognit. Lett..

[71]  Xiaoli Z. Fern,et al.  Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. , 2012, The Journal of the Acoustical Society of America.

[72]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[73]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[74]  Jaakko Astola,et al.  Audio based solutions for detecting intruders in wild areas , 2012, Signal Process..

[75]  Rémi Gribonval,et al.  Harmonic decomposition of audio signals with matching pursuit , 2003, IEEE Trans. Signal Process..

[76]  R Piccinini,et al.  Cough sound description in relation to respiratory diseases in dairy calves. , 2010, Preventive veterinary medicine.

[77]  Christian Wellekens,et al.  On desensitizing the Mel-cepstrum to spurious spectral components for robust speech recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[78]  E. B. Newman,et al.  A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .

[79]  M. Chmulik,et al.  Bio-inspired optimization of acoustic features for generic sound recognition , 2012, 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP).

[80]  Delia Mitrea,et al.  Texture based characterization and automatic diagnosis of the abdominal tumors from ultrasound images using third order GLCM features , 2011, 2011 4th International Congress on Image and Signal Processing.

[81]  Tom J. Moir,et al.  An overview of applications and advancements in automatic sound recognition , 2016, Neurocomputing.

[82]  Pedro Antonio Gutiérrez,et al.  Ensembles of evolutionary product unit or RBF neural networks for the identification of sound for pass-by noise test in vehicles , 2013, Neurocomputing.

[83]  Tom J. Moir,et al.  Subband Time-Frequency Image Texture Features for Robust Audio Surveillance , 2015, IEEE Transactions on Information Forensics and Security.

[84]  Lonce Wyse,et al.  Audio events classification using hierarchical structure , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[85]  Paris Smaragdis,et al.  Hidden Markov and Gaussian mixture models for automatic call classification. , 2009, The Journal of the Acoustical Society of America.

[86]  Malcolm Slaney,et al.  Lyon's Cochlear Model , 1997 .

[87]  Keikichi Hirose,et al.  Spectrogram based features selection using multiple kernel learning for speech/music discrimination , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[88]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[89]  Jonathan William Dennis,et al.  Sound event recognition in unstructured environments using spectrogram image processing , 2014 .

[90]  Alaa Eleyan,et al.  Co-occurrence matrix and its statistical features as a new approach for face recognition , 2011, Turkish Journal of Electrical Engineering and Computer Sciences.

[91]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[92]  Md. Sumon Shahriar,et al.  A context aware sound classifier applied to prawn feed monitoring and energy disaggregation , 2013, Knowl. Based Syst..

[93]  Asma Rabaoui,et al.  Using One-Class SVMs and Wavelets for Audio Surveillance , 2008, IEEE Transactions on Information Forensics and Security.

[94]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.

[95]  Tom J. Moir,et al.  Cochleagram image feature for improved robustness in sound recognition , 2015, 2015 IEEE International Conference on Digital Signal Processing (DSP).

[96]  Boonserm Kijsirikul,et al.  Adaptive Directed Acyclic Graphs for Multiclass Classification , 2002, PRICAI.

[97]  Tom J. Moir,et al.  Subband spectral histogram feature for improved sound recognition in low SNR conditions , 2015, 2015 IEEE International Conference on Digital Signal Processing (DSP).

[98]  Andry Rakotonirainy,et al.  Acoustic Hazard Detection for Pedestrians With Obscured Hearing , 2011, IEEE Transactions on Intelligent Transportation Systems.

[99]  Jhing-Fa Wang,et al.  Environmental Sound Classification using Hybrid SVM/KNN Classifier and MPEG-7 Audio Low-Level Descriptor , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[100]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[101]  Jason Weston,et al.  Multi-Class Support Vector Machines , 1998 .

[102]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[103]  Michael S. Lewicki,et al.  Efficient Coding of Time-Relative Structure Using Spikes , 2005, Neural Computation.

[104]  Zhu Le-Qing,et al.  Insect Sound Recognition Based on MFCC and PNN , 2011, 2011 International Conference on Multimedia and Signal Processing.

[105]  Bin Gao,et al.  Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation. , 2014, The Journal of the Acoustical Society of America.

[106]  Gaël Richard,et al.  ENST-Drums: an extensive audio-visual database for drum signals processing , 2006, ISMIR.

[107]  Manuel Rosa-Zurera,et al.  Transient modeling by matching pursuits with a wavelet dictionary for parametric audio coding , 2004, IEEE Signal Processing Letters.

[108]  Waleed H. Abdulla,et al.  Performance Evaluation of Front-end Processing for Speech Recognition Systems , 2005 .

[109]  DeLiang Wang,et al.  An algorithm to improve speech recognition in noise for hearing-impaired listeners. , 2013, The Journal of the Acoustical Society of America.

[110]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[111]  Guodong Guo,et al.  Content-based audio classification and retrieval by support vector machines , 2003, IEEE Trans. Neural Networks.

[112]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[113]  R.A. Goubran,et al.  Security-Monitoring using Microphone Arrays and Audio Classification , 2005, 2005 IEEE Instrumentationand Measurement Technology Conference Proceedings.

[114]  Tom J. Moir,et al.  Audio surveillance under noisy conditions using time-frequency image feature , 2014, 2014 19th International Conference on Digital Signal Processing.

[115]  Naotoshi Seo,et al.  A Comparison of Multi-class Support Vector Machine Methods for Face Recognition , 2007 .

[116]  Liu Yi Lung sound feature extraction based on parametric bispectrum analysis of higher-order cumulants , 2005 .

[117]  Xin Wang,et al.  GLCM texture based fractal method for evaluating fabric surface roughness , 2009, 2009 Canadian Conference on Electrical and Computer Engineering.

[118]  Richard M. Stern,et al.  Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[119]  Rasmus Berg Palm,et al.  Prediction as a candidate for learning deep hierarchical models of data , 2012 .

[120]  Hendrik Purwins,et al.  Sparse Approximations for Drum Sound Classification , 2011, IEEE Journal of Selected Topics in Signal Processing.

[121]  Haizhou Li,et al.  Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions , 2011, IEEE Signal Processing Letters.

[122]  Emanuele Menegatti,et al.  Combining Audio and Video Surveillance with a Mobile Robot , 2007, Int. J. Artif. Intell. Tools.

[123]  John H. L. Hansen,et al.  Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition , 2001, INTERSPEECH.

[124]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .

[125]  Steve Young,et al.  The HTK book version 3.4 , 2006 .

[126]  D. D. Greenwood A cochlear frequency-position function for several species--29 years later. , 1990, The Journal of the Acoustical Society of America.

[127]  DeLiang Wang,et al.  Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection , 2014, INTERSPEECH.

[128]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[129]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[130]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[131]  Gwo-Ching Chang,et al.  Investigation of noise effect on lung sound recognition , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[132]  S. Mallat A wavelet tour of signal processing , 1998 .

[133]  Ulrich H.-G. Kreßel,et al.  Pairwise classification and support vector machines , 1999 .

[134]  Patrice Alexandre,et al.  Root cepstral analysis: A unified view. Application to speech processing in car noise environments , 1993, Speech Commun..

[135]  Tom J. Moir,et al.  Robust audio surveillance using spectrogram image texture feature , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[136]  Rajat Raina,et al.  Large-scale deep unsupervised learning using graphics processors , 2009, ICML '09.

[137]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[138]  Luiz S. Oliveira,et al.  Music genre recognition using spectrograms , 2011, 2011 18th International Conference on Systems, Signals and Image Processing.

[139]  Zahra Moussavi,et al.  Automatic and Unsupervised Snore Sound Extraction From Respiratory Sound Signals , 2011, IEEE Transactions on Biomedical Engineering.

[140]  Enrique Alexandre,et al.  Feature Selection for Sound Classification in Hearing Aids Through Restricted Search Driven by Genetic Algorithms , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[141]  Antonia Papandreou-Suppappola,et al.  Classification of Acoustic Emissions Using Modified Matching Pursuit , 2004, EURASIP J. Adv. Signal Process..

[142]  Tom J. Moir,et al.  Noise robust audio surveillance using reduced spectrogram image feature and one-against-all SVM , 2015, Neurocomputing.

[143]  George Kalliris,et al.  Bowel-sound pattern analysis using wavelets and neural networks with application to long-term, unsupervised, gastrointestinal motility monitoring , 2008, Expert Syst. Appl..

[144]  C.-C. Jay Kuo,et al.  Where am I? Scene Recognition for Mobile Robots using Audio Features , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[145]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[146]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[147]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[148]  V. Vapnik Pattern recognition using generalized portrait method , 1963 .

[149]  Alessandro L. Koerich,et al.  The Latin Music Database , 2008, ISMIR.

[150]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[151]  Tong Feng,et al.  Application of evolutionary neural network in impact acoustics based nondestructive inspection of tile-wall , 2005, Proceedings. 2005 International Conference on Communications, Circuits and Systems, 2005..

[152]  Thomas C. Walters Auditory-based processing of communication sounds , 2011 .

[153]  Lie Lu,et al.  Content analysis for audio classification and segmentation , 2002, IEEE Trans. Speech Audio Process..

[154]  Fernando Pérez-Cruz,et al.  Enhancing genetic feature selection through restricted search and Walsh analysis , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[155]  R. Radhakrishnan,et al.  Audio analysis for surveillance applications , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[156]  Oh-Wook Kwon,et al.  Cardiac disorder classification by heart sound signals using murmur likelihood and hidden Markov model state likelihood , 2012, IET Signal Process..

[157]  Malcolm Slaney,et al.  An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank , 1997 .

[158]  Christian Breiteneder,et al.  Features for Content-Based Audio Retrieval , 2010, Adv. Comput..