论文信息 - Audio surveillance in unstructured environments - 字舞流文

Audio surveillance in unstructured environments

Roneel Vikash Sharan | R. Sharan

[1] Francesco Beritelli,et al. Human identity verification based on Mel frequency analysis of digital heart sounds , 2009, 2009 16th International Conference on Digital Signal Processing.

[2] Masataka Goto,et al. RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[3] Andrzej Czyzewski,et al. Audio-Visual Surveillance System for Application in Bank Operating Room , 2013, MCSS.

[4] Kuldip K. Paliwal,et al. Robust speech recognition in noisy environments based on subband spectral centroid histograms , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[5] Mohammad Bagher Menhaj,et al. Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[6] Jhing-Fa Wang,et al. Robust Environmental Sound Recognition for Home Automation , 2008, IEEE Transactions on Automation Science and Engineering.

[7] S. Suresh Kumar,et al. Color based Urban and Agricultural Land classification by GLCM Texture Features , 2012 .

[8] Shrikanth Narayanan,et al. Environmental Sound Recognition With Time–Frequency Audio Features , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[9] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[10] Vesa T. Peltonen,et al. Audio-based context recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[11] György Fazekas,et al. Automatic Ontology Generation for Musical Instruments Based on Audio Analysis , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[12] Jérôme Louradour,et al. Audio Events Detection in Public Transport Vehicle , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[13] Yangsheng Xu,et al. A surveillance robot with human recognition based on video and audio , 2010, 2010 IEEE International Conference on Robotics and Biomimetics.

[14] Banshidhar Majhi,et al. Mammogram classification using two dimensional discrete wavelet transform and gray-level co-occurrence matrix for detection of breast cancer , 2015, Neurocomputing.

[15] David V. Anderson,et al. Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing , 2006, SAPA@INTERSPEECH.

[16] A. Ayatollahi,et al. Comparing Gaussian and chirplet dictionaries for time-frequency analysis using matching pursuit decomposition , 2003, Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology (IEEE Cat. No.03EX795).

[17] Nello Cristianini,et al. Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[18] Jinhai Cai,et al. Sensor Network for the Monitoring of Ecosystem: Bird Species Recognition , 2007, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information.

[19] Tom J. Moir,et al. Comparison of multiclass SVM classification techniques in an audio surveillance application under mismatched conditions , 2014, 2014 19th International Conference on Digital Signal Processing.

[20] Wai Lok Woo,et al. Wearable Audio Monitoring: Content-Based Processing Methodology and Implementation , 2014, IEEE Transactions on Human-Machine Systems.

[21] Dejan Gjorgjevikj,et al. Evaluation of Distance Measures for Multi-class Classification in Binary SVM Decision Tree , 2010, ICAISC.

[22] H. Jaafar,et al. Automatic syllables segmentation for frog identification system , 2013, 2013 IEEE 9th International Colloquium on Signal Processing and its Applications.

[23] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[24] Zbigniew W. Ras,et al. Multi-way Hierarchic Classification of Musical Instrument Sounds , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[25] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[26] Rémi Gribonval,et al. Fast matching pursuit with a multiscale dictionary of Gaussian chirps , 2001, IEEE Trans. Signal Process..

[27] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[28] Zhen Zhang,et al. Auto-classification of insect images based on color histogram and GLCM , 2010, 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery.

[29] Luiz Eduardo Soares de Oliveira,et al. Selection of Training Instances for Music Genre Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[30] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[31] Steven Kay,et al. Modern Spectral Estimation: Theory and Application , 1988 .

[32] Monique Thonnat,et al. Audio-Video Event Recognition System for Public Transport Security , 2006 .

[33] James Kennedy,et al. Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[34] Reza Sabzevari,et al. Improvement of learning algorithms for RBF neural networks in a helicopter sound identification system , 2007, Neurocomputing.

[35] S. Viazzi,et al. A novel method to automatically measure the feed intake of broiler chickens by sound technology , 2014 .

[36] Ying Li,et al. Environmental Sound Recognition Using Double-Level Energy Detection , 2013 .

[37] Guang Yang,et al. Matching-pursuit-based adaptive wavelet-packet atomic decomposition applied in ultrasonic inspection , 2007 .

[38] Yan Song,et al. Robust Sound Event Classification Using Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[39] Bernhard Schölkopf,et al. Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[40] Koji Abe,et al. Sound classification for hearing aids using time-frequency images , 2011, Proceedings of 2011 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing.

[41] Mohan S. Kankanhalli,et al. Audio Based Event Detection for Multimedia Surveillance , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[42] Augusto Sarti,et al. Scream and gunshot detection and localization for audio-surveillance systems , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[43] Lie Lu,et al. Digital Object Identifier (DOI) 10.1007/s00530-002-0065-0 Multimedia Systems , 2003 .

[44] R. Patterson,et al. Complex Sounds and Auditory Images , 1992 .

[45] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[46] Francesc Alías,et al. Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification , 2012, IEEE Transactions on Multimedia.

[47] Hanseok Ko,et al. Acoustic and visual signal based context awareness system for mobile application , 2011, 2011 IEEE International Conference on Consumer Electronics (ICCE).

[48] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[49] Jean-Marie Aerts,et al. Original papers: Real-time recognition of sick pig cough sounds , 2008 .

[50] Karthikeyan Umapathy,et al. Multigroup classification of audio signals using time-frequency parameters , 2005, IEEE Transactions on Multimedia.

[51] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[52] M. Tabacchi,et al. A statistical pattern recognition approach for the classification of cooking stages. The boiling water case , 2013 .

[53] C.-C. Jay Kuo,et al. Environmental sound recognition using MP-based features , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[54] Kuldip K. Paliwal,et al. Spectral subband centroid features for speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[55] Paul Smolensky,et al. Information processing in dynamical systems: foundations of harmony theory , 1986 .

[56] Stan Z. Li,et al. Content-based audio classification and retrieval using the nearest feature line method , 2000, IEEE Trans. Speech Audio Process..

[57] Guy J. Brown,et al. Fundamentals of Computational Auditory Scene Analysis , 2006 .

[58] Douglas Keislar,et al. Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[59] Luis Alejandro Sánchez-Pérez,et al. Aircraft take-off noises classification based on human auditory’s matched features extraction , 2014 .

[60] Yang Peng,et al. Audio sensors fusion based on vote for robot navigation , 2013, 2013 25th Chinese Control and Decision Conference (CCDC).

[61] Chih-Jen Lin,et al. A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[62] Bin Guo,et al. Social Activity Recognition and Recommendation Based on Mobile Sound Sensing , 2013, 2013 IEEE 10th International Conference on Ubiquitous Intelligence and Computing and 2013 IEEE 10th International Conference on Autonomic and Trusted Computing.

[63] J. Makhoul,et al. Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[64] Diego H. Milone,et al. Automatic recognition of ingestive sounds of cattle based on hidden Markov models , 2012, Computers and Electronics in Agriculture.

[65] Michel Vacher,et al. Information extraction from sound for medical telemonitoring , 2006, IEEE Transactions on Information Technology in Biomedicine.

[66] Insu Song,et al. Content-based classification of breath sound with enhanced features , 2014, Neurocomputing.

[67] Buket D. Barkana,et al. NON-SPEECH ENVIRONMENTAL SOUND CLASSIFICATION USING SVMS WITH A NEW SET OF FEATURES , 2012 .

[68] Satoshi Nakamura,et al. Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition , 2000, LREC.

[69] Yuan Yan Tang,et al. Recognizing complex events in real movies by combining audio and video features , 2014, Neurocomputing.

[70] Arivazhagan Selvaraj,et al. Texture classification using wavelet transform , 2003, Pattern Recognit. Lett..

[71] Xiaoli Z. Fern,et al. Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. , 2012, The Journal of the Acoustical Society of America.

[72] Herman J. M. Steeneken,et al. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[73] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[74] Jaakko Astola,et al. Audio based solutions for detecting intruders in wild areas , 2012, Signal Process..

[75] Rémi Gribonval,et al. Harmonic decomposition of audio signals with matching pursuit , 2003, IEEE Trans. Signal Process..

[76] R Piccinini,et al. Cough sound description in relation to respiratory diseases in dairy calves. , 2010, Preventive veterinary medicine.

[77] Christian Wellekens,et al. On desensitizing the Mel-cepstrum to spurious spectral components for robust speech recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[78] E. B. Newman,et al. A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .

[79] M. Chmulik,et al. Bio-inspired optimization of acoustic features for generic sound recognition , 2012, 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP).

[80] Delia Mitrea,et al. Texture based characterization and automatic diagnosis of the abdominal tumors from ultrasound images using third order GLCM features , 2011, 2011 4th International Congress on Image and Signal Processing.

[81] Tom J. Moir,et al. An overview of applications and advancements in automatic sound recognition , 2016, Neurocomputing.

[82] Pedro Antonio Gutiérrez,et al. Ensembles of evolutionary product unit or RBF neural networks for the identification of sound for pass-by noise test in vehicles , 2013, Neurocomputing.

[83] Tom J. Moir,et al. Subband Time-Frequency Image Texture Features for Robust Audio Surveillance , 2015, IEEE Transactions on Information Forensics and Security.

[84] Lonce Wyse,et al. Audio events classification using hierarchical structure , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[85] Paris Smaragdis,et al. Hidden Markov and Gaussian mixture models for automatic call classification. , 2009, The Journal of the Acoustical Society of America.

[86] Malcolm Slaney,et al. Lyon's Cochlear Model , 1997 .

[87] Keikichi Hirose,et al. Spectrogram based features selection using multiple kernel learning for speech/music discrimination , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[88] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[89] Jonathan William Dennis,et al. Sound event recognition in unstructured environments using spectrogram image processing , 2014 .

[90] Alaa Eleyan,et al. Co-occurrence matrix and its statistical features as a new approach for face recognition , 2011, Turkish Journal of Electrical Engineering and Computer Sciences.

[91] Michael A. Saunders,et al. Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[92] Md. Sumon Shahriar,et al. A context aware sound classifier applied to prawn feed monitoring and energy disaggregation , 2013, Knowl. Based Syst..

[93] Asma Rabaoui,et al. Using One-Class SVMs and Wavelets for Audio Surveillance , 2008, IEEE Transactions on Information Forensics and Security.

[94] Donald F. Specht,et al. Probabilistic neural networks , 1990, Neural Networks.

[95] Tom J. Moir,et al. Cochleagram image feature for improved robustness in sound recognition , 2015, 2015 IEEE International Conference on Digital Signal Processing (DSP).

[96] Boonserm Kijsirikul,et al. Adaptive Directed Acyclic Graphs for Multiclass Classification , 2002, PRICAI.

[97] Tom J. Moir,et al. Subband spectral histogram feature for improved sound recognition in low SNR conditions , 2015, 2015 IEEE International Conference on Digital Signal Processing (DSP).

[98] Andry Rakotonirainy,et al. Acoustic Hazard Detection for Pedestrians With Obscured Hearing , 2011, IEEE Transactions on Intelligent Transportation Systems.

[99] Jhing-Fa Wang,et al. Environmental Sound Classification using Hybrid SVM/KNN Classifier and MPEG-7 Audio Low-Level Descriptor , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[100] Robert M. Haralick,et al. Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[101] Jason Weston,et al. Multi-Class Support Vector Machines , 1998 .

[102] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[103] Michael S. Lewicki,et al. Efficient Coding of Time-Relative Structure Using Spikes , 2005, Neural Computation.

[104] Zhu Le-Qing,et al. Insect Sound Recognition Based on MFCC and PNN , 2011, 2011 International Conference on Multimedia and Signal Processing.

[105] Bin Gao,et al. Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation. , 2014, The Journal of the Acoustical Society of America.

[106] Gaël Richard,et al. ENST-Drums: an extensive audio-visual database for drum signals processing , 2006, ISMIR.

[107] Manuel Rosa-Zurera,et al. Transient modeling by matching pursuits with a wavelet dictionary for parametric audio coding , 2004, IEEE Signal Processing Letters.

[108] Waleed H. Abdulla,et al. Performance Evaluation of Front-end Processing for Speech Recognition Systems , 2005 .

[109] DeLiang Wang,et al. An algorithm to improve speech recognition in noise for hearing-impaired listeners. , 2013, The Journal of the Acoustical Society of America.

[110] Koby Crammer,et al. On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[111] Guodong Guo,et al. Content-based audio classification and retrieval by support vector machines , 2003, IEEE Trans. Neural Networks.

[112] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[113] R.A. Goubran,et al. Security-Monitoring using Microphone Arrays and Audio Classification , 2005, 2005 IEEE Instrumentationand Measurement Technology Conference Proceedings.

[114] Tom J. Moir,et al. Audio surveillance under noisy conditions using time-frequency image feature , 2014, 2014 19th International Conference on Digital Signal Processing.

[115] Naotoshi Seo,et al. A Comparison of Multi-class Support Vector Machine Methods for Face Recognition , 2007 .

[116] Liu Yi. Lung sound feature extraction based on parametric bispectrum analysis of higher-order cumulants , 2005 .

[117] Xin Wang,et al. GLCM texture based fractal method for evaluating fabric surface roughness , 2009, 2009 Canadian Conference on Electrical and Computer Engineering.

[118] Richard M. Stern,et al. Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[119] Rasmus Berg Palm,et al. Prediction as a candidate for learning deep hierarchical models of data , 2012 .

[120] Hendrik Purwins,et al. Sparse Approximations for Drum Sound Classification , 2011, IEEE Journal of Selected Topics in Signal Processing.

[121] Haizhou Li,et al. Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions , 2011, IEEE Signal Processing Letters.

[122] Emanuele Menegatti,et al. Combining Audio and Video Surveillance with a Mobile Robot , 2007, Int. J. Artif. Intell. Tools.

[123] John H. L. Hansen,et al. Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition , 2001, INTERSPEECH.

[124] Douglas D. O'Shaughnessy,et al. Speech communication : human and machine , 1987 .

[125] Steve Young,et al. The HTK book version 3.4 , 2006 .

[126] D. D. Greenwood. A cochlear frequency-position function for several species--29 years later. , 1990, The Journal of the Acoustical Society of America.

[127] DeLiang Wang,et al. Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection , 2014, INTERSPEECH.

[128] George Tzanetakis,et al. Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[129] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[130] Aapo Hyvärinen,et al. Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[131] Gwo-Ching Chang,et al. Investigation of noise effect on lung sound recognition , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[132] S. Mallat. A wavelet tour of signal processing , 1998 .

[133] Ulrich H.-G. Kreßel,et al. Pairwise classification and support vector machines , 1999 .

[134] Patrice Alexandre,et al. Root cepstral analysis: A unified view. Application to speech processing in car noise environments , 1993, Speech Commun..

[135] Tom J. Moir,et al. Robust audio surveillance using spectrogram image texture feature , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[136] Rajat Raina,et al. Large-scale deep unsupervised learning using graphics processors , 2009, ICML '09.

[137] Brian R Glasberg,et al. Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[138] Luiz S. Oliveira,et al. Music genre recognition using spectrograms , 2011, 2011 18th International Conference on Systems, Signals and Image Processing.

[139] Zahra Moussavi,et al. Automatic and Unsupervised Snore Sound Extraction From Respiratory Sound Signals , 2011, IEEE Transactions on Biomedical Engineering.

[140] Enrique Alexandre,et al. Feature Selection for Sound Classification in Hearing Aids Through Restricted Search Driven by Genetic Algorithms , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[141] Antonia Papandreou-Suppappola,et al. Classification of Acoustic Emissions Using Modified Matching Pursuit , 2004, EURASIP J. Adv. Signal Process..

[142] Tom J. Moir,et al. Noise robust audio surveillance using reduced spectrogram image feature and one-against-all SVM , 2015, Neurocomputing.

[143] George Kalliris,et al. Bowel-sound pattern analysis using wavelets and neural networks with application to long-term, unsupervised, gastrointestinal motility monitoring , 2008, Expert Syst. Appl..

[144] C.-C. Jay Kuo,et al. Where am I? Scene Recognition for Mobile Robots using Audio Features , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[145] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[146] E. Parzen. On Estimation of a Probability Density Function and Mode , 1962 .

[147] Nello Cristianini,et al. Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[148] V. Vapnik. Pattern recognition using generalized portrait method , 1963 .

[149] Alessandro L. Koerich,et al. The Latin Music Database , 2008, ISMIR.

[150] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[151] Tong Feng,et al. Application of evolutionary neural network in impact acoustics based nondestructive inspection of tile-wall , 2005, Proceedings. 2005 International Conference on Communications, Circuits and Systems, 2005..

[152] Thomas C. Walters. Auditory-based processing of communication sounds , 2011 .

[153] Lie Lu,et al. Content analysis for audio classification and segmentation , 2002, IEEE Trans. Speech Audio Process..

[154] Fernando Pérez-Cruz,et al. Enhancing genetic feature selection through restricted search and Walsh analysis , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[155] R. Radhakrishnan,et al. Audio analysis for surveillance applications , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[156] Oh-Wook Kwon,et al. Cardiac disorder classification by heart sound signals using murmur likelihood and hidden Markov model state likelihood , 2012, IET Signal Process..

[157] Malcolm Slaney,et al. An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank , 1997 .

[158] Christian Breiteneder,et al. Features for Content-Based Audio Retrieval , 2010, Adv. Comput..