Real-Time monophonic and polyphonic audio classification from power spectra

[1]  Haibin Ling,et al.  Attention guided deep audio-face fusion for efficient speaker naming , 2019, Pattern Recognit..

[2]  Roberto Togneri,et al.  Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features , 2018, Pattern Recognit..

[3]  Ankit Shah,et al.  DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System , 2017, DCASE.

[4]  Maxime Baelde,et al.  Classification de signaux audio en temps-réel par un modèle de mélanges d'histogrammes , 2017 .

[5]  Christophe Biernacki,et al.  A mixture model-based real-time audio sources classification method , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Gaël Richard,et al.  Overlapping sound event detection with supervised Nonnegative Matrix Factorization , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Heikki Huttunen,et al.  Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Yanmin Qian,et al.  Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[9]  Tuomas Virtanen,et al.  TUT database for acoustic scene classification and sound event detection , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[10]  Joachim Flocon-Cholet Classification audio sous contrainte de faible latence , 2016 .

[11]  Annamaria Mesaros,et al.  Metrics for Polyphonic Sound Event Detection , 2016 .

[12]  Mathieu Lagrange,et al.  Detection of overlapping acoustic events using a temporally-constrained probabilistic model , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Jesús Favela,et al.  Scalable identification of mixed environmental sounds, recorded from heterogeneous sources , 2015, Pattern Recognit. Lett..

[14]  Elmer P. Dadios,et al.  Neural network classification for detecting abnormal events in a public transport vehicle , 2015, 2015 International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM).

[15]  Karol J. Piczak Environmental sound classification with convolutional neural networks , 2015, 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP).

[16]  Karol J. Piczak ESC: Dataset for Environmental Sound Classification , 2015, ACM Multimedia.

[17]  Dan Stowell,et al.  Detection and Classification of Acoustic Scenes and Events , 2015, IEEE Transactions on Multimedia.

[18]  Onur Dikmen,et al.  Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Dimitri Palaz,et al.  Convolutional Neural Networks-based continuous speech recognition using raw speech signal , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Francis R. Bach,et al.  An online em algorithm in hidden (semi-)Markov models for audio segmentation and clustering , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21]  Gursimran Kour,et al.  Music Genre Classification using MFCC, SVM and BPNN , 2015 .

[22]  Souli S. Sameh,et al.  On the Use of Time–Frequency Reassignment and SVM-Based Classifier for Audio Surveillance Applications , 2014 .

[23]  R. Biondi,et al.  Low Cost Real Time Robust Identification of Impulsive Signals , 2014 .

[24]  Markus Flierl,et al.  Bayesian estimation of Dirichlet mixture model with variational inference , 2014, Pattern Recognit..

[25]  Wei Jiang,et al.  Latent topic model for audio retrieval , 2014, Pattern Recognit..

[26]  Hakan Erdogan,et al.  Deep neural networks for single channel source separation , 2013, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Moncef Gabbouj,et al.  Supervised model training for overlapping sound events based on unsupervised source separation , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[28]  Joon-Hyuk Chang,et al.  On using acoustic environment classification for statistical model-based speech enhancement , 2012, Speech Commun..

[29]  Patrick Susini,et al.  The Timbre Toolbox: extracting audio descriptors from musical signals. , 2011, The Journal of the Acoustical Society of America.

[30]  Cédric Richard,et al.  Abnormal events detection using unsupervised One-Class SVM - Application to audio surveillance and evaluation - , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[31]  Tuomas Virtanen,et al.  Audio context recognition using audio event histograms , 2010, 2010 18th European Signal Processing Conference.

[32]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[33]  Dan Istrate,et al.  Real time sound analysis for medical remote monitoring , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[34]  Grigorios Tsoumakas,et al.  Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[35]  Andrey Temko,et al.  Classification of acoustic events using SVM-based clustering schemes , 2006, Pattern Recognit..

[36]  Chloé Clavel,et al.  Events Detection for an Audio-Based Surveillance System , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[37]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[38]  G. Celeux,et al.  Assessing a Mixture Model for Clustering with the Integrated Classification Likelihood , 1998 .

[39]  P. Paatero,et al.  Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[40]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[41]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[42]  E. Hellinger,et al.  Neue Begründung der Theorie quadratischer Formen von unendlichvielen Veränderlichen. , 1909 .

[43]  Qasem A. Al-Radaideh,et al.  A Multi-Label Classification Approach Based on Correlations Among Labels , 2015 .

[44]  Gaël Richard,et al.  Temporal Integration for Audio Classification With Application to Musical Instrument Classification , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[45]  Bhiksha Raj,et al.  A Probabilistic Latent Variable Model for Acoustic Modeling , 2006 .