论文信息 - Real-Time monophonic and polyphonic audio classification from power spectra - 字舞流文

Real-Time monophonic and polyphonic audio classification from power spectra

Christophe Biernacki | Maxime Baelde | Raphael Greff | C. Biernacki | Raphaël Greff | Maxime Baelde

[1] Haibin Ling,et al. Attention guided deep audio-face fusion for efficient speaker naming , 2019, Pattern Recognit..

[2] Roberto Togneri,et al. Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features , 2018, Pattern Recognit..

[3] Ankit Shah,et al. DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System , 2017, DCASE.

[4] Maxime Baelde,et al. Classification de signaux audio en temps-réel par un modèle de mélanges d'histogrammes , 2017 .

[5] Christophe Biernacki,et al. A mixture model-based real-time audio sources classification method , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6] Gaël Richard,et al. Overlapping sound event detection with supervised Nonnegative Matrix Factorization , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7] Heikki Huttunen,et al. Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8] Yanmin Qian,et al. Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[9] Tuomas Virtanen,et al. TUT database for acoustic scene classification and sound event detection , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[10] Joachim Flocon-Cholet. Classification audio sous contrainte de faible latence , 2016 .

[11] Annamaria Mesaros,et al. Metrics for Polyphonic Sound Event Detection , 2016 .

[12] Mathieu Lagrange,et al. Detection of overlapping acoustic events using a temporally-constrained probabilistic model , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13] Jesús Favela,et al. Scalable identification of mixed environmental sounds, recorded from heterogeneous sources , 2015, Pattern Recognit. Lett..

[14] Elmer P. Dadios,et al. Neural network classification for detecting abnormal events in a public transport vehicle , 2015, 2015 International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM).

[15] Karol J. Piczak. Environmental sound classification with convolutional neural networks , 2015, 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP).

[16] Karol J. Piczak. ESC: Dataset for Environmental Sound Classification , 2015, ACM Multimedia.

[17] Dan Stowell,et al. Detection and Classification of Acoustic Scenes and Events , 2015, IEEE Transactions on Multimedia.

[18] Onur Dikmen,et al. Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19] Dimitri Palaz,et al. Convolutional Neural Networks-based continuous speech recognition using raw speech signal , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20] Francis R. Bach,et al. An online em algorithm in hidden (semi-)Markov models for audio segmentation and clustering , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21] Gursimran Kour,et al. Music Genre Classification using MFCC, SVM and BPNN , 2015 .

[22] Souli S. Sameh,et al. On the Use of Time–Frequency Reassignment and SVM-Based Classifier for Audio Surveillance Applications , 2014 .

[23] R. Biondi,et al. Low Cost Real Time Robust Identification of Impulsive Signals , 2014 .

[24] Markus Flierl,et al. Bayesian estimation of Dirichlet mixture model with variational inference , 2014, Pattern Recognit..

[25] Wei Jiang,et al. Latent topic model for audio retrieval , 2014, Pattern Recognit..

[26] Hakan Erdogan,et al. Deep neural networks for single channel source separation , 2013, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27] Moncef Gabbouj,et al. Supervised model training for overlapping sound events based on unsupervised source separation , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[28] Joon-Hyuk Chang,et al. On using acoustic environment classification for statistical model-based speech enhancement , 2012, Speech Commun..

[29] Patrick Susini,et al. The Timbre Toolbox: extracting audio descriptors from musical signals. , 2011, The Journal of the Acoustical Society of America.

[30] Cédric Richard,et al. Abnormal events detection using unsupervised One-Class SVM - Application to audio surveillance and evaluation - , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[31] Tuomas Virtanen,et al. Audio context recognition using audio event histograms , 2010, 2010 18th European Signal Processing Conference.

[32] Geoff Holmes,et al. Classifier chains for multi-label classification , 2009, Machine Learning.

[33] Dan Istrate,et al. Real time sound analysis for medical remote monitoring , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[34] Grigorios Tsoumakas,et al. Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[35] Andrey Temko,et al. Classification of acoustic events using SVM-based clustering schemes , 2006, Pattern Recognit..

[36] Chloé Clavel,et al. Events Detection for an Audio-Based Surveillance System , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[37] Eric R. Ziegel,et al. The Elements of Statistical Learning , 2003, Technometrics.

[38] G. Celeux,et al. Assessing a Mixture Model for Clustering with the Integrated Classification Likelihood , 1998 .

[39] P. Paatero,et al. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[40] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[41] J. H. Ward. Hierarchical Grouping to Optimize an Objective Function , 1963 .

[42] E. Hellinger,et al. Neue Begründung der Theorie quadratischer Formen von unendlichvielen Veränderlichen. , 1909 .

[43] Qasem A. Al-Radaideh,et al. A Multi-Label Classification Approach Based on Correlations Among Labels , 2015 .

[44] Gaël Richard,et al. Temporal Integration for Audio Classification With Application to Musical Instrument Classification , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[45] Bhiksha Raj,et al. A Probabilistic Latent Variable Model for Acoustic Modeling , 2006 .