GTH-UPM System for Albayzin Multimodal Diarization Challenge 2020
暂无分享,去创建一个
Fernando Fernández Martínez | José Manuel Pardo Muñoz | Ricardo Kleinlein | Cristina Luna Jiménez | José Manuel Moya-Fernández | Ricardo Kleinlein | F. Martínez | J. M. P. Muñoz | J. Moya-Fernandez
[1] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[2] Kenneth Ward Church,et al. Third DIHARD Challenge Evaluation Plan , 2020, ArXiv.
[3] Yu Qiao,et al. Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.
[4] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Sanjeev Khudanpur,et al. X-Vectors: Robust DNN Embeddings for Speaker Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Nicholas W. D. Evans,et al. ODESSA/PLUMCOT at Albayzin Multimodal Diarization Challenge 2018 , 2018, IberSPEECH.
[7] Marie Kunesová,et al. Audio-Video Speaker Diarization for Unsupervised Speaker and Face Model Creation , 2014, TSD.
[8] Laura Docío Fernández,et al. The GTM-UVIGO System for Audiovisual Diarization , 2018, IberSPEECH.
[9] Eduardo Lleida,et al. Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering , 2017, INTERSPEECH.
[10] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[11] Hans-Peter Kriegel,et al. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.
[12] Naoyuki Kanda,et al. End-to-End Neural Speaker Diarization with Self-Attention , 2019, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[13] Josep Ramon Morros,et al. UPC Multimodal Speaker Diarization System for the 2018 Albayzin Challenge , 2018, IberSPEECH.
[14] Suramya Tomar,et al. Converting video formats with FFmpeg , 2006 .
[15] Jitendra Ajmera,et al. A robust speaker clustering algorithm , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[16] Quan Wang,et al. Fully Supervised Speaker Diarization , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] P. Rousseeuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .
[18] Javier Lorenzo-Navarro,et al. Who is Really Talking? A Visual-Based Speaker Diarization Strategy , 2017, EUROCAST.