Centroid-aware local discriminative metric learning in speaker verification
暂无分享,去创建一个
Wei Li | Feiyue Huang | Weiming Dong | Bao-Gang Hu | Kekai Sheng | Joseph Razik | Weiming Dong | Joseph Razik | Wei Li | Feiyue Huang | Kekai Sheng | Bao-Gang Hu
[1] Yun Lei,et al. Robust feature front-end for speaker identification , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Themos Stafylakis,et al. A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[3] Zeyuan Allen Zhu,et al. Exploiting the Structure: Stochastic Gradient Methods Using Raw Clusters , 2016, NIPS.
[4] Wei Yang,et al. Fast neighborhood component analysis , 2012, Neurocomputing.
[5] Lianwen Jin,et al. DropSample: A New Training Method to Enhance Deep Convolutional Neural Networks for Large-Scale Unconstrained Handwritten Chinese Character Recognition , 2015, Pattern Recognit..
[6] Qingming Huang,et al. Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks , 2015, ECCV.
[7] Geoffrey E. Hinton,et al. Neighbourhood Components Analysis , 2004, NIPS.
[8] N. Lack. Non-Parametric Discriminant Analysis , 1988 .
[9] Patrick Kenny,et al. Bayesian Speaker Verification with Heavy-Tailed Priors , 2010, Odyssey.
[10] James R. Glass,et al. A channel-blind system for speaker verification , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Douglas A. Reynolds,et al. Deep Neural Network Approaches to Speaker and Language Recognition , 2015, IEEE Signal Processing Letters.
[12] J. van Leeuwen,et al. Neural Networks: Tricks of the Trade , 2002, Lecture Notes in Computer Science.
[13] François Fleuret,et al. Large Scale Hard Sample Mining with Monte Carlo Tree Search , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Kilian Q. Weinberger,et al. Stochastic triplet embedding , 2012, 2012 IEEE International Workshop on Machine Learning for Signal Processing.
[15] Seyed Omid Sadjadi,et al. The IBM 2016 Speaker Recognition System , 2016, Odyssey.
[16] P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .
[17] Douglas A. Reynolds,et al. Language Recognition via i-vectors and Dimensionality Reduction , 2011, INTERSPEECH.
[18] Chen Huang,et al. Local Similarity-Aware Deep Feature Embedding , 2016, NIPS.
[19] Patrick Kenny,et al. Eigenvoice modeling with sparse training data , 2005, IEEE Transactions on Speech and Audio Processing.
[20] Xiao Liu,et al. Deep Speaker: an End-to-End Neural Speaker Embedding System , 2017, ArXiv.
[21] Patrick Kenny,et al. An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech , 2010, Odyssey.
[22] Silvio Savarese,et al. Deep Metric Learning via Lifted Structured Feature Embedding , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .
[24] H L HansenJohn. Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition , 1996 .
[25] Manohar Paluri,et al. Metric Learning with Adaptive Density Discrimination , 2015, ICLR.
[26] Christopher D. Manning,et al. Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..
[27] Jian Sun,et al. Bayesian Face Revisited: A Joint Formulation , 2012, ECCV.
[28] Abhinav Gupta,et al. Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Björn Ommer,et al. CliqueCNN: Deep Unsupervised Exemplar Learning , 2016, NIPS.
[30] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[31] James H. Elder,et al. Probabilistic Linear Discriminant Analysis for Inferences About Identity , 2007, 2007 IEEE 11th International Conference on Computer Vision.
[32] Hugo Van hamme,et al. Speaker age estimation using i-vectors , 2014, Eng. Appl. Artif. Intell..
[33] Inderjit S. Dhillon,et al. Information-theoretic metric learning , 2006, ICML '07.
[34] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[35] Andrzej Drygajlo,et al. Speaker verification in score-ageing-quality classification space , 2013, Comput. Speech Lang..
[36] John H. L. Hansen,et al. An Investigation into Back-end Advancements for Speaker Recognition in Multi-Session and Noisy Enrollment Scenarios , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[37] Andreas Stolcke,et al. Within-class covariance normalization for SVM-based speaker recognition , 2006, INTERSPEECH.
[38] Sergey Ioffe,et al. Probabilistic Linear Discriminant Analysis , 2006, ECCV.
[39] Kilian Q. Weinberger,et al. Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.
[40] Léon Bottou,et al. Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.
[41] Youfu Li,et al. Adaptive weighted learning for linear regression problems via Kullback-Leibler divergence , 2013, Pattern Recognit..