暂无分享,去创建一个
Titouan Parcollet | Fabien Ringeval | Solange Rossato | Benjamin Lecouteux | Didier Schwab | Alexandre Allauzen | Laurent Besacier | Marco Dinarelli | Salima Mdhaffar | Ha Nguyen | Marcely Zanon Boito | Yannick Esteve | Hang Le | Sina Alisamir | Natalia Tomashenko | Solene Evain | Ziyi Tong | Francois Portet
[1] Francis M. Tyers,et al. Common Voice: A Massively-Multilingual Speech Corpus , 2020, LREC.
[2] J. Pino,et al. CoVoST 2 and Massively Multilingual Speech-to-Text Translation , 2020 .
[3] Olivier Galibert,et al. The ETAPE corpus for the evaluation of speech-based TV content processing in the French language , 2012, LREC.
[4] Isabelle Tellier,et al. Un grand corpus oral « disponible » : le corpus d'Orléans 1 1968-2012 , 2011 .
[5] Yannick Estève,et al. Recent Advances in End-to-End Spoken Language Understanding , 2019, SLSP.
[6] Nicolas Obin,et al. Att-HACK: An Expressive Speech Database with Social Attitudes , 2020, Speech Prosody 2020.
[7] K. Scherer,et al. Introducing the Geneva Multimodal expression corpus for experimental research on emotion perception. , 2012, Emotion.
[8] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[9] Yiming Wang,et al. Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks , 2018, INTERSPEECH.
[10] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.
[11] Sanjeev Khudanpur,et al. A time delay neural network architecture for efficient modeling of long temporal contexts , 2015, INTERSPEECH.
[12] George Trigeorgis,et al. Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[14] Frédéric Béchet,et al. The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News , 2010, LREC.
[15] Frédéric Béchet,et al. Results of the French Evalda-Media evaluation campaign for literal understanding , 2006, LREC.
[16] Chris Dyer,et al. Learning Robust and Multilingual Speech Representations , 2020, FINDINGS.
[17] Ronan Collobert,et al. wav2vec: Unsupervised Pre-training for Speech Recognition , 2019, INTERSPEECH.
[18] Ronan Collobert,et al. Unsupervised Cross-lingual Representation Learning for Speech Recognition , 2020, Interspeech.
[19] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[20] Quoc V. Le,et al. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Fabien Ringeval,et al. Introducing the RECOLA multimodal corpus of remote collaborative and affective interactions , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).
[22] James R. Glass,et al. Improved Speech Representations with Multi-Target Autoregressive Predictive Coding , 2020, ACL.
[23] Yu-An Chung,et al. Generative Pre-Training for Speech with Autoregressive Predictive Coding , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[24] Laurent Besacier,et al. A Data Efficient End-to-End Spoken Language Understanding Architecture , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[25] Yannick Estève,et al. End-To-End Named Entity And Semantic Concept Extraction From Speech , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[26] Alexei Baevski,et al. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations , 2020, NeurIPS.
[27] Myle Ott,et al. fairseq: A Fast, Extensible Toolkit for Sequence Modeling , 2019, NAACL.
[28] Gabriel Synnaeve,et al. MLS: A Large-Scale Multilingual Dataset for Speech Research , 2020, INTERSPEECH.
[29] Emmanuelle Canut,et al. Mise à disposition de corpus oraux interactifs : le projet TCOF (Traitement de Corpus Oraux en Français) , 2010 .
[30] Hao Tang,et al. An Unsupervised Autoregressive Model for Speech Representation Learning , 2019, INTERSPEECH.
[31] L. Lin,et al. A concordance correlation coefficient to evaluate reproducibility. , 1989, Biometrics.
[32] Philippe Gournay,et al. A canadian french emotional speech dataset , 2018, MMSys.
[33] Dmytro Okhonko,et al. fairseq S2T: Fast Speech-to-Text Modeling with fairseq , 2020, AACL.
[34] Yannick Estève,et al. AlloSat: A New Call Center French Corpus for Satisfaction and Frustration Analysis , 2020, LREC.
[35] Emmanuel Dupoux,et al. VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation , 2021, ACL.
[36] Armand Joulin,et al. Unsupervised Pretraining Transfers Well Across Languages , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[37] Yiming Wang,et al. Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI , 2016, INTERSPEECH.
[38] R Devon Hjelm,et al. Learning Representations by Maximizing Mutual Information Across Views , 2019, NeurIPS.
[39] Fethi Bougares,et al. Investigating Self-Supervised Pre-Training for End-to-End Speech Translation , 2020, INTERSPEECH.
[40] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.
[41] Elizabeth Salesky,et al. The Multilingual TEDx Corpus for Speech Recognition and Translation , 2021, Interspeech 2021.
[42] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[43] Fabien Ringeval,et al. Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio , 2016, IJCAI.
[44] Alexei Baevski,et al. Effectiveness of self-supervised pre-training for speech recognition , 2019, ArXiv.
[45] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[46] Yannick Estève,et al. Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability , 2019, INTERSPEECH.