暂无分享,去创建一个
Vijay Janapa Reddi | Maximilian Lam | Mark Mazumder | David Kanter | Juan Ciro | Daniel Galvez | Greg Diamos | Juan Felipe Cer'on | Keith Achorn | Anjali Gopi | Maximilian Lam | G. Diamos | David Kanter | V. Reddi | Mark Mazumder | Juan Ciro | Keith Achorn | Daniel Galvez | Anjali Gopi
[1] Quinten McNamara,et al. Earnings-21: A Practical Benchmark for ASR in the Wild , 2021, Interspeech 2021.
[2] Xiangang Li,et al. GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio , 2021, Interspeech.
[3] Armand Joulin,et al. Libri-Light: A Benchmark for ASR with Limited or No Supervision , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Boris Ginsburg,et al. Quartznet: Deep Automatic Speech Recognition with 1D Time-Channel Separable Convolutions , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Geoffrey Zweig,et al. Large scale weakly and semi-supervised learning for low-resource video ASR , 2020, INTERSPEECH.
[7] Roland Vollgraf,et al. Contextual String Embeddings for Sequence Labeling , 2018, COLING.
[8] Sanjeev Khudanpur,et al. X-Vectors: Robust DNN Embeddings for Speaker Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[10] Piotr Indyk,et al. Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.
[11] Vijay Janapa Reddi,et al. Data Engineering for Everyone , 2021, ArXiv.
[12] Timothy Baldwin,et al. langid.py: An Off-the-shelf Language Identification Tool , 2012, ACL.
[13] Douglas A. Reynolds,et al. Language Recognition via i-vectors and Dimensionality Reduction , 2011, INTERSPEECH.
[14] Gabriel Synnaeve,et al. MLS: A Large-Scale Multilingual Dataset for Speech Research , 2020, INTERSPEECH.
[15] Pavel Korshunov,et al. Pyannote.Audio: Neural Building Blocks for Speaker Diarization , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[16] Francis M. Tyers,et al. Common Voice: A Massively-Multilingual Speech Corpus , 2020, LREC.
[17] Aren Jansen,et al. Audio Set: An ontology and human-labeled dataset for audio events , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Justin Luitjens,et al. Gpu-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).