暂无分享,去创建一个
Ariya Rastrow | Grant P. Strimel | Jinru Su | Jonathan Macoskey | A. Rastrow | J. Macoskey | Jinru Su
[1] Rohit Prabhavalkar,et al. On the Efficient Representation and Execution of Deep Acoustic Models , 2016, INTERSPEECH.
[2] Suyog Gupta,et al. To prune, or not to prune: exploring the efficacy of pruning for model compression , 2017, ICLR.
[3] Jürgen Schmidhuber,et al. LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.
[4] Ding Zhao,et al. Dynamic Sparsity Neural Networks for Automatic Speech Recognition , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] Tara N. Sainath,et al. A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Zhong Meng,et al. Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability , 2020, INTERSPEECH.
[7] Alex Graves,et al. Adaptive Computation Time for Recurrent Neural Networks , 2016, ArXiv.
[8] Jiangyan Yi,et al. Self-Attention Transducers for End-to-End Speech Recognition , 2019, INTERSPEECH.
[9] Tomas Mikolov,et al. Variable Computation in Recurrent Neural Networks , 2016, ICLR.
[10] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Trevor Hastie,et al. Statistical Learning with Sparsity: The Lasso and Generalizations , 2015 .
[12] Tara N. Sainath,et al. Compression of End-to-End Models , 2018, INTERSPEECH.
[13] Yashesh Gaur,et al. On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition , 2020, INTERSPEECH.
[14] Ling Li,et al. Addressing Sparsity in Deep Neural Networks , 2019, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.
[15] Navdeep Jaitly,et al. Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[16] Erich Elsen,et al. The State of Sparsity in Deep Neural Networks , 2019, ArXiv.
[17] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .
[18] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.
[19] Yu Zhang,et al. Simple Recurrent Units for Highly Parallelizable Recurrence , 2017, EMNLP.
[20] Varun Sharma,et al. Fast Intent Classification for Spoken Language Understanding Systems , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Gholamreza Anbarjafari,et al. Lossy image compression using singular value decomposition and wavelet difference reduction , 2014, Digit. Signal Process..
[22] Tara N. Sainath,et al. Streaming End-to-end Speech Recognition for Mobile Devices , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[23] Ian McGraw,et al. On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[24] Yun Jiang,et al. SVD Compression for Magnetic Resonance Fingerprinting in the Time Domain , 2014, IEEE Transactions on Medical Imaging.
[25] Tara N. Sainath,et al. Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model , 2019, INTERSPEECH.
[26] Chong-Min Kyung,et al. Efficient Neural Network Compression , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[28] Sridhar Swaminathan,et al. Sparse low rank factorization for deep neural network compression , 2020, Neurocomputing.
[29] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[30] H. Andrews,et al. Singular Value Decomposition (SVD) Image Coding , 1976, IEEE Trans. Commun..
[31] Joelle Pineau,et al. Conditional Computation in Neural Networks for faster models , 2015, ArXiv.
[32] Geoffrey Zweig,et al. Exploring multidimensional lstms for large vocabulary ASR , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[33] Daniel S. Park,et al. Efficient Knowledge Distillation for RNN-Transducer Models , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[34] Ariya Rastrow,et al. Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[35] Chng Eng Siong,et al. Cross Attention with Monotonic Alignment for Speech Transformer , 2020, INTERSPEECH.
[36] Kaile Su,et al. Long Short-Term Memory Projection Recurrent Neural Network Architectures for Piano's Continuous Note Recognition , 2017, J. Robotics.
[37] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[38] Hieu Duy Nguyen,et al. Quantization Aware Training with Absolute-Cosine Regularization for Automatic Speech Recognition , 2020, INTERSPEECH.
[39] E. Gumbel. Bivariate Exponential Distributions , 1960 .
[40] Liang Qiao,et al. Optimizing Speech Recognition For The Edge , 2019, ArXiv.
[41] Venkatesh Saligrama,et al. Adaptive Neural Networks for Efficient Inference , 2017, ICML.
[42] Rohit Prabhavalkar,et al. Exploring architectures, data and units for streaming end-to-end speech recognition with RNN-transducer , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).