论文信息 - Development of a computationally efficient voice conversion system on mobile phones - 字舞流文

Development of a computationally efficient voice conversion system on mobile phones

Cheng Xiang | Dong-Yan Huang | Xiaoling Wu | Shuhua Gao | Shuhua Gao | Dong Huang | Cheng Xiang | Xiaoling Wu

[1] Cláudio T. Silva,et al. Robust Smooth Feature Extraction from Point Clouds , 2007, IEEE International Conference on Shape Modeling and Applications 2007 (SMI '07).

[2] Daniel Erro,et al. On combining statistical methods and frequency warping for high-quality voice conversion , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3] Li-Rong Dai,et al. Voice Conversion Using Deep Neural Networks With Layer-Wise Generative Training , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[4] Ricardo Gutierrez-Osuna,et al. Can voice conversion be used to reduce non-native accents? , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5] Donald J. Berndt,et al. Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[6] José Luis Martínez,et al. Energy efficient low-cost video communications , 2012, IEEE Transactions on Consumer Electronics.

[7] Haizhou Li,et al. Exemplar-based voice conversion using non-negative spectrogram deconvolution , 2013, SSW.

[8] Mark D. Hill,et al. Amdahl's Law in the Multicore Era , 2008 .

[9] Borko Furht,et al. Parallel programming for multimedia applications , 2010, Multimedia Tools and Applications.

[10] Kuldip K. Paliwal,et al. Interpolation properties of linear prediction parametric representations , 1995, EUROSPEECH.

[11] Moncef Gabbouj,et al. Voice Conversion Using Partial Least Squares Regression , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[12] Haizhou Li,et al. Voice conversion versus speaker verification: an overview , 2014 .

[13] Tomoki Toda,et al. GMM-based voice conversion applied to emotional speech synthesis , 2003, INTERSPEECH.

[14] Eric Moulines,et al. High-quality speech modification based on a harmonic + noise model , 1995, EUROSPEECH.

[15] Alexander Kain,et al. Spectral voice conversion for text-to-speech synthesis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[16] Amro El-Jaroudi,et al. Discrete all-pole modeling , 1991, IEEE Trans. Signal Process..

[17] Daniel Erro,et al. Voice Conversion Based on Weighted Frequency Warping , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[18] Tomoki Toda,et al. One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[19] Ning Xu,et al. Voice conversion based on Gaussian processes by coherent and asymmetric training with limited training data , 2014, Speech Commun..

[20] Chng Eng Siong,et al. High quality voice conversion using prosodic and high-resolution spectral features , 2015, Multimedia Tools and Applications.

[21] Antonio Bonafonte,et al. Voice conversion using k-histograms and frame selection , 2009, INTERSPEECH.

[22] Hideki Kawahara,et al. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[23] Seyed Hamidreza Mohammadi,et al. An overview of voice conversion systems , 2017, Speech Commun..

[24] Keikichi Hirose,et al. One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space , 2011, INTERSPEECH.

[25] P. Depalle,et al. Extraction of spectral peak parameters using a short-time Fourier transform modeling and no sidelobe windows , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[26] Jun Zhou,et al. Use of SIMD Vector Operations to Accelerate Application Code Performance on Low-Powered ARM and Intel Platforms , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[27] Peng Song,et al. Voice conversion using support vector regression , 2011 .

[28] Tomoki Toda,et al. Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[29] Samuel Williams,et al. The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .

[30] Tack-Don Han,et al. Mobile digital image stabilisation using SIMD data path , 2012 .

[31] Tomoki Toda,et al. Implementation of Computationally Efficient Real-Time Voice Conversion , 2012, INTERSPEECH.

[32] Kishore Prahallad,et al. Spectral Mapping Using Artificial Neural Networks for Voice Conversion , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[33] L. Dagum,et al. OpenMP: an industry standard API for shared-memory programming , 1998 .

[34] Haizhou Li,et al. Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[35] Bayya Yegnanarayana,et al. Transformation of formants for voice conversion using artificial neural networks , 1995, Speech Commun..

[36] Xia Wang,et al. Phoneme cluster based state mapping for text-independent voice conversion , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[37] Tetsuya Takiguchi,et al. Voice conversion in high-order eigen space using deep belief nets , 2013, INTERSPEECH.

[38] Ricardo Gutierrez-Osuna,et al. Foreign accent conversion in computer assisted pronunciation training , 2009, Speech Commun..

[39] Olivier Rosec,et al. Voice Conversion Using Dynamic Frequency Warping With Amplitude Scaling, for Parallel or Nonparallel Corpora , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[40] John-Paul Hosom,et al. Improving the intelligibility of dysarthric speech , 2007, Speech Commun..

[41] Haohong Wang,et al. A mobile world made of functions , 2017, APSIPA Transactions on Signal and Information Processing.

[42] Eric Moulines,et al. Continuous probabilistic transform for voice conversion , 1998, IEEE Trans. Speech Audio Process..

[43] Inma Hernáez,et al. Parametric Voice Conversion Based on Bilinear Frequency Warping Plus Amplitude Scaling , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[44] Wojciech Kwedlo. A Parallel EM Algorithm for Gaussian Mixture Models Implemented on a NUMA System Using OpenMP , 2014, 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[45] Tomoki Toda,et al. Statistical singing voice conversion with direct waveform modification based on the spectrum differential , 2014, INTERSPEECH.

[46] Tomoki Toda,et al. The Voice Conversion Challenge 2016 , 2016, INTERSPEECH.

[47] Yung-Hwan Oh,et al. Hidden Markov model based voice conversion using dynamic characteristics of speaker , 1997, EUROSPEECH.

[48] Satoshi Nakamura,et al. Voice conversion through vector quantization , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[49] Eric Moulines,et al. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..

[50] Daniel Erro,et al. Flexible harmonic/stochastic speech synthesis , 2007, SSW.