Neural Networks for Sequential Data: a Pre-training Approach based on Hidden Markov Models
暂无分享,去创建一个
[1] Geoffrey E. Hinton. Learning to represent visual input , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.
[2] Vasant Honavar,et al. Discriminatively trained Markov model for sequence classification , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).
[3] David Haussler,et al. Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.
[4] Mehryar Mohri,et al. Rational Kernels: Theory and Algorithms , 2004, J. Mach. Learn. Res..
[5] Peter N. Yianilos,et al. Learning String-Edit Distance , 1996, IEEE Trans. Pattern Anal. Mach. Intell..
[6] Li Wei,et al. Semi-supervised time series classification , 2006, KDD '06.
[7] Peter E. Hart,et al. Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.
[8] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.
[9] Marc'Aurelio Ranzato,et al. Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[10] Philipp Slusallek,et al. Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.
[11] Mehryar Mohri,et al. Rational Kernels , 2002, NIPS.
[12] Enrique Vidal,et al. Computation of Normalized Edit Distance and Applications , 1993, IEEE Trans. Pattern Anal. Mach. Intell..
[13] Alessandro Sperduti,et al. A general framework for adaptive processing of data structures , 1998, IEEE Trans. Neural Networks.
[14] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.
[15] Rajat Raina,et al. Large-scale deep unsupervised learning using graphics processors , 2009, ICML '09.
[16] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[17] John F. Kolen,et al. From Sequences to Data Structures: Theory and Applications , 2001 .
[18] Laurens van der Maaten,et al. Learning Discriminative Fisher Kernels , 2011, ICML.
[19] Isabelle Guyon,et al. An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..
[20] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[21] Antonia J. Jones,et al. Feature selection for genetic sequence classification , 1998, Bioinform..
[22] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.
[23] Razvan Pascanu,et al. Advances in optimizing recurrent networks , 2012, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[24] Peter E. Hart,et al. The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.
[25] Yoshua Bengio,et al. Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription , 2012, ICML.
[26] Markus Hagenbuchner,et al. Learning Nonsparse Kernels by Self-Organizing Maps for Structured Data , 2009, IEEE Transactions on Neural Networks.
[27] Ilya Sutskever,et al. Learning Recurrent Neural Networks with Hessian-Free Optimization , 2011, ICML.
[28] Geoffrey E. Hinton,et al. Unsupervised learning : foundations of neural computation , 1999 .
[29] Alberto Maria Segre,et al. Programs for Machine Learning , 1994 .
[30] Geoffrey E. Hinton,et al. Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[31] Christina S. Leslie,et al. Fast String Kernels using Inexact Matching for Protein Sequences , 2004, J. Mach. Learn. Res..
[32] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..
[33] Geoffrey E. Hinton,et al. Factored conditional restricted Boltzmann Machines for modeling motion style , 2009, ICML '09.
[34] Geoffrey E. Hinton,et al. The Recurrent Temporal Restricted Boltzmann Machine , 2008, NIPS.
[35] Michele De Filippo De Grazia,et al. Deep Unsupervised Learning on a Desktop PC: A Primer for Cognitive Scientists , 2013, Front. Psychol..
[36] Mert Bay,et al. Evaluation of Multiple-F0 Estimation and Tracking Systems , 2009, ISMIR.
[37] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.
[38] Hava T. Siegelmann,et al. The Computational Power of Interactive Recurrent Neural Networks , 2012, Neural Computation.
[39] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .
[40] M. Inés Torres,et al. Comparative Study of the Baum-Welch and Viterbi Training Algorithms Applied to Read and Spontaneous Speech Recognition , 2003, IbPRIA.
[41] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.
[42] Li Wei,et al. Fast time series classification using numerosity reduction , 2006, ICML.
[43] Paul J. Werbos,et al. Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.
[44] Eamonn J. Keogh,et al. Time series shapelets: a new primitive for data mining , 2009, KDD.
[45] Yves Chauvin,et al. Backpropagation: theory, architectures, and applications , 1995 .
[46] Yoshua Bengio,et al. Why Does Unsupervised Pre-training Help Deep Learning? , 2010, AISTATS.
[47] J. Elman. Learning and development in neural networks: the importance of starting small , 1993, Cognition.
[48] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .
[49] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[50] Alessandro Sperduti,et al. A HMM-based pre-training approach for sequential data , 2014, ESANN.
[51] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[52] Bilge Mutlu,et al. How Do Humans Teach: On Curriculum Learning and Teaching Dimension , 2011, NIPS.