论文信息 - Learning Efficient Representations for Sequence Retrieval

Learning Efficient Representations for Sequence Retrieval

Background In many domains, the most natural representation for data is as sequences of feature vectors. For example, in speech recognition, recorded utterances are typically transformed into series of vectors which describe the frequency content over short periods of time [1]. Similarly, in natural language processing tasks, sentences are often represented as sequences of vectors where each word corresponds to a unique vector [2]. Many off-the-shelf machine learning approaches assume that feature vectors are independent, so modeling the sequential nature of these representations often necessitates special treatment.

Colin Raffel

[1] Daniel P. W. Ellis,et al. Large-Scale Content-Based Matching of MIDI and Audio Files , 2015, ISMIR.

[2] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[3] Jürgen Schmidhuber,et al. Multimodal Similarity-Preserving Hashing , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Eamonn J. Keogh,et al. Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping , 2012, KDD.

[6] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.

[7] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.