Réseaux de neurones de quaternions pour le traitement du langage

Machine Learning algorithms reach great performances on different Natural Language Processing tasks. Among these methods, Neural Networks (NN or MLP) recently received a great interest from researchers due to their capability to represent complex internal structures. However, MLPs employ basic word level or thème-based features and, therefore, reveal little in way of document statistical structure. We propose to address this issue by extending the NN to Quaternion called QMLP to take into consideration features dependencies. A well-dedicated segmentation of document approach is also compared to the one proposed in (Morchid et al., 2013). Experiments made on a SLU task with spoken dialogues show that our QMLP associated with the proposed document segmentation outperforms other approaches, with an gain of 2% and 3% compared to MLP and (Morchid et al., 2013) respectively. We finally demonstrated that less iterations are needed by QMLPs to reach better accuracies than MLP. MOTS-CLÉS : Réseaux de neurones, Quaternions, Traitement du langage.

[1]  Nobuyuki Matsui,et al.  Quaternionic Neural Networks: Fundamental Properties and Applications , 2009 .

[2]  Michael Fox Quaternions and rotation sequences, by Jack B. Kuipers. Pp. 371. £24.95 (pbk), £59.00 (hbk). 2002. ISBN 0 691 10298 8 (pbk), 0 691 05872 5 (hbk) (Princeton University Press). , 2006, The Mathematical Gazette.

[3]  Norbert Jankowski,et al.  Survey of Neural Transfer Functions , 1999 .

[4]  이주연,et al.  Latent Dirichlet Allocation (LDA) 모델 기반의 인공지능(A.I.) 기술 관련 연구 활동 및 동향 분석 , 2018 .

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Geoffrey E. Hinton A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[8]  Nikos A. Aspragathos,et al.  A comparative study of three methods for robot kinematics , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[9]  Fuzhen Zhang Quaternions and matrices of quaternions , 1997 .

[10]  Andrew W. Senior,et al.  Fast and accurate recurrent neural network acoustic models for speech recognition , 2015, INTERSPEECH.

[11]  Giovanni Muscato,et al.  Multilayer Perceptrons to Approximate Quaternion Valued Functions , 1997, Neural Networks.

[12]  A. S. Solodovnikov,et al.  Hypercomplex Numbers: An Elementary Introduction to Algebras , 1989 .

[13]  Frédéric Béchet,et al.  DECODA: a call-centre human-human spoken conversation corpus , 2012, LREC.

[14]  Mohamed Morchid,et al.  Theme identification in telephone service conversations using quaternions of speech features , 2013, INTERSPEECH.

[15]  Tom Minka,et al.  Expectation-Propogation for the Generative Aspect Model , 2002, UAI.

[16]  J. P. Ward Quaternions and Cayley Numbers , 1997 .

[17]  Georges Linarès,et al.  The LIA Speech Recognition System: From 10xRT to 1xRT , 2007, TSD.

[18]  Yoshimi Suzuki,et al.  Keyword Extraction using Term-Domain Interdependence for Dictation of Radio News , 1998, COLING-ACL.

[19]  Gregor Heinrich Parameter estimation for text analysis , 2009 .