Neural Network Approaches for Noisy Language Modeling

Text entry from people is not only grammatical and distinct, but also noisy. For example, a user's typing stream contains all the information about the user's interaction with computer using a QWERTY keyboard, which may include the user's typing mistakes as well as specific vocabulary, typing habit, and typing performance. In particular, these features are obvious in disabled users' typing streams. This paper proposes a new concept called noisy language modeling by further developing information theory and applies neural networks to one of its specific application-typing stream. This paper experimentally uses a neural network approach to analyze the disabled users' typing streams both in general and specific ways to identify their typing behaviors and subsequently, to make typing predictions and typing corrections. In this paper, a focused time-delay neural network (FTDNN) language model, a time gap model, a prediction model based on time gap, and a probabilistic neural network model (PNN) are developed. A 38% first hitting rate (HR) and a 53% first three HR in symbol prediction are obtained based on the analysis of a user's typing history through the FTDNN language modeling, while the modeling results using the time gap prediction model and the PNN model demonstrate that the correction rates lie predominantly in between 65% and 90% with the current testing samples, and 70% of all test scores above basic correction rates, respectively. The modeling process demonstrates that a neural network is a suitable and robust language modeling tool to analyze the noisy language stream. The research also paves the way for practical application development in areas such as informational analysis, text prediction, and error correction by providing a theoretical basis of neural network approaches for noisy language modeling.

[1]  U. Ohler,et al.  Promoter Prediction on a Genomic Scale – the Adh Experience , 2000 .

[2]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[3]  Khashayar Khorasani,et al.  Adaptive time delay neural network structures for nonlinear system identification , 2002, Neurocomputing.

[4]  Michael R. Davenport,et al.  Continuous-time temporal back-propagation with adaptable time delays , 1993, IEEE Trans. Neural Networks.

[5]  Wei Xu,et al.  Can artificial neural networks learn language models? , 2000, INTERSPEECH.

[6]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[7]  Mukul Agarwal,et al.  Long-range prediction for poorly-known systems , 1995 .

[8]  Hassan B. Kazemian,et al.  A neural network based solution for automatic typing errors correction , 2010, Neural Computing and Applications.

[9]  Jean-Luc Gauvain,et al.  Training Neural Network Language Models on Very Large Corpora , 2005, HLT.

[10]  Hongjian Zhang,et al.  Time-delay neural network for the prediction of carbonation tower's temperature , 2003, IEEE Trans. Instrum. Meas..

[11]  Claude E. Shannon,et al.  Prediction and Entropy of Printed English , 1951 .

[12]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[13]  Philip D. Wasserman,et al.  Advanced methods in neural computing , 1993, VNR computer library.

[14]  Khalid Sayood,et al.  Introduction to data compression (2nd ed.) , 2000 .

[15]  Shari Trewin,et al.  A model of keyboard configuration requirements , 1998, Assets '98.

[16]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[17]  A Probabilistic Neural Network for Human Face Identification based on Fuzzy Logic Chromatic Rules , .

[18]  I. Scott MacKenzie,et al.  Input-based Language Modelling in the Design of High Performance Text Input Techniques , 2003, Graphics Interface.

[19]  Geoffrey E. Hinton,et al.  Learning distributed representations of concepts. , 1989 .

[20]  Geoffrey E. Hinton,et al.  Extracting distributed representations of concepts and relations from positive and negative propositions , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[21]  P. Fitts The information capacity of the human motor system in controlling the amplitude of movement. , 1954, Journal of experimental psychology.

[22]  Alan F. Blackwell,et al.  Dasher—a data entry interface using continuous gestures and language models , 2000, UIST '00.

[23]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[24]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[25]  G. Blelloch Introduction to Data Compression * , 2022 .

[26]  Kwok-wing Chau,et al.  A hybrid adaptive time-delay neural network model for multi-step-ahead prediction of sunspot activity , 2006 .

[27]  Shari Trewin Automating accessibility: the dynamic keyboard , 2003, Assets '04.

[28]  Darrel Hankerson,et al.  Introduction to Information Theory and Data Compression , 2003 .

[29]  Geoffrey E. Hinton,et al.  Learning Distributed Representations of Concepts Using Linear Relational Embedding , 2001, IEEE Trans. Knowl. Data Eng..

[30]  Peter Norvig,et al.  Artificial intelligence - a modern approach, 2nd Edition , 2003, Prentice Hall series in artificial intelligence.

[31]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[32]  P. Haffner,et al.  Multi-State Time Delay Neural Networks for Continuous Speech Recognition , 1991 .

[33]  Chushin Hosaka A CRITICAL BIOGRAPHY , 1987 .

[34]  P. Werbos,et al.  Long-term predictions of chemical processes using recurrent neural networks: a parallel training approach , 1992 .

[35]  Nikos Fakotakis,et al.  Probabilistic neural networks combined with GMMs for speaker recognition over telephone channels , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[36]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[37]  Michael J. Watts,et al.  IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS Publication Information , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[38]  Wilfred Beeching Century of the Typewriter , 1974 .

[39]  Ian H. Witten,et al.  Data Compression Using Adaptive Coding and Partial String Matching , 1984, IEEE Trans. Commun..

[40]  Pavel Pudil,et al.  Probabilistic neural network playing and learning Tic-Tac-Toe , 2005, Pattern Recognit. Lett..