A Dataset and a Novel Neural Approach for Optical Gregg Shorthand Recognition

Gregg shorthand is the most popular form of pen stenography in the United States. It has been adapted for many other languages. In order to substantially explore the potentialities of performing optical recognition of Gregg shorthand, we develop and present Gregg-1916, a dataset that comprises Gregg shorthand scripts of about 16 thousand common English words. In addition, we present a novel architecture for shorthand recognition which exhibits promising performance and opens up the path for various further directions.

[1]  Zheru Chi,et al.  Segmentation and recognition of on-line Pitman shorthand outlines using neural networks , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[2]  Razvan Pascanu,et al.  On the difficulty of training recurrent neural networks , 2012, ICML.

[3]  Graham Leedham,et al.  Segmentation and recognition of phonetic features in handwritten Pitman shorthand , 2008, Pattern Recognit..

[4]  Cordelia Schmid,et al.  Areas of Attention for Image Captioning , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[6]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[8]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[9]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[10]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[11]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[12]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[13]  Fei-Fei Li,et al.  Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.

[14]  K. Ramar,et al.  Handwritten Gregg Shorthand Recognition , 2012 .