A Non-image-based Subcharacter-level Method to Encode the Shape of Chinese Characters

[1]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[2]  Yann Dauphin,et al.  Convolutional Sequence to Sequence Learning , 2017, ICML.

[3]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[5]  Masafumi Hagiwara,et al.  CNN-encoded Radical-level Representation for Japanese Processing , 2018 .

[6]  D. Allport,et al.  What Are the Functional Orthographic Units in Chinese Word Recognition: The Stroke or the Stroke Pattern? , 1996 .

[7]  Masafumi Hagiwara,et al.  Radical-level Ideograph Encoder for RNN-based Sentiment Analysis of Chinese and Japanese , 2017, ACML.

[8]  Daiki Shimada,et al.  Document classification through image-based character embedding and wildcard training , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[9]  Hao Xin,et al.  Joint Embeddings of Chinese Words, Characters, and Fine-grained Subcharacter Components , 2017, EMNLP.

[10]  Rico Sennrich,et al.  Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[11]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[12]  Jörg Tiedemann,et al.  Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF , 2017, IJCNLP.

[13]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[14]  Timothy Baldwin,et al.  Sub-character Neural Language Modelling in Japanese , 2017, SWCN@EMNLP.

[15]  Frederick Liu,et al.  Learning Character-level Compositionality with Visual Features , 2017, ACL.

[16]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[17]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[18]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[19]  Taku Kudo,et al.  SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing , 2018, EMNLP.

[20]  Rui Li,et al.  Multi-Granularity Chinese Word Embedding , 2016, EMNLP.

[21]  Bofang Li,et al.  Subcharacter Information in Japanese Embeddings: When Is It Worth It? , 2018 .

[22]  Mamoru Komachi,et al.  Neural Machine Translation of Logographic Language Using Sub-character Level Information , 2018, WMT.

[23]  Wenjie Li,et al.  Component-Enhanced Chinese Character Embeddings , 2015, EMNLP.

[24]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[25]  Jason Lee,et al.  Fully Character-Level Neural Machine Translation without Explicit Segmentation , 2016, TACL.