Ancient Asian Character Recognition for Literature Preservation and Understanding

This paper introduces a project for automatically recognizing ancient Asian characters by image processing and deep learning with the aim of preserving Asian culture. The ancient characters examined include Chinese and Indian characters, which are the most mysterious, wildly used, and historic in the ancient world, and also feature multiply types. The automatic recognition method consists of preprocessing and recognition processing. The preprocessing includes character segmentation and noise reduction, and the recognition processing has a conventional recognition and deep learning. The conventional recognition method consists of feature extraction and similarity calculation or classification, and data augmentation is a key part of the deep learning. Experimental results show that deep learning achieves a better recognition accuracy than conventional image processing. Our aim is to preserve ancient literature by digitizing it and clarifying the characters and how they change throughout history by means of accurate character recognition. We also hope to help people discover new knowledge from ancient literature.