BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters

BanglaLekha-Isolated, a Bangla handwritten isolated character dataset is presented in this article. This dataset contains 84 different characters comprising of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters were collected, digitized and pre-processed. After discarding mistakes and scribbles, 1,66,105 handwritten character images were included in the final dataset. The dataset also includes labels indicating the age and the gender of the subjects from whom the samples were collected. This dataset could be used not only for optical handwriting recognition research but also to explore the influence of gender and age on handwriting. The dataset is publicly available at https://data.mendeley.com/datasets/hf6sf8zrkc/2.

[1]  E. Kimes,et al.  Evaluation of Vancomycin TDM Strategies: Prediction and Prevention of Kidney Injuries Based on Vancomycin TDM Results , 2023, Journal of Korean medical science.

[2]  Bidyut Baran Chaudhuri,et al.  Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[4]  Subhadip Basu,et al.  A benchmark image database of isolated Bangla handwritten compound characters , 2014, International Journal on Document Analysis and Recognition (IJDAR).

[5]  K. Murphy,et al.  Overview of Machine Learning , 2022, International Journal of Advanced Research in Science, Communication and Technology.

[6]  Yann LeCun,et al.  Regularization of Neural Networks using DropConnect , 2013, ICML.

[7]  Sebastiano Impedovo,et al.  More than twenty years of advancements on Frontiers in handwriting recognition , 2014, Pattern Recognit..