A document image classification system fusing deep and machine learning models

[1]  Yousri Kessentini,et al.  Transformer-Based Approach for Joint Handwriting and Named Entity Recognition in Historical documents , 2021, Pattern Recognit. Lett..

[2]  C. V. Jawahar,et al.  Asking questions on handwritten document collections , 2021, International Journal on Document Analysis and Recognition (IJDAR).

[3]  Leen-Kiat Soh,et al.  Investigating coupling preprocessing with shallow and deep convolutional neural networks in document image classification , 2021, J. Electronic Imaging.

[4]  Margrit Betke,et al.  Extracting text from scanned Arabic books: a large-scale benchmark dataset and a fine-tuned Faster-R-CNN model , 2021, International Journal on Document Analysis and Recognition (IJDAR).

[5]  Mickaël Coustaty,et al.  EAML: ensemble self-attention-based mutual learning network for document image classification , 2021, International Journal on Document Analysis and Recognition (IJDAR).

[6]  Zekai Chen,et al.  A concatenated approach based on transfer learning and PCA for classifying bees and wasps , 2021, Journal of Physics: Conference Series.

[7]  Ekin Ekinci,et al.  Searchable Turkish OCRed historical newspaper collection 1928–1942 , 2021, J. Inf. Sci..

[8]  Mruthyunjaya Mendu,et al.  Cloud based Machine learning with advanced predictive Analytics using Google Colaboratory , 2021 .

[9]  Enrique Vidal,et al.  Textual-Content-Based Classification of Bundles of Untranscribed Manuscript Images , 2021, 2020 25th International Conference on Pattern Recognition (ICPR).

[10]  Kirk Roberts,et al.  Automatic classification of scanned electronic health record documents , 2020, Int. J. Medical Informatics.

[11]  Perumadura De Silva,et al.  A convolutional neural network model for robust classification of document-images under real-world hard conditions , 2020 .

[12]  Zheng Huang,et al.  Attention-Based Graph Neural Network with Global Context Awareness for Document Understanding , 2020, CCL.

[13]  Zuheng Ming,et al.  Cross-Modal Deep Networks For Document Image Classification , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[14]  Jordi Torres,et al.  Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems , 2020, ICCS.

[15]  Mickaël Coustaty,et al.  Visual and Textual Deep Feature Fusion for Document Image Classification , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16]  Marius Popescu,et al.  Self-Supervised Representation Learning on Document Images , 2020, DAS.

[17]  Furu Wei,et al.  LayoutLM: Pre-training of Text and Layout for Document Image Understanding , 2019, KDD.

[18]  Hamed Malek,et al.  Document Image Classification using SqueezeNet Convolutional Neural Network , 2019, 2019 5th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS).

[19]  Lovekesh Vig,et al.  Character Keypoint-Based Homography Estimation in Scanned Documents for Efficient Information Extraction , 2019, 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW).

[20]  Curtis Wigington,et al.  Multimodal Document Image Classification , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[21]  Muhammad Imran Malik,et al.  Two Stream Deep Network for Document Image Classification , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[22]  Süleyman Eken,et al.  DoCA: A Content-Based Automatic Classification System Over Digital Documents , 2019, IEEE Access.

[23]  Nicolas Audebert,et al.  Multimodal deep networks for text and image-based document classification , 2019, PKDD/ECML Workshops.

[24]  Quoc V. Le,et al.  EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[25]  Yuliant Sibaroni,et al.  Sentiment analysis on hotel reviews using Multinomial Naïve Bayes classifier , 2019, Journal of Physics: Conference Series.

[26]  Mouloud Koudil,et al.  A Novel Active Learning Method Using SVM for Text Classification , 2018, Int. J. Autom. Comput..

[27]  Ujjwal Bhattacharya,et al.  Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[28]  Marcus Liwicki,et al.  Real-Time Document Image Classification Using Deep CNN and Extreme Learning Machines , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[29]  Chris Tensmeyer,et al.  Analysis of Convolutional Neural Networks for Document Image Classification , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[30]  Vijay Vasudevan,et al.  Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Hazim Kemal Ekenel,et al.  Comparison of convolutional neural network models for document image classification , 2017, 2017 25th Signal Processing and Communications Applications Conference (SIU).

[32]  Marcus Liwicki,et al.  Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image Classification , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[33]  Sergey Zavalishin,et al.  Document Image Classification on the Basis of Layout Information , 2017, Visual Information Processing and Communication.

[34]  Ujjwal Bhattacharya,et al.  Generalized stacking of layerwise-trained Deep Convolutional Neural Networks for document image classification , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[35]  Ignazio Gallo,et al.  Embedded Textual Content for Document Image Classification with Convolutional Neural Networks , 2016, DocEng.

[36]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[38]  Bo Tang,et al.  Toward Optimal Feature Selection in Naive Bayes for Text Categorization , 2016, IEEE Transactions on Knowledge and Data Engineering.

[39]  Gabriela Csurka,et al.  Document image classification, with a specific view on applications of patent images , 2016, ArXiv.

[40]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Marcus Liwicki,et al.  Deepdocclassifier: Document classification with deep Convolutional Neural Network , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[43]  Konstantinos G. Derpanis,et al.  Evaluation of deep convolutional nets for document image classification and retrieval , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[44]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Yi Li,et al.  Convolutional Neural Networks for Document Image Classification , 2014, 2014 22nd International Conference on Pattern Recognition.

[46]  Jayant Kumar,et al.  Structural similarity for document image classification and retrieval , 2014, Pattern Recognit. Lett..

[47]  Mark Hedges,et al.  Ocropodium: open source OCR for small-scale historical archives , 2012, J. Inf. Sci..

[48]  Anthony W. Kay,et al.  Tesseract: an open-source optical character recognition engine , 2007 .

[49]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[50]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[51]  Taorong Qiu,et al.  Document image classification: Progress over two decades , 2021, Neurocomputing.

[52]  Shoaib Ahmed Siddiqui,et al.  Self-Supervised Representation Learning for Document Image Classification , 2021, IEEE Access.

[53]  K. P. Soman,et al.  Performance Analysis of NASNet on Unconstrained Ear Recognition , 2019 .

[54]  L. Breiman Random Forests , 2001, Machine Learning.