Image Captioning for Thai Cultures

Before each trip, tourists generally gather information or photos from different places. This work aims at providing additional information about touristic sites in Thailand via automatic image captioning. Image captioning is the process of generating a textual description for given images. In recent years, the development of Artificial Intelligence in combining image processing and natural language processing has gained attention worldwide. Image captioning can be regarded as a sequence-to-sequence modeling problem, as it converts images, which are considered a sequence of pixels, to a sequence of words. This work proposed a finetuned model that combined CNNs and LSTM to generate the image description. In the experiment part, we use BLEU to evaluate the model.

[1]  Basanta Joshi,et al.  Transfer Learning Based Image Visualization Using CNN , 2019, International Journal of Artificial Intelligence & Applications.

[2]  Serge J. Belongie,et al.  Learning to Evaluate Image Captioning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Albert Gatt,et al.  Where to put the image in an image caption generator , 2017, Natural Language Engineering.

[4]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Zachary Chase Lipton A Critical Review of Recurrent Neural Networks for Sequence Learning , 2015, ArXiv.

[6]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  D. Meyer,et al.  Cultural context and its influence on managerial leadership in Thailand , 2013 .

[8]  Wheeler Ruml,et al.  A Comparison of Greedy Search Algorithms , 2010, SOCS.

[9]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[10]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[11]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.