Construction of reliable image captioning system for web camera based traffic analysis on road transport application

The automated captioning of natural images with appropriate descriptions is an intriguing and complicated task in the field of image processing. On the other hand, Deep learning, which combines computer vision with natural language, has emerged in recent years. Image emphasization is a record file representation that allows a computer to understand the visual information of an image in one or more words. When it comes to connecting high-quality images, the expressive process not only requires the credentials of the primary item and scene but also the ability to analyse the status, physical characteristics, and connections. Many traditional algorithms substitute the image to the front image. The image characteristics are dynamic depending on the ambient condition of natural photographs. Image processing techniques fail to extract several characteristics from the specified image. Nonetheless, four properties from the images are accurately described by using our proposed technique. Based on the various filtering layers in the convolutional neural network (CNN), it is an advantage to extract different characteristics. The caption for the image is based on long short term memory (LSTM), which comes under recurrent neural network. In addition, the precise subtitling is compared to current conventional techniques of image processing and different deep learning models. The proposed method is performing well in natural images and web camera based images for traffic analysis. Besides, the proposed algorithm leverages good accuracy and reliable image captioning.

[1]  Cyrus Rashtchian,et al.  Every Picture Tells a Story: Generating Sentences from Images , 2010, ECCV.

[2]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[3]  Akey Sungheetha,et al.  A Comparative Machine Learning Study on IT Sector Edge Nearer to Working From Home (WFH) Contract Category for Improving Productivity , 2021 .

[4]  Yejin Choi,et al.  Composing Simple Image Descriptions using Web-scale N-grams , 2011, CoNLL.

[5]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[6]  Jitendra Malik,et al.  Region-Based Convolutional Networks for Accurate Object Detection and Segmentation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Varsha Hole,et al.  Real-Time Environment Description Application for Visually Challenged People , 2019 .

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Sathish,et al.  Faultless Decision Making for False Information in Online: A Systematic Approach , 2021 .

[10]  M. Tripathi Analysis of Convolutional Neural Network based Image Classification Techniques , 2021, Journal of Innovative Image Processing.

[11]  Yejin Choi,et al.  Baby talk: Understanding and generating simple image descriptions , 2011, CVPR 2011.

[12]  Samy Bengio,et al.  Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Ramakant Nevatia,et al.  Automatic Concept Discovery from Parallel Text and Visual Corpora , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14]  Mihir Mistry,et al.  Machine Learning Based User Interface Generation , 2019 .

[15]  Vipul Salunke,et al.  Image Sentiment Analysis Using Deep Learning , 2021 .

[16]  J SamuelManoharan Capsule Network Algorithm for Performance Optimization of Text Classification , 2021 .

[17]  Geoffrey Zweig,et al.  From captions to visual concepts and back , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Fei-Fei Li,et al.  Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.

[19]  Geoffrey Zweig,et al.  Language Models for Image Captioning: The Quirks and What Works , 2015, ACL.

[20]  Vijayakumar T,et al.  Fusion based Feature Extraction Analysis of ECG Signal Interpretation - A Systematic Approach , 2021, March 2021.

[21]  Svetlana Lazebnik,et al.  Improving Image-Sentence Embeddings Using Large Weakly Annotated Photo Collections , 2014, ECCV.

[22]  E. E. Babikir Adam,et al.  Construction of Accurate Crack Identification on Concrete Structure using Hybrid Deep Learning Approach , 2021 .

[23]  Smys S,et al.  Naive Bayes and Entropy based Analysis and Classification of Humans and Chat Bots , 2021 .

[24]  Samuel Manoharan J,et al.  A Novel User Layer Cloud Security Model based on Chaotic Arnold Transformation using Fingerprint Biometric Traits , 2021 .

[25]  S. Revathy,et al.  An Automated Invoice Handling Method Using OCR , 2021 .

[26]  Edriss E. B. Adam,et al.  Evaluation of Fingerprint Liveness Detection by Machine Learning Approach - A Systematic View , 2021 .

[27]  V. Geetha Lekshmy,et al.  Role of Non-textual Contents and Citations in Plagiarism Detection , 2021 .

[28]  Vicente Ordonez,et al.  Im2Text: Describing Images Using 1 Million Captioned Photographs , 2011, NIPS.

[29]  Xu Yu,et al.  SVMs Classification Based Two-side Cross Domain Collaborative Filtering by inferring intrinsic user and item features , 2018, Knowl. Based Syst..

[30]  Kottilingam Dr Dr,et al.  A DYNAMIC ROUTING MODEL FOR HYBRID ELECTRIC VEHICLES , 2019, Journal of Electrical Engineering and Automation.