Loading [MathJax]/extensions/MathMenu.js
Visual Image Caption Generation for Service Robotics and Industrial Applications | IEEE Conference Publication | IEEE Xplore

Visual Image Caption Generation for Service Robotics and Industrial Applications


Abstract:

Image caption generation is a task that generates a sentence from a raw image, which is mimicking the intelligence of human that can acquire knowledge from the view. The ...Show More

Abstract:

Image caption generation is a task that generates a sentence from a raw image, which is mimicking the intelligence of human that can acquire knowledge from the view. The difficulty of this task is the combination of multimodal knowledge learning, i.e. recognition of objects, actions, scenes, human, etc. In order to perform semantic understanding for service robotics or other industrial applications, the caption must be enhanced for recognition of the objects in the confined environment. We propose a template-based augmentation method for improving the capability of object recognition while retaining the other capability of the image caption model. This work opens a new era of image caption generation training procedure that the caption dataset and the classification dataset can be combined to train the deep captioning model. We show in our experiments that our improved model outperforms the original model in SPICE metrics by 4 times.
Date of Conference: 06-09 May 2019
Date Added to IEEE Xplore: 01 August 2019
ISBN Information:
Conference Location: Taipei, Taiwan
Citations are not available for this document.

I. Introduction

Image captioning is a task that generates a description of an image automatically. The artificial intelligence must recognize the content including people, objects and their relationship in the given picture and form semantically and syntactically correct sentence.

Getting results...

Contact IEEE to Subscribe

References

References is not available for this document.