Reference Hub3
ISCG: An Intelligent Sensing and Caption Generation System for Object Detection and Captioning Using Deep Learning

ISCG: An Intelligent Sensing and Caption Generation System for Object Detection and Captioning Using Deep Learning

Aahan Singh, Nithin Nagaraj, Srinidhi Hiriyannaiah, Lalit Mohan Patnaik
Copyright: © 2020 |Volume: 16 |Issue: 4 |Pages: 17
ISSN: 1548-3657|EISSN: 1548-3665|EISBN13: 9781799805144|DOI: 10.4018/IJIIT.2020100104
Cite Article Cite Article

MLA

Singh, Aahan, et al. "ISCG: An Intelligent Sensing and Caption Generation System for Object Detection and Captioning Using Deep Learning." IJIIT vol.16, no.4 2020: pp.51-67. http://doi.org/10.4018/IJIIT.2020100104

APA

Singh, A., Nagaraj, N., Hiriyannaiah, S., & Patnaik, L. M. (2020). ISCG: An Intelligent Sensing and Caption Generation System for Object Detection and Captioning Using Deep Learning. International Journal of Intelligent Information Technologies (IJIIT), 16(4), 51-67. http://doi.org/10.4018/IJIIT.2020100104

Chicago

Singh, Aahan, et al. "ISCG: An Intelligent Sensing and Caption Generation System for Object Detection and Captioning Using Deep Learning," International Journal of Intelligent Information Technologies (IJIIT) 16, no.4: 51-67. http://doi.org/10.4018/IJIIT.2020100104

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Artificial intelligence has paved the way for different areas of computing such as speech recognition and translation, object detection, machine translation, and others. One of the goals of artificial general intelligence is to simulate human thinking and rationality within machines such that they are able to perceive their environment and then perform reasonable actions based on their perception. Creating a single model that performs every single task from visual perception to actuation is currently impossible. The system must be divided into several models each of which functions independently as well also contribute to the operation of the whole intelligent machine. In this paper, an intelligent sensing and caption generation (ISCG) system is proposed which is capable of detecting living/non-living objects and states of motion in images. The system consists of two separate modules of caption generator and intelligence engine with a Convolutional Neural Network (CNN) for determining the different objects in the images. Our model yields state-of-the-art performance on benchmarked dataset.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.