skip to main content
10.1145/2499788.2499823acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

Automatic image description by using word-level features

Published: 17 August 2013 Publication History

Abstract

Automatic image description is one of the challenging tasks of image recognitions. However, there are image descriptions that contain some too specific phrases that cannot be judged only from appearance of images. In this paper, we propose a novel approach to collect general phrases for generating image descriptions. On the assumption that there are high frequency phrases related to an query image in the image descriptions of similar images, we select nouns and their attribute phrases from the image descriptions of similar images based on their frequency. In order to evaluate the relevance of our image description, we conduct comparative experiments with existing approaches. Our experimental results show that our image descriptions are short, concise and visually relevant to query images.

References

[1]
N. Chen, Q. Y. Zhou, and V. Prasanna. Understanding web images by object relation network. In Proceedings of the 21st international conference on World Wide Web, pages 291--300. ACM, 2012.
[2]
A. Farhadi, M. Hejrati, M. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences from images. Computer Vision-ECCV 2010, pages 15--29, 2010.
[3]
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Baby talk: Understanding and generating simple image descriptions. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 1601--1608. IEEE, 2011.
[4]
P. Kuznetsova, V. Ordonez, A. C. Berg, T. L. Berg, and Y. Choi. Collective generation of natural image descriptions. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1, ACL '12, pages 359--368, Stroudsburg, PA, USA, 2012. Association for Computational Linguistics.
[5]
V. Ordonez, G. Kulkarni, and T. L. Berg. Im2text: Describing images using 1 million captioned photographs. In Proceedings of NIPS, 2011.
[6]
B. Z. Yao, X. Yang, L. Lin, M. W. Lee, and S. C. Zhu. I2t: Image parsing to text description. Proceedings of the IEEE, 98(8):1485--1508, 2010.

Cited By

View all
  • (2021)Building A Voice Based Image Caption Generator with Deep Learning2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS)10.1109/ICICCS51141.2021.9432091(943-948)Online publication date: 6-May-2021
  • (2018)A Survey on Automatic Image CaptioningMathematics and Computing10.1007/978-981-13-0023-3_8(74-83)Online publication date: 14-Apr-2018

Index Terms

  1. Automatic image description by using word-level features

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICIMCS '13: Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
    August 2013
    419 pages
    ISBN:9781450322522
    DOI:10.1145/2499788
    • Conference Chair:
    • Tat-Seng Chua,
    • General Chairs:
    • Ke Lu,
    • Tao Mei,
    • Xindong Wu
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • NSF of China: National Natural Science Foundation of China
    • University of Sciences & Technology, Hefei: University of Sciences & Technology, Hefei
    • Beijing ACM SIGMM Chapter

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 August 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. automatic image annotation
    2. image description
    3. image recognition

    Qualifiers

    • Research-article

    Conference

    ICIMCS '13
    Sponsor:
    • NSF of China
    • University of Sciences & Technology, Hefei

    Acceptance Rates

    ICIMCS '13 Paper Acceptance Rate 20 of 94 submissions, 21%;
    Overall Acceptance Rate 163 of 456 submissions, 36%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Building A Voice Based Image Caption Generator with Deep Learning2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS)10.1109/ICICCS51141.2021.9432091(943-948)Online publication date: 6-May-2021
    • (2018)A Survey on Automatic Image CaptioningMathematics and Computing10.1007/978-981-13-0023-3_8(74-83)Online publication date: 14-Apr-2018

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media