short-paper

A Supervised Approach for Text Illustration

Authors:

Harsh Jhamtani,

Midhun Gundapuneni,

Siddhartha Kumar DuttaAuthors Info & Claims

MM '16: Proceedings of the 24th ACM international conference on Multimedia

Pages 217 - 221

https://doi.org/10.1145/2964284.2967214

Published: 01 October 2016 Publication History

Abstract

In this paper we propose a novel method to illustrate text articles with pictures from a tagged repository. Certain types of documents, like news articles, are often accompanied by a few pictures only. Prior works leverage topics or key phrases from the text to suggest relevant pictures. We propose a supervised model based on features like readability, picturability, sentiment polarity, and presence of important phrases, to identify and rank key sentences. The proposed method then suggests some relevant pictures based on the top ranked sentences thus identified.

References

[1]

The gunning's fog index readability formula. Accessed: 2015-07--19.

[2]

Agrawal, R., Gollapudi, S., Kannan, A., and Kenthapadi, K. Enriching textbooks with images. In Proceedings of the 20th ACM international conference on Information and knowledge management (2011), ACM, pp. 1847--1856.

Digital Library

[3]

Aletras, N., and Stevenson, M. Representing topics using images. In HLT-NAACL (2013), pp. 158--167.

[4]

Carbonell, J., and Goldstein, J. The use of mmr, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval (1998), ACM, pp. 335--336.

Digital Library

[5]

Delgado, D., Magalhaes, J., and Correia, N. Automated illustration of news stories. In Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on (2010), IEEE, pp. 73--78.

Digital Library

[6]

Goldberg, A. B., Zhu, X., Dyer, C. R., Eldawy, M., and Heng, L. Easy as abc?: facilitating pictorial communication via semantically enhanced layout. In Proceedings of the Twelfth Conference on Computational Natural Language Learning (2008), Association for Computational Linguistics, pp. 119--126.

Digital Library

[7]

Gupta, V., Varshney, D., Jhamtani, H., Kedia, D., and Karwa, S. Identifying purchase intent from social posts. In ICWSM (2014).

[8]

Jhamtani, H., Chhaya, N., Karwa, S., Varshney, D., Kedia, D., and Gupta, V. Identifying suggestions for improvement of product features from online product reviews. In International Conference on Social Informatics (2015), Springer, pp. 112--119.

[9]

Joshi, D., Wang, J. Z., and Li, J. The story picturing engine--a system for automatic text illustration. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 2, 1 (2006), 68--89.

Digital Library

[10]

Kendall, M. G. A new measure of rank correlation. Biometrika (1938), 81--93.

[11]

Kosala, R., and Blockeel, H. Web mining research: A survey. ACM Sigkdd Explorations Newsletter 2, 1 (2000), 1--15.

Digital Library

[12]

Leong, C. W., Mihalcea, R., and Hassan, S. Text mining for automatic image tagging. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters (Stroudsburg, PA, USA, 2010), COLING '10, Association for Computational Linguistics, pp. 647--655.

Digital Library

[13]

LLC, O. Alchemyapi, 2009.

[14]

Lu, X., Pang, Y., Hao, Q., and Zhang, L. Visualizing textual travelogue with location-relevant images. In Proceedings of the 2009 International Workshop on Location Based Social Networks (2009), ACM, pp. 65--68.

Digital Library

[15]

Mankiewicz, R. The story of mathematics. Princeton University Press, 2000.

[16]

Mihalcea, R., and Tarau, P. Textrank: Bringing order into texts. In Proceedings of EMNLP 2004 (Barcelona, Spain, July 2004), D. Lin and D. Wu, Eds., Association for Computational Linguistics, pp. 404--411.

[17]

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., et al. Scikit-learn: Machine learning in python. The Journal of Machine Learning Research 12 (2011), 2825--2830.

Digital Library

[18]

Salton, G., Wong, A., and Yang, C.-S. A vector space model for automatic indexing. Communications of the ACM 18, 11 (1975), 613--620.

Digital Library

[19]

Spearman, C. The proof and measurement of association between two things. The American journal of psychology 15, 1 (1904), 72--101.

[20]

Steinberger, J., and Jezek, K. Using latent semantic analysis in text summarization and summary evaluation. In Proc. ISIM'04 (2004), pp. 93--100.

[21]

UzZaman, N., Bigham, J. P., and Allen, J. F. Multimodal summarization of complex sentences. In Proceedings of the 16th international conference on Intelligent user interfaces (2011), ACM, pp. 43--52.

Digital Library

[22]

Zhu, X., Goldberg, A. B., Eldawy, M., Dyer, C. R., and Strock, B. A text-to-picture synthesis system for augmenting communication. In AAAI (2007), vol. 7, pp. 1590--1595.

Digital Library

Cited By

Zhang LXu JGong YYu LZhang JShen J(2022)Unsupervised Image and Text Fusion for Travel Information EnhancementIEEE Transactions on Multimedia10.1109/TMM.2021.306440824(1415-1425)Online publication date: 2022
https://doi.org/10.1109/TMM.2021.3064408
Zhang LXu JZhang JGong Y(2018)Information Enhancement for Travelogues via a Hybrid Clustering Model2018 Digital Image Computing: Techniques and Applications (DICTA)10.1109/DICTA.2018.8615849(1-8)Online publication date: Dec-2018
https://doi.org/10.1109/DICTA.2018.8615849

Index Terms

A Supervised Approach for Text Illustration
1. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia content creation

Recommendations

Weakly Supervised Joint Sentiment-Topic Detection from Text

Sentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework called joint sentiment-topic (JST)...
Semi-supervised latent Dirichlet allocation for multi-label text classification
IEA/AIE'13: Proceedings of the 26th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

This paper proposes a semi-supervised latent Dirichlet allocation (ssLDA) method, which differs from the existing supervised topic models for multi-label classification in mainly two aspects. Firstly both labeled and unlabeled learning data are used in ...
Semi-supervised text categorization: Exploiting unlabeled data using ensemble learning algorithms

Text categorization is one of the fundamental tasks in text mining. Classical supervised methods need lot of labeled data to train a classifier. Since assigning labels to the large amount of data is very costly and time consuming, it is useful to use ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '16: Proceedings of the 24th ACM international conference on Multimedia

October 2016

1542 pages

ISBN:9781450336031

DOI:10.1145/2964284

General Chairs:
Alan Hanjalic
Delft University of Technology
,
Cees Snoek
Qualcomm Research Netherlands / University of Amsterdam
,
Marcel Worring
University of Amsterdam
,
Moderator:
Dick Bulterman
CWI / VU University Amsterdam
,
Program Chairs:
Benoit Huet
EURECOM
,
Aisling Kelliher
Virginia Tech
,
Yiannis Kompatsiaris
CERTH-ITI
,
Jin Li
Microsoft

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

MM '16

Sponsor:

SIGMM

MM '16: ACM Multimedia Conference

October 15 - 19, 2016

Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;

Overall Acceptance Rate 1,291 of 5,076 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
226
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 15 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhang LXu JGong YYu LZhang JShen J(2022)Unsupervised Image and Text Fusion for Travel Information EnhancementIEEE Transactions on Multimedia10.1109/TMM.2021.306440824(1415-1425)Online publication date: 2022
https://doi.org/10.1109/TMM.2021.3064408
Zhang LXu JZhang JGong Y(2018)Information Enhancement for Travelogues via a Hybrid Clustering Model2018 Digital Image Computing: Techniques and Applications (DICTA)10.1109/DICTA.2018.8615849(1-8)Online publication date: Dec-2018
https://doi.org/10.1109/DICTA.2018.8615849

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten