skip to main content
10.1145/2964284.2967214acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

A Supervised Approach for Text Illustration

Published: 01 October 2016 Publication History

Abstract

In this paper we propose a novel method to illustrate text articles with pictures from a tagged repository. Certain types of documents, like news articles, are often accompanied by a few pictures only. Prior works leverage topics or key phrases from the text to suggest relevant pictures. We propose a supervised model based on features like readability, picturability, sentiment polarity, and presence of important phrases, to identify and rank key sentences. The proposed method then suggests some relevant pictures based on the top ranked sentences thus identified.

References

[1]
The gunning's fog index readability formula. Accessed: 2015-07--19.
[2]
Agrawal, R., Gollapudi, S., Kannan, A., and Kenthapadi, K. Enriching textbooks with images. In Proceedings of the 20th ACM international conference on Information and knowledge management (2011), ACM, pp. 1847--1856.
[3]
Aletras, N., and Stevenson, M. Representing topics using images. In HLT-NAACL (2013), pp. 158--167.
[4]
Carbonell, J., and Goldstein, J. The use of mmr, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval (1998), ACM, pp. 335--336.
[5]
Delgado, D., Magalhaes, J., and Correia, N. Automated illustration of news stories. In Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on (2010), IEEE, pp. 73--78.
[6]
Goldberg, A. B., Zhu, X., Dyer, C. R., Eldawy, M., and Heng, L. Easy as abc?: facilitating pictorial communication via semantically enhanced layout. In Proceedings of the Twelfth Conference on Computational Natural Language Learning (2008), Association for Computational Linguistics, pp. 119--126.
[7]
Gupta, V., Varshney, D., Jhamtani, H., Kedia, D., and Karwa, S. Identifying purchase intent from social posts. In ICWSM (2014).
[8]
Jhamtani, H., Chhaya, N., Karwa, S., Varshney, D., Kedia, D., and Gupta, V. Identifying suggestions for improvement of product features from online product reviews. In International Conference on Social Informatics (2015), Springer, pp. 112--119.
[9]
Joshi, D., Wang, J. Z., and Li, J. The story picturing engine--a system for automatic text illustration. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 2, 1 (2006), 68--89.
[10]
Kendall, M. G. A new measure of rank correlation. Biometrika (1938), 81--93.
[11]
Kosala, R., and Blockeel, H. Web mining research: A survey. ACM Sigkdd Explorations Newsletter 2, 1 (2000), 1--15.
[12]
Leong, C. W., Mihalcea, R., and Hassan, S. Text mining for automatic image tagging. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters (Stroudsburg, PA, USA, 2010), COLING '10, Association for Computational Linguistics, pp. 647--655.
[13]
LLC, O. Alchemyapi, 2009.
[14]
Lu, X., Pang, Y., Hao, Q., and Zhang, L. Visualizing textual travelogue with location-relevant images. In Proceedings of the 2009 International Workshop on Location Based Social Networks (2009), ACM, pp. 65--68.
[15]
Mankiewicz, R. The story of mathematics. Princeton University Press, 2000.
[16]
Mihalcea, R., and Tarau, P. Textrank: Bringing order into texts. In Proceedings of EMNLP 2004 (Barcelona, Spain, July 2004), D. Lin and D. Wu, Eds., Association for Computational Linguistics, pp. 404--411.
[17]
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., et al. Scikit-learn: Machine learning in python. The Journal of Machine Learning Research 12 (2011), 2825--2830.
[18]
Salton, G., Wong, A., and Yang, C.-S. A vector space model for automatic indexing. Communications of the ACM 18, 11 (1975), 613--620.
[19]
Spearman, C. The proof and measurement of association between two things. The American journal of psychology 15, 1 (1904), 72--101.
[20]
Steinberger, J., and Jezek, K. Using latent semantic analysis in text summarization and summary evaluation. In Proc. ISIM'04 (2004), pp. 93--100.
[21]
UzZaman, N., Bigham, J. P., and Allen, J. F. Multimodal summarization of complex sentences. In Proceedings of the 16th international conference on Intelligent user interfaces (2011), ACM, pp. 43--52.
[22]
Zhu, X., Goldberg, A. B., Eldawy, M., Dyer, C. R., and Strock, B. A text-to-picture synthesis system for augmenting communication. In AAAI (2007), vol. 7, pp. 1590--1595.

Cited By

View all
  • (2022)Unsupervised Image and Text Fusion for Travel Information EnhancementIEEE Transactions on Multimedia10.1109/TMM.2021.306440824(1415-1425)Online publication date: 2022
  • (2018)Information Enhancement for Travelogues via a Hybrid Clustering Model2018 Digital Image Computing: Techniques and Applications (DICTA)10.1109/DICTA.2018.8615849(1-8)Online publication date: Dec-2018

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '16: Proceedings of the 24th ACM international conference on Multimedia
October 2016
1542 pages
ISBN:9781450336031
DOI:10.1145/2964284
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. supervised approach
  2. text illustration

Qualifiers

  • Short-paper

Conference

MM '16
Sponsor:
MM '16: ACM Multimedia Conference
October 15 - 19, 2016
Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;
Overall Acceptance Rate 1,291 of 5,076 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Unsupervised Image and Text Fusion for Travel Information EnhancementIEEE Transactions on Multimedia10.1109/TMM.2021.306440824(1415-1425)Online publication date: 2022
  • (2018)Information Enhancement for Travelogues via a Hybrid Clustering Model2018 Digital Image Computing: Techniques and Applications (DICTA)10.1109/DICTA.2018.8615849(1-8)Online publication date: Dec-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media