Skip to main content

Identifying Persons in News Article Images Based on Textual Analysis

  • Conference paper
The Role of Digital Libraries in a Time of Global Change (ICADL 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6102))

Included in the following conference series:

  • 1409 Accesses

Abstract

A large portion of news articles contains images of persons whose names appear in the news stories. To provide image search of persons, most search engines construct an index from textual descriptions (such as headline and caption) of images. The index search approach, although very simple and scalable, has one serious drawback. A query of a person name could match some news articles which do not contain images of the target person. Therefore, some irrelevant images could be returned as search results. Our main goal is to improve the performance of the index search approach based on the syntactic analysis of person name entities in the news articles. Given sentences containing person names, we construct a set of syntactic rules for identifying persons in news images. The set of syntactic rules is used to filter out images of non-target persons from the results returned by the index search. From the experimental results, our approach improved the performance over the basic index search by 10% based on the F1-measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 74.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abney, S.: Parsing by chunks. In: Berwick, R., Abney, S., Tenny, C. (eds.) Principle-Based Parsing. Kluwer Academic Publishers, Dordrecht (1991)

    Google Scholar 

  2. Berg, T.L., Berg, A.C., Edwards, J., Maire, M., White, R., Yee-Whye, T., Learned-Miller, E., Forsyth, D.A.: Names and Faces in the News. In: Proc. of the 2004 IEEE Conf. on Computer Vision and Pattern Recognition, pp. 848–854 (2004)

    Google Scholar 

  3. Chinchor, N.: MUC-7 Named Entity Task Definition (Version 3.5). MUC-7, Fairfax, Virginia (1998)

    Google Scholar 

  4. Datta, R., Joshi, D., Li, J., Wang, J.Z.: Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys (CSUR) 40(2), 1–60 (2008)

    Article  Google Scholar 

  5. Edwards, J., White, R., Forsyth, D.: Words and pictures in the news. In: Proc. of the HLT-NAACL 2003 workshop on learning word meaning from non-linguistic data, pp. 6–13 (2003)

    Google Scholar 

  6. He, X., Cai, D., Wen, J.-R., Ma, W.-Y., Zhang, H.-J.: Clustering and searching WWW images using link and page layout analysis. ACM Trans. on Multimedia Computing, Communications, and Applications 3(2) (2007)

    Google Scholar 

  7. Hörster, E., Lienhart, R., Slaney, M.: Image retrieval on large-scale image databases. In: Proc. of the 6th ACM int. conf. on image and video retrieval, pp. 17–24 (2007)

    Google Scholar 

  8. Kherfi, M.L., Ziou, D., Bernardi, A.: Image Retrieval from the World Wide Web: Issues, Techniques, and Systems. ACM Computing Surveys (CSUR) 36(1), 35–67 (2004)

    Article  Google Scholar 

  9. Kitahara, A., Joutou, T., Yanai, K.: Associating Faces and Names in Japanese Photo News Articles on the Web. In: Proc. of the 22nd Int. Conf. on Advanced Information Networking and Applications - Workshops, pp. 1156–1161 (2008)

    Google Scholar 

  10. Liu, C., Jiang, S., Huang, Q.: Naming faces in broadcast news video by image google. In: Proc. of the 16th ACM int. conf. on multimedia, pp. 717–720 (2008)

    Google Scholar 

  11. Ozkan, D., Duygulu, P.: A Graph Based Approach for Naming Faces in News Photos. In: Proc. of the 2006 IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1477–1482 (2006)

    Google Scholar 

  12. Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-Based Image Retrieval at the End of the Early Years. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)

    Article  Google Scholar 

  13. Srihari, R.K.: Automatic Indexing and Content-Based Retrieval of Captioned Images. Computer 28(9), 49–56 (1995)

    Article  Google Scholar 

  14. Yagnik, J., Islam, A.: Learning people annotation from the web via consistency learning. In: Proc. of the int. workshop on multimedia information retrieval, pp. 285–290 (2007)

    Google Scholar 

  15. Yang, J., Hauptmann, A.G.: Naming every individual in news video monologues. In: Proc. of the 12th ACM int. conf. on multimedia, pp. 580–587 (2004)

    Google Scholar 

  16. Zhao, W., Chellappa, R., Phillips, P.J., Rosenfeld, A.: Face recognition: A literature survey. ACM Computing Surveys (CSUR) 35(4), 399–458 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Haruechaiyasak, C., Damrongrat, C. (2010). Identifying Persons in News Article Images Based on Textual Analysis. In: Chowdhury, G., Koo, C., Hunter, J. (eds) The Role of Digital Libraries in a Time of Global Change. ICADL 2010. Lecture Notes in Computer Science, vol 6102. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13654-2_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13654-2_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13653-5

  • Online ISBN: 978-3-642-13654-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics