Skip to main content

Uncovering Hidden Links Between Images Through Their Textual Context

  • Conference paper
  • First Online:
Book cover Enterprise Information Systems (ICEIS 2018)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 363))

Included in the following conference series:

Abstract

Using hyperlinks to enhance page ranking has been widely studied in the literature. The main motivation is that an hyperlink underlines a page relevance. However, several hyperlinks in the web are used for navigation or marketing purposes. In addition, hyperlinks are created manually, so it is impossible to semantically link all similar pages. In our work, we propose to uncover hidden semantic links and create them automatically between all the collection’s images. For this aim, we propose first to format textual context of images into topic distributions via LDA technique, and then compute semantic similarities to create links. Experiments carried out in the Wikipedia Retrieval Task of ImageClef 2011 showed that the whole textual context of images is useful for uncovering hidden links and consequently enhancing the retrieval accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://lucene.apache.org/.

  2. 2.

    http://mallet.cs.umass.edu/.

References

  1. Alzu’bi, A., Amira, A., Ramzan, N.: Semantic content-based image retrieval: a comprehensive study. J. Vis. Commun. Image Represent. 32, 20–54 (2015)

    Article  Google Scholar 

  2. Aouadi, H., Khemakhem, M.T., Jemaa, M.B.: Combination of document structure and links for multimedia object retrieval. J. Inf. Sci. 38(5), 442–458 (2012)

    Article  Google Scholar 

  3. Aouadi, H., Khemakhem, M.T., Jemaa, M.B.: An LDA topic model adaptation for context-based image retrieval. In: Stuckenschmidt, H., Jannach, D. (eds.) EC-Web 2015. LNBIP, vol. 239, pp. 69–80. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27729-5_6

    Chapter  Google Scholar 

  4. Aouadi, H., Khemakhem, M.T., Jemaa, M.B.: Building contextual implicit links for image retrieval. In: Proceedings of the 20th International Conference on Enterprise Information Systems, ICEIS 2018, Funchal, 21–24 March 2018, vol. 1, pp. 81–91 (2018)

    Google Scholar 

  5. Belmouhcine, A., Benkhalifa, M.: Implicit links based web page representation for web page classification. In: Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, p. 12. ACM (2015)

    Google Scholar 

  6. Bharat, K., Henzinger, M.R.: Improved algorithms for topic distillation in a hyperlinked environment. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 104–111. ACM (1998)

    Google Scholar 

  7. Bhatt, C., Pappas, N., Habibi, M., Popescu-Belis, A.: Multimodal reranking of content-based recommendations for hyperlinking video snippets. In: Proceedings of International Conference on Multimedia Retrieval, p. 225. ACM (2014)

    Google Scholar 

  8. Blei, D., Ng, A., Jordan, M.: Latent Dirichlet Allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)

    MATH  Google Scholar 

  9. Brandes, U.: A faster algorithm for betweenness centrality. J. Math. Sociol. 25(2), 163–177 (2001)

    Article  Google Scholar 

  10. Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. In: Proceedings of the 7th International Conference on World Wide Web (WWW), Brisbane, pp. 107–117 (1998)

    Google Scholar 

  11. Cai, D., He, X., Ma, W.Y., Wen, J.R., Zhang, H.: Organizing www images based on the analysis of page layout and web link structure. In: 2004 IEEE International Conference on Multimedia and Expo 2004, ICME 2004, vol. 1, pp. 113–116. IEEE (2004)

    Google Scholar 

  12. Cai, D., Yu, S., Wen, J.R., Ma, W.Y.: VIPS: a vision based page segmentation algorithm. Microsoft Technical Report 79, MSR-TR (2003)

    Google Scholar 

  13. Chakrabarti, S., Dom, B., Raghavan, P., Rajagopalan, S., Gibson, D., Kleinberg, J.: Automatic resource compilation by analyzing hyperlink structure and associated text. Comput. Netw. ISDN Syst. 30(1–7), 65–74 (1998)

    Article  Google Scholar 

  14. Chen, S., Eskevich, M., Jones, G.J.F., O’Connor, N.E.: An investigation into feature effectiveness for multimedia hyperlinking. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014. LNCS, vol. 8326, pp. 251–262. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-04117-9_23

    Chapter  Google Scholar 

  15. Chibane, I., Doan, B.L.: Relevance propagation model for large hypertext document collections. In: Large scale Semantic Access to Content (text, image, video, and sound), pp. 585–595. Le Centre de Hautes Etudes Internationales D’Informatique Documentaire (2007)

    Google Scholar 

  16. Cohn, D.A., Hofmann, T.: The missing link-a probabilistic model of document content and hypertext connectivity. In: Advances in Neural Information Processing Systems, pp. 430–436 (2001)

    Google Scholar 

  17. Datta, R., Joshi, D., Li, J., Wang, J.Z.: Image retrieval: ideas, influences, and trends of the new age. ACM Comput. Surv. (CSUR) 40(2), 5 (2008)

    Article  Google Scholar 

  18. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391 (1990)

    Article  Google Scholar 

  19. Dunlop, M.D.: Multimedia information retrieval. Ph.D. thesis, University of Glasgow (1991)

    Google Scholar 

  20. Eskevich, M., et al.: Multimedia information seeking through search and hyperlinking. In: Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval, pp. 287–294. ACM (2013)

    Google Scholar 

  21. Harmandas, V., Sanderson, M., Dunlop, M.D.: Image retrieval by hypertext links. In: ACM SIGIR Forum, vol. 31, pp. 296–303. ACM (1997)

    Google Scholar 

  22. Hashemi, H.B., Yazdani, N., Shakery, A., Naeini, M.P.: Application of ensemble models in web ranking. In: 2010 5th International Symposium on Telecommunications (IST), pp. 726–731. IEEE (2010)

    Google Scholar 

  23. Haveliwala, T.H.: Topic-sensitive PageRank. In: Proceedings of the 11th international conference on World Wide Web, pp. 517–526. ACM (2002)

    Google Scholar 

  24. He, X., Cai, D., Wen, J.R., Ma, W.Y., Zhang, H.J.: Clustering and searching WWW images using link and page layout analysis. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 3(2), 10 (2007)

    Article  Google Scholar 

  25. Hsu, W.H., Kennedy, L.S., Chang, S.H.: Video search reranking through random walk over document-level context graph. In: Proceedings of the 15th ACM International Conference on Multimedia, pp. 971–980. ACM (2007)

    Google Scholar 

  26. Ingongngam, P., Rungsawang, A.: Topic-centric algorithm: a novel approach to web link analysis. In: 2004 18th International Conference on Advanced Information Networking and Applications, AINA 2004, vol. 2, pp. 299–301. IEEE (2004)

    Google Scholar 

  27. Jeh, G., Widom, J.: Scaling personalized web search. In: Proceedings of the 12th International Conference on World Wide Web, pp. 271–279. ACM (2003)

    Google Scholar 

  28. Jing, Y., Baluja, S.: PageRank for product image search. In: Proceedings of the 17th International Conference on World Wide Web, pp. 307–316. ACM (2008)

    Google Scholar 

  29. Jing, Y., Baluja, S.: VisualRank: applying PageRank to large-scale image search. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1877–1890 (2008)

    Article  Google Scholar 

  30. Khasanova, R., Dong, X., Frossard, P.: Multi-modal image retrieval with random walk on multi-layer graphs. arXiv preprint arXiv:1607.03406 (2016)

  31. Kim, D.J., Lee, S.C., Son, H.Y., Kim, S.W., Lee, J.B.: C-rank and its variants: a contribution-based ranking approach exploiting links and content. J. Inf. Sci. 40(6), 761–778 (2014)

    Article  Google Scholar 

  32. Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of 9th Annual ACM-SIAM Symposium Discrete Algorithms, pp. 668–677 (1998)

    Google Scholar 

  33. Kolda, T., Bader, B.: The TOPHITS model for higher-order web link analysis. In: Workshop on Link Analysis, Counterterrorism and Security (2006)

    Google Scholar 

  34. Kurland, O., Lee, L.: PageRank without hyperlinks: structural reranking using links induced by language models. In: Proceedings of the Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, pp. 306–313. ACM (2005)

    Google Scholar 

  35. Kurland, O., Lee, L.: Respect my authority!: hits without hyperlinks, utilizing cluster-based language models. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 83–90. ACM (2006)

    Google Scholar 

  36. Lempel, R., Soffer, A.: PicASHOW: pictorial authority search by hyperlinks on the web. In: Proceedings of the 10th International Conference on World Wide Web, pp. 438–448. ACM (2001)

    Google Scholar 

  37. Li, B., Han, L.: Distance weighted cosine similarity measure for text classification. In: Yin, H., et al. (eds.) IDEAL 2013. LNCS, vol. 8206, pp. 611–618. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41278-3_74

    Chapter  Google Scholar 

  38. Lin, J.: PageRank without hyperlinks: reranking with pubmed related article networks for biomedical text retrieval. BMC Bioinformatics 9(1), 270 (2008)

    Article  Google Scholar 

  39. Liu, J., Lai, W., Hua, X.S., Huang, Y., Li, S.: Video search re-ranking via multi-graph propagation. In: Proceedings of the 15th ACM International Conference on Multimedia, pp. 208–217. ACM (2007)

    Google Scholar 

  40. Liu, Z., Wang, S., Zheng, L., Tian, Q.: Robust imagegraph: rank-level feature fusion for image search. IEEE Trans. Image Process. 26(7), 3128–3141 (2017)

    Article  MathSciNet  Google Scholar 

  41. Mikawa, K., Ishida, T., Goto, M.: A proposal of extended cosine measure for distance metric learning in text classification. In: 2011 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 1741–1746. IEEE (2011)

    Google Scholar 

  42. Najork, M., Zaragoza, H., Taylor, M.: Hits on the web: how does it compare? In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2007)

    Google Scholar 

  43. Petrakis, E.G.: Intelligent search for image information on the web through text and link structure analysis. In: Maragos, P., Potamianos, A., Gros, P. (eds.) Multimodal Processing and Interaction. Multimedia Systems and Applications, pp. 1–17. Springer, Boston (2008). https://doi.org/10.1007/978-0-387-76316-3_12

    Chapter  Google Scholar 

  44. Ricardo, B.-Y., Berthier, R.N.: Modern Information Retrieval-the Concepts and Technology Behind Search, 2nd edn. Addison Wesley, New Jersey (2011)

    Google Scholar 

  45. Richardson, M., Domingos, P.: The intelligent surfer: probabilistic combination of link and content information in PageRank. In: Advances in Neural Information Processing Systems, vol. 2, pp. 1441–1448 (2002)

    Google Scholar 

  46. Robertson, S.E., Walker, S.: Okapi/keenbow at TREC-8. TREC 8, 151–162 (1999)

    Google Scholar 

  47. Salton, G.: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading (1989)

    Google Scholar 

  48. Shakery, A., Zhai, C.: A probabilistic relevance propagation model for hypertext retrieval. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management, pp. 550–558. ACM (2006)

    Google Scholar 

  49. Shen, D., Sun, J.T., Yang, Q., Chen, Z.: A comparison of implicit and explicit links for web page classification. In: Proceedings of the 15th International Conference on World Wide Web, pp. 643–650. ACM (2006)

    Google Scholar 

  50. Simon, A.R., Sicre, R., Bois, R., Gravier, G., SĂ©billot, P.: Irisa at trecvid2015: Leveraging multimodal LDA for video hyperlinking. In: TRECVID 2015 Workshop (2015)

    Google Scholar 

  51. T. Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 50–57. ACM (1999)

    Google Scholar 

  52. Torjmen, M., Pinel-Sauvagnat, K., Boughanem, M.: Using textual and structural context for searching multimedia elements. Int. J. Bus. Intell. Data Mining 5(4), 323–352 (2010)

    Article  Google Scholar 

  53. Tsikrika, T., et al.: Structured document retrieval, multimedia retrieval, and entity ranking using PF/Tijah. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 306–320. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-85902-4_27

    Chapter  Google Scholar 

  54. Voutsakis, E., Petrakis, E.G., Milios, E.: Weighted link analysis for logo and trademark image retrieval on the web. In: Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 581–585. IEEE Computer Society (2005)

    Google Scholar 

  55. Wang, M., Li, H., Tao, D., Lu, K., Wu, X.: Multimodal graph-based reranking for web image search. IEEE Trans. Image Process. 21(11), 4649–4661 (2012)

    Article  MathSciNet  Google Scholar 

  56. Wang, X., Zhou, W., Tian, Q., Li, H.: Adaptively weighted graph fusion for image retrieval. In: Proceedings of the International Conference on Internet Multimedia Computing and Service, pp. 18–21. ACM (2016)

    Google Scholar 

  57. Wilcoxon, F.: Individual comparisons by ranking methods. Biometrics Bull. 1(6), 80–83 (1945)

    Article  Google Scholar 

  58. Xie, L., Tian, Q., Zhou, W., Zhang, B.: Fast and accurate near-duplicate image search with affinity propagation on the imageweb. Comput. Vis. Image Underst. 124, 31–41 (2014)

    Article  Google Scholar 

  59. Xu, G., Ma, W.Y.: Building implicit links from content for forum search. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 300–307. ACM (2006)

    Google Scholar 

  60. Xue, G.R., Zeng, H.J., Chen, Z., Ma, W.Y., Zhang, H.J., Lu, C.J.: Implicit link analysis for small web search. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 56–63. ACM (2003)

    Google Scholar 

  61. Zaragoza, H., Craswell, N., Taylor, M.J., Saria, S., Robertson, S.E.: Microsoft cambridge at TREC 13: web and hard tracks. TREC 4, 1–1 (2004)

    Google Scholar 

  62. Zhang, S., Yang, M., Cour, T., Yu, K., Metaxas, D.N.: Query specific fusion for image retrieval. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, pp. 660–673. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33709-3_47

    Chapter  Google Scholar 

  63. Zhang, W., Ngo, C.W., Cao, X.: Hyperlink-aware object retrieval. IEEE Trans. Image Process. 25(9), 4186–4198 (2016)

    Article  MathSciNet  Google Scholar 

  64. Zhang, X., Hu, X., Zhou, X.: A comparative evaluation of different link types on enhancing document clustering. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 555–562. ACM (2008)

    Google Scholar 

  65. Zhou, W., Li, H., Tian, Q.: Recent advance in content-based image retrieval: a literature survey. arXiv preprint arXiv:1706.06064 (2017)

  66. Zhou, W., Tian, Q., Li, H.: Visual block link analysis for image re-ranking. In: Proceedings of the First International Conference on Internet Multimedia Computing and Service, pp. 10–16. ACM (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hatem Aouadi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Aouadi, H., Khemakhem, M.T., Jemaa, M.B. (2019). Uncovering Hidden Links Between Images Through Their Textual Context. In: Hammoudi, S., Śmiałek, M., Camp, O., Filipe, J. (eds) Enterprise Information Systems. ICEIS 2018. Lecture Notes in Business Information Processing, vol 363. Springer, Cham. https://doi.org/10.1007/978-3-030-26169-6_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-26169-6_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-26168-9

  • Online ISBN: 978-3-030-26169-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics