Abstract
Using hyperlinks to enhance page ranking has been widely studied in the literature. The main motivation is that an hyperlink underlines a page relevance. However, several hyperlinks in the web are used for navigation or marketing purposes. In addition, hyperlinks are created manually, so it is impossible to semantically link all similar pages. In our work, we propose to uncover hidden semantic links and create them automatically between all the collection’s images. For this aim, we propose first to format textual context of images into topic distributions via LDA technique, and then compute semantic similarities to create links. Experiments carried out in the Wikipedia Retrieval Task of ImageClef 2011 showed that the whole textual context of images is useful for uncovering hidden links and consequently enhancing the retrieval accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alzu’bi, A., Amira, A., Ramzan, N.: Semantic content-based image retrieval: a comprehensive study. J. Vis. Commun. Image Represent. 32, 20–54 (2015)
Aouadi, H., Khemakhem, M.T., Jemaa, M.B.: Combination of document structure and links for multimedia object retrieval. J. Inf. Sci. 38(5), 442–458 (2012)
Aouadi, H., Khemakhem, M.T., Jemaa, M.B.: An LDA topic model adaptation for context-based image retrieval. In: Stuckenschmidt, H., Jannach, D. (eds.) EC-Web 2015. LNBIP, vol. 239, pp. 69–80. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27729-5_6
Aouadi, H., Khemakhem, M.T., Jemaa, M.B.: Building contextual implicit links for image retrieval. In: Proceedings of the 20th International Conference on Enterprise Information Systems, ICEIS 2018, Funchal, 21–24 March 2018, vol. 1, pp. 81–91 (2018)
Belmouhcine, A., Benkhalifa, M.: Implicit links based web page representation for web page classification. In: Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, p. 12. ACM (2015)
Bharat, K., Henzinger, M.R.: Improved algorithms for topic distillation in a hyperlinked environment. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 104–111. ACM (1998)
Bhatt, C., Pappas, N., Habibi, M., Popescu-Belis, A.: Multimodal reranking of content-based recommendations for hyperlinking video snippets. In: Proceedings of International Conference on Multimedia Retrieval, p. 225. ACM (2014)
Blei, D., Ng, A., Jordan, M.: Latent Dirichlet Allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)
Brandes, U.: A faster algorithm for betweenness centrality. J. Math. Sociol. 25(2), 163–177 (2001)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. In: Proceedings of the 7th International Conference on World Wide Web (WWW), Brisbane, pp. 107–117 (1998)
Cai, D., He, X., Ma, W.Y., Wen, J.R., Zhang, H.: Organizing www images based on the analysis of page layout and web link structure. In: 2004 IEEE International Conference on Multimedia and Expo 2004, ICME 2004, vol. 1, pp. 113–116. IEEE (2004)
Cai, D., Yu, S., Wen, J.R., Ma, W.Y.: VIPS: a vision based page segmentation algorithm. Microsoft Technical Report 79, MSR-TR (2003)
Chakrabarti, S., Dom, B., Raghavan, P., Rajagopalan, S., Gibson, D., Kleinberg, J.: Automatic resource compilation by analyzing hyperlink structure and associated text. Comput. Netw. ISDN Syst. 30(1–7), 65–74 (1998)
Chen, S., Eskevich, M., Jones, G.J.F., O’Connor, N.E.: An investigation into feature effectiveness for multimedia hyperlinking. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014. LNCS, vol. 8326, pp. 251–262. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-04117-9_23
Chibane, I., Doan, B.L.: Relevance propagation model for large hypertext document collections. In: Large scale Semantic Access to Content (text, image, video, and sound), pp. 585–595. Le Centre de Hautes Etudes Internationales D’Informatique Documentaire (2007)
Cohn, D.A., Hofmann, T.: The missing link-a probabilistic model of document content and hypertext connectivity. In: Advances in Neural Information Processing Systems, pp. 430–436 (2001)
Datta, R., Joshi, D., Li, J., Wang, J.Z.: Image retrieval: ideas, influences, and trends of the new age. ACM Comput. Surv. (CSUR) 40(2), 5 (2008)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391 (1990)
Dunlop, M.D.: Multimedia information retrieval. Ph.D. thesis, University of Glasgow (1991)
Eskevich, M., et al.: Multimedia information seeking through search and hyperlinking. In: Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval, pp. 287–294. ACM (2013)
Harmandas, V., Sanderson, M., Dunlop, M.D.: Image retrieval by hypertext links. In: ACM SIGIR Forum, vol. 31, pp. 296–303. ACM (1997)
Hashemi, H.B., Yazdani, N., Shakery, A., Naeini, M.P.: Application of ensemble models in web ranking. In: 2010 5th International Symposium on Telecommunications (IST), pp. 726–731. IEEE (2010)
Haveliwala, T.H.: Topic-sensitive PageRank. In: Proceedings of the 11th international conference on World Wide Web, pp. 517–526. ACM (2002)
He, X., Cai, D., Wen, J.R., Ma, W.Y., Zhang, H.J.: Clustering and searching WWW images using link and page layout analysis. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 3(2), 10 (2007)
Hsu, W.H., Kennedy, L.S., Chang, S.H.: Video search reranking through random walk over document-level context graph. In: Proceedings of the 15th ACM International Conference on Multimedia, pp. 971–980. ACM (2007)
Ingongngam, P., Rungsawang, A.: Topic-centric algorithm: a novel approach to web link analysis. In: 2004 18th International Conference on Advanced Information Networking and Applications, AINA 2004, vol. 2, pp. 299–301. IEEE (2004)
Jeh, G., Widom, J.: Scaling personalized web search. In: Proceedings of the 12th International Conference on World Wide Web, pp. 271–279. ACM (2003)
Jing, Y., Baluja, S.: PageRank for product image search. In: Proceedings of the 17th International Conference on World Wide Web, pp. 307–316. ACM (2008)
Jing, Y., Baluja, S.: VisualRank: applying PageRank to large-scale image search. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1877–1890 (2008)
Khasanova, R., Dong, X., Frossard, P.: Multi-modal image retrieval with random walk on multi-layer graphs. arXiv preprint arXiv:1607.03406 (2016)
Kim, D.J., Lee, S.C., Son, H.Y., Kim, S.W., Lee, J.B.: C-rank and its variants: a contribution-based ranking approach exploiting links and content. J. Inf. Sci. 40(6), 761–778 (2014)
Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of 9th Annual ACM-SIAM Symposium Discrete Algorithms, pp. 668–677 (1998)
Kolda, T., Bader, B.: The TOPHITS model for higher-order web link analysis. In: Workshop on Link Analysis, Counterterrorism and Security (2006)
Kurland, O., Lee, L.: PageRank without hyperlinks: structural reranking using links induced by language models. In: Proceedings of the Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, pp. 306–313. ACM (2005)
Kurland, O., Lee, L.: Respect my authority!: hits without hyperlinks, utilizing cluster-based language models. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 83–90. ACM (2006)
Lempel, R., Soffer, A.: PicASHOW: pictorial authority search by hyperlinks on the web. In: Proceedings of the 10th International Conference on World Wide Web, pp. 438–448. ACM (2001)
Li, B., Han, L.: Distance weighted cosine similarity measure for text classification. In: Yin, H., et al. (eds.) IDEAL 2013. LNCS, vol. 8206, pp. 611–618. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41278-3_74
Lin, J.: PageRank without hyperlinks: reranking with pubmed related article networks for biomedical text retrieval. BMC Bioinformatics 9(1), 270 (2008)
Liu, J., Lai, W., Hua, X.S., Huang, Y., Li, S.: Video search re-ranking via multi-graph propagation. In: Proceedings of the 15th ACM International Conference on Multimedia, pp. 208–217. ACM (2007)
Liu, Z., Wang, S., Zheng, L., Tian, Q.: Robust imagegraph: rank-level feature fusion for image search. IEEE Trans. Image Process. 26(7), 3128–3141 (2017)
Mikawa, K., Ishida, T., Goto, M.: A proposal of extended cosine measure for distance metric learning in text classification. In: 2011 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 1741–1746. IEEE (2011)
Najork, M., Zaragoza, H., Taylor, M.: Hits on the web: how does it compare? In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2007)
Petrakis, E.G.: Intelligent search for image information on the web through text and link structure analysis. In: Maragos, P., Potamianos, A., Gros, P. (eds.) Multimodal Processing and Interaction. Multimedia Systems and Applications, pp. 1–17. Springer, Boston (2008). https://doi.org/10.1007/978-0-387-76316-3_12
Ricardo, B.-Y., Berthier, R.N.: Modern Information Retrieval-the Concepts and Technology Behind Search, 2nd edn. Addison Wesley, New Jersey (2011)
Richardson, M., Domingos, P.: The intelligent surfer: probabilistic combination of link and content information in PageRank. In: Advances in Neural Information Processing Systems, vol. 2, pp. 1441–1448 (2002)
Robertson, S.E., Walker, S.: Okapi/keenbow at TREC-8. TREC 8, 151–162 (1999)
Salton, G.: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading (1989)
Shakery, A., Zhai, C.: A probabilistic relevance propagation model for hypertext retrieval. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management, pp. 550–558. ACM (2006)
Shen, D., Sun, J.T., Yang, Q., Chen, Z.: A comparison of implicit and explicit links for web page classification. In: Proceedings of the 15th International Conference on World Wide Web, pp. 643–650. ACM (2006)
Simon, A.R., Sicre, R., Bois, R., Gravier, G., SĂ©billot, P.: Irisa at trecvid2015: Leveraging multimodal LDA for video hyperlinking. In: TRECVID 2015 Workshop (2015)
T. Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 50–57. ACM (1999)
Torjmen, M., Pinel-Sauvagnat, K., Boughanem, M.: Using textual and structural context for searching multimedia elements. Int. J. Bus. Intell. Data Mining 5(4), 323–352 (2010)
Tsikrika, T., et al.: Structured document retrieval, multimedia retrieval, and entity ranking using PF/Tijah. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 306–320. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-85902-4_27
Voutsakis, E., Petrakis, E.G., Milios, E.: Weighted link analysis for logo and trademark image retrieval on the web. In: Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 581–585. IEEE Computer Society (2005)
Wang, M., Li, H., Tao, D., Lu, K., Wu, X.: Multimodal graph-based reranking for web image search. IEEE Trans. Image Process. 21(11), 4649–4661 (2012)
Wang, X., Zhou, W., Tian, Q., Li, H.: Adaptively weighted graph fusion for image retrieval. In: Proceedings of the International Conference on Internet Multimedia Computing and Service, pp. 18–21. ACM (2016)
Wilcoxon, F.: Individual comparisons by ranking methods. Biometrics Bull. 1(6), 80–83 (1945)
Xie, L., Tian, Q., Zhou, W., Zhang, B.: Fast and accurate near-duplicate image search with affinity propagation on the imageweb. Comput. Vis. Image Underst. 124, 31–41 (2014)
Xu, G., Ma, W.Y.: Building implicit links from content for forum search. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 300–307. ACM (2006)
Xue, G.R., Zeng, H.J., Chen, Z., Ma, W.Y., Zhang, H.J., Lu, C.J.: Implicit link analysis for small web search. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 56–63. ACM (2003)
Zaragoza, H., Craswell, N., Taylor, M.J., Saria, S., Robertson, S.E.: Microsoft cambridge at TREC 13: web and hard tracks. TREC 4, 1–1 (2004)
Zhang, S., Yang, M., Cour, T., Yu, K., Metaxas, D.N.: Query specific fusion for image retrieval. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, pp. 660–673. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33709-3_47
Zhang, W., Ngo, C.W., Cao, X.: Hyperlink-aware object retrieval. IEEE Trans. Image Process. 25(9), 4186–4198 (2016)
Zhang, X., Hu, X., Zhou, X.: A comparative evaluation of different link types on enhancing document clustering. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 555–562. ACM (2008)
Zhou, W., Li, H., Tian, Q.: Recent advance in content-based image retrieval: a literature survey. arXiv preprint arXiv:1706.06064 (2017)
Zhou, W., Tian, Q., Li, H.: Visual block link analysis for image re-ranking. In: Proceedings of the First International Conference on Internet Multimedia Computing and Service, pp. 10–16. ACM (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Aouadi, H., Khemakhem, M.T., Jemaa, M.B. (2019). Uncovering Hidden Links Between Images Through Their Textual Context. In: Hammoudi, S., Śmiałek, M., Camp, O., Filipe, J. (eds) Enterprise Information Systems. ICEIS 2018. Lecture Notes in Business Information Processing, vol 363. Springer, Cham. https://doi.org/10.1007/978-3-030-26169-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-26169-6_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26168-9
Online ISBN: 978-3-030-26169-6
eBook Packages: Computer ScienceComputer Science (R0)