Content-Irrelevant Tag Cleansing via Bi-Layer Clustering and Peer Cooperation

Xia, Zhaoqiang; Feng, Xiaoyi; Peng, Jinye; Fan, Jianping

doi:10.1007/s11265-014-0895-y

Content-Irrelevant Tag Cleansing via Bi-Layer Clustering and Peer Cooperation

Published: 15 May 2014

Volume 81, pages 29–44, (2015)
Cite this article

Journal of Signal Processing Systems Aims and scope Submit manuscript

Zhaoqiang Xia¹,
Xiaoyi Feng¹,
Jinye Peng¹ &
…
Jianping Fan²

359 Accesses
7 Citations
Explore all metrics

Abstract

User-provided tags for social images have facilitated many fields, such as social image organization, summarization and retrieval. Since the users utilize their own knowledge and personalized language to describe the visual content of social images, these social tags are too imprecise and ambiguous to exploit the social image tagging. In this paper, we discover the content-similar images (peers) and leverage the relationships among these images (peer cooperation) to handle the problem of content-irrelevant tags. A bi-layer clustering framework for discovering content-similar images is proposed to divide image collection into different groups, and the tags of peers in these groups are cleaned jointly based on tag statistics and relevance. The relevance of tags measured by Google Distance is used to generate the first-layer clustering and then the bi-modality similarity of images is used to perform the second-layer clustering. Based on the bi-layer clustering, we utilize peers in a group to identify their content-irrelevant tags. Finally, an extended Fisher’s criterion is proposed to decide the proper number of content-irrelevant tags. To verify the effectiveness of our proposed technique, we conduct the experiments on the social images of Flickr and the standard benchmark. The comparison experiments show that our proposed algorithm achieves positive results for tag cleansing and image retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Knowledge-aware progressive clustering for social image

Article Open access 30 October 2023

Multi-level diversification approach of semantic-based image retrieval results

Article 22 June 2019

A Tags Mining Approach for Automatic Image Annotation Using Neighbor Images Tree

Notes

In [11], authors use the Google search engine and name the approach with ‘Google distance’. We still use the same name even if we utilize the BING search engine and other database in this paper.
The NMF method constructs the estimated image-tag matrix V ^′ using the matrix factorization V≈W H (V is the original image-tag matrix). Most zero entities in V would be updated to non-zero by updating W and H through minimizing the objective function F= ∑i=1n ∑j=1m[V _{i
j} l o g(W H)_{i
j}−(W H)_{i
j}].

References

Bing. http://www.bing.com/images.
Flickr. http://www.flickr.com/.
Flickr blog. http://www.flickr.com/photos/franckmichel/6471458477/in/photostream.
Picasa web alums. https://picasaweb.google.com/home.
Wordnet. http://wordnet.princeton.edu/.
Ames, M., & Naaman, M. (2007). Why we tag: motivations for annotation in mobile and online media. In: Proceedings of the SIGCHI conference on Human factors in computing systems, (pp. 971–980). ACM.
Bird, S., Klein, E., Loper, E. (2009). Natural language processing with Python. O’Reilly Media, Incorporated.
Cattuto, C., Benz, D., Hotho, A., Stumme, G. (2008). Semantic analysis of tag similarity measures in collaborative tagging systems. arXiv preprint arXiv:0805.2045.
Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y. (2009). Nus-wide: a real-world web image database from national university of singapore. In: Proceedings of the ACM International Conference on Image and Video Retrieval, CIVR ’09, (pp. 48:1–48:9). ACM, New York.
Cilibrasi, R., & Vitányi, P.M. (2005). Clustering by compression. IEEE Transactions on Information Theory, 51(4), 1523–1545.
Article Google Scholar
Cilibrasi, R.L., & Vitanyi, P.M.B. (2007). The google similarity distance. IEEE Transactions on Knowledge and Data Engineering, 19(3), 370–383.
Article Google Scholar
Fan, J., Shen, Y., Zhou, N., Gao, Y. (2010). Harvesting large-scale weakly-tagged image databases from the web. In: CVPR’10, (pp. 802–809).
Frey, B., & Dueck, D. (2007). Clustering by passing messages between data points. Science, 315(5814), 972.
Article Google Scholar
Garg, N., & Weber, I. (2008). Personalized, interactive tag recommendation for flickr. In: Proceedings of the 2008 ACM conference on Recommender systems, (pp. 67–74). ACM.
Jia, Y., Wang, J., Zhang, C., Hua, X.S. (2008). Finding image exemplars using fast sparse affinity propagation. In: Proceedings of the 16th ACM international conference on Multimedia, MM ’08, (pp. 639–642). ACM.
Kang, F.Automatic image annotation. Ph.D. thesis (2007). AAI3282132.
Kennedy, L., Naaman, M., Ahern, S., Nair, R., Rattenbury, T. (2007). How flickr helps us make sense of the world: context and content in community-contributed media collections. In: Proceedings of the 15th international conference on Multimedia, (pp. 631–640). ACM.
Lee, D.D., Seung, H., et al. (1999). Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755), 788–791.
Article Google Scholar
Lee, S., De Neve, W., Plataniotis, K., Ro, Y. (2010). Map-based image tag recommendation using a visual folksonomy. Pattern Recognition Letters, 31(9), 976–982.
Article Google Scholar
Lee, S., De Neve, W., Ro, Y.M. (2010). Tag refinement in an image folksonomy using visual similarity and tag co-occurrence statistics. Image Commun, 25(10), 761–773.
Google Scholar
Li, X., Snoek, C.G., Worring, M. (2008). Learning tag relevance by neighbor voting for social image retrieval. In: Proceedings of the 1st ACM international conference on Multimedia information retrieval, MIR ’08, (pp. 180–187). ACM.
Lin, C.J. (2007). Projected gradient methods for nonnegative matrix factorization. Neural computation, 19(10), 2756–2779.
Article Google Scholar
Liu, D., Hua, X.S., Wang, M., Zhang, H.J. (2010). Image retagging. In: Proceedings of the international conference on Multimedia, MM ’10, (pp. 491–500). ACM.
Liu, J., Li, Z., Lu, H. (2013). Sparse semantic metric learning for image retrieval. Multimedia Systems, 1–9.
Olson, D.L., & Delen, D. (2008). Advanced data mining techniques. Springer.
Qiu, G. (2002). Indexing chromatic and achromatic patterns for content-based colour image retrieval. Pattern Recognition, 35(8), 1675–1686.
Article Google Scholar
Sang, J., Xu, C., Liu, J. (2012). User-aware image tag refinement via ternary semantic analysis. IEEE Transactions on Multimedia, 14(3), 883–895.
Article Google Scholar
Shen, Y., & Fan, J. (2010). Leveraging loosely-tagged images and inter-object correlations for tag recommendation. In: Proceedings of the international conference on Multimedia, (pp. 5–14). ACM.
Sigurbjörnsson, B., & Van Zwol, R. (2008). Flickr tag recommendation based on collective knowledge. In: Proceedings of the 17th international conference on World Wide Web, (pp. 327–336). ACM.
Solskinnsbakk, G., & Gulla, J.A. (2011). Mining tag similarity in folksonomies. In: Proceedings of the 3rd international workshop on Search and mining user-generated contents, (pp. 53–60). ACM.
Tang, J., Hong, R., Yan, S., Chua, T.S., Qi, G.J., Jain, R. (2011). Image annotation by k nn-sparse graph-based label propagation over noisily tagged web images. ACM Transactions on Intelligent Systems and Technology (TIST), 2(2), 14.
Google Scholar
Tang, J., Yan, S., Hong, R., Qi, G.J., Chua, T.S. (2009). Inferring semantic concepts from community-contributed images and noisy tags. In: Proceedings of the 17th ACM international conference on Multimedia, MM ’09, (pp. 223–232). ACM.
Wang, M., Ni, B., Hua, X., Chua, T. (2012). Assistive tagging: A survey of multimedia tagging with human-computer joint exploration. ACM Computing Surveys (CSUR), 44(4), 25.
Article Google Scholar
Wu, L., Jin, R., Jain, A.K. (2013). Tag completion for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(3), 716–727.
Article Google Scholar
Wu, L., Yang, L., Yu, N., Hua, X.S. (2009). Learning to tag. In: Proceedings of the 18th international conference on World wide web, WWW ’09, (pp. 361–370). ACM.
Xia, Z., Peng, J., Feng, X., Fan, J. (2012). Social tag enrichment via automatic abstract tag refinement. In: Advances in Multimedia Information Processing PCM 2012, Lecture Notes in Computer Science, vol. 7674, (pp. 198–209). Springer, Berlin.
Zhu, G., Yan, S., Ma, Y. (2010). Image tag refinement towards low-rank, content-tag prior and error sparsity. In: Proceedings of the international conference on Multimedia, MM ’10, (pp. 461–470). ACM.

Download references

Acknowledgments

This work is partly supported by the doctorate foundation of Northwestern Polytechnical University (Grant No: CX201113), Doctoral Program of Higher Education of China (Grant No. 201161 02110027) and National Natural Science Foundation of China (under Grant No.61075014).

Author information

Authors and Affiliations

Northwestern Polytechnical University, Xi’an, China
Zhaoqiang Xia, Xiaoyi Feng & Jinye Peng
University of North Carolina at Charlotte, Charlotte, NC, USA
Jianping Fan

Authors

Zhaoqiang Xia
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyi Feng
View author publications
You can also search for this author in PubMed Google Scholar
Jinye Peng
View author publications
You can also search for this author in PubMed Google Scholar
Jianping Fan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhaoqiang Xia.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xia, Z., Feng, X., Peng, J. et al. Content-Irrelevant Tag Cleansing via Bi-Layer Clustering and Peer Cooperation. J Sign Process Syst 81, 29–44 (2015). https://doi.org/10.1007/s11265-014-0895-y

Download citation

Received: 30 July 2013
Revised: 03 April 2014
Accepted: 09 April 2014
Published: 15 May 2014
Issue Date: October 2015
DOI: https://doi.org/10.1007/s11265-014-0895-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Content-Irrelevant Tag Cleansing via Bi-Layer Clustering and Peer Cooperation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Knowledge-aware progressive clustering for social image

Multi-level diversification approach of semantic-based image retrieval results

A Tags Mining Approach for Automatic Image Annotation Using Neighbor Images Tree

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Content-Irrelevant Tag Cleansing via Bi-Layer Clustering and Peer Cooperation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Knowledge-aware progressive clustering for social image

Multi-level diversification approach of semantic-based image retrieval results

A Tags Mining Approach for Automatic Image Annotation Using Neighbor Images Tree

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation