research-article

Image label completion by pursuing contextual decomposability

Authors:

Xiaobai Liu,

Shuicheng Yan,

Tat-Seng Chua,

Hai JinAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 8, Issue 2

Article No.: 21, Pages 1 - 20

https://doi.org/10.1145/2168996.2169001

Published: 22 May 2012 Publication History

Get Access

Abstract

This article investigates how to automatically complete the missing labels for the partially annotated images, without image segmentation. The label completion procedure is formulated as a nonnegative data factorization problem, to decompose the global image representations that are used for describing the entire images, for instance, various image feature descriptors, into their corresponding label representations, that are used for describing the local semantic regions within images. The solution provided in this work is motivated by following observations. First, label representations of the regions with the same label often share certain commonness, yet may be essentially different due to the large intraclass variations. Thus, each label or concept should be represented by using a subspace spanned by an ensemble of basis, instead of a single one, to characterize the intralabel diversities. Second, the subspaces for different labels are different from each other. Third, while two images are similar with each other, the corresponding label representations should be similar. We formulate this cross-image context as well as the given partial label annotations in the framework of nonnegative data factorization and then propose an efficient multiplicative nonnegative update rules to alternately optimize the subspaces and the reconstruction coefficients. We also provide the theoretic proof of algorithmic convergence and correctness. Extensive experiments over several challenging image datasets clearly demonstrate the effectiveness of our proposed solution in boosting the quality of image label completion and image annotation accuracy. Based on the same formulation, we further develop a label ranking algorithms, to refine the noised image labels without any manual supervision. We compare the proposed label ranking algorithm with the state-of-the-arts over the popular evaluation databases and achieve encouragingly improvements.

References

[1]

Ahonen, T., Hadid, A., and Pietikäinen, M. 2006. Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intel. 28, 12, 2037--2041.

Abstract

References

Cited By

Index Terms

Recommendations

Multi-label learning with missing labels for image annotation and facial action unit recognition

Recurrent Image Annotation with Explicit Inter-label Dependencies

Instance Annotation for Multi-Instance Multi-Label Learning

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations