Abstract
Recognizing figures in portrait Thangka images is fundamental to the appreciation of this unique Tibetan art. In this paper, we focus on the problem of portrait Thangka image retrieval from the perspective of re-identifying the figures in the images. Based on state-of-the-art re-identification methods, we further improve them by exploiting several tricks. We also investigate the impact of different cropping methods to evaluate the contribution of different features. Our evaluation results on an annotated portrait Thangka image dataset collected by ourselves demonstrate the necessity of further study on this challenging problem. We will release the data and code to promote the research on Thangka.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Liu, H., Wang, X., Bi, X., Wang, X., Zhao, J.: A multi-feature SVM classification of Thangka headdress. In: 2015 8th International Symposium on Computational Intelligence and Design (ISCID), Hangzhou, China, pp. 160–163. IEEE, December 2015
Chen, Y., Liu, X.: Thangka religious tools classification and detection based on HOG+SVM. In: 2019 IEEE 3rd Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), Chongqing, China, pp. 967–971. IEEE, October 2019
Qian, J., Wang, W.: Religious portrait Thangka image retrieval based on gesture feature. In: 2009 Chinese Conference on Pattern Recognition, Nanjing, China, pp. 1–5. IEEE, November 2009
Wang, W.: Study of Thangka image retrieval and multimedia presentation management system. In: 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Kyoto, Japan, pp. 981–984. IEEE, September 2009
Bryan, B., Gong, Y., Zhang, Y., Poellabauer, C.: Second-order non-local attention networks for person re-identification. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), pp. 3759–3768. IEEE, October 2019
Weyand, T., Araujo, A., Cao, B., Sim, J.: Google landmarks dataset v2 – a large-scale benchmark for instance-level recognition and retrieval. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, pp. 2572–2581. IEEE, June 2020
Ji, Y.H., et al.: An effective pipeline for a real-world clothes retrieval system (2020)
Ge, Y., Zhang, R., Wang, X., Tang, X., Luo, P.: DeepFashion2: a versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, pp. 5332–5340. IEEE, June 2019
Jegou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, pp. 3304–3311. IEEE, June 2010
Radenovic, F., Tolias, G., Chum, O.: Fine-tuning CNN image retrieval with no human annotation. IEEE Trans. Pattern Anal. Mach. Intell. 41(7), 1655–1668 (2019)
Ng, T., Balntas, V., Tian, Y., Mikolajczyk, K.: SOLAR: second-order loss and attention for image retrieval. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12370, pp. 253–270. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58595-2_16
Cao, B., Araujo, A., Sim, J.: Unifying deep local and global features for image search. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12365, pp. 726–743. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58565-5_43
Wang, G., Yuan, Y., Chen, X., Li, J., Zhou, X.: Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 26th ACM International Conference on Multimedia, MM 2018, pp. 274–282. Association for Computing Machinery, New York (2018)
Luo, H., Gu, Y., Liao, X., Lai, S., Jiang, W.: Bag of tricks and a strong baseline for deep person re-identification. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA, pp. 1487–1495. IEEE, June 2019
Ye, M., Shen, J., Lin, G., Xiang, T., Shao, L., Hoi, S.C.H.: Deep learning for person re-identification: a survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell. (2021)
Cubuk, E.D., Zoph, B., Mane, D., Vasudevan, V., Le, Q.V.: AutoAugment: learning augmentation strategies from data. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, pp. 113–123. IEEE, June 2019
Pan, X., Luo, P., Shi, J., Tang, X.: Two at once: enhancing learning and generalization capacities via IBN-Net. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11208, pp. 484–500. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01225-0_29
He, L., Liao, X., Liu, W., Liu, X., Cheng, P., Mei, T.: FastReID: a Pytorch toolbox for general instance re-identification (2020)
Acknowledgments
This work was supported by the National Natural Science Foundation of China [NO. 62066042; NO. 61971005], and First-Class Discipline Cultivation Projects of Tibet University (No. 00060704/004).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Danzeng, X. et al. (2021). Portrait Thangka Image Retrieval via Figure Re-identification. In: Feng, J., Zhang, J., Liu, M., Fang, Y. (eds) Biometric Recognition. CCBR 2021. Lecture Notes in Computer Science(), vol 12878. Springer, Cham. https://doi.org/10.1007/978-3-030-86608-2_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-86608-2_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86607-5
Online ISBN: 978-3-030-86608-2
eBook Packages: Computer ScienceComputer Science (R0)