Portrait Thangka Image Retrieval via Figure Re-identification

Danzeng, Xire; Yang, Yuchao; Yang, Yufan; Hou, Zhao; Xi, Rui; Li, Xinsheng; Zhao, Qijun; Danzeng, Pubu; Duoji, Gesang; Gao, Dingguo

doi:10.1007/978-3-030-86608-2_9

Xire Danzeng¹²,
Yuchao Yang¹³,
Yufan Yang¹²,
Zhao Hou¹²,
Rui Xi¹³,
Xinsheng Li¹³,
Qijun Zhao^12,13,
Pubu Danzeng¹²,
Gesang Duoji¹² &
…
Dingguo Gao¹²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12878))

Included in the following conference series:

Chinese Conference on Biometric Recognition

1408 Accesses
2 Citations

Abstract

Recognizing figures in portrait Thangka images is fundamental to the appreciation of this unique Tibetan art. In this paper, we focus on the problem of portrait Thangka image retrieval from the perspective of re-identifying the figures in the images. Based on state-of-the-art re-identification methods, we further improve them by exploiting several tricks. We also investigate the impact of different cropping methods to evaluate the contribution of different features. Our evaluation results on an annotated portrait Thangka image dataset collected by ourselves demonstrate the necessity of further study on this challenging problem. We will release the data and code to promote the research on Thangka.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Liu, H., Wang, X., Bi, X., Wang, X., Zhao, J.: A multi-feature SVM classification of Thangka headdress. In: 2015 8th International Symposium on Computational Intelligence and Design (ISCID), Hangzhou, China, pp. 160–163. IEEE, December 2015
Google Scholar
Chen, Y., Liu, X.: Thangka religious tools classification and detection based on HOG+SVM. In: 2019 IEEE 3rd Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), Chongqing, China, pp. 967–971. IEEE, October 2019
Google Scholar
Qian, J., Wang, W.: Religious portrait Thangka image retrieval based on gesture feature. In: 2009 Chinese Conference on Pattern Recognition, Nanjing, China, pp. 1–5. IEEE, November 2009
Google Scholar
Wang, W.: Study of Thangka image retrieval and multimedia presentation management system. In: 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Kyoto, Japan, pp. 981–984. IEEE, September 2009
Google Scholar
Bryan, B., Gong, Y., Zhang, Y., Poellabauer, C.: Second-order non-local attention networks for person re-identification. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), pp. 3759–3768. IEEE, October 2019
Google Scholar
Weyand, T., Araujo, A., Cao, B., Sim, J.: Google landmarks dataset v2 – a large-scale benchmark for instance-level recognition and retrieval. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, pp. 2572–2581. IEEE, June 2020
Google Scholar
Ji, Y.H., et al.: An effective pipeline for a real-world clothes retrieval system (2020)
Google Scholar
Ge, Y., Zhang, R., Wang, X., Tang, X., Luo, P.: DeepFashion2: a versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, pp. 5332–5340. IEEE, June 2019
Google Scholar
Jegou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, pp. 3304–3311. IEEE, June 2010
Google Scholar
Radenovic, F., Tolias, G., Chum, O.: Fine-tuning CNN image retrieval with no human annotation. IEEE Trans. Pattern Anal. Mach. Intell. 41(7), 1655–1668 (2019)
Article Google Scholar
Ng, T., Balntas, V., Tian, Y., Mikolajczyk, K.: SOLAR: second-order loss and attention for image retrieval. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12370, pp. 253–270. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58595-2_16
Chapter Google Scholar
Cao, B., Araujo, A., Sim, J.: Unifying deep local and global features for image search. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12365, pp. 726–743. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58565-5_43
Chapter Google Scholar
Wang, G., Yuan, Y., Chen, X., Li, J., Zhou, X.: Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 26th ACM International Conference on Multimedia, MM 2018, pp. 274–282. Association for Computing Machinery, New York (2018)
Google Scholar
Luo, H., Gu, Y., Liao, X., Lai, S., Jiang, W.: Bag of tricks and a strong baseline for deep person re-identification. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA, pp. 1487–1495. IEEE, June 2019
Google Scholar
Ye, M., Shen, J., Lin, G., Xiang, T., Shao, L., Hoi, S.C.H.: Deep learning for person re-identification: a survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell. (2021)
Google Scholar
Cubuk, E.D., Zoph, B., Mane, D., Vasudevan, V., Le, Q.V.: AutoAugment: learning augmentation strategies from data. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, pp. 113–123. IEEE, June 2019
Google Scholar
Pan, X., Luo, P., Shi, J., Tang, X.: Two at once: enhancing learning and generalization capacities via IBN-Net. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11208, pp. 484–500. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01225-0_29
Chapter Google Scholar
He, L., Liao, X., Liu, W., Liu, X., Cheng, P., Mei, T.: FastReID: a Pytorch toolbox for general instance re-identification (2020)
Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China [NO. 62066042; NO. 61971005], and First-Class Discipline Cultivation Projects of Tibet University (No. 00060704/004).

Author information

Authors and Affiliations

School of Information Science and Technology, Tibet University, Lhasa, Tibet, People’s Republic of China
Xire Danzeng, Yufan Yang, Zhao Hou, Qijun Zhao, Pubu Danzeng, Gesang Duoji & Dingguo Gao
College of Computer Science, Sichuan University, Chengdu, Sichuan, People’s Republic of China
Yuchao Yang, Rui Xi, Xinsheng Li & Qijun Zhao

Authors

Xire Danzeng
View author publications
You can also search for this author in PubMed Google Scholar
Yuchao Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yufan Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zhao Hou
View author publications
You can also search for this author in PubMed Google Scholar
Rui Xi
View author publications
You can also search for this author in PubMed Google Scholar
Xinsheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Qijun Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Pubu Danzeng
View author publications
You can also search for this author in PubMed Google Scholar
Gesang Duoji
View author publications
You can also search for this author in PubMed Google Scholar
Dingguo Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gesang Duoji .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Jianjiang Feng
Fudan University, Shanghai, China
Junping Zhang
Shanghai Jiao Tong University, Shanghai, China
Manhua Liu
Shanghai University, Shanghai, China
Yuchun Fang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Danzeng, X. et al. (2021). Portrait Thangka Image Retrieval via Figure Re-identification. In: Feng, J., Zhang, J., Liu, M., Fang, Y. (eds) Biometric Recognition. CCBR 2021. Lecture Notes in Computer Science(), vol 12878. Springer, Cham. https://doi.org/10.1007/978-3-030-86608-2_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-86608-2_9
Published: 08 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86607-5
Online ISBN: 978-3-030-86608-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics