Skip to main content

A Clothing Classification Network with Manifold Structure Based on Second-Order Convolution

  • Conference paper
  • First Online:
International Conference on Neural Computing for Advanced Applications (NCAA 2023)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1870))

Included in the following conference series:

  • 672 Accesses

Abstract

Currently, AI-based clothing image classification techniques mostly use traditional deep learning methods, which are based on monocular clothing images for classification. However, the diversity of perspectives of realistic clothing images bring great difficulties and challenges to clothing classification. Moreover, deep convolutional networks have limitations of their own. They treat data as vectors in Euclidean space and fail to make full use of the potential low dimensional non-linear geometric structure information within high-dimensional clothing image data. Therefore, this paper explores and exploits the geometric structure information inherent of clothing image data from the perspective of a non-Euclidean manifold learning method, and designs and implements a clothing classification network with manifold structure based on second-order convolution to classify images using the second-order statistics of clothing features for image classification. Firstly, the input clothing image features extracted by the convolution neural network are pooled with the covariance pooling module to obtain the second-order statistical covariance, which is converted into SPD manifold to characterize the feature information of the clothing image set, and then a complete manifold structure neural network is constructed to enhance the feature representation ability of the model on the geometric intrinsic structure of the clothing image set. The experimental results of this method on the multi view clothing image dataset MVC show that it has good effectiveness, robustness, and accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Liu, Z., Luo, P., Qiu, S., Wang, X., Tang, X.: DeepFashion: powering robust clothes recognition and retrieval with rich annotations. In: Computer Vision & Pattern Recognition. IEEE (2016). https://doi.org/10.1109/CVPR.2016.124

  2. Xiao, L., Yichao, X.: Exact clothing retrieval approach based on deep neural network. In: 2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference (ITNEC 2016) (2016)

    Google Scholar 

  3. Liu, K.H., Chen, T.Y., Chen, C.S.: MVC: a dataset for view-invariant clothing retrieval and attribute prediction. In: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, pp. 313–316 (2016)

    Google Scholar 

  4. Ge, Y., Zhang, R., Wang, X., Tang, X., Luo, P.: Deepfashion2: a versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5337–5345 (2019)

    Google Scholar 

  5. Sidnev, A., Krapivin, A., Trushkov, A., Krasikova, E., Kazakov, M.: DeepMark++: CenterNet-based clothing detection (2020). 10.48550

    Google Scholar 

  6. He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. IEEE Trans. Pattern Anal. Mach. Intell. (2017). https://doi.org/10.1109/TPAMI.2018.2844175

  7. Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., Tian, Q.: CenterNet: keypoint triplets for object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6569–6578 (2019)

    Google Scholar 

  8. Lin, M., Ji, R., Chen, B., Chao, F., Ji, R.: Training compact CNNs for image classification using dynamic-coded filter fusion. IEEE Trans. Pattern Anal. Mach. Intell. (2023)

    Google Scholar 

  9. Liu, S., Zhang, L., Yang, X., Su, H., Zhu, J.: Query2label: a simple transformer way to multi-label classification. arXiv preprint arXiv:2107.10834 (2021)

  10. Mallavarapu, T., Cranfill, L., Kim, E.H., Parizi, R.M., Morris, J., Son, J.: A federated approach for fine-grained classification of fashion apparel. Mach. Learn. Appl. 6, 100118 (2021)

    Google Scholar 

  11. Eshwar, S.G., Rishikesh, A.V., Charan, N.A., Umadevi, V.: Apparel classification using convolutional neural networks. In: 2016 International Conference on ICT in Business Industry & Government (ICTBIG), pp. 1–5. IEEE (2016)

    Google Scholar 

  12. Iliukovich-Strakovskaia, A., Dral, A., Dral, E.: Using pre-trained models for fine-grained image classification in fashion field. In: Proceedings of the First International Workshop on Fashion and KDD, KDD, pp. 31–40 (2016)

    Google Scholar 

  13. Seo, Y., Shin, K.S.: Hierarchical convolutional neural networks for fashion image classification. Expert Syst. Appl. 116, 328–339 (2019)

    Article  Google Scholar 

  14. Qu, X., Che, H., Huang, J., Xu, L., Zheng, X.: Multi-layered semantic representation network for multi-label image classification. Int. J. Mach. Learn. Cybern., 1–9 (2023)

    Google Scholar 

  15. Cheng, X., et al.: MLTR: multi-label classification with transformer. In: 2022 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2022)

    Google Scholar 

  16. Lin T H.: Aggregation and finetuning for clothes landmark detection. arXiv preprint arXiv:2005.00419 (2020)

  17. Zhang, Z., Song, C., Zou, Q.: Fusing hierarchical convolutional features for human body segmentation and clothing fashion classification. arXiv preprint arXiv:1803.03415 (2018)

  18. Masci, J., Boscaini, D., Bronstein, M., Vandergheynst, P.: Geodesic convolutional neural networks on Riemannian manifolds. In: Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 37–45 (2015)

    Google Scholar 

  19. Poulenard, A., Ovsjanikov, M.: Multi-directional geodesic neural networks via equivariant convolution. ACM Trans. Graph. (TOG) 37(6), 1–14 (2018)

    Article  Google Scholar 

  20. Ionescu, C., Vantzos, O., Sminchisescu, C.: Matrix backpropagation for deep networks with structured layers. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2965–2973 (2015)

    Google Scholar 

  21. Huang, Z., Van Gool, L.: A riemannian network for SPD matrix learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31, no. 1 (2017)

    Google Scholar 

  22. Huang, Z., Wu, J., Van Gool, L.: Building deep networks on Grassmann manifolds. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1 (2018)

    Google Scholar 

  23. Huang, Z., Wan, C., Probst, T., Van Gool, L.: Deep learning on lie groups for skeleton-based action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6099–6108 (2017)

    Google Scholar 

  24. Qiao, S., Wang, R., Shan, S., Chen, X.: Deep heterogeneous hashing for face video retrieval. IEEE Trans. Image Process. 29, 1299–1312 (2019)

    Article  MathSciNet  MATH  Google Scholar 

  25. Li, C., et al.: Deep manifold structure transfer for action recognition. IEEE Trans. Image Process. 28(9), 4646–4658 (2019)

    Article  MathSciNet  MATH  Google Scholar 

  26. Chakraborty, R., Bouza, J., Manton, J., Vemuri, B.C.: ManifoldNet: a deep network framework for manifold-valued data. arXiv preprint arXiv:1809.06211 (2018)

  27. Brooks, D., Schwander, O., Barbaresco, F., Schneider, J.Y., Cord, M.: Riemannian batch normalization for SPD neural networks. In: Advances in Neural Information Processing Systems, 32 (2019)

    Google Scholar 

  28. Yu, K., Salzmann, M.: Second-order convolutional neural networks. Clin. Immunol. Immunopathol. (2017). https://doi.org/10.1006/clin.1993.1030

  29. Sra, S.: Positive definite matrices and the S-divergence. Proc. Am. Math. Soc. 144(7), 2787–2797 (2016)

    Article  MathSciNet  MATH  Google Scholar 

  30. Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4489–4497 (2015)

    Google Scholar 

  31. Donahue, J., et al.: Long-term recurrent convolutional networks for visual recognition and description. In: Proceedings of the IEEE Conference on computer Vision and Pattern Recognition, pp. 2625–2634 (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cheng Quan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

He, R., Quan, C. (2023). A Clothing Classification Network with Manifold Structure Based on Second-Order Convolution. In: Zhang, H., et al. International Conference on Neural Computing for Advanced Applications. NCAA 2023. Communications in Computer and Information Science, vol 1870. Springer, Singapore. https://doi.org/10.1007/978-981-99-5847-4_10

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-5847-4_10

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-5846-7

  • Online ISBN: 978-981-99-5847-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics