Abstract
Currently, AI-based clothing image classification techniques mostly use traditional deep learning methods, which are based on monocular clothing images for classification. However, the diversity of perspectives of realistic clothing images bring great difficulties and challenges to clothing classification. Moreover, deep convolutional networks have limitations of their own. They treat data as vectors in Euclidean space and fail to make full use of the potential low dimensional non-linear geometric structure information within high-dimensional clothing image data. Therefore, this paper explores and exploits the geometric structure information inherent of clothing image data from the perspective of a non-Euclidean manifold learning method, and designs and implements a clothing classification network with manifold structure based on second-order convolution to classify images using the second-order statistics of clothing features for image classification. Firstly, the input clothing image features extracted by the convolution neural network are pooled with the covariance pooling module to obtain the second-order statistical covariance, which is converted into SPD manifold to characterize the feature information of the clothing image set, and then a complete manifold structure neural network is constructed to enhance the feature representation ability of the model on the geometric intrinsic structure of the clothing image set. The experimental results of this method on the multi view clothing image dataset MVC show that it has good effectiveness, robustness, and accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Liu, Z., Luo, P., Qiu, S., Wang, X., Tang, X.: DeepFashion: powering robust clothes recognition and retrieval with rich annotations. In: Computer Vision & Pattern Recognition. IEEE (2016). https://doi.org/10.1109/CVPR.2016.124
Xiao, L., Yichao, X.: Exact clothing retrieval approach based on deep neural network. In: 2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference (ITNEC 2016) (2016)
Liu, K.H., Chen, T.Y., Chen, C.S.: MVC: a dataset for view-invariant clothing retrieval and attribute prediction. In: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, pp. 313–316 (2016)
Ge, Y., Zhang, R., Wang, X., Tang, X., Luo, P.: Deepfashion2: a versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5337–5345 (2019)
Sidnev, A., Krapivin, A., Trushkov, A., Krasikova, E., Kazakov, M.: DeepMark++: CenterNet-based clothing detection (2020). 10.48550
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. IEEE Trans. Pattern Anal. Mach. Intell. (2017). https://doi.org/10.1109/TPAMI.2018.2844175
Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., Tian, Q.: CenterNet: keypoint triplets for object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6569–6578 (2019)
Lin, M., Ji, R., Chen, B., Chao, F., Ji, R.: Training compact CNNs for image classification using dynamic-coded filter fusion. IEEE Trans. Pattern Anal. Mach. Intell. (2023)
Liu, S., Zhang, L., Yang, X., Su, H., Zhu, J.: Query2label: a simple transformer way to multi-label classification. arXiv preprint arXiv:2107.10834 (2021)
Mallavarapu, T., Cranfill, L., Kim, E.H., Parizi, R.M., Morris, J., Son, J.: A federated approach for fine-grained classification of fashion apparel. Mach. Learn. Appl. 6, 100118 (2021)
Eshwar, S.G., Rishikesh, A.V., Charan, N.A., Umadevi, V.: Apparel classification using convolutional neural networks. In: 2016 International Conference on ICT in Business Industry & Government (ICTBIG), pp. 1–5. IEEE (2016)
Iliukovich-Strakovskaia, A., Dral, A., Dral, E.: Using pre-trained models for fine-grained image classification in fashion field. In: Proceedings of the First International Workshop on Fashion and KDD, KDD, pp. 31–40 (2016)
Seo, Y., Shin, K.S.: Hierarchical convolutional neural networks for fashion image classification. Expert Syst. Appl. 116, 328–339 (2019)
Qu, X., Che, H., Huang, J., Xu, L., Zheng, X.: Multi-layered semantic representation network for multi-label image classification. Int. J. Mach. Learn. Cybern., 1–9 (2023)
Cheng, X., et al.: MLTR: multi-label classification with transformer. In: 2022 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2022)
Lin T H.: Aggregation and finetuning for clothes landmark detection. arXiv preprint arXiv:2005.00419 (2020)
Zhang, Z., Song, C., Zou, Q.: Fusing hierarchical convolutional features for human body segmentation and clothing fashion classification. arXiv preprint arXiv:1803.03415 (2018)
Masci, J., Boscaini, D., Bronstein, M., Vandergheynst, P.: Geodesic convolutional neural networks on Riemannian manifolds. In: Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 37–45 (2015)
Poulenard, A., Ovsjanikov, M.: Multi-directional geodesic neural networks via equivariant convolution. ACM Trans. Graph. (TOG) 37(6), 1–14 (2018)
Ionescu, C., Vantzos, O., Sminchisescu, C.: Matrix backpropagation for deep networks with structured layers. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2965–2973 (2015)
Huang, Z., Van Gool, L.: A riemannian network for SPD matrix learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31, no. 1 (2017)
Huang, Z., Wu, J., Van Gool, L.: Building deep networks on Grassmann manifolds. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1 (2018)
Huang, Z., Wan, C., Probst, T., Van Gool, L.: Deep learning on lie groups for skeleton-based action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6099–6108 (2017)
Qiao, S., Wang, R., Shan, S., Chen, X.: Deep heterogeneous hashing for face video retrieval. IEEE Trans. Image Process. 29, 1299–1312 (2019)
Li, C., et al.: Deep manifold structure transfer for action recognition. IEEE Trans. Image Process. 28(9), 4646–4658 (2019)
Chakraborty, R., Bouza, J., Manton, J., Vemuri, B.C.: ManifoldNet: a deep network framework for manifold-valued data. arXiv preprint arXiv:1809.06211 (2018)
Brooks, D., Schwander, O., Barbaresco, F., Schneider, J.Y., Cord, M.: Riemannian batch normalization for SPD neural networks. In: Advances in Neural Information Processing Systems, 32 (2019)
Yu, K., Salzmann, M.: Second-order convolutional neural networks. Clin. Immunol. Immunopathol. (2017). https://doi.org/10.1006/clin.1993.1030
Sra, S.: Positive definite matrices and the S-divergence. Proc. Am. Math. Soc. 144(7), 2787–2797 (2016)
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4489–4497 (2015)
Donahue, J., et al.: Long-term recurrent convolutional networks for visual recognition and description. In: Proceedings of the IEEE Conference on computer Vision and Pattern Recognition, pp. 2625–2634 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
He, R., Quan, C. (2023). A Clothing Classification Network with Manifold Structure Based on Second-Order Convolution. In: Zhang, H., et al. International Conference on Neural Computing for Advanced Applications. NCAA 2023. Communications in Computer and Information Science, vol 1870. Springer, Singapore. https://doi.org/10.1007/978-981-99-5847-4_10
Download citation
DOI: https://doi.org/10.1007/978-981-99-5847-4_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-5846-7
Online ISBN: 978-981-99-5847-4
eBook Packages: Computer ScienceComputer Science (R0)