skip to main content
10.1145/3318299.3318331acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

Multi-Path and Multi-Loss Network for Person Re-Identification

Authors Info & Claims
Published:22 February 2019Publication History

ABSTRACT

In person re-identification (re-ID), most state-of-the-art models extract features by convolutional neural networks to do similarity comparison. Feature representation becomes the key task for person re-ID. However, the learned features are not good enough based on a single-path and single-loss network because the learned objective only achieves one of the multiple minima. To improve feature representation, we propose a multi-path and multi-loss network (MPMLN) and concatenate multi-path features to represent pedestrian. Subsequently, we design MPMLN based on ResNet-50 and construct an end-to-end architecture. The backbone of our proposed network shares the local parameters for multiple paths and multiple losses. It has fewer parameters than multiple independent networks. Experimental results show that our MPMLN achieves the state-of-the-art performance on the public Market1501, DukeMTMC-reID and CUHK03 person re-ID benchmarks.

References

  1. Zheng, L., Yang, Y., and Hauptmann, A. G. 2016. Person re-identification: Past, present and future. CoRR, abs/1610.02984. DOI= http://arxiv.org/abs/1610.02984Google ScholarGoogle Scholar
  2. Shi, H., Yang, Y., Zhu, X., Liao, S., Lei, Z., Zheng, W., and Li, S. Z. 2016. Embedding deep metric for person re-identification: A study against large variations. In Computer Vision -- ECCV 2016 14th European Conference, (Amsterdam, The Nether-lands, October 11--14, 2016), Proceedings, Part I, pages 732--748.Google ScholarGoogle Scholar
  3. Zheng, Z., Zheng, L., and Yang, Y. 2016. A discriminatively learned CNN embedding for person re-identification. CoRR, abs/1611.05666. DOI= http://arxiv.org/abs/1611.05666Google ScholarGoogle Scholar
  4. Wang, J., Li, Y., and Miao, Z. 2017. Siamese cosine network embedding for person re-identification. In Computer Vision -- Second CCF Chinese Conference, CCCV 2017, (Tianjin, China, October 11--14, 2017), Proceedings, Part III, pages 352--362.Google ScholarGoogle Scholar
  5. Zeiler, M. D. and Fergus, R. 2014. Visualizing and understanding convolutional networks. In Computer Vision - ECCV 2014 13th European Conference, (Zurich, Switzerland, September 6--12, 2014), Proceedings, Part I, pages 818--833.Google ScholarGoogle Scholar
  6. Keskar, N. S., Mudigere, D., Nocedal, J., Smelyanskiy, M., and Tang, P. T. P. 2016. On large-batch training for deep learning: Generalization gap and sharp minima. CoRR, abs/1609.04836. DOI= http://arxiv.org/abs/1609.04836Google ScholarGoogle Scholar
  7. Wen, W., Wang, Y., Yan, F., Xu, C., Chen, Y., and Li, H. 2018. Smoothout: Smoothing out sharp minima for generalization in large-batch deep learning. CoRR, abs/1805.07898. DOI= http://arxiv.org/abs/1805.07898Google ScholarGoogle Scholar
  8. Zhang, Y., Xiang, T., Hospedales, T. M., and Lu, H. 2017. Deep mutual learning. CoRR, abs/1706.00384. DOI= http://arxiv.org/abs/1706.00384Google ScholarGoogle Scholar
  9. He, K., Zhang, X., Ren, S., and Sun, J. 2015. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In 2015 IEEE International Conference on Computer Vision, ICCV 2015, (Santiago, Chile, December 7-13, 2015), pages 1026--1034. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Krizhevsky, A., Sutskever, I., and Hinton, G. E. 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012, Lake Tahoe, Nevada, United States., pages 1106--1114. DOI= http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Simonyan, K. and Zisserman, A. 2014. Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556. DOI= http://arxiv.org/abs/1409.1556Google ScholarGoogle Scholar
  12. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S. E., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. 2015. Going deeper with convolutions. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, (Boston, MA, USA, June 7-12, 2015), pages 1--9.Google ScholarGoogle ScholarCross RefCross Ref
  13. Larsson, G., Maire, M., and Shakhnarovich, G. 2016. Fractal net: Ultra-deep neural networks without residuals. CoRR, abs/1605.07648. DOI= http://arxiv.org/abs/1605.07648Google ScholarGoogle Scholar
  14. Zhong, Z., Zheng, L., Kang, G., Li, S., and Yang, Y. 2017. Random erasing data augmentation. CoRR, abs/1708.04896. DOI= http://arxiv.org/abs/1708.04896Google ScholarGoogle Scholar
  15. Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., and Tian, Q. 2015. Scalable person re-identification: A benchmark. In 2015 IEEE International Conference on Computer Vision, ICCV 2015, (Santiago, Chile, December 7-13, 2015), pages 1116--1124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Zheng, Z., Zheng, L., and Yang, Y. 2017. Unlabeled samples generated by GAN improve the person re-identification baseline in vitro. In IEEE International Conference on Computer Vision, ICCV 2017, (Venice, Italy, October 22-29, 2017), pages 3774--3782.Google ScholarGoogle ScholarCross RefCross Ref
  17. Li, W., Zhao, R., Xiao, T., and Wang, X. 2014. Deepreid: Deep filter pairing neural network for person re-identification. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014, (Columbus, OH, USA, June 23-28, 2014), pages 152--159. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Felzenszwalb, P. F., Girshick, R. B., McAllester, D. A., and Ramanan, D. 2010. Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell., 32(9):1627--1645. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Zhong, Z., Zheng, L., Cao, D., and Li, S. 2017. Re-ranking person re-identification with k-reciprocal encoding. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, (Honolulu, HI, USA, July 21-26, 2017), pages 3652--3661.Google ScholarGoogle ScholarCross RefCross Ref
  20. Zheng, Z., Zheng, L., and Yang, Y. 2017. Pedestrian alignment network for large-scale person re-identification. CoRR, abs/1707.00408. DOI= http://arxiv.org/abs/1707.00408Google ScholarGoogle Scholar
  21. Sun, Y., Zheng, L., Deng, W., and Wang, S. 2017. Svdnet for pedestrian retrieval. In IEEE International Conference on Computer Vision, ICCV 2017, (Venice, Italy, October 22-29, 2017), pages 3820--3828.Google ScholarGoogle ScholarCross RefCross Ref
  22. Hermans, A., Beyer, L., and Leibe, B. 2017. In defense of the triplet loss for person re-identification. CoRR, abs/1703.07737. DOI= http://arxiv.org/abs/1703.07737Google ScholarGoogle Scholar
  23. Wang, Y., Wang, L., You, Y., Zou, X., Chen, V., Li, S., Huang, G., Hariharan, B., and Weinberger, K. Q. 2018. Resource aware person re-identification across multiple resolutions. CoRR, abs/1805.08805. DOI= http://arxiv.org/abs/1805.08805Google ScholarGoogle Scholar
  24. Chang, X., Hospedales, T. M., and Xiang, T. 2018. Multi-level factorisation net for person re-identification. CoRR, abs/1803.09132. DOI= http://arxiv.org/abs/1803.09132Google ScholarGoogle Scholar
  25. Li, W., Zhu, X., and Gong, S. 2018. Harmonious attention network for person re-identification. CoRR, abs/1802.08122. DOI= http://arxiv.org/abs/1802.08122Google ScholarGoogle Scholar
  26. Jin, H., Wang, X., Liao, S., and Li, S. Z. 2017. Deep person re-identification with improved embedding and efficient training. In 2017 IEEE International Joint Conference on Biometrics, IJCB 2017, (Denver, CO, USA, October 1-4, 2017), pages 261--267.Google ScholarGoogle ScholarCross RefCross Ref
  27. Sun, Y., Zheng, L., Yang, Y., Tian, Q., and Wang, S. 2017. Beyond part models: Person retrieval with refined part pooling. CoRR, abs/1711.09349. DOI= http://arxiv.org/abs/1711.09349Google ScholarGoogle Scholar

Index Terms

  1. Multi-Path and Multi-Loss Network for Person Re-Identification

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and Computing
      February 2019
      563 pages
      ISBN:9781450366007
      DOI:10.1145/3318299

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 February 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)5
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader