Partially Separated Networks for Person Search

Chen, Chuanchuan; Fan, Jingbo; Zhu, Yuesheng; Luo, Guibo

doi:10.1007/978-3-030-00764-5_71

Chuanchuan Chen¹⁸,
Jingbo Fan¹⁸,
Yuesheng Zhu¹⁸ &
…
Guibo Luo¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11166))

Included in the following conference series:

Pacific Rim Conference on Multimedia

3161 Accesses

Abstract

The multi-task learning framework that considers pedestrian detection and person re-identification jointly is an effective solution for person search. However, the existing joint frameworks simply share the backbone network without considering the negative interaction between the two tasks. To alleviate this conflict and meet the different requirements in detection and re-identification, a Partially Separated Network (PSN) for person search is proposed in this paper. Unlike the traditional joint frameworks, our backbone network is partially separated for detection and identification, and feature maps with different scales are provided according to different characteristics. Our experiment results have demonstrated that on CUHK-SYSU dataset our mAP and top-1 on ResNet-50 are 5.4% and 4.4% higher, and on PRW dataset our mAP and top-1 on PVANet are 8.0% and 5.0% higher compared with the state-of-the-art methods. Specially, the improvements can be more impressive in the case of large gallery, occlusion and low resolution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ahmed, E., Jones, M., Marks, T.K.: An improved deep learning architecture for person re-identification. In: Computer Vision and Pattern Recognition, pp. 3908–3916 (2015)
Google Scholar
Chen, D., Yuan, Z., Chen, B., Zheng, N.: Similarity learning with spatial constraints for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1268–1277 (2016)
Google Scholar
Chen, D., Yuan, Z., Hua, G., Zheng, N.: Similarity learning on an explicit polynomial kernel feature map for person re-identification. In: Conference on Computer Vision and Pattern Recognition, pp. 1565–1573 (2015)
Google Scholar
Chen, W., Chen, X., Zhang, J., Huang, K.: Beyond triplet loss: a deep quadruplet network for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1320–1329 (2017)
Google Scholar
Ding, S., Lin, L., Wang, G., Chao, H.: Deep feature learning with relative distance comparison for person re-identification. Pattern Recognit. 48(10), 2993–3003 (2015)
Article Google Scholar
Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)
Article Google Scholar
Dollr, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: British Machine Vision Conference (2009)
Google Scholar
Felzenszwalb, P.F., Girshick, R.B., Mcallester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 47(2), 6–7 (2014)
Google Scholar
Gheissari, N., Sebastian, T.B., Hartley, R.: Person reidentification using spatiotemporal appearance. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1528–1535 (2006)
Google Scholar
Girshick, R.: Fast R-CNN. In: IEEE International Conference on Computer Vision (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142 (2016)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Hong, S., Roh, B., Kim, K.H., Cheon, Y., Park, M.: PVANet: lightweight deep neural networks for real-time object detection. In: International Workshop on Efficient Methods for Deep Neural Networks (2016)
Google Scholar
Kong, T., Yao, A., Chen, Y., Sun, F.: Hypernet: towards accurate region proposal generation and joint object detection. In: Computer Vision and Pattern Recognition, pp. 845–853 (2016)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: International Conference on Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Li, W., Zhao, R., Xiao, T., Wang, X.: DeepReID: deep filter pairing neural network for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 152–159 (2014)
Google Scholar
Liao, S., Hu, Y., Zhu, X., Li, S.Z.: Person re-identification by local maximal occurrence representation and metric learning. In: Computer Vision and Pattern Recognition, pp. 2197–2206 (2015)
Google Scholar
Liu, H., et al.: Neural person search machines, pp. 493–501 (2017)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Nam, W., Dollr, P., Han, J.H.: Local decorrelation for improved pedestrian detection. In: Advances in Neural Information Processing Systems (2014)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS (2015)
Google Scholar
Roth, P.M., Wohlhart, P., Hirzer, M., Kostinger, M., Bischof, H.: Large scale metric learning from equivalence constraints. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2288–2295 (2012)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 (2014)
Sun, Y., Zheng, L., Deng, W., Wang, S.: SVDNet for pedestrian retrieval. In: IEEE International Conference on Computer Vision, pp. 3820–3828 (2017)
Google Scholar
Varior, R.R., Haloi, M., Wang, G.: Gated siamese convolutional neural network architecture for human re-identification. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 791–808. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_48
Chapter Google Scholar
Xiao, J., Xie, Y., Tillo, T., Huang, K., Wei, Y., Feng, J.: IAN: the individual aggregation network for person search. arXiv: 1705.05552 (2017)
Xiao, T., Li, S., Wang, B., Lin, L., Wang, X.: Joint detection and identification feature learning for person search. In: CVPR (2017)
Google Scholar
Yang, B., Yan, J., Lei, Z., Li, S.Z.: Convolutional channel features. In: IEEE International Conference on Computer Vision, pp. 82–90 (2015)
Google Scholar
Yang, Y., Liao, S., Lei, Z., Li, S.Z.: Large scale similarity learning using similar pairs for person verification. In: Thirtieth AAAI Conference on Artificial Intelligence, pp. 3655–3661 (2016)
Google Scholar
Yu, S.I., Yang, Y., Hauptmann, A.: Harry Potter’s marauder’s map: localizing and tracking multiple persons-of-interest by nonnegative discretization. In: Computer Vision and Pattern Recognition, pp. 3714–3720 (2013)
Google Scholar
Zajdel, W., Zivkovic, Z., Krose, B.J.A.: Keeping track of humans: have I seen this person before? In: IEEE International Conference on Robotics and Automation, pp. 2081–2086 (2005)
Google Scholar
Zhang, L., Lin, L., Liang, X., He, K.: Is faster R-CNN doing well for pedestrian detection? In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 443–457. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_28
Chapter Google Scholar
Zhao, H., et al.: Spindle net: person re-identification with human body region guided feature decomposition and fusion. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 907–915 (2017)
Google Scholar
Zhao, R., Ouyang, W., Wang, X.: Unsupervised salience learning for person re-identification. In: Computer Vision and Pattern Recognition, pp. 3586–3593 (2013)
Google Scholar
Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., Tian, Q.: Scalable person re-identification: a benchmark. In: IEEE International Conference on Computer Vision, pp. 1116–1124 (2015)
Google Scholar
Zheng, L., Zhang, H., Sun, S., Chandraker, M., Yang, Y., Tian, Q.: Person re-identification in the wild. In: CVPR (2017)
Google Scholar

Download references

Acknowledgement

This work is supported by the Shenzhen Municipal Development and Reform Commission (Disciplinary Development Program for Date Science and Intelligent Computing), and by Shenzhen International cooperative research projects GJHZ20170313150021171.

Author information

Authors and Affiliations

Communication and Information Security Lab, Shenzhen Graduate School, Peking University, Shenzhen, China
Chuanchuan Chen, Jingbo Fan, Yuesheng Zhu & Guibo Luo

Authors

Chuanchuan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jingbo Fan
View author publications
You can also search for this author in PubMed Google Scholar
Yuesheng Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Guibo Luo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuesheng Zhu .

Editor information

Editors and Affiliations

Hefei University of Technology, Hefei, China
Richang Hong
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
University of Tokyo, Tokyo, Japan
Toshihiko Yamasaki
Hefei University of Technology, Hefei, China
Meng Wang
City University of Hong Kong, Hong Kong, Hong Kong
Chong-Wah Ngo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, C., Fan, J., Zhu, Y., Luo, G. (2018). Partially Separated Networks for Person Search. In: Hong, R., Cheng, WH., Yamasaki, T., Wang, M., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2018. PCM 2018. Lecture Notes in Computer Science(), vol 11166. Springer, Cham. https://doi.org/10.1007/978-3-030-00764-5_71

Download citation

DOI: https://doi.org/10.1007/978-3-030-00764-5_71
Published: 18 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00763-8
Online ISBN: 978-3-030-00764-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics