Abstract
Common target detection is usually based on single frame images, which is vulnerable to affected by the similar targets in the image and not applicable to video images. In this paper, anchor mask is proposed to add the prior knowledge for target detection and an anchor mask net is designed to improve the RPN performance for single target detection. Tested in the VOT2016, the model perform better.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition (2015)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. (2014)
Krähenbühl, P., Koltun, V.: Efficient inference in fully connected CRFs with Gaussian edge potentials (2012)
Zheng, S., Jayasumana, S., Romera-Paredes, B., et al.: Conditional random fields as recurrent neural networks (2015)
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society (2014)
Girshick, R.: Fast R-CNN. In: Computer Science (2015)
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: International Conference on Neural Information Processing Systems (2015)
Redmon, J., Divvala, S., Girshick, R., et al.: You only look once: unified, real-time object detection (2015)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: IEEE Conference on Computer Vision & Pattern Recognition, Honolulu, HI, 21–26 July 2017, pp. 6517–6525. IEEE (2017)
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement (2018)
Dai, J., Li, Y., He, K., et al.: R-FCN: object detection via region-based fully convolutional networks. In: Proceedings of the 30th International Conference on Neural Information Processing Systems. Curran Associates Inc. (2016)
Liu, W., Anguelov, D., Erhan, D., et al.: SSD: single shot multibox detector (2015)
Tran, D., Bourdev, L., Fergus, R., et al.: Learning spatiotemporal features with 3D convolutional networks (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Li, M., Feng, Y., Yin, Z., Zhou, C., Dong, F. (2019). Improved RPN for Single Targets Detection Based on the Anchor Mask Net. In: Wang, Y., Huang, Q., Peng, Y. (eds) Image and Graphics Technologies and Applications. IGTA 2019. Communications in Computer and Information Science, vol 1043. Springer, Singapore. https://doi.org/10.1007/978-981-13-9917-6_2
Download citation
DOI: https://doi.org/10.1007/978-981-13-9917-6_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9916-9
Online ISBN: 978-981-13-9917-6
eBook Packages: Computer ScienceComputer Science (R0)