Abstract
As a task in object detection, small object detection mainly focuses on detecting objects of small size, which is more complex than general object detection. It is pivotal in various applications, e.g., small tumor detection, national defense and security, and traffic surveillance. Small objects have low pixels, few effective features, and a large influence of background noise, making small object detection extremely challenging. Currently, most object detection algorithms fail to take advantage of global context information to improve accuracy. Moreover, the conventional bounding-box proposal generation method will cause the missed detection of small targets since the target features are few and challenging to locate. To address the above problems, this paper uses the position enhancement method to improve the proposals’ generation to improve the recall rate and accuracy rate of small object detection. First, the location enhancement module adds additional keypoint location supervision to obtain target latent keypoints. Besides, we propose a keypoint expansion method to get more accurate keypoint locations. A global contextual attention mechanism is further introduced, enabling the detector to learn fine-grained features with contextual location information better. The experimental results on two datasets show that the proposed method can significantly improve the accuracy and recall of small target detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ashbee, H.R., Barnes, R.A., Johnson, E.M., Richardson, M.D., Gorton, R., Hope, W.W.: Therapeutic drug monitoring (TDM) of antifungal agents: guidelines from the British society for medical mycology. J. Antimicrob. Chemother. 69(5), 1162–1176 (2014)
Chen, K., et al.: MMdetection: Open mmlab detection toolbox and benchmark. arXiv preprint arXiv:1906.07155 (2019)
Chu, X., et al.: Twins: Revisiting the design of spatial attention in vision transformers. In: NeurIPS, vol. 34 (2021)
Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., Tian, Q.: Centernet: keypoint triplets for object detection. In: ICCV, pp. 6569–6578 (2019)
Jiang, N., Yu, X., Peng, X., Gong, Y., Han, Z.: Sm+: refined scale match for tiny person detection. In: ICASSP, pp. 1815–1819. IEEE (2021)
Kim, L.S., Axelrod, L., Howard, P., Buratovich, N., Waters, R.: Efficacy of methylsulfonylmethane (MSM) in osteoarthritis pain of the knee: a pilot clinical trial. Osteoarthritis Cartilage 14(3), 286–294 (2006)
Krishna, K., Murty, M.N.: Genetic k-means algorithm. IEEE Trans. Syst. Man. Cybern. Part B (Cybern.) 29(3), 433–439 (1999)
Law, H., Deng, J.: Cornernet: detecting objects as paired keypoints. In: ECCV, pp. 734–750 (2018)
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: ICCV, pp. 2980–2988 (2017)
Lin, T.Y., et al.: Microsoft coco: common objects in context. In: ECCV, pp. 740–755 (2014)
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Mariscal, S.L., Garcia, V.S., Fernández-García, J.C., de Villarreal, E.S.: Acute effects of ballistic vs. passive static stretching involved in a prematch warm-up on vertical jump and linear sprint performance in soccer players. J. Strength Conditioning Res. 35(1), 147–153 (2021)
Rao, Z., He, M., Dai, Y., Zhu, Z., Li, B., He, R.: Nlca-net: a non-local context attention network for stereo matching. APSIPA Transactions on Signal and Information Processing 9 (2020)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. NeurIPS 28, 91–99 (2015)
Sang, J., et al.: An improved YOLOv2 for vehicle detection. Sensors 18(12), 4272 (2018)
Targ, S., Almeida, D., Lyman, K.: Resnet in resnet: Generalizing residual architectures. arXiv preprint arXiv:1603.08029 (2016)
Tian, Z., Shen, C., Chen, H., He, T.: FCOS: fully convolutional one-stage object detection. In: ICCV, pp. 9627–9636 (2019)
Vicente, S., Carreira, J., Agapito, L., Batista, J.: Reconstructing pascal voc. In: CVPR, pp. 41–48 (2014)
Wang, K., Liew, J.H., Zou, Y., Zhou, D., Feng, J.: Panet: few-shot image semantic segmentation with prototype alignment. In: ICCV, pp. 9197–9206 (2019)
Wang, Y., Wang, C., Zhang, H., Dong, Y., Wei, S.: Automatic ship detection based on RetinaNet using multi-resolution Gaofen-3 imagery. Remote Sens. 11(5), 531 (2019)
Yu, X., Gong, Y., Jiang, N., Ye, Q., Han, Z.: Scale match for tiny person detection. In: WACV, pp. 1257–1265 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhou, X., Luo, Z., Li, S. (2023). A Improved Prior Box Generation Method for Small Object Detection. In: Sun, Y., et al. Computer Supported Cooperative Work and Social Computing. ChineseCSCW 2022. Communications in Computer and Information Science, vol 1682. Springer, Singapore. https://doi.org/10.1007/978-981-99-2385-4_35
Download citation
DOI: https://doi.org/10.1007/978-981-99-2385-4_35
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-2384-7
Online ISBN: 978-981-99-2385-4
eBook Packages: Computer ScienceComputer Science (R0)