Abstract
Cone-beam computed tomography (CBCT) is widely used in clinical diagnosis of vertical root fractures (VRFs) which presents as crack on the teeth. However, manually checking the VRFs from a larger number of CBCT images is time-consuming and error-prone. Although the Convolutional Neural Networks (CNN) have achieved unprecedented progress in natural image recognition, end-to-end CNN is unsuitable to identify VRFs due to crack appears to be multi-scales and their complex relationships with surroundings tissues. We proposed a novel Feature Pyramids Attention Convolutional Neural Network (FPA-CNN), which incorporates saliency mask and multi-scale feature to boost the classification performance. Saliency map is viewed as spatial probability map where a person might look first to make a discriminative conclusion. Therefore it plays a role of high-level hint to guide the network focusing on the discriminative region. Experimental results demonstrate that our proposed FPA-CNN overcomes the challenge arised from multi-scale crack and complex contextual relationships.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahn, J., Cho, S., Kwak, S.: Weakly supervised learning of instance segmentation with inter-pixel relations. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2209–2218 (2019)
Boykov, Y.Y., Jolly, M.P.: Interactive graph cuts for optimal boundary & region segmentation of objects in nd images. In: Proceedings eighth IEEE International Conference on Computer Vision (ICCV), vol. 1, pp. 105–112. IEEE (2001)
Cohen, S., Berman, L.H., Blanco, L., Bakland, L., Kim, J.S.: A demographic analysis of vertical root fractures. J. Endodontics 32(12), 1160–1163 (2006)
Corbella, S., Del Fabbro, M., Tamse, A., Rosen, E., Tsesis, I., Taschieri, S.: Cone beam computed tomography for the diagnosis of vertical root fractures: a systematic review of the literature and meta-analysis. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. 118(5), 593–602 (2014)
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient graph-based image segmentation. Int. J. Comput. Vision 59(2), 167–181 (2004)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR), pp. 7132–7141 (2018)
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4700–4708 (2017)
Ioffe, S., Szegedy, C.: Batch normalization: ccelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015)
Johari, M., Esmaeili, F., Andalib, A., Garjani, S., Saberkari, H.: Detection of vertical root fractures in intact and endodontically treated premolar teeth by designing a probabilistic neural network: an ex vivo study. Dentomaxillofacial Radiol. 46(2), 20160107 (2017)
Kass, M., Witkin, A., Terzopoulos, D.: Snakes: active contour models. Int. J. Comput. Vision 1(4), 321–331 (1988)
Kositbowornchai, S., Plermkamon, S., Tangkosol, T.: Performance of an artificial neural network for vertical root fracture detection: an ex vivo study. Dent. Traumatol. 29(2), 151–155 (2013)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classfication with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (NIPS), pp. 1097–1105 (2012)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Li, C., Xu, C., Gui, C., Fox, M.D.: Distance regularized level set evolution and its application to image segmentation. IEEE Trans. Image Process. 19(12), 3243–3254 (2010)
Lin, D., Dai, J., Jia, J., He, K., Sun, J.: Scribblesup: scribble-supervised convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3159–3167 (2016)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2117–2125 (2017)
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 2980–2988 (2017)
Long, H., et al.: Diagnostic accuracy of CBCT for tooth fractures: a meta-analysis. J. Dent. 42(3), 240–248 (2014)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015
Ma, R., Ge, Z., Li, G.: Detection accuracy of root fractures in cone-beam computed tomography images: a systematic review and meta-analysis. Int. Endod. J. 49(7), 646–654 (2016)
Oh, S.J., Benenson, R., Khoreva, A., Akata, Z., Fritz, M., Schiele, B.: Exploiting saliency for object segmentation from image level labels. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5038–5047. IEEE (2017)
Pinheiro, P.O., Collobert, R.: From image-level to pixel-level labeling with convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1713–1721 (2015)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems (NIPS), pp. 91–99 (2015)
Rivera, E., Walton, R.: Cracking the cracked tooth code: detection and treatment of various longitudinal tooth fractures. Am. Assoc. Endodontists Colleagues Excellence News Lett. 2, 1–19 (2008)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Rother, C., Kolmogorov, V., Blake, A.: Grabcut interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. (TOG) 23(3), 309–314 (2004)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9 (2015)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2818–2826 (2016)
Yamada, K., Sugano, Y., Okabe, T., Sato, Y., Sugimoto, A., Hiraki, K.: Can saliency map models predict human egocentric visual attention? In: Koch, R., Huang, F. (eds.) ACCV 2010. LNCS, vol. 6468, pp. 420–429. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22822-3_42
Zhou, B., Khosla, A., Lapedriza: Learning deep features for discriminative localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2921–2929 (2016)
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A.: Object detectors emerge in deep scene CNNs. arXiv preprint arXiv:1412.6856 (2014)
Zhou, Z.H.: A brief introduction to weakly supervised learning. Nat. Sci. Rev. 5(1), 44–53 (2018)
Zhu, Y., Zhou, Y., Ye, Q., Qiu, Q., Jiao, J.: Soft proposal networks for weakly supervised object localization. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1841–1850 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Xu, Z., Wan, P., Aihemaiti, G., Zhang, D. (2021). Exploiting Saliency in Attention Based Convolutional Neural Network for Classification of Vertical Root Fractures. In: Del Bimbo, A., et al. Pattern Recognition. ICPR International Workshops and Challenges. ICPR 2021. Lecture Notes in Computer Science(), vol 12661. Springer, Cham. https://doi.org/10.1007/978-3-030-68763-2_29
Download citation
DOI: https://doi.org/10.1007/978-3-030-68763-2_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-68762-5
Online ISBN: 978-3-030-68763-2
eBook Packages: Computer ScienceComputer Science (R0)