Abstract
Usually, different convolutional neural networks (CNN) are designed for different tasks and trained separately. However, the same convolutional encoder can be shared between classification, segmentation, and object detection network. Based on this observation, we wonder if it is possible to combine those different parts and solve multiple tasks in a single network to save the cost of training multiple networks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen, L.-C., Papandreou, G., Schroff, F., & Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587.
Chu, Q., Ouyang, W., Li, H., Wang, X., Liu, B., & Yu, N. (2017). Online multi-object tracking using CNN-based single object tracker with spatial-temporal attention mechanism. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4836–4845).
Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4700–4708).
Li, K., Wu, Z., Peng, K.-C., Ernst, J., & Fu, Y. (2018). Tell me where to look: Guided attention inference network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 9215–9223).
Liu, J., Li, W., Zhao, N., Cao, K., Yin, Y., Song, Q., et al. (2018). Integrate domain knowledge in training CNN for ultrasonography breast cancer diagnosis. In International Conference on Medical Image Computing and Computer-Assisted Intervention (pp. 868–875). Springer.
Lucas, B. D., & Kanade, T. (1981). An iterative image registration technique with an application to stereo vision. In Proceedings DARPA Image Understanding (p. 121430).
Merkow, J., Marsden, A., Kriegman, D., & Tu, Z. (2016). Dense volume-to-volume vascular boundary detection. In International Conference on Medical Image Computing and Computer-Assisted Intervention (pp. 371–379). Springer.
Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention (pp. 234–241). Springer.
Xie, S., & Tu, Z. (2015). Holistically-nested edge detection. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1395–1403).
Zhao, H., Shi, J., Qi, X., Wang, X., & Jia, J. (2017). Pyramid scene parsing network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2881–2890).
Zheng, H., Chen, H., Huang, J., Li, X., Han, X., & Yao, J. (2019). Polyp tracking in video colonoscopy using optical flow with an on-the-fly trained CNN. In 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) (pp. 79–82). IEEE.
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., & Torralba, A. (2016). Learning deep features for discriminative localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2921–2929).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Chen, H. (2021). Multi-resolution Multi-task Network and Polyp Tracking. In: Bernal, J., Histace, A. (eds) Computer-Aided Analysis of Gastrointestinal Videos. Springer, Cham. https://doi.org/10.1007/978-3-030-64340-9_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-64340-9_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-64339-3
Online ISBN: 978-3-030-64340-9
eBook Packages: Computer ScienceComputer Science (R0)