Semantic Segmentation of Railway Images Considering Temporal Continuity

Furitsu, Yuki; Deguchi, Daisuke; Kawanishi, Yasutomo; Ide, Ichiro; Murase, Hiroshi; Mukojima, Hiroki; Nagamine, Nozomi

doi:10.1007/978-3-030-41404-7_45

Semantic Segmentation of Railway Images Considering Temporal Continuity

Yuki Furitsu¹²,
Daisuke Deguchi¹²,
Yasutomo Kawanishi¹²,
Ichiro Ide¹²,
Hiroshi Murase¹²,
Hiroki Mukojima¹³ &
…
Nozomi Nagamine¹³

Conference paper
First Online: 23 February 2020

1440 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12046))

Abstract

In this paper, we focus on the semantic segmentation of images taken from a camera mounted on the front end of trains for measuring and managing rail-side facilities. Improving the efficiency and perhaps automating such tasks are crucial as they are currently done manually. We aim to realize this by capturing information about the railway environment through the semantic segmentation of train front-view camera images. Specifically, assuming that the lateral movement of trains are smooth, we propose a method to use information from multiple frames to consider temporal continuity during semantic segmentation. Based on the densely estimated optical flow between sequential frames, the weighted mean of class likelihoods of corresponding pixels of the focused frame are calculated. We also construct a new dataset consisting of train front-view camera images and its annotations for semantic segmentation. The proposed method outperforms a conventional single-frame semantic segmentation model, and the use of class likelihoods for the frame combination also proved effective.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)
Article Google Scholar
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 833–851. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_49
Chapter Google Scholar
Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3213–3223, June 2016
Google Scholar
Gibert, X., Patel, V.M., Chellappa, R.: Material classification and semantic segmentation of railway track images with deep convolutional neural networks. In: Proceedings of the 2015 IEEE International Conference of Image Processing, pp. 621–625, September 2015
Google Scholar
Kundu, A., Li, Y., Dellaert, F., Li, F., Rehg, J.M.: Joint semantic segmentation and 3D reconstruction from monocular video. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8694, pp. 703–718. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10599-4_45
Chapter Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440, June 2015
Google Scholar
Niina, Y., Oketani, E., Yokouchi, H., Honma, R., Tsuji, K., Kondo, K.: Monitoring of railway structures by MMS. J. Jpn. Soc. Photogramm. Remote Sens. 55(2), 95–99 (2016). https://doi.org/10.4287/jsprs.55.95
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 234–241, November 2015
Google Scholar
Schonberger, J.L., Frahm, J.M.: Structure-from-motion revisited. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 4104–4113, June 2016
Google Scholar
Sun, D., Yang, X., Liu, M.Y., Kautz, J.: PWC-Net: CNNs for optical flow using pyramid, warping, and cost volume. In: Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, pp. 8934–8943, June 2018
Google Scholar

Download references

Acknowledgement

Parts of this research were supported by MEXT, Grant-in-Aid for Scientific Research.

Author information

Authors and Affiliations

Nagoya University, Nagoya, Japan
Yuki Furitsu, Daisuke Deguchi, Yasutomo Kawanishi, Ichiro Ide & Hiroshi Murase
Railway Technical Research Institute, Tokyo, Japan
Hiroki Mukojima & Nozomi Nagamine

Authors

Yuki Furitsu
View author publications
You can also search for this author in PubMed Google Scholar
Daisuke Deguchi
View author publications
You can also search for this author in PubMed Google Scholar
Yasutomo Kawanishi
View author publications
You can also search for this author in PubMed Google Scholar
Ichiro Ide
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Murase
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Mukojima
View author publications
You can also search for this author in PubMed Google Scholar
Nozomi Nagamine
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuki Furitsu .

Editor information

Editors and Affiliations

University of Malaya, Kuala Lumpur, Malaysia
Shivakumara Palaiahnakote
Consiglio Nazionale delle Ricerche, ICAR, Naples, Italy
Gabriella Sanniti di Baja
Chinese Academy of Sciences, Beijing, China
Liang Wang
Auckland University of Technology, Auckland, New Zealand
Wei Qi Yan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Furitsu, Y. et al. (2020). Semantic Segmentation of Railway Images Considering Temporal Continuity. In: Palaiahnakote, S., Sanniti di Baja, G., Wang, L., Yan, W. (eds) Pattern Recognition. ACPR 2019. Lecture Notes in Computer Science(), vol 12046. Springer, Cham. https://doi.org/10.1007/978-3-030-41404-7_45

Download citation

DOI: https://doi.org/10.1007/978-3-030-41404-7_45
Published: 23 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-41403-0
Online ISBN: 978-3-030-41404-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics