research-article

Multi-Objective Deep CNN for Outdoor Auto-Navigation

Authors:

DongLiang Wang,

Yao YeboahAuthors Info & Claims

ICDLT '18: Proceedings of the 2018 2nd International Conference on Deep Learning Technologies

Pages 81 - 85

https://doi.org/10.1145/3234804.3234823

Published: 27 June 2018 Publication History

Abstract

Target-guided navigation establishes the foundation for efficiently addressing vision-based multi-agent coordination for robotics. This work proposes a multi-objective deep convolution network which consists of two parallel branches built atop a shared feature extractor. The proposed network is capable of concurrently constructing semantic maps while achieving efficient visual detection of a designated guider robot or landmark towards outdoor navigation. In order to achieve the low latency requirements of the navigation controller, the structure and parameters of the network have been meticulously designed to boost run-time performance. The model is trained and tested on an altered version of the Cityscape outdoor dataset. We further finetune using a collected dataset in order to improve generalization performance on unseen outdoor scenes. Experimental results on an outdoor navigation robot equipped with an RGBD camera and GPU mini PC verifies the feasibility of the model.

References

[1]

Kragic, D., & Christensen, H. I. (2002). Survey on visual servoing for manipulation. Computational Vision and Active Perception Laboratory, Fiskartorpsv, 15, 2002.

[2]

Dickmanns, E. D., & Graefe, V. (1988). Dynamic monocular machine vision. Machine vision and applications, 1(4), 223--240.

Digital Library

[3]

Dudek, G., Jenkin, M., Milios, E., & Wilkes, D. (1991). Robotic exploration as graph construction. IEEE transactions on robotics and automation, 7(6), 859--865.

[4]

Briggs, A. J., Scharstein, D., Braziunas, D., Dima, C., & Wall, P. (2000). Mobile robot navigation using self-similar landmarks. In Robotics and Automation, 2000. Proceedings. ICRA'00. IEEE International Conference on (Vol. 2, pp. 1428--1434). IEEE.

[5]

Zhu, D. Q., & Yan, M. Z. (2010). Survey on technology of mobile robot path planning. Control and Decision, 25(7), 961--967.

[6]

Leonard, J., Durrant-Whyte, H., & Cox, I. J. (1990, July). Dynamic map building for autonomous mobile robot. In Intelligent Robots and Systems' 90.'Towards a New Frontier of Applications', Proceedings. IROS'90. IEEE International Workshop on (pp. 89--96). IEEE.

[7]

Guivant, J., Nebot, E., & Baiker, S. (2000). Autonomous navigation and map building using laser range sensors in outdoor applications. Journal of robotic systems, 17(10), 565--583.

[8]

Royer, E., Lhuillier, M., Dhome, M., & Lavest, J. M. (2007). Monocular vision for mobile robot localization and autonomous navigation. International Journal of Computer Vision, 74(3), 237--260.

Digital Library

[9]

Konolige, K., Agrawal, M., Bolles, R. C., Cowan, C., Fischler, M., & Gerkey, B. (2008). Outdoor mapping and navigation using stereo vision. In Experimental Robotics (pp. 179--190). Springer, Berlin, Heidelberg.

[10]

Bailey, T., & Durrant-Whyte, H. (2006). Simultaneous localization and mapping (SLAM): Part II. IEEE Robotics & Automation Magazine, 13(3), 108--117.

[11]

Becker, C., Salas, J., Tokusei, K., & Latombe, J. C. (1995, May). Reliable navigation using landmarks. In Robotics and Automation, 1995. Proceedings., 1995 IEEE International Conference on (Vol. 1, pp. 401--406). IEEE.

[12]

Basiri, A., Amirian, P., & Winstanley, A. (2014). The use of quick response (QR) codes in landmark-based pedestrian navigation. International Journal of Navigation and Observation, 2014.

[13]

Yang, R. (2011, July). The study and improvement of Augmented reality based on feature matching. In Software Engineering and Service Science (ICSESS), 2011 IEEE 2nd International Conference on (pp. 586--589). IEEE.

[14]

LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. nature, 521(7553), 436.

[15]

Valipour, S. (2017). Deep Learning in Robotics (Doctoral dissertation, University of Alberta).

[16]

Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., … & Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861.

[17]

Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.

[18]

He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770--778).

[19]

Long, J., Shelhamer, E., & Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3431--3440).

[20]

Chen, L. C., Papandreou, G., Kokkinos, I., Murphy, K., & Yuille, A. L. (2018). Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence, 40(4), 834--848.

[21]

Chen, L. C., Papandreou, G., Schroff, F., & Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587.

[22]

Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016, October). Ssd: Single shot multibox detector. In European conference on computer vision (pp. 21--37). Springer, Cham.

[23]

Lin, T. Y., Goyal, P., Girshick, R., He, K., & Dollár, P. (2017). Focal loss for dense object detection. arXiv preprint arXiv:1708.02002.

[24]

M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth, and B. Schiele, "The Cityscapes Dataset for Semantic Urban Scene Understanding," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.

Cited By

Hirotsu THirota MAraki TEndo MIshikawa H(2019)Tourism application with CNN-Based Classification specialized for cultural informationProceedings of the 21st International Conference on Information Integration and Web-based Applications & Services10.1145/3366030.3366073(8-14)Online publication date: 2-Dec-2019
https://dl.acm.org/doi/10.1145/3366030.3366073

Index Terms

Multi-Objective Deep CNN for Outdoor Auto-Navigation
1. Information systems
  1. Information systems applications
    1. Mobile information processing systems

Recommendations

Autonomous Indoor Robot Navigation via Siamese Deep Convolutional Neural Network
AIPR '18: Proceedings of the 2018 International Conference on Artificial Intelligence and Pattern Recognition

The vast majority of indoor navigation algorithms either rely on manual scene augmentation and labelling or exploit multi-sensor fusion techniques in achieving simultaneous localization and mapping (SLAM), leading to high computational costs, hardware ...
Task-oriented navigation algorithms for an outdoor environment with colored borders and obstacles

This paper presents task-oriented navigation algorithms used for an outdoor environment. The goals of the navigation are recognizing colored border lines on both sides of a path, avoiding obstacles on the path, and navigating the given path. To ...
Robot Navigation in Multi-terrain Outdoor Environments

This paper presents a methodology for motion planning in outdoor environments that takes into account specific characteristics of the terrain. Instead of decomposing the robot configuration space into "free" and "occupied", we consider the existence of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICDLT '18: Proceedings of the 2018 2nd International Conference on Deep Learning Technologies

June 2018

112 pages

ISBN:9781450364737

DOI:10.1145/3234804

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Chongqing University of Posts and Telecommunications
University of Electronic Science and Technology of China: University of Electronic Science and Technology of China

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

NSFC
Science and Technology Planning Project of Guangzhou
Guangzhou University Innovation and Entrepreneurship Education Project
Science and Technology Planning Project of Guangdong

Conference

ICDLT '18

ICDLT '18: 2018 2nd International Conference on Deep Learning Technologies

June 27 - 29, 2018

Chongqing, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
136
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)1

Reflects downloads up to 19 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Hirotsu THirota MAraki TEndo MIshikawa H(2019)Tourism application with CNN-Based Classification specialized for cultural informationProceedings of the 21st International Conference on Information Integration and Web-based Applications & Services10.1145/3366030.3366073(8-14)Online publication date: 2-Dec-2019
https://dl.acm.org/doi/10.1145/3366030.3366073

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten