Optimal Truncated MobileNet-Based Image Binarization for Pose-Based Visual Servoing of Autonomous Mobile Robot

Ho, Chian C.; Lin, Cian-Duo

doi:10.1007/978-981-97-1714-9_17

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2075))

Included in the following conference series:

International Conference on Technologies and Applications of Artificial Intelligence

294 Accesses

Abstract

Pose-based visual servoing (PBVS) can complement the frequent drift issue of light detection and ranging (LiDAR) coordinate of LiDAR-based simultaneous localization and mapping (SLAM), navigation, and servoing technology, especially when autonomous mobile robot (AMR) works on the automatic docking alignment missions for automatic pallet engaging or automatic battery charging whose alignment precision requirement is extremely stricter. But PBVS occasionally suffers from the poor detected image quality of ARTag landmark to cause the reading drift or error. This paper proposes a light-weight deep-learning image binarization method based on optimal truncated MobileNet model to preprocess the image quality of ARTag landmarks so that PBVS can evaluate the distance and pose between the ARTag landmark and the camera sensor more accurately, promptly, and steadily, for better feasibility of PBVS on the automatic docking alignment missions. Experimental results show, against conventional image-processing-based image binarization, conventional computer-vision-based image binarization, and conventional deep-learning-based image binarization, the proposed optimal truncated MobileNet-based image binarization not only raises the accuracy and reliability of ARTag’s reading, but also apparently improves the effectiveness and efficiency of PBVS’s operation, especially under environmental conditions of shadow occlusion, image blurring, low contrast, uneven illumination, or complex background.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Pose Estimation of Mobile Robot Using Image and Point-Cloud Data

Article 04 September 2024

Deep 6-DoF camera relocalization in variable and dynamic scenes by multitask learning

Article 26 March 2023

Deep-trained illumination-robust precision positioning for real-time manipulation of embedded objects

Article 26 October 2020

References

Azadeh, K., Koster, R., Roy, D.: Robotized and automated warehouse systems: review and recent developments. Transp. Sci. 53(4), 917–945 (2019)
Article Google Scholar
Hossain, S.G.M., Jamil, H., Ali, M.Y., Haq, M.Z.: Automated guided vehicles for industrial logistics - development of intelligent prototypes using appropriate technology. In: Proceedings of International Conference on Computer and Automation Engineering (ICCAE), pp. 237–241 (2010)
Google Scholar
Zhao, L., Ding, Y.: Design of AGV low-delay visual servo system. In: Proceedings of International Conference on Audio, Language and Image Processing (ICALIP), pp. 6–10 (2018)
Google Scholar
Kelly, A., Nagy, B., Stager, D., Unnikrishnan, R.: Field and service applications - an infrastructure-free automated guided vehicle based on computer vision - an effort to make an industrial robot vehicle that can operate without supporting infrastructure. IEEE Robot. Autom. Mag. 14(3), 24–34 (2007)
Article Google Scholar
Fiala, M.: ARTag, a fiducial marker system using digital techniques. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 2, pp. 590–596 (2005)
Google Scholar
Niekum, S.: ar_track_alvar. ROS Wiki [Online]. http://www.ros.org/wiki/ar_track_alvar
Wang, H., Wei, S., Chen, Y.: An improved Rao-Blackwellized particle filter for SLAM. In: Proceedings of International Symposium on Intelligent Information Technology Application Workshops, pp. 515–518 (2008)
Google Scholar
Collins, T., Collins, J.J., Ryan, D.: Occupancy grid mapping: an empirical evaluation. In: Proceedings of Mediterranean Conference on Control and Automation, pp. 1–6 (2007)
Google Scholar
Hanten, R., et al.: Vector-AMCL: vector based adaptive Monte Carlo localization for indoor maps. Intell. Auton. Syst. 14, 403–416 (2017)
Google Scholar
Deng, Y., et al.: Fuzzy Dijkstra algorithm for shortest path problem under uncertain environment. Appl. Soft Comput. 12, 1231–1237 (2012)
Google Scholar
Keller, M., et al.: Planning of optimal collision avoidance trajectories with timed elastic bands. IFAC Proc. Volumes 47, 9822–9827 (2014)
Article Google Scholar
Nor, R.M., et al.: Mobile robot stable-target navigation control via encoder data feedback. In: Proceedings of International Malaysia-Ireland Joint Symposium on Engineering, Science and Business (2012)
Google Scholar
Wang, K., Liu, Y., Li, L.: Visual servoing trajectory tracking of nonholonomic mobile robots without direct position measurement. IEEE Trans. Robot. 30, 1026–1035 (2014)
Google Scholar
Al-amri, S.S., Kalyankar, N.V., Khamitkar, S.D.: Image segmentation by using threshold techniques. J. Comput. 2, 83–86 (2010)
Google Scholar
Eyupoglu, C.: Implementation of Bernsen’s locally adaptive binarization method for gray scale images. Online J. Sci. Technol. 7, 68–72 (2017)
Google Scholar
Huang, D.-Y., Wang, C.-H.: Optimal multi-level thresholding using a two-stage Otsu optimization approach. Pattern Recogn. Lett. 30, 275–284 (2009)
Article Google Scholar
Zhou, S.-F., et al.: An improved adaptive document image binarization method. In: Proceedings of International Congress on Image and Signal Processing, pp. 1–5 (2009)
Google Scholar
Calvo-Zaragoza, J., Gallego, A.-J.: A selectional auto-encoder approach for document image binarization. Pattern Recogn. 86, 37–47 (2019)
Article Google Scholar
Tensmeyer, C., Martinez, T.: Document image binarization with fully convolutional neural networks. In: Proceedings of IAPR International Conference on Document Analysis and Recognition (ICDAR), pp. 99–104 (2017)
Google Scholar
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.-C.: MobileNetV2: inverted residuals and linear bottlenecks. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4510–4520 (2018)
Google Scholar
Gavish, M., Donoho, D.L.: The optimal hard threshold for singular values is 4/√3. IEEE Trans. Inf. Theory 60(8), 5040–5053 (2014)
Article MathSciNet Google Scholar

Download references

Acknowledgement

This work was financially supported by the “Intelligent Recognition Industry Service Center” from The Featured Areas Research Center Program within the framework of Higher Education Sprout Project, Ministry of Education (MOE), Taiwan.

Author information

Authors and Affiliations

Department of Electrical Engineering, National Yunlin University of Science and Technology, Yunlin, Taiwan
Chian C. Ho & Cian-Duo Lin

Authors

Chian C. Ho
View author publications
You can also search for this author in PubMed Google Scholar
Cian-Duo Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chian C. Ho .

Editor information

Editors and Affiliations

National Yunlin University of Science and Technology, Douliu, Taiwan
Chao-Yang Lee
National Yunlin University of Science and Technology, Douliou, Taiwan
Chun-Li Lin
National Yunlin University of Science and Technology, Douliou, Taiwan
Hsuan-Ting Chang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ho, C.C., Lin, CD. (2024). Optimal Truncated MobileNet-Based Image Binarization for Pose-Based Visual Servoing of Autonomous Mobile Robot. In: Lee, CY., Lin, CL., Chang, HT. (eds) Technologies and Applications of Artificial Intelligence. TAAI 2023. Communications in Computer and Information Science, vol 2075. Springer, Singapore. https://doi.org/10.1007/978-981-97-1714-9_17

Download citation

DOI: https://doi.org/10.1007/978-981-97-1714-9_17
Published: 28 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1713-2
Online ISBN: 978-981-97-1714-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics