MSIN: An Efficient Multi-head Self-attention Framework for Inertial Navigation

Shi, Gaotao; Pan, Bingjia; Ni, Yuzhi

doi:10.1007/978-981-97-0834-5_26

Gaotao Shi^10,11,
Bingjia Pan^10,11 &
Yuzhi Ni¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14487))

Included in the following conference series:

International Conference on Algorithms and Architectures for Parallel Processing

111 Accesses

Abstract

Inertial Measurement Unit (IMU) makes an outstanding contribution to indoor inertial navigation in the era of ubiquitous computing, as it is widely integrated into portable devices. Many prominent works have been proposed by taking gyroscope and accelerometer readings as input to estimate the velocity and orientation. However, most of them focus on the local features of IMU (i.e., single sensor temporal feature or local spatial feature), eventually leading to drift on the trajectory. In this paper, we propose a robust model to mitigate the problem of jitters and drifts in trajectory prediction by exploiting the spatial dependence in accelerometer and gyroscope readings, as well as the contextual relation in motion terms through in-depth analyses of IMU readings. In particular, we design a framework (MSIN) to fuse the local spatial dependence of multiple sensors and incorporate the local spatial and global temporal features by using the multi-head self-attention mechanism. We have conducted extensive experiments on two public datasets and the results show that MSIN achieves a significant improvement in RTE (Relative Trajectory Error) performance, with improvements of up to 6.14% and 15.19% over state-of-the-art methods for RoNIN-Unseen and RIDI-Unseen, respectively.

This work was supported by the National Natural Science Foundation of China (NSFC) under Grant No. 61872266.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bai, S., Kolter, J.Z., Koltun, V.: An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271 (2018)
Brossard, M., Barrau, A., Bonnabel, S.: Ai-imu dead-reckoning. IEEE Trans. Intell. Veh. 5(4), 585–595 (2020)
Article Google Scholar
Brown, T., et al.: Language models are few-shot learners. Adv. Neural. Inf. Process. Syst. 33, 1877–1901 (2020)
Google Scholar
Chen, C., Lu, X., Markham, A., Trigoni, N.: Ionet: learning to cure the curse of drift in inertial odometry. Proc. AAAI Conf. Artif. Intell. 32, 6468–6476 (2018)
Google Scholar
Chen, C., et al.: Motiontransformer: transferring neural inertial tracking between domains. Proc. AAAI Conf. Artif. Intell. 33, 8009–8016 (2019)
Google Scholar
Cummins, C., Orr, R., O’Connor, H., West, C.: Global positioning systems (GPS) and microtechnology sensors in team sports: a systematic review. Sports Med. 43, 1025–1042 (2013)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Einicke, G.A., White, L.B.: Robust extended kalman filtering. IEEE Trans. Signal Process. 47(9), 2596–2599 (1999)
Article Google Scholar
Gao, Z., Li, Q., Li, C., Liu, N.: Iekf-swcs method for pedestrian self-navigation and location. J. Syst. Simulat. 27(9), 1944–1950 (2015)
Google Scholar
Guo, H., Uradziński, M., Yin, H., Yu, M.: Indoor positioning based on foot-mounted imu. Bull. Polish Acad. Sci. Tech. Sci. 63(3), 629–634 (2015)
Google Scholar
Han, C., Zhang, L., Tang, Y., Huang, W., Min, F., He, J.: Human activity recognition using wearable sensors by heterogeneous convolutional neural networks. Expert Syst. Appl. 198, 116764 (2022)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Herath, S., Yan, H., Furukawa, Y.: Ronin: robust neural inertial navigation in the wild: benchmark, evaluations, and new methods. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 3146–3152. IEEE (2020)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Ilyas, M., Cho, K., Baeg, S.H., Park, S.: Drift reduction in pedestrian navigation system by exploiting motion constraints and magnetic field. Sensors 16(9), 1455 (2016)
Article Google Scholar
Jiang, W., Yin, Z.: Human activity recognition using wearable sensors by deep convolutional neural networks. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 1307–1310 (2015)
Google Scholar
Jiang, Y., Li, Z., Wang, J.: Ptrack: enhancing the applicability of pedestrian tracking with wearables. IEEE Trans. Mob. Comput. 18(2), 431–443 (2018)
Article Google Scholar
Joshi, M., Chen, D., Liu, Y., Weld, D.S., Zettlemoyer, L., Levy, O.: Spanbert: improving pre-training by representing and predicting spans. Trans. Assoc. Comput. Linguist. 8, 64–77 (2020)
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Levi, R.W., Judd, T.: Dead reckoning navigational system using accelerometer to measure foot impacts. US Patent 5,583,776 (1996)
Google Scholar
Li, W., Liu, D., Chen, K., Li, K., Qi, H.: Hone: mitigating stragglers in distributed stream processing with tuple scheduling. IEEE Trans. Parall. Distrib. Syst. 32(8), 2021–2034 (2021)
Google Scholar
Li, W., et al.: Efficient coflow transmission for distributed stream processing. In: IEEE Conference on Computer Communications (IEEE INFOCOM 2020), pp. 1319–1328. IEEE (2020)
Google Scholar
Liu, H., Li, Q.: 12-dimensional zero velocity state updating intelligent algorithm for pedestrian dead reckoning. J. Syst. Simulat. 30(11), 4387 (2012)
Google Scholar
Liu, W., et al.: Tlio: tight learned inertial odometry. IEEE Robot. Automat. Lett. 5(4), 5653–5660 (2020)
Google Scholar
Liu, Y., Li, Z., Liu, Z., Wu, K.: Real-time arm skeleton tracking and gesture inference tolerant to missing wearable sensors. In: Proceedings of the 17th Annual International Conference on Mobile Systems, Applications, and Services, pp. 287–299 (2019)
Google Scholar
Nilsson, J.O., Skog, I., Händel, P., Hari, K.: Foot-mounted ins for everybody-an open-source embedded implementation. In: Proceedings of the 2012 IEEE/ION Position, Location and Navigation Symposium, pp. 140–145. IEEE (2012)
Google Scholar
Pathak, D., Agrawal, P., Efros, A.A., Darrell, T.: Curiosity-driven exploration by self-supervised prediction. In: International Conference on Machine Learning, pp. 2778–2787. PMLR (2017)
Google Scholar
Rao, B., Kazemi, E., Ding, Y., Shila, D.M., Tucker, F.M., Wang, L.: CTIN: robust contextual transformer network for inertial navigation. Proc. AAAI Conf. Artif. Intell. 36, 5413–5421 (2022)
Google Scholar
Saha, S.S., Sandha, S.S., Garcia, L.A., Srivastava, M.: Tinyodom: hardware-aware efficient neural inertial navigation. Proc. ACM Interact. Mobile Wearable Ubiquit. Technol. 6(2), 1–32 (2022)
Google Scholar
Shoaib, M., Bosch, S., Incel, O.D., Scholten, H., Havinga, P.J.: Fusion of smartphone motion sensors for physical activity recognition. Sensors 14(6), 10146–10176 (2014)
Article Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
Google Scholar
Vaswani, A., et al.: Attention is all you need. Adv. Neural. Inf. Process. Syst. 30, 6000–6010 (2017)
Google Scholar
Woodman, O.J.: An Introduction to Inertial Navigation. University of Cambridge, Computer Laboratory, Tech. Rep. (2007)
Google Scholar
Xu, H., Zhou, P., Tan, R., Li, M., Shen, G.: Limu-bert: unleashing the potential of unlabeled data for IMU sensing applications. In: Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, pp. 220–233 (2021)
Google Scholar
Yan, H., Shan, Q., Furukawa, Y.: RIDI: robust IMU double integration. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11217, pp. 641–656. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01261-8_38
Yang, C., Shao, H.R.: Wifi-based indoor positioning. IEEE Commun. Mag. 53(3), 150–157 (2015)
Article Google Scholar
Yang, S., Quan, Z., Nie, M., Yang, W.: Transpose: keypoint localization via transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 11802–11812 (2021)
Google Scholar
Yao, S., Hu, S., Zhao, Y., Zhang, A., Abdelzaher, T.: Deepsense: a unified deep learning framework for time-series mobile sensing data processing. In: Proceedings of the 26th International Conference on World Wide Web, pp. 351–360 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Intelligence and Computing, Tianjin University, Tianjin, China
Gaotao Shi & Bingjia Pan
Tianjin Key Laboratory of Advanced Networking (TANK), Tianjin, China
Gaotao Shi & Bingjia Pan
Tianjin International Engineering Institute, Tianjin University, Tianjin, China
Yuzhi Ni

Authors

Gaotao Shi
View author publications
You can also search for this author in PubMed Google Scholar
Bingjia Pan
View author publications
You can also search for this author in PubMed Google Scholar
Yuzhi Ni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gaotao Shi .

Editor information

Editors and Affiliations

Royal Melbourne Institute of Technology, Melbourne, VIC, Australia
Zahir Tari
Tianjin University, Tianjin, China
Keqiu Li
University of Arizona, Tucson, AZ, USA
Hongyi Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shi, G., Pan, B., Ni, Y. (2024). MSIN: An Efficient Multi-head Self-attention Framework for Inertial Navigation. In: Tari, Z., Li, K., Wu, H. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2023. Lecture Notes in Computer Science, vol 14487. Springer, Singapore. https://doi.org/10.1007/978-981-97-0834-5_26

Download citation

DOI: https://doi.org/10.1007/978-981-97-0834-5_26
Published: 12 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0833-8
Online ISBN: 978-981-97-0834-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

MSIN: An Efficient Multi-head Self-attention Framework for Inertial Navigation