Skip to main content

MSIN: An Efficient Multi-head Self-attention Framework forĀ Inertial Navigation

  • Conference paper
  • First Online:
Algorithms and Architectures for Parallel Processing (ICA3PP 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14487))

  • 111 Accesses

Abstract

Inertial Measurement Unit (IMU) makes an outstanding contribution to indoor inertial navigation in the era of ubiquitous computing, as it is widely integrated into portable devices. Many prominent works have been proposed by taking gyroscope and accelerometer readings as input to estimate the velocity and orientation. However, most of them focus on the local features of IMU (i.e., single sensor temporal feature or local spatial feature), eventually leading to drift on the trajectory. In this paper, we propose a robust model to mitigate the problem of jitters and drifts in trajectory prediction by exploiting the spatial dependence in accelerometer and gyroscope readings, as well as the contextual relation in motion terms through in-depth analyses of IMU readings. In particular, we design a framework (MSIN) to fuse the local spatial dependence of multiple sensors and incorporate the local spatial and global temporal features by using the multi-head self-attention mechanism. We have conducted extensive experiments on two public datasets and the results show that MSIN achieves a significant improvement in RTE (Relative Trajectory Error) performance, with improvements of up to 6.14% and 15.19% over state-of-the-art methods for RoNIN-Unseen and RIDI-Unseen, respectively.

This work was supported by the National Natural Science Foundation of China (NSFC) under Grant No. 61872266.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bai, S., Kolter, J.Z., Koltun, V.: An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271 (2018)

  2. Brossard, M., Barrau, A., Bonnabel, S.: Ai-imu dead-reckoning. IEEE Trans. Intell. Veh. 5(4), 585ā€“595 (2020)

    ArticleĀ  Google ScholarĀ 

  3. Brown, T., et al.: Language models are few-shot learners. Adv. Neural. Inf. Process. Syst. 33, 1877ā€“1901 (2020)

    Google ScholarĀ 

  4. Chen, C., Lu, X., Markham, A., Trigoni, N.: Ionet: learning to cure the curse of drift in inertial odometry. Proc. AAAI Conf. Artif. Intell. 32, 6468ā€“6476 (2018)

    Google ScholarĀ 

  5. Chen, C., et al.: Motiontransformer: transferring neural inertial tracking between domains. Proc. AAAI Conf. Artif. Intell. 33, 8009ā€“8016 (2019)

    Google ScholarĀ 

  6. Cummins, C., Orr, R., Oā€™Connor, H., West, C.: Global positioning systems (GPS) and microtechnology sensors in team sports: a systematic review. Sports Med. 43, 1025ā€“1042 (2013)

    Google ScholarĀ 

  7. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  8. Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)

  9. Einicke, G.A., White, L.B.: Robust extended kalman filtering. IEEE Trans. Signal Process. 47(9), 2596ā€“2599 (1999)

    ArticleĀ  Google ScholarĀ 

  10. Gao, Z., Li, Q., Li, C., Liu, N.: Iekf-swcs method for pedestrian self-navigation and location. J. Syst. Simulat. 27(9), 1944ā€“1950 (2015)

    Google ScholarĀ 

  11. Guo, H., Uradziński, M., Yin, H., Yu, M.: Indoor positioning based on foot-mounted imu. Bull. Polish Acad. Sci. Tech. Sci. 63(3), 629ā€“634 (2015)

    Google ScholarĀ 

  12. Han, C., Zhang, L., Tang, Y., Huang, W., Min, F., He, J.: Human activity recognition using wearable sensors by heterogeneous convolutional neural networks. Expert Syst. Appl. 198, 116764 (2022)

    ArticleĀ  Google ScholarĀ 

  13. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770ā€“778 (2016)

    Google ScholarĀ 

  14. Herath, S., Yan, H., Furukawa, Y.: Ronin: robust neural inertial navigation in the wild: benchmark, evaluations, and new methods. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 3146ā€“3152. IEEE (2020)

    Google ScholarĀ 

  15. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735ā€“1780 (1997)

    ArticleĀ  Google ScholarĀ 

  16. Ilyas, M., Cho, K., Baeg, S.H., Park, S.: Drift reduction in pedestrian navigation system by exploiting motion constraints and magnetic field. Sensors 16(9), 1455 (2016)

    ArticleĀ  Google ScholarĀ 

  17. Jiang, W., Yin, Z.: Human activity recognition using wearable sensors by deep convolutional neural networks. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 1307ā€“1310 (2015)

    Google ScholarĀ 

  18. Jiang, Y., Li, Z., Wang, J.: Ptrack: enhancing the applicability of pedestrian tracking with wearables. IEEE Trans. Mob. Comput. 18(2), 431ā€“443 (2018)

    ArticleĀ  Google ScholarĀ 

  19. Joshi, M., Chen, D., Liu, Y., Weld, D.S., Zettlemoyer, L., Levy, O.: Spanbert: improving pre-training by representing and predicting spans. Trans. Assoc. Comput. Linguist. 8, 64ā€“77 (2020)

    ArticleĀ  Google ScholarĀ 

  20. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

  21. Levi, R.W., Judd, T.: Dead reckoning navigational system using accelerometer to measure foot impacts. US Patent 5,583,776 (1996)

    Google ScholarĀ 

  22. Li, W., Liu, D., Chen, K., Li, K., Qi, H.: Hone: mitigating stragglers in distributed stream processing with tuple scheduling. IEEE Trans. Parall. Distrib. Syst. 32(8), 2021ā€“2034 (2021)

    Google ScholarĀ 

  23. Li, W., et al.: Efficient coflow transmission for distributed stream processing. In: IEEE Conference on Computer Communications (IEEE INFOCOM 2020), pp. 1319ā€“1328. IEEE (2020)

    Google ScholarĀ 

  24. Liu, H., Li, Q.: 12-dimensional zero velocity state updating intelligent algorithm for pedestrian dead reckoning. J. Syst. Simulat. 30(11), 4387 (2012)

    Google ScholarĀ 

  25. Liu, W., et al.: Tlio: tight learned inertial odometry. IEEE Robot. Automat. Lett. 5(4), 5653ā€“5660 (2020)

    Google ScholarĀ 

  26. Liu, Y., Li, Z., Liu, Z., Wu, K.: Real-time arm skeleton tracking and gesture inference tolerant to missing wearable sensors. In: Proceedings of the 17th Annual International Conference on Mobile Systems, Applications, and Services, pp. 287ā€“299 (2019)

    Google ScholarĀ 

  27. Nilsson, J.O., Skog, I., HƤndel, P., Hari, K.: Foot-mounted ins for everybody-an open-source embedded implementation. In: Proceedings of the 2012 IEEE/ION Position, Location and Navigation Symposium, pp. 140ā€“145. IEEE (2012)

    Google ScholarĀ 

  28. Pathak, D., Agrawal, P., Efros, A.A., Darrell, T.: Curiosity-driven exploration by self-supervised prediction. In: International Conference on Machine Learning, pp. 2778ā€“2787. PMLR (2017)

    Google ScholarĀ 

  29. Rao, B., Kazemi, E., Ding, Y., Shila, D.M., Tucker, F.M., Wang, L.: CTIN: robust contextual transformer network for inertial navigation. Proc. AAAI Conf. Artif. Intell. 36, 5413ā€“5421 (2022)

    Google ScholarĀ 

  30. Saha, S.S., Sandha, S.S., Garcia, L.A., Srivastava, M.: Tinyodom: hardware-aware efficient neural inertial navigation. Proc. ACM Interact. Mobile Wearable Ubiquit. Technol. 6(2), 1ā€“32 (2022)

    Google ScholarĀ 

  31. Shoaib, M., Bosch, S., Incel, O.D., Scholten, H., Havinga, P.J.: Fusion of smartphone motion sensors for physical activity recognition. Sensors 14(6), 10146ā€“10176 (2014)

    ArticleĀ  Google ScholarĀ 

  32. Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929ā€“1958 (2014)

    Google ScholarĀ 

  33. Vaswani, A., et al.: Attention is all you need. Adv. Neural. Inf. Process. Syst. 30, 6000ā€“6010 (2017)

    Google ScholarĀ 

  34. Woodman, O.J.: An Introduction to Inertial Navigation. University of Cambridge, Computer Laboratory, Tech. Rep. (2007)

    Google ScholarĀ 

  35. Xu, H., Zhou, P., Tan, R., Li, M., Shen, G.: Limu-bert: unleashing the potential of unlabeled data for IMU sensing applications. In: Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, pp. 220ā€“233 (2021)

    Google ScholarĀ 

  36. Yan, H., Shan, Q., Furukawa, Y.: RIDI: robust IMU double integration. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11217, pp. 641ā€“656. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01261-8_38

  37. Yang, C., Shao, H.R.: Wifi-based indoor positioning. IEEE Commun. Mag. 53(3), 150ā€“157 (2015)

    ArticleĀ  Google ScholarĀ 

  38. Yang, S., Quan, Z., Nie, M., Yang, W.: Transpose: keypoint localization via transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 11802ā€“11812 (2021)

    Google ScholarĀ 

  39. Yao, S., Hu, S., Zhao, Y., Zhang, A., Abdelzaher, T.: Deepsense: a unified deep learning framework for time-series mobile sensing data processing. In: Proceedings of the 26th International Conference on World Wide Web, pp. 351ā€“360 (2017)

    Google ScholarĀ 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gaotao Shi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shi, G., Pan, B., Ni, Y. (2024). MSIN: An Efficient Multi-head Self-attention Framework forĀ Inertial Navigation. In: Tari, Z., Li, K., Wu, H. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2023. Lecture Notes in Computer Science, vol 14487. Springer, Singapore. https://doi.org/10.1007/978-981-97-0834-5_26

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0834-5_26

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0833-8

  • Online ISBN: 978-981-97-0834-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics