skip to main content
research-article

Robust Inertial Motion Tracking through Deep Sensor Fusion across Smart Earbuds and Smartphone

Published: 24 June 2021 Publication History

Abstract

IMU based inertial tracking plays an indispensable role in many mobility centric tasks, such as robotic control, indoor navigation and virtual reality gaming. Despite its mature application in rigid machine mobility (e.g., robot and aircraft), tracking human users via mobile devices remains a fundamental challenge due to the intractable gait/posture patterns. Recent data-driven models have tackled sensor drifting, one key issue that plagues inertial tracking. However, these systems still assume the devices are held or attached to the user body with a relatively fixed posture. In practice, natural body activities may rotate/translate the device which may be mistaken as whole body movement. Such motion artifacts remain as the dominating factor that fails existing inertial tracing systems in practical uncontrolled settings.
Inspired by the observation that human heads induces far less intensive movement relative to the body during walking, compared to other parts, we propose a novel multi-stage sensor fusion pipeline called DeepIT, which realizes inertial tracking by synthesizing the IMU measurements from a smartphone and an associated earbud. DeepIT introduces a data-driven reliability aware attention model, which assesses the reliability of each IMU and opportunistically synthesizes their data to mitigate the impacts of motion noise. Furthermore, DeepIT uses a reliability aware magnetometer compensation scheme to combat the angular drifting problem caused by unrestricted motion artifacts. We validate DeepIT on the first large-scale inertial navigation dataset involving both smartphone and earbud IMUs. The evaluation results show that DeepIT achieves multiple folds of accuracy improvement on the challenging uncontrolled natural walking scenarios, compared with state-of-the-art closed-form and data-driven models.

References

[1]
Agata Brajdic and Robert Harle. 2013. Walk detection and step counting on unconstrained smartphones. In Proceedings of ACM international joint conference on Pervasive and ubiquitous computing.
[2]
MA Brodie, Alan Walmsley, and Wyatt Page. 2008. The static accuracy and calibration of inertial measurement units for 3D orientation. (2008).
[3]
Changhao Chen, Xiaoxuan Lu, Andrew Markham, and Niki Trigoni. 2018. IONet: Learning to Cure the Curse of Drift in Inertial Odometry.
[4]
Changhao Chen, Xiaoxuan Lu, Johan Wahlstrom, Andrew Markham, and Niki Trigoni. 2019. Deep neural network based inertial odometry using low-cost inertial measurement units. IEEE Transactions on Mobile Computing (2019).
[5]
Changhao Chen, Stefano Rosa, Yishu Miao, Chris Xiaoxuan Lu, Wei Wu, Andrew Markham, and Niki Trigoni. 2019. Selective sensor fusion for neural visual-inertial odometry. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[6]
Changhao Chen, Peijun Zhao, Chris Xiaoxuan Lu, Wei Wang, Andrew Markham, and Niki Trigoni. 2020. Deep-Learning-Based Pedestrian Inertial Navigation: Methods, Data Set, and On-Device Inference. IEEE Internet of Things Journal 7, 5 (2020).
[7]
Zhenghua Chen, Han Zou, Hao Jiang, Qingchang Zhu, Yeng Chai Soh, and Lihua Xie. 2015. Fusion of WiFi, smartphone sensors and landmarks using the Kalman filter for indoor localization. Sensors 15, 1 (2015), 715--732.
[8]
Seungeun Chung, Jiyoun Lim, Kyoung Ju Noh, Gague Kim, and Hyuntae Jeong. 2019. Sensor Data Acquisition and Multimodal Sensor Fusion for Human Activity Recognition Using Deep Learning. Sensors 19, 7 (2019). https://www.mdpi.com/1424-8220/19/7/1716
[9]
Santiago Cortés, Arno Solin, and Juho Kannala. 2018. Deep learning based speed estimation for constraining strapdown inertial navigation on smartphones. In 2018 IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP).
[10]
Andreas Eitel, Jost Springenberg, Luciano Spinello, Martin Riedmiller, and Wolfram Burgard. 2015. Multimodal Deep Learning for Robust RGB-D Object Recognition. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[11]
eSense. 2020. eSense website. https://www.esense.io/
[12]
Andrea Ferlini, Alessandro Montanari, Cecilia Mascolo, and Robert Harle. 2019. Head Motion Tracking Through in-Ear Wearables. Proceedings of the 1st International Workshop on Earable Computing (2019).
[13]
Google. 2020. tango phone. https://support.google.com/faqs/faq/6029402?hl=en
[14]
Guoquan Huang. 2019. Visual-inertial navigation: A concise review. In 2019 International Conference on Robotics and Automation (ICRA). IEEE, 9572--9582.
[15]
Benoit Huyghe, Jan Doutreloigne, and Jan Vanfleteren. 2009. 3D orientation tracking based on unscented Kalman filtering of accelerometer and magnetometer data. In IEEE Sensors Applications Symposium.
[16]
Antonio R Jimenez, Fernando Seco, Carlos Prieto, and Jorge Guevara. 2009. A comparison of pedestrian dead-reckoning algorithms using a low-cost MEMS IMU. In 2009 IEEE International Symposium on Intelligent Signal Processing. IEEE, 37--42.
[17]
Wonho Kang and Youngnam Han. 2014. SmartPDR: Smartphone-based pedestrian dead reckoning for indoor localization. IEEE Sensors journal 15, 5 (2014), 2906--2916.
[18]
Stefan Leutenegger, Simon Lynen, Michael Bosse, Roland Siegwart, and Paul Furgale. 2015. Keyframe-based visual-inertial odometry using nonlinear optimization. The International Journal of Robotics Research 34, 3 (2015), 314--334.
[19]
Chris Xiaoxuan Lu, Muhamad Risqi U Saputra, Peijun Zhao, Yasin Almalioglu, Pedro PB de Gusmao, Changhao Chen, Ke Sun, Niki Trigoni, and Andrew Markham. 2020. milliEgo: single-chip mmWave radar aided egomotion estimation via deep sensor fusion. In Proceedings of ACM SenSys.
[20]
Sebastian Madgwick. 2010. An efficient orientation filter for inertial and inertial/magnetic sensor arrays. Report x-io and University of Bristol (UK) 25 (2010).
[21]
Robert Mahony, Tarek Hamel, and Jean-Michel Pflimlin. 2008. Nonlinear complementary filters on the special orthogonal group. IEEE Transactions on automatic control 53, 5 (2008), 1203--1218.
[22]
Thibaud Michel, Hassen Fourati, Pierre Geneves, and Nabil Layaïda. 2015. A comparative analysis of attitude estimation for pedestrian navigation with smartphones. In 2015 International Conference on Indoor Positioning and Indoor Navigation (IPIN). IEEE, 1--10.
[23]
Sebastian Münzner, Philip Schmidt, Attila Reiss, Michael Hanselmann, Rainer Stiefelhagen, and Robert Dürichen. 2017. CNN-based sensor fusion techniques for multimodal human activity recognition.
[24]
Armando Neto, Douglas Macharet, Víctor Campos, and Mario Campos. 2009. Adaptive complementary filtering algorithm for mobile robot localization. J. Braz. Comp. Soc. (2009).
[25]
Nonnarit O.-Larnnithipong and Armando Barreto. 2016. Gyroscope drift correction algorithm for inertial measurement unit used in hand motion tracking. 2016 IEEE SENSORS (2016), 1--3.
[26]
N. Patel, A. Choromanska, P. Krishnamurthy, and F. Khorrami. 2017. Sensor modality fusion with CNNs for UGV autonomous driving in indoor environments. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 1531--1536.
[27]
Jay Prakash, Zhijian Yang, Yu-Lin Wei, and Romit Choudhury. 2019. STEAR: Robust Step Counting from Earables. https://doi.org/10.1145/3345615.3361133
[28]
Pytorch. 2020. Pytorch website. https://pytorch.org/
[29]
Kun Qian, Zhaoyuan He, and Xinyu Zhang. 2020. 3D Point Cloud Generation with Millimeter-Wave Radar. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 4, 4 (2020).
[30]
Kun Qian, Shilin Zhu, Xinyu Zhang, and Li Erran Li. 2020. Robust Multimodal Vehicle Detection in Adverse Weather using Complementary Lidar and Radar Signals. In Proceedings of IEEE CVPR.
[31]
Muhamad Risqi U Saputra, Pedro PB de Gusmao, Chris Xiaoxuan Lu, Yasin Almalioglu, Stefano Rosa, Changhao Chen, Johan Wahlström, Wei Wang, Andrew Markham, and Niki Trigoni. 2020. Deeptio: A deep thermal-inertial odometry with visual hallucination. IEEE Robotics and Automation Letters 5, 2 (2020).
[32]
Paul G. Savage. 1998. Strapdown Inertial Navigation Integration Algorithm Design Part 1: Attitude Algorithms. Journal of Guidance, Control, and Dynamics 21, 1 (1998).
[33]
Sheng Shen, Mahanth Gowda, and Romit Roy Choudhury. 2018. Closing the gaps in inertial motion tracking. In Proceedings of the 24th Annual International Conference on Mobile Computing and Networking. 429--444.
[34]
Yuanchao Shu, Kang G Shin, Tian He, and Jiming Chen. 2015. Last-mile navigation using smartphones. In Proceedings of ACM MobiSys.
[35]
Jürgen Sturm, Stéphane Magnenat, Nikolas Engelhard, François Pomerleau, Francis Colas, Daniel Cremers, Roland Siegwart, and Wolfram Burgard. 2011. Towards a benchmark for RGB-D SLAM evaluation.
[36]
Qinglin Tian, Zoran Salcic, I Kevin, Kai Wang, and Yun Pan. 2015. An enhanced pedestrian dead reckoning approach for pedestrian tracking using smartphones. In 2015 IEEE Tenth International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP). IEEE, 1--6.
[37]
Sam Tregillus and Eelke Folmer. 2016. Vr-step: Walking-in-place using inertial sensing for hands free navigation in mobile vr environments. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. 1250--1255.
[38]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, undefinedukasz Kaiser, and Illia Polosukhin. 2017. Attention is All You Need. In Proceedings of the International Conference on Neural Information Processing Systems.
[39]
B. Wagstaff and J. Kelly. 2018. LSTM-Based Zero-Velocity Detection for Robust Inertial Navigation. In International Conference on Indoor Positioning and Indoor Navigation (IPIN).
[40]
Gerard Wilkinson, Ahmed Kharrufa, Jonathan Hook, Bradley Pursglove, Gavin Wood, Hendrik Haeuser, Nils Y Hammerla, Steve Hodges, and Patrick Olivier. 2016. Expressy: Using a wrist-worn inertial measurement unit to add expressiveness to touch-based interactions. In Proceedings of the ACM CHI.
[41]
Hongfei Xue, Wenjun Jiang, Chenglin Miao, Fenglong Ma, Shiyang Wang, Ye Yuan, Shuochao Yao, Aidong Zhang, and Lu Su. 2020. DeepMV: Multi-view deep learning for device-free human activity recognition. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 4, 1 (2020), 1--26.
[42]
Hongfei Xue, Wenjun Jiang, Chenglin Miao, Ye Yuan, Fenglong Ma, Xin Ma, Yijiang Wang, Shuochao Yao, Wenyao Xu, Aidong Zhang, et al. 2019. Deepfusion: A deep learning framework for the fusion of heterogeneous sensory data. In Proceedings of the Twentieth ACM International Symposium on Mobile Ad Hoc Networking and Computing. 151--160.
[43]
Hang Yan, Sachini Herath, and Yasutaka Furukawa. 2019. RoNIN: Robust Neural Inertial Navigation in the Wild: Benchmark, Evaluations, and New Methods. arXiv preprint arXiv:1905.12853 (2019).
[44]
Hang Yan, Qi Shan, and Yasutaka Furukawa. 2018. RIDI: Robust IMU double integration. In Proceedings of the European Conference on Computer Vision (ECCV).
[45]
S. Yao, Y. Zhao, H. Shao, D. Liu, S. Liu, Y. Hao, A. Piao, S. Hu, S. Lu, and T. F. Abdelzaher. 2019. SADeepSense: Self-Attention Deep Learning Framework for Heterogeneous On-Device Sensors in Internet of Things Applications. In IEEE INFOCOM 2019 - IEEE Conference on Computer Communications. 1243--1251.
[46]
Seanglidet Yean, Bu Sung Lee, Chai Kiat Yeo, and Chan Hua Vun. 2016. Algorithm for 3D orientation estimation based on Kalman filter and gradient descent. In 2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON). IEEE, 1--6.
[47]
Ming Zeng, Haoxiang Gao, Tong Yu, Ole J Mengshoel, Helge Langseth, Ian Lane, and Xiaobing Liu. 2018. Understanding and improving recurrent networks for human activity recognition by continuous attention. In Proceedings of the 2018 ACM International Symposium on Wearable Computers. 56--63.

Cited By

View all
  • (2024)A comprehensive evaluation of marker-based, markerless methods for loose garment scenarios in varying camera configurationsFrontiers in Computer Science10.3389/fcomp.2024.13799256Online publication date: 5-Apr-2024
  • (2024)MSense: Boosting Wireless Sensing Capability Under Motion InterferenceProceedings of the 30th Annual International Conference on Mobile Computing and Networking10.1145/3636534.3649350(108-123)Online publication date: 29-May-2024
  • (2024)Spatial-Temporal Masked Autoencoder for Multi-Device Wearable Human Activity RecognitionProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36314157:4(1-25)Online publication date: 12-Jan-2024
  • Show More Cited By

Index Terms

  1. Robust Inertial Motion Tracking through Deep Sensor Fusion across Smart Earbuds and Smartphone

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies  Volume 5, Issue 2
    June 2021
    932 pages
    EISSN:2474-9567
    DOI:10.1145/3472726
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 June 2021
    Published in IMWUT Volume 5, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. deep learning
    2. sensor fusion
    3. smart earbuds
    4. smartphone

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)145
    • Downloads (Last 6 weeks)15
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A comprehensive evaluation of marker-based, markerless methods for loose garment scenarios in varying camera configurationsFrontiers in Computer Science10.3389/fcomp.2024.13799256Online publication date: 5-Apr-2024
    • (2024)MSense: Boosting Wireless Sensing Capability Under Motion InterferenceProceedings of the 30th Annual International Conference on Mobile Computing and Networking10.1145/3636534.3649350(108-123)Online publication date: 29-May-2024
    • (2024)Spatial-Temporal Masked Autoencoder for Multi-Device Wearable Human Activity RecognitionProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36314157:4(1-25)Online publication date: 12-Jan-2024
    • (2024)Noisy Labels Make Sense: Data-Driven Smartphone Inertial Tracking without Tedious Annotations2024 IEEE 25th International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)10.1109/WoWMoM60985.2024.00061(339-348)Online publication date: 4-Jun-2024
    • (2024)Online Path Description Learning Based on IMU Signals From IoT DevicesIEEE Transactions on Mobile Computing10.1109/TMC.2024.340643623:12(11889-11906)Online publication date: Dec-2024
    • (2024)Deep Learning for Inertial Positioning: A SurveyIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.338116125:9(10506-10523)Online publication date: 4-Apr-2024
    • (2024)BLEAR: Practical Wireless Earphone Tracking under BLE protocol2024 IEEE International Conference on Pervasive Computing and Communications (PerCom)10.1109/PerCom59722.2024.10494424(164-173)Online publication date: 11-Mar-2024
    • (2024)Centaur: Robust Multimodal Fusion for Human Activity RecognitionIEEE Sensors Journal10.1109/JSEN.2024.338889324:11(18578-18591)Online publication date: 1-Jun-2024
    • (2024)A Learning-Based Multi-Node Fusion Positioning Method Using Wearable Inertial SensorsICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10447298(1976-1980)Online publication date: 14-Apr-2024
    • (2023)Non-intrusive Human Vital Sign Detection Using mmWave Sensing Technologies: A ReviewACM Transactions on Sensor Networks10.1145/362716120:1(1-36)Online publication date: 3-Nov-2023
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media