Skip to main content

Analysis and Impact of Training Set Size in Cross-Subject Human Activity Recognition

  • Conference paper
  • First Online:
Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications (CIARP 2023)

Abstract

The ubiquity of consumer devices with sensing and computational capabilities, such as smartphones and smartwatches, has increased interest in their use in human activity recognition for healthcare monitoring applications, among others. When developing such a system, researchers rely on input data to train recognition models. In the absence of openly available datasets that meet the model requirements, researchers face a hard and time-consuming process to decide which sensing device to use or how much data needs to be collected. In this paper, we explore the effect of the amount of training data on the performance (i.e., classification accuracy and activity-wise F1-scores) of a CNN model by performing an incremental cross-subject evaluation using data collected from a consumer smartphone and smartwatch. Systematically studying the incremental inclusion of subject data from a set of 22 training subjects, the results show that the model’s performance initially improves significantly with each addition, yet this improvement slows down the larger the number of included subjects. We compare the performance of models based on smartphone and smartwatch data. The latter option is significantly better with smaller sizes of training data, while the former outperforms with larger amounts of training data. In addition, gait-related activities show significantly better results with smartphone-collected data, while non-gait-related activities, such as standing up or sitting down, were better recognized with smartwatch-collected data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.st.com/en/mems-and-sensors/lsm6dso.html.

  2. 2.

    https://time.google.com.

References

  1. Banos, O., et al.: Window size impact in human activity recognition. Sensors 14(4), 6474–6499 (2014). https://doi.org/10.3390/s140406474

    Article  Google Scholar 

  2. Chen, H., et al.: Assessing impacts of data volume and data set balance in using deep learning approach to human activity recognition. In: IEEE International Conference on Bioinformatics and Biomedicine, pp. 1160–1165. IEEE (2017). https://doi.org/10.1109/BIBM.2017.8217821

  3. Chen, W., et al.: Sensecollect: we need efficient ways to collect on-body sensor-based human activity data! Proc. ACM Interact. Mobile Wearable Ubiquitous Technol. 5(3), 1–27 (2021). https://doi.org/10.1145/3478119

    Article  Google Scholar 

  4. Coviello, G., Avitabile, G.: Multiple synchronized inertial measurement unit sensor boards platform for activity monitoring. IEEE Sens. J. 20(15), 8771–8777 (2020). https://doi.org/10.1109/JSEN.2020.2982744

    Article  Google Scholar 

  5. De-La-Hoz-Franco, E., Ariza-Colpas, P., Quero, J.M., Espinilla, M.: Sensor-based datasets for human activity recognition-a systematic review of literature. IEEE Access 6, 59192–59210 (2018). https://doi.org/10.1109/ACCESS.2018.2873502

    Article  Google Scholar 

  6. Demrozi, F., et al.: Human activity recognition using inertial, physiological and environmental sensors: a comprehensive survey. IEEE Access 8, 210816–210836 (2020). https://doi.org/10.1109/ACCESS.2020.3037715

    Article  Google Scholar 

  7. Gholamiangonabadi, D., Kiselov, N., Grolinger, K.: Deep neural networks for human activity recognition with wearable sensors: leave-one-subject-out cross-validation for model selection. IEEE Access 8, 133982–133994 (2020). https://doi.org/10.1109/ACCESS.2020.3010715

    Article  Google Scholar 

  8. Hossin, M., Sulaiman, M.N.: A review on evaluation metrics for data classification evaluations. International journal of data mining & knowledge management process 5(2), 1 (2015). https://doi.org/10.5121/ijdkp.2015.5201

    Article  Google Scholar 

  9. Jaén-Vargas, M., et al.: Effects of sliding window variation in the performance of acceleration-based human activity recognition using deep learning models. PeerJ Comput. Sci. 8, e1052 (2022). https://doi.org/10.7717/peerj-cs.1052

    Article  Google Scholar 

  10. Lara, O.D., Labrador, M.A.: A survey on human activity recognition using wearable sensors. IEEE Commun. Surv. Tutor. 15(3), 1192–1209 (2012). https://doi.org/10.1109/SURV.2012.110112.00192

    Article  Google Scholar 

  11. Leightley, D., Darby, J., Li, B., McPhee, J.S., Yap, M.H.: Human activity recognition for physical rehabilitation. In: IEEE International Conference on Systems, Man, and Cybernetics, pp. 261–266 (2013). https://doi.org/10.1109/SMC.2013.51

  12. Li, H., Shrestha, A., Heidari, H., Le Kernec, J., Fioranelli, F.: Bi-lstm network for multimodal continuous human activity recognition and fall detection. IEEE Sens. J. 20(3), 1191–1201 (2019). https://doi.org/10.1109/JSEN.2019.2946095

    Article  Google Scholar 

  13. Mann, H.B., Whitney, D.R.: On a test of whether one of two random variables is stochastically larger than the other. The annals of mathematical statistics, pp. 50–60 (1947)

    Google Scholar 

  14. Matey-Sanz, M.: Reproducible Package for Analysis and Impact of Training Set Size in Cross-Subject Human Activity Recognition (Jul 2023). https://doi.org/10.5281/zenodo.8163542

  15. Matey-Sanz, M., et al.: Instrumented timed up and go test using inertial sensors from consumer wearable devices. In: 20th International Conference on Artificial Intelligence in Medical, Proceedings, pp. 144–154. Springer (2022). https://doi.org/10.1007/978-3-031-09342-5_14

  16. Mills, D.L.: Internet time synchronization: the network time protocol. IEEE Trans. Commun. 39(10), 1482–1493 (1991). https://doi.org/10.1109/26.103043

    Article  Google Scholar 

  17. Moënne-Loccoz, N., Brémond, F., Thonnat, M.: Recurrent bayesian network for the recognition of human behaviors from video. In: Crowley, J.L., Piater, J.H., Vincze, M., Paletta, L. (eds.) ICVS 2003. LNCS, vol. 2626, pp. 68–77. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-36592-3_7

    Chapter  Google Scholar 

  18. Narayana, P.A., et al.: Deep-learning-based neural tissue segmentation of mri in multiple sclerosis: effect of training set size. J. Magn. Reson. Imaging 51(5), 1487–1496 (2020). https://doi.org/10.1002/jmri.26959

    Article  Google Scholar 

  19. Oluwalade., B., et al.: Human activity recognition using deep learning models on smartphones and smartwatches sensor data. In: Proc. of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies, HEALTHINF, pp. 645–650. INSTICC, SciTePress (2021). https://doi.org/10.5220/0010325906450650

  20. Ramezan, C.A., et al.: Effects of training set size on supervised machine-learning land-cover classification of large-area high-resolution remotely sensed data. Remote Sensing 13(3), 368 (2021). https://doi.org/10.3390/rs13030368

    Article  Google Scholar 

  21. Saez, Y., Baldominos, A., Isasi, P.: A comparison study of classifier algorithms for cross-person physical activity recognition. Sensors 17(1), 66 (2016). https://doi.org/10.3390/s17010066

    Article  Google Scholar 

  22. Sandha, S.S., et al.: Time awareness in deep learning-based multimodal fusion across smartphone platforms. In: IEEE/ACM Fifth International Conference on IoT Design and Implementation, pp. 149–156. IEEE (2020). https://doi.org/10.1109/IOTDI49375.2020.00022

  23. Sansano, E., et al.: A study of deep neural networks for human activity recognition. Comput. Intell. 36(3), 1113–1139 (2020). https://doi.org/10.1111/coin.12318

    Article  MathSciNet  Google Scholar 

  24. Vallat, R.: Pingouin: statistics in python. J. Open Source Soft. 3(31), 1026 (2018). https://doi.org/10.21105/joss.01026

  25. Yazdansepas, D., et al.: A multi-featured approach for wearable sensor-based human activity recognition. In: IEEE International Conference on Healthcare Informatics, pp. 423–431 (2016). https://doi.org/10.1109/ICHI.2016.81

Download references

Acknowledgments

M. Matey-Sanz and A. González-Pérez are funded by the Spanish Ministry of Universities [grants FPU19/05352 and FPU17/03832]. This study was supported by project PID2020-120250RB-I00 (SyMptOMS-ET) funded by MCIN/AEI/10.13039/501100011033.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Miguel Matey-Sanz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Matey-Sanz, M., Torres-Sospedra, J., González-Pérez, A., Casteleyn, S., Granell, C. (2024). Analysis and Impact of Training Set Size in Cross-Subject Human Activity Recognition. In: Vasconcelos, V., Domingues, I., Paredes, S. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2023. Lecture Notes in Computer Science, vol 14469. Springer, Cham. https://doi.org/10.1007/978-3-031-49018-7_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-49018-7_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-49017-0

  • Online ISBN: 978-3-031-49018-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics