Skip to main content

Examining the Relationship Between Playing a Chord with Expressions and Hand Movements Using MediaPipe

  • Conference paper
  • First Online:
Human Interface and the Management of Information (HCII 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14691))

Included in the following conference series:

  • 298 Accesses

Abstract

This paper examined whether hand movements are responsible for expression in a piano performance. A player played a chord 100 times each with 12 different performance expressions, which consisted of three articulations (tenuto, heavy staccato, and light staccato) and four-level dynamics. The landmarks’ coordinates of her right fingers, wrist, elbow, and shoulder estimated as she played the chord (12 \(\times \) 100 times), judged by MediaPipe Pose and Hands, were used for machine learning training and testing. In the results for the learning model, the testing accuracy rate was 0.99. In each performance expression, F1-scores were 0.94–1.00. This suggested a relationship between performance expressions and hand movements. Moreover, when the player happened to play a different, unintended type of expression, her landmarks’ coordinates were close to those when she had aimed exactly to play that type of performance expression.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Lugaresi, C., et alet al.: Mediapipe: a framework for building perception pipelines, arXiv:1906.08172 (2019)

  2. Google MediaPipe, Pose landmark detection guide. https://developers.google.com/mediapipe/solutions/vision/pose_landmarker. (Accessed 28 Jan 2024)

  3. Google MediaPipe, Hand landmark detection guide. https://developers.google.com/mediapipe/solutions/vision/hand_landmarker. (Accessed 28 Jan 2024)

  4. Zhang, F., et al.: Mediapipe hands: On-device real-time hand tracking. arXiv:2006.10214 (2020)

  5. Bazarevsky, V., Zhang, F.: On-Device, Real-Time hand Tracking with mediaPipe. https://blog.research.google/2019/08/on-device-real-time-hand-tracking-with.html. (Accessed 28 Jan 2024)

  6. Wu, E., Nishioka, H., Furuya, S., Koike, H.: Marker-removal networks to collect precise 3D hand data for RGB-based estimation and its application in piano. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2977-2986 (2023)

    Google Scholar 

  7. Jones, T.: The Art of Piano Articulation: Mastering Musical Expression. https://tamecajones.com/. (Accessed 28 Jan 2024)

  8. Creative Piano Teacher: The Long and Short of Articulations: How to Correctly Interpret Piano Articulations, https://creativepianoteacher.com/. (Accessed 28 Jan 2024)

  9. Take Note: Learn How to Read Sheet Music: Dynamics, Articulations and Tempo, https://blog.sheetmusicplus.com/. (Accessed 28 Jan 2024)

  10. Guan, Y., Plötz, T.: Ensembles of deep lstm learners for activity recognition using wearables. Proc. ACM interactive, Mobile, Wearable Ubiquitous Technol. 1(2), 1–28 (2017)

    Article  Google Scholar 

  11. TensorFlow. https://github.com/tensorflow/tensorflow. (Accessed 28 Jan 2024)

  12. Pytorch, https://pytorch.org/. (Accessed 28 Jan 2024)

  13. Yanagisawa, Y., Akahani, J., Satoh, T.: Shape-based similarity query for trajectory of mobile objects. In: Chen, M.-S., Chrysanthis, P.K., Sloman, M., Zaslavsky, A. (eds.) MDM 2003. LNCS, vol. 2574, pp. 63–77. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-36389-0_5

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chika Oshima .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Oshima, C., Takatsu, T., Nakayama, K. (2024). Examining the Relationship Between Playing a Chord with Expressions and Hand Movements Using MediaPipe. In: Mori, H., Asahi, Y. (eds) Human Interface and the Management of Information. HCII 2024. Lecture Notes in Computer Science, vol 14691. Springer, Cham. https://doi.org/10.1007/978-3-031-60125-5_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-60125-5_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-60124-8

  • Online ISBN: 978-3-031-60125-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics