research-article

Tangible and body-related interaction techniques for a singing voice synthesis installation

Authors:
Jochen Feitsch

University of Applied Sciences Düsseldorf

University of Applied Sciences Düsseldorf
View Profile

,
Marco Strobel

University of Applied Sciences Düsseldorf

University of Applied Sciences Düsseldorf
View Profile

,
Stefan Meyer

University of Applied Sciences Düsseldorf

University of Applied Sciences Düsseldorf
View Profile

,
Christian Geiger

University of Applied Sciences Düsseldorf, Düsseldorf, Germany

University of Applied Sciences Düsseldorf, Düsseldorf, Germany
View Profile

TEI '14: Proceedings of the 8th International Conference on Tangible, Embedded and Embodied InteractionFebruary 2014Pages 157–164https://doi.org/10.1145/2540930.2540962

Published:16 February 2014Publication History

TEI '14: Proceedings of the 8th International Conference on Tangible, Embedded and Embodied Interaction

Pages 157–164

ABSTRACT

This paper presents an interactive media installation that aims at providing users with the experience to sing like an opera singer from the 19th century. We designed a set of tangible and body-related interaction and feedback techniques and developed a singing voice synthesizer system that is controlled by the user's mouth shapes and gestures. This musical interface allows users to perform an aria without real singing. We adapted techniques from 3D body tracking, face recognition, singing voice synthesis, 3D rendering and tangible interaction to integrate them into an interactive musical interface.

References

D. Butler, S. Izadi, O. Hilliges, D. Molyneaux, S. Hodges, and D. Kim. Shake'n'sense: Reducing interference for overlapping structured light depth cameras. In Proc. of the SIGCHI Conference on Human Factors in Computing Systems, CHI '12, pages 1933--1936, New York, NY, USA, 2012. ACM. Google ScholarDigital Library
P. Cano, A. Loscos, J. Bonada, M. D. Boer, and X. Serra. Voice morphing system for impersonating in karaoke applications. In In Proceedings of the ICMC, 2000.Google Scholar
J. Cheng and P. Huang. Real-time mouth tracking and 3d reconstruction. In Image and Signal Processing (CISP), 2010 3rd International Congress on, volume 4, pages 1524--1528, 2010.Google ScholarCross Ref
N. D'Alessandro, C. d'Alessandro, S. L. Beux, and B. Doval. Real-time calm synthesizer: New approaches in hands-controlled voice synthesis. In NIME, pages 266--271. IRCAM, 2006. Google ScholarDigital Library
G. C. de Silva, T. Smyth, and M. J. Lyons. A novel face-tracking mouth controller and its application to interacting with bioacoustic models. In Y. Nagashima and M. J. Lyons, editors, NIME, pages 169--172. Shizuoka University of Art and Culture, 2004. Google ScholarDigital Library
G. Fant. Acoustic Theory of Speech Production. Mouton De Gruyter, 1960.Google Scholar
J. Feitsch, M. Strobel, and C. Geiger. Caruso - singen wie ein tenor. In Mensch & Computer Workshopband, pages 531--534, 2013.Google ScholarCross Ref
J. Feitsch, M. Strobel, and C. Geiger. Singing like a tenor without a real voice. In Advances in Computer Entertainment, pages 258--269. Springer, 2013.Google ScholarDigital Library
C. Geiger, H. Reckter, D. Paschke, F. Schulz, and C. Poepel. Towards Participatory Design and Evaluation of Theremin-based Musical Interfaces. In Proc. of the Int. Conference on New Interfaces for Musical Expression, pages 303--306, 2008.Google Scholar
M. Goto, T. Nakano, S. Kajita, Y. Matsusaka, S. Nakaoka, and K. Yokoi. Voca-listener and voca-watcher: Imitating a human singer by using signal processing. In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, pages 5393--5396, 2012.Google ScholarCross Ref
R. Linggard. Electronic Synthesis of Speech. Cambridge University Press, 1985. Google ScholarDigital Library
M. J. Lyons, M. Haehnel, and N. Tetsutani. Designing, playing, and performing with a vision-based mouth interface. In Proceedings of the 2003 Conference on New Interfaces for Musical Expression, NIME '03, pages 116--121, Singapore, Singapore, 2003. National University of Singapore. Google ScholarDigital Library
G. Odowichuk, S. Trail, P. Driessen, W. Nie, and W. Page. Sensor fusion: Towards a fully expressive 3d music control interface. In Communications, Computers and Signal Processing (PacRim), 2011 IEEE Pacific Rim Conference on, pages 836--841, 2011.Google ScholarCross Ref
G. E. Peterson and H. L. Barney. Control methods used in a study of the vowels. The Journal of the Acoustical Society of America, 24(2):175--184, 1952.Google ScholarCross Ref
X. Rodet. Synthesis and processing of the singing voice. In In Proc. 1st IEEE Benelux Workshop on Model based Processing and Coding of Audio (MPCA-2002, 2002.Google Scholar
T. Yonezawa, N. Suzuki, K. Mase, and K. Kogure. Handysinger: Expressive singing voice morphing using personified hand-puppet interface. In Proc. of the Int. Conference on New Interfaces for Musical Expression, NIME '05, pages 121--126, Singapore, 2005. Google ScholarDigital Library

Index Terms

Tangible and body-related interaction techniques for a singing voice synthesis installation
1. Human-centered computing

Recommendations

DeepSinger: Singing Voice Synthesis with Data Mined From the Web
KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

In this paper, we develop DeepSinger, a multi-lingual multi-singer singing voice synthesis (SVS) system, which is built from scratch using singing training data mined from music websites. The pipeline of DeepSinger consists of several steps, including ...
Read More
Singing Voice Database
Speech and Computer
Abstract
The first publicly available singing voice database, which was first released in 2012, is presented in this paper. This database contains recordings of professional singers including one Grammy Award winner. The database includes so-called plain ...
Read More
Rhythm Speech Lyrics Input for MIDI-Based Singing Voice Synthesis
PCM '09: Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing

This paper presents useful techniques and considerations in implementing underlying mandarin singing voice synthesis system using the RSLI unit. The system can receive the continuous speech of the lyrics of a song, and can synthesize the intended song ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
TEI '14: Proceedings of the 8th International Conference on Tangible, Embedded and Embodied Interaction
February 2014
401 pages
ISBN:9781450326353
DOI:10.1145/2540930
Conference Chairs:
Andreas Butz
University of Munich (LMU), Germany
,
Saul Greenberg
University of Calgary, Canada
,
Program Chairs:
Saskia Bakker
Eindhoven University of Technology, the Netherlands
,
Lian Loke
University of Sydney, Australia
,
Alexander De Luca
University of Munich (LMU), Germany
Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 February 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
3D character performance
interactive media installation
singing voice synthesis
tangible musical interfaces
Qualifiers
- research-article
Conference

Acceptance Rates
TEI '14 Paper Acceptance Rate46of172submissions,27%Overall Acceptance Rate393of1,367submissions,29%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 267
  Total Downloads
- Downloads (Last 12 months)10
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Tangible and body-related interaction techniques for a singing voice synthesis installation

TEI '14: Proceedings of the 8th International Conference on Tangible, Embedded and Embodied Interaction

ABSTRACT

References

Cited By

Index Terms

Recommendations

DeepSinger: Singing Voice Synthesis with Data Mined From the Web

Singing Voice Database

Rhythm Speech Lyrics Input for MIDI-Based Singing Voice Synthesis