Abstract
We investigate a method of detecting the wrong lyrics from the singing voice. In the proposed method, we compare the input singing voice and the reference singing voice using dynamic time warping, and then observe the frame-by-frame distance to find the error location. However, the absolute value of the distance is affected by the singer individuality of the reference and input singing voice. Thus, we attempted to adapt the singer individuality into the reference singer’s one by a linear transformation. The results of the experiment showed that we could detect the wrong lyrics with high accuracy when the different part of the lyrics was long. In addition, we investigated the effect of iterative linear transformation, and we could not find any benefit from the second or third linear transformations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Takeuchi, H., Hoguro, M., Umezaki, T.: A KARAOKE system singing evaluation method that more closely matches human evaluation. Trans. Inst. Electr. Eng. Jpn. C 130(6), 1042–1053 (2010)
Nakano, T., Goto, M., Hiraga, Y.: An automatic singing skill evaluation method for unknown melodies. Inf. Process. Soc. Jpn. 48(1), 227–236 (2007)
Daido, R., Ito, M., Makino, S., Ito, A.: Automatic evaluation of singing enthusiasm for karaoke. Comput. Speech Lang. 28, 501–517 (2014)
Mesaros, A., Virtanen, T.: Automatic recognition of lyrics in singing. EURASIP J. Audio Speech Music Process. 2010, article No. 4 (2014)
Suzuki, M., Hosoya, T., Ito, A., Makino, S.: Music information retrieval from a singing voice using lyrics and melody information. EURASIP J. Adv. Signal Process. 2007, 038727 (2006)
Panasonic: KARAOKE machine, Patent JP-A-2001-42879 (2001)
Berndt, D.J., Clifford, J.: Using dynamic time warping to find patterns in time series. In: KDD Workshop, vol. 10(16), pp. 359–370 (1994)
Matsumoto, H., Inoue, H.: A piece wise linear special mapping for supervised speaker adaptation. In: Proceedings of ICASSP, vol. 1, pp. 449–452 (1992)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Miyagawa, I., Chiba, Y., Nose, T., Ito, A. (2018). Detection of Singing Mistakes from Singing Voice. In: Pan, JS., Tsai, PW., Watada, J., Jain, L. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. IIH-MSP 2017. Smart Innovation, Systems and Technologies, vol 82. Springer, Cham. https://doi.org/10.1007/978-3-319-63859-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-63859-1_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63858-4
Online ISBN: 978-3-319-63859-1
eBook Packages: EngineeringEngineering (R0)