Abstract
This paper describes a new multipitch estimation method. The proposed approach is based on the calculation of the autocorrelation function of the Multi-scale product of the composite signal and its filtered version by a comb filter. After analyzing the composite speech signal, the autocorrelation applied on the multi-scale product (MP) of the signal allows us to find the first pitch; it’s the dominant one. After applying the comb filter, we substract the resulting signal from the original one. Then we apply the same analysis to the residue to obtain the pitch estimation of the intrusion. Besides, this method is applied and evaluated on the Cooke database. It’s also compared to other well known algorithms. Experimental results show the robustness and the effectiveness of our approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
De Cheveigné, A.: Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model for auditory processing. J. Acoust. Soc. Amer. 93(6), 3271–3290 (1993)
Gilbert, K.D., Payton, K.L.: Source Enumeration of Speech Mixtures Using Pitch Harmonics. In: IEEE Workshop on Applications of Audio and Acoustics (2009)
Ben Messaoud, M.A., Bouzid, A., Ellouze, N.: Using multi-scale product spectrum for single and multi-pitch estimation. IET Signal Process 5(3), 344–355 (2011)
Huang, Q., Wang, D.: Multipitch Estimation for Speech Mixture Based on Multi-Length Windows Harmonic Model. In: 4th International Joint Conference on Computational Sciences and Optimization (2011)
Meddis, R., O’Mard, L.: A unitary model of pitch perception. J. Acoust. Soc. Am. 3, 1811–1820 (1997)
Vishnubhotla, S., Espy-Wilson, C.: An algorithm for Multipitch tracking in Co-Channel speech, Brisbane, Australia (2008)
Stark, M., Wohlmayr, M., Pernkopf, F.: Single Channel Speech Separation Using Source-Filter Representation. Pattern Recognition (2010)
Ben Messaoud, M.A., Bouzid, A., Ellouze, N.: Autocorrelation of the Speech Multi-Scale Product for Voicing Decision and Pitch Estimation. Cognitive Computation 2(3), 151–159 (2010)
Bouzid, A.: Contribution à la détection des instants d’ouverture et de fermeture de la glotte sur les signaux de parole voisée par transformée en ondelettes. Rapport de thèse de doctorat (2004)
Gainza, M., Lawlor, B., Coyle, E.: Multi pitch estimation by using modified IIR Comb Filters. In: 47th International Symposium ELMAR (2005)
Cooke, M.P.: Modeling auditory processing and organization. Doctoral thesis, Sheffield University, Sheffield, UK (1993)
Wu, M., Wang, D., Brown, G.J.: A multipitch tracking algorithm for noisy speech. IEEE Trans. Speech and Audio Process 11(3), 229–241 (2003)
Tolonen, T., Karjalainen, M.: A computationally efficient multipitch analysis model. IEEE Trans. Speech and Audio Process. 8(6), 708–716 (2000)
Gu, Y.H., Van Bokhoven, W.M.G.: Co-channel speech separation using frequency bin non-linear adaptive filter. In: Proc. Int. Conf. IEEE on Acoust. Speech and Signal Process, ICASSP 1991, pp. 949–952 (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zeremdini, J., Ben Messaoud, M.A., Bouzid, A., Ellouze, N. (2013). Contribution to the Multipitch Estimation by Multi-scale Product Analysis. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-38847-7_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38846-0
Online ISBN: 978-3-642-38847-7
eBook Packages: Computer ScienceComputer Science (R0)