Skip to main content

Modeling Speech Based on Harmonic Plus Noise Models

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3445))

Abstract

Hybrid models of speech have received increasing interest from the speech processing community. Splitting the speech signal into a periodic and a non-periodic part increases the quality of prosodic modifications necessary in concatenative speech synthesis systems. This paper focuses on the decomposition of the speech signal into a periodic and a non-periodic part based on a Harmonic plus Noise Model, HNM; three versions of HNM are discussed with respect to their effectiveness in decomposing the speech signal into a periodic and a non-periodic part. While the harmonic part is modeled explicitely, the non-periodic part (or noise part) is obtained by subtracting in the time domain the harmonic part from the original speech signal. Three versions of HNM are discussed. The objective of the discussion is to determine which of these versions could be useful for prosodic modifications and synthesis of speech.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Griffin, D., Lim, J.: Multiband-excitation vocoder. IEEE Trans. Acoust., Speech, Signal Processing ASSP-36, 236–243 (1988)

    Google Scholar 

  2. Dutoit, T., Leich, H.: Text-To-Speech synthesis based on a MBE re-synthesis of the segments database. Speech Communication 13, 435–440 (1993)

    Article  Google Scholar 

  3. Serra, X., Smith, J.: Spectral modeling synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition. Computer Music J. 14, 12–24 (1990)

    Article  Google Scholar 

  4. Serra, X.: A System for Sound Analysis/Transformation/Synthesis Based on a Deterministic Plus Stochastic Decomposition. PhD thesis, Stanford University, Stanford, CA, STAN-M-58 (1989)

    Google Scholar 

  5. Rodet, X., Depalle, P., Poirot, G.: Speech Analysis and Synthesis Methods Based on Spectral Envelopes nad Voiced/Unvoiced Functions. In: Proc. EUROSPEECH, Edinburgh, U.K. (1987)

    Google Scholar 

  6. d’Alessandro, C., Yegnanarayana, B., Darsinos, V.: Decomposition of speech signals into deterministic and stochastic components. In: Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 760–763 (1995)

    Google Scholar 

  7. Yegnanarayana, B., d’Alessandro, C., Darsinos, V.: An iterative algorithm for decomposition of speech signals into periodic and aperiodic components. Proc. IEEE 6 (1998)

    Google Scholar 

  8. Laroche, J., Stylianou, Y., Moulines, E.: HNS: Speech modification based on a harmonic + noise model. In: Proc. IEEE ICASSP 1993, Minneapolis, pp. 550–553 (1993)

    Google Scholar 

  9. Stylianou, Y., Laroche, J., Moulines, E.: High-Quality Speech Modification based on a Harmonic + Noise Model. In: Proc. EUROSPEECH, pp. 451–454 (1995)

    Google Scholar 

  10. Stylianou, Y.: Harmonic plus Noise Models for Speech, combined with Statistical Methods, for Speech and Speaker Modification. PhD thesis, Ecole Nationale Supèrieure des Télécommunications (1996)

    Google Scholar 

  11. Hess, W.: Pitch determination of Speech Signals: Algorithmes and Devices. Springer, Berlin (1983)

    Google Scholar 

  12. Seneff, S.: Real-time harmonic pitch detector. IEEE Trans. Acoust. Speech, Signal Processing ASSP-26, 358–365 (1978)

    Article  Google Scholar 

  13. Lawson, C.L., Hanson, R.J.: Solving Least–Squares Problems. Prentice Hall, Englewood Cliffs (1974)

    MATH  Google Scholar 

  14. Press, W., Teukolsky, S., Vettering, W., Flannery, B.: Numerical Recipes in C, 2nd edn. Cambridge University Press, Cambridge (1994)

    Google Scholar 

  15. Poirot, G., Rodet, X., Depalle, P.: Diphone sound synthesis based on spectral envelopes and harmonic /noise excitation functions. In: Proc. Internat. Computer Music Conf., pp. 364–373 (1988)

    Google Scholar 

  16. Moulines, E., Laroche, J.: Techniques for pitch-scale and time-scale transformation of speech. part I. non parametric methods. Speech Communication 16 (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Stylianou, Y. (2005). Modeling Speech Based on Harmonic Plus Noise Models. In: Chollet, G., Esposito, A., Faundez-Zanuy, M., Marinaro, M. (eds) Nonlinear Speech Modeling and Applications. NN 2004. Lecture Notes in Computer Science(), vol 3445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11520153_11

Download citation

  • DOI: https://doi.org/10.1007/11520153_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27441-4

  • Online ISBN: 978-3-540-31886-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics