Skip to main content

Waveform-Aligned Adaptive Windows for Spectral Component Tracking and Noise Rejection

  • Conference paper
  • First Online:
Sound, Music, and Motion (CMMR 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8905))

Included in the following conference series:

Abstract

A new Short-Time Fourier Transform (STFT) pipeline is presented which uses a two-pass method to adapt the size of the analysis window to the period of the signal, which is assumed in this case to be at least pseudo-periodic. The pipeline begins with pitch estimation, followed by upsampling, to construct a new analysis window that matches theĀ length of a single period. This reduces or eliminates the spectral leakage problems which are typical of traditional STFT analysis techniques. The result is a discrete and accurate spectral representation that provides highly accurate location of partials from one analysis frame to the next. We have extended this method to allow noise cancellation by selecting an analysis window that contains a small whole number of complete cycles. We also present a new display method based on this pipeline which greatly improves the spectrogram through enhanced distinction among partials. Finally, validation is performed by signal restoration on 40 clips, showing the superiority of the pipeline for true periodic signals and comparability for pseudo-periodic signals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The threshold is also subtracted from any coefficient that is greater than the threshold in a soft thresholding. This not only smooths the time series, but moves it toward zero, which is not desired.

References

  1. Auger, F., Flandrin, P.: Improving the readability of time-frequency and time-scale representations by the reassignment method. IEEE Trans. Signal Process. 43, 1068ā€“1089 (1995)

    ArticleĀ  Google ScholarĀ 

  2. Breitenbach, A.: Against spectral leakage. Measurement 25(2), 135ā€“142 (1999). http://www.sciencedirect.com/science/article/B6V42-3X652TH-C/2/0e63c15ee62b1307b871d9463caa6438

    ArticleĀ  Google ScholarĀ 

  3. Canazza, S.: Noise and Representation Systems: A Comparison among Audio Restoration Algorithms. Lulu Enterprise, USA (2007)

    Google ScholarĀ 

  4. de CheveignƩ, A.: Hearing science and speech recognition. Technical report, IRCAM, February 1998

    Google ScholarĀ 

  5. de CheveignĆ©, A., Kawahara, H.: YIN, a fundamental frequency estimator for speech and music. J. Acoust. Soc. Am. 111(4), 1917ā€“1930 (2002)

    ArticleĀ  Google ScholarĀ 

  6. Gerhard, D.: Pitch extraction and fundamental frequency: history and current techniques. Technical report, Department of Computer Science, University of Regina, November 2003

    Google ScholarĀ 

  7. Hidalgo, R., Fernandez, J., Rivera, R., Larrondo, H.: A simple adjustable window algorithm to improve FFT measurements. IEEE Trans. Instrum. Measur. 51(1), 31ā€“36 (2002)

    ArticleĀ  Google ScholarĀ 

  8. Liuni, M., Robel, A., Matusiak, E., Romito, M., Rodet, X.: Automatic adaptation of the time-frequency resolution for sound analysis and re-synthesis. IEEE Trans. Audio Speech Lang. Process. 21(5), 959ā€“970 (2013)

    ArticleĀ  Google ScholarĀ 

  9. Min, K., Chien, D., Li, S., Jones, C.: Automated two speaker separation system. In: Proceedings of IEEE ICASSP, pp. 537ā€“540 (1988)

    Google ScholarĀ 

  10. Morise, M.: A method to estimate a temporally stable spectral envelope for periodic signals. In: Proceedings of Meetings on Acoustics, vol. 19, p. 060022. Acoustical Society of America (2013)

    Google ScholarĀ 

  11. VĆ­ch, R., Vondra, M.: Pitch synchronous transform warping in voice conversion. In: Esposito, A., Esposito, A.M., Vinciarelli, A., Hoffmann, R., MĆ¼ller, V.C. (eds.) COST 2102. LNCS, vol. 7403, pp. 280ā€“289. Springer, Heidelberg (2012)

    ChapterĀ  Google ScholarĀ 

  12. Xu, F.: Algorithm to remove spectral leakage, close-in noise, and its application to converter test. In: Proceedings of the IEEE Instrumentation and Measurement Technology Conference, pp. 1038ā€“1042, April 2006

    Google ScholarĀ 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Gerhard .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhao, Y., Gerhard, D. (2014). Waveform-Aligned Adaptive Windows for Spectral Component Tracking and Noise Rejection. In: Aramaki, M., Derrien, O., Kronland-Martinet, R., Ystad, S. (eds) Sound, Music, and Motion. CMMR 2013. Lecture Notes in Computer Science(), vol 8905. Springer, Cham. https://doi.org/10.1007/978-3-319-12976-1_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12976-1_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12975-4

  • Online ISBN: 978-3-319-12976-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics