Abstract
This paper presents a comparison of different spectral re-synthesis algorithms. This study describes more particularly the Di Martino and Pierron (D&P) and real-time iterative spectrogram inversion with look-ahead algorithms from an architectural point of view because they are dedicated to real-time process. We use Python (as simulation language) because it allows easily the comparison of performances of the all the algorithms studied according to some important algorithm parameters as the number of iterations or the number of look-ahead frames. This comparison confirms the advantage of using D&P for real-time process from an architectural point of view.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Beauregard, G. T., Zhu, X., & Wyse, L. (2005). An efficient algorithm for real-time spectrogram inversion. In Proceedings of the 8th International Conference on Digital Audio Effects (DAFX-05) (pp. 116–221).
Brown, M. C. (2001). Python. New York: McGraw-Hill.
Chami, M., Di Martino, J., Pierron, L., & Ibn Elhaj, E. (2012) Real-time signal reconstruction from short-time Fourier transform magnitude spectra using FPGAs. In Proceedings of the 5th International Conference on Information Systems and Economic Intelligence.
Di Martino, J., & Pierron, L. (2010) Synthétiseur numérique audio amélioré. Patent “Oesovox”, 10/02674 INPI, Paris.
Fluckiger, F. (1995). Understanding networked multimedia: Applications and technology, Paperback.
Griffin, D. W., & Lim, J. S. (1984). Signal estimation from short-time Fourier transform. IEEE Transactions on Acoustics, Speech and Signal Processing ASSP, 32(2), 236–243.
Griffin, D. W. & Lim, J. S. (1984). Speech synthesis from short-time Fourier transform magnitude and its application to speech processing. In Proceedings of the IEEE International Conference Acoustics, Speech, Signal Processing, vol. 9 (pp. 61–64).
Lawrence, R. (1975). Rabiner and Bernard Gold: Theory and application of digital signal processing. Englewood Cliffs: Prentice-Hall Inc.
Lutz, M., & Ascher, D. (1998). Learning python. O’Reilly & Associates.
Nawab, S. H., Quartieri, T. F., & Lim, J. S. (1983). Signal reconsctruction from short-time Fourier transform magnitude. IEEE Transactions on Acoustics, Speech and Signal Processing ASSP, 31(4), 986–998.
Zhu, X., Beauregard, G. T., & Wyse, L. (2007). Real-time signal estimation from modified short-time Fourier transform magnitude spectra. IEEE Transactions on Audio, Speech and Language Processing, 15(5), 1645–1653.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chami, M., Immassi, M. & Martino, J.D. An architectural comparison of signal reconstruction algorithms from short-time Fourier transform magnitude spectra. Int J Speech Technol 18, 433–441 (2015). https://doi.org/10.1007/s10772-015-9281-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-015-9281-9