Ill-condition enhancement for BC speech using RMC method

Ohidujjaman; Hasan, Mahmudul; Zhang, Shiming; Huda, Mohammad Nurul; Uddin, Mohammad Shorif

doi:10.1007/s10772-024-10159-9

Ill-condition enhancement for BC speech using RMC method

Published: 19 October 2024

Volume 27, pages 1085–1092, (2024)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

Ohidujjaman ORCID: orcid.org/0000-0002-8776-7145¹,
Mahmudul Hasan²,
Shiming Zhang³,
Mohammad Nurul Huda⁴ &
…
Mohammad Shorif Uddin^5,6

76 Accesses
Explore all metrics

Abstract

This paper improves the ill-condition of bone-conducted (BC) speech signal by reducing the eigenvalue expansion. BC speech commonly contains a large spectral dynamic range that causes ill-condition for the classical linear prediction (LP) methods. In the field of numerical analysis, we often face the situation where an ill-conditioned case occurs in finding the solution. Principally, eigenvalue expansion causes ill-condition in numerical analysis. To mitigate this problem, the regularized least squares (RLS) technique is commonly used. Motivated by the RLS concept, we derive the regularized modified covariance (RMC) method for BC speech analysis in this study. The RMC method reduces eigenvalue expansion by compressing the spectral dynamic range of the speech signal. Thus, the RMC method resolves the ill-conditioned problem of LP. In experiments, we show that the RMC method provides compressed eigenvalue expansion than the conventional methods for BC speech where synthetic and real BC speeches are considered. The performance of the RMC method is affected by the setting of the regularization parameter. In this paper, the regularization parameter in practice is iteratively and rule-based derived. The RMC method with such a setting provides the best performance for BC speech analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Spectral analysis of bone-conducted speech using modified linear prediction

Article 16 October 2024

Bone Conducted Speech Signal Enhancement Using LPC and MFCC

Speech Random Impulse Noise Elimination Method Based on Robust PCA Inexact ALM Algorithm

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availibility

Data will be made available on request.

Materials availability

Materials will be made available on request.

Code availability

This is private. But we can share the idea on request.

References

Amino, K., Osanai, T., Kamada, H. T. Makinae, & Arai, T. (2011). Bone-conducted speech synthesis based on least squares method. In A. Neustein & H. Patil (Eds.), Forensic speaker recognition: Law enforcement and counter-terrorism (pp. 275–308).
Atal, B. S., & Hanauer, S. L. (1971). Speech analysis and synthesis by linear prediction of the speech wave. Journal of the Acoustical Society of America, 50(2), 637–655.
Article Google Scholar
Bojanczyk, A. W.(1988). The QR decomposition of Toeplitz matrices. Asilomar Conference, 307–311.
Cheng, L., Dou, Y., Zhou, J., Wang, H., & Tao, L.(2023). Speaker-independent spectral enhancement for bone-conducted speech. Algorithms,16(153).
Demeure, C. J., & Scharf, L. L. (1990). Sliding windows and lattice algorithms for computing AR factors in the least squares theory of linear prediction. IEEE Transactions on Acoustics, Speech, and Signal Processing, 38(4), 721–725.
Article Google Scholar
Kabal, P. (2003). Ill-conditioning and bandnwidth expansion in linear prediction of speech. In Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP) (pp. 824–827).
Makhoul, J. (1975). Linear prediction: A tutorial review. Proceedings of the IEEE, 63(4), 561–580.
Article Google Scholar
Marple, S. L. (1990). A fast computational algorithm for the QR-like decomposition of the modified covariance method of linear prediction. In International conference on acoustics, speech, and signal processing.
Marple, L. (1980). A new autoregressive spectrum analysis algorithm. IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(4), 441–454.
Article Google Scholar
Marple, S. L. (1991). A fast computational algorithm for the modified covariance method of linear prediction. Digital Signal Processing, 1(3), 124–133.
Article Google Scholar
Martin, D. R., & Reichel, L. (2013). Minimization of functionals on the solution of a large-scale discrete ill-posed problem. BIT Numerical Mathematics, 53(1), 153–173.
Article MathSciNet Google Scholar
Ohidujjaman, Sugiura, Y., Shimamura, T., & Makinae, H. (2024). Packet loss concealment using regularized modified linear prediction through bone-conducted speech. In 2024 6th international conference on image, video and signal processing (IVSP 2024) (pp. 142–146).
Ohidujjaman, Sugiura, Y., Yasui, N., Shimamura, T., & Makinae, H. (2024). Regularized modified covariance method for spectral analysis of bone-conducted speech. Journal of Signal Processing, 28(3), 77–87.
Ohidujjaman, Yasui, N., Sugiura, Y., Shimamura, T., & Makinae, H.(2023). Packet loss compensation for voip through bone-conducted speech using modified linear prediction. IEEJ Transaction on Electrical and Electronic Engineering, 18(11), 1781–1790.
Paliwal, K. K., & Rao, P. V. S. (1981). A modified autocorrelation method of linear prediction for pitch-synchronous analysis of voiced speech. Signal Processing, 3(2), 181–185.
Article Google Scholar
Rabiner, L. R., & Schafer, R. W. (2011). Theory and application of digital speech processing. Prentice-Hall.
Google Scholar
Rahman, M.S., & Shimamura, T. (2016). Pitch determination from bone conducted speech. IEICE Transactions on Information and Systems, E99-D, 283–287
Rahman, M. S., & Shimamura, T. (2019). Amplitude variation of bone-conducted speech compared with air-conducted speech. Acoustical Society of Japan, 40(5), 293–301.
Google Scholar
Rahman, M. A., Sugiura, Y., & Shimamura, T. (2017). Spectrum compensation method for speech signals based on prediction error filtering. WSEAS Transactions on Systems and Control, 12, 213–220.
Google Scholar
Rahman, M. A., Sugiura, Y., & Shimamura, T. (2017). Accurate power spectrum estimation of speech with spectrum compensation based on prediction error filtering. WSEAS Transactions on Signal Processing, 13, 21–25.
Google Scholar
Rialan, C. P., & Scharf, L L.(1988). Fast algorithms for computing QR and Cholesky factors of Toeplitz operators. IEEE Transactions on Acoustics Speech Signal Processing, 36(11), 1740–1748.
Wu, Y. (2012). Parametric inverse of severely ill-conditioned Hermitian matrices in signal processing. Journal of the Franklin Institute, 349(3), 1048–1060.
Article MathSciNet Google Scholar
Xingsheng, D., Liangbo, Y., Sichun, P., & Meiqing, D. (2015). An iterative algorithm for solving ill-conditioned linear least squares problems. Geodesy and Geodynamics, 6(6), 453–459.
Article Google Scholar
Zhang, S., Sugiura, Y., & Shimamura, T. (2022). Bone-conducted speech synthesis based on least squares method. IEEJ Transactions on Electrical and Electronic Engineering, 17(3), 425–435.
Article Google Scholar

Download references

Acknowledgements

We sincerely express our gratitude to United International University (UIU) for support in making this research happen. This research was funded by the Institute for Advanced Research Publication Grant of United International University, Ref. No.: IAR-2024-Pub-058.

Funding

This research was funded by the Institute for Advanced Research Publication Grant of United International University, Ref. No.: IAR-2024-Pub-058.

Author information

Authors and Affiliations

Computer Science and Engineering, Daffodil International University, Dhaka, 1216, Bangladesh
Ohidujjaman
Computer Science and Engineering, Comilla University, Comilla, 3506, Bangladesh
Mahmudul Hasan
School of Electrical and Information, Northeast Agricultural University, Harbin, 150030, China
Shiming Zhang
Computer Science and Engineering, United International University, Dhaka, 1212, Bangladesh
Mohammad Nurul Huda
Computer Science and Engineering, Green University of Bangladesh, Kanchon, 1460, Bangladesh
Mohammad Shorif Uddin
Computer Science and Engineering, Jahangirnagar University, Savar, 1342, Bangladesh
Mohammad Shorif Uddin

Authors

Ohidujjaman
View author publications
You can also search for this author inPubMed Google Scholar
Mahmudul Hasan
View author publications
You can also search for this author inPubMed Google Scholar
Shiming Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Mohammad Nurul Huda
View author publications
You can also search for this author inPubMed Google Scholar
Mohammad Shorif Uddin
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Ohidujjaman: Writing -review & editing, Conceptualization, Formal analysis, Data curation. Mahmudul Hasan: Conceptualization, Formal analysis, Data curation. Shiming Zhang: Writing -review & editing, Data curation. Mohammad Nurul Huda: Conceptualization, Methodology. Mohammad Shorif Uddin: Supervision.

Corresponding authors

Correspondence to Ohidujjaman or Mahmudul Hasan.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Consent for publication

We affirm that this manuscript is unique, unpublished, and not under consideration for publication elsewhere. We confirm that the manuscript has been read and approved by all named authors and that there are no other people who meet the criteria for authorship but are not listed. We further affirm that we have all approved the order of authors listed in the manuscript. We understand that the corresponding author is the sole contact for the editorial process. He is responsible for communicating with the other authors about progress, submissions of revisions, and final approval of proofs.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ohidujjaman, Hasan, M., Zhang, S. et al. Ill-condition enhancement for BC speech using RMC method. Int J Speech Technol 27, 1085–1092 (2024). https://doi.org/10.1007/s10772-024-10159-9

Download citation

Received: 25 August 2024
Accepted: 05 October 2024
Published: 19 October 2024
Issue Date: December 2024
DOI: https://doi.org/10.1007/s10772-024-10159-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Ill-condition enhancement for BC speech using RMC method

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Spectral analysis of bone-conducted speech using modified linear prediction

Bone Conducted Speech Signal Enhancement Using LPC and MFCC

Speech Random Impulse Noise Elimination Method Based on Robust PCA Inexact ALM Algorithm

Explore related subjects

Data availibility

Materials availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now