Skip to main content

Changing the Voice of a Subscriber on the Example of an Implementation of the PSOLA Algorithm for the iOS and Android Mobile Platforms

  • Conference paper
Multimedia Communications, Services and Security (MCSS 2013)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 368))

  • 845 Accesses

Abstract

This paper describes the implementation of the PSOLA algorithm for two mobile platforms with embedded operating systems: Android and iOS. In order to mask the voice identity of a telephony subscriber using a virtual voice, a modification of the time scale of the utterance and of the pitch of the speaker have been implemented, with the influence of these modifications on the recognition of the identity by listeners being studied. Mobile platforms were compared in terms of signal processing time, including the read and write times.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hui, Y., Young, S.: Quality-enhanced voice morphing using maximum likelihood transformations. IEEE Transactions on Audio, Speech, and Language Processing 14, 1301–1312 (2006)

    Article  Google Scholar 

  2. Hui, Y., Young, S.: High quality voice morphing. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 9–12. IEEE Press, New York (2004)

    Google Scholar 

  3. Duxans, H., Bonafonte, A.: Residual Conversion Versus Prediction on Voice Morphing Systems. In: IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1. IEEE Press, New York (2006)

    Google Scholar 

  4. Ning, X., Xi, S., Zhen, Y.: A Novel Voice Morphing System Using Bi-GMM for High Quality Transformation. In: Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, pp. 485–489 (2008)

    Google Scholar 

  5. Ning, X., Zhen, Y.: A precise estimation of vocal tract parameters for high quality voice morphing. In: 9th International Conference on Signal Processing, pp. 684–687 (2008)

    Google Scholar 

  6. Furuya, K., Moriyama, T., Ozawa, S.: Generation of Speaker Mixture Voice using Spectrum Morphing. In: IEEE International Conference on Multimedia and Expo, pp. 344–347. IEEE Press, New York (2007)

    Google Scholar 

  7. Drgas, S., Zamorski, D., Dabrowski, A.: Speaker verification using various prosodic kernels, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 1–5 (2011)

    Google Scholar 

  8. Drgas, S., Cetnarowicz, D., Dabrowski, A.: Speaker verification based on prosodic features, Signal Processing Algorithms, Architectures, Arrangements, and Applications (SPA), pp. 79–82 (2008)

    Google Scholar 

  9. Lopatka, K., Suchomski, P., Czyzewski, A.: Time-domain prosodic modifications for Text-To-Speech Synthesizer, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 73–77 (2010)

    Google Scholar 

  10. Kupryjanow, A., Czyzewski, A.: Time-scale modification of speech signals for supporting hearing impaired schoolchildren, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 159–162 (2009)

    Google Scholar 

  11. Yinqiu, G., Zhen, Y.: Pitch modification based on syllable units for voice morphing system. In: International Conference on Network and Parallel Computing Workshops, pp. 135–139 (2007)

    Google Scholar 

  12. Kumar, K., Jain, J.: Speech Pitch Shifting using Complex Continuous Wavelet Transform. In: Annual IEEE India Conference, pp. 1–4. IEEE Press, New York (2006)

    Chapter  Google Scholar 

  13. Abe, M.: Speech morphing by gradually changing spectrum parameter and fundamental frequency. In: Fourth International Conference on Spoken Language, vol. 4, pp. 2235–2238 (1996)

    Google Scholar 

  14. Yifeng, S., Jia, J., Lianhong, C.: Detection on PSOLA-modified voices by seeking out duplicated fragments. In: International Conference on Systems and Informatics, pp. 2177–2182 (2012)

    Google Scholar 

  15. Wang, Y., Yang, S.: Speech synthesis based on PSOLA algorithm and modified pitch parameters. In: International Conference on Computational Problem-Solving, pp. 296–299 (2010)

    Google Scholar 

  16. Valbret, H., Moulines, E., Tubach, J.P.: Voice transformation using PSOLA technique. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 145–148 (1992)

    Google Scholar 

  17. Celik, M., Sharma, G., Murat Tekalp, A.: Pitch and Duration Modification for Speech Watermarking. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 17–20. IEEE Press, New York (2005)

    Google Scholar 

  18. Nakano, T., Goto, M.: Vocalistener2: A singing synthesis system able to mimic a user’s singing in terms of voice timbre changes as well as pitch and dynamics. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 453–456. IEEE Press, New York (2011)

    Google Scholar 

  19. Lisiecki, B., Meyer, A., Dabrowski, A.: Implementation of sound effects on DSP platform, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 1–3 (2011)

    Google Scholar 

  20. Key Global Telecom Indicators for the World Telecommunication Service Sector, http://www.itu.int/ITU-D/ict/statistics/at_glance/KeyTelecom.html (accessed January 30, 2013)

  21. Smart phones overtake client PCs in 2011, http://www.canalys.com/newsroom/smart-phones-overtake-client-pcs-2011 (accessed January 30, 2013)

  22. Piotrowski, Z., Grabiec, W.: Voice trust in public switched telephone networks. In: Dziech, A., Czyżewski, A. (eds.) MCSS 2012. CCIS, vol. 287, pp. 282–291. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  23. Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication 9(5/6), 453–467 (1990)

    Article  Google Scholar 

  24. Verteletskaya, E., Šimák, B.: Performance Evaluation of Pitch Detection Algorithms, http://access.feld.cvut.cz/view.php?cisloclanku=2009060001 (accessed January 30, 2013)

  25. Zölzer, U.: DAFX: Digital Audio Effects. Wiley, New York (2012)

    Google Scholar 

  26. http://code.google.com/p/sipdroid/ (accessed January 30, 2013)

  27. http://www.ifixit.com/Teardown/iPhone+4S+Teardown/6610/2 (accessed January 30, 2013)

  28. http://samsung.com/global/business/semiconductor/product/application/detail?productId=7644&iaId=844 (accessed January 30, 2013)

  29. Tyler, J.: XDA Developers’ Android Hacker’s Toolkit: The Complete Guide to Rooting, ROMs and Theming. Wiley, New York (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Piotrowski, Z., Ciołek, M. (2013). Changing the Voice of a Subscriber on the Example of an Implementation of the PSOLA Algorithm for the iOS and Android Mobile Platforms. In: Dziech, A., Czyżewski, A. (eds) Multimedia Communications, Services and Security. MCSS 2013. Communications in Computer and Information Science, vol 368. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38559-9_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38559-9_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38558-2

  • Online ISBN: 978-3-642-38559-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics