Abstract
This paper describes the implementation of the PSOLA algorithm for two mobile platforms with embedded operating systems: Android and iOS. In order to mask the voice identity of a telephony subscriber using a virtual voice, a modification of the time scale of the utterance and of the pitch of the speaker have been implemented, with the influence of these modifications on the recognition of the identity by listeners being studied. Mobile platforms were compared in terms of signal processing time, including the read and write times.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hui, Y., Young, S.: Quality-enhanced voice morphing using maximum likelihood transformations. IEEE Transactions on Audio, Speech, and Language Processing 14, 1301–1312 (2006)
Hui, Y., Young, S.: High quality voice morphing. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 9–12. IEEE Press, New York (2004)
Duxans, H., Bonafonte, A.: Residual Conversion Versus Prediction on Voice Morphing Systems. In: IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1. IEEE Press, New York (2006)
Ning, X., Xi, S., Zhen, Y.: A Novel Voice Morphing System Using Bi-GMM for High Quality Transformation. In: Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, pp. 485–489 (2008)
Ning, X., Zhen, Y.: A precise estimation of vocal tract parameters for high quality voice morphing. In: 9th International Conference on Signal Processing, pp. 684–687 (2008)
Furuya, K., Moriyama, T., Ozawa, S.: Generation of Speaker Mixture Voice using Spectrum Morphing. In: IEEE International Conference on Multimedia and Expo, pp. 344–347. IEEE Press, New York (2007)
Drgas, S., Zamorski, D., Dabrowski, A.: Speaker verification using various prosodic kernels, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 1–5 (2011)
Drgas, S., Cetnarowicz, D., Dabrowski, A.: Speaker verification based on prosodic features, Signal Processing Algorithms, Architectures, Arrangements, and Applications (SPA), pp. 79–82 (2008)
Lopatka, K., Suchomski, P., Czyzewski, A.: Time-domain prosodic modifications for Text-To-Speech Synthesizer, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 73–77 (2010)
Kupryjanow, A., Czyzewski, A.: Time-scale modification of speech signals for supporting hearing impaired schoolchildren, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 159–162 (2009)
Yinqiu, G., Zhen, Y.: Pitch modification based on syllable units for voice morphing system. In: International Conference on Network and Parallel Computing Workshops, pp. 135–139 (2007)
Kumar, K., Jain, J.: Speech Pitch Shifting using Complex Continuous Wavelet Transform. In: Annual IEEE India Conference, pp. 1–4. IEEE Press, New York (2006)
Abe, M.: Speech morphing by gradually changing spectrum parameter and fundamental frequency. In: Fourth International Conference on Spoken Language, vol. 4, pp. 2235–2238 (1996)
Yifeng, S., Jia, J., Lianhong, C.: Detection on PSOLA-modified voices by seeking out duplicated fragments. In: International Conference on Systems and Informatics, pp. 2177–2182 (2012)
Wang, Y., Yang, S.: Speech synthesis based on PSOLA algorithm and modified pitch parameters. In: International Conference on Computational Problem-Solving, pp. 296–299 (2010)
Valbret, H., Moulines, E., Tubach, J.P.: Voice transformation using PSOLA technique. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 145–148 (1992)
Celik, M., Sharma, G., Murat Tekalp, A.: Pitch and Duration Modification for Speech Watermarking. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 17–20. IEEE Press, New York (2005)
Nakano, T., Goto, M.: Vocalistener2: A singing synthesis system able to mimic a user’s singing in terms of voice timbre changes as well as pitch and dynamics. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 453–456. IEEE Press, New York (2011)
Lisiecki, B., Meyer, A., Dabrowski, A.: Implementation of sound effects on DSP platform, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 1–3 (2011)
Key Global Telecom Indicators for the World Telecommunication Service Sector, http://www.itu.int/ITU-D/ict/statistics/at_glance/KeyTelecom.html (accessed January 30, 2013)
Smart phones overtake client PCs in 2011, http://www.canalys.com/newsroom/smart-phones-overtake-client-pcs-2011 (accessed January 30, 2013)
Piotrowski, Z., Grabiec, W.: Voice trust in public switched telephone networks. In: Dziech, A., Czyżewski, A. (eds.) MCSS 2012. CCIS, vol. 287, pp. 282–291. Springer, Heidelberg (2012)
Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication 9(5/6), 453–467 (1990)
Verteletskaya, E., Šimák, B.: Performance Evaluation of Pitch Detection Algorithms, http://access.feld.cvut.cz/view.php?cisloclanku=2009060001 (accessed January 30, 2013)
Zölzer, U.: DAFX: Digital Audio Effects. Wiley, New York (2012)
http://code.google.com/p/sipdroid/ (accessed January 30, 2013)
http://www.ifixit.com/Teardown/iPhone+4S+Teardown/6610/2 (accessed January 30, 2013)
http://samsung.com/global/business/semiconductor/product/application/detail?productId=7644&iaId=844 (accessed January 30, 2013)
Tyler, J.: XDA Developers’ Android Hacker’s Toolkit: The Complete Guide to Rooting, ROMs and Theming. Wiley, New York (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Piotrowski, Z., Ciołek, M. (2013). Changing the Voice of a Subscriber on the Example of an Implementation of the PSOLA Algorithm for the iOS and Android Mobile Platforms. In: Dziech, A., Czyżewski, A. (eds) Multimedia Communications, Services and Security. MCSS 2013. Communications in Computer and Information Science, vol 368. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38559-9_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-38559-9_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38558-2
Online ISBN: 978-3-642-38559-9
eBook Packages: Computer ScienceComputer Science (R0)