Interactive Evolutionary Computation Improving Voice Impressions with Keeping Speaker Personality for Real-Time Speech

Fukumoto, Makoto; Fukushima, Yuta; Miyamoto, Taichi

doi:10.1007/978-3-031-71115-2_24

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14902))

Included in the following conference series:

International Conference on Computer Information Systems and Industrial Management

164 Accesses

Abstract

Recently, we generally have meetings via the Internet. In this situation, we use background display to improve our impression of other members of the meetings. To improve the users’ voice via the Internet, this study proposes an Interactive Evolutionary Computation (IEC) that adjusts the voice filter based on real-time pronunciations while keeping user’s personality. The concrete system was constructed by employing a Genetic Algorithm and Koigoe, a software voice filter. The listening experiments were conducted to investigate the efficiencies of the proposed IEC from perspectives of increasing the fitness values and keeping the speaker’s personality. The results showed that the proposed IEC has enough possibility to find a good parameter set of the voice filter; however, we need to improve its performance because the obtained best filter did not overcome the impression of the original voice without any filter. Furthermore, the proposed IEC could be considered to keep the user’s personality based on the result of the evaluation experiment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Self-Fitting Hearing Aid Algorithm Based on Improved Interactive Genetic Algorithm

Towards an Evolutionary Computational Approach to Articulatory Vocal Synthesis with PRAAT

Speech Enhancement Approach Based on Accelerated Particle Swarm Optimization (APSO)

References

Zoom Support, Touch up my appearance. https://support.zoom.com/hc/en/article?id=zm_kb&sysparm_article=KB0060352#h_01EEEKSJTQPX33FTK8K1H46YFQ. Accessed 15 March 2024
Naunheim, M.R., Puka, E., Huston, M.N.: Do you like your voice? A population-based survey of voice satisfaction and voice enhancement. Laryngoscope (2023). https://doi.org/10.1002/lary.30822
Uchida, T.: Subjective impressions of speech sound converted with a fundamental frequency and a spectral frequency scale. In: Proceedings the 78th Annual Convention of the Japanese Psychological Association, p. 647 (2014). (in Japanese)
Google Scholar
Elian, M.T., Bao, S., Masuko, S., Yamanaka, T.: Designing gender ambiguous voice agents -effects of gender ambiguous voice agents on usability of voice user interfaces. Int. J. Affect. Eng. 22(1), 53–62 (2023)
Article Google Scholar
Yano, S., Niimi, M., Mizumachi, M.: Proposal for Motegoe voice changer providing enriched speech communication -automatic modification of Brisk voice, thankful voice and apology voice. IEICE Tech. Rep. 114(511), EMM2014-84, 43–48 (2015) (in Japanese)
Google Scholar
Dawkins, R.: The Blind Watchmaker. Penguin Books, USA (1986)
Google Scholar
Takagi, H.: Interactive evolutionary computation: Fusion of the capabilities of EC optimization and human evaluation. Proc. IEEE 89(9), 1275–1296 (2001)
Article Google Scholar
Holland, J.H.: Adaptation in natural and artificial systems: an introductory analysis with applications to biology control and artificial intelligence. The University of Michigan Press, USA (1975)
Google Scholar
Fukumoto, M.: An efficiency of interactive differential evolution for optimization of warning sound with reflecting individual preference. Trans. Electric. Electron. Eng. 10(S1), S77–S82 (2015). https://doi.org/10.1002/tee.22167
Watanabe, A., Tanji, M., Iba, H.: Creating singing vocal expressions by means of interactive evolutionary computation. In: Proceedins of the 5th International Workshop on Computational Intelligence & Applications, pp. 278–283 (2009)
Google Scholar
Inoue, A., Fukumoto, M.: A proposal of creating ideal UTAU voice based on voice of the user’s own key by interactive differential evolution. In: Proceedings of the 6th International Conference on Computational Science/Intelligence and Applied Informatics (2019). https://doi.org/10.1109/CSII.2019.00017
Miyamoto, T., Gan, H., Fukumoto, M.: Making an english speech similar to the user’s voice using UTAU and interactive differential evolution. Int. J. Affect. Eng. 22(3), 245–251 (2023)
Article Google Scholar
Sato, Y.: Voice quality conversion using interactive evolution of prosodic control. Appl. Soft Comput. 5(2), 181–192 (2005)
Article Google Scholar
Miyamoto, T., Fukumoto, M.: Making english voices similar to user’s voices using voice changer and interactive differential evolution. In: Proceedings of the 10th International Symposium Affective Science and Engineering, AM-1B-05 (2024)
Google Scholar
Koigoe. http://koigoemoe.g2.xrea.com/koigoe/koigoe.html. Accessed 15 March 2024
Herdy, M.: Evolutionary optimization based on subjective selection: evolving blends of coffee. In: Proceedings of 5th European Congress on Intelligent Techniques and Soft Computing, pp. 640–644 (1997)
Google Scholar
Fukumoto M., Hanada, Y.: A proposal for creation of beverage suited for user by blending juices based on interactive genetic algorithm. In: Proceedings of the IEEE International Conference SMC2019 (2019). https://doi.org/10.1109/SMC.2019.8914494
Fukumoto, M., Inoue, M., Koga S., Imai, J.: Interactive differential evolution using time information required for user’s selection: In a case of optimizing fragrance composition. In: Proceedings of the 2015 IEEE Congress on Evolutionary Computation (CEC), p. 7257155 (2015). https://doi.org/10.1109/CEC.2015.7257155
Fukumoto, M., Ienaga, T.: A proposal for optimization method of vibration pattern of mobile device with interactive genetic algorithm. Lecture Notes in Computer Science (2013). https://doi.org/10.1007/978-3-642-39238-2_29
Fukumoto, M., Miyamoto, T., Gan, H.: Interactive evolutionary computation creating congruent media content composed of different media types. Inform. Eng. Express 10(1) (2024). https://doi.org/10.52731/iee.v10.i1.803
Osgood, C.E., Suci, G.K., Tannenbaum, P.: The measurement of meaning. University of Illinois Press, USA (1957)
Google Scholar
Kuwahara, N., Ohgushi, K.: The role of formant frequencies and bandwidths in the perception of speaker. Trans. Inst. Electron. Commun. Eng. Japan A 69(4), 509–517 (1986). (in Japanese)
Google Scholar

Download references

Author information

Authors and Affiliations

Fukuoka Institute of Technology, Fukuoka, 8110295, Japan
Makoto Fukumoto & Yuta Fukushima
Graduate School of Engineering, Fukuoka Institute of Technology, Fukuoka, 8110295, Japan
Taichi Miyamoto

Authors

Makoto Fukumoto
View author publications
You can also search for this author in PubMed Google Scholar
Yuta Fukushima
View author publications
You can also search for this author in PubMed Google Scholar
Taichi Miyamoto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Makoto Fukumoto .

Editor information

Editors and Affiliations

Bialystok University of Technology, Białystok, Poland
Khalid Saeed
VSB - Technical University of Ostrava, Ostrava, Czech Republic
Jiří Dvorský

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fukumoto, M., Fukushima, Y., Miyamoto, T. (2024). Interactive Evolutionary Computation Improving Voice Impressions with Keeping Speaker Personality for Real-Time Speech. In: Saeed, K., Dvorský, J. (eds) Computer Information Systems and Industrial Management. CISIM 2024. Lecture Notes in Computer Science, vol 14902. Springer, Cham. https://doi.org/10.1007/978-3-031-71115-2_24

Download citation

DOI: https://doi.org/10.1007/978-3-031-71115-2_24
Published: 30 August 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-71114-5
Online ISBN: 978-3-031-71115-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics