Skip to main content

Interactive Evolutionary Computation Improving Voice Impressions with Keeping Speaker Personality for Real-Time Speech

  • Conference paper
  • First Online:
Computer Information Systems and Industrial Management (CISIM 2024)

Abstract

Recently, we generally have meetings via the Internet. In this situation, we use background display to improve our impression of other members of the meetings. To improve the users’ voice via the Internet, this study proposes an Interactive Evolutionary Computation (IEC) that adjusts the voice filter based on real-time pronunciations while keeping user’s personality. The concrete system was constructed by employing a Genetic Algorithm and Koigoe, a software voice filter. The listening experiments were conducted to investigate the efficiencies of the proposed IEC from perspectives of increasing the fitness values and keeping the speaker’s personality. The results showed that the proposed IEC has enough possibility to find a good parameter set of the voice filter; however, we need to improve its performance because the obtained best filter did not overcome the impression of the original voice without any filter. Furthermore, the proposed IEC could be considered to keep the user’s personality based on the result of the evaluation experiment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zoom Support, Touch up my appearance. https://support.zoom.com/hc/en/article?id=zm_kb&sysparm_article=KB0060352#h_01EEEKSJTQPX33FTK8K1H46YFQ. Accessed 15 March 2024

  2. Naunheim, M.R., Puka, E., Huston, M.N.: Do you like your voice? A population-based survey of voice satisfaction and voice enhancement. Laryngoscope (2023). https://doi.org/10.1002/lary.30822

  3. Uchida, T.: Subjective impressions of speech sound converted with a fundamental frequency and a spectral frequency scale. In: Proceedings the 78th Annual Convention of the Japanese Psychological Association, p. 647 (2014). (in Japanese)

    Google Scholar 

  4. Elian, M.T., Bao, S., Masuko, S., Yamanaka, T.: Designing gender ambiguous voice agents -effects of gender ambiguous voice agents on usability of voice user interfaces. Int. J. Affect. Eng. 22(1), 53–62 (2023)

    Article  Google Scholar 

  5. Yano, S., Niimi, M., Mizumachi, M.: Proposal for Motegoe voice changer providing enriched speech communication -automatic modification of Brisk voice, thankful voice and apology voice. IEICE Tech. Rep. 114(511), EMM2014-84, 43–48 (2015) (in Japanese)

    Google Scholar 

  6. Dawkins, R.: The Blind Watchmaker. Penguin Books, USA (1986)

    Google Scholar 

  7. Takagi, H.: Interactive evolutionary computation: Fusion of the capabilities of EC optimization and human evaluation. Proc. IEEE 89(9), 1275–1296 (2001)

    Article  Google Scholar 

  8. Holland, J.H.: Adaptation in natural and artificial systems: an introductory analysis with applications to biology control and artificial intelligence. The University of Michigan Press, USA (1975)

    Google Scholar 

  9. Fukumoto, M.: An efficiency of interactive differential evolution for optimization of warning sound with reflecting individual preference. Trans. Electric. Electron. Eng. 10(S1), S77–S82 (2015). https://doi.org/10.1002/tee.22167

  10. Watanabe, A., Tanji, M., Iba, H.: Creating singing vocal expressions by means of interactive evolutionary computation. In: Proceedins of the 5th International Workshop on Computational Intelligence & Applications, pp. 278–283 (2009)

    Google Scholar 

  11. Inoue, A., Fukumoto, M.: A proposal of creating ideal UTAU voice based on voice of the user’s own key by interactive differential evolution. In: Proceedings of the 6th International Conference on Computational Science/Intelligence and Applied Informatics (2019). https://doi.org/10.1109/CSII.2019.00017

  12. Miyamoto, T., Gan, H., Fukumoto, M.: Making an english speech similar to the user’s voice using UTAU and interactive differential evolution. Int. J. Affect. Eng. 22(3), 245–251 (2023)

    Article  Google Scholar 

  13. Sato, Y.: Voice quality conversion using interactive evolution of prosodic control. Appl. Soft Comput. 5(2), 181–192 (2005)

    Article  Google Scholar 

  14. Miyamoto, T., Fukumoto, M.: Making english voices similar to user’s voices using voice changer and interactive differential evolution. In: Proceedings of the 10th International Symposium Affective Science and Engineering, AM-1B-05 (2024)

    Google Scholar 

  15. Koigoe. http://koigoemoe.g2.xrea.com/koigoe/koigoe.html. Accessed 15 March 2024

  16. Herdy, M.: Evolutionary optimization based on subjective selection: evolving blends of coffee. In: Proceedings of 5th European Congress on Intelligent Techniques and Soft Computing, pp. 640–644 (1997)

    Google Scholar 

  17. Fukumoto M., Hanada, Y.: A proposal for creation of beverage suited for user by blending juices based on interactive genetic algorithm. In: Proceedings of the IEEE International Conference SMC2019 (2019). https://doi.org/10.1109/SMC.2019.8914494

  18. Fukumoto, M., Inoue, M., Koga S., Imai, J.: Interactive differential evolution using time information required for user’s selection: In a case of optimizing fragrance composition. In: Proceedings of the 2015 IEEE Congress on Evolutionary Computation (CEC), p. 7257155 (2015). https://doi.org/10.1109/CEC.2015.7257155

  19. Fukumoto, M., Ienaga, T.: A proposal for optimization method of vibration pattern of mobile device with interactive genetic algorithm. Lecture Notes in Computer Science (2013). https://doi.org/10.1007/978-3-642-39238-2_29

  20. Fukumoto, M., Miyamoto, T., Gan, H.: Interactive evolutionary computation creating congruent media content composed of different media types. Inform. Eng. Express 10(1) (2024). https://doi.org/10.52731/iee.v10.i1.803

  21. Osgood, C.E., Suci, G.K., Tannenbaum, P.: The measurement of meaning. University of Illinois Press, USA (1957)

    Google Scholar 

  22. Kuwahara, N., Ohgushi, K.: The role of formant frequencies and bandwidths in the perception of speaker. Trans. Inst. Electron. Commun. Eng. Japan A 69(4), 509–517 (1986). (in Japanese)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Makoto Fukumoto .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fukumoto, M., Fukushima, Y., Miyamoto, T. (2024). Interactive Evolutionary Computation Improving Voice Impressions with Keeping Speaker Personality for Real-Time Speech. In: Saeed, K., Dvorský, J. (eds) Computer Information Systems and Industrial Management. CISIM 2024. Lecture Notes in Computer Science, vol 14902. Springer, Cham. https://doi.org/10.1007/978-3-031-71115-2_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-71115-2_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-71114-5

  • Online ISBN: 978-3-031-71115-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics