Abstract
Text-to-image AI models can generate novel images for design inspiration. Yet, their applications for collaborative design (co-design) purposes and interoperability within simulation-based, immersive settings have been scarcely explored. In this paper, we propose a novel, multi-modal approach for interactive public participation in urban design projects. The main objectives of our research are (a) to describe a methodological workflow of integrating text-to-image AI models into VR-mediated co-design workshops, and (b) to investigate the applicability of the proposed workflow through a set of completed and prospective case studies. Both studies are parts of a broader research project, which aims to revitalize the city of Derby, UK through producing a series of sustainable design visions. Through these case studies, we discuss some preliminary results and introduce our future work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Portman, M.E., Natapov, A., Fisher-Gewirtzman, D.: To go where no man has gone before: virtual reality in architecture, landscape architecture and environmental planning. Comput. Environ. Urban Syst. 54, 376–384 (2015)
Meenar, M., Kitson, J.: Using multi-sensory and multi-dimensional immersive virtual reality in participatory planning. Urban Sci. 4(3), 34 (2020)
Zhang, C., Zeng, W., Liu, L.: UrbanVR: an immersive analytics system for context-aware urban design. Comput. Graph. 99, 128–138 (2021)
Safikhani, S., Keller, S., Schweiger, G., Pirker, J.: Immersive virtual reality for extending the potential of building information modeling in architecture, engineering, and construction sector: systematic review. Int. J. Dig. Earth 15(1), 503–526 (2022)
Liu, X.: Three-dimensional visualized urban landscape planning and design based on virtual reality technology. IEEE Access 8, 149510–149521 (2020)
Schrom-Feiertag, H., Stubenschrott, M., Regal, G., Matyus, T., Seer, S.: An interactive and responsive virtual reality environment for participatory urban planning. In: Proceedings of the 11th Annual Symposium on Simulation for Architecture and Urban Design pp. 1–7 (2020)
Kim, S., Kim, J., Kim, B.: Immersive virtual reality-aided conjoint analysis of urban square preference by living environment. Sustainability 12(16), 6440 (2020)
Yu, R., Gu, N., Lee, G., Khan, A.: A systematic review of architectural design collaboration in immersive virtual environments. Designs 6(5), 93 (2022)
Panya, D.S., Kim, T., Choo, S.: An interactive design change methodology using a BIM-based Virtual Reality and Augmented Reality. J. Build. Eng. 106030 (2023)
Delgado, J.M.D., Oyedele, L., Demian, P., Beach, T.: A research agenda for augmented and virtual reality in architecture, engineering and construction. Adv. Eng. Inform. 45, 101122 (2020)
Michels, A.: Citizen participation in local policy making: design and democracy. Int. J. Public Adm. 35(4), 285–292 (2012)
Binder, T., Brandt, E.: The design: lab as platform in participatory design research. CoDesign 4(2), 115–129 (2008)
Nabatchi, T., Leighninger, M.: Citizenship, outside the public square. In: Public Participation for 21st Century Democracy, pp. 1–12 (2015)
Sanders, E.B.-N., Stappers, P.J.: Co-creation and the new landscapes of design. CoDesign 4(1), 5–18 (2008)
Pillai, A.G., et al.: Communicate, Critique and Co-create (CCC) future technologies through design fictions in VR environment. In: Companion Publication of the 2020 ACM Designing Interactive Systems Conference, pp. 413–416 (2020)
Paasch Knudsen, S., Husted Hansen, H., Ørngreen, R.: Exploring the Learning Potentials of Augmented Reality Through Speculative Design, pp. 156–163 (2022)
Gu, N., Amini Behbahani, P.: A critical review of computational creativity in built environment design. Buildings 11(1), 29 (2021)
Yildirim, E.: Text-to-image generation - AI in architecture. Art Archit. Theory Pract. Exper. 97 (2022)
Kingma, D.P., Welling, M.: Auto-Encoding Variational Bayes. CoRR, abs/1312.6114 (2013)
Goodfellow, I., et al.: Generative adversarial networks. Commun. ACM 63(11), 139–144 (2020)
Open AI: DALL·E 2 (2002). https://openai.com/product/dall-e-2
Stability AI: Stable Diffusion Public Release (2023). https://stability.ai/blog/stable-diffusion-public-release
Google Research: Imagen: Text-to-Image Diffusion Models (2022). https://imagen.research.google/
Gafni, O., Polyak, A., Ashual, O., Sheynin, S., Parikh, D., Taigman, Y.: Make-a-scene: scene-based text-to-image generation with human priors. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, Tal (eds.) Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XV, pp. 89–106. Springer Nature Switzerland, Cham (2022). https://doi.org/10.1007/978-3-031-19784-0_6
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., Ommer, B.: High-resolution image synthesis with latent diffusion models. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022-June, pp. 10674–10685 (2022)
Saharia, C., et al.: Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding (2022)
Ruiz, N., Li, Y., Jampani, V., Pritch, Y., Rubinstein, M., Aberman, K.: DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation (2022)
Ho, J., Jain, A., Abbeel, P.: Denoising diffusion probabilistic models. Adv. Neural. Inf. Process. Syst. 33, 6840–6851 (2020)
Dhariwal, P., Nichol, A.: Diffusion models beat GANs on image synthesis. Adv. Neural. Inf. Process. Syst. 11, 8780–8794 (2021)
Gu, S., et al.: Vector quantized diffusion model for text-to-image synthesis. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022-June, pp. 10686–10696 (2022)
Seneviratne, S., Senanayake, D., Rasnayaka, S., Vidanaarachchi, R., Thompson, J.: DALLE-URBAN: capturing the urban design expertise of large text to image transformers (2022)
UrbanistAI: UrbanistAI (2023). https://www.urbanistai.com/
Ehab, A., Heath, T.: Exploring immersive co-design: comparing human interaction in real and virtual elevated urban spaces in london. Sustain. 15(12), 9184 (2023). https://doi.org/10.3390/su15129184
Funding
This project has been generously supported by the Osborne Legacy. The financial assistance provided by the legacy has been instrumental in the successful completion of this research effort.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Bussell, C., Ehab, A., Hartle-Ryan, D., Kapsalis, T. (2023). Generative AI for Immersive Experiences: Integrating Text-to-Image Models in VR-Mediated Co-design Workflows. In: Stephanidis, C., Antona, M., Ntoa, S., Salvendy, G. (eds) HCI International 2023 Posters. HCII 2023. Communications in Computer and Information Science, vol 1836. Springer, Cham. https://doi.org/10.1007/978-3-031-36004-6_52
Download citation
DOI: https://doi.org/10.1007/978-3-031-36004-6_52
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36003-9
Online ISBN: 978-3-031-36004-6
eBook Packages: Computer ScienceComputer Science (R0)