Imageability-Based Multi-modal Analysis of Urban Environments for Architects and Artists

Pistola, Theodora; Georgakopoulou, Nefeli; Shvets, Alexander; Chatzistavros, Konstantinos; Xefteris, Vasileios-Rafail; García, Alba Táboas; Koulalis, Ilias; Diplaris, Sotiris; Wanner, Leo; Vrochidis, Stefanos; Kompatsiaris, Ioannis

doi:10.1007/978-3-031-13321-3_18

Theodora Pistola¹¹,
Nefeli Georgakopoulou¹¹,
Alexander Shvets¹²,
Konstantinos Chatzistavros¹¹,
Vasileios-Rafail Xefteris¹¹,
Alba Táboas García¹²,
Ilias Koulalis¹¹,
Sotiris Diplaris¹¹,
Leo Wanner^12,13,
Stefanos Vrochidis¹¹ &
…
Ioannis Kompatsiaris¹¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13373))

Included in the following conference series:

International Conference on Image Analysis and Processing

1070 Accesses
1 Citations

Abstract

According to urban planner Kevin Lynch, imageability is the ability of a physical object to evoke a strong image in any viewer, making it memorable. The concept of imageability is important for architects and urban designers, so that their creations meet the needs of the citizens and improve the aesthetics of the place. Recently, computer vision and textual analysis techniques have been investigated for calculating the imageability of a place. In this paper, we propose a novel multi-modal system that utilises both visual and textual analysis methods to estimate the imageability score of a place. In addition, an image sentiment analysis deep learning model had been developed to provide supplementary information about the sentiment that is evoked to citizens by urban locations. Finally, a text generation algorithm is used to provide an explanation of the information extracted by the data analysis in a form of text to facilitate the works of architects and urban designers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://github.com/ayoolaolafenwa/PixelLib.
2.
https://assetstore.unity.com/3d/environments.
3.
https://www.google.com/forms/about/.
4.
In this work, we experimented with free-text comments that were apriori about spaces, as stated in Sect. 4. Therefore, we left other challenges such as the separation of texts about spaces and happenings in spaces for future research.
5.
The concept “rotonda” is taken from user describing nearby roundabout which is not captured on the image.

References

Paivio, A., Yuille, J.C., Madigan, S.A.: Concreteness, imagery, and meaningfulness values for 925 nouns. J. Exp. Psychol. 76, 1–25 (1968)
Article Google Scholar
Lynch, K.: The Image of the City, vol. 11. MIT press, Cambridge (1960)
Google Scholar
Ortis, A., Farinella, G., Battiato, S.: An overview on image sentiment analysis: methods. datasets and current challenges. ICETE, 290–300 (2019). https://doi.org/10.5220/0007909602900300
You, Q., Luo, J., Jin, H., Yang, J.: Robust image sentiment analysis using progressively trained and domain transferred deep networks. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI 2015, pp. 381–388. AAAI Press (2015)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)
Google Scholar
Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1251–1258 (2017)
Google Scholar
Zhou, B., Lapedriza, À., Khosla, A., Oliva, A., Torralba, A.: Places: a 10 million image database for scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 40, 1452–1464 (2017). https://doi.org/10.1109/TPAMI.2017.2723009
Article Google Scholar
Rofes, A., et al.: Imageability ratings across languages. Behav. Res. Methods 50(3), 1187–1197 (2017). https://doi.org/10.3758/s13428-017-0936-0
Article Google Scholar
Shvets, A., Wanner, L.: Concept extraction using pointer–generator networks and distant supervision for data augmentation. In: Keet, C.M., Dumontier, M. (eds.) EKAW 2020. LNCS (LNAI), vol. 12387, pp. 120–135. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-61244-3_8
Chapter Google Scholar
Ljubešić, N., Fišer, D., Peti-Stantić, A.: Predicting concreteness and imageability of words within and across languages via word embeddings. arXiv preprint arXiv:1807.02903 (2018)
Scott, G.G., Keitel, A., Becirspahic, M., Yao, B., Sereno, S.C.: The glasgow norms: ratings of 5,500 words on nine scales. Behav. Res. Methods 51(3), 1258–1270 (2018). https://doi.org/10.3758/s13428-018-1099-3
Article Google Scholar
Umemura, K., et al.: Tell as you imagine: sentence imageability-aware image captioning. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 62–73. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_6
Chapter Google Scholar
Mille, S., Carlini, R., Burga, A., Wanner, L.: FORGe at SemEval-2017 Task 9: deep sentence generation based on a sequence of graph transducers. In: Proceedings of the 11th International Workshop on Semantic Evaluation, Vancouver, pp. 920–923 (2017)
Google Scholar
Meenar, M., Afzalan, N., Hajrasouliha, A.: Analyzing Lynch’s city imageability in the digital age. J. Plan. Educ. Res., 0739456X19844573 (2019)
Google Scholar
McCunn, L.J., Gifford, R.: Spatial navigation and place imageability in sense of place. Cities 74, 208–218 (2018)
Article Google Scholar
LNCS Homepage. www.springer.com/lncs. Accessed 21 Nov 2016
Quercia, D., O’Hare, N.K., Cramer, H.: Aesthetic capital: what makes London look beautiful, quiet, and happy?. In: Proceedings of the 17th ACM Conference on Computer Supported Cooperative Work & Social Computing, pp. 945–955 (2014)
Google Scholar
Porzi, L., Rota Buló, S., Lepri, B., Ricci, E.: Predicting and understanding urban perception with convolutional neural networks. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 139–148 (2015)
Google Scholar
Dubey, A., Naik, N., Parikh, D., Raskar, R., Hidalgo, C.A.: Deep learning the city: quantifying urban perception at a global scale. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 196–212. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_12
Chapter Google Scholar
Qiu, W., Li, W., Liu, X., Huang, X.: Subjective street scene perceptions for Shanghai with large-scale application of computer vision and machine learning (No. 6166). EasyChair (2021)
Google Scholar
Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 801–818 (2018)
Google Scholar
Zhou, B., et al.: Semantic understanding of scenes through the ade20k dataset. Int. J. Comput. Vision 127(3), 302–321 (2019)
Article Google Scholar
Biljecki, F., Ito, K.: Street view imagery in urban analytics and GIS: a review. Landscape Urban Plan. 215, 104217 (2021)
Article Google Scholar
Isola, P., Xiao, J., Parikh, D., Torralba, A., Oliva, A.: What makes a photograph memorable? IEEE Trans. Pattern Anal. Mach. Intell. 36(7), 1469–1482 (2013)
Article Google Scholar
Hasler, D., Suesstrunk, S.E.: Measuring colorfulness in natural images. In: Human Vision and Electronic Imaging VIII, vol. 5007, pp. 87–95. International Society for Optics and Photonics (2003)
Google Scholar
Mel’čuk, I.: Dependency Syntax. State University of New York Press, Albany (1988)
Google Scholar
Huang, J., Obracht-Prondzynska, H., Kamrowska-Zaluska, D., Sun, Y., Li, L.: The image of the city on social media: a comparative study using “Big Data’’ and “Small Data’’ methods in the Tri-City Region in Poland. Landscape Urban Plan. 206, 103977 (2021)
Article Google Scholar
Kastner, M.A., et al.: Estimating the imageability of words by mining visual characteristics from crawled image data. Multimedia Tools Appl. 79(3), 18167–18199 (2020). https://doi.org/10.1007/s11042-019-08571-4
Article Google Scholar
Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995). https://doi.org/10.1145/219717.219748
Article Google Scholar
Manzo, L.C., Perkins, D.D.: Finding common ground: the importance of place attachment to community participation and planning. J. Plan. Lit. 20, 335–350 (2006)
Article Google Scholar

Download references

Acknowledgments.

This work was supported by the EC-funded research and innovation programme H2020 Mindspaces: “Art-driven adaptive outdoors and indoors design” under the grant agreement No.825079.

Author information

Authors and Affiliations

Information Technologies Institute - CERTH, Thessaloniki, Greece
Theodora Pistola, Nefeli Georgakopoulou, Konstantinos Chatzistavros, Vasileios-Rafail Xefteris, Ilias Koulalis, Sotiris Diplaris, Stefanos Vrochidis & Ioannis Kompatsiaris
NLP Group, Pompeu Fabra University, Roc Boronat, 138, Barcelona, Spain
Alexander Shvets, Alba Táboas García & Leo Wanner
Catalan Institute for Research and Advanced Studies (ICREA), Barcelona, Spain
Leo Wanner

Authors

Theodora Pistola
View author publications
You can also search for this author in PubMed Google Scholar
Nefeli Georgakopoulou
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Shvets
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos Chatzistavros
View author publications
You can also search for this author in PubMed Google Scholar
Vasileios-Rafail Xefteris
View author publications
You can also search for this author in PubMed Google Scholar
Alba Táboas García
View author publications
You can also search for this author in PubMed Google Scholar
Ilias Koulalis
View author publications
You can also search for this author in PubMed Google Scholar
Sotiris Diplaris
View author publications
You can also search for this author in PubMed Google Scholar
Leo Wanner
View author publications
You can also search for this author in PubMed Google Scholar
Stefanos Vrochidis
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Kompatsiaris
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Theodora Pistola .

Editor information

Editors and Affiliations

National Research Council, Lecce, Italy
Pier Luigi Mazzeo
Università Politecnica delle Marche, Ancona, Italy
Emanuele Frontoni
Boston University, Boston, MA, USA
Stan Sclaroff
National Research Council, Lecce, Italy
Cosimo Distante

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pistola, T. et al. (2022). Imageability-Based Multi-modal Analysis of Urban Environments for Architects and Artists. In: Mazzeo, P.L., Frontoni, E., Sclaroff, S., Distante, C. (eds) Image Analysis and Processing. ICIAP 2022 Workshops. ICIAP 2022. Lecture Notes in Computer Science, vol 13373. Springer, Cham. https://doi.org/10.1007/978-3-031-13321-3_18

Download citation

DOI: https://doi.org/10.1007/978-3-031-13321-3_18
Published: 07 August 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-13320-6
Online ISBN: 978-3-031-13321-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Imageability-Based Multi-modal Analysis of Urban Environments for Architects and Artists