Toward a System of Visual Classification, Analysis and Recognition of Performance-Based Moving Images in the Artistic Field

Castronuovo, Michael; Fiordelmondo, Alessandro; Saba, Cosetta

doi:10.1007/978-3-031-51026-7_29

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14366))

Included in the following conference series:

International Conference on Image Analysis and Processing

204 Accesses

Abstract

This paper proposes a research program focused on the design of a model for the recognition, analysis and classification of video art works and documentations based on their semiotic aspects and audiovisual content. Focusing on a corpus of art cinema, video art, and performance art, the theoretical framework involves bringing together semiotics, film studies, visual studies, and performance studies with the innovative technologies of computer vision and artificial intelligence. The aim is to analyze the performance aspect to interpret contextual references and cultural constructs recorded in artistic contexts, contributing to the classification and analysis of video art works with complex semiotic characteristics. Underlying the conceptual framework is the simultaneous use of a set of technologies, such as pose estimation, facial recognition, object recognition, motion analysis, audio analysis, and natural language processing, to improve recognition accuracy and create a large set of labeled audiovisual data. In addition, the authors propose a prototype application to explore the primary challenges of such a research project.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Archival facilities in the GLAM (Galleries, Libraries, Archives and Museums) and MAB (Museums, Archives, Libraries) sectors are invested in the European Union’s strategic program for digitization, preservation and online accessibility of cultural heritage, supported by the Plan for Recovery, which will be completed by 2030. https://commission.europa.eu/strategy-and-policy/priorities-2019-2024/europe-fit-digital-age/europes-digital-decade-digital-targets-2030_en (last accessed 17 August 2023).

References

Andrea, P., Antonio, S.: Teorie dell’immagine. il dibattito contemporaneo (2009)
Google Scholar
Arcagni, S., et al.: L’occhio della macchina, vol. 705. Einaudi (2018)
Google Scholar
Audry, S.: Art in the age of machine learning. Mit Press (2021)
Google Scholar
Avola, D., Cinque, L., Fagioli, A., Foresti, G.L., Fragomeni, A., Pannone, D.: 3D hand pose and shape estimation from RGB images for keypoint-based hand gesture recognition. Pattern Recogn. 129, 108762 (2022)
Article Google Scholar
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Bosi, M., Pretto, N., Guarise, M., Canazza, S.: Sound and music computing using AI: Designing a standard. In: Proceedings of the 18th Sound Music Computing Conference (SMC’21) (2021)
Google Scholar
Falcon, A., Serra, G., Lanz, O.: Video question answering supported by a multi-task learning objective. Multimedia Tools and Applications pp. 1–28 (2023). https://doi.org/10.1007/s11042-023-14333-0
Fontanille, J.: Soma & séma. Figures du corps, Maisonneuve et Larose (2004)
Google Scholar
Goldberg, R.: Performance now: Live art from the 21st Century. Thames and Hudson (2018)
Google Scholar
Grespi, B.: Figure del corpo. Gesto e immagine in movimento (2019)
Google Scholar
Hossain, M.S., Muhammad, G.: Emotion recognition using deep learning approach from audio-visual emotional big data. Inform. Fusion 49, 69–78 (2019)
Article Google Scholar
Huyghe, P., et al.: Pierre huyghe. (No Title) (1999)
Google Scholar
Kazakos, E., Nagrani, A., Zisserman, A., Damen, D.: Epic-fusion: audio-visual temporal binding for egocentric action recognition. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 5492–5501 (2019)
Google Scholar
Khurana, D., Koli, A., Khatter, K., Singh, S.: Natural language processing: State of the art, current trends and challenges. Multimed. Tools Appl. 82(3), 3713–3744 (2023)
Article Google Scholar
Kim, J.W., Choi, J.Y., Ha, E.J., Choi, J.H.: Human pose estimation using mediapipe pose and optimization method based on a humanoid model. Appl. Sci. 13(4), 2700 (2023)
Article Google Scholar
Ko, B.C.: A brief review of facial emotion recognition based on visual information. Sensors 18(2), 401 (2018)
Google Scholar
Mitchell, W.J.: Pictorial turn. In: Visual Global Politics, pp. 230–232. Routledge (2018)
Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: Unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Redmon, J., Farhadi, A.: Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Saba, C.: Per un supplemento d’indagine: la forza deterritorializzante del video. In: Valentini V., Saba C. (edited by), Medium senza medium. Amnesia e cannibalizzazione: il video dopo gli anni ‘90, pp. 79–127. Bulzoni (2015)
Google Scholar
Saba, C.G.: Extended cinema: the performative power of cinema in installation practices. Cinéma & Cie 13(1), 123–140 (2013)
Google Scholar
Sun, K., Xiao, B., Liu, D., Wang, J.: Deep high-resolution representation learning for human pose estimation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5693–5703 (2019)
Google Scholar
Wang, L., et al.: Temporal segment networks: Towards good practices for deep action recognition. In: European Conference on Computer Vision pp. 20–36. Springer (2016)
Google Scholar
Yao, G., Lei, T., Zhong, J.: A review of convolutional-neural-network-based action recognition. Pattern Recogn. Lett. 118, 14–22 (2019)
Article Google Scholar
Zamprogno, M., et al.: Video-based convolutional attention for person re-identification. In: Image Analysis and Processing-ICIAP 2019: 20th International Conference, Trento, Italy, September 9–13, 2019, Proceedings, Part I 20. pp. 3–14. Springer (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Humanities and Cultural Heritage (DIUM), University of Udine (UNIUD), Udine, Italy
Michael Castronuovo, Alessandro Fiordelmondo & Cosetta Saba

Authors

Michael Castronuovo
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Fiordelmondo
View author publications
You can also search for this author in PubMed Google Scholar
Cosetta Saba
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The paper has been conceived, discussed and planned by all three authors. Michael Castronuovo has written Sects. 3-4, Alessandro Fiordelmondo planned and carried out the implementation of a prototype application, and Cosetta Saba has written Sects. 1-2.

Corresponding author

Correspondence to Alessandro Fiordelmondo .

Editor information

Editors and Affiliations

University of Udine, Udine, Italy
Gian Luca Foresti
University of Udine, Udine, Italy
Andrea Fusiello
University of York, York, UK
Edwin Hancock

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Castronuovo, M., Fiordelmondo, A., Saba, C. (2024). Toward a System of Visual Classification, Analysis and Recognition of Performance-Based Moving Images in the Artistic Field. In: Foresti, G.L., Fusiello, A., Hancock, E. (eds) Image Analysis and Processing - ICIAP 2023 Workshops. ICIAP 2023. Lecture Notes in Computer Science, vol 14366. Springer, Cham. https://doi.org/10.1007/978-3-031-51026-7_29

Download citation

DOI: https://doi.org/10.1007/978-3-031-51026-7_29
Published: 21 January 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-51025-0
Online ISBN: 978-3-031-51026-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Toward a System of Visual Classification, Analysis and Recognition of Performance-Based Moving Images in the Artistic Field