Abstract
What is the most important reason for using Computer Vision methods in humanities research? In this article, I argue that the use of numerical representation and data analysis methods offers a new language for describing cultural artifacts, experiences and dynamics. The human languages such as English or Russian that developed rather recently in human evolution are not good at capturing analog properties of human sensorial and cultural experiences. These limitations become particularly worrying if we want to compare thousands, millions or billions of artifacts—i.e. to study contemporary media and cultures at their new twenty-first century scale. When we instead use numerical measurements of image properties standard in Computer Vision, we can better capture details of a single artifact as well as visual differences between a number of artifacts–even if they are very small. The examples of visual dimensions that numbers can capture better then languages include color, shape, texture, contours, composition, and visual characteristics of represented faces, bodies and objects. The methods of finding structures and relationships in large numerical datasets developed in statistics and machine learning allow us to extend this analysis to very big datasets of cultural objects. Equally importantly, numerical image features used in Computer Vision also give us a new language to represent gradual and continuous temporal changes—something which natural languages are also bad at. This applies to both single artworks such as a film or a dance piece (describing movement and rhythm) and also to changes in visual characteristics in millions of artifacts over decades or centuries.
Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Barthes R (1997) Elements of semiology. Hill and Wang, New York. Originally published in France in 1962
Brown K (2020) The Routledge companion to digital humanities and art history. Routledge, London
Champion E (2017) Digital humanities is text heavy, visualization light, and simulation poor. Digital Scholarship Humanities 32, issue supplement 1: 25–32. https://academic.oup.com/dsh/article/32/suppl_1/i25/2957402. Accessed 1 July 2020
Derech N, Tal A, Shimshoni I (2018) Solving archeological puzzles. https://arxiv.org/pdf/1812.10553.pdf. Accessed 1 July 2020
Desrosières A (1998) The politics of large numbers: a history of statistical reading. Harvard University Press, Cambridge
Digital Humanities Conference (2019) https://dh2019.adho.org. Accessed 1 July 2020
Gibson T, Conway BR (2017) The world has millions of colors. Why do we only name a few? Smithsonian Magazine. https://www.smithsonianmag.com/science-nature/why-different-languages-name-different-colors-180964945/. Accessed 1 July 2020
Goldberger P (1996) On Madison avenue, sometimes less is less. The New York Times October 27, 1996
Goodman, N (1968) Languages of art: an approach to a theory of symbols. Bobbs-Merrill, Indianapolis
Heftberger A (2019) Digital humanities and film studies. Springer, Berlin
Huawei (2019) Huawei P20. consumer.huawei.com. http://consumer.huawei.com/en/phones/p20/. Accessed 1 July 2020
Impett L, Moretti F (2017) Totentanz. Operationalizing Aby Warburg’s Pathosformeln. Stanford Literary Lab. https://litlab.stanford.edu/LiteraryLabPamphlet16.pdf. Accessed 1 July 2020
Isaacson W (2012) How Steve jobs’ love of simplicity fueled a design revolution. Smithsonian Magazine, September 24, 2012. http://www.smithsonianmag.com/arts-culture/how-steve-jobs-love-of-simplicity-fueled-a-design-revolution-23868877/. Accessed 1 July 2020
Manovich L (2001) The language of new media. The MIT Press, Cambridge
Manovich L (2007b) Cultural analytics: about. Software Studies Lab. http://lab.softwarestudies.com/p/overview-slides-and-video-articles-why.html. Accessed 1 July 2020
Manovich L (2007c) Information as an aesthetic event. Receiver, n.p. http://manovich.net/index.php/projects/information-as-an-aesthetic-event. Accessed 1 July 2020
Manovich L (2009) There is only software. In: Lee Y, Henk Slager H (eds) Nam June Paik reader—contributions to an artistic anthropology. NJP Art Center, Yongin, pp 26–29
Redi M, Liu FZ, O’Hare NK (2017) Bridging the aesthetic gap: the wild beauty of web imagery. In: ICMR’17: proceedings of the 2017 ACM international conference on multimedia retrieval. ACM, New York, pp 242–250
Nusberg L (1969) Cybertheater. Leonardo 2: 61–62. http://monoskop.org/images/a/af/Nusberg_Lev_1969_Cybertheater.pdf. Accessed 1 July 2020
Pawson J (2020) Calvin Klein Collections Store. Johnpawson.com. http://www.johnpawson.com/works/calvin-klein-collections-store. Accessed 1 July 2020
Peckham J (2018) Huawei P20 and P20 pro colors: what shade should you buy? Techradar. http://www.techradar.com/news/huawei-p20-and-p20-pro-colors-what-shade-should-you-buy. Accessed 1 July 2020
Sonesson G (1989) Pictorial concepts: inquiries into the semiotic heritage and its relevance to the interpretation of the visual world. Lund University Press, Lund
Stork D (2009) Computer vision and computer graphics analysis of paintings and drawings: an introduction to the literature. In: Xiaoyi J, Nicolai P (eds) CAIP’09: proceedings of the 13th international conference on computer analysis of images and patterns. Springer, Berlin, pp 9–24
VISART IV (2018) 4th workshop on computer vision for art analysis, 9th September 2018, Munich, Germany. https://visarts.eu/past-workshops/2018. Accessed 1 July 2020
Yale Digital Humanities Lab (2017) Yale DHLab—robots reading vogue. http://dhlab.yale.edu/projects/vogue/. Accessed 1 July 2020
Funding
No funding supported writing this article.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
All authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Manovich, L. Computer vision, human senses, and language of art. AI & Soc 36, 1145–1152 (2021). https://doi.org/10.1007/s00146-020-01094-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00146-020-01094-9