ABSTRACT
Over the past decade, there has been a significant improvement in the quality of images we see on the web, and image processing technologies such as monocular depth estimation are opening up new possibilities for various applications. However, despite these developments, how we utilize image descriptions for image accessibility has remained stagnant since alt text was introduced with HTML 2.0 in 1995. This paper presents the concept of Dimensional alt text, which enables users to navigate image descriptions through three-dimensional layers: the foreground, middle ground, and background. Our research findings suggest that providing space for image descriptions on each dimensional layer can assist users in building a mental image of the photo, resulting in better spatial understanding. Our discussion for future work is to extend the use case of the prototype to a broader range of users and investigate a hybrid authoring model that combines human authorship with AI assistance.
Supplemental Material
- Felix Richter (August 31, 2017). Smartphones Cause Photography Boom. Retrieved January 15, 2023, from https://www.statista.com/chart/10913/number-of-photos-taken-worldwide/Google Scholar
- Ed Lee (June 10, 2021). 2021 Worldwide Image Capture Forecast: 2020 – 2025. Retrieved Jan 15, 2023, from https://riseaboveresearch.com/rar-reports/2021-worldwide-image-capture-forecast-2020-2025/Google Scholar
- Léonie Watson (June 2011). Text descriptions and emotion rich images. Retrieved January 15, 2023, from https://tink.uk/text-descriptions-emotion-rich-images/Google Scholar
- Meredith Ringel Morris, Jazette Johnson, Cynthia L. Bennett, and Edward Cutrell. 2018. Rich Representations of Visual Content for Screen Reader Users. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). Association for Computing Machinery, New York, NY, USA, Paper 59, 1–11. https://doi.org/10.1145/3173574.3173633Google ScholarDigital Library
- Ju Yeon Jung, Tom Steinberger, Junbeom Kim, and Mark S. Ackerman. 2022. “So What? What's That to Do With Me?” Expectations of People With Visual Impairments for Image Descriptions in Their Personal Photo Activities. In Designing Interactive Systems Conference (DIS '22). Association for Computing Machinery, New York, NY, USA, 1893–1906. https://doi.org/10.1145/3532106.3533522Google ScholarDigital Library
- Kelly Mack, Edward Cutrell, Bongshin Lee, and Meredith Ringel Morris. 2021. Designing Tools for High-Quality Alt Text Authoring. In Proceedings of the 23rd International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '21). Association for Computing Machinery, New York, NY, USA, Article 23, 1–14. https://doi.org/10.1145/3441852.3471207Google ScholarDigital Library
- Thomas Smith (May 2021). AI Is Terrible at Writing Alt Text. Medium. Retrieved March 3, 2023, from https://tomsmith585.medium.com/ai-is-terrible-at-writing-alt-text-e79b0c4ecf51Google Scholar
- Apple Inc. 2021. iPhone User Guide. Retrieved March 3, 2023, from https://support.apple.com/en-gb/guide/iphone/iph3e2e2281/iosGoogle Scholar
- Lauren Race (2012). Twitter Alt Text Features. laurenrace.com. Retrieved March 3, 2023, from https://laurenrace.com/design/twitter-alt-text-features-design/Google Scholar
Index Terms
- Dimensional alt text: Enhancing Spatial Understanding through Dimensional Layering of Image Descriptions for Screen Reader Users
Recommendations
Rich Representations of Visual Content for Screen Reader Users
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing SystemsAlt text (short for "alternative text") is descriptive text associated with an image in HTML and other document formats. Screen reader technologies speak the alt text aloud to people who are visually impaired. Introduced with HTML 2.0 in 1995, the alt ...
Going Beyond One-Size-Fits-All Image Descriptions to Satisfy the Information Wants of People Who are Blind or Have Low Vision
ASSETS '21: Proceedings of the 23rd International ACM SIGACCESS Conference on Computers and AccessibilityImage descriptions are how people who are blind or have low vision (BLV) access information depicted within images. To our knowledge, no prior work has examined how a description for an image should be designed for different scenarios in which users ...
“Honestly I Never Really Thought About Adding a Description”: Why Highly Engaged Tweets Are Inaccessible
Human-Computer Interaction – INTERACT 2021AbstractAlternative (alt) text is vital for visually impaired users to consume digital images with screen readers. When these image descriptions are not incorporated, these users encounter accessibility challenges. In this study, we explore the prevalence ...
Comments