poster

Tightly coupling visual and linguistic features for enriching audio-based web browsing experience

Authors:

Muhammad Asiful Islam,

Faisal Ahmed,

Yevgen Borodin,

I. V. RamakrishnanAuthors Info & Claims

CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

Pages 2085 - 2088

https://doi.org/10.1145/2063576.2063896

Published: 24 October 2011 Publication History

Get Access

Abstract

People who are blind use screen readers for browsing web pages. Since screen readers read out content serially, a naive readout tends to mix irrelevant and relevant content thereby disrupting the coherency of the material being read out and confusing the listener. To address this problem we can partition web pages into coherent segments and narrate each such piece separately. Extant methods to do segmentation use visual and structural cues without taking the semantics into account and consequently create segments containing irrelevant material. In this paper, we describe a new technique for creating coherent segments by tightly coupling visual, structural, and linguistic features present in the content. A notable aspect of the technique is that it produces segments with little irrelevant content. Preliminary experiments indicate that the technique is effective in creating highly coherent segments and the experiences of an early adopter who is blind suggest that it enriches the overall browsing experience.

References

[1]

Apple. Voiceover, screen reader from apple (http://www.apple.com/accessibility/voiceover). 2010.

Google Scholar

[2]

Y. Borodin, F. Ahmed, M. A. Islam, Y. Puzis, V. Melnyk, S. Feng, I. V. Ramakrishnan, and G. Dausch. Hearsay: a new generation context-driven multi-modal assistive web browser. In WWW, 2010.

Digital Library

Google Scholar

[3]

D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. VIPS: a vision-based page segmentation algorithm. Microsoft Technical Report, (MSR-TR-2003-79), 2003.

Google Scholar

[4]

D. Chakrabarti, R. Kumar, and K. Punera. A graph-theoretic approach to webpage segmentation. In WWW, pages 377--386, 2008.

Digital Library

Google Scholar

[5]

H.-F. Guo, J. Mahmud, Y. Borodin, A. Stent, and I. V. Ramakrishnan. A general approach for partitioning web page content based on geometric and style information. In ICDAR, pages 929--933, 2007.

Digital Library

Google Scholar

[6]

JAWS. (http://www.freedomscientific.com). 2010.

Google Scholar

[7]

J. Mahmud, Y. Borodin, and I. V. Ramakrishnan. Csurf: a context-driven non-visual web-browser. In WWW, pages 31--40, 2007.

Digital Library

Google Scholar

[8]

C. D. Manning, P. Raghavan, and H. Schütze. Introduction to information retrieval. Cambridge University Press, 2008.

Digital Library

Google Scholar

[9]

Readability. (https://www.readability.com). 2010.

Google Scholar

[10]

A. Strehl. Relationship-based clustering and cluster ensembles for high-dimensional data mining. PhD thesis, UT Austin, May 2002.

Digital Library

Google Scholar

Cited By

View all

Phutane MJung CChen NAzenkot S(2023)Speaking with My Screen Reader: Using Audio Fictions to Explore Conversational Access to InterfacesProceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3597638.3608404(1-18)Online publication date: 22-Oct-2023
https://dl.acm.org/doi/10.1145/3597638.3608404
Vtyurina AFourney AMorris MFindlater LWhite RBigham JAzenkot SKane S(2019)VERSEProceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3308561.3353773(414-426)Online publication date: 24-Oct-2019
https://dl.acm.org/doi/10.1145/3308561.3353773
Ramakrishnan IAshok VBillah S(2019)Alternative Nonvisual Web Browsing TechniquesWeb Accessibility10.1007/978-1-4471-7440-0_32(629-649)Online publication date: 4-Jun-2019
https://doi.org/10.1007/978-1-4471-7440-0_32
Show More Cited By

Index Terms

Tightly coupling visual and linguistic features for enriching audio-based web browsing experience
1. Human-centered computing
  1. Human computer interaction (HCI)
2. Information systems
  1. Information retrieval

Recommendations

Thematic organization of web content for distraction-free text-to-speech narration
ASSETS '12: Proceedings of the 14th international ACM SIGACCESS conference on Computers and accessibility

People with visual disabilities, especially those who are blind, have digital content narrated to them by text-to-speech (TTS) engines (e.g., with the help of screen readers). Naively narrating web pages, particularly the ones consisting of several ...
An interface agent for non-visual, accessible web automation
UIST Adjunct Proceedings '12: Adjunct proceedings of the 25th annual ACM symposium on User interface software and technology

The Web is far less usable and accessible for the users with visual impairments than it is for the sighted people. Web automation has the potential to bridge the divide between the ways visually impaired people and sighted people access the Web, and ...
Universal and ubiquitous web access with Capti
W4A '12: Proceedings of the International Cross-Disciplinary Conference on Web Accessibility

In this paper we present Capti -- a universally and ubiquitously accessible web browsing application enabling intuitive and usable web access for people with and w/o vision impairments. Capti provides a usable screen-reader interface for web browsing ...

Comments

Information & Contributors

Information

Published In

CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

October 2011

2712 pages

ISBN:9781450307178

DOI:10.1145/2063576

Editors:
Bettina Berendt,
Arjen de Vries,
Wenfei Fan,
Craig Macdonald
University of Glasgow, UK
,
Iadh Ounis
University of Glasgow, UK
,
Ian Ruthven
University of Strathclyde, UK

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2011

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Poster

Conference

CIKM '11

Sponsor:

CIKM '11: International Conference on Information and Knowledge Management

October 24 - 28, 2011

Glasgow, Scotland, UK

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

7
Total Citations
View Citations
153
Total Downloads

Downloads (Last 12 months)9
Downloads (Last 6 weeks)2

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Phutane MJung CChen NAzenkot S(2023)Speaking with My Screen Reader: Using Audio Fictions to Explore Conversational Access to InterfacesProceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3597638.3608404(1-18)Online publication date: 22-Oct-2023
https://dl.acm.org/doi/10.1145/3597638.3608404
Vtyurina AFourney AMorris MFindlater LWhite RBigham JAzenkot SKane S(2019)VERSEProceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3308561.3353773(414-426)Online publication date: 24-Oct-2019
https://dl.acm.org/doi/10.1145/3308561.3353773
Ramakrishnan IAshok VBillah S(2019)Alternative Nonvisual Web Browsing TechniquesWeb Accessibility10.1007/978-1-4471-7440-0_32(629-649)Online publication date: 4-Jun-2019
https://doi.org/10.1007/978-1-4471-7440-0_32
Ramakrishnan IAshok VBillah S(2017)Non-visual Web Browsing: Beyond Web AccessibilityUniversal Access in Human–Computer Interaction. Designing Novel Interactions10.1007/978-3-319-58703-5_24(322-334)Online publication date: 16-May-2017
https://doi.org/10.1007/978-3-319-58703-5_24
Soviak ACarriço LMirri SGuerreiro TThiessen P(2015)Haptic gloves for audio-tactile web accessibilityProceedings of the 12th International Web for All Conference10.1145/2745555.2746671(1-2)Online publication date: 18-May-2015
https://dl.acm.org/doi/10.1145/2745555.2746671
Soviak AAshok VBorodin YPuzis YRamakrishnan IYesilada YBigham J(2015)Feel the WebProceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility10.1145/2700648.2811385(391-392)Online publication date: 26-Oct-2015
https://dl.acm.org/doi/10.1145/2700648.2811385
Islam MAhmed FBorodin YRamakrishnan IHuenerfauth MKurniawan S(2012)Thematic organization of web content for distraction-free text-to-speech narrationProceedings of the 14th international ACM SIGACCESS conference on Computers and accessibility10.1145/2384916.2384920(17-24)Online publication date: 22-Oct-2012
https://dl.acm.org/doi/10.1145/2384916.2384920

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Thematic organization of web content for distraction-free text-to-speech narration

An interface agent for non-visual, accessible web automation

Universal and ubiquitous web access with Capti