skip to main content
10.1145/2063576.2063896acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Tightly coupling visual and linguistic features for enriching audio-based web browsing experience

Published: 24 October 2011 Publication History

Abstract

People who are blind use screen readers for browsing web pages. Since screen readers read out content serially, a naive readout tends to mix irrelevant and relevant content thereby disrupting the coherency of the material being read out and confusing the listener. To address this problem we can partition web pages into coherent segments and narrate each such piece separately. Extant methods to do segmentation use visual and structural cues without taking the semantics into account and consequently create segments containing irrelevant material. In this paper, we describe a new technique for creating coherent segments by tightly coupling visual, structural, and linguistic features present in the content. A notable aspect of the technique is that it produces segments with little irrelevant content. Preliminary experiments indicate that the technique is effective in creating highly coherent segments and the experiences of an early adopter who is blind suggest that it enriches the overall browsing experience.

References

[1]
Apple. Voiceover, screen reader from apple (http://www.apple.com/accessibility/voiceover). 2010.
[2]
Y. Borodin, F. Ahmed, M. A. Islam, Y. Puzis, V. Melnyk, S. Feng, I. V. Ramakrishnan, and G. Dausch. Hearsay: a new generation context-driven multi-modal assistive web browser. In WWW, 2010.
[3]
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. VIPS: a vision-based page segmentation algorithm. Microsoft Technical Report, (MSR-TR-2003-79), 2003.
[4]
D. Chakrabarti, R. Kumar, and K. Punera. A graph-theoretic approach to webpage segmentation. In WWW, pages 377--386, 2008.
[5]
H.-F. Guo, J. Mahmud, Y. Borodin, A. Stent, and I. V. Ramakrishnan. A general approach for partitioning web page content based on geometric and style information. In ICDAR, pages 929--933, 2007.
[6]
JAWS. (http://www.freedomscientific.com). 2010.
[7]
J. Mahmud, Y. Borodin, and I. V. Ramakrishnan. Csurf: a context-driven non-visual web-browser. In WWW, pages 31--40, 2007.
[8]
C. D. Manning, P. Raghavan, and H. Schütze. Introduction to information retrieval. Cambridge University Press, 2008.
[9]
Readability. (https://www.readability.com). 2010.
[10]
A. Strehl. Relationship-based clustering and cluster ensembles for high-dimensional data mining. PhD thesis, UT Austin, May 2002.

Cited By

View all
  • (2023)Speaking with My Screen Reader: Using Audio Fictions to Explore Conversational Access to InterfacesProceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3597638.3608404(1-18)Online publication date: 22-Oct-2023
  • (2019)VERSEProceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3308561.3353773(414-426)Online publication date: 24-Oct-2019
  • (2019)Alternative Nonvisual Web Browsing TechniquesWeb Accessibility10.1007/978-1-4471-7440-0_32(629-649)Online publication date: 4-Jun-2019
  • Show More Cited By

Index Terms

  1. Tightly coupling visual and linguistic features for enriching audio-based web browsing experience

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
      October 2011
      2712 pages
      ISBN:9781450307178
      DOI:10.1145/2063576
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 October 2011

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. blind
      2. clustering
      3. screen reader
      4. segmentation
      5. singular value decomposition

      Qualifiers

      • Poster

      Conference

      CIKM '11
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)9
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 17 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Speaking with My Screen Reader: Using Audio Fictions to Explore Conversational Access to InterfacesProceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3597638.3608404(1-18)Online publication date: 22-Oct-2023
      • (2019)VERSEProceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3308561.3353773(414-426)Online publication date: 24-Oct-2019
      • (2019)Alternative Nonvisual Web Browsing TechniquesWeb Accessibility10.1007/978-1-4471-7440-0_32(629-649)Online publication date: 4-Jun-2019
      • (2017)Non-visual Web Browsing: Beyond Web AccessibilityUniversal Access in Human–Computer Interaction. Designing Novel Interactions10.1007/978-3-319-58703-5_24(322-334)Online publication date: 16-May-2017
      • (2015)Haptic gloves for audio-tactile web accessibilityProceedings of the 12th International Web for All Conference10.1145/2745555.2746671(1-2)Online publication date: 18-May-2015
      • (2015)Feel the WebProceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility10.1145/2700648.2811385(391-392)Online publication date: 26-Oct-2015
      • (2012)Thematic organization of web content for distraction-free text-to-speech narrationProceedings of the 14th international ACM SIGACCESS conference on Computers and accessibility10.1145/2384916.2384920(17-24)Online publication date: 22-Oct-2012

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media