ABSTRACT
In the past, researchers studied readability enhancement of English articles for non-native English readers, either on paper reading or hypertext reading. Using a variety of methods, researchers were able to enhance the reading comprehension and the users' satisfaction on hypertext reading, such as changing content presentation with visual-syntactic text formatting (VSTF) format or Jenga format. In terms of dynamically changing content presentation for reading, one less explored format is Portable Document Format (PDF), which was traditionally viewed within a modern Web browser or Adobe Acrobat reader on the desktop. PDF format was standardized as an open format in 2008 and has been widely used to keep a fixed-layout content. However, a fixed layout document presents a challenge to apply existing transformation methods, not mention on mobile devices. In this paper, we present a system that uses a novel algorithm to decode a PDF document and apply content transformation to enhance its readability. Although we used Jenga format as an example to enhance the readability of PDF documents, we envision the proposed framework can be used to adopt different transformation methods. The system was implemented in a mobile device and we are able to apply a basic transformation to a PDF document at both the sentence and paragraph levels. The main contribution of this research is we extend previous work of readability enhancement from paper document and hypertext content to PDF documents. Current result is promising, and we believe it is worth further investigation to make PDF documents readable and accessible on the Web for different populations, such as non-native English readers, people with dyslexia or special needs, etc.
- Nicholas Chen et al. Navigation techniques for dual-display e-book readers. In Proceeding of the twenty-sixth annual CHI conference on Human factors in computing systems - CHI '08, page 1779, New York, New York, USA, April 2008. ACM Press.Google Scholar
- Bill N. Schilit, Gene Golovchinsky, and Morgan N. Price. Beyond paper: supporting active reading with free form digital ink annotations. In Proceedings of the SIGCHI conference on Human factors in computing systems - CHI '98, pages 249--256, New York, New York, USA, January 1998. ACM Press.Google ScholarDigital Library
- Aristidis Protopsaltis and Vassiliki Bouki. The effects of reading goals in hypertext reading. In Proceedings of the 24th annual conference on Design of communication - SIGDOC '06, pages 29--34, New York, New York, USA, October 2006. ACM Press.Google ScholarDigital Library
- Aristidis Protopsaltis and Vassiliki Bouki. The effects of reading goals in hypertext reading. In Proceedings of the 24th annual conference on Design of communication - SIGDOC '06, pages 29--34, New York, New York, USA, October 2006. ACM Press.Google ScholarDigital Library
- Stan Walker, Phil Schloss, Charles R Fletcher, Charles A Vogel, and Randall C Walker. Visual-Syntactic Text Formatting: A New Method to Enhance Online Reading. Reading Online An Electronic Journal, 8(6), 2005.Google Scholar
- Chen-Hsiang Yu and Robert C. Miller, Enhancing web page readability for non-native readers, Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, April 10--15, 2010, Atlanta, Georgia, USAGoogle ScholarDigital Library
- https://www.gsmaintelligence.com/research/?file=061ad2d2417d6ed1ab002da0dbc9ce22&downloadGoogle Scholar
- https://w3techs.com/technologies/history_overview/content_language/ms/yGoogle Scholar
- Ammon, U (2015) The Status of the German Language in the World, cited in Noack, R and Gamio, L (2015) The world's languages, in 7 maps and charts. Washington Post 23 April 2015. Available online at: https://www.washingtonpost.com/news/worldviews/wp/2015/04/23/the-worlds-languages-in7-maps-and-charts/?utm_term=.c7342219767bGoogle Scholar
- https://tools.ietf.org/html/rfc3778Google Scholar
- https://www.iso.org/news/2008/07/Ref1141.htmlGoogle Scholar
- Android PdfRender. https://developer.android.com/reference/android/graphics/pdf/PdfRendererGoogle Scholar
- PdfBox-Android. https://github.com/TomRoush/PdfBox-AndroidGoogle Scholar
- Josef B. Baker, Alan P. Sexton and Volker. MaxTract: Converting PDF to LATEX, MathML and Text. In J. Jeuring, J. A. Campbell, J. Carette, G. D. Reis, P. Sojka, M. Wenzel, and V. Sorge, editors, AISC/DML/MKM/Calculemus, volume 7362 of Lecture Notes in Computer Science, pages 422--426. Springer, 2012.Google Scholar
- Masakazu Suzuki, Tamari Fumikazu, Fukuda Ryoji, Uchida Seiichi and Kanahori Toshihiro. Infty: an integrated ocr system for mathematical documents. In DocEng '03: Proceedings of the 2003 ACM symposium on Document engineering, pages 95--104, New York, NY, USA, 2003. ACM Press.Google ScholarDigital Library
Recommendations
Enhancing web page readability for non-native readers
CHI '10: Proceedings of the SIGCHI Conference on Human Factors in Computing SystemsReaders face many obstacles on today's Web, including distracting content competing for the user's attention and other factors interfering with comfortable reading. On today's primarily English-language Web, non-native readers encounter even more ...
Interactive Repair of Tables Extracted from PDF Documents on Mobile Devices
CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing SystemsPDF documents often contain rich data tables that offer opportunities for dynamic reuse in new interactive applications. We describe a pipeline for extracting, analyzing, and parsing PDF tables based on existing machine learning and rule-based ...
Accessibility Devices for Mobile Interfaces Extensions: A Survey
MobileHCI '15: Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services AdjunctThe development of mobile applications that consider accessibility can uniquely make use of software extensions to provide special interfaces to people with impairments. However, such extensions are limited so that the use of external devices is a ...
Comments