Abstract
Examines the effects of orthography on information retrieval (IR). IR systems use white-space normalization to produce graphic, or orthographic words, from text. The punctuation in graphic words is treated idiosyncratically by different database vendor systems. This two-step process produces the index terms that IR users must match when formulating IR queries. The argument advanced by this essay is that these index terms are often unpredictable and therefore difficult to match. In this manner textual aesthetics impedes IR. The textual aesthetics of the space, the hyphen, the apostrophe stopwords are discussed. Examples are given for the commercial database vendor systems of DIALOG, DataStar and OCLC EPIC. Implications are drawn for the manipulation of written language on the World Wide Web.
Preview
Unable to display preview. Download preview PDF.
References
Beall, J.: The dirty database test. American Libraries, 22 (1991) 197.
Borgman, C. L.: Why are online catalogs hard to Use? Lessons learned from informationretrieval studies. Journal of the American Society for Information Science, 37 (1986) 387–400.
Borgman, C. L.: Why are online catalogs still hard to use? Journal of the American Society for Information Science, 47 (1996) 493–503.
Bruthiaux, P.: Knowing when to stop: Investigating the nature of punctuation. Language & Communication, 13 (1993) 27–43.
Chalker, S. & Weiner, E.: The Oxford dictionary of English Grammar. Oxford: Clarendon Press (1994).
Cooper, M.: Design of Library Automation Systems: File Structures, Data Structures and Tools. New York, NY: Wiley (1996).
Coyle, K. ”WAIS Software.” PACS-L@UHUPVM1. BITNET (1993, October 1).
Crystal, D.: An Encyclopedic Dictionary of Language and Languages. Oxford, UK: Black-well (1992).
Crystal, D.: The Cambridge Encyclopedia of the English Language. Cambridge, UK: Cambridge University Press (1995).
DataStar Guide: System Reference Manual. Philadelphia, PA (n.d.).
Dillon, J. T.: In pursuit of the colon: A century of scholarly progress: 1880–1980. Journal of Higher Education, 53 (1982) 93–99.
Drabenstott, K. M. & Vizine-Goetz, D.: Using subject headings for online retrieval: Theory, practice, and potential. San Diego, CA: Academic Press (1994).
Friedl, J. E. F.: Mastering regular expressions: Powerful techniques for perl and other tools. Sebastopol, CA: O'Reilly (1997).
Harris, R.: The Origin of writing. London: Duckworth (1986).
Hindle, D.: A Parser for text corpora. In B. T. S. Atkins, & A. Zampolli (Eds.), Computational Approaches to the Lexicon (pp. 103–151). Oxford, UK: Oxford University Press (1994).
Houston, J.E. (Ed.): Thesaurus of ERIC descriptors. Phoenix, AZ: Oryz Press (1990).
Little, G. D.: The Ambivalent apostrophe. English Today, 8 (1986) 15–17.
McArthur, T.: The Oxford Companion to the English Language. Oxford: Oxford University Press (1992).
McIntosh, R.: Hyphenation. Bradford, UK: Computer Hyphenation Ltd. (1990).
Room, A.: Axing the apostrophe. English Today, 19 (1989) 21–23.
Sklar, E.S.: The Possessive apostrophe: the development and decline of a crooked mark. College English, 38 (1976) 175–183.
”Smiley Face Dictionary.” http://users.nbn.net/%7Ekimle/smile.html (November 15, 1996).
Smith, K.W.: OCLC — Moving toward the next stage of the electronic library. In Proceedings of the Fourteenth Annual Conference of Research Library Directors. Tomorrow's Access-Today's Decisions: Ensuring Access to Today's Electronic Resources (pp. 1–5). Dublin, OH: OCLC Online Computer Library Center (1996).
Stovel, L. (BL.MDS@RLG.Stanford.EDU).: Term normalization. E-mail to Terrence A. Brooks (tabrooks@u.Washington.edu) (1995, July 20).
The Unicode Consortium.: The Unicode Standard, Version 2.0. Reading, MA: Addison-Wesley (1996).
Weller, C. R. & Houston, J. E.: ERIC Identifier Authority List. Phoeniz, AZ: Oryx Press (1992).
Wiegand, W.A.: Irrepressible Reformer: a Biography of Melvil Dewey. Chicago, IL: American Library Association (1996).
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Brooks, T.A. (1998). The effect of textual aesthetics on information retrieval. In: Hersch, R.D., André, J., Brown, H. (eds) Electronic Publishing, Artistic Imaging, and Digital Typography. RIDT 1998. Lecture Notes in Computer Science, vol 1375. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0053291
Download citation
DOI: https://doi.org/10.1007/BFb0053291
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64298-5
Online ISBN: 978-3-540-69718-3
eBook Packages: Springer Book Archive