Article

Reprocessing paper-based reference materials for the digital environment

Author:
P. Bryan Heidorn

University of Illinois, Champaign, IL

University of Illinois, Champaign, IL
View Profile

JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital librariesJuly 2002Pages 377https://doi.org/10.1145/544220.544324

Published:14 July 2002Publication History

JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries

Pages 377

ABSTRACT

One of the primary challenges for the creation of digital libraries is to enhance the value of paper-based publications by providing digital access to the materials. Simple full-text searching is just a first step in this process. Better functionality may be gained by exploiting the natural structure within text. The following paper describes the process of digital conversion and integration of encyclopedic publications, glossaries and thesauri. The Biological Information Browsing (http://www.biobrowser.org) team developed text-processing tools, and an information retrieval and visualization environment that provides greater functionality for these traditionally paper-based publications [1]. The process includes automatic text segmentation and structuring, automated XML markup, structure-based indexing, automatic thesaurus extraction for query expansion and on-line definitions. Very few other information systems provide complete services for publishing, indexing, XML query and retrieving documents.

References

Heidorn, P. Bryan. (2001) A Tool for Multipurpose Use of Online Flora and Fauna: The Biological Information Browsing Environment (BIBE), First Monday, 6(2) (February 2001). {http://firstmonday.org/}Google Scholar

Index Terms

Reprocessing paper-based reference materials for the digital environment
1. Applied computing
  1. Computers in other domains
    1. Digital libraries and archives
2. Information systems
  1. Information systems applications
    1. Digital libraries and archives

Recommendations

Some results using different approaches to merge visual and text-based features in CLEF'08 photo collection
CLEF'08: Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access

This paper describes the participation of the MIRACLE team at the ImageCLEF Photographic Retrieval task of CLEF 2008. We succeeded in submitting 41 runs. Obtained results from text-based retrieval are better than content-based as previous experiments in ...
Read More
Multimedia retrieval by means of merge of results from textual and content based retrieval subsystems
CLEF'09: Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments

The main goal of this paper it is to present our experiments in ImageCLEF 2009 Campaign (photo retrieval task). In 2008 we proved empirically that the Text-based Image Retrieval (TBIR) methods defeats the Content-based Image Retrieval CBIR "quality" of ...
Read More
An XQuery engine for digital library systems
JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries

XML is now a standard markup language for web information. Many application areas are producing XML documents on the web. This situation urges digital library systems to deal with not only typical text documents but also XML documents. XML documents are ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
July 2002
448 pages
ISBN:1581135130
DOI:10.1145/544220
General Chair:
William Hersh
Oregon Health & Science University
,
Program Chair:
Gary Marchionini
University of North Carolina at Chapel Hill
Copyright © 2002 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 July 2002
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
XML
electronic publishing
indexing
information retrieval
structured text
Qualifiers
- Article
Conference

Acceptance Rates
JCDL '02 Paper Acceptance Rate69of240submissions,29%Overall Acceptance Rate415of1,482submissions,28%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 250
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Reprocessing paper-based reference materials for the digital environment

JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries

ABSTRACT

References

Cited By

Index Terms

Recommendations

Some results using different approaches to merge visual and text-based features in CLEF'08 photo collection

Multimedia retrieval by means of merge of results from textual and content based retrieval subsystems

An XQuery engine for digital library systems