Abstract. The automatic retrieval of indexing information from colored paper documents is a challenging problem. In order to build up bibliographic databases, editing by humans is usually necessary to provide information about title, authors and keywords. For automating the indexing process, the identification of text elements is essential. In this article an approach to automatic text extraction from colored book and journal covers is proposed. Two methods have been developed for extracting text hypotheses. The results of both methods are combined to robustly distinguish between text and non-text elements.
Similar content being viewed by others
Author information
Authors and Affiliations
Additional information
Received November 3, 1999 / Revised January 21, 2000
Rights and permissions
About this article
Cite this article
Sobottka, K., Kronenberg, H., Perroud, T. et al. Text extraction from colored book and journal covers. IJDAR 2, 163–176 (2000). https://doi.org/10.1007/PL00021523
Issue Date:
DOI: https://doi.org/10.1007/PL00021523