skip to main content
10.1145/1815330.1815333acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

A framework for the assessment of text extraction algorithms on complex colour images

Published: 09 June 2010 Publication History

Abstract

The availability of open, ground-truthed datasets and clear performance metrics is a crucial factor in the development of an application domain. The domain of colour text image analysis (real scenes, Web and spam images, scanned colour documents) has traditionally suffered from a lack of a comprehensive performance evaluation framework. Such a framework is extremely difficult to specify, and corresponding pixel-level accurate information tedious to define. In this paper we discuss the challenges and technical issues associated with developing such a framework. Then, we describe a complete framework for the evaluation of text extraction methods at multiple levels, provide a detailed ground-truth specification and present a case study on how this framework can be used in a real-life situation.

References

[1]
T. Retornaz and B. Marcotegui, "Scene text localization based on the ultimate opening", Proceedings of the 8th International Symposium on Mathematical Morphology, Rio de Janeiro, Brazil, Oct. 10--13, 2007, Vol. 1, pp. 177--188.
[2]
A. Clavelli and D. Karatzas, "Text Segmentation in Colour Posters from the Spanish Civil War Era", Proceedings of the 10th International Conference on Document Analysis and Recognition, IEEE CPS, pp. 181--185, 2009
[3]
S. J. Perantonis, B. Gatos, V. Maragos, V. Karkaletsis and G. Petasis, "Text Area Identification in Web Images", in "Methods and Applications of Artificial Intelligence", LNCS 3025, pp. 82--92, 2004
[4]
D. Lopresti and J. Zhou, "Locating and Recognizing Text in WWW Images," Information Retrieval, vol. 2, 2000, pp. 177--206
[5]
D. Karatzas, A. Antonacopoulos, "Colour Text Segmentation in Web Images Based on Human Perception", Image and Vision Computing, 25(5), pp. 564--577, 2007
[6]
Kwang In Kim, Keechul Jung, and Jin Hyung Kim, "Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 25, No, 12, 2003
[7]
S. M. Lucas, A. Panaretos, L. Sosa, A. Tang, S. Wong, R. Young, K. Ashida, H. Nagai, M. Okamoto, H. Yamamoto, H. Miyao, J. Zhu, W. Ou, C. Wolf, J. M. Jolion, L. Todoran, M. Worring, and X. Lin, "ICDAR 2003 robust reading competitions: entries, results, and future directions", International Journal on Document Analysis and Recognition Vol. 7, no. 2--3, pp. 105--122, 2005
[8]
C. Wolf, J.-M. Jolion, "Object count/area graphs for the evaluation of object detection and segmentation algorithms", International Journal of Document Analysis, 8(4), pp. 280--296, 2006
[9]
A. Antonacopoulos, D. Karatzas and D. Bridson, "Ground truth for layout analysis performance evaluation", In Proceedings 7th IAPR Document Analysis Workshop, 2006
[10]
J. Liang, I. Phillips, R. Haralick, "Performance evaluation of document layout analysis algorithms on the UW data set", In: Proceedings of SPIE, Document Recognition IV, pp 149--160, 1997
[11]
F. Einsele, R. Ingold, J. Hennebert, "A Language-Independent, Open-Vocabulary System Based on HMMs for Recognition of Ultra Low Resolution Words", Journal of Universal Computer Science, vol. 14, no. 18, pp. 2982--2997, 2008
[12]
M. A. Moll, H. S. Baird and C. An, "Truthing for Pixel-Accurate Segmentation", Proceedings of the 8th International Workshop on Document Analysis Systems, IEEE CPS, pp. 379--385
[13]
K. Ntirogiannis, B. Gatos and I. Pratikakis, "An Objective Evaluation Methodology for Document Image Binarization Techniques", Proceedings of the 8th International Workshop on Document Analysis Systems, IEEE CPS, pp. 217--224
[14]
http://algoval.essex.ac.uk/icdar/Datasets.html

Cited By

View all
  • (2021)Improving information extraction from visually rich documents using visual span representationsProceedings of the VLDB Endowment10.14778/3446095.344610414:5(822-834)Online publication date: 1-Jan-2021
  • (2021)Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR46437.2021.01187(12040-12050)Online publication date: Jun-2021
  • (2020)Improvement of the end-to-end scene text recognition method for “text-to-speech” conversionInternational Journal of Wavelets, Multiresolution and Information Processing10.1142/S021969132050052618:06(2050052)Online publication date: 15-Sep-2020
  • Show More Cited By

Index Terms

  1. A framework for the assessment of text extraction algorithms on complex colour images

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
    June 2010
    490 pages
    ISBN:9781605587738
    DOI:10.1145/1815330
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 June 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. colour
    2. performance evaluation
    3. text extraction

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    DAS '10

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Improving information extraction from visually rich documents using visual span representationsProceedings of the VLDB Endowment10.14778/3446095.344610414:5(822-834)Online publication date: 1-Jan-2021
    • (2021)Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR46437.2021.01187(12040-12050)Online publication date: Jun-2021
    • (2020)Improvement of the end-to-end scene text recognition method for “text-to-speech” conversionInternational Journal of Wavelets, Multiresolution and Information Processing10.1142/S021969132050052618:06(2050052)Online publication date: 15-Sep-2020
    • (2020)Region Growing-Based Scheme for Extraction of Text from Scene ImagesProceedings of International Conference on Frontiers in Computing and Systems10.1007/978-981-15-7834-2_14(149-155)Online publication date: 24-Nov-2020
    • (2019)A new video text extraction using local laplacian filters and mean shiftMultimedia Tools and Applications10.1007/s11042-018-6451-178:6(6989-7004)Online publication date: 1-Mar-2019
    • (2019)SVM and MLP Based Segmentation and Recognition of Text from Scene Images Through an Effective Binarization SchemeComputational Intelligence in Pattern Recognition10.1007/978-981-13-9042-5_20(237-246)Online publication date: 18-Aug-2019
    • (2018)A novel method for binarization of scene text images and its application in text identificationPattern Analysis and Applications10.1007/s10044-018-0687-2Online publication date: 14-Feb-2018
    • (2017)Scene text segmentation using low variation extremal regions and sorting based character groupingNeurocomputing10.1016/j.neucom.2017.05.021266:C(56-65)Online publication date: 29-Nov-2017
    • (2017)Unsupervised refinement of color and stroke features for text binarizationInternational Journal on Document Analysis and Recognition10.1007/s10032-017-0283-920:2(105-121)Online publication date: 1-Jun-2017
    • (2016)From Text Detection to Text Segmentation: A Unified Evaluation SchemeComputer Vision – ECCV 2016 Workshops10.1007/978-3-319-46604-0_28(378-394)Online publication date: 18-Sep-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media