Skip to main content
Log in

Software architecture of PSET: a page segmentation evaluation toolkit

  • Original Research Paper
  • Published:
International Journal on Document Analysis and Recognition Aims and scope Submit manuscript

Abstract.

Empirical performance evaluation of page segmentation algorithms has become increasingly important due to the numerous algorithms that are being proposed each year. In order to choose between these algorithms for a specific domain it is important to empirically evaluate their performance. To accomplish this task the document image analysis community needs: i) standardized document image datasets with groundtruth; ii) evaluation metrics that are agreed upon by researchers; and iii) freely available software for evaluating new algorithms and replicating other researchers' results. In an earlier paper (IEEE Transactions on Pattern Analysis and Machine Intelligence 2001) we published evaluation results for various popular page segmentation algorithms using the University of Washington dataset. In this paper we describe the software architecture of the PSET evaluation package, which was used to evaluate the segmentation algorithms. The description of the architecture will allow researchers to understand the software better, replicate our results, evaluate new algorithms, experiment with new metrics and datasets, etc. The software is written using the C language on the SUN/UNIX platform and is being made available to researchers at no cost.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received October 2, 2000 / Accepted September 7, 2001

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mao, S., Kanungo, T. Software architecture of PSET: a page segmentation evaluation toolkit. IJDAR 4, 205–217 (2002). https://doi.org/10.1007/s100320200070

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s100320200070

Navigation