Skip to main content
Log in

Abstract.

This paper is concerned with research on OCR (optical character recognition) of printed mathematical expressions. Construction of a representative corpus of technical and scientific documents containing expressions is discussed. A statistical investigation of the corpus is presented, and usefulness of this analysis is demonstrated in the related research problems, namely, (i) identification and segmentation of expression zones from the rest of the document, (ii) recognition of expression symbols, (iii) interpretation of expression structures, and (iv) performance evaluation of a mathematical expression recognition system. Moreover, a groundtruthing format has been proposed to facilitate automatic evaluation of expression recognition techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Fateman RJ, Tokuyasu T (1996) Progress in recognizing typeset mathematics. In: Proc. SPIE, San Jose, CA, 2660:37-50

  2. Blostein D, Grbavec A (1997) Recognition of mathematical notation. In: Bunke H, Wang PSP (eds) Handbook of character recognition and document image analysis. World Scientific, Singapore, pp 557-582

  3. Chan K-F, Yeung D-Y (2000) Mathematical expression recognition: a survey. Int J Doc Anal Recog 3(1):3-15

    Google Scholar 

  4. Phillips I (1998) Methodologies for using UW databases for OCR and image understanding systems. Proc. SPIE, Document Recognition V, 3305:112-127

    Google Scholar 

  5. Chan K-F, Yeung D-Y (2001) Error detection, error correction and performance evaluation in on-line mathematical expression recognition. Pattern Recog 34:1671-1684

    Google Scholar 

  6. Okamoto M, Imai H, Takagi K (2001) Performance evaluation of a robust method for mathematical expression recognition. In: Proc. 6th international conference on document analysis and recognition, Seattle, pp 121-128

  7. Zanibbi R, Blostein D, Cordy JR (2002) Recognizing mathematical expressions using tree transformation. IEEE Trans Pattern Anal Mach Intell 24(11):1455-1467

    Google Scholar 

  8. Raman TV (1994) Audio system for technical readings. Doctoral dissertation, Cornell University, Ithaca, NY

  9. Garain U, Chaudhuri BB (2001) On development and statistical analysis of a corpus for printed and handwritten mathematical expressions. In: Proc. 4th IAPR international workshop on graphics recognition (GREC2001), Canada, 2001, pp 429-439

  10. Jain AK, Yu B (1998) Document representation and its application to page decomposition. IEEE Trans Pattern Anal Mach Intell 20(3):294-308

    Google Scholar 

  11. Pavlidis T, Zhou J (1992) Page segmentation and classification. Comput Vis Graph Image Process 54:484-496

    Google Scholar 

  12. Lee HJ, Wang J-S (1995) Design of a mathematical expression recognition system. In: Proc. 3rd international conference on document analysis and recognition, Montreal, pp 1084--1087

  13. Toumit J-Y, Garcia-Salicetti S, Emptoz H (1999) A hierarchical and recursive model of mathematical expressions for automatic reading of mathematical documents. In: Proc. 5th international conference on document analysis and recognition, Bangalore, India, pp 119-122

  14. Fateman RJ (2000) How to find mathematics on a scanned page. In: Proc. Document Recognition and Retrieval VII, January 2000, San Jose, CA

  15. Kacem A, Belaid A, Ben Ahmed M (2001) Automatic extraction of printed mathematical formulas using fuzzy logic and propagation of context. Int J Doc Anal Recog 4(2):97-108

    Google Scholar 

  16. Chowdhury SP, Mandal S, Das AK, Chanda B (2003) Automated segmentation of math-zones from document images. In: Proc. 7th international conference on document analysis and recognition, Edinburgh, UK, pp 755-759

  17. Jin J, Han X, Wang Q (2003) Mathematical formulas extraction. In: Proc. 7th international conference on document analysis and recognition, Edinburgh, UK, pp 1138-1141

  18. Suzuki M, Tamari F, Fukuda R, Uchida S, Kanahori T (2003) INFTY - An integrated OCR system for mathematical documents. In: Proc. ACM symposium on document engineering (DocEng), Grenoble, France, pp 95-104

  19. Chaudhuri BB, Garain U (1998) Automatic detection of italic, bold and all-capital words in document. In: Proc. 14th international conference on pattern recognition (ICPR), Brisbane, Australia, pp 610-612

  20. Chaudhuri BB, Garain U (2001) Extraction of type style based meta-information from imaged documents. Int J Doc Anal Recog 3(3):138-149

    Google Scholar 

  21. Chaudhuri BB, Garain U (2000) An approach for recognition and interpretation of mathematical expressions in printed documents. Pattern Anal Appl 3:120-131

    Google Scholar 

  22. Mitra J, Garain U, Chaudhuri BB, Swamy HVK, Pal T (2003) Automatic understanding of structures in printed mathematical expressions. In: Proc. 7th international conference on document analysis and recognition, Edinburgh, UK, pp 540-544

  23. Zipf KG (1949) Human behavior and the principle of least effort: an introduction to human ecology. Addison-Wesley, Reading, MA

  24. Phillips I, Chhabra A (1999) Empirical performance evaluation of graphics recognition systems. IEEE Trans Pattern Anal Mach Intell 21(9):849-870

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Utpal Garain.

Additional information

Received: 10 July 2003, Accepted: 22 November 2004, Published online: 18 March 2005

Correspondence to: Utpal Garain

Rights and permissions

Reprints and permissions

About this article

Cite this article

Garain, U., Chaudhuri, B.B. A corpus for OCR research on mathematical expressions. IJDAR 7, 241–259 (2005). https://doi.org/10.1007/s10032-004-0140-5

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-004-0140-5

Keywords:

Navigation