Skip to main content
Log in

A survey of table recognition

Models, observations, transformations, and inferences

  • Published:
Document Analysis and Recognition Aims and scope Submit manuscript

Abstract.

Table characteristics vary widely. Consequently, a great variety of computational approaches have been applied to table recognition. In this survey, the table recognition literature is presented as an interaction of table models, observations, transformations, and inferences. A table model defines the physical and logical structure of tables; the model is used to detect tables and to analyze and decompose the detected tables. Observations perform feature measurements and data lookup, transformations alter or restructure data, and inferences generate and test hypotheses. This presentation clarifies both the decisions made by a table recognizer and the assumptions and inferencing techniques that underlie these decisions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Abu-Tarif AA (1998) Table processing and understanding. Master’s thesis, Rensselaer Polytechnic Institute, Troy, NY

  2. Amano A, Asada N (2002) Complex table form analysis using graph grammar. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 283-386

  3. Amano A, Asada N, Motoyama T, Sumiyoshi T, Suzuki K (2001) Table form document synthesis by grammar-based structure analysis. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 533-537

  4. Arias JF, Chhabra A, Misra V (1996) Efficient interpretation of tabular documents. In: Proceedings of the 13th international conference on pattern recognition, Vienna, Austria, 25-30 August 1996, pp 681-685

  5. Arias JF, Chhabra A, Misra V (1996) Interpreting and representing tabular documents. In: Proceedings of the conference on computer vision and pattern recognition, San Francisco, 18-20 June 1996, pp 600-605

  6. Balasubramanian S, Chandran S, Arias J, Kasturi R (1994) Information extraction from tabular drawings. In: Proceedings of Document Recognition I (IS&T/SPIE Electronic Imaging), San Jose, CA, 9 February 1994, 2181:152-163

  7. Belaïd A (2001) Recognition of table of contents for electronic library consulting. Int J Doc Anal Recog 4(1):35-45

    Google Scholar 

  8. Bing L, Zao J, Hong X (1999) New method for logical structure extraction of form document image. In: Proceedings of Document Recognition and Retrieval VI (IS&T/SPIE Electronic Imaging), San Jose, CA, 27 January 1999, 3651:183-193

  9. Bourgeois FL, Emptoz H, Bensafi S (2001) Document understanding using probabilistic relaxation: application on tables of contents of periodicals. In: Proceedings of the international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 508-512

  10. Bunke H (1993) Structural and syntactic pattern recognition. Chen CH, Pau LF, Wang PSP (eds) Handbook of pattern recognition and computer vision. World Scientific, Singapore, pp 163-209

  11. Casey RG, Lecolinet E (1996) A survey of methods and strategies in character segmentation. IEEE Trans Patt Anal Mach Intell 18(7):690-706

    Google Scholar 

  12. Cesarini F, Gori M, Marinai S, Soda G (1998) INFORMys: a flexible invoice-like form-reader system. IEEE Trans Patt Anal Mach Intell 20(7):730-745

    Google Scholar 

  13. Cesarini F, Gori M, Marinai S, Soda G (1999) Structured document segmentation and representation by the modified X-Y tree. In: Proceedings of the 5th international conference on document analysis and recognition, Bangalore, India, 20-22 September 1999, pp 563-566

  14. Cesarini F, Marinai S, Sarti L, Soda G (2002) Trainable table location in document images. In: Proceedings of the 16th international conference on pattern recognition, Québec City, Canada, 11-15 August 2002, 3:236-240

  15. Chandran S, Kasturi R (1993) Structural recognition of tabulated data. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 516-519

  16. Chhabra AK, Misra V, Arias J (1996) Detection of horizontal lines in noisy run length encoded images: the FAST method. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 35-48

  17. Coll RA, Coll JH, Thakur G (1994) Graphs and tables: a four-factor experiment. Commun ACM 37(4):76-86

    Google Scholar 

  18. Coüasnon B (2001) DMOS: A generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table recognition systems. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 215-220

  19. Coüasnon B, Pasquer L (2001) A real-world evaluation of a generic document recognition method applied to a military form of the 19th century. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 779-783

  20. Douglas S, Hurst M (1996) Layout and language: lists and tables in technical documents. In: Proceedings of the ACL SIGPARSE workshop on punctuation in computational linguistics, Santa Cruz, CA, 28 June 1996, pp 19-24

  21. Douglas S, Hurst M, Quinn D (1995) Using natural language processing for identifying and interpreting tables in plain text. In: Proceedings of the 4th annual symposium on document analysis and information retrieval, Las Vegas, 15-17 April 1995, pp 535-546

  22. Green E, Krishnamoorthy M (1996) Model-based analysis of printed tables. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 80-91

  23. Grossman J (ed) (1993) Chicago manual of style, 14th edn, chap 12. University of Chicago Press, Chicago

  24. Tsai S, Chen H, Tsai J (2000) Mining tables from large scale HTML texts. In: Proceedings of the 18th international conference on computational linguistics, Saarbrucken, Germany, 31 July-4 August 2000

  25. Ha J, Haralick RM, Phillips IT (1995) Recursive X-Y cut using bounding boxes of connected components. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 952-955

  26. Hall R (1943) Handbook of tabular presentation. Ronald Press, New York

  27. Handley JC (1999) Electronic imaging technology, chap 8. IS&T/SPIE Optical Engineering Press, Bellingham, WA

  28. Handley JC (2001) Table analysis for multi-line cell identification. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE electronic imaging), San Jose, CA, 24 January 2001, 4307:34-43

  29. Haralick RM (1994) Document image understanding: geometric and logical layout. In: Proceedings of the conference on computer vision and pattern recognition, Seattle, 20-24 June 1994, pp 385-390

  30. Haralick RM, Shapiro LG (1992) Computer and robot vision (2 vols). Addison-Wesley, Reading, MA

  31. Hinkelmann K, Kempthorne O (1994) Design and analysis of experiments: introduction to experimental design, vol 1. Wiley, New York

  32. Hirayama Y (1993) A block segmentation method for document images with complicated column structures. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 91-94

  33. Hirayama Y (1995) A method for table structure analysis using DP matching. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 583-586

  34. Hori O, Doermann DS (1995) Robust table-form structure analysis based on box-driven reasoning. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 218-221

  35. Hu J, Kashi R, Lopresti D, Nagy G, Wilfong G (2001) Why table ground-truthing is hard. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 129-133

  36. Hu J, Kashi R, Lopresti D, Wilfong G (2000) Medium-independent table detection. In: Proceedings of Document Recognition and Retrieval VII (IS&T/SPIE electronic imaging), San Jose, CA, 26 January 2000, 3967:291-302

  37. Hu J, Kashi R, Lopresti D, Wilfong G (2001a) Experiments in table recognition. In: Proceedings of the workshop on document layout interpretation and applications, Seattle, 9 September 2001

  38. Hu J, Kashi R, Lopresti D, Wilfong G (2001b) Table structure recognition and its evaluation. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE Electronic Imaging), San Jose, CA, 24 January 2001, 4307:44-55

  39. Hu J, Kashi RS, Lopresti D, Wilfong GT (2002) Evaluating the performance of table processing algorithms. Int J Doc Anal Recog 4(3):140-153

    Google Scholar 

  40. Hurst M (1999) Layout and language: beyond simple text for information interaction - modelling the table. In: Proceedings of the 2nd international conference on multimodal interfaces, Hong Kong, 5-7 January 1999

  41. Hurst M (2001a) Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE Electronic Imaging), San Jose, CA, 24 January 2001, 4307:56-67

  42. Hurst M (2001b) Layout and language: challenges for table understanding on the web. In: Proceedings of the first international workshop on Web document analysis, Seattle, 8 September 2001, pp 27-30

  43. Hurst M, Douglas S (1997) Layout and language: preliminary investigations in recognizing the structure of tables. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 1043-1047

  44. Hurst M, Nasukawa T (2000) Layout and language: integrating spatial and linguistic knowledge for layout understanding tasks. In: Proceedings of the 18th international conference on computational linguistics, Saarbrucken, Germany, 31 July-4 August 2000

  45. Itonori K (1993) Table structure recognition based on textblock arrangement and ruled line position. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 765-768

  46. Jain AK, Yu B (1998) Document representation and its application to page decomposition. IEEE Trans Patt Anal Mach Intell 20(3):294-308

    Google Scholar 

  47. Kieninger TG (1998) Table structure recognition based on robust block segmentation. In: Proceedings of Document Recognition V (IS&T/SPIE Electronic Imaging), San Jose, CA, 28 January 1998, 3305:22-32

  48. Kieninger TG, Dengel A (2001) Applying the T-RECS table recognition system to the business letter domain. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 518-522

  49. Klein B, Gökkus S, Kieninger T, Dengel A (2001) Three approaches to “industrial” table spotting. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 513-517

  50. Kojima H, Akiyama T (1990) Table recognition for automated document entry system. High-speed inspection architectures, barcoding, and character recognition. In: (Proceedings of SPIE), Boston, MA, 5-7 November 1990, 1384:285-292

  51. Kornfeld W, Wattecamps J (1998) Automatically locating, extracting and analyzing tabular data. In: Proceedings of the 21st ACM SIGIR international conference on research and development in information retrieval, Melbourne, Australia, 24-28 August 1998, pp 347-348

  52. Krishnamoorthy S, Nagy G, Seth S, Viswanathan M (1993) Syntactic segmentation and labelling of digitized pages from technical journals. IEEE Trans Patt Anal Mach Intell 15(7):737-747

    Google Scholar 

  53. Lam SW, Javanbakht L, Srihari SN (1993) Anatomy of a form reader. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 506-509

  54. Laurentini A, Viada P (1992) Identifying and understanding tabular material in compound documents. In: Proceedings of the 11th international conference on pattern recognition, The Hague, The Netherlands, 30 August-3 September 1992, pp 405-409

  55. Lewandowksy S, Spence I (1989) The perception of statistical graphs. Sociol Meth Res 18(2&3):200-242

    Google Scholar 

  56. Liang J (1999) Document structure analysis and performance evaluation. PhD thesis, University of Washington, Seattle

  57. Lopresti D (2002) Exploiting WWW resources in experimental document analysis research. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 532-543

  58. Lopresti D, Nagy G (1999) Automated table processing: an (opinionated) survey. In: Proceedings of the 3rd international workshop on graphics recognition, Jaipur, India, 26-27 September 1999, pp 109-134

  59. Lopresti D, Nagy G (2000) A tabular survey of automated table processing. Lecture notes in computer science, vol 1941. Springer, Berlin Heidelberg New York, pp 93-120

  60. Lopresti D, Wilfong G (2001) Evaluating document analysis results via graph probing. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 116-120

  61. Mao S, Kanungo T (2001) Empirical performance evaluation methodology and its application to page segmentation algorithms. IEEE Trans Patt Anal Mach Intell 23(3):242-256

    Google Scholar 

  62. Nagy G (2000) Twenty years of document image analysis in PAMI. IEEE Trans Patt Anal Mach Intell 22(1):38-62

    Google Scholar 

  63. Nagy G, Seth S (1984) Hierarchical representation of optically scanned documents. In: Proceedings of the 7th international conference on pattern recognition, Montreal, July 1984, pp 347-349

  64. Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J (7):308-313

  65. Ng HT, Lim CY, Koo JLT (1999) Learning to recognize tables in free text. In: Proceedings of the 37th annual meeting of the Association of Computational Linguistics, College Park, MD, 20-26 June 1999, pp 443-450

  66. O’Gorman L (1992) Image and document processing techniques for the RightPages electronic library system. In: Proceedings of the 11th international conference on pattern recognition, The Hague, Netherlands, 30 August-3 September 1992, pp 260-263

  67. O’Gorman L (1993) The document spectrum for page layout analysis. IEEE Trans Patt Anal Mach Intell 15(11):1162-1173

    Google Scholar 

  68. Perlovsky LI (1998) Conundrum of combinatorial complexity. IEEE Trans Patt Anal Mach Intell 20(6):666-670

    Google Scholar 

  69. Peterman C, Chang CH, Alam H (1997) A system for table understanding. In: Proceedings of the conference on document image understanding technology, Annapolis, MD, April/May 1997, pp 55-62

  70. Petrou M (1999) Learning in pattern recognition. Lecture notes in computer science, vol 1715. Springer, Berlin Heidelberg New York, pp 1-12

  71. Phillips I, Chhabra AK (1999) Empirical performance evaluation of graphics recognition systems. IEEE Trans Patt Anal Mach Intell 21(9):849-870

    Google Scholar 

  72. Pizano A (1992) Extracting line features from images of business forms and tables. In: Proceedings of the 11th international conference on pattern recognition, The Hague, The Netherlands, 30 August-3 September 1992, pp 399-403

  73. Pyreddy P, Croft WB (1997) Tintin: a system for retrieval in text tables. In: Proceedings of the 2nd international conference on digital libraries, Philadelphia, 25-28 July 1997, pp 193-200

  74. Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann, San Francisco

  75. RAF Technology (1995) Illuminator user’s manual. Redmond, WA

  76. Rahgozar MA, Cooperman R (1996) A graph-based table recognition system. In: Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), San Jose, CA, 29 January 1996, 2660:192-203

  77. Rosenfeld A, Kak AC (1982) Digital picture processing (2 vols). Academic, Orlando, FL

  78. Rus D, Subramanian D (1997) Customizing information capture and access. ACM Trans Inf Sys 15(1):67-101

    Google Scholar 

  79. Seong-Whan L, Dae-Seok R (2001) Parameter-free geometric document layout analysis. IEEE Trans Patt Anal Mach Intell 23(11):1240-1256

    Google Scholar 

  80. Shamillian JH, Baird HS, Wood TL (1997) A retargetable table reader. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 158-163

  81. Souafi-Bensafi S, Parizeau M, Lebourgeois F, Emptoz H (2001) Bayesian networks classifiers applied to documents. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 508-511

  82. Takasu A, Satoh S, Katsura E (1994) A document understanding method for database construction of an electronic library. In: Proceedings of the 12th international conference on pattern recognition, Jerusalem, Israel, 9-13 October 1994, pp 463-466

  83. Takasu A, Satoh S, Katsura E (1995) A rule learning method for academic document image processing. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 239-242

  84. Tersteegen W, Wenzel C (1998) Scantab: table recognition by reference tables. In: Proceedings of the 3rd workshop on document analysis systems, Nagano, Japan, 4-6 November 1998

  85. Tubbs KM, Embley DW (2002) Recognizing records from the extracted cells of microfilm tables. In: Proceedings of the ACM symposium on document engineering, McLean, VA, 8-9 November 2002, pp 149-156

  86. Tupaj S, Shi Z, Chang CH, Alam H (1996) Extracting tabular information from text files. Available from citeseer: http://citeseer.nj.nec.com/tupaj96extracting.html

  87. Turolla E, Belaid A, Belaid Y (1996) Form item extraction based on line searching. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 69-79

  88. Viswanathan M, Green E, Krishnamoorthy MS (1996) Document recognition: an attribute grammar approach. In: Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), San Jose, CA, 29 January 1996, 2660:101-111

  89. Wang X (1996) Tabular abstraction, editing and formatting. PhD thesis, University of Waterloo, Canada

  90. Wang Y, Haralick R, Phillips IT (2001) Zone content classification and its performance evaluation. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 540-544

  91. Wang Y, Hu J (2002) Detecting tables in HTML documents. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 249-260

  92. Wang Y, Phillips IT, Haralick R (2001) Automatic table ground truth generation and a background-analysis-based table structure extraction method. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 528-532

  93. Wang Y, Phillips IT, Haralick R (2002) Table detection via probability optimization. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 272-282

  94. Watanabe T, Luo Q, Sugie N (1995) Layout recognition of multi-kinds of table-form documents. IEEE Trans Patt Anal Mach Intell 17(4):432-445

    Google Scholar 

  95. Wong KY, Casey RG, Wahl FM (1982) Document analysis system. IBM J Res Dev 26(6):647-656

    Google Scholar 

  96. Yoshida M, Torisawa K, Tsujii J (2001) A method to integrate tables of the world wide web. In: Proceedings of the 1st international workshop on Web document analysis, Seattle, 8 September 2001, pp 31-34

  97. Yu B, Jain AK (1996) A generic system for form dropout. IEEE Trans Patt Anal Mach Intell 18(11):1127-1134

    Google Scholar 

  98. Zuyev K (1997) Table image segmentation. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 705-708

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Richard Zanibbi.

Additional information

Received: 29 May 2003, Revised: 28 October 2003, Published online: 1 April 2004

Correspondence to: Richard Zanibbi

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zanibbi, R., Blostein, D. & Cordy, J.R. A survey of table recognition. IJDAR 7, 1–16 (2004). https://doi.org/10.1007/s10032-004-0120-9

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-004-0120-9

Keywords:

Navigation