Junction-based table detection in camera-captured document images

Seo, Wonkyo; Koo, Hyung Il; Cho, Nam Ik

doi:10.1007/s10032-014-0226-7

Junction-based table detection in camera-captured document images

Original Paper
Published: 16 July 2014

Volume 18, pages 47–57, (2015)
Cite this article

International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Wonkyo Seo¹,
Hyung Il Koo² &
Nam Ik Cho¹

978 Accesses
22 Citations
Explore all metrics

Abstract

In this paper, we present a method that locates tables and their cells in camera-captured document images. In order to deal with this problem in the presence of geometric and photometric distortions, we develop new junction detection and labeling methods, where junction detection means to find candidates for the corners of cells, and junction labeling is to infer their connectivity. We consider junctions as the intersections of curves, and so we first develop a multiple curve detection algorithm. After the junction detection, we encode the connectivity information (including false detection) between the junctions into 12 labels, and design a cost function reflecting pairwise relationships as well as local observations. The cost function is minimized via the belief propagation algorithm, and we can locate tables and their cells from the inferred labels. Also, in order to handle multiple tables in a single page, we propose a table area detection method. Our method is based on the well-known recursive X-Y cut, however, we modify the method so that we can also deal with curved seams caused by the geometric distortions. For the evaluation of our method, we build a data set that includes a variety of camera-captured table images and make the set publicly available. Experimental results on the set show that our method successfully locates tables and their cells in camera-captured images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Junction Based Table Detection in Mobile Captured Golf Scorecard Images

Digital Line Segment Detection for Table Reconstruction in Document Images

Scene Table Structure Recognition with Segmentation and Key Point Collaboration

References

Arias, J.F., Kasturi, R.: Efficient extraction of primitives from line drawings composed of horizontal and vertical lines. Mach. Vis. Appl. 10(4), 214–221 (1997)
Article Google Scholar
Avidan, S., Shamir, A.: Seam carving for content-aware image resizing. ACM Trans. Graph. 26(3):10 (2007)
Brown, M.S., Seales, W.B.: Image restoration of arbitrarily warped documents. IEEE Trans. Pattern Anal. Mach. Intell. 26(10), 1295–1306 (2004)
Article Google Scholar
Cao, H., Ding, X., Liu, C.: A cylindrical surface model to rectify the bound document image. In: IEEE International Conference on Computer Vision, pp. 228–233 (2003)
Cesarini, F., Marinai, S., Sarti, L., Soda, G.: Trainable table location in document images. In: International Conference on Pattern Recognition, pp. 236–240 (2002)
Gatos, B., Danatsas, D., Pratikakis, I., Perantonis, S.J.: Automatic table detection in document images. In: Pattern Recognition and Data Mining, pp. 609–618 (2005)
Ha, J., Haralick, R., Phillips, I.: Recursive x-y cut using bounding boxes of connected components. In: International Conference on Document Analysis and Recognition, pp. 952–955 (1995)
Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2000)
MATH Google Scholar
Hu, J., Kashi, R.S., Lopresti, D.P., Wilfong, G.: Medium-independent table detection. SPIE Doc. Recognit. Retr. VII 3967, 291–302 (2000)
Google Scholar
Kieninger, T., Dengel, A.: Applying the t-recs table recognition system to the business letter domain. In: IEEE International Conference on Document Analysis and Recognition, pp. 518–522 (2001)
Kieninger, T.G.: Table structure recognition based on robust block segmentation. In: Photonics West’98 Electronic Imaging, pp. 22–32 (1998)
Koo, H.I.: Segmentation and rectification of pictures in the camera-captured images of printed documents. IEEE Trans. Multimed. 15(3), 647–660 (2013)
Article MathSciNet Google Scholar
Koo, H.I., Cho, N.I.: State estimation in a document image and its application in text block identification and text line extraction. In: European Conference on Computer Vision, pp. 421–434 (2010)
Koo, H.I., Kim, J., Cho, N.I.: Composition of a dewarped and enhanced document image from two view images. IEEE Trans. Image Process. 18(7), 1551–1562 (2009)
Article MathSciNet Google Scholar
Liang, J., DeMenthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 591–605 (2008)
Article Google Scholar
Mandal, S., Chowdhury, S., Das, A.K., Chanda, B.: A simple and effective table detection system from document images. Int. J. Doc. Anal. Recognit. 8(2–3), 172–182 (2006)
Matas, J., Galambos, C., Kittler, J.: Robust detection of lines using the progressive probabilistic hough transform. Comput. Vis. Image Underst. 78(1), 119–137 (2000)
Article Google Scholar
Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: International Conference on Pattern Recognition, pp. 347–349 (1984)
Pereira Neves, L., Facon, J.: Methodology of automatic extraction of table-form cells. In: XIII Brazilian Symposium on Computer Graphics and Image Processing, pp. 15–21 (2000)
Pilu, M., Pollard, S.: A light-weight text image processing method for handheld embedded cameras. In: British Machine Vision Conference, pp. 547–556 (2002)
Shafait, F.: Document image dewarping contest. In: 2nd International Workshop on Camera-Based Document Analysis and Recognition, pp. 181–188 (2007)
Shafait, F., Smith, R.: Table detection in heterogeneous documents. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 65–72 (2010)
Stamatopoulos, N., Gatos, B., Pratikakis, I., Perantonis, S.: Goal-oriented rectification of camera-based document images. IEEE Trans. Image Process. 20(4), 910–920 (2011)
Article MathSciNet Google Scholar
Szeliski, R., Zabih, R., Scharstein, D., Veksler, O., Kolmogorov, V., Agarwala, A., Tappen, M., Rother, C.: A comparative study of energy minimization methods for markov random fields. In: European Conference on Computer Vision, pp. 16–29 (2006)
Tappen, M.F., Freeman, W.T.: Comparison of graph cuts with belief propagation for stereo, using identical mrf parameters. In: IEEE International Conference on Computer Vision, pp. 900–906 (2003)
Taylor, S.L., Fritzson, R., Pastor, J.A.: Extraction of data from preprinted forms. Mach. Vis. Appl. 5(3), 211–222 (1992)
Article Google Scholar
Tian, Y., Narasimhan, S.: Rectification and 3d reconstruction of curved document images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 377–384 (2011)
Yamashita, A., Kawarago, A., Kaneko, T., Miura, K.: Shape reconstruction and image restoration for non-flat surfaces of documents with a stereo vision system. In: International Conference on Pattern Recognition, pp. 482–485 (2004)
Zanibbi, R., Blostein, D., Cordy, R.: A survey of table recognition. Int. J. Doc. Anal. Recognit. 7(1), 1–16 (2004)
Article Google Scholar
Zhang, Z., Liang, X., Ma, Y.: Unwrapping low-rank textures on generalized cylindrical surfaces. In: IEEE International Conference on Computer Vision, pp. 1347–1354 (2011)
Zheng, Y., Liu, C., Ding, X., Pan, S.: Form frame line detection with directional single-connected chain. In: IEEE International Conference on Document Analysis and Recognition, pp. 699–703 (2001)
Saabni, R., El-Sana, J.: Language-Independent Text Lines Extraction Using Seam Carving. In: IEEE International Conference on Document Analysis and Recognition, pp. 563–568 (2011)

Download references

Acknowledgments

This work was supported by the Ministry of Science, ICT and Future Planning, Korea, through the Information Technology Research Center support Program supervised by the National IT Industry Promotion Agency under Grant NIPA-2014-H0301-14-1019.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering and INMC, Seoul National University, Seoul, Korea
Wonkyo Seo & Nam Ik Cho
Department of Electrical and Computer Engineering, Ajou University, Suwon, Korea
Hyung Il Koo

Authors

Wonkyo Seo
View author publications
You can also search for this author in PubMed Google Scholar
Hyung Il Koo
View author publications
You can also search for this author in PubMed Google Scholar
Nam Ik Cho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hyung Il Koo.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Seo, W., Koo, H.I. & Cho, N.I. Junction-based table detection in camera-captured document images. IJDAR 18, 47–57 (2015). https://doi.org/10.1007/s10032-014-0226-7

Download citation

Received: 12 October 2013
Revised: 24 June 2014
Accepted: 30 June 2014
Published: 16 July 2014
Issue Date: March 2015
DOI: https://doi.org/10.1007/s10032-014-0226-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Junction-based table detection in camera-captured document images

Abstract

Access this article

Similar content being viewed by others

Junction Based Table Detection in Mobile Captured Golf Scorecard Images

Digital Line Segment Detection for Table Reconstruction in Document Images

Scene Table Structure Recognition with Segmentation and Key Point Collaboration

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Junction-based table detection in camera-captured document images

Abstract

Access this article

Similar content being viewed by others

Junction Based Table Detection in Mobile Captured Golf Scorecard Images

Digital Line Segment Detection for Table Reconstruction in Document Images

Scene Table Structure Recognition with Segmentation and Key Point Collaboration

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation