Abstract
In this paper, we present a method that locates tables and their cells in camera-captured document images. In order to deal with this problem in the presence of geometric and photometric distortions, we develop new junction detection and labeling methods, where junction detection means to find candidates for the corners of cells, and junction labeling is to infer their connectivity. We consider junctions as the intersections of curves, and so we first develop a multiple curve detection algorithm. After the junction detection, we encode the connectivity information (including false detection) between the junctions into 12 labels, and design a cost function reflecting pairwise relationships as well as local observations. The cost function is minimized via the belief propagation algorithm, and we can locate tables and their cells from the inferred labels. Also, in order to handle multiple tables in a single page, we propose a table area detection method. Our method is based on the well-known recursive X-Y cut, however, we modify the method so that we can also deal with curved seams caused by the geometric distortions. For the evaluation of our method, we build a data set that includes a variety of camera-captured table images and make the set publicly available. Experimental results on the set show that our method successfully locates tables and their cells in camera-captured images.
Similar content being viewed by others
References
Arias, J.F., Kasturi, R.: Efficient extraction of primitives from line drawings composed of horizontal and vertical lines. Mach. Vis. Appl. 10(4), 214–221 (1997)
Avidan, S., Shamir, A.: Seam carving for content-aware image resizing. ACM Trans. Graph. 26(3):10 (2007)
Brown, M.S., Seales, W.B.: Image restoration of arbitrarily warped documents. IEEE Trans. Pattern Anal. Mach. Intell. 26(10), 1295–1306 (2004)
Cao, H., Ding, X., Liu, C.: A cylindrical surface model to rectify the bound document image. In: IEEE International Conference on Computer Vision, pp. 228–233 (2003)
Cesarini, F., Marinai, S., Sarti, L., Soda, G.: Trainable table location in document images. In: International Conference on Pattern Recognition, pp. 236–240 (2002)
Gatos, B., Danatsas, D., Pratikakis, I., Perantonis, S.J.: Automatic table detection in document images. In: Pattern Recognition and Data Mining, pp. 609–618 (2005)
Ha, J., Haralick, R., Phillips, I.: Recursive x-y cut using bounding boxes of connected components. In: International Conference on Document Analysis and Recognition, pp. 952–955 (1995)
Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2000)
Hu, J., Kashi, R.S., Lopresti, D.P., Wilfong, G.: Medium-independent table detection. SPIE Doc. Recognit. Retr. VII 3967, 291–302 (2000)
Kieninger, T., Dengel, A.: Applying the t-recs table recognition system to the business letter domain. In: IEEE International Conference on Document Analysis and Recognition, pp. 518–522 (2001)
Kieninger, T.G.: Table structure recognition based on robust block segmentation. In: Photonics West’98 Electronic Imaging, pp. 22–32 (1998)
Koo, H.I.: Segmentation and rectification of pictures in the camera-captured images of printed documents. IEEE Trans. Multimed. 15(3), 647–660 (2013)
Koo, H.I., Cho, N.I.: State estimation in a document image and its application in text block identification and text line extraction. In: European Conference on Computer Vision, pp. 421–434 (2010)
Koo, H.I., Kim, J., Cho, N.I.: Composition of a dewarped and enhanced document image from two view images. IEEE Trans. Image Process. 18(7), 1551–1562 (2009)
Liang, J., DeMenthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 591–605 (2008)
Mandal, S., Chowdhury, S., Das, A.K., Chanda, B.: A simple and effective table detection system from document images. Int. J. Doc. Anal. Recognit. 8(2–3), 172–182 (2006)
Matas, J., Galambos, C., Kittler, J.: Robust detection of lines using the progressive probabilistic hough transform. Comput. Vis. Image Underst. 78(1), 119–137 (2000)
Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: International Conference on Pattern Recognition, pp. 347–349 (1984)
Pereira Neves, L., Facon, J.: Methodology of automatic extraction of table-form cells. In: XIII Brazilian Symposium on Computer Graphics and Image Processing, pp. 15–21 (2000)
Pilu, M., Pollard, S.: A light-weight text image processing method for handheld embedded cameras. In: British Machine Vision Conference, pp. 547–556 (2002)
Shafait, F.: Document image dewarping contest. In: 2nd International Workshop on Camera-Based Document Analysis and Recognition, pp. 181–188 (2007)
Shafait, F., Smith, R.: Table detection in heterogeneous documents. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 65–72 (2010)
Stamatopoulos, N., Gatos, B., Pratikakis, I., Perantonis, S.: Goal-oriented rectification of camera-based document images. IEEE Trans. Image Process. 20(4), 910–920 (2011)
Szeliski, R., Zabih, R., Scharstein, D., Veksler, O., Kolmogorov, V., Agarwala, A., Tappen, M., Rother, C.: A comparative study of energy minimization methods for markov random fields. In: European Conference on Computer Vision, pp. 16–29 (2006)
Tappen, M.F., Freeman, W.T.: Comparison of graph cuts with belief propagation for stereo, using identical mrf parameters. In: IEEE International Conference on Computer Vision, pp. 900–906 (2003)
Taylor, S.L., Fritzson, R., Pastor, J.A.: Extraction of data from preprinted forms. Mach. Vis. Appl. 5(3), 211–222 (1992)
Tian, Y., Narasimhan, S.: Rectification and 3d reconstruction of curved document images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 377–384 (2011)
Yamashita, A., Kawarago, A., Kaneko, T., Miura, K.: Shape reconstruction and image restoration for non-flat surfaces of documents with a stereo vision system. In: International Conference on Pattern Recognition, pp. 482–485 (2004)
Zanibbi, R., Blostein, D., Cordy, R.: A survey of table recognition. Int. J. Doc. Anal. Recognit. 7(1), 1–16 (2004)
Zhang, Z., Liang, X., Ma, Y.: Unwrapping low-rank textures on generalized cylindrical surfaces. In: IEEE International Conference on Computer Vision, pp. 1347–1354 (2011)
Zheng, Y., Liu, C., Ding, X., Pan, S.: Form frame line detection with directional single-connected chain. In: IEEE International Conference on Document Analysis and Recognition, pp. 699–703 (2001)
Saabni, R., El-Sana, J.: Language-Independent Text Lines Extraction Using Seam Carving. In: IEEE International Conference on Document Analysis and Recognition, pp. 563–568 (2011)
Acknowledgments
This work was supported by the Ministry of Science, ICT and Future Planning, Korea, through the Information Technology Research Center support Program supervised by the National IT Industry Promotion Agency under Grant NIPA-2014-H0301-14-1019.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Seo, W., Koo, H.I. & Cho, N.I. Junction-based table detection in camera-captured document images. IJDAR 18, 47–57 (2015). https://doi.org/10.1007/s10032-014-0226-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-014-0226-7