Document Image Understanding through Iterative Transductive Learning

Ceci, Michelangelo; Loglisci, Corrado; Macchia, Lucrezia; Malerba, Donato; Quercia, Luciano

doi:10.1007/978-3-642-35834-0_13

Michelangelo Ceci³,
Corrado Loglisci³,
Lucrezia Macchia³,
Donato Malerba³ &
…
Luciano Quercia³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 354))

Included in the following conference series:

Italian Research Conference on Digital Libraries

1254 Accesses

Abstract

In Document Image Understanding, one of the fundamental tasks is that of recognizing semantically relevant components in the layout extracted from a document image. This process can be automatized by learning classifiers able to automatically label such components. However, the learning process assumes the availability of a huge set of documents whose layout components have been previously manually labeled. Indeed, this contrasts with the more common situation in which we have only few labeled documents and abundance of unlabeled ones. In addition, labeling layout documents introduces further complexity aspects due to multi-modal nature of the components (textual and spatial information may coexist). In this work, we investigate the application of a relational classifier that works in the transductive setting. The relational setting is justified by the multi-modal nature of the data we are dealing with, while transduction is justified by the possibility of exploiting the large amount of information conveyed in the unlabeled layout components. The classifier bootstraps the labeling process in an iterative way: reliable classifications are used in subsequent iterative steps as training examples. The proposed computational solution has been evaluated on document images of scientific literature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Self-supervised Representation Learning on Document Images

Large-scale image annotation with image–text hybrid learning models

Article 14 June 2016

UnSupDLA: Towards Unsupervised Document Layout Analysis

References

Baird, H.S., Casey, M.R.: Towards Versatile Document Analysis Systems. In: Bunke, H., Spitz, A.L. (eds.) DAS 2006. LNCS, vol. 3872, pp. 280–290. Springer, Heidelberg (2006)
Chapter Google Scholar
Ceci, M., Appice, A.: Spatial associative classification: Propositional vs structural approach. Journal of Intelligent Information Systems 27(3), 191–213 (2006)
Article Google Scholar
Ceci, M., Appice, A., Malerba, D.: Discovering Emerging Patterns in Spatial Databases: A Multi-relational Approach. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) PKDD 2007. LNCS (LNAI), vol. 4702, pp. 390–397. Springer, Heidelberg (2007)
Chapter Google Scholar
Ceci, M., Appice, A., Malerba, D.: Transductive Learning for Spatial Data Classification. In: Koronacki, J., Raś, Z.W., Wierzchoń, S.T., Kacprzyk, J. (eds.) Advances in Machine Learning I. SCI, vol. 262, pp. 189–207. Springer, Heidelberg (2010)
Chapter Google Scholar
Ceci, M., Berardi, M., Malerba, D.: Relational Data Mining and ILP for Document Image Understanding. Applied Artificial Intelligence 21(4-5), 317–342 (2007)
Article Google Scholar
Ceci, M., Loglisci, C., Malerba, D.: Transductive Learning of Logical Structures from Document Images. In: Biba, M., Xhafa, F. (eds.) Learning Structure and Schemas from Documents. SCI, vol. 375, pp. 121–142. Springer, Heidelberg (2011)
Chapter Google Scholar
Ceci, M., Malerba, D.: Classifying web documents in a hierarchy of categories: a comprehensive study. Journal of Intelligent Information Systems 28(1), 37–78 (2007)
Article Google Scholar
Dong, G., Li, J.: Efficient mining of emerging patterns: Discovering trends and differences. In: International Conference on Knowledge Discovery and Data Mining, pp. 43–52. ACM Press (1999)
Google Scholar
Dong, G., Zhang, X., Wong, L., Li, J.: CAEP: Classification by Aggregating Emerging Patterns. In: Arikawa, S., Nakata, I. (eds.) DS 1999. LNCS (LNAI), vol. 1721, pp. 30–42. Springer, Heidelberg (1999)
Chapter Google Scholar
Esposito, F., Malerba, D., Semeraro, G.: Multistrategy learning for document recognition. Applied Artificial Intelligence 8(1), 33–84 (1994)
Article Google Scholar
Krogel, M.-A., Scheffer, T.: Multi-relational learning, text mining, and semi-supervised learning for functional genomics. Mach. Lear. 57(1-2), 61–81 (2004)
Article MATH Google Scholar
Lisi, F.A., Malerba, D.: Inducing multi-level association rules from multiple relations. Machine Learning 55(2), 175–210 (2004)
Article MATH Google Scholar
Malerba, D., Ceci, M., Appice, A.: A relational approach to probabilistic classification in a transductive setting. Engineering Applications of Artificial Intelligence 22(1), 109–116 (2009)
Article Google Scholar
Niyogi, D., Srihari, S.N.: Knowledge-based derivation of document logical structure. In: ICDAR 1995: Proceedings of the Third International Conference on Document Analysis and Recognition, vol. 1, p. 472. IEEE Computer Society, Washington, DC (1995)
Chapter Google Scholar
Seeger, M.: Learning with labeled and unlabeled data. Technical report, Institute for Adaptive and Neural Computation. University of Edinburgh (2001)
Google Scholar
Zhang, X., Dong, G., Ramamohanarao, K.: Exploring constraints to efficiently mine emerging patterns from large high-dimensional datasets. In: Knowledge Discovery and Data Mining, pp. 310–314 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Università degli Studi di Bari “Aldo Moro”, Italy
Michelangelo Ceci, Corrado Loglisci, Lucrezia Macchia, Donato Malerba & Luciano Quercia

Authors

Michelangelo Ceci
View author publications
You can also search for this author in PubMed Google Scholar
Corrado Loglisci
View author publications
You can also search for this author in PubMed Google Scholar
Lucrezia Macchia
View author publications
You can also search for this author in PubMed Google Scholar
Donato Malerba
View author publications
You can also search for this author in PubMed Google Scholar
Luciano Quercia
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information Engineering, University of Padua, Via Gradenigo, 6/a, 35131, Padua, Italy
Maristella Agosti & Nicola Ferro &
Department of Computer Science, University of Bari, Via E. Orabona, 4, 70126, Bari, Italy
Floriana Esposito & Stefano Ferilli &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ceci, M., Loglisci, C., Macchia, L., Malerba, D., Quercia, L. (2013). Document Image Understanding through Iterative Transductive Learning. In: Agosti, M., Esposito, F., Ferilli, S., Ferro, N. (eds) Digital Libraries and Archives. IRCDL 2012. Communications in Computer and Information Science, vol 354. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35834-0_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-35834-0_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35833-3
Online ISBN: 978-3-642-35834-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics