Abstract
This paper aims at presenting the application of first-order logic machine learning techniques to two document domains in order to learn rules for recognizing the semantic role of their logical components. Specifically, the multistrategy incremental learning system INTHELEX has been applied to multi-format scientific papers and documents concerning European films from the 20’s and 30’s. The challenge comes from the different levels of formatting standards in these domains: from (more or less) standardized layouts, in scientific papers, to documents with almost no standard, in historical cultural heritage material. Experimental results in both domains and a comparison with the Progol system assess the advantages that the exploitation of INTHELEX can yield.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Becker, J.M.: Inductive learning of decision rules with exceptions: Methodology and experimentation. B.s. diss., Dept. of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA (1985) UIUCDCS-F-85-945
Dietterich, T.G.: Approximate statistical test for comparing supervised classification learning algorithms. Neural Computation 10(7), 1895–1923 (1998)
Esposito, F., Malerba, D., Lisi, F.A.: Machine learning for intelligent processing of printed documents. Journal of Intelligent Information Systems 14(2/3), 175–198 (2000)
Esposito, F., Semeraro, G., Fanizzi, N., Ferilli, S.: Multistrategy Theory Revision: Induction and abduction in INTHELEX. Machine Learning Journal 38(1/2), 133–156 (2000)
Lamma, E., Mello, P., Riguzzi, F., Esposito, F., Ferilli, S., Semeraro, G.: Co-operation of abduction and induction in logic programming. In: Kakas, A.C., Flach, P. (eds.) Abductive and Inductive Reasoning: Essays on their Relation and Integration. Kluwer, Dordrecht (2000)
Lloyd, J.W.: Foundations of Logic Programming, 2nd edn. Springer, Berlin (1987)
Michalski, R.S.: Inferential theory of learning. developing foundations for multistrategy learning. In: Michalski, R.S., Tecuci, G. (eds.) Machine Learning. A Multistrategy Approach, vol. IV, pp. 3–61. Morgan Kaufmann, San Mateo (1994)
Muggleton, S.: Inverse entailment and Progol. New Generation Computing, special issue on Inductive Logic Programming 13(3-4), 245–286 (1995)
Plotkin, G.D.: A note on inductive generalization. Machine Intelligence 5, 153–163 (1970)
Semeraro, G., Esposito, F., Malerba, D., Fanizzi, N., Ferilli, S.: A logic framework for the incremental inductive synthesis of datalog theories. In: Fuchs, N.E. (ed.) LOPSTR 1997. LNCS, vol. 1463, pp. 300–321. Springer, Heidelberg (1998)
Wrobel, S.: Concept Formation and Knowledge Revision. Kluwer Academic Publishers, Dordrecht (1994)
Zucker, J.-D.: Semantic abstraction for concept representation and learning. In: Michalski, R.S., Saitta, L. (eds.) Proceedings of the 4th International Workshop on Multistrategy Learning, Desenzano del Garda, Italy (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ferilli, S., Di Mauro, N., Basile, T.M.A., Esposito, F. (2003). Incremental Induction of Rules for Document Image Understanding. In: Cappelli, A., Turini, F. (eds) AI*IA 2003: Advances in Artificial Intelligence. AI*IA 2003. Lecture Notes in Computer Science(), vol 2829. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39853-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-39853-0_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20119-9
Online ISBN: 978-3-540-39853-0
eBook Packages: Springer Book Archive