Abstract
This paper introduces a new grammatical inference algorithm called the Boisdale algorithm. This algorithm can identify a class of context-free unification grammar in the limit from positive data only. The Boisdale algorithm infers both the syntax and the semantics of the language, where the semantics of the language can be described using arbitrarily complex data structures represented as key value pairs. The Boisdale algorithm is an alignment based learning algorithm that executes in polynomial time with respect to the length of the training data and can infer a grammar when presented with any set of sentences tagged with any data structure. This paper includes a description of the algorithm, a description of a class of language that it can identify in the limit and some experimental results.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aho, A.V., Ullman, J.D.: The theory of parsing, translation, and compiling. Prentice-Hall, Englewood Cliffs (1972)
Allen, J.: Natural language understanding, 2nd edn. Benjamin/ Cummings Pub. Co, Redwood City (1995)
Angluin, D.: Inductive inference of formal grammars from positive data. Information and Control 45, 117–135 (1980)
de la Higuera, C., Oncina, J.: Learning context-free grammars. In: Proceedings of ECML Workshop and Tutorial on Learning Context-free Grammars, Cavtat-Dubrovnik Croatia, pp. 6–19 (2003)
Gold, E.M.: Grammar identification in the limit. Information and Control 10(5), 447–474 (1967)
Otto, F.: On deciding the confluence of a finite string-rewriting system on a given congruence class. Journal of Computer and System Sciences 35, 285–310 (1987)
Pereira, F.C.N., Warren, D.H.D.: Definite clause grammars for language analysis - A survey of the formalism and a comparison with augmented transition networks. Artificial Intelligence Review 13(3), 231–278 (1980)
Starkie, B.: Inferring attribute grammars with structured data for natural language processing. In: Adriaans, P.W., Fernau, H., van Zaanen, M. (eds.) ICGI 2002. LNCS (LNAI), vol. 2484, pp. 237–248. Springer, Heidelberg (2002)
Starkie, B.: Left aligned grammars - Identifying a class of context-free grammar in the limit from positive data. In: Proceedings of ECML Workshop and Tutorial on Learning Context-free Grammars, Cavtat-Dubrovnik Croatia, pp. 90–101 (2003)
Starkie, B., Fernau, H.: The Boisdale algorithm - Identifying a class of unification grammar in the limit from positive data. Addendum A - Proof sketches.University of Newcastle School of Electrical Engineering and Computer Science, Callaghan, NSW Australia 2308 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Starkie, B., Fernau, H. (2004). The Boisdale Algorithm – An Induction Method for a Subclass of Unification Grammar from Positive Data. In: Paliouras, G., Sakakibara, Y. (eds) Grammatical Inference: Algorithms and Applications. ICGI 2004. Lecture Notes in Computer Science(), vol 3264. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30195-0_21
Download citation
DOI: https://doi.org/10.1007/978-3-540-30195-0_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23410-4
Online ISBN: 978-3-540-30195-0
eBook Packages: Springer Book Archive