Algorithms for Learning Regular Expressions

Fernau, Henning

doi:10.1007/11564089_24

Algorithms for Learning Regular Expressions

Henning Fernau^21,22

Conference paper

2219 Accesses
10 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3734))

Abstract

We describe algorithms that directly infer regular expressions from positive data and characterize the regular language classes that can be learned this way.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ahonen, H.: Disambiguation of SGML content models. In: Nicholas, C., Wood, D. (eds.) PODDP 1996 and PODP 1996. LNCS, vol. 1293, pp. 27–37. Springer, Heidelberg (1997)
Chapter Google Scholar
Ahonen, H., Mannila, H., Nikunen, E.: Forming grammars for structured documents: an application of grammatical inference. In: Carrasco, R.C., Oncina, J. (eds.) ICGI 1994. LNCS (LNAI), vol. 862, pp. 153–167. Springer, Heidelberg (1994)
Google Scholar
Berstel, J., Boasson, L.: Formal properties of XML grammars and languages. Acta Informatica 38(9), 649–671 (2002)
Article MATH MathSciNet Google Scholar
Blackwell, A.F.: SWYN: A visual representation for regular expressions. In: Lieberman, H. (ed.) Your wish is my command: Giving users the power to instruct their software, pp. 245–270. Morgan Kaufmann, San Francisco (2001)
Chapter Google Scholar
Brazma, A.: Efficient learning of regular expressions from approximate examples. In: Greiner, R., Petsche, T., Hanson, S.J. (eds.) Computational Learning Theory and Natural Learning Systems, Making Learning Systems Practical, ch.19, vol. IV, pp. 337–352. MIT Press, Cambridge (1997)
Google Scholar
Chung, Y.D., Kim, J.W., Kim, M.H.: Efficient preprocessing of XML queries using structured signatures. Information Processing Letters 87, 257–264 (2003)
Article MATH MathSciNet Google Scholar
CZ-Redaktion.: Maschinenmenschen plaudern per XML mit der Unternehmens-IT. Computer Zeitung (50), 30 (2000)
Google Scholar
Fernau, H.: Learning XML grammars. In: Perner, P. (ed.) MLDM 2001. LNCS (LNAI), vol. 2123, pp. 73–87. Springer, Heidelberg (2001)
Chapter Google Scholar
Garofalakis, M., Gionis, A., Rastogi, R., Seshadri, S., Shim, K.: XTRACT: learning document type descriptors from XML document collections. Data Mining and Knowledge Discovery 7, 23–56 (2003)
Article MathSciNet Google Scholar
Gold, E.M.: Language identification in the limit. Information and Control (now Information and Computation) 10, 447–474 (1967)
Article MATH Google Scholar
Hopcroft, J.E., Ullman, J.D.: Introduction to Automata Theory, Languages, and Computation. Addison-Wesley, Reading (1979)
MATH Google Scholar
Laird., P.D.: Learning from Good and Bad Data. Kluwer Academic Publishers, Norwell (1988)
Google Scholar
Nevill-Manning, C.G., Witten, I.H.: Online and offline heuristics for inferring hierarchies of repetitions in sequences. Proc. IEEE 88, 1745–1755 (2000)
Article Google Scholar
Smith, T.C., Witten, I.H., Cleary, J.G., Legg, S.: Objective evaluation of inferred context-free grammars. In: Proc. Australian and New Zealand Conference on Intelligent Information Systems, Brisbane, Australia (November 1994)
Google Scholar
van Zaanen, M.: Bootstrapping Structure into Language: Alignment-Based Learning. PhD, School of Computing, University of Leeds, UK (September 2001)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Hertfordshire, College Lane, Hatfield, Herts, AL10 9AB, UK
Henning Fernau
Wilhelm-Schickard-Institut für Informatik, Universität Tübingen, Sand 13, D-72076, Tübingen, Germany
Henning Fernau

Authors

Henning Fernau
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, National University of Singapore, 117590, Singapore
Sanjay Jain
Ruhr-Universität Bochum, Germany
Hans Ulrich Simon
Department of Information and Communication Engineering, Faculty of Electro-Communications, The University of Electro-Communications, Chofugaoka 1–5–1, Chofu, 182-8585, Tokyo, Japan
Etsuji Tomita

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fernau, H. (2005). Algorithms for Learning Regular Expressions. In: Jain, S., Simon, H.U., Tomita, E. (eds) Algorithmic Learning Theory. ALT 2005. Lecture Notes in Computer Science(), vol 3734. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11564089_24

Download citation

DOI: https://doi.org/10.1007/11564089_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29242-5
Online ISBN: 978-3-540-31696-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics