Using a More Powerful Teacher to Reduce the Number of Queries of the L* Algorithm in Practical Applications

Martins, André L.; Pinto, H. Sofia; Oliveira, Arlindo L.

doi:10.1007/11595014_33

André L. Martins²¹,
H. Sofia Pinto²¹ &
Arlindo L. Oliveira²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3808))

Included in the following conference series:

Portuguese Conference on Artificial Intelligence

Abstract

In this work we propose to use a more powerful teacher to effectively apply query learning algorithms to identify regular languages in practical, real-world problems. More specifically, we define a more powerful set of replies to the membership queries posed by the L* algorithm that reduces the number of such queries by several orders of magnitude in a practical application. The basic idea is to avoid the needless repetition of membership queries in cases where the reply will be negative as long as a particular condition is met by the string in the membership query. We present an example of the application of this method to a real problem, that of inferring a grammar for the structure of technical articles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Gold-Style Learning Theory

Learning Domain-Specific Grammars from a Small Number of Examples

Learning Automatic Families of Languages

References

Gold, E.M.: Complexity of automaton identification from given data. Information and Control 37, 302–320 (1978)
Article MathSciNet MATH Google Scholar
Pitt, L., Warmuth, M.: The minimum consistent DFA problem cannot be approximated within any polynomial. Journal of ACM 40, 95–142 (1993)
Article MATH MathSciNet Google Scholar
Angluin, D.: Learning regular sets from queries and counterexamples. Information and Computation 75, 86–106 (1987)
Article MathSciNet Google Scholar
Gold, E.M.: System identification via state characterization. Automatica 8, 621–636 (1972)
Article MATH MathSciNet Google Scholar
Schapire, R.E.: The Design and Analysis of Efficient Learning Algorithms. MIT Press, Cambridge (1992)
Google Scholar
Nevill-Manning, C., Witten, I.H., Maulsby, D.L.: Modeling sequences using grammars and automata. In: Proceedings Canadian Machine Learning Workshop, pp. 15–18 (1994)
Google Scholar
Hsu, C.N., Dung, M.T.: Generating finite-state transducers for semi-structured data extraction from the web. Information Systems 23, 521–538 (1998)
Article Google Scholar
Witten, I.H.: Adaptive text mining: inferring structure from sequences. Journal of Discrete Algorithms 2, 137–159 (2004)
Article MATH MathSciNet Google Scholar
Laender, A.H.F., Ribeiro-Neto, B.A., da Silva, A.S., Teixeira, J.S.: A brief survey of web data extraction tools. SIGMOD Record 31, 84–93 (2002)
Article Google Scholar
Ribeiro-Neto, B.A., Laender, A.H.F., da Silva, A.S.: Extracting semi-structured data through examples. In: Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, pp. 94–101. ACM, New York (1999)
Chapter Google Scholar
Adelberg, B.: NoDoSE - a tool for semi-automatically extracting semi-structured data from text documents. In: Proceedings ACM SIGMOD International Conference on Management of Data, pp. 283–294 (1998)
Google Scholar
Califf, M.E., Mooney, R.J.: Relational learning of pattern-match rules for information extraction. In: Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, pp. 328–334 (1999)
Google Scholar
Soderland, S.: Learning information extraction rules for semi-structured and free text. Machine Learning 34, 233–272 (1999)
Article MATH Google Scholar
Angluin, D.: Queries and concept learning. Machine Learning 2, 319–342 (1988)
Google Scholar
Martins, A.L., Pinto, H.S., Oliveira, A.L.: Towards automatic learning of a structure ontology for technical articles. In: Semantic Web Workshop at SIGIR 2004 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

INESC-ID/IST, Av. Alves Redol, 9, 1000-029, Lisboa, Portugal
André L. Martins, H. Sofia Pinto & Arlindo L. Oliveira

Authors

André L. Martins
View author publications
You can also search for this author in PubMed Google Scholar
H. Sofia Pinto
View author publications
You can also search for this author in PubMed Google Scholar
Arlindo L. Oliveira
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Portugal Telecom Inovação (PTI), Centro de Informatica e Sistemas da Universidade de Coimbra (CISUC),
Carlos Bento
Department of Informatics Engineering, Coimbra University, Portugal
Amílcar Cardoso
Centre of Human Language Technology and Bioinformatics, University of Beira Interior, 6201-001, Covilhã, Portugal
Gaël Dias

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Martins, A.L., Pinto, H.S., Oliveira, A.L. (2005). Using a More Powerful Teacher to Reduce the Number of Queries of the L* Algorithm in Practical Applications. In: Bento, C., Cardoso, A., Dias, G. (eds) Progress in Artificial Intelligence. EPIA 2005. Lecture Notes in Computer Science(), vol 3808. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11595014_33

Download citation

DOI: https://doi.org/10.1007/11595014_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30737-2
Online ISBN: 978-3-540-31646-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Using a More Powerful Teacher to Reduce the Number of Queries of the L* Algorithm in Practical Applications

Abstract

Access this chapter

Preview

Similar content being viewed by others

Gold-Style Learning Theory

Learning Domain-Specific Grammars from a Small Number of Examples

Learning Automatic Families of Languages

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Using a More Powerful Teacher to Reduce the Number of Queries of the L* Algorithm in Practical Applications

Abstract

Access this chapter

Preview

Similar content being viewed by others

Gold-Style Learning Theory

Learning Domain-Specific Grammars from a Small Number of Examples

Learning Automatic Families of Languages

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation