research-article

Automata Learning: An Algebraic Approach

Authors:

Lutz SchröderAuthors Info & Claims

LICS '20: Proceedings of the 35th Annual ACM/IEEE Symposium on Logic in Computer Science

Pages 900 - 914

https://doi.org/10.1145/3373718.3394775

Published: 08 July 2020 Publication History

Abstract

We propose a generic categorical framework for learning unknown formal languages of various types (e.g. finite or infinite words, weighted and nominal languages). Our approach is parametric in a monad T that represents the given type of languages and their recognizing algebraic structures. Using the concept of an automata presentation of T-algebras, we demonstrate that the task of learning a T-recognizable language can be reduced to learning an abstract form of algebraic automaton whose transitions are modeled by a functor. For the important case of adjoint automata, we devise a learning algorithm generalizing Angluin's L*. The algorithm is phrased in terms of categorically described extension steps; we provide for a termination and complexity analysis based on a dedicated notion of finiteness. Our framework applies to structures like ω-regular languages that were not within the scope of existing categorical accounts of automata learning. In addition, it yields new learning algorithms for several types of languages for which no such algorithms were previously known at all, including sorted languages, nominal languages with name binding, and cost functions.

References

[1]

Jiří Adámek, Filippo Bonchi, Mathias Hülsbusch, Barbara König, Stefan Milius, and Alexandra Silva. 2012. A Coalgebraic Perspective on Minimization and Determinization. In Foundations of Software Science and Computational Structures, Lars Birkedal (Ed.). Springer Berlin Heidelberg, 58--73.

[2]

Jiří Adámek, Stefan Milius, Robert S. R. Myers, and Henning Urbat. 2014. On Continuous Nondeterminism and State Minimality. In Proc. Mathematical Foundations of Programming Science (MFPS XXX) (Electron. Notes Theor. Comput. Sci.), Bart Jacobs, Alexandra Silva, and Sam Staton (Eds.), Vol. 308. Elsevier, 3--23.

[3]

J. Adámek, S. Milius, and H. Urbat. 2015. Syntactic Monoids in a Category. In Proc. CALCO'15 (LIPIcs). Schloss Dagstuhl-Leibniz-Zentrum für Informatik.

[4]

Jiří Adámek and Vera Trnková. 1989. Automata and Algebras in Categories. Springer.

[5]

Jiří Adámek. 1974. Free algebras and automata realizations in the language of categories. Commentationes Mathematicae Universitatis Carolinae 15, 4 (1974), 589--602. http://eudml.org/doc/16649

[6]

Mikołaj Bojańczyk. 2015. Recognisable languages over monads. In Proc. DLT 2015, Igor Potapov (Ed.). LNCS, Vol. 9168. Springer, 1--13. http://arxiv.org/abs/1502.04898.

[7]

Dana Angluin. 1987. Learning Regular Sets from Queries and Counterexamples. Inf. Comput. 75, 2 (1987), 87--106.

Digital Library

[8]

Dana Angluin and Dana Fisman. 2016. Learning regular omega languages. Theoretical Computer Science 650 (2016), 57--72.

[9]

Michael A. Arbib and Ernest G. Manes. 1975. Adjoint machines, state-behavior machines, and duality. Journal of Pure and Applied Algebra 6, 3 (1975), 313--344.

[10]

László Babai. 1986. On the length of subgroup chains in the symmetric group. Comm. Alg. 14, 9 (1986), 1729--1736. https://doi.org/10.1080/00927878608823393

[11]

Borja Balle and Mehryar Mohri. 2015. Learning Weighted Automata. In Algebraic Informatics, Andreas Maletti (Ed.). Springer, 1--21.

[12]

Bernhard Banaschewski and Evelyn Nelson. 1976. Tensor products and biomorphisms. Can. Math. Bull. 19, 4 (1976), 385--402. https://doi.org/10.4153/CMB-1976--060--2

[13]

Simone Barlocco, Clemens Kupke, and Jurriaan Rot. 2019. Coalgebra Learning via Duality. In Proc. FOSSACS 2019. 62--79.

Digital Library

[14]

Michael Barr. 1970. Coequalizers and free triples. Mathematische Zeitschrift 116, 4 (1970), 307--322.

[15]

Nick Bezhanishvili, Clemens Kupke, and Prakash Panangaden. 2012. Minimization via Duality. In Logic, Language, Information and Computation, Luke Ong and Ruy de Queiroz (Eds.). Springer Berlin Heidelberg, 191--205.

[16]

S. L. Bloom. 1976. Varieties of ordered algebras. J. Comput. Syst. Sci. 2, 13 (1976), 200--212.

Digital Library

[17]

Mikołaj Bojańczyk, Bartek Klin, and Sławomir Lasota. 2014. Automata theory in nominal sets. Log. Methods Comput. Sci. 10, 3:4 (2014), 44 pp.

[18]

Mikołaj Bojańczyk. 2013. Nominal Monoids. Theory of Computing Systems 53, 2 (2013), 194--222.

[19]

Benedikt Bollig, Peter Habermehl, Carsten Kern, and Martin Leucker. 2009. Angluin-Style Learning of NFA. In 21st International Joint Conference on Artifical Intelligence (IJCAI'09).

[20]

Benedikt Bollig, Peter Habermehl, Martin Leucker, and Benjamin Monmege. 2014. A Robust Class of Data Languages and an Application to Learning. Logical Methods in Computer Science 10, 4 (2014).

[21]

Thomas Colcombet. 2009. The Theory of Stabilisation Monoids and Regular Cost Functions. In Automata, Languages and Programming, Susanne Albers, Alberto Marchetti-Spaccamela, Yossi Matias, Sotiris Nikoletseas, and Wolfgang Thomas (Eds.). Springer Berlin Heidelberg, 139--150.

[22]

Thomas Colcombet and Daniela Petrişan. 2017. Automata Minimization: a Functorial Approach. In 7th Conference on Algebra and Coalgebra in Computer Science (CALCO 2017) (Leibniz International Proceedings in Informatics (LIPIcs)), Filippo Bonchi and Barbara König (Eds.), Vol. 72. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 8:1--8:16.

[23]

H. Comon, M. Dauchet, R. Gilleron, C. Löding, F. Jacquemard, D. Lugiez, S. Tison, and M. Tommasi. 2007. Tree Automata Techniques and Applications. Available on: http://www.grappa.univ-lille3.fr/tata.

[24]

L. Daviaud, D. Kuperberg, and J.-É. Pin. 2016. Varieties of Cost Functions. In Proc. STACS 2016 (LIPIcs), N. Ollinger and H. Vollmer (Eds.), Vol. 47. Schloss Dagstuhl-Leibniz-Zentrum für Informatik, 30:1--30:14.

[25]

Hans-Peter Deifel, Stefan Milius, Lutz Schröder, and Thorsten Wiß-mann. 2019. Generic Partition Refinement and Weighted Tree Automata. In Formal Methods - The Next 30 Years, Maurice H. ter Beek, Annabelle McIver, and José N. Oliveira (Eds.). Springer International Publishing, 280--297.

[26]

François Denis, Aurélien Lemay, and Alain Terlutte. 2001. Residual Finite State Automata. In STACS 2001, Afonso Ferreira and Horst Reichel (Eds.). 144--157.

[27]

Ulrich Dorsch, Stefan Milius, Lutz Schröder, and Thorsten Wiß-mann. 2017. Efficient Coalgebraic Partition Refinement. In Proc. 28th International Conference on Concurrency Theory (CONCUR 2017) (LIPIcs). Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik. https://arxiv.org/abs/1705 08362

[28]

Frank Drewes and Johanna Högberg. 2003. Learning a Regular Tree Language from a Teacher. In Developments in Language Theory, Zoltán Ésik and Zoltán Fülöp (Eds.). Springer Berlin Heidelberg, 279--291.

[29]

M. Droste, W. Kuich, and H. Vogler (Eds.). 2009. Handbook of weighted automata. Springer.

[30]

Azadeh Farzan, Yu-Fang Chen, Edmund M. Clarke, Yih-Kuen Tsay, and Bow-Yaw Wang. 2008. Extending Automated Compositional Verification to the Full Class of Omega-regular Languages. In Proc. TACAS 2008. 2--17.

[31]

Murdoch James Gabbay and Vincenzo Ciancia. 2011. Freshness and Name-Restriction in Sets of Traces with Names. In Foundations of Software Science and Computational Structures, FOSSACS 2011 (LNCS), Vol. 6604. Springer, 365--380. https://doi.org/10.1007/978--3-642--19805--2

[32]

Murdoch James Gabbay, Dan R. Ghica, and Daniela Petrişan. 2015. Leaving the Nest: Nominal Techniques for Variables with Interleaving Scopes. In Computer Science Logic, CSL 2015 (LIPIcs), Vol. 41. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 374--389.

[33]

Joseph A. Goguen. 1975. Discrete-Time Machines in Closed Monoidal Categories. I. J. Comput. Syst. Sci. 10, 1 (1975), 1--43.

Digital Library

[34]

Claudio Hermida and Bart Jacobs. 1998. Structural Induction and Coinduction in a Fibrational Setting. Information and Computation 145, 2 (1998), 107--152.

Digital Library

[35]

Bart Jacobs and Alexandra Silva. 2014. Automata Learning: A Categorical Perspective. Springer, 384--406.

[36]

Michael Kaminski and Nissim Francez. 1994. Finite-memory automata. Theoret. Comput. Sci. 134, 2 (1994), 329--363.

Digital Library

[37]

Ondřej Klíma and Libor Polák. 2008. On varieties of meet automata. Theoretical Computer Science 407, 1 (2008), 278--289.

Digital Library

[38]

Dexter Kozen, Konstantinos Mamouras, Daniela Petrişan, and Alexandra Silva. 2015. Nominal Kleene Coalgebra. In Automata, Languages, and Programming, ICALP 2015 (LNCS), Vol. 9135. Springer, 286--298. https://doi.org/10.1007/978--3-662--47666--6

[39]

S.Mac Lane. 1998. Categories for the Working Mathematician (2nd ed.). Springer.

[40]

Oded Maler and Amir Pnueli. 1995. On the Learnability of Infinitary Regular Sets. Inf. Comput. 118, 2 (1995), 316--326.

Digital Library

[41]

E. G. Manes. 1976. Algebraic Theories. Graduate Texts in Mathematics, Vol. 26. Springer.

[42]

Joshua Moerman. 2019. Learning Product Automata. In Proc. 14th International Conference on Grammatical Inference 2018 (Proceedings of Machine Learning Research), Olgierd Unold, Witold Dyrka, and Wojciech Wieczorek (Eds.), Vol. 93. PMLR, 54--66.

[43]

Joshua Moerman and Jurriaan Rot. 2019. Separation and Renaming in Nominal Sets. CoRR abs/1906.00763 (2019). arXiv:1906.00763

[44]

Joshua Moerman, Matteo Sammartino, Alexandra Silva, Bartek Klin, and MichałSzynwelski. 2017. Learning Nominal Automata. In Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages (POPL 2017). ACM, 613--625.

Digital Library

[45]

Robert S. R. Myers, Jiří Adámek, Stefan Milius, and Henning Urbat. 2014. Canonical Nondeterministic Automata. In Proc. Coalgebraic Methods in Computer Science (CMCS'14) (Lecture Notes Comput. Sci.), Marcello M. Bonsangue (Ed.), Vol. 8446. Springer, 189--210.

[46]

D. Perrin and J.-É. Pin. 2004. Infinite Words. Elsevier.

[47]

J.-É. Pin. 2016. Mathematical Foundations of Automata Theory. (November 2016). Available at http://www.liafa.jussieu.fr/~jep/PDF/MPRI/MPRI.pdf.

[48]

Andrew M. Pitts. 2013. Nominal Sets: Names and Symmetry in Computer Science. Cambridge University Press.

[49]

L. Polák. 2001. Syntactic semiring of a language. In Proc. MFCS'01 (LNCS), J. Sgall, A. Pultr, and P. Kolman (Eds.), Vol. 2136. Springer, 611--620.

[50]

Michael O. Rabin and Dana S. Scott. 1959. Finite Automata and Their Decision Problems. IBM J. Res. Dev. 3, 2 (April 1959), 114--125.

Digital Library

[51]

C. Reutenauer. 1980. Séries formelles et algèbres syntactiques. J. Algebra 66 (1980), 448--483.

[52]

Jan J. M. M. Rutten. 2000. Universal coalgebra: a theory of systems. Theoret. Comput. Sci. 249, 1 (2000), 3--80.

Digital Library

[53]

Lutz Schröder, Dexter Kozen, Stefan Milius, and Thorsten Wiß-mann. 2017. Nominal Automata with Name Binding. In Foundations of Software Science and Computation Structures, FOSSACS 2017 (LNCS), Vol. 10203. Springer, 124--142. https://doi.org/10.1007/978--3-662--54458--7

[54]

Henning Urbat, Jirí Adámek, Liang-Ting Chen, and Stefan Milius. 2017. Eilenberg Theorems for Free. CoRR abs/1602.05831 (2017). http://arxiv.org/abs/1602.05831

[55]

Henning Urbat, Jiří Adámek, Liang-Ting Chen, and Stefan Milius. 2017. Eilenberg Theorems for Free. In Proc. MFCS 2017 (LIPIcs), Kim G. Larsen, Hans L. Bodlaender, and Jean-François Raskin (Eds.), Vol. 83. Schloss Dagstuhl.

[56]

Henning Urbat and Stefan Milius. 2019. Varieties of Data Languages. In Proc. 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019) (LIPIcs), Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi (Eds.), Vol. 132. 130:1--130:14. R@(Presents the first Eilenberg-type correspondence for data languages and a nominal Eilenberg-Schützenberger theorem characterizing pseudovarieties of nominal monoids.).

[57]

Henning Urbat and Lutz Schröder. 2020. Automata Learning: An Algebraic Approach. CoRR abs/1911.00874 (2020). https://arxiv.org/abs/1911.00874

Digital Library

[58]

Frits Vaandrager. 2017. Model Learning. Commun. ACM 60, 2 (2017), 86--95.

Digital Library

[59]

Gerco van Heerdt, Tobias Kappé, Jurriaan Rot, Matteo Sammartino, and Alexandra Silva. 2019. Tree Automata as Algebras: Minimisation and Determinisation. CoRR abs/1904.08802 (2019). http://arxiv.org/abs/1904.08802

[60]

Gerco van Heerdt, Matteo Sammartino, and Alexandra Silva. 2017. CALF: Categorical Automata Learning Framework. In Proc. CSL 2017. 29:1--29:24.

[61]

Gerco van Heerdt, Matteo Sammartino, and Alexandra Silva. 2017. Learning Automata with Side-Effects. CoRR abs/1704.08055 (2017). http://arxiv.org/abs/1704.08055

[62]

T. Wilke. 1991. An Eilenberg Theorem for ∞-Languages. In Proc. ICALP'91 (LNCS), Vol. 510. Springer, 588--599.

Cited By

Plambeck SBracht AHranisavljevic NFey G(2024)FaMoS– Fast Model Learning for Hybrid Cyber-Physical Systems using Decision TreesProceedings of the 27th ACM International Conference on Hybrid Systems: Computation and Control10.1145/3641513.3650131(1-10)Online publication date: 14-May-2024
https://dl.acm.org/doi/10.1145/3641513.3650131
Vilar J(2024)A categorical interpretation of state merging algorithms for DFA inferencePattern Recognition10.1016/j.patcog.2024.110326150:COnline publication date: 1-Jun-2024
https://dl.acm.org/doi/10.1016/j.patcog.2024.110326
Chernev AHansen HKupke C(2024)Dual Adjunction Between -Automata and Wilke Algebra QuotientsTheoretical Aspects of Computing – ICTAC 202410.1007/978-3-031-77019-7_6(96-113)Online publication date: 25-Nov-2024
https://dl.acm.org/doi/10.1007/978-3-031-77019-7_6
Show More Cited By

Index Terms

Automata Learning: An Algebraic Approach
1. Theory of computation
  1. Formal languages and automata theory
    1. Formalisms
      1. Algebraic language theory

Recommendations

Learning Automata with Side-Effects
Coalgebraic Methods in Computer Science
Abstract
Automata learning has been successfully applied in the verification of hardware and software. The size of the automaton model learned is a bottleneck for scalability, and hence optimizations that enable learning of compact representations are ...
Explicit substitutions and higher-order syntax
MERLIN '03: Proceedings of the 2003 ACM SIGPLAN workshop on Mechanized reasoning about languages with variable binding

Recently there has been a great deal of interest in higher-order syntax which seeks to extend standard initial algebra semantics to cover languages with variable binding by using functor categories. The canonical example studied in the literature is ...
Idioms are Oblivious, Arrows are Meticulous, Monads are Promiscuous

We revisit the connection between three notions of computation: Moggi s monads, Hughes s arrows and McBride and Paterson s idioms (also called applicative functors). We show that idioms are equivalent to arrows that satisfy the type isomorphism A B 1 (A ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

LICS '20: Proceedings of the 35th Annual ACM/IEEE Symposium on Logic in Computer Science

July 2020

986 pages

ISBN:9781450371049

DOI:10.1145/3373718

Conference Chairs:
Holger Hermanns,
Lijun Zhang,
Naoki Kobayashi,
General Chair:
Dale Miller

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGLOG: ACM Special Interest Group on Logic and Computation
EACSL: European Association for Computer Science Logic
IEEE-CS\DATC: IEEE Computer Society

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

LICS '20

Sponsor:

SIGLOG
EACSL
IEEE-CS\DATC

LICS '20: 35th Annual ACM/IEEE Symposium on Logic in Computer Science

July 8 - 11, 2020

Saarbrücken, Germany

Acceptance Rates

LICS '20 Paper Acceptance Rate 69 of 174 submissions, 40%;

Overall Acceptance Rate 215 of 622 submissions, 35%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

13
Total Citations
View Citations
220
Total Downloads

Downloads (Last 12 months)60
Downloads (Last 6 weeks)5

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Plambeck SBracht AHranisavljevic NFey G(2024)FaMoS– Fast Model Learning for Hybrid Cyber-Physical Systems using Decision TreesProceedings of the 27th ACM International Conference on Hybrid Systems: Computation and Control10.1145/3641513.3650131(1-10)Online publication date: 14-May-2024
https://dl.acm.org/doi/10.1145/3641513.3650131
Vilar J(2024)A categorical interpretation of state merging algorithms for DFA inferencePattern Recognition10.1016/j.patcog.2024.110326150:COnline publication date: 1-Jun-2024
https://dl.acm.org/doi/10.1016/j.patcog.2024.110326
Chernev AHansen HKupke C(2024)Dual Adjunction Between -Automata and Wilke Algebra QuotientsTheoretical Aspects of Computing – ICTAC 202410.1007/978-3-031-77019-7_6(96-113)Online publication date: 25-Nov-2024
https://dl.acm.org/doi/10.1007/978-3-031-77019-7_6
Krumnow APlambeck SFey G(2024)Using Forest Structures for Passive Automata LearningMachine Learning for Cyber-Physical Systems10.1007/978-3-031-47062-2_7(65-74)Online publication date: 21-Jun-2024
https://doi.org/10.1007/978-3-031-47062-2_7
Knitt MPlambeck SWieck JKohlisch JBalduin SVeith ESchyga JHinckeldeyn JFey GKreutzfeldt J(2023)Towards the Automatic Generation of Models for Prediction, Monitoring, and Testing of Cyber-Physical Systems2023 IEEE 28th International Conference on Emerging Technologies and Factory Automation (ETFA)10.1109/ETFA54631.2023.10275706(1-4)Online publication date: 12-Sep-2023
https://doi.org/10.1109/ETFA54631.2023.10275706
Muscholl AWalukiewicz I(2022)Active learning for sound negotiations✱Proceedings of the 37th Annual ACM/IEEE Symposium on Logic in Computer Science10.1145/3531130.3533342(1-12)Online publication date: 2-Aug-2022
https://dl.acm.org/doi/10.1145/3531130.3533342
Plambeck SFey GSchyga JHinckeldeyn JKreutzfeldt J(2022)Explaining Cyber-Physical Systems Using Decision Trees2022 2nd International Workshop on Computation-Aware Algorithmic Design for Cyber-Physical Systems (CAADCPS)10.1109/CAADCPS56132.2022.00006(3-8)Online publication date: May-2022
https://doi.org/10.1109/CAADCPS56132.2022.00006
Plambeck SSchammer LFey G(2022)On the Viability of Decision Trees for Learning Models of Systems2022 27th Asia and South Pacific Design Automation Conference (ASP-DAC)10.1109/ASP-DAC52403.2022.9712579(696-701)Online publication date: 17-Jan-2022
https://doi.org/10.1109/ASP-DAC52403.2022.9712579
Heerdt GKappé TRot JSammartino MSilva A(2022)A Categorical Framework for Learning Generalised Tree AutomataCoalgebraic Methods in Computer Science10.1007/978-3-031-10736-8_4(67-87)Online publication date: 23-Jul-2022
https://doi.org/10.1007/978-3-031-10736-8_4
Vaandrager FGarhewal BRot JWißmann T(2022)A New Approach for Active Automata Learning Based on ApartnessTools and Algorithms for the Construction and Analysis of Systems10.1007/978-3-030-99524-9_12(223-243)Online publication date: 30-Mar-2022
https://doi.org/10.1007/978-3-030-99524-9_12
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents