On the Rademacher Complexity of Weighted Automata

Balle, Borja; Mohri, Mehryar

doi:10.1007/978-3-319-24486-0_12

Borja Balle¹⁶ &
Mehryar Mohri^17,18

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9355))

Included in the following conference series:

International Conference on Algorithmic Learning Theory

1274 Accesses
1 Citations

Abstract

Weighted automata (WFAs) provide a general framework for the representation of functions mapping strings to real numbers. They include as special instances deterministic finite automata (DFAs), hidden Markov models (HMMs), and predictive states representations (PSRs). In recent years, there has been a renewed interest in weighted automata in machine learning due to the development of efficient and provably correct spectral algorithms for learning weighted automata. Despite the effectiveness reported for spectral techniques in real-world problems, almost all existing statistical guarantees for spectral learning of weighted automata rely on a strong realizability assumption. In this paper, we initiate a systematic study of the learning guarantees for broad classes of weighted automata in an agnostic setting. Our results include bounds on the Rademacher complexity of three general classes of weighted automata, each described in terms of different natural quantities. Interestingly, these bounds underline the key role of different data-dependent parameters in the convergence rates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abe, N., Warmuth, M.K.: On the computational complexity of approximating distributions by probabilistic automata. Machine Learning (1992)
Google Scholar
Albert, J., Kari, J.: Digital image compression. In: Handbook of weighted automata. Springer (2009)
Google Scholar
Baier, C., Größer, M., Ciesinski, F.: Model checking linear-time properties of probabilistic systems. In: Handbook of Weighted automata. Springer (2009)
Google Scholar
Bailly, R., Denis, F., Ralaivola, L.: Grammatical inference as a principal component analysis problem. In: ICML (2009)
Google Scholar
Bailly, R., Denis, F.: Absolute convergence of rational series is semi-decidable. Inf. Comput. (2011)
Google Scholar
Balle, B., Carreras, X., Luque, F., Quattoni, A.: Spectral learning of weighted automata: A forward-backward perspective. Machine Learning (2014)
Google Scholar
Balle, B., Hamilton, W., Pineau, J.: Methods of moments for learning stochastic languages: unified presentation and empirical comparison. In: ICML (2014)
Google Scholar
Balle, B., Mohri, M.: Spectral learning of general weighted automata via constrained matrix completion. In: NIPS (2012)
Google Scholar
Balle, B., Mohri, M.: Learning weighted automata. In: CAI (2015)
Google Scholar
Balle, B., Panangaden, P., Precup, D.: A canonical form for weighted automata and applications to approximate minimization. In: Logic in Computer Science (LICS) (2015)
Google Scholar
Bartlett, P.L., Mendelson, S.: Rademacher and gaussian complexities: risk bounds and structural results. In: Helmbold, D.P., Williamson, B. (eds.) COLT 2001 and EuroCOLT 2001. LNCS (LNAI), vol. 2111, pp. 224–240. Springer, Heidelberg (2001)
Chapter Google Scholar
Berstel, J., Reutenauer, C.: Noncommutative rational series with applications. Cambridge University Press (2011)
Google Scholar
Boots, B., Siddiqi, S., Gordon, G.: Closing the learning-planning loop with predictive state representations. In: RSS (2009)
Google Scholar
Carlyle, J.W., Paz, A.: Realizations by stochastic finite automata. J. Comput. Syst. Sci. 5(1) (1971)
Google Scholar
Cortes, C., Mohri, M., Rastogi, A.: Lp distance and equivalence of probabilistic automata. International Journal of Foundations of Computer Science (2007)
Google Scholar
Devroye, L., Lugosi, G.: Combinatorial methods in density estimation. Springer (2001)
Google Scholar
Eilenberg, S.: Automata, Languages and Machines, vol. A. Academic Press (1974)
Google Scholar
Fliess, M.: Matrices de Hankel. Journal de Mathématiques Pures et Appliquées 53 (1974)
Google Scholar
de Gispert, A., Iglesias, G., Blackwood, G., Banga, E., Byrne, W.: Hierarchical phrase-based translation with weighted finite-state transducers and shallow-n grammars. Computational Linguistics (2010)
Google Scholar
Hamilton, W.L., Fard, M.M., Pineau, J.: Modelling sparse dynamical systems with compressed predictive state representations. In: ICML (2013)
Google Scholar
Hsu, D., Kakade, S.M., Zhang, T.: A spectral algorithm for learning hidden Markov models. In: COLT (2009)
Google Scholar
Ishigami, Y., Tani, S.: Vc-dimensions of finite automata and commutative finite automata with k letters and n states. Discrete Applied Mathematics (1997)
Google Scholar
Knight, K., May, J.: Applications of weighted automata in natural language processing. In: Handbook of Weighted Automata. Springer (2009)
Google Scholar
Koltchinskii, V., Panchenko, D.: Rademacher processes and bounding the risk of function learning. In: High Dimensional Probability II, pp. 443–459. Birkhäuser (2000)
Google Scholar
Kuich, W., Salomaa, A.: Semirings, automata, languages. In: EATCS. Monographs on Theoretical Computer Science, vol. 5. Springer-Verlag, Berlin-New York (1986)
Google Scholar
Kulesza, A., Jiang, N., Singh, S.: Low-rank spectral learning with weighted loss functions. In: AISTATS (2015)
Google Scholar
Kulesza, A., Rao, N.R., Singh, S.: Low-rank spectral learning. In: AISTATS (2014)
Google Scholar
Massart, P.: Some applications of concentration inequalities to statistics. In: Annales de la Faculté des Sciences de Toulouse (2000)
Google Scholar
Mirsky, L.: A trace inequality of John von Neumann. Monatshefte für Mathematik (1975)
Google Scholar
Mohri, M.: Weighted automata algorithms. In: Handbook of Weighted Automata. Monographs in Theoretical Computer Science, pp. 213–254. Springer (2009)
Google Scholar
Mohri, M., Pereira, F.C.N., Riley, M.: Speech recognition with weighted finite-state transducers. In: Handbook on Speech Processing and Speech Comm. Springer (2008)
Google Scholar
Mohri, M., Rostamizadeh, A., Talwalkar, A.: Foundations of machine learning. MIT press (2012)
Google Scholar
Salomaa, A., Soittola, M.: Automata-Theoretic Aspects of Formal Power Series. Springer-Verlag, New York (1978)
Book MATH Google Scholar
Tropp, J.A.: An Introduction to Matrix Concentration Inequalities (2015). ArXiv abs/1501.01571
Vershynin, R.: Lectures in Geometrical Functional Analysis. Preprint (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, McGill University, Montréal, Canada
Borja Balle
Courant Institute of Mathematical Sciences, New York, NY, USA
Mehryar Mohri
Google Research, New York, NY, USA
Mehryar Mohri

Authors

Borja Balle
View author publications
You can also search for this author in PubMed Google Scholar
Mehryar Mohri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Borja Balle .

Editor information

Editors and Affiliations

University of California, La Jolla, California, USA
Kamalika Chaudhuri
UNIV DEL INSUBRIA, 21100 VARESE, Italy
CLAUDIO GENTILE
REGINA, Saskatchewan, Canada
Sandra Zilles

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Balle, B., Mohri, M. (2015). On the Rademacher Complexity of Weighted Automata. In: Chaudhuri, K., GENTILE, C., Zilles, S. (eds) Algorithmic Learning Theory. ALT 2015. Lecture Notes in Computer Science(), vol 9355. Springer, Cham. https://doi.org/10.1007/978-3-319-24486-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-24486-0_12
Published: 31 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24485-3
Online ISBN: 978-3-319-24486-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics