Skip to main content

Succinctness of Regular Expressions with Interleaving, Intersection and Counting

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5162))

Abstract

Studying the impact of operations, such as intersection and interleaving, on the succinctness of regular expressions has recently received renewed attention [12,13,14]. In this paper, we study the succinctness of regular expressions (REs) extended with interleaving, intersection and counting operators. We show that in a translation from REs with interleaving to standard regular expressions a double exponential size increase can not be avoided. We also consider the complexity of translations to finite automata. We give a tight exponential lower bound on the translation of REs with intersection to NFAs, and, for each of the three classes of REs, we show that in a translation to a DFA a double exponential size increase can not be avoided. Together with known results, this gives a complete picture of the complexity of translating REs extended with interleaving, intersection or counting into (standard) regular expressions, NFAs, and DFAs.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bex, G., Gelade, W., Neven, F., Vansummeren, S.: Learning deterministic regular expressions for the inference of schemas from XML data. In: WWW, pp. 825–834 (2008)

    Google Scholar 

  2. Bex, G., Neven, F., Schwentick, T., Tuyls, K.: Inference of concise DTDs from XML data. In: VLDB, pp. 115–126 (2006)

    Google Scholar 

  3. Bex, G., Neven, F., Vansummeren, S.: Inferring XML Schema Definitions from XML data. In: VLDB, pp. 998–1009 (2007)

    Google Scholar 

  4. Bruggemann-Klein, A.: Regular expressions into finite automata. Theoretical Computer Science 120(2), 197–213 (1993)

    Article  MathSciNet  Google Scholar 

  5. Clark, J., Murata, M.: RELAX NG Specification. OASIS (December 2001)

    Google Scholar 

  6. Cohen, R.S.: Rank-non-increasing transformations on transition graphs. Information and Control 20(2), 93–113 (1972)

    Article  MATH  MathSciNet  Google Scholar 

  7. Eggan, L.C.: Transition graphs and the star height of regular events. Michigan Mathematical Journal 10, 385–397 (1963)

    Article  MATH  MathSciNet  Google Scholar 

  8. Ehrenfeucht, A., Zeiger, H.: Complexity measures for regular expressions. Journal of Computer and System Sciences 12(2), 134–146 (1976)

    MATH  MathSciNet  Google Scholar 

  9. Ellul, K., Krawetz, B., Shallit, J., Wang, M.: Regular expressions: New results and open problems. Journal of Automata, Languages and Combinatorics 10(4), 407–437 (2005)

    MATH  MathSciNet  Google Scholar 

  10. Fürer, M.: The complexity of the inequivalence problem for regular expressions with intersection. In: ICALP, pp. 234–245 (1980)

    Google Scholar 

  11. Gelade, W., Martens, W., Neven, F.: Optimizing schema languages for XML: Numerical constraints and interleaving. In: Schwentick, T., Suciu, D. (eds.) ICDT 2007. LNCS, vol. 4353, pp. 269–283. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  12. Gelade, W., Neven, F.: Succinctness of the complement and intersection of regular expressions. In: STACS, pp. 325–336 (2008)

    Google Scholar 

  13. Gruber, H., Holzer, M.: Finite automata, digraph connectivity, and regular expression size. In: ICALP (to appear, 2008)

    Google Scholar 

  14. Gruber, H., Holzer, M.: Language operations with regular expressions of polynomial size. In: DCFS (to appear, 2008)

    Google Scholar 

  15. Gruber, H., Johannsen, J.: Optimal lower bounds on regular expression size using communication complexity. In: Amadio, R.M. (ed.) FOSSACS 2008. LNCS, vol. 4962, pp. 273–286. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  16. Hume, A.: A tale of two greps. Software, Practice and Experience 18(11), 1063–1072 (1988)

    Article  Google Scholar 

  17. Jiang, T., Ravikumar, B.: A note on the space complexity of some decision problems for finite automata. Information Processing Letters 40(1), 25–31 (1991)

    Article  MATH  MathSciNet  Google Scholar 

  18. Kilpelainen, P., Tuhkanen, R.: Regular expressions with numerical occurrence indicators — preliminary results. In: SPLST 2003, pp. 163–173 (2003)

    Google Scholar 

  19. Kupferman, O., Zuhovitzky, S.: An improved algorithm for the membership problem for extended regular expressions. In: Diks, K., Rytter, W. (eds.) MFCS 2002. LNCS, vol. 2420, pp. 446–458. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  20. Mayer, A.J., Stockmeyer, L.J.: Word problems-this time with interleaving. Information and Computation 115(2), 293–311 (1994)

    Article  MathSciNet  Google Scholar 

  21. McNaughton, R.: The loop complexity of pure-group events. Information and Control 11(1/2), 167–176 (1967)

    Article  MATH  MathSciNet  Google Scholar 

  22. McNaughton, R.: The loop complexity of regular events. Information Sciences 1(3), 305–328 (1969)

    Article  MathSciNet  Google Scholar 

  23. Meyer, A.R., Stockmeyer, L.J.: The equivalence problem for regular expressions with squaring requires exponential space. In: FOCS, pp. 125–129 (1972)

    Google Scholar 

  24. Petersen, H.: Decision problems for generalized regular expressions. In: DCAGRS, pp. 22–29 (2000)

    Google Scholar 

  25. Petersen, H.: The membership problem for regular expressions with intersection is complete in LOGCFL. In: Alt, H., Ferreira, A. (eds.) STACS 2002. LNCS, vol. 2285, pp. 513–522. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  26. Robson, J.M.: The emptiness of complement problem for semi extended regular expressions requires c\(^{\mbox{n}}\) space. Information Processing Letters 9(5), 220–222 (1979)

    Article  MATH  MathSciNet  Google Scholar 

  27. Schott, R., Spehner, J.C.: Shuffle of words and araucaria trees. Fundamenta Informatica 74(4), 579–601 (2006)

    MATH  MathSciNet  Google Scholar 

  28. Sperberg-McQueen, C.M., Thompson, H.: XML Schema (2005), http://www.w3.org/XML/Schema

  29. Stockmeyer, L.J., Meyer, A.R.: Word problems requiring exponential time: Preliminary report. In: STOC, pp. 1–9 (1973)

    Google Scholar 

  30. Wall, L., Christiansen, T., Orwant, J.: Programming Perl, 3rd edn. OReilly, Sebastopol (2000)

    MATH  Google Scholar 

  31. Yu, S.: Regular languages. In: Rozenberg, G., Salomaa, A. (eds.) Handbook of formal languages, ch. 2, vol. 1, pp. 41–110. Springer, Heidelberg (1997)

    Google Scholar 

  32. Yu, S.: State complexity of regular languages. Journal of Automata, Languages and Combinatorics 6(2), 221–234 (2001)

    MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Edward Ochmański Jerzy Tyszkiewicz

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gelade, W. (2008). Succinctness of Regular Expressions with Interleaving, Intersection and Counting. In: Ochmański, E., Tyszkiewicz, J. (eds) Mathematical Foundations of Computer Science 2008. MFCS 2008. Lecture Notes in Computer Science, vol 5162. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85238-4_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85238-4_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85237-7

  • Online ISBN: 978-3-540-85238-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics