Abstract
We study the determinism checking problem for regular expressions extended with interleaving. There are two notions of determinism, i.e., strong and weak determinism. Interleaving allows child elements intermix in any order. Although interleaving does not increase the expressive power of regular expressions, its use makes the sizes of regular expressions be exponentially more succinct. We first show an \(\mathcal {O}(|\varSigma ||E|)\) time algorithm to check the weak determinism of such expressions, where \(\varSigma \) is the set of distinct symbols in the expression. Next, we derive an \(\mathcal {O}(|E|)\) method to transform a regular expression with interleaving to its weakly star normal form which can be used to rewrite an expression that is weakly but not strongly deterministic into an equivalent strongly deterministic expression in linear time. Based on this form, we present an \(\mathcal {O}(|\varSigma ||E|)\) algorithm to check strong determinism. As far as we know, they are the first \(\mathcal {O}(|\varSigma ||E|)\) time algorithms proposed for solving the weak and strong determinism problems of regular expressions with interleaving.
Work supported by the National Natural Science Foundation of China under Grant Nos. 61472405, 61070038.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
World Wide Web Consortium. http://www.w3.org/wiki/UniqueParticleAttribution
ISO 8879. Information processingtext and office systems-standard generalized markup language (SGML) (1986)
Gelade, W., Martens, W., Neven, F.: Optimizing schema languages for XML: numerical constraints and interleaving. In: Schwentick, T., Suciu, D. (eds.) ICDT 2007. LNCS, vol. 4353, pp. 269–283. Springer, Heidelberg (2006)
Brüggemann-Klein, A.: Regular expressions into finite automata. Theoret. Comput. Sci. 120(2), 197–213 (1993)
Koch, C., Scherzinger, S.: Attribute grammars for scalable query processing on XML streams. VLDB J. 16(3), 317–342 (2007)
Gelade, W., Gyssens, M., Martens, W.: Regular expressions with counting: weak versus strong determinism. SIAM J. Comput. 41(1), 160–190 (2012)
Fuchs, M., Brown, A.: Supporting UPA and restriction on an extension of XML Schema. In: Extreme Markup Languages\(\textregistered \) (2003)
Groz, B., Maneth, S., Staworko, S.: Deterministic regular expressions in linear time. In: PODS, pp. 49–60 (2012)
Kilpeläinen, P.: Checking determinism of XML Schema content models in optimal time. Inf. Syst. 36(3), 596–617 (2011)
Brüggemann-Klein, A.: Unambiguity of extended regular expressions in SGML document grammars. In: Lengauer, T. (ed.) ESA 1993. LNCS, vol. 726, pp. 73–84. Springer, Heidelberg (1993)
Brüggemann-Klein, A., Wood, D.: One-unambiguous regular languages. Inf. Comput. 142(2), 182–206 (1998)
Chen, H., Lu, P.: Checking determinism of regular expressions with counting. Inf. Comput. 241, 302–320 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Peng, F., Chen, H., Mou, X. (2015). Deterministic Regular Expressions with Interleaving. In: Leucker, M., Rueda, C., Valencia, F. (eds) Theoretical Aspects of Computing - ICTAC 2015. ICTAC 2015. Lecture Notes in Computer Science(), vol 9399. Springer, Cham. https://doi.org/10.1007/978-3-319-25150-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-25150-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25149-3
Online ISBN: 978-3-319-25150-9
eBook Packages: Computer ScienceComputer Science (R0)