Abstract
The concepts of pseudo-bordered and pseudo-unbordered words are in large part motivated by research in theoretical DNA computing, wherein the Watson–Crick complementarity of DNA strands is modelled as an antimorphic involution, that is, a function \(\theta \) which is an antimorphism, \(\theta (uv)=\theta (v) \theta (u)\), and an involution, \(\theta (\theta (u))=u\), for all words u, v over the DNA alphabet. In particular, a word w is said to be \(\theta \)-bordered (or pseudo-bordered) if there exists a word \(v \in \Sigma ^{+}\) that is a proper prefix of w, while \(\theta (v)\) is a proper suffix of w. A word which is not \(\theta \)-bordered is \(\theta \)-unbordered. This paper continues the exploration of properties (for the case where \(\theta \) is a morphic involution) of the set of \(\theta \)-unbordered words, \(D_{\theta }(1)\), and the sets of words that have exactly i \(\theta \)-borders, \(D_{\theta }(i)\), \(i \ge 2\). We prove that, under some conditions, the set \(D_{\theta }(i)\) is disjunctive for all \(i \ge 1\), and that the set \(D_{\theta }^i(1){\setminus }D(i)\) is disjunctive for all \(i\ge 2\), where D(i) denotes the set of words with exactly i borders. We also discuss conditions for catenations of languages of \(\theta \)-unbordered words to remain \(\theta \)-unbordered, and anticipate further generalizations by showing that the set of all \(\theta \)-bordered words is not context-free for all morphisms \(\theta \) over an alphabet \(\Sigma \) with \(|\Sigma | \ge 3\) such that \(\theta (a) \ne a\) for all \(a \in \Sigma \) and \(\theta ^{3}\) equals the identity function on \(\Sigma \).
Similar content being viewed by others
Notes
By (anti)morphism we mean either a morphism or an antimorphism.
References
Blondin Massé, A., Gaboury, S., Hallé, S.: Pseudoperiodic words. In: Yen, H.C., Ibarra, O. (eds.) Developments in Language Theory. Lecture Notes in Computer Science, vol. 7410, pp. 308–319. Springer, Berlin Heidelberg (2012)
Carpi, A., de Luca, A.: Periodic-like words, periodicity, and boxes. Acta Inf. 37(8), 597–618 (2001)
Cho, D.J., Han, Y.S., Ko, S.K.: Decidability of involution hypercodes. Theor. Comput. Sci. 550, 90–99 (2014)
Constantinescu, S., Ilie, L.: Fine and Wilf’s theorem for Abelian periods. Bull. EATCS 89, 167–170 (2006)
Crochemore, M., Hancart, C., Lecroq, T.: Algorithms on Strings. Cambridge University Press, Cambridge (2007)
Crochemore, M., Rytter, W.: Jewels of Stringology. World Scientific, Singapore (2002)
Cummings, L.J., Smyth, W.F.: Weak repetitions in strings. J. Comb. Math. Comb. Comput. 24, 33–48 (1997)
Czeizler, E., Kari, L., Seki, S.: On a special class of primitive words. Theor. Comput. Sci. 411, 617–630 (2010)
de Luca, A., De Luca, A.: Pseudopalindrome closure operators in free monoids. Theor. Comput. Sci. 362(13), 282–300 (2006)
Gawrychowski, P., Manea, F., Mercaş, R., Nowotka, D., Tiseanu, C.: Finding pseudo-repetitions. Leibniz Int. Proc. Inf. 20, 257–268 (2013)
Gawrychowski, P., Manea, F., Nowotka, D.: Discovering hidden repetitions in words. In: Bonizzoni, P., Brattka, V., Löwe, B. (eds.) The Nature of Computation. Logic, Algorithms, Applications. Lecture Notes in Computer Science, vol. 7921, pp. 210–219. Springer, Berlin Heidelberg (2013)
Gawrychowski, P., Manea, F., Nowotka, D.: Testing generalised freeness of words. In: Mayr, E.W., Portier, N. (eds.) 31st International Symposium on Theoretical Aspects of Computer Science (STACS 2014). Leibniz International Proceedings in Informatics (LIPIcs), vol. 25, pp. 337–349. Schloss Dagstuhl–Leibniz–Zentrum für Informatik, Dagstuhl, Germany (2014)
Hopcroft, J.E., Ullman, J.D.: Formal Languages and Their Relation to Automata. Addison-Wesley Longman Publishing Co. Inc, Boston (1969)
Hsu, S., Ito, M., Shyr, H.: Some properties of overlapping order and related languages. Soochow J. Math. 15(1), 29–45 (1989)
Huang, C., Hsiao, P.C., Liau, C.J.: A note of involutively bordered words. J. Inf. Opt. Sci. 31(2), 371–386 (2010)
Hussini, S., Kari, L., Konstantinidis, S.: Coding properties of DNA languages. In: Jonoska, N., Seeman, N. (Eds.) Proceedings of DNA7. Lecture Notes in Computer Science, vol. 2340, pp. 57–69. Springer (2002)
Jonoska, N., Kephart, D., Mahalingam, K.: Generating DNA code words. Congr. Numerantium. 156, 99–110 (2002)
Kari, L., Kulkarni, M.S.: Pseudo-identities and bordered words. In: Paun, G., Rozenberg, G., Salomaa, A. (eds.) Discrete Mathematics and Computer Science, pp. 207–222. Editura Academiei Române, Romania (2014)
Kari, L., Mahalingam, K.: Involutively bordered words. Int. J. Found. Comput. Sci. 18(05), 1089–1106 (2007)
Kari, L., Mahalingam, K.: Watson–Crick conjugate and commutative words. In: Garzon, M., Yan, H. (eds.) DNA Computing. Lecture Notes in Computer Science, vol. 4848, pp. 273–283. Springer, Berlin Heidelberg (2008)
Kari, L., Mahalingam, K.: Watson–Crick palindromes in DNA computing. Natural Comput. 9(2), 297–316 (2010)
Kari, L., Seki, S.: On pseudoknot-bordered words and their properties. J. Comput. Syst. Sci. 75, 113–121 (2009)
Kari, L., Seki, S.: An improved bound for an extension of Fine and Wilf’s theorem and its optimality. Fundam. Inf. 101, 215–236 (2010)
Kopecki, S.: Personal communication (2015)
Lothaire, M.: Combinatorics on Words. Cambridge University Press, Cambridge (1997)
Shyr, H., Thierrin, G.: Disjunctive languages and codes. In: Karpinski, M. (ed.) Fundamentals of Computation Theory. Lecture Notes in Computer Science, vol. 56, pp. 171–176. Springer, Berlin Heidelberg (1977)
Shyr, H.J.: Free Monoids and Languages. Department of Mathematics, Soochow University, Taipei, Taiwan (1979)
Trappe, W., Washington, L.C.: Introduction to Cryptography with Coding Theory. Pearson Education India, New Delhi (2006)
Watson, J.D., Crick, F.H.: Molecular structure of nucleic acids. Nature 171(4356), 737–738 (1953)
Wong, F.R.: Algebraic Properties of d-Primitive Words. Master’s thesis. Chuang-Yuan Christian University, Chuang Li, Taiwan (1994)
Yu, S.: d-Minimal languages. Disc. Appl. Math. 89(13), 243–262 (1998)
Yu, S.S.: Languages and Codes. Tsang Hai Book Publishing Co., Taichung (2005)
Ziv, J., Lempel, A.: A universal algorithm for sequential data compression. IEEE Trans. Inf. Theory 23(3), 337–343 (1977)
Acknowledgments
We thank Steffen Kopecki for valuable suggestions, in particular those that lead to shortening the proof of Proposition 12, and Sepinoud Azimi for discussions.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported by Natural Science and Engineering Council of Canada (NSERC) Discovery Grant R2824A01 to L.K.
Rights and permissions
About this article
Cite this article
Kari, L., Kulkarni, M.S. Disjunctivity and other properties of sets of pseudo-bordered words. Acta Informatica 54, 379–398 (2017). https://doi.org/10.1007/s00236-016-0258-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00236-016-0258-1