Abstract
A linear graph is a graph whose vertices are linearly ordered. This linear ordering allows pairs of disjoint edges to be either preceding (<), nesting (\(\sqsubset\)) or crossing (\(\between\)). Given a family of linear graphs, and a non-empty subset \(\mathcal{R} \subseteq \{<,\sqsubset,\between\}\), we are interested in the PBMCSP problem: Find a maximum size edge-disjoint graph, with edge-pairs all comparable by one of the relations in \(\mathcal{R}\), that occurs as a subgraph in each of the linear graphs of the family. In this paper, we generalize the framework of Davydov and Batzoglou by considering patterns comparable by all possible subsets \(\mathcal{R} \subseteq \{<,\sqsubset,\between\}\). This is motivated by the fact that many biological applications require considering crossing structures, and by the fact that different combinations of the relations above give rise to different generalizations of natural combinatorial problems. Our results can be summarized as follows: We give tight hardness results for the PBMCSP problem for \(\{<,\between\}\)-structured patterns and \(\{\sqsubset,\between\}\)-structured patterns. Furthermore, we prove that the problem is approximable within ratios: (i) for \(\{<,\between\}\)-structured patterns, (ii) k 1/2 for \(\{\sqsubset, \between\}\)-structured patterns, and (iii) \(\mathcal{O}(\sqrt{k \lg k})\) for \(\{<,\sqsubset,\between\}\)-structured patterns, where k is the size of the optimal solution and is the k-th harmonic number.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Alon, N.: Private communication (2006)
Alonso, L., Schott, R.: On the tree inclusion problem. In: Borzyszkowski, A.M., Sokolowski, S. (eds.) MFCS 1993. LNCS, vol. 711, pp. 211–221. Springer, Heidelberg (1993)
Apostolico, A., Guerra, C.: The longest common subsequence problem revisited. Algorithmica 2, 315–336 (1987)
Blin, G., Fertin, G., Rizzi, R., Vialette, S.: What makes the arc-preserving subsequence problem hard ? In: Sunderam, V.S., van Albada, G.D., Sloot, P.M.A., Dongarra, J.J. (eds.) ICCS 2005. LNCS, vol. 3515, pp. 860–868. Springer, Heidelberg (2005)
Blin, G., Fertin, G., Vialette, S.: New results for the 2-interval pattern problem. In: Sahinalp, S.C., Muthukrishnan, S.M., Dogrusoz, U. (eds.) CPM 2004. LNCS, vol. 3109, Springer, Heidelberg (2004)
Bose, P., Buss, J.F., Lubiw, A.: Pattern matching for permutations. IPL 65(5), 277–283 (1998)
Chang, M.-S., Wang, F.-G.: Efficient algorithms for the maximum weight clique and maximum weight independent set problems on permutation graphs. IPL 43(6), 293–295 (1992)
Chen, W.: More efficient algorithm for ordered tree inclusion. J. Algorithms 26(2), 370–385 (1998)
Crochemore, M., Hermelin, D., Landau, G.M., Vialette, S.: Approximating the 2-interval pattern problem. In: Brodal, G.S., Leonardi, S. (eds.) ESA 2005. LNCS, vol. 3669, pp. 426–437. Springer, Heidelberg (2005)
Davydov, E., Batzoglou, S.: A computational model for RNA multiple structural alignment. In: Sahinalp, S.C., Muthukrishnan, S.M., Dogrusoz, U. (eds.) CPM 2004. LNCS, vol. 3109, pp. 254–269. Springer, Heidelberg (2004)
Dilworth, R.P.: A decomposition theorem for partially ordered sets. Annals of Mathematics Series 2 51, 161–166 (1950)
Eppstein, D., Galil, Z., Giancarlo, R., Italiano, G.F.: Sparse dynamic programming I: Linear cost functions. J. ACM 39(3), 519–545 (1992)
Erdős, P., Szekeres, G.: A combinatorial problem in geometry. Compositio Mathematica 2, 463–470 (1935)
Evans, P.A.: Algorithms and complexity for annotated sequence analysis. PhD thesis, University of Alberta (1999)
Goldman, D., Istrail, S., Papadimitriou, C.H.: Algorithmic aspects of protein structure similarity. In: Proc. 40th Foundations of Computer Science (FOCS), pp. 512–522 (1999)
Gramm, J.: A polynomial-time algorithm for the matching of crossing contact-map patterns. IEEE/ACM Trans. Comp. Biol. and Bioinfo. 1(4), 171–180 (2004)
Gramm, J., Guo, J., Niedermeier, R.: Pattern matching for arc-annotated sequences. In: Agrawal, M., Seth, A.K. (eds.) FST TCS 2002: Foundations of Software Technology and Theoretical Computer Science. LNCS, vol. 2556, pp. 182–193. Springer, Heidelberg (2002)
Gupta, U.I., Lee, D.T., Leung, J.Y.-T.: Efficient algorithms for interval graph and circular-arc graphs. Networks 12, 459–467 (1982)
Hirschberg, D.S.: Algorithms for the longest common subsequence problem. J. ACM 24(4), 664–675 (1977)
Hunt, J.W., Szymanski, T.G.: A fast algorithm for computing longest common subsequences. Communications of the ACM 20, 350–353 (1977)
Kilpeläinen, P., Mannila, H.: Ordered and unordered tree inclusion. SIAM J. Comp. 24(2), 340–356 (1995)
Klein, P.N.: Computing the edit-distance between unrooted ordered trees. In: Bilardi, G., Pietracaprina, A., Italiano, G.F., Pucci, G. (eds.) ESA 1998. LNCS, vol. 1461, pp. 91–102. Springer, Heidelberg (1998)
Kostochka, A.: On upper bounds on the chromatic numbers of graphs. Transactions of the Institute of Mathematics (Siberian Branch of the Academy of Sciences in USSR) 10, 204–226 (1988)
Kostochka, A., Kratochvil, J.: Covering and coloring polygon-circle graphs. Discrete Mathematics 163, 299–305 (1997)
Kubica, M., Rizzi, R., Vialette, S., Waleń, T.: Approximation of RNA multiple structural alignment. In: Lewenstein, M., Valiente, G. (eds.) CPM 2006. LNCS, vol. 4009, pp. 211–222. Springer, Heidelberg (2006)
Li, S.C., Li, M.: On the complexity of the crossing contact map pattern matching problem. In: Bücher, P., Moret, B.M.E. (eds.) WABI 2006. LNCS (LNBI), vol. 4175, pp. 231–241. Springer, Heidelberg (2006)
Maier, D.: The complexity of some problems on subsequences and supersequences. J. ACM 25(2), 322–336 (1978)
Masek, W.J., Paterson, M.S.: A faster algorithm computing string edit distances. J. Comp. and Syst. Sc. 20(1), 18–31 (1980)
Shasha, D., Zhang, K.: Simple fast algorithms for the editing distance between trees and related problems. SIAM J. Comp. 18(6), 1245–1262 (1989)
Tiskin, A.: Longest common subsequences in permutations and maximum cliques in circle graphs. In: Lewenstein, M., Valiente, G. (eds.) CPM 2006. LNCS, vol. 4009, pp. 270–281. Springer, Heidelberg (2006)
Vialette, S.: On the computational complexity of 2-interval pattern matching problems. Theoretical Computer Science 312(2-3), 223–249 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fertin, G., Hermelin, D., Rizzi, R., Vialette, S. (2007). Common Structured Patterns in Linear Graphs: Approximation and Combinatorics. In: Ma, B., Zhang, K. (eds) Combinatorial Pattern Matching. CPM 2007. Lecture Notes in Computer Science, vol 4580. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73437-6_25
Download citation
DOI: https://doi.org/10.1007/978-3-540-73437-6_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73436-9
Online ISBN: 978-3-540-73437-6
eBook Packages: Computer ScienceComputer Science (R0)