A PTAS for Distinguishing (Sub)string Selection

Deng, Xiaotie; Li, Guojun; Li, Zimao; Ma, Bin; Wang, Lusheng

doi:10.1007/3-540-45465-9_63

Xiaotie Deng⁷,
Guojun Li⁸,
Zimao Li⁷,
Bin Ma⁹ &
…
Lusheng Wang⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2380))

Included in the following conference series:

International Colloquium on Automata, Languages, and Programming

2194 Accesses
8 Citations

Abstract

Consider two sets of strings, \( \mathcal{B} \) (bad genes) and \( \mathcal{G} \) (good genes), as well as two integers d _b and d _g (d _b ≤ d _g). A frequently occurring problem in computational biology (and other fields) is to find a (distinguishing) substring s of length L that distinguishes the bad strings from good strings, i.e., for each string s _i ∈ \( \mathcal{B} \) there exists a length-L substring t _i of s _i with d(s, t _i) ≤ d _b (close to bad strings) and for every substring u _i of length L of every string g _i ∈ \( \mathcal{G} \) , d(s, u _i) ≥ d _g (far from good strings). We present a polynomial time approximation scheme to settle the problem, i.e., for any constant ∈ τ 0, the algorithm finds a string s of length L such that for every s _i ∈ \( \mathcal{B} \) , there is a length-L substring t _i of s0_i with d(t _i, s) ≤ (1 + ∈)d _b and for every substring u _i of length L of every g _i ∈ \( \mathcal{G} \) , d(u _i, s) ≥ (1 - ∈)d _g, if a solution to the original pair (d _b ≤ d _g) exists.

Fully supported by a grant from the Natural Science Foundation of China and Research Grants Council of the HKSAR Joint Research Scheme [Project No: NCityU 102/01].

Fully supported by a grant from the Research Grants Council of the Hong Knog SAR, China [Project No: CityU 1130/99E].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. Ben-Dor, G. Lancia, J. Perone, and R. Ravi, Banishing bias from consensus sequences, Proc. 8th Ann. Combinatorial Pattern Matching Conf., pp. 247–261, 1997.
Google Scholar
J. Dopazo, A. Rodríguez, J. C. Sáiz, and F. Sobrino, Design of primers for PCR amplification of highly variable genomes, CABIOS, 9(1993), 123–125.
Google Scholar
M. Frances, A. Litman, On covering problems of codes, Theor. Comput. Syst., 30(1997), 113–119.
Article MATH MathSciNet Google Scholar
L. Gcasieniec, J. Jansson, and A. Lingas, Efficient approximation algorithms for the Hamming center problem, Proc. 10th ACM-SIAM Symp. on Discrete Algorithms, pp. S905–S906, 1999.
Google Scholar
M. Ito, K. Shimizu, M. Nakanishi, and A. Hashimoto, Polynominal-time algorithms for computing characteristic strings, Proc. 5th Annual Symposium on Combinatorial Pattern Matching, pp. 274–288, (1994).
Google Scholar
K. Lucas, M. Busch, S. Mössinger and J.A. Thompson, An improved microcomputer program for finding gene-or gene family-specific oligonucleotides suitable as primers for polymerase chain reactions or as probes, CABIOS, 7(1991), 525–529.
Google Scholar
K. Lanctot, M. Li, B. Ma, S. Wang, and L. Zhang, Distinguishing string selection problems, SODA’99, pp. 633–642..
Google Scholar
Ming Li, Bin Ma,and Lusheng Wang, “Finding similar regions in many strings”, the 31th ACM Symp. on Theory of Computing, pp. 473–482, 1999.
Google Scholar
B. Ma, A polynomial time approximation scheme for the closest substring problem, Proc. 11th Annual Symposium on Combinatorial Pattern Matching, pp. 99–107, Montreal, (2000).
Google Scholar
R. Motwani and P. Raghavan, Randomized Algorithms, Cambridge Univ. Press, 1995.
Google Scholar
C.H. Papadimitriou and M. Yannakakis, On the approximability of trade-offs and optimal access of web sources, FOCS00, pp. 86–92, 2000.
Google Scholar
V. Proutski and E. C. Holme, Primer Master: a new program for the design and analysis of PCR primers, CABIOS, 12(1996), 253–255.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
Xiaotie Deng, Zimao Li & Lusheng Wang
Department of Mathematics, Shandong University, Jinan, 250100, P. R. China
Guojun Li
Department of Computer Science, University of Western Ontario, London, Ont, N6A 5B7, Canada
Bin Ma

Authors

Xiaotie Deng
View author publications
You can also search for this author in PubMed Google Scholar
Guojun Li
View author publications
You can also search for this author in PubMed Google Scholar
Zimao Li
View author publications
You can also search for this author in PubMed Google Scholar
Bin Ma
View author publications
You can also search for this author in PubMed Google Scholar
Lusheng Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Theoretical Computer Science, ETH Zentrum, ETH Zürich, 8092, Zürich, Switzerland
Peter Widmayer & Stephan Eidenbenz &
Department of Languages and Sciences of the Computation E.T.S. de Ingeniería Informática, University of Málaga, Campus de Teatinos, 29071, Málaga, Spain
Francisco Triguero , Rafael Morales & Ricardo Conejo , &
School of Cognitive and Computing Sciences, University of Sussex, Falmer, Brighton, BN1 9QN, UK
Matthew Hennessy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Deng, X., Li, G., Li, Z., Ma, B., Wang, L. (2002). A PTAS for Distinguishing (Sub)string Selection. In: Widmayer, P., Eidenbenz, S., Triguero, F., Morales, R., Conejo, R., Hennessy, M. (eds) Automata, Languages and Programming. ICALP 2002. Lecture Notes in Computer Science, vol 2380. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45465-9_63

Download citation

DOI: https://doi.org/10.1007/3-540-45465-9_63
Published: 25 June 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43864-9
Online ISBN: 978-3-540-45465-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics