Skip to main content

Advertisement

Log in

Recognition of overlap graphs

  • Published:
Journal of Combinatorial Optimization Aims and scope Submit manuscript

    We’re sorry, something doesn't seem to be working properly.

    Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Overlap graphs occur in computational biology and computer science, and have applications in genome sequencing, string compression, and machine scheduling. Given two strings \(s_{i}\) and \(s_{j}\), their overlap string is defined as the longest string \(v\) such that \(s_{i} = uv\) and \(s_{j} = vw\), for some non empty strings \(u,w\), and its length is called the overlap between these two strings. A weighted directed graph is an overlap graph if there exists a set of strings with one-to-one correspondence to the vertices of the graph, such that each arc weight in the graph equals the overlap between the corresponding strings. In this paper, we characterize the class of overlap graphs, and we present a polynomial time recognition algorithm as a direct consequence. Given a weighted directed graph \(G\), the algorithm constructs a set of strings that has \(G\) as its overlap graph, or decides that this is not possible.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

  • Alon N, Cosares S, Hochbaum DS, Shamir R (1989) An algorithm for the detection and construction of monge sequences. Linear Algebra Appl 114115:669–680 ISSN: 0024-3795. Special issue dedicated to Alan J. Hoffman

    Article  MathSciNet  Google Scholar 

  • Barnes ER, Hoffman AJ (1985) On transportation problems with upper bounds on leading rectangles. SIAM J Algebr Discret Methods 6(3):487–496

    Article  MATH  MathSciNet  Google Scholar 

  • Blum A, Jiang T, Li M, Tromp J, Yannakakis M (1994) Linear approximation of shortest superstrings. J ACM 41:630–647 ISSN: 0004-5411

    Article  MATH  MathSciNet  Google Scholar 

  • Braga MDV, Meidanis J (2002) An algorithm that builds a set of strings given its overlap graph. In Proceedings of the 5th Latin American symposium on theoretical informatics, LATIN ’02, pp 52–63, London, UK, Springer. ISBN: 3-540-43400-3

  • Czumaj A, Ga̧sieniec L, Piotrów M, Rytter W (1997) Sequential and parallel approximation of shortest superstrings. J Algorithms 23:74–100 ISSN: 0196-6774

    Article  MATH  MathSciNet  Google Scholar 

  • Gallant J, Maier D, Storer JA (1980) On finding minimal length superstrings. J Comput Syst Sci 20(1):50–58 ISSN: 0022-0000

    Article  MATH  MathSciNet  Google Scholar 

  • Gevezes T, Pitsoulis L (2013) A greedy randomized adaptive search procedure with path relinking for the shortest superstring problem. J Comb Optim ISSN: 1382-6905

  • Gingeras TR, Milazzo PJ, Sciaky D, Roberts RJ (1979) Computer programs for the assembly of DNA sequences. Nucl Acids Res 7(2):529–543

    Article  Google Scholar 

  • Gusfield D (1994) Faster implementation of a shortest superstring approximation. Inf Proces Lett 51(5):271–274 ISSN: 0020-0190

    Article  MATH  MathSciNet  Google Scholar 

  • Gusfield D, Landau GM, Schieber B (1992) An efficient algorithm for the all pairs suffix–prefix problem. Inf Process Lett 41:181–185 ISSN: 0020-0190

    Article  MATH  MathSciNet  Google Scholar 

  • Ilie L, Popescu C (2006) The shortest common superstring problem and viral genome compression. Fundam Inf 73:153–164 ISSN: 0169-2968

    MATH  MathSciNet  Google Scholar 

  • Ilie L, Tinta L, Popescu C, Hill KA (2006) Viral genome compression. In: Mao C, Yokomori T (eds) DNA computing, vol 4287 of lecture notes in computer science, pp Springer, Berlin/Heidelberg, pp 111–126

  • Jenkyns TA (1979) The greedy travelling salesman’s problem. Networks 9(4):363–373 ISSN: 1097-0037

    Article  MATH  MathSciNet  Google Scholar 

  • Jiang T, Li M (1994) Approximating shortest superstrings with constraints. Theor Comput Sci 134(2):473–491 ISSN: 0304-3975

    Article  MATH  Google Scholar 

  • Middendorf M (1998) Shortest common superstrings and scheduling with coordinated starting times. Theor Comput Sci 191(1–2):205–214 ISSN: 0304-3975

    Article  MATH  MathSciNet  Google Scholar 

  • Shapiro MB (1967) An algorithm for reconstructing protein and RNA sequences. J ACM 14:720–731 ISSN: 0004-5411

    Article  MATH  Google Scholar 

  • Staden Roger (1982) Automation of the computer handling of gel reading data produced by the shotgun method of DNA sequencing. Nucl Acids Res 10(15):4731–4751

    Article  Google Scholar 

  • Storer JA, Szymanski TG (1982) Data compression via textual substitution. J ACM 29:928–951 ISSN 0004-5411

    Article  MATH  MathSciNet  Google Scholar 

  • Tarhio J, Ukkonen E (1988) A greedy approximation algorithm for constructing shortest common superstrings. Theor Comput Sci 57(1):131–145 ISSN: 0304–3975

    Article  MATH  MathSciNet  Google Scholar 

Download references

Acknowledgments

This research has been co-financed by the European Union (European Social Fund—ESF) and Greek national funds through the Operational Program “Education and Lifelong Learning” of the National Strategic Reference Framework (NSRF)—Research Funding Program: Heracleitus II. Investing in knowledge society through the European Social Fund.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Theodoros P. Gevezes.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gevezes, T.P., Pitsoulis, L.S. Recognition of overlap graphs. J Comb Optim 28, 25–37 (2014). https://doi.org/10.1007/s10878-013-9663-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10878-013-9663-3

Keywords