Skip to main content

Approximate Regular Expression Matching

  • Reference work entry
  • First Online:
Encyclopedia of Algorithms
  • 236 Accesses

Years and Authors of Summarized Original Work

  • 1995; Wu, Manber, Myers

Problem Definition

Given a text string T = t1t2… t n and a regular expression R of length m denoting language, \(\mathcal{L}(R)\) over an alphabet \(\Sigma \) of size \(\sigma\), and given a distance function among strings d and a threshold k, the approximate regular expression matching (AREM) problem is to find all the text positions that finish a so-called approximate occurrence of R in T, that is, compute the set \(\{j,\,\exists i,\,1,\,\leq \, i\, \leq \, j,\,\exists P\, \in \,\mathcal{L}(R),\,d(P,\,t_{i},\,.\,.\,.\,,\,t_{j})\, \leq \, k\}\ T,R\), and k are given together, whereas the algorithm can be tailored for a specific d.

This entry focuses on the so-called weighted edit distance, which is the minimum sum of weights of a sequence of operations converting one string into the other. The operations are insertions, deletions, and substitutions of characters. The weights are positive real values associated to...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. Gusfield D (1997) Algorithms on strings, trees and sequences. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  2. Myers EW (1996) Approximate matching of network expressions with spacers. J Comput Biol 3(1):33–51

    Article  Google Scholar 

  3. Myers EW, Miller W (1989) Approximate matching of regular expressions. Bull Math Biol 51:7–37

    Article  MathSciNet  MATH  Google Scholar 

  4. Navarro G (2001) Nr-grep: a fast and flexible pattern matching tool. Softw Pract Exp 31:1265–1312

    Article  MATH  Google Scholar 

  5. Navarro G (2004) Approximate regular expression searching with arbitrary integer weights. Nord J Comput 11(4):356–373

    MathSciNet  MATH  Google Scholar 

  6. Navarro G, Raffinot M (2002) Flexible pattern matching in strings – practical on-line search algorithms for texts and biological sequences. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  7. Navarro G, Raffinot M (2003) Fast and simple character classes and bounded gaps pattern matching, with applications to protein searching. J Comput Biol 10(6):903–923

    Article  Google Scholar 

  8. Thompson K (1968) Regular expression search algorithm. Commun ACM 11(6):419–422

    Article  MATH  Google Scholar 

  9. Wu S, Manber U (1992) Fast text searching allowing errors. Commun ACM 35(10):83–91

    Article  Google Scholar 

  10. Wu S, Manber U, Myers EW (1995) A subquadratic algorithm for approximate regular expression matching. J Algorithms 19(3):346–360

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gonzalo Navarro .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media New York

About this entry

Cite this entry

Navarro, G. (2016). Approximate Regular Expression Matching. In: Kao, MY. (eds) Encyclopedia of Algorithms. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-2864-4_22

Download citation

Publish with us

Policies and ethics