Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5518))

Included in the following conference series:

  • 2742 Accesses

Abstract

This paper proposes a novel algorithm for complete exact pattern-matching focusing the specificities of protein sequences (alphabet of 20 symbols) but, also highly efficient considering larger alphabets. The searching strategy uses large search windows allowing multiple alignments per iteration. A new filtering heuristic, named compatibility rule, contributed decisively to the efficiency improvement. The new algorithm’s performance is, on average, superior in comparison with its best-rated competitors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Knuth, D.E., Morris, J.H., Pratt, V.R.: Fast pattern matching in strings. SIAM J. Comput. 6(2), 323–350 (1977)

    Article  MathSciNet  MATH  Google Scholar 

  2. Boyer, R.S., Moore, J.S.: A fast string searching algorithm. Commun. Assoc. Comput. Mach. 20(10), 762–772 (1977)

    MATH  Google Scholar 

  3. Horspool, R.N.: Practical fast searching in strings. Software - Practice & Experience 10(6), 501–506 (1980)

    Article  Google Scholar 

  4. Sunday, D.M.: A very fast substring search algorithm. Commun. Assoc. Comput. Mach. 33(8), 132–142 (1990)

    Google Scholar 

  5. Kim, S.: A new string-pattern matching algorithm using partitioning and hashing efficiently. Journal of Experimental Algorithmics (JEA) 4(2) (1999)

    Google Scholar 

  6. Crochemore, M., Czumaj, A., Gasieniec, L., Jarominek, S., Lecroq, T., Plandowski, W., Rytter, W.: Speeding up Two String Matching Algorithms. Algorithmica 12(4/5), 247–267 (1994)

    Article  MathSciNet  MATH  Google Scholar 

  7. Baeza-Yates, R.A., Gonnet, G.H.: A new approach to text searching. Commun. ACM 35(10), 74–82 (1992)

    Article  Google Scholar 

  8. Karp, R.M., Rabin, M.O.: Efficient randomized pattern-matching algorithms. IBM J. Res. Dev. 31(2), 249–260 (1987)

    Article  MathSciNet  MATH  Google Scholar 

  9. Franek, F., Jennings, C.G., Smyth, W.F.: A Simple Fast Hybrid Pattern-Matching Algorithm. In: Apostolico, A., Crochemore, M., Park, K. (eds.) CPM 2005. LNCS, vol. 3537, pp. 288–297. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  10. Navarro, G., Raffinot, M.: Fast and Flexible String Matching by Combining Bitparallelism and Suffix automata. ACM Journal of Experimental Algorithms 5(4), 1–36 (2000)

    MathSciNet  MATH  Google Scholar 

  11. Crochemore, M., Rytter, W.: Text algorithms. Oxford University Press, Oxford (1994)

    MATH  Google Scholar 

  12. Peltola, H., Tarhio, J.: Alternative Algorithms for Bit-Parallel String Matching. In: Nascimento, M.A., de Moura, E.S., Oliveira, A.L. (eds.) SPIRE 2003. LNCS, vol. 2857, pp. 80–93. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  13. Lecroq, T.: Fast exact string matching algorithms. Information Processing Letters 102, 229–235 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  14. Wu, S., Manber, U.: A fast algorithm for multi-pattern searching. TR-94-17, Department of Computer Science, University of Arizona, Tucson (1994)

    Google Scholar 

  15. Michailidis, P.D., Maragaritis, K.G.: On-line String Matching Algorithms: Survey and Experimental Results. International Journal of Computer Mathematics 76(4), 411–434 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  16. Lecroq, T.: Experimental Results on String Matching Algorithms. Software - Pratice and Experience 25(7), 727–765 (1995)

    Article  Google Scholar 

  17. Smyth, B.: Computing Patterns in Strings. Pearson Addison-Wesley, London (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Deusdado, S., Carvalho, P. (2009). Efficient Exact Pattern-Matching in Proteomic Sequences. In: Omatu, S., et al. Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living. IWANN 2009. Lecture Notes in Computer Science, vol 5518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02481-8_178

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02481-8_178

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02480-1

  • Online ISBN: 978-3-642-02481-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics