skip to main content
10.1145/2401603.2401616acmotherconferencesArticle/Chapter ViewAbstractPublication PagesracsConference Proceedingsconference-collections
short-paper

A simple pattern matching algorithm for weighted sequences

Published:23 October 2012Publication History

ABSTRACT

Weighted sequences are widely used in various areas including information retrieval, bioinformatics, and music analysis. In this paper we show how to apply the Boyer-Moore algorithm [2] for weighted sequences. As our algorithm is based on the Boyer-Moore algorithm, it works well when the size of alphabet is huge which is the case in information retrieval. Also, we show how to handle the case with small alphabets by considering two or more characters at once.

References

  1. R. A. Baeza-Yates and G. H. Gonnet. A new approach to text searching. CACM, 35(10):74--82, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Boyer and S. Moore. A fast string search algorithm. CACM, 20(10):762--722, 1977. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Clifford and B. Sach. Pattern matching in pseudo real-time. J. Discrete Algorithms, 9(1): 67--81 (2011) Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. V. Freschi and A. Bogliolo. Using sequence compression to speedup probabilistic profile matching. Bioinformatics 15;21(10):2225--2229, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. K. Fredriksson. Faster string matching with Super-alphabets. In Proc. of SPIRE 2002, pages 207--214, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Pizzi, P. Rastas, and E. Ukkonen. Fast search algorithms for position specific scoring matrices. In Proc. of BIRD 2007, pages 239--250, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Rajasekaran, X. Jin, and J. L. Spouge. The efficient computation of position-specific match scores with the Fast Fourier Transform. J. Computational Biology, 9(1). pages 23--33, 2002.Google ScholarGoogle Scholar
  8. D. M. Sunday. A very fast substring search algorithm. CACM 33(8):132--142. 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A simple pattern matching algorithm for weighted sequences

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      RACS '12: Proceedings of the 2012 ACM Research in Applied Computation Symposium
      October 2012
      488 pages
      ISBN:9781450314923
      DOI:10.1145/2401603

      Copyright © 2012 Author

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 23 October 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate393of1,581submissions,25%
    • Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader