skip to main content
10.1145/1570256.1570275acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
short-paper

A data-based coding of candidate strings in the closest string problem

Published:08 July 2009Publication History

ABSTRACT

Given a set of strings S of equal lengths over an alphabet σ, the closest string problem seeks a string over σ whose maximum Hamming distance to any of the given strings is as small as possible. A data-based coding of strings for evolutionary search represents candidate closest strings as sequences of indexes of the given strings. The string such a chromosome represents consists of the symbols in the corresponding positions of the indexed strings.

A genetic algorithm using this coding was compared with two GAs that encoded candidate strings directly as strings over σ. In trials on twenty-five instances of the closest string problem with alphabets ranging is size from 2 to 30, the algorithm that used the data-based representation of candidate strings consistently returned the best results, and its advantage increased with the sizes of the test instances' alphabets.

References

  1. M. Frances and A. Litman. On covering problems of codes. Theory of Computing Systems, 30(2):113--119, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  2. L. Gasieniec, J. Jansson, and A. Lingas. Approximation algorithms for Hamming clustering problems. Journal of Discrete Algorithms, 2:289--301, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  3. F.C. Gomes, C.N. Meneses, P.M. Pardalos, and G.V.R. Viana. Parallel algorithm for the closest string problem. In R. Mondaini, editor, Proceedings of the Fourth Brazilian Symposium on Mathematical and Computational Biology, volume II, pages 326--332, 2005.Google ScholarGoogle Scholar
  4. J. Gramm, R. Niedermeier, and P. Rossmanith. Exact solutions for closest string and related problems. In P. Eades and T. Takaoka, editors, Proceedings of the 12th International Symposium on Algorithms and Computation, volume 2223 of LNCS, pages 441--453, Berlin/Heidelberg, 2001. Springer. Christchurch, New Zealand. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J.K. Lanctot, M. Li, B. Ma, S. Wang, and L. Zhang. Distinguishing string selection problems. Information and Computation, 185:41--55, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Li, B. Ma, and L. Wang. On the closest string and substring problems. Journal of the ACM, 49:157--171, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. X. Liu, H. He, and O. Sykora. Parallel genetic algorithm and parallel simulated annealing for the closest string problem. In X. Li, S. Wang, and Z. Y. Dong, editors, Proceedings of the First International Conference on Advanced Data Mining and Applications, volume 3584 of LNAI, pages 591--597, Berlin/Heidelberg, 2005. Springer. Wuhan, China. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. X. Liu, H. Mauch, Z. Hao, and G. Wu. A compounded genetic and simulated annealing algorithm for the closest string problem. In Proceedings of the 2nd International Conference on Bioinformatics and Biomedical Engineering, pages 702--705, 2008. Shanghai.Google ScholarGoogle ScholarCross RefCross Ref
  9. H. Mauch, M.J. Melzer, and J.S. Hu. Genetic algorithm approach for the closest string problem. In Proceedings of the 2003 IEEE Bioinformatics Conference, pages 560--561. IEEE Press, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C.N. Meneses, Z. Lu, C.A.S. Oliveira, and P.M. Pardalos. Optimal solutions for the closest string problem via integer programming. INFORMS Journal on Computing, 16(4):419--429, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Roman. Coding and Information Theory, volume 134 of Graduate Texts in Mathematics. Springer, Berlin/Heidelberg, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. G. Syswerda. Uniform crossover in genetic algorithms. In J.D. Schaffer, editor, Proceedings of the Third International Conference on Genetic Algorithms, pages 2--9, San Mateo, CA, 1989. Morgan Kaufmann Publishers. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A data-based coding of candidate strings in the closest string problem

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '09: Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers
      July 2009
      1760 pages
      ISBN:9781605585055
      DOI:10.1145/1570256

      Copyright © 2009 Copyright is held by the author/owner(s)

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 July 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader