Abstract
In this paper we investigate the use of hyper-heuristic methodologies for predicting DNA sequences. In particular, we utilize Sequencing by Hybridization. We believe that this is the first time that hyper-heuristics have been investigated in this domain. A hyper-heuristic is provided with a set of low-level heuristics and the aim is to decide which heuristic to call at each decision point. We investigate three types of hyper-heuristics. Two of these (simulated annealing and tabu search) draw their inspiration from meta-heuristics. The choice function hyper-heuristic draws its inspiration from reinforcement learning. We utilize two independent sets of low-level heuristics. The first set is based on a previous tabu search method, with the second set being a significant extension to this basic set, including utilizing a different representation and introducing the definition of clusters. The datasets we use comprises two randomly generated datasets and also a publicly available biological dataset. In total, we carried out experiments using 70 different combinations of heuristics, using the three datasets mentioned above and investigating six different hyper-heuristic algorithms. Our results demonstrate the effectiveness of a hyper-heuristic approach to this problem domain. It is necessary to provide a good set of low-level heuristics, which are able to both intensify and diversify the search but this approach has demonstrated very encouraging results on this extremely difficult and important problem domain.
Similar content being viewed by others
References
Aarts, E., Korst, J., & Michiels, W. (2005). Simulated annealing. In E. K. Burke & G. Kendall (Eds.), Search methodologies: introductory tutorials in optimization and decision support techniques (pp. 187–210). Berlin: Springer. Chap. 7.
Ayob, M., & Kendall, G. (2003). A Monte Carlo hyper-heuristic to optimise component placement sequencing for multi head placement machine. In Placement machine, InTech’03 Thailand (pp. 132–141).
Bai, R., & Kendall, G. (2005). An investigation of automated planograms using a simulated annealing based hyper-heuristics. In T. Ibaraki, K. Nonobe, & M. Yagiura (Eds.), Metaheuristics: progress as real problem solvers operations research/computer science interfaces series (Vol. 32, pp. 87–108). Berlin: Springer.
Bai, R., Blazewicz, J., Burke, E. K., Kendall, G., & Mccollum, B. (2007). A simulated annealing hyper-heuristic methodology for flexible decision support (Tech. rep.). School of CSiT. University of Nottingham, UK.
Blazewicz, J., & Kasprzak, M. (2003). Complexity of DNA sequencing by hybridization. Theoretical Computer Science, 290(3), 1459–1473.
Blazewicz, J., Formanowicz, P., Kasprzak, M., Markiewicz, W., & Weglarz, J. (2000). Tabu search for DNA sequencing with false negative and false positives. European Journal of Operational Research, 125, 257–265.
Blazewicz, J., Formanowicz, P., Guinand, F., & Kasprzak, M. (2002a). A heuristic managing errors for DNA sequencing. Bioinformatics, 18, 652–660.
Blazewicz, J., Kasprzak, M., & Kuroczycki, W. (2002b). Hybrid genetic algorithm for DNA sequencing with errors. Journal of Heuristics, 8, 495–502.
Blazewicz, J., Glover, F., & Kasprzak, M. (2004). DNA sequencing—tabu and scatter search combined. INFORMS Journal on Computing, 16, 232–240.
Blazewicz, J., Glover, F., Swiercz, A., Kasprzak, M., Markiewicz, W., Oguz, C., & Rebholz-Schuhmann, D. (2006). Dealing with repetitions in sequencing by hybridization. Computational Biology and Chemistry, 30(5), 313–320.
Bui, T., & Youssef, W. (2004). An enhanced genetic algorithm for DNA sequencing by hybridization with positive and negative errors. Lecture Notes in Computer Science, 3103, 908–919.
Burke, E. K., & Kendall, G. (Eds.) (2005). Search methodologies: introductory tutorials in optimization and decision support techniques. Berlin: Springer.
Burke, E. K., & Soubeiga, E. (2003). Scheduling nurses using a tabu-search hyperheuristic. In Proceedings of the 1st multidisciplinary international conference on scheduling: theory and applications (MISTA 2003), 197–218.
Burke, E., Kendall, G., Newall, J., Hart, E., Ross, P., & Schulenburg, S. (2003a) Hyper-heuristics: An emerging direction in modern search technology. In Handbook of metaheuristics. Dordrecht: Kluwer Academic. Chap. 16.
Burke, E. K., Kendall, G., & Soubeiga, E. (2003b). A tabu-search hyperheuristic for timetabling and rostering. Journal of Heuristics, 9(6), 451–470.
Burke, E. K., McCollum, B., Meisels, A., Petrovic, S., & Qu, R. (2007). A graph-based hyper-heuristic for timetabling problems. European Journal of Operational Research, 176(1), 177–192.
Cowling, P., Kendall, G., & Soubeiga, E. (2001). A hyperheuristic approach to scheduling a sales summit. In PATAT ’00: Selected papers from the third international conference on practice and theory of automated timetabling III (pp. 176–190). London: Springer.
Cowling, P., Kendall, G., & Soubeiga, E. (2002). Hyperheuristics: A tool for rapid prototyping in scheduling and optimisation. In: Lecture notes in computer science, EvoWorkShops, pp. 1–10. Berlin: Springer.
Dowsland, K., Soubeiga, E. & Burke, E.K (2007). A simulated annealing hyper-heuristic for determining shipper sizes. European Journal of Operational Research 179(3), 759–774.
Dramanac, R., Labat, I., Brukner, I., & Crkvenjakov, R. (1989). Sequencing of megabase plus DNA by hybridization: Theory of the method. Genomics, 4(2), 114–128.
Gendreau, M., & Potvin, J. Y. (2005). Tabu search. In: E. K. Burke & G. Kendall (Eds.), Search methodologies: introductory tutorials in optimization and decision support techniques (pp. 165–186). Berlin: Springer. Chap. 6.
Glover, F., & Laguna, M. (1997). Tabu search. Boston: Kluwer Academic.
Han, L., & Kendall, G. (2003). Investigation of a tabu assisted hyper-heuristic genetic algorithm. In Proceedings of congress on evolutionary computation (CEC2003) (Vol. 3, pp. 2230–2237).
Kendall, G., & Hussin, M. (2005a). A tabu search hyper-heuristic approach to the examination timetabling problem at the MARA University of Technology. Lectures Notes in Computer Science, 3616, 270–293.
Kendall, G., & Hussin, N. M. (2005b). In G. Kendall, E. Burke, S. Petrovic, & M. Gendreau (Eds.), An investigation of a tabu-search-based hyper-heuristic for examination timetabling, multidisciplinary scheduling; theory and applications (pp. 309–328). Berlin: Springer.
Kendall, G., Soubeiga, E., & Cowling, P. (2002). Choice function and random hyperheuristics. In Proceedings of the 4th Asia-Pacific conference on simulated evolution and learning (SEAL’02) (pp. 667–671).
Kirkpatrick, S., Gelatt, C.D., & Vecchi, M.P. (1983). Optimization by simulated annealing. Science, 220, 671–680.
Lysov, I.P., Florent’ev, V.L., Khorlin, A.A., Khrapko, K.R., & Shik, V.V. (1988). Determination of the nucleotide sequence of DNA using hybridization with oligonucleotides. A new method. Doklady Akademii Nauk SSSR, 303, 1508–1511.
Mruczkiewicz, W. (2009). Hyper-heuristics for sequencing by hybridisation problem. Master Thesis, Poznan University of Technology, Poland.
Needleman, S. B., Wunsch, C.D. (1970). A general method applicable to the search for similarities of the amino acid sequence of two proteins. Journal of Molecular Biology 48, 443–453.
Pevzner, P. A. (1989). 1-tuple DNA sequencing: computer analysis. Journal of Biomolecular Structure and Dynamics, 7, 63–73.
Ross, P. (2005). Hyper-heuristics. In E. K. Burke & G. Kendall (Eds.), Search methodologies: introductory tutorials in optimization and decision support techniques (pp. 529–556). Berlin: Springer. Chap. 17.
Ross, P., Marin-Blázquez, J. G., Schulenburg, S., & Hart, E. (2003). Learning a procedure that can solve hard bin-packing problems: A new GA-based approach to hyper-heuristics. In Proceedings of the genetic and evolutionary computation conference (pp. 1295–1306). Berlin: Springer.
Southern, E. (1988). United Kingdom Patent Application GB8810400.
Zhang, J. H., LY, Wu, & Zhang, X. S. (2003). Reconstruction of DNA sequencing by hybridization. Bioinformatics, 19(1), 14–21.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Blazewicz, J., Burke, E.K., Kendall, G. et al. A hyper-heuristic approach to sequencing by hybridization of DNA sequences. Ann Oper Res 207, 27–41 (2013). https://doi.org/10.1007/s10479-011-0927-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-011-0927-y