Skip to main content
Log in

A hybrid particle swarm optimization based memetic algorithm for DNA sequence compression

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Due to the recent advancements in high- throughput sequencing technologies, biomedical research is faced with ever increasing quantities of data, and the storage or transmission of the huge amount of data is one of the concerns. So, we presented a novel hybrid particle swarm optimization based memetic algorithm (HPMA) for DNA sequence compression. In HPMA, within the framework of the memetic algorithm, dynamic comprehensive learning particle swarm optimization method is used for global search, and two adaptive local search operators including center symmetry mutation differential evolution operator and adaptive chaotic search operator work in a cooperative way. HPMA looks for the global optimal code book based on extended approximate repeat vector, by which the DNA sequence will be compressed. Experiments were conducted on 19 high-dimensional functions and 11 real DNA sequences. The results show that HPMA is more competitive in both the performance and scalability, and also attains better compression ability than other representative DNA-specific algorithms on DNA sequence data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Behzadi B, Fessant FL (2005) DNA compression challenge revisited: a dynamic programming approach. In: Proceeding of the 16th annual symposium on combinatorial pattern matching, pp 190–200

  • Benson DA, Karch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW (2011) GenBank. Nucleic Acids Res 39:D32–D37

    Article  Google Scholar 

  • Chen X, Kwong S, Li M (2000) Compression algorithm for DNA sequences and its applications in genome comparison. In: Proceeding of the conference on annual international conference on computational molecular biology, pp 107–117

  • Chen X, Li M, Ma B, Tromp J (2002) DNACompress: fast and effective DNA sequence compression. Bioinformatics 18(12):1696–1698

    Article  Google Scholar 

  • Chen XS, Ong YS (2012) A conceptual modeling of meme complexes in stochastic search. IEEE Trans Syst Man Cybern Part C Appl Rev 42(5):612–625

    Article  Google Scholar 

  • Cohen J (2011) The human genome, a decade later. Technol Rev 114(1):40–56

    Google Scholar 

  • Dawkins R (1976) The selfish gence. Oxford University Press, New York

    Google Scholar 

  • Dinneen MJ, Wei K (2013) A (1+1) adaptive memetic algorithm for the maximum clique problem. In: Proceeding of the IEEE congress on evolutionary computation, pp 1626–1634

  • Garcia S, Fernndez A, Luengo J, Herrera F (2010) Advanced nonparametric tests for multiple comparisons in the design of experiment in computational intelligence and data mining: experiment analysis of power. Inf Sci 180(10):2044–2064

    Article  Google Scholar 

  • Grumbach S, Tahi F (1993) Compression of DNA sequences. In: Proceeding of the data compression conference, pp 340–350

  • Grumbach S, Tahi F (1994) A new challenge for compression algorithms: genetic sequences. Inform Proc Manag 30(6):875–886

    Article  MATH  Google Scholar 

  • Iacca G, Neri F, Mininno E, Ong YS, Lim MH (2012) Ockham’s Razor in memetic computing: three stage optimal memetic exploration. Inform Sci 188:17–43

    Article  MathSciNet  Google Scholar 

  • Iztok F, Iztok Jr F, Brest J, Viljem Z (2012) Memetic artificial bee colony algorithm for large-scale global optimization. In: Proceeding of the IEEE congress on evolutionary computation, pp 1–8

  • Ji Z, Zhou JR, Zhu ZX, Chen SP (2013) Self-configuration single particle optimizer for DNA sequence compression. Soft Comput 17(4):675–682

    Article  Google Scholar 

  • Korodi G, Tabus I (2007) DNA sequence compression-based on the normalized maximum likelihood model. IEEE Signal Process Mag 24(1):47–53

    Article  Google Scholar 

  • Kuruppu S, Beresford-Smith B, Conway T, Zobel J (2012) Iterative dictionary construction for compression of large DNA data sets. IEEE/ACM Trans Comput Biol Bioinform 9(1):137–149

    Article  Google Scholar 

  • Liang JJ, Qin AK, Suganthan PN, Baskar S (2006) Comprehensive learning particle swarm optimizer for global optimization of multimodal functions. IEEE Trans Evolut Comput 10(3):281–295

    Article  Google Scholar 

  • Li HJ, Ni B, Wong MH, Leung KS (2011) A fast CUDA implementation of agrep algorithm for approximate nucleotide sequence matching. In: Proceeding of the ninth symposium on application specific processors, pp 74–77

  • Long M, Tan L (2010) A chaos-based data encryption algorithm for image/video. In: Proceeding of the second international conference on multimedia and information technology, pp 172–175

  • Lozano M, Molina D, Herrera F (2011) Editorial scalability of evolutionary algorithms and other metaheuristics for large-scale continuous optimization problem. Soft Comput 15(11):2085–2087

    Article  Google Scholar 

  • Matsumoto T, Sadakane K, Imai H (2000) Biological sequence compression algorithms. Genome Inform 11:43–52

    Google Scholar 

  • Mavrovouniotis M, Yang SX (2011) A memetic ant colony optimization algorithm for the dynamic traveling salesman problem. Soft Comput 15(7):1405–1425

    Article  Google Scholar 

  • Mei Y, Tang K, Yao X (2011) Decomposition-based memetic algorithm for multiobjective capacitated arc routing problem. IEEE Trans Evolut Comput 15(2):151–165

    Article  Google Scholar 

  • Molina D, Lozano M, Sanchez AM, Herrera F (2011) Memetic algorithms based on local search chains for large scale continuous optimization problems: MA-SSW-Chains. Soft Comput 15(11):2201–2220

    Article  Google Scholar 

  • Moscato PA (1989) On evolution, search, optimization, genetic algorithms and martial arts: toward memetic algorithms. Technical report, technical report Caltech concurrent computation program report 826. Caltech, Pasadena, California.

  • Neri F, Cotta C (2012) Memetic algorithms and memetic computing optimization: a literature review. Swarm Evolut Comput 2:1–14

    Article  Google Scholar 

  • Neri F, Weber M, Caraffini F, Poikolainen I (2012) Meta-Lamarckian learning in three stage optimal memetic exploration. In: Proceeding of the 12th workshop on computational intelligence, pp 1–8

  • Nguyen QH, Ong YS, Lim MH (2009) A probabilistic memetic framework. IEEE Trans Evolut Comput 13(3):604–623

  • Ni JC, Li L, Qiao F, Wu QD (2012) A novel memetic algorithm based on the comprehensive learning PSO. In: Proceeding of the IEEE congress on evolutionary computation, pp 1–8

  • Ong YS, Lim MH, Chen XS (2010) Research frontier: memetic computation—past, present and future. IEEE Comput Intell Mag 5(2):24–36

    Article  Google Scholar 

  • Pan QK, Wang L, Sang HY, Li JQ, Liu M (2013) A high performing memetic algorithm for the flowshop scheduling problem with blocking. IEEE Trans Autom Sci Eng 10(3):741–756

    Article  Google Scholar 

  • Sayed OM, Soliman OS, Gendy TS, Mohamed SM (2012) Memetic particle swarm optimization algorithm for multi-objective optimization problems. In: Proceeding of the eighth international conference on informatics and systems, pp MM-111–MM-118

  • Sengupta A, Chakraborti T, Konar A, Eunjin K, Nagar AK (2012) An adaptive memetic algorithm using a synergy of differential evolution and learning automata. In: Proceeding of the IEEE congress on evolutionary computation, pp 1–8

  • Srinivasa KG, Jagadish M, Venugopal KR, Patnaik LM (2006) Efficient compression of non-repetitive DNA sequence using dynamic programming. In: Proceeding of the conference on advanced computing and communications, pp 569–574

  • Tsay JJ, Su SC (2012) A memetic algorithm for protein structure prediction based on the 2D triangular lattice model. In: Proceeding of the international conference on bioinformatics models, methods and algorithms, pp 131–136

  • Wang HF, Moon L, Yang SX, Wang DW (2012) A memetic particle swarm optimization algorithm for multimodal optimization problems. Inform Sci 197:38–52

    Article  Google Scholar 

  • Wang H, Rahnamayan S, Sun H, Omran MGH (2013) Gaussian bare-bones differential evolution. IEEE Trans Cybern 43(2):634–647

    Article  Google Scholar 

  • Wang P, Tang K, Tsang Edward PK, Yao X (2011) A memetic genetic programming with decision tree-based local search for classification problems. In: Proceeding of the IEEE congress on evolutionary computation, pp 917–924

  • Wang Y, Li B, He Z (2011) Enhancing differential evolution with effective evolutionary local search in memetic framework. In: Proceeding of the IEEE congress on evolutionary computation, pp 2457–2464

  • Zhu ZX, Ji Z, Jia S (2010) Memetic ant colony optimization for band selection of hyperspectral imagery classification. In: Proceeding of Chinese conference on pattern recognition, pp 1–6

  • Zhu ZX, Shen LL, Sun YW, Ji Z (2012) Memetic three-dimensional gabor feature extraction for hyperspectral imagery classification. In: Proceeding of the third international conference on swarm intelligence, vol 7331, pp 479–488

  • Zhu ZX, Zhou JR, Ji Z, Shi YH (2011) DNA sequence compression using adaptive particle swarm optimization-based memetic algorithm. IEEE Trans Evolut Comput 15(5):643–658

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported by Guangdong Natural Science Foundation of P. R. China (Grant: 91510641 01000037).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li Tan.

Additional information

Communicated by V. Loia.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tan, L., Sun, J. & Tong, X. A hybrid particle swarm optimization based memetic algorithm for DNA sequence compression. Soft Comput 19, 1255–1268 (2015). https://doi.org/10.1007/s00500-014-1338-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-014-1338-1

Keywords

Navigation