Skip to main content

Compact Representation of Biological Sequences Using Set Decision Diagrams

  • Conference paper

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 154))

Abstract

Nowadays, the exponential availability of biological sequences and the complexity of the computational methods that use them as input motivate the research of new compact representations. To this end, we propose an alternative method for storing sets of sequences based on set decision diagrams instead of classical compression techniques. The set decision diagrams are an extension of the reduced ordered binary decision diagrams, a graph data structure used as a symbolic compact representation of sets or relations between sets. Some experiments with genes of the mitochondrion DNA support the feasibility of our approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Sayers, E.W.: GenBank. Nucleic. Acids Res. 51, D46–D51 (2010)

    Google Scholar 

  2. Bollig, B., Wegener, I.: Improving the variable ordering of OBDDs Is NP-Complete. IEEE Trans. Comput. 45, 993–1002 (1996)

    Article  MATH  Google Scholar 

  3. Bryant, R.E.: Symbolic manipulation of boolean functions using a graphical representation. In: Proc. of the 22nd ACM/IEEE Design Automation Conference, pp. 688–694. ACM (1985)

    Google Scholar 

  4. Couvreur, J.-M., Encrenaz, E., Paviot-Adet, E., Poitrenaud, D., Wacrenier, P.-A.: Data Decision Diagrams for Petri Net Analysis. In: Esparza, J., Lakos, C.A. (eds.) ICATPN 2002. LNCS, vol. 2360, pp. 101–120. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Couvreur, J.-M., Thierry-Mieg, Y.: Hierarchical decision diagrams to exploit model structure. In: Wang, F. (ed.) FORTE 2005. LNCS, vol. 3731, pp. 443–457. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  6. Denzumi, S., Arimura, H., Minato, S.: Implementation of sequence BDDs in Erlang. In: Proceedings of the 10th ACM SIGPLAN W. on Erlang, pp. 90–91. ACM (2011)

    Google Scholar 

  7. Denzumi, S., Yoshinaka, R., Arimura, H., Minato, S.: Notes on Sequence Binary Decision Diagrams: Relationship to Acyclic Automata and Complexities of Binary Set Operations. In: Holub, J., et al. (eds.) Proc. of the Prague Stringology Conference 2011, pp. 147–161, Prague Stringology Club (2011)

    Google Scholar 

  8. Giancarlo, R., Scaturro, D., Utro, F.: Textual data compression in computational biology: a synopsis. Bioinformatics 25, 1575–1586 (2009)

    Article  Google Scholar 

  9. LIP6/Move: the libDDD environment (2011), http://ddd.lip6.fr

  10. Loekito, E., Bailey, J., Pei, J.: A binary decision diagram based approach for mining frequent subsequences. Knowl. Inf. Syst. 24, 235–268 (2010)

    Article  Google Scholar 

  11. Meinel, C., Theobald, T.: Algorithms and data structures in VLSI design: OBDD - Foundations and applications. Springer, Heidelberg (1998)

    Book  Google Scholar 

  12. Miller, D.M.: Multiple-valued logic design tools. In: Proceedings of the 23rd Int. Symposium on Multiple-Valued Logic, pp. 2–11. IEEE (1993)

    Google Scholar 

  13. Nalbantoglu, Ö.U., Russell, D.J., Sayood, K.: Data compression concepts and algorithms and their applications to bioinformatics. Entropy 12, 34–52 (2009)

    Article  Google Scholar 

  14. Polanski, A., Kimmel, M.: Bioinformatics. Springer, Heidelberg (2007)

    MATH  Google Scholar 

  15. Requeno, J.I.: Análisis filogenético mediante lógica temporal y model checking. Master Thesis in Computer Science and Engineering, University of Zaragoza (2010)

    Google Scholar 

  16. Requeno, J.I., Blanco, R., de Miguel Casado, G., Colom, J.M.: Phylogenetic Analysis Using an SMV Tool. In: Rocha, M.P., Rodríguez, J.M.C., Fdez-Riverola, F., Valencia, A. (eds.) 5th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2011). AISC, vol. 93, pp. 167–174. Springer, Heidelberg (2011)

    Google Scholar 

  17. Rudell, R.: Dynamic variable ordering for ordered binary decision diagrams. In: Proc. of the 1993 IEEE/ACM Int. Conf. on Computer-aided Design, pp. 42–47. IEEE (1993)

    Google Scholar 

  18. Strelets, V.B., Lim, H.A.: Compression of protein sequence databases. Comput. Appl. Biosci. 11, 557–561 (1995)

    Google Scholar 

  19. White, W.T., Hendy, M.: Compressing DNA sequence databases with coil. BMC Bioinformatics 9, 242–257 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to José Ignacio Requeno .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Requeno, J.I., Colom, J.M. (2012). Compact Representation of Biological Sequences Using Set Decision Diagrams. In: Rocha, M., Luscombe, N., Fdez-Riverola, F., Rodríguez, J. (eds) 6th International Conference on Practical Applications of Computational Biology & Bioinformatics. Advances in Intelligent and Soft Computing, vol 154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28839-5_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28839-5_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28838-8

  • Online ISBN: 978-3-642-28839-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics