Skip to main content

UNION: An Efficient Mapping Tool Using UniMark with Non-overlapping Interval Indexing Strategy

  • Conference paper
Database Theory and Application, Bio-Science and Bio-Technology (BSBT 2011, DTA 2011)

Abstract

NGS has become a popular research field in biologists because it was able to produce inexpensive and accuracy short biology sequences very fast. NGS technique has been improved to produce long length sequences, more than 100bp, recently with the same quality, accuracy and speed. Thus, tools for short sequences may be not suitable for long length sequences. We propose a new tool called UNION for re-sequencing applications by mapping long length sequences to a reference genome. UNION uses the UniMarker with a non-overlapping interval indexing strategy and a tool, CORAL, to do sequence alignments. For the experiments we randomly cut ten thousands sequences with a length of 512bp from the genome of Trichomonas and also produce mutations/sequence errors for these sequences to simulate different similarities. UNION has been compared with GMAP in terms of speed and accuracy and achieves better performance than that of GMAP.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)

    Article  Google Scholar 

  2. Chao, K.M., Zhang, J., Ostell, J., Miller, W.: A tool for aligning very similar DNA sequences. Comput. Appl. Biosci. 13, 75–80 (1997)

    Google Scholar 

  3. Zhang, J., Madden, T.L.: PowerBLAST: A new network BLAST application for interactive or automated sequence analysis and annotation. Genome Res. 7, 649–656 (1997)

    Google Scholar 

  4. Florea, L., Hartzell, G., Zhang, Z., Rubin, G.M., Miller, W.: A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res. 8, 967–974 (1998)

    Google Scholar 

  5. Delcher, A.L., Kasif, S., Fleischmann, R.D., Peterson, J., White, O., Salzberg, S.L.: Alignment of whole genomes. Nucleic Acids Res. 27, 2369–2376 (1999)

    Article  Google Scholar 

  6. Zhang, Z., Schwartz, S., Wagner, L., Miller, W.: A greedy algorithm for aligning DNA sequences. J. Comput. Biol. 7, 203–214 (2000)

    Article  Google Scholar 

  7. Chen, L.Y.Y., Lu, S.H., Shih, E.S.C., Hwang, M.J.: Single nucleotide polymorphism mapping using genome-wide unique sequences. Genome Res. 12, 1106–1111 (2002)

    Article  Google Scholar 

  8. Hsu, F.R., Chen, J.F.: Aligning ESTs to Genome Using Multi-Layer Unique Makers. In: Proceedings of the Computational Systems Bioinformatics, CSB 2003 (2003)

    Google Scholar 

  9. Wu, T.D., Watanabe, C.K.: GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21(9), 1859–1875 (2005)

    Article  Google Scholar 

  10. James Kent, W.: BLAT—The BLAST-Like Alignment Tool. Genome Research 12, 656–664 (2002)

    Article  Google Scholar 

  11. Hung, C.L., Lin, C.Y., Chung, Y.C., Hsieh, S.J., Tang, C.Y.: Comparative Exon Prediction based on Heuristic Coding Region Alignment. In: Proceedings of the International Symposium on Parallel Architectures, Algorithms, and Networks, pp. 14–19 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hung, CL., Lin, CY., Hu, YC. (2011). UNION: An Efficient Mapping Tool Using UniMark with Non-overlapping Interval Indexing Strategy. In: Kim, Th., et al. Database Theory and Application, Bio-Science and Bio-Technology. BSBT DTA 2011 2011. Communications in Computer and Information Science, vol 258. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27157-1_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27157-1_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27156-4

  • Online ISBN: 978-3-642-27157-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics