Skip to main content

Storing and Querying DICOM Data with HYTORMO

  • Conference paper
  • First Online:
Data Management and Analytics for Medicine and Healthcare (DMAH 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10186))

  • 546 Accesses

Abstract

In the health care industry, DICOM (Digital Imaging and Communication in Medicine) standard has become very popular for storage and transmission of digital medical images and reports. The ever-increasing size, high velocity and variety of the DICOM data collections make them more and more inefficient to be stored and queried them using a single data storage technique, e.g., a row store or a column store. In this study, we first highlight challenges in DICOM data management. We then describe HYTORMO, a new model to store and query the DICOM data. HYTORMO uses a hybrid data storage strategy that is aimed not only to leverage the advantage of both row and column stores, but also to attempt to keep a trade-off among reducing disk I/O cost, reducing tuple construction cost and reducing storage space. In addition, Bloom filters are applied to reduce network I/O cost during query processing. We prototyped our model on the top of Spark. Our preliminary experiments validate the proposed model in real DICOM datasets and show the effectiveness of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Pianykh, O.S.: Digital Imaging and Communications in Medicine (DICOM): A Practical Introduction and Survival Guide. Springer, Heidelberg (2008)

    Google Scholar 

  2. Merelli, I., et al.: Managing, analysing, and integrating big data in medical bioinformatics: open problems and future perspectives. BioMed. Res. Int. 1–13 (2014)

    Google Scholar 

  3. Power, D., Politou, E., Slaymaker, M., Harris, S., et al.: A relational approach to the capture of DICOM files for grid-enabled medical imaging databases. In: SAC, pp. 272–279 (2004)

    Google Scholar 

  4. Annamalai, M., Guo, D., Susan, M., Steiner, J.: An oracle white paper: oracle database 11 g DICOM medical image support (2009)

    Google Scholar 

  5. Savaris, A., Härder, T., von Wangenheim, A.: DCMDSM: a DICOM decomposed storage model. J. Am. Med. Inform. Assoc. 21, 917–924 (2014)

    Article  Google Scholar 

  6. Rascovsky, S.J., et al.: Informatics in radiology: use of CouchDB for document-based storage of DICOM objects. Radiographics 32, 913–927 (2012)

    Article  Google Scholar 

  7. Boncz, P., et al.: MonetDB/X100: hyper-pipelining query execution. In: CIDR (2005)

    Google Scholar 

  8. Stonebraker, M., et al.: C-store: a column-oriented DBMS. In: VLDB, pp. 553–564 (2005)

    Google Scholar 

  9. Ramamurthy, R., DeWitt, D.: A case for fractured mirrors. VLDB 12, 89–101 (2003)

    Article  Google Scholar 

  10. Grund, M., et al.: HYRISE: a main memory hybrid storage engine. VLDB 4, 105–116 (2010)

    Google Scholar 

  11. Bloom, B.H.: Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13, 422–426 (1970)

    Article  MATH  Google Scholar 

  12. Phan, T.C., Orazio, L.D., Rigaux, P.: Toward intersection filter-based optimization for joins in MapReduce. In: Workshop Proceedings of the Cloud-I (2013)

    Google Scholar 

  13. OECD: Genetic Testing: A Survey of Quality Assurance and Proficiency Standards. OECD Publishing, Paris (2007)

    Google Scholar 

  14. Armbrust, M., et al.: Spark SQL: relational data processing in spark. In: SIGMOD (2015)

    Google Scholar 

  15. Steinbrunn, M., Moerkotte, G., Kemper, A.: Heuristic and randomized optimization for the join ordering problem. VLDB J. 6, 191–208 (1997)

    Article  Google Scholar 

  16. Broder, A., Mitzenmacher, M.: Network applications of bloom filters: a survey. Internet Math. 1(4), 485–509 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  17. CT Colonography. https://idash.ucsd.edu. Accessed 11 Oct 2015

  18. David Clunie’s Medical Image Format Site. http://www.dclunie.com. Accessed Oct 2015

  19. Sample Data. http://idoimaging.com/wiki/. Accessed 12 Oct 2015

  20. Lung Cancer Datasets. http://giveascan.org. Accessed 11 Oct 2015

  21. MIDAS Datasets. http://www.insight-journal.org. Accessed 12 Oct 2015

  22. Open Source Clinical Image and Object Management. http://www.dcm4che.org

  23. White, T.: Hadoop: The Definitive Guide. 4th edn. O’Reilly Media, Inc., California (2015)

    Google Scholar 

  24. TPC-H specification 2.8.0. http://www.tpc.org/tpch/

  25. Möller, M., Mukherjee, S.: Context-driven ontological annotations in DICOM images: towards semantic PACS. In: Proceedings of HEALTHINF (2009)

    Google Scholar 

  26. Copeland, G., Khoshafian, S.: A decomposed storage model. In: SIGMOD (1985)

    Google Scholar 

  27. Harizopoulos, S., et al.: Performance tradeoffs in read-optimized databases. In: VLDB (2006)

    Google Scholar 

  28. Floratou, A., Minhas, U.F., Özcan, F.: SQL-on-Hadoop: full circle back to shared-nothing database architectures. VLDB 7, 1295–1306 (2014)

    Google Scholar 

  29. Popescu, A.D., Dash, D., Kantere, V., Ailamaki, A.: Adaptive query execution for data management in the cloud. In: CloudDB, pp. 17–24 (2010)

    Google Scholar 

  30. Rösch, P., Dannecker, L., Färber, F., Hackenbroich, G.: A storage advisor for hybrid-store databases. Proc. VLDB 5(12), 1748–1758 (2012)

    Article  Google Scholar 

  31. Szalay, A.S., et al.: The SDSS Skyserver: public access to the sloan digital sky server data. In: Proceedings of SIGMOD, pp. 570–581. ACM (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Danh Nguyen-Cong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Nguyen-Cong, D., d’Orazio, L., Tran, N., Hacid, MS. (2017). Storing and Querying DICOM Data with HYTORMO. In: Wang, F., Yao, L., Luo, G. (eds) Data Management and Analytics for Medicine and Healthcare. DMAH 2016. Lecture Notes in Computer Science(), vol 10186. Springer, Cham. https://doi.org/10.1007/978-3-319-57741-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-57741-8_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-57740-1

  • Online ISBN: 978-3-319-57741-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics