Abstract
In the health care industry, DICOM (Digital Imaging and Communication in Medicine) standard has become very popular for storage and transmission of digital medical images and reports. The ever-increasing size, high velocity and variety of the DICOM data collections make them more and more inefficient to be stored and queried them using a single data storage technique, e.g., a row store or a column store. In this study, we first highlight challenges in DICOM data management. We then describe HYTORMO, a new model to store and query the DICOM data. HYTORMO uses a hybrid data storage strategy that is aimed not only to leverage the advantage of both row and column stores, but also to attempt to keep a trade-off among reducing disk I/O cost, reducing tuple construction cost and reducing storage space. In addition, Bloom filters are applied to reduce network I/O cost during query processing. We prototyped our model on the top of Spark. Our preliminary experiments validate the proposed model in real DICOM datasets and show the effectiveness of our method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pianykh, O.S.: Digital Imaging and Communications in Medicine (DICOM): A Practical Introduction and Survival Guide. Springer, Heidelberg (2008)
Merelli, I., et al.: Managing, analysing, and integrating big data in medical bioinformatics: open problems and future perspectives. BioMed. Res. Int. 1–13 (2014)
Power, D., Politou, E., Slaymaker, M., Harris, S., et al.: A relational approach to the capture of DICOM files for grid-enabled medical imaging databases. In: SAC, pp. 272–279 (2004)
Annamalai, M., Guo, D., Susan, M., Steiner, J.: An oracle white paper: oracle database 11 g DICOM medical image support (2009)
Savaris, A., Härder, T., von Wangenheim, A.: DCMDSM: a DICOM decomposed storage model. J. Am. Med. Inform. Assoc. 21, 917–924 (2014)
Rascovsky, S.J., et al.: Informatics in radiology: use of CouchDB for document-based storage of DICOM objects. Radiographics 32, 913–927 (2012)
Boncz, P., et al.: MonetDB/X100: hyper-pipelining query execution. In: CIDR (2005)
Stonebraker, M., et al.: C-store: a column-oriented DBMS. In: VLDB, pp. 553–564 (2005)
Ramamurthy, R., DeWitt, D.: A case for fractured mirrors. VLDB 12, 89–101 (2003)
Grund, M., et al.: HYRISE: a main memory hybrid storage engine. VLDB 4, 105–116 (2010)
Bloom, B.H.: Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13, 422–426 (1970)
Phan, T.C., Orazio, L.D., Rigaux, P.: Toward intersection filter-based optimization for joins in MapReduce. In: Workshop Proceedings of the Cloud-I (2013)
OECD: Genetic Testing: A Survey of Quality Assurance and Proficiency Standards. OECD Publishing, Paris (2007)
Armbrust, M., et al.: Spark SQL: relational data processing in spark. In: SIGMOD (2015)
Steinbrunn, M., Moerkotte, G., Kemper, A.: Heuristic and randomized optimization for the join ordering problem. VLDB J. 6, 191–208 (1997)
Broder, A., Mitzenmacher, M.: Network applications of bloom filters: a survey. Internet Math. 1(4), 485–509 (2004)
CT Colonography. https://idash.ucsd.edu. Accessed 11 Oct 2015
David Clunie’s Medical Image Format Site. http://www.dclunie.com. Accessed Oct 2015
Sample Data. http://idoimaging.com/wiki/. Accessed 12 Oct 2015
Lung Cancer Datasets. http://giveascan.org. Accessed 11 Oct 2015
MIDAS Datasets. http://www.insight-journal.org. Accessed 12 Oct 2015
Open Source Clinical Image and Object Management. http://www.dcm4che.org
White, T.: Hadoop: The Definitive Guide. 4th edn. O’Reilly Media, Inc., California (2015)
TPC-H specification 2.8.0. http://www.tpc.org/tpch/
Möller, M., Mukherjee, S.: Context-driven ontological annotations in DICOM images: towards semantic PACS. In: Proceedings of HEALTHINF (2009)
Copeland, G., Khoshafian, S.: A decomposed storage model. In: SIGMOD (1985)
Harizopoulos, S., et al.: Performance tradeoffs in read-optimized databases. In: VLDB (2006)
Floratou, A., Minhas, U.F., Özcan, F.: SQL-on-Hadoop: full circle back to shared-nothing database architectures. VLDB 7, 1295–1306 (2014)
Popescu, A.D., Dash, D., Kantere, V., Ailamaki, A.: Adaptive query execution for data management in the cloud. In: CloudDB, pp. 17–24 (2010)
Rösch, P., Dannecker, L., Färber, F., Hackenbroich, G.: A storage advisor for hybrid-store databases. Proc. VLDB 5(12), 1748–1758 (2012)
Szalay, A.S., et al.: The SDSS Skyserver: public access to the sloan digital sky server data. In: Proceedings of SIGMOD, pp. 570–581. ACM (2002)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Nguyen-Cong, D., d’Orazio, L., Tran, N., Hacid, MS. (2017). Storing and Querying DICOM Data with HYTORMO. In: Wang, F., Yao, L., Luo, G. (eds) Data Management and Analytics for Medicine and Healthcare. DMAH 2016. Lecture Notes in Computer Science(), vol 10186. Springer, Cham. https://doi.org/10.1007/978-3-319-57741-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-57741-8_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57740-1
Online ISBN: 978-3-319-57741-8
eBook Packages: Computer ScienceComputer Science (R0)