Abstract
Medical data have a number of unique characteristics like data sparseness, high dimensionality and rapidly changing set of attributes. Entity Attribute Value (EAV) is the widely used solution to handle the above challenges of medical data, but EAV is neither storage efficient nor search efficient. In this paper, we have proposed a storage & search efficient data model: Optimized Column-Oriented Model (OCOM) for physical representation of high dimensional and sparse data as an alternative of widely used EAV. We have implemented both EAV and OCOM models in a medical data warehousing environment and performed different relational and warehouse queries on both the models. The experimental results show that OCOM is dramatically search efficient and occupy less storage space compared to EAV.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Torben, P.B., Christian, J.S.: Research Issues in Clinical Data Warehousing. In: Proceedings of the 10th International Conference on Scientific and Statistical Database Management, Capri, pp. 43–52 (1998)
Stead, W.W., Hammond, E.W., Straube, J.M.: A chartless record—Is it adequate? Journal of Medical Systems 7(2), 103–109 (1983)
Li, J., Li, M., Deng, H., Duffy, P., Deng, H.: PhD: a web database application for phenotype data management. Oxford Bioinformatics 21(16), 3443–3444 (2005)
Anhøj, J.: Generic design of Web-based clinical databases. Journal of Medical Internet Research 5(4), 27 (2003)
Brandt, C., Deshpande, A., Lu, C.: TrialDB: A Web-based Clinical Study Data Management System. In: Proceedings of the American Medical Informatics Association Annual Symposium, Washington, p. 794 (2003)
Nadkarni, P.M., Brandt, C., Frawley, S., et al.: Managing Attribute—Value Clinical Trials Data Using the ACT/DB Client—Server Database System. The Journal of the American Medical Informatics Association 5(2), 139–151 (1998)
Chen, P.P.: The entity-relationship model—toward a unified view of data. ACM Transactions on Database Systems 1(1), 9–36 (1976)
Hoque, A.S.M.L.: Storage and Querying of High Dimensional Sparsely Populated Data in Compressed Representation. In: Shafazand, H., Tjoa, A.M. (eds.) EurAsia-ICT 2002. LNCS, vol. 2510, pp. 418–425. Springer, Heidelberg (2002)
Agarwal, S., Agrawal, R., Deshpande, P., et al.: On the Computation of Multidimensional Aggregates. In: Proceedings of the 22th International Conference on Very Large Data Bases, pp. 506–521 (1996)
Jim, G., Surajit, C., Adam, B.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab and Sub-Totals. Data Mining and Knowledge Discovery 1(1), 29–53 (1997)
Copeland, G.P.: A decomposition storage model. In: International Conference on Management of Data, Austin, Texas, United States (1985)
Stonebraker, M., Abadi, D.J., Batkin, A., et al.: C-Store: A column-oriented DBMS. In: Proceedings of the 31st VLDB Conference, Trondheim, Norway (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Paul, R., Latiful Hoque, A.S.M. (2010). Optimized Column-Oriented Model: A Storage and Search Efficient Representation of Medical Data. In: Khuri, S., Lhotská, L., Pisanti, N. (eds) Information Technology in Bio- and Medical Informatics, ITBAM 2010. ITBAM 2010. Lecture Notes in Computer Science, vol 6266. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15020-3_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-15020-3_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15019-7
Online ISBN: 978-3-642-15020-3
eBook Packages: Computer ScienceComputer Science (R0)