Skip to main content

Optimized Column-Oriented Model: A Storage and Search Efficient Representation of Medical Data

  • Conference paper
Information Technology in Bio- and Medical Informatics, ITBAM 2010 (ITBAM 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6266))

Abstract

Medical data have a number of unique characteristics like data sparseness, high dimensionality and rapidly changing set of attributes. Entity Attribute Value (EAV) is the widely used solution to handle the above challenges of medical data, but EAV is neither storage efficient nor search efficient. In this paper, we have proposed a storage & search efficient data model: Optimized Column-Oriented Model (OCOM) for physical representation of high dimensional and sparse data as an alternative of widely used EAV. We have implemented both EAV and OCOM models in a medical data warehousing environment and performed different relational and warehouse queries on both the models. The experimental results show that OCOM is dramatically search efficient and occupy less storage space compared to EAV.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Torben, P.B., Christian, J.S.: Research Issues in Clinical Data Warehousing. In: Proceedings of the 10th International Conference on Scientific and Statistical Database Management, Capri, pp. 43–52 (1998)

    Google Scholar 

  2. Stead, W.W., Hammond, E.W., Straube, J.M.: A chartless record—Is it adequate? Journal of Medical Systems 7(2), 103–109 (1983)

    Article  Google Scholar 

  3. Li, J., Li, M., Deng, H., Duffy, P., Deng, H.: PhD: a web database application for phenotype data management. Oxford Bioinformatics 21(16), 3443–3444 (2005)

    Article  Google Scholar 

  4. Anhøj, J.: Generic design of Web-based clinical databases. Journal of Medical Internet Research 5(4), 27 (2003)

    Article  Google Scholar 

  5. Brandt, C., Deshpande, A., Lu, C.: TrialDB: A Web-based Clinical Study Data Management System. In: Proceedings of the American Medical Informatics Association Annual Symposium, Washington, p. 794 (2003)

    Google Scholar 

  6. Nadkarni, P.M., Brandt, C., Frawley, S., et al.: Managing Attribute—Value Clinical Trials Data Using the ACT/DB Client—Server Database System. The Journal of the American Medical Informatics Association 5(2), 139–151 (1998)

    Article  Google Scholar 

  7. Chen, P.P.: The entity-relationship model—toward a unified view of data. ACM Transactions on Database Systems 1(1), 9–36 (1976)

    Article  Google Scholar 

  8. Hoque, A.S.M.L.: Storage and Querying of High Dimensional Sparsely Populated Data in Compressed Representation. In: Shafazand, H., Tjoa, A.M. (eds.) EurAsia-ICT 2002. LNCS, vol. 2510, pp. 418–425. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  9. Agarwal, S., Agrawal, R., Deshpande, P., et al.: On the Computation of Multidimensional Aggregates. In: Proceedings of the 22th International Conference on Very Large Data Bases, pp. 506–521 (1996)

    Google Scholar 

  10. Jim, G., Surajit, C., Adam, B.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab and Sub-Totals. Data Mining and Knowledge Discovery 1(1), 29–53 (1997)

    Article  Google Scholar 

  11. Copeland, G.P.: A decomposition storage model. In: International Conference on Management of Data, Austin, Texas, United States (1985)

    Google Scholar 

  12. Stonebraker, M., Abadi, D.J., Batkin, A., et al.: C-Store: A column-oriented DBMS. In: Proceedings of the 31st VLDB Conference, Trondheim, Norway (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Paul, R., Latiful Hoque, A.S.M. (2010). Optimized Column-Oriented Model: A Storage and Search Efficient Representation of Medical Data. In: Khuri, S., Lhotská, L., Pisanti, N. (eds) Information Technology in Bio- and Medical Informatics, ITBAM 2010. ITBAM 2010. Lecture Notes in Computer Science, vol 6266. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15020-3_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15020-3_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15019-7

  • Online ISBN: 978-3-642-15020-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics