skip to main content
10.1145/1739041.1739125acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Xbase: cloud-enabled information appliance for healthcare

Published: 22 March 2010 Publication History

Abstract

XML is a more desirable format for modeling and storing clinical data in EMR (Electronic medical record) applications for its extendibility; however, existing EMR systems either are built on top of RDBMS or file systems or lack of support for complex and large scale healthcare applications, such as treatment effectiveness analysis and procedure optimization. SAP Technology Lab, China is developing a clouds-enabled information appliance, Xbase, built on top of Hadoop, which is the first XML-based information appliance designed specifically for large scale and complex healthcare applications. XML presents a different set of challenges for query processing, indexing, parallelism, and distributed computing using existing Hadoop's APIs as well as its HDFS storage infrastructure and MapReduce framework. In this paper, we describe system architecture and internal designs of Xbase as well as how the indexing is mapped to RDBMS and Hadoop. We also discuss why we select Hadoop over other candidates, such as Hbase, Google's Bigtable, and Hive.

References

[1]
B. H. Bloom. Space/time trade-offs in hash coding with allowable errors. Communications of the ACM, 13(7):422--426, 1970.
[2]
J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In Proceedings of the Sixth Symposium on Operating System Design and Implementation, San Fransisco, CA, USA, 2004.
[3]
A. Gates, O. Natkovich, S. Chopra, P. Kamath, S. Narayanam, C. Olston, B. Reed, S. Srinivasan, and U. Srivastava. Building a highlevel dataflow system on top of mapreduce: The pig experience. PVLDB, 2(2):1414--1425, 2009.
[4]
R. Goldman and J. Widom. Dataguides: Enabling query formulation and optimization in semistructured databases. In VLDB, pages 436--445, 1997.
[5]
G. Gou and R. Chirkova. Efficiently querying large xml data repositories: A survey. IEEE Trans. Knowl. Data Eng., 19(10):1381--1403, 2007.
[6]
R. Kaushik, P. Bohannon, J. F. Naughton, and H. F. Korth. Covering indexes for branching path queries. In SIGMOD Conference, pages 133--144, 2002.
[7]
R. Kaushik, P. Shenoy, P. Bohannon, and E. Gudes. Exploiting local similarity for indexing paths in graph-structured data. In ICDE, pages 129--140, 2002.
[8]
T. Milo and D. Suciu. Index structures for path expressions. In ICDT, pages 277--295, 1999.
[9]
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig latin: a not-so-foreign language for data processing. In J. T.-L. Wang, editor, SIGMOD Conference, pages 1099--1110. ACM, 2008.
[10]
The Apache Software Foundation. Information available at http://hadoop.apache.org.
[11]
The Apache Software Foundation. Information available at http://hadoop.apache.org/hbase.
[12]
The Apache Software Foundation. Information available at http://hadoop.apache.org/hdfs.
[13]
The Apache Software Foundation. Information available at http://hadoop.apache.org/hive.
[14]
A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff, and R. Murthy. Hive - a warehousing solution over a map-reduce framework. PVLDB, 2(2):1626--1629, 2009.
[15]
W. Wang, H. Wang, H. Lu, H. Jiang, X. Lin, and J. Li. Efficient processing of xml path queries using the disk-based f&b index. In VLDB, pages 145--156, 2005.

Cited By

View all
  • (2016)A Novel Storage Architecture for Facilitating Efficient Analytics of Health Informatics Big Data in Cloud2016 IEEE International Conference on Computer and Information Technology (CIT)10.1109/CIT.2016.86(578-585)Online publication date: Dec-2016
  • (2016)An effective model for store and retrieve big health data in cloud computingComputer Methods and Programs in Biomedicine10.1016/j.cmpb.2016.04.016132:C(75-82)Online publication date: 1-Aug-2016
  • (2015)Sharing Medical Information by Means of Using Intelligent Agents and Cloud ComputingCloud Technology10.4018/978-1-4666-6539-2.ch042(889-919)Online publication date: 2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '10: Proceedings of the 13th International Conference on Extending Database Technology
March 2010
741 pages
ISBN:9781605589459
DOI:10.1145/1739041
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 March 2010

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

EDBT/ICDT '10
EDBT/ICDT '10: EDBT/ICDT '10 joint conference
March 22 - 26, 2010
Lausanne, Switzerland

Acceptance Rates

Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2016)A Novel Storage Architecture for Facilitating Efficient Analytics of Health Informatics Big Data in Cloud2016 IEEE International Conference on Computer and Information Technology (CIT)10.1109/CIT.2016.86(578-585)Online publication date: Dec-2016
  • (2016)An effective model for store and retrieve big health data in cloud computingComputer Methods and Programs in Biomedicine10.1016/j.cmpb.2016.04.016132:C(75-82)Online publication date: 1-Aug-2016
  • (2015)Sharing Medical Information by Means of Using Intelligent Agents and Cloud ComputingCloud Technology10.4018/978-1-4666-6539-2.ch042(889-919)Online publication date: 2015
  • (2015)Cloud-Based Data and Knowledge Management for Multi-Centre Biomedical StudiesProceedings of the 8th International Conference on Knowledge Capture10.1145/2815833.2816949(1-4)Online publication date: 7-Oct-2015
  • (2015)Cloud-enabled real-time platform for adaptive planning and control in auction logistics centerComputers and Industrial Engineering10.1016/j.cie.2014.11.00584:C(79-90)Online publication date: 1-Jun-2015
  • (2014)Sharing Medical Information by Means of Using Intelligent Agents and Cloud ComputingCloud Computing Applications for Quality Health Care Delivery10.4018/978-1-4666-6118-9.ch008(140-170)Online publication date: 2014
  • (2013)Large-Scale Clinical Data Management and Analysis System Based on Cloud ComputingFrontier and Future Development of Information Technology in Medicine and Education10.1007/978-94-007-7618-0_177(1575-1583)Online publication date: 6-Dec-2013
  • (2012)Cloud Computing Based PHR Architecture Using Multi Layers ModelJournal of Software Engineering and Applications10.4236/jsea.2012.53110505:11(903-911)Online publication date: 2012
  • (2012)Towards a hybrid row-column database for a cloud-based medical data management systemProceedings of the 1st International Workshop on Cloud Intelligence10.1145/2347673.2347675(1-4)Online publication date: 31-Aug-2012
  • (2012)Cloud technology and EHR data management2012 IEEE 6th International Conference on Information and Automation for Sustainability10.1109/ICIAFS.2012.6419891(112-117)Online publication date: Sep-2012

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media