skip to main content
10.1145/2618243.2618273acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
research-article

Data movement in hybrid analytic systems: a case for automation

Published:30 June 2014Publication History

ABSTRACT

Hybrid data analysis systems integrate an analytic tool and a data management tool. While hybrid systems have benefits, in order to be effective data movement between the two hybrid components must be minimized. Through experimental results we demonstrate that under workloads whose inputs vary in size, shape, and location, automation is the only practical way to manage data movement in hybrid systems.

References

  1. Park, J., Bikshandi, G., Vaidyanathan, K., Tang, P., Dubey, P., and Kim, D. 2013. Tera-Scale 1D FFT with Low-Communication Algorithm and Intel Xeon Phi Coprocessors. Proceedings of SC13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Tiwari, D., Vazhkudai, S., Kim, Y., Ma, X., Boboila, S. and Desnoyers, P. 2012. Reducing data movement costs using energy-efficient, active computation on SSD. USENIX 2012, 1--5. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ihaka, R., and Gentelman, R. 1996. R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics, 299--314.Google ScholarGoogle Scholar
  4. Stonebraker, M., Brown, P., Poliakov, A., Raman, S. 2011. The Architecture of SciDB. Proceedings of SSDBM 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Leyshock, P., Maier, D., and Tufte, K. 2013. Agrios: A hybrid approach to big array analytics. IEEE International Conference on Big Data, 85--93.Google ScholarGoogle Scholar
  6. Yi, Z., Herodotou, H., and Yang, J. 2009. RIOT: I/O-efficient numerical computing without SQL. CIDR 2009, 1--11.Google ScholarGoogle Scholar
  7. Grosse, P., Lehner, W., Weichert, T., Farber, F., and Li, W. S. 2011. Bridging two worlds with RICE. Proceedings of the VLDB Endowment, 1307--1317.Google ScholarGoogle Scholar
  8. Das, S., Simanis, Y., Beyer, K. S., Gemulla, R., Haas, P. J., and McPherson, J. 2011. Ricardo: Integrating R and Hadoop. Proceedings of the 2010 International Conference on Management of Data, 987--998 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Data movement in hybrid analytic systems: a case for automation

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      SSDBM '14: Proceedings of the 26th International Conference on Scientific and Statistical Database Management
      June 2014
      417 pages
      ISBN:9781450327220
      DOI:10.1145/2618243

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 30 June 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      SSDBM '14 Paper Acceptance Rate26of71submissions,37%Overall Acceptance Rate56of146submissions,38%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader