skip to main content
10.1145/2926534.2926541acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
short-paper

Toward elastic memory management for cloud data analytics

Published:26 June 2016Publication History

ABSTRACT

We present several key elements towards elastic memory management in modern big data systems. The goal of our approach is to avoid out-of-memory failures without over-provisioning but also to avoid garbage-collection overheads when possible.

References

  1. Tungsten: Memory management and binary processing on spark. https://databricks.com/blog/2015/04/28/project-tungsten-bringing-spark-closer-to-bare-metal.html.Google ScholarGoogle Scholar
  2. Memory management in the Java HotSpot™virtual machine. http://www.oracle.com/technetwork/java/javase/memorymanagement-whitepaper-150215.pdf, 2006.Google ScholarGoogle Scholar
  3. N. Anciaux et al. Memory requirements for query execution in highly constrained devices. In VLDB, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. K. P. Brown et al. Managing memory to meet multiclass workload response time goals. In VLDB, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Chen et al. Adaptive database buffer allocation using query feedback. In VLDB, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. E. Cook et al. Semi-automatic, self-adaptive control of garbage collection rates in object databases. In SIGMOD, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. L. Davison et al. Dynamic resource brokering for multi-user query execution. In SIGMOD, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Faloutsos et al. Predictive load control for flexible buffer allocation. In VLDB, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. N. Garofalakis et al. Parallel query scheduling and optimization with time- and space-shared resources. In VLDB, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Hall et al. The weka data mining software: An update. 2009.Google ScholarGoogle Scholar
  11. D. Halperin et al. Demo of the Myria big data management service. In SIGMOD, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. Herodotou et al. No one (cluster) size fits all: automatic cluster sizing for data-intensive analytics. In SoCC, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Herodotou et al. Starfish: A self-tuning system for big data analytics. In CIDR, 2011.Google ScholarGoogle Scholar
  14. B. Hindman et al. Mesos: A platform for fine-grained resource sharing in the data center. In NSDI, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Kornacker et al. Impala: A modern, open-source SQL engine for hadoop. In CIDR, 2015.Google ScholarGoogle Scholar
  16. W. Lang et al. Towards multi-tenant performance slos. IEEE Trans. Knowl. Data Eng., 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Li et al. Resource bricolage for parallel database systems. Proc. of the VLDB Endow., 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. Low et al. Distributed GraphLab: a framework for machine learning and data mining in the cloud. In VLDB, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. G. Murray et al. Naiad: A timely dataflow system. In SOSP, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. V. R. Narasayya et al. Sharing buffer pool memory in multi-tenant relational database-as-a-service. Proc. of the VLDB Endow., 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. T. Ng et al. Flexible buffer allocation based on marginal gains. In SIGMOD, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Pang et al. Managing memory for real-time queries. In SIGMOD, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. T. A. Project. Apache Giraph, http://giraph.apache.org/.Google ScholarGoogle Scholar
  24. J. Schaffner et al. Predicting in-memory database performance for automating cluster management tasks. In ICDE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. A. J. Storm et al. Adaptive self-tuning memory in DB2. In VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. P. Tembey et al. Merlin: Application- and platform-aware resource allocation in consolidated server systems. In SoCC, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. V. K. Vavilapalli et al. Apache hadoop YARN: yet another resource negotiator. In SoCC, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. M. Weimer et al. REEF: retainable evaluator execution framework. In SIGMOD, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. T. White. Hadoop: The Definitive Guide. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Zaharia et al. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In NSDI, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Toward elastic memory management for cloud data analytics

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      BeyondMR '16: Proceedings of the 3rd ACM SIGMOD Workshop on Algorithms and Systems for MapReduce and Beyond
      June 2016
      70 pages
      ISBN:9781450343114
      DOI:10.1145/2926534

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 June 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      BeyondMR '16 Paper Acceptance Rate10of19submissions,53%Overall Acceptance Rate19of36submissions,53%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader