skip to main content
10.1145/3211922.3211923acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
short-paper

Boosting scalable data analytics with modern programmable networks

Published: 11 June 2018 Publication History

Abstract

Data center networks lie at the core of distributed data analytics frameworks running in large scale environments. Recent research seek to improve the system performance by optimizing the end-host network usage, e.g., optimally use RDMA [2] or zero copy I/O frameworks [5] for distributed data analytics frameworks. Such approaches allow these systems to leverage the high network-bandwidth at end-hosts, however, keep the network itself untouched which does not solve contention and scalability issues.

References

[1]
A. Alim, R. G. Clegg, L. Mai, L. Rupprecht, E. Seckler, P. Costa, P. Pietzuch, A. L. Wolf, N. Sultana, J. Crowcroft, et al. Flick: developing and running application-specific network services. In 2016 USENIX Annual Technical Conference (USENIX ATC 16), pages 1--14. USENIX Association, 2016.
[2]
C. Binnig, A. Crotty, A. Galakatos, T. Kraska, and E. Zamanian. The end of slow networks: It's time for a redesign. PVLDB, 9(7):528--539, 2016.
[3]
P. Costa, A. Donnelly, A. Rowstron, and G. O'Shea. Camdoop: exploiting in-network aggregation for big data applications. In Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation, pages 3--3. USENIX Association, 2012.
[4]
R. L. Graham, D. Bureddy, P. Lui, H. Rosenstock, G. Shainer, G. Bloch, D. Goldenerg, M. Dubman, S. Kotchubievsky, V. Koushnir, et al. Scalable hierarchical aggregation protocol (sharp): a hardware architecture for efficient data reduction. In Communication Optimizations in HPC (COMHPC), International Workshop on, pages 1--10. IEEE, 2016.
[5]
J. Hwang, K. K. Ramakrishnan, and T. Wood. Netvm: High performance and flexible networking using virtualization on commodity platforms. IEEE Trans. Network and Service Management, 12(1):34--47, 2015.
[6]
X. Jin, X. Li, H. Zhang, R. Soulé, J. Lee, N. Foster, C. Kim, and I. Stoica. Netcache: Balancing key-value stores with fast in-network caching. In Proceedings of the 26th Symposium on Operating Systems Principles, pages 121--136. ACM, 2017.
[7]
E. Kohler, R. Morris, B. Chen, J. Jannotti, and M. F. Kaashoek. The click modular router. ACM Transactions on Computer Systems (TOCS), 18(3):263--297, 2000.
[8]
M. Liu, L. Luo, J. Nelson, L. Ceze, A. Krishnamurthy, and K. Atreya. Incbricks: Toward in-network computation with an in-network cache. In Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, pages 795--809. ACM, 2017.
[9]
L. Mai, L. Rupprecht, A. Alim, P. Costa, M. Migliavacca, P. Pietzuch, and A. L. Wolf. Netagg: Using middleboxes for application-specific on-path aggregation in data centres. In Proceedings of the 10th ACM International on Conference on emerging Networking Experiments and Technologies, pages 249--262. ACM, 2014.
[10]
A. Sapio, I. Abdelaziz, A. Aldilaijan, M. Canini, and P. Kalnis. In-network computation is a dumb idea whose time has come. In ACM Workshop on Hot Topics in Networks, 2017.
[11]
R. Stoenescu, V. Olteanu, M. Popovici, M. Ahmed, J. Martins, R. Bifulco, F. Manco, F. Huici, G. Smaragdakis, M. Handley, et al. In-net: in-network processing for the masses. In Proceedings of the Tenth European Conference on Computer Systems, page 23. ACM, 2015.

Cited By

View all
  • (2025)Scalable Data Management on Next-Generation Data Center NetworksScalable Data Management for Future Hardware10.1007/978-3-031-74097-8_8(199-221)Online publication date: 24-Jan-2025
  • (2023)Databases on Modern Networks: A Decade of Research That Now Comes into PracticeProceedings of the VLDB Endowment10.14778/3611540.361157916:12(3894-3897)Online publication date: 12-Sep-2023
  • (2022)P4DB - The Case for In-Network OLTPProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517825(1375-1389)Online publication date: 10-Jun-2022
  • Show More Cited By

Index Terms

  1. Boosting scalable data analytics with modern programmable networks

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        DAMON '18: Proceedings of the 14th International Workshop on Data Management on New Hardware
        June 2018
        75 pages
        ISBN:9781450358538
        DOI:10.1145/3211922
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 11 June 2018

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. data center networks
        2. deeply programmable networks
        3. in-network processing
        4. scalable data analytics

        Qualifiers

        • Short-paper

        Funding Sources

        • NSF CSR grant 1421910
        • German Research Foundation (DFG) as part of project B2 within the Collaborative Research Center (CRC) 1053 MAKI Multi-Mechanisms Adaptation for the Future Internet
        • ERC grant 617805 LiveSoft
        • German Research Foundation (DFG) as part of grant BI2011/1 of SPP 2037
        • NSF CSR grant 1618923

        Conference

        SIGMOD/PODS '18
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 94 of 127 submissions, 74%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)5
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 25 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2025)Scalable Data Management on Next-Generation Data Center NetworksScalable Data Management for Future Hardware10.1007/978-3-031-74097-8_8(199-221)Online publication date: 24-Jan-2025
        • (2023)Databases on Modern Networks: A Decade of Research That Now Comes into PracticeProceedings of the VLDB Endowment10.14778/3611540.361157916:12(3894-3897)Online publication date: 12-Sep-2023
        • (2022)P4DB - The Case for In-Network OLTPProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517825(1375-1389)Online publication date: 10-Jun-2022
        • (2021)In-network support for transaction triagingProceedings of the VLDB Endowment10.14778/3461535.346155114:9(1626-1639)Online publication date: 22-Oct-2021
        • (2021)Scalable and Flexible High-Performance In-Network Processing of Hash Joins in Distributed Databases2021 International Conference on Field-Programmable Technology (ICFPT)10.1109/ICFPT52863.2021.9609804(1-9)Online publication date: 6-Dec-2021
        • (2021)Exploiting 3D Memory for Accelerated In-Network Processing of Hash Joins in Distributed DatabasesApplied Reconfigurable Computing. Architectures, Tools, and Applications10.1007/978-3-030-79025-7_2(18-32)Online publication date: 23-Jun-2021

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media