skip to main content
10.1145/3629526.3649131acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
keynote
Free Access

Optimizing Edge AI: Performance Engineering in Resource-Constrained Environments

Published:07 May 2024Publication History

ABSTRACT

Recent years have witnessed the growth of Edge AI, a transformative paradigm that integrates neural networks with edge computing, bringing computational intelligence closer to end users. However, this innovation is not without its challenges, especially in environments with limited computing, network, and memory constraints, where resource-hungry AI models often need to be partitioned for distributed execution. This issue becomes even more acute in scenarios where post-deployment updates are infeasible or costly, posing a need to accurately reason about the interplay between resource constraints and Quality-of-Service (QoS) in Edge AI systems, so as to optimally design and operate them.

In this keynote talk, I will focus on these challenges, discussing QoS management and deployment problems arising in Edge AI systems. I will review mechanisms such as early exits and DNN partitioning that are distinctive of this problem space, explaining how they could be accounted for and leveraged in system performance and reliability tuning. I will then illustrate how design decisions and the definition of novel runtime control algorithms can be guided by approaches based on both traditional analytical models and emerging data-driven methods based on machine learning models.

Index Terms

  1. Optimizing Edge AI: Performance Engineering in Resource-Constrained Environments

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICPE '24: Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering
          May 2024
          310 pages
          ISBN:9798400704444
          DOI:10.1145/3629526

          Copyright © 2024 Owner/Author

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 7 May 2024

          Check for updates

          Qualifiers

          • keynote

          Acceptance Rates

          Overall Acceptance Rate252of851submissions,30%
        • Article Metrics

          • Downloads (Last 12 months)23
          • Downloads (Last 6 weeks)23

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader