skip to main content
research-article

SplitPlace: Intelligent Placement of Split Neural Nets in Mobile Edge Environments

Published:20 January 2022Publication History
Skip Abstract Section

Abstract

In recent years, deep learning models have become ubiquitous in industry and academia alike. Modern deep neural networks can solve one of the most complex problems today, but coming with the price of massive compute and storage requirements. This makes deploying such massive neural networks challenging in the mobile edge computing paradigm, where edge nodes are resource-constrained, hence limiting the input analysis power of such frameworks. Semantic and layer-wise splitting of neural networks for distributed processing show some hope in this direction. However, there are no intelligent algorithms that place such modular splits to edge nodes for optimal performance. This work proposes a novel placement policy, SplitPlace, for the placement of such neural network split fragments on mobile edge hosts for efficient and scalable computing.

References

  1. Nasir Abbas et al. "Mobile edge computing: A survey". In: IEEE Internet of Things Journal 5.1 (2017), pp. 450--465.Google ScholarGoogle ScholarCross RefCross Ref
  2. Ejaz Ahmed and Mubashir Husain Rehmani. "Mobile Edge Computing: Opportunities, solutions, and challenges". In: Future Generation Computer Systems 70 (2017), pp. 59--63. issn: 0167--739X.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Md Zahangir Alom et al. "The history began from alexnet: A comprehensive survey on deep learning approaches". In: arXiv preprint arXiv:1803.01164 (2018).Google ScholarGoogle Scholar
  4. Alessandro Capotondi et al. "CMix-NN: Mixed lowprecision CNN library for memory-constrained edge devices". In: IEEE Transactions on Circuits and Systems II: Express Briefs 67.5 (2020), pp. 871--875.Google ScholarGoogle ScholarCross RefCross Ref
  5. Jiasi Chen and Xukan Ran. "Deep LearningWith Edge Computing: A Review." In: Proceedings of the IEEE 107.8 (2019), pp. 1655--1674.Google ScholarGoogle ScholarCross RefCross Ref
  6. Amir Erfan Eshratifar, Amirhossein Esmaili, and Massoud Pedram. "Bottlenet: A deep learning architecture for intelligent mobile cloud computing services". In: 2019 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED). IEEE. 2019, pp. 1--6.Google ScholarGoogle ScholarCross RefCross Ref
  7. Sukhpal Singh Gill et al. "Transformative effects of IoT, Blockchain and Artificial Intelligence on cloud computing: Evolution, vision, trends and open challenges". In: Internet of Things (2019), pp. 100--118.Google ScholarGoogle Scholar
  8. V Scott Gordon and Jeb Crouson. "Self-splitting modular neural network-domain partitioning at boundaries of trained regions". In: International Joint Conference on Neural Networks. IEEE. 2008, pp. 1085--1091.Google ScholarGoogle Scholar
  9. Jashwant Raj Gunasekaran et al. "Implications of Public Cloud Resource Heterogeneity for Inference Serving". In: Proceedings of the 2020 Sixth International Workshop on Serverless Computing. 2020, pp. 7--12.Google ScholarGoogle Scholar
  10. Juyong Kim et al. "SplitNet: Learning to semantically split deep networks for parameter reduction and model parallelization". In: International Conference on Machine Learning (ICML). JMLR. 2017, pp. 1866--1874.Google ScholarGoogle Scholar
  11. Quan Le et al. "An overview of deep learning in industry". In: Data Analytics and AI (2020), pp. 65-- 98.Google ScholarGoogle Scholar
  12. Wei Yang Bryan Lim et al. "Federated learning in mobile edge networks: A comprehensive survey". In: IEEE Communications Surveys & Tutorials 22.3 (2020), pp. 2031--2063.Google ScholarGoogle ScholarCross RefCross Ref
  13. Yuyi Mao, Jun Zhang, and Khaled B Letaief. "Dynamic computation offloading for mobile-edge computing with energy harvesting devices". In: IEEE Journal on Selected Areas in Communications 34.12 (2016), pp. 3590--3605.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Yoshitomo Matsubara et al. "Distilled Split Deep Neural Networks for Edge-Assisted Real-Time Systems". In: Workshop on Hot Topics in Video Analytics and Intelligent Edges. 2019, pp. 21--26.Google ScholarGoogle Scholar
  15. Yuanming Shi et al. "Communication-efficient edge AI: Algorithms and systems". In: IEEE Communications Surveys & Tutorials 22.4 (2020), pp. 2167--2191.Google ScholarGoogle ScholarCross RefCross Ref
  16. Surat Teerapittayanon, BradleyMcDanel, and Hsiang- Tsung Kung. "Distributed deep neural networks over the cloud, the edge and end devices". In: 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS). IEEE. 2017, pp. 328--339.Google ScholarGoogle ScholarCross RefCross Ref
  17. Shreshth Tuli et al. "COSCO: Container Orchestration using Co-Simulation and Gradient Based Optimization for Fog Computing Environments". In: IEEE Transactions on Parallel and Distributed Systems (2021).Google ScholarGoogle Scholar
  18. Shreshth Tuli et al. "Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks". In: IEEE Transactions on Mobile Computing (2020).Google ScholarGoogle Scholar
  19. Shreshth Tuli et al. "Fogbus: A blockchain-based lightweight framework for edge and fog computing". In: Journal of Systems and Software (2019).Google ScholarGoogle Scholar
  20. Shreshth Tuli et al. "HealthFog: An ensemble deep learning based smart healthcare system for automatic diagnosis of heart diseases in integrated IoT and fog computing environments". In: Future Generation Computer Systems 104 (2020), pp. 187--200.Google ScholarGoogle ScholarCross RefCross Ref
  21. Shreshth Tuli et al. "iThermoFog: IoT-Fog based automatic thermal profile creation for cloud data centers using artificial intelligence techniques". In: Internet Technology Letters 3.5 (2020), e198.Google ScholarGoogle Scholar
  22. Yury A Ushakov et al. "Split Neural Networks for Mobile Devices". In: Telecommunications Forum (TELFOR). IEEE. 2018, pp. 420--425.Google ScholarGoogle Scholar
  23. Minchen Yu et al. "Gillis: Serving Large Neural Networks in Serverless Functions with Automatic Model Partitioning". In: 41st IEEE International Conference on Distributed Computing Systems. 2021.Google ScholarGoogle Scholar
  24. Hongyu Zhu et al. "TBD: Benchmarking and analyzing deep neural network training". In: arXiv preprint arXiv:1803.06905 (2018).Google ScholarGoogle Scholar

Index Terms

  1. SplitPlace: Intelligent Placement of Split Neural Nets in Mobile Edge Environments
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM SIGMETRICS Performance Evaluation Review
      ACM SIGMETRICS Performance Evaluation Review  Volume 49, Issue 2
      September 2021
      73 pages
      ISSN:0163-5999
      DOI:10.1145/3512798
      Issue’s Table of Contents

      Copyright © 2022 Copyright is held by the owner/author(s)

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 January 2022

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader