research-article

Are Existing Knowledge Transfer Techniques Effective For Deep Learning on Edge Devices?

Authors:

Ragini Sharma,

Saman Biookaghazadeh,

Ming ZhaoAuthors Info & Claims

HPDC '18: Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing

Pages 15 - 16

https://doi.org/10.1145/3220192.3220459

Published: 11 June 2018 Publication History

Get Access

Abstract

With the emergence of edge computing paradigm, many applications such as image recognition and augmented reality require to perform machine learning (ML) and artificial intelligence (AI) tasks on edge devices. Most AI and ML models are large and computational-heavy, whereas edge devices are usually equipped with limited computational and storage resources. Such models can be compressed and reduced for deployment on edge devices, but they may loose their capability and not perform well. Recent works used knowledge transfer techniques to transfer information from a large network (termed teacher) to a small one (termed student) in order to improve the performance of the latter. This approach seems to be promising for learning on edge devices, but a thorough investigation on its effectiveness is lacking. This paper provides an extensive study on the performance (in both accuracy and convergence speed) of knowledge transfer, considering different student architectures and different techniques for transferring knowledge from teacher to student. The results show that the performance of KT does vary by architectures and transfer techniques. A good performance improvement is obtained by transferring knowledge from both the intermediate layers and last layer of the teacher to a shallower student. But other architectures and transfer techniques do not fare so well and some of them even lead to negative performance impact.

References

[1]

Martín Abadi. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. https://www.tensorflow.org/ Software available from tensorflow.org.

Google Scholar

[2]

Jimmy Ba and Rich Caruana. 2014. Do deep nets really need to be deep?. In Advances in neural information processing systems. 2654--2662.

Digital Library

Google Scholar

[3]

Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).

Google Scholar

[4]

Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017).

Google Scholar

[5]

Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta, and Yoshua Bengio. 2014. Fitnets: Hints for thin deep nets. arXiv preprint arXiv:1412.6550 (2014).

Google Scholar

[6]

Ragav Venkatesan and Baoxin Li. 2016. Diving deeper into mentee networks. arXiv preprint arXiv:1604.08220 (2016).

Google Scholar

Cited By

View all

Terziyan VTiihonen T(2024)Using Cloning-GAN Architecture to Unlock the Secrets of Smart ManufacturingProcedia Computer Science10.1016/j.procs.2024.01.089232:C(890-902)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1016/j.procs.2024.01.089
Majewski PMrzygłód MLampa PBurduk RReiner J(2024)Monitoring the growth of insect larvae using a regression convolutional neural network and knowledge transferEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107358127(107358)Online publication date: Jan-2024
https://doi.org/10.1016/j.engappai.2023.107358
Murshed MMurphy CHou DKhan NAnanthanarayanan GHussain F(2021)Machine Learning at the Network Edge: A SurveyACM Computing Surveys10.1145/346902954:8(1-37)Online publication date: 4-Oct-2021
https://dl.acm.org/doi/10.1145/3469029
Show More Cited By

Index Terms

Are Existing Knowledge Transfer Techniques Effective For Deep Learning on Edge Devices?
1. Computer systems organization
  1. Architectures
    1. Other architectures
      1. Heterogeneous (hybrid) systems
      2. Neural networks
  2. Embedded and cyber-physical systems
    1. Embedded systems
      1. Embedded software

Recommendations

Structural knowledge transfer by spatial abstraction for reinforcement learning agents

In this article we investigate the role of abstraction principles for knowledge transfer in agent control learning tasks. We analyze abstraction from a formal point of view and characterize three distinct facets: aspectualization, coarsening, and ...
A novel framework of knowledge transfer system for construction projects based on knowledge graph and transfer learning
Highlights
- A framework to automatically and intelligently transfer project knowledge.
- ...
Abstract
For construction enterprises, efficient knowledge sharing among projects not only effectively improves enterprise technology, level of management and competitiveness, but also promotes their sustainable development. Given the many ...
Research on the Knowledge Transfer in Academic Blog
IITA '08: Proceedings of the 2008 Second International Symposium on Intelligent Information Technology Application - Volume 02

This paper explores the knowledge transfer in academic blog. The objectives of the present study were (1) to analyze the knowledge transfer in blog knowledge-sharing network based on the knowledge communication activities and (2) discuss the realization ...

Comments

Information & Contributors

Information

Published In

HPDC '18: Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing

June 2018

25 pages

ISBN:9781450358996

DOI:10.1145/3220192

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

SIGHPC: ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

HPDC '18

Sponsor:

University of Arizona
SIGARCH

HPDC '18: The 27th International Symposium on High-Performance Parallel and Distributed Computing

June 11 - 15, 2018

AZ, Tempe, USA

Acceptance Rates

Overall Acceptance Rate 61 of 390 submissions, 16%

Upcoming Conference

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
327
Total Downloads

Downloads (Last 12 months)23
Downloads (Last 6 weeks)3

Reflects downloads up to 07 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Terziyan VTiihonen T(2024)Using Cloning-GAN Architecture to Unlock the Secrets of Smart ManufacturingProcedia Computer Science10.1016/j.procs.2024.01.089232:C(890-902)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1016/j.procs.2024.01.089
Majewski PMrzygłód MLampa PBurduk RReiner J(2024)Monitoring the growth of insect larvae using a regression convolutional neural network and knowledge transferEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107358127(107358)Online publication date: Jan-2024
https://doi.org/10.1016/j.engappai.2023.107358
Murshed MMurphy CHou DKhan NAnanthanarayanan GHussain F(2021)Machine Learning at the Network Edge: A SurveyACM Computing Surveys10.1145/346902954:8(1-37)Online publication date: 4-Oct-2021
https://dl.acm.org/doi/10.1145/3469029
Wang XHan YLeung VNiyato DYan XChen X(2020)Convergence of Edge Computing and Deep Learning: A Comprehensive SurveyIEEE Communications Surveys & Tutorials10.1109/COMST.2020.297055022:2(869-904)Online publication date: Oct-2021
https://doi.org/10.1109/COMST.2020.2970550
Yamakami T(2018)An Experimental Implementation of an Edge-based AI Engine with Edge-Cloud Coordination2018 18th International Symposium on Communications and Information Technologies (ISCIT)10.1109/ISCIT.2018.8587931(442-446)Online publication date: Sep-2018
https://doi.org/10.1109/ISCIT.2018.8587931

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Structural knowledge transfer by spatial abstraction for reinforcement learning agents

A novel framework of knowledge transfer system for construction projects based on knowledge graph and transfer learning

Research on the Knowledge Transfer in Academic Blog

Comments

Information

Published In

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations