Abstract
As a combination of artificial intelligence (AI) and edge computing, edge intelligence has made great contributions in pushing AI applications to the edge of the network, especially in reducing delay, saving energy and improving privacy. However, most of researchers only considered the computation approach of end device to edge server and ignored the scheduling of multi-task. In this paper, we study DNN model partition and online task scheduling over cloud, edge and devices for deadline-aware DNN inference tasks. We first establish our mathematical model and find the model can not be solved directly because the solution space is too large. Therefore, we propose the partition point filtering algorithm to reduce the solution space. Then by jointly considering management of the networking bandwidth and computing resources, we propose our online scheduling algorithm to meet the maximum number of deadlines. Experiments and simulations show that our online algorithm reduces deadline miss ratio by up to \(51\%\) compared with other four typical computation approaches.
Z. Xu—Supported by the National Natural Science Foundation of China (Grant No. 61806067), the Anhui Provincial Key R&D Program of China (202004a05020040).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Li, H.: Deep learning for natural language processing: advantages and challenges. Nat. Sci. Rev. 5, 24–26 (2018)
Parkhi, O.M., Vedaldi, A., Zisserman, A.: Deep face recognition, pp. 41.1–41.12. BMVA Press (2015)
Nassif, A.B., Shahin, I., Attili, I.B., Azzeh, M., Shaalan, K.: Speech recognition using deep neural networks: a systematic review. IEEE Access 7, 19143–19165 (2019)
Li, S., Xu, L., Zhao, S.: The internet of things: a survey. Inf. Syst. Front. 17, 243–259 (2015)
Chettri, L., Bera, R.: A comprehensive survey on internet of things (iot) toward 5G wireless systems. IEEE Internet Things J. 7, 16–32 (2020)
Cai, Z., Zheng, X.: A private and efficient mechanism for data uploading in smart cyber-physical systems. IEEE Trans. Netw. Sci. Eng. 7, 766–775 (2020)
Shi, W., Cao, J., Zhang, Q., Li, Y., Xu, L.: Edge computing: vision and challenges. IEEE Internet Things J. 3, 637–646 (2016)
Cai, Z., Shi, T.: Distributed query processing in the edge assisted IoT data monitoring system. IEEE Internet Things J. 8, 12679–12693 (2020)
Zhu, T., Shi, T., Li, J., Cai, Z., Zhou, X.: Task scheduling in deadline-aware mobile edge computing systems. IEEE Internet Things J. 6, 4854–4866 (2019)
Duan, Z., Li, W., Cai, Z.: Distributed auctions for task assignment and scheduling in mobile crowdsensing systems. In: 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS), pp. 635–644 (2017)
Han, Z., Tan, H., Li, X., Jiang, S.H., Li, Y., Lau, F.C.M.: Ondisc: online latency-sensitive job dispatching and scheduling in heterogeneous edge-clouds. IEEE/ACM Trans. Netw. 27, 2472–2485 (2019)
Khan, L.U., Yaqoob, I., Tran, N.H., Kazmi, S.M.A., Tri, N.D., Hong, C.: Edge-computing-enabled smart cities: a comprehensive survey. IEEE Internet Things J. 7, 10200–10232 (2020)
Alam, M.R., Reaz, M., Ali, M.A.: A review of smart homes: past, present, and future. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 42, 1190–1203 (2012)
Zhou, Z., Chen, X., Li, E., Zeng, L., Luo, K., Zhang, J.: Edge intelligence: paving the last mile of artificial intelligence with edge computing. Proc. IEEE 107, 1738–1762 (2019)
Xu, M., Qian, F., Zhu, M., Huang, F., Pushp, S., Liu, X.: Deepwear: adaptive local offloading for on-wearable deep learning. IEEE Trans. Mobile Comput. 19, 314–330 (2020)
Li, E., Zeng, L., Zhou, Z., Chen, X.: Edge AI: on-demand accelerating deep neural network inference via edge computing. IEEE Trans. Wirel. Commun. 19, 447–457 (2020)
Shi, C., Chen, L., Shen, C., Song, L., Xu, J.: Privacy-aware edge computing based on adaptive DNN partitioning. In: 2019 IEEE Global Communications Conference (GLOBECOM), pp. 1–6 (2019)
Mohammed, T., Joe-Wong, C., Babbar, R., Francesco, M.D.: Distributed inference acceleration with adaptive DNN partitioning and offloading. In: IEEE INFOCOM 2020 - IEEE Conference on Computer Communications, pp. 854–863 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhong, L., Shi, J., Shi, L., Xu, J., Fan, Y., Xu, Z. (2021). Online Task Scheduling for DNN-Based Applications over Cloud, Edge and End Devices. In: Liu, Z., Wu, F., Das, S.K. (eds) Wireless Algorithms, Systems, and Applications. WASA 2021. Lecture Notes in Computer Science(), vol 12939. Springer, Cham. https://doi.org/10.1007/978-3-030-86137-7_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-86137-7_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86136-0
Online ISBN: 978-3-030-86137-7
eBook Packages: Computer ScienceComputer Science (R0)