Skip to main content

Inference Acceleration Model of Branched Neural Network Based on Distributed Deployment in Fog Computing

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12432))

Abstract

Research based on deep neural networks (DNN) is becoming more common. In order to solve the problem that DNN needs to consume a lot of performance during the use prediction process and generate unacceptable delays for users, a distributed neural network deployment model based on fog computing is proposed. The distributed deployment of deep neural networks in fog computing scenarios is analyzed. A deployment algorithm based on Solution Space Tree Pruning (SSTP) is designed, and a suitable fog computing node deployment model is selected to reduce the delay of prediction tasks. An algorithm for Maximizing Accuracy based on Guaranteed Latency (MAL) is designed and implemented, and suitable fog computing nodes are selected for different tasks to exit the prediction task. Simulation experiment results show that compared with the method of deploying neural network models in the cloud, the model prediction delay of the distributed neural network model based on fog computing is reduced by an average of 44.79%. Reduced the average computing acceleration framework of similar algorithms by 28.75%.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Wang, T., Wen, C.-K., Wang, H., Gao, F., Jiang, T., Jin, S.: Deep learning for wireless physical layer: opportunities and challenges. China Commun. 14(11), 92–111 (2017). https://doi.org/10.1109/cc.2017.8233654

    Article  Google Scholar 

  2. Mahmud, R., Kotagiri, R., Buyya, R.: Fog computing: a taxonomy, survey and future directions. In: Di Martino, B., Li, K.-C., Yang, L.T., Esposito, A. (eds.) Internet of Everything. IT, pp. 103–130. Springer, Singapore (2018). https://doi.org/10.1007/978-981-10-5861-5_5

    Chapter  Google Scholar 

  3. Teerapittayanon, S., McDanel, B., Kung, H.-T.: Distributed deep neural networks over the cloud, the edge and end devices. In: 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS), pp. 328–339. IEEE (2017). https://doi.org/10.1109/icdcs.2017.226

  4. Jiang, W., YuSheng, X., Hong, G., Zhang, L.: Dynamic trust calculation model and credit management mechanism of online trading. SCIENTIA SINICA Informationis 44(9), 1084–1101 (2014). https://doi.org/10.1360/N112013-00202

    Article  Google Scholar 

  5. Liu, W., Wang, Z., Liu, X., Zeng, N., Liu, Y., Alsaadi, F.E.: A survey of deep neural network architectures and their applications. Neurocomputing 234, 11–26 (2017). https://doi.org/10.1016/j.neucom.2016.12.038

    Article  Google Scholar 

  6. Zhang, Y., Li, S., Guo, H.: A type of biased consensus-based distributed neural network for path planning. Nonlinear Dyn. 89(3), 1803–1815 (2017). https://doi.org/10.1007/s11071-017-3553-7

    Article  MathSciNet  MATH  Google Scholar 

  7. Kanev, A., et al.: Anomaly detection in wireless sensor network of the “smart home” system. In: 2017 20th Conference of Open Innovations Association (FRUCT), pp. 118–124. IEEE (2017). https://doi.org/10.23919/fruct.2017.8071301

  8. Li, E., Zeng, L., Zhou, Z., Chen, X.: Edge AI: on-demand accelerating deep neural network inference via edge computing. IEEE Trans. Wirel. Commun. 19(1), 447–457 (2019). https://doi.org/10.1109/twc.2019.2946140

    Article  Google Scholar 

  9. Fan, Y., Shi, Y., Kang, K., Xing, Q.: An inflection point based clustering method for sequence data. In: Ni, W., Wang, X., Song, W., Li, Y. (eds.) WISA 2019. LNCS, vol. 11817, pp. 201–212. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30952-7_22

    Chapter  Google Scholar 

Download references

Acknowledgement(s)

National Natural Science Foundation of China (61772196; 61472136); Hunan Provincial Focus Natural Science Fund (2020JJ4249); Hunan Provincial Focus Social Science Fund (2016ZDB006); Key Project of Hunan Provincial Social Science Achievement Review Committee (XSP 19ZD1005).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sijian Lv .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jiang, W., Lv, S. (2020). Inference Acceleration Model of Branched Neural Network Based on Distributed Deployment in Fog Computing. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds) Web Information Systems and Applications. WISA 2020. Lecture Notes in Computer Science(), vol 12432. Springer, Cham. https://doi.org/10.1007/978-3-030-60029-7_45

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60029-7_45

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60028-0

  • Online ISBN: 978-3-030-60029-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics