skip to main content
10.1145/3576842.3582385acmconferencesArticle/Chapter ViewAbstractPublication PagesiotdiConference Proceedingsconference-collections
research-article

REACT: Streaming Video Analytics On The Edge With Asynchronous Cloud Support

Published:09 May 2023Publication History

ABSTRACT

Emerging Internet of Things (IoT) and mobile computing applications are expected to support latency-sensitive deep neural network (DNN) workloads. To realize this vision, the Internet is evolving towards an edge-computing architecture, where computing infrastructure is located closer to the end device to help achieve low latency. However, edge computing may have limited resources compared to cloud environments and thus, cannot run large DNN models that often have high accuracy. In this work, we develop REACT, a framework that leverages cloud resources to execute large DNN models with higher accuracy to improve the accuracy of models running on edge devices. To do so, we propose a novel edge-cloud fusion algorithm that fuses edge and cloud predictions, achieving low latency and high accuracy. We extensively evaluate our approach and show that our approach can significantly improve the accuracy compared to baseline approaches. We focus specifically on object detection in videos (applicable in many video analytics scenarios) and show that the fused edge-cloud predictions can outperform the accuracy of edge-only and cloud-only scenarios by as much as 50%. REACT shows that for Edge AI, the choice between offloading and on-device inference is not binary — redundant execution at cloud and edge locations complement each other when carefully employed.

References

  1. Ganesh Ananthanarayanan, Victor Bahl, Landon Cox, Alex Crown, Shadi Nogbahi, and Yuanchao Shu. 2019. Demo: Video Analytics-Killer App for Edge Computing. In Proc. ACM MobiSys.Google ScholarGoogle Scholar
  2. Kittipat Apicharttrisorn, Xukan Ran, Jiasi Chen, Srikanth V Krishnamurthy, and Amit K Roy-Chowdhury. 2019. Frugal following: Power thrifty object detection and tracking for mobile augmented reality. In Proc. SenSys. 96–109.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ashwin Ashok, Peter Steenkiste, and Fan Bai. 2015. Enabling vehicular applications using cloud services through adaptive computation offloading. In Proceedings of the 6th International Workshop on Mobile Cloud Computing and Services. 1–7.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Mohammad Farhadi Bajestani and Yezhou Yang. 2020. TKD: Temporal Knowledge Distillation for Active Perception. In Proc. WACV. 953–962.Google ScholarGoogle Scholar
  5. Ravi Bhandari, Akshay Uttama Nambi, Venkata N Padmanabhan, and Bhaskaran Raman. 2018. DeepLane: camera-assisted GPS for driving lane detection. In Proc. BuildSys. 73–82.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Erik Bochinski, Volker Eiselein, and Thomas Sikora. 2017. High-speed tracking-by-detection without using image information. In 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, 1–6.Google ScholarGoogle ScholarCross RefCross Ref
  7. Daniel Bolya, Sean Foley, James Hays, and Judy Hoffman. 2020. Tide: A general toolbox for identifying object detection errors. In Proc. ECCV.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Rainer E Burkard and Ulrich Derigs. 1980. The linear sum assignment problem. In Assignment and Matching Problems: Solution Methods with FORTRAN-Programs. Springer, 1–15.Google ScholarGoogle Scholar
  9. Zhengping Che, Guangyu Li, Tracy Li, Bo Jiang, Xuefeng Shi, Xinsheng Zhang, Ying Lu, Guobin Wu, Yan Liu, and Jieping Ye. 2019. D2-City: A Large-Scale Dashcam Video Dataset of Diverse Traffic Scenarios. arXiv preprint arXiv:1904.01975 (2019).Google ScholarGoogle Scholar
  10. Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, and Dahua Lin. 2019. MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155 (2019).Google ScholarGoogle Scholar
  11. Tiffany Yu-Han Chen, Lenin Ravindranath, Shuo Deng, Paramvir Bahl, and Hari Balakrishnan. 2015. Glimpse: Continuous, real-time object recognition on mobile devices. In Proc. SenSys. 155–168.Google ScholarGoogle Scholar
  12. Byung-Gon Chun, Sunghwan Ihm, Petros Maniatis, Mayur Naik, and Ashwin Patti. 2011. Clonecloud: elastic execution between mobile device and cloud. In Proceedings of the sixth conference on Computer systems. 301–314.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Mark Everingham, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. 2010. The pascal visual object classes (voc) challenge. IJCV 88, 2 (2010).Google ScholarGoogle Scholar
  14. Anurag Ghosh, Akshay Nambi, Aditya Singh, Harish YVS, and Tanuja Ganu. 2021. Adaptive streaming perception using deep reinforcement learning. arXiv preprint arXiv:2106.05665 (2021).Google ScholarGoogle Scholar
  15. Google. 2020. Google Coral USB Accelerator. https://coral.ai/products/accelerator.Google ScholarGoogle Scholar
  16. Song Han, Huizi Mao, and William J Dally. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015).Google ScholarGoogle Scholar
  17. Jonatan Heyman, Carl Byström, Joakim Hamrén, and Hugo Heyman. 2020. Locust: An Open Source Load Testing Tool. https://locust.io/Google ScholarGoogle Scholar
  18. Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, 2017. Speed/accuracy trade-offs for modern convolutional object detectors. In Proc. CVPR.Google ScholarGoogle ScholarCross RefCross Ref
  19. Itay Hubara, Matthieu Courbariaux, Daniel Soudry, Ran El-Yaniv, and Yoshua Bengio. 2017. Quantized neural networks: Training neural networks with low precision weights and activations. The Journal of Machine Learning Research 18, 1 (2017), 6869–6898.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Intel. 2020. Intel Neural Compute Stick 2. https://software.intel.com/en-us/neural-compute-stick.Google ScholarGoogle Scholar
  21. Srinivasan Iyengar, Ravi Raj Saxena, Joydeep Pal, Bhawana Chhaglani, Anurag Ghosh, Venkata N Padmanabhan, and Prabhakar T Venkata. 2021. Holistic energy awareness for intelligent drones. In Proc. BuildSys.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Junchen Jiang, Ganesh Ananthanarayanan, Peter Bodik, Siddhartha Sen, and Ion Stoica. 2018. Chameleon: scalable adaptation of video analytics. In Proc. SIGCOMM. 253–266.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Yiping Kang, Johann Hauswald, Cao Gao, Austin Rovinski, Trevor Mudge, Jason Mars, and Lingjia Tang. 2017. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. ACM SIGARCH Computer Architecture News 45, 1 (2017), 615–629.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Harold W Kuhn. 1955. The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1-2 (1955), 83–97.Google ScholarGoogle Scholar
  25. Mengtian Li, Yu-Xiong Wang, and Deva Ramanan. 2020. Towards Streaming Image Understanding. arXiv preprint arXiv:2005.10420 (2020).Google ScholarGoogle Scholar
  26. Yuanqi Li, Arthi Padmanabhan, Pengzhan Zhao, Yufei Wang, Guoqing Harry Xu, and Ravi Netravali. 2020. Reducto: On-Camera Filtering for Resource-Efficient Real-Time Video Analytics. In Proc. SIGCOMM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Robert LiKamWa, Yunhui Hou, Julian Gao, Mia Polansky, and Lin Zhong. 2016. RedEye: analog ConvNet image sensor architecture for continuous mobile vision. ACM SIGARCH Computer Architecture News 44, 3 (2016).Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proc. ICCV.Google ScholarGoogle ScholarCross RefCross Ref
  29. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Proc. ECCV.Google ScholarGoogle ScholarCross RefCross Ref
  30. Luyang Liu, Hongyu Li, and Marco Gruteser. 2019. Edge assisted real-time object detection for mobile augmented reality. In Proc. MobiCom. 1–16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Alan Lukezic, Tomas Vojir, Luka Cehovin Zajc, Jiri Matas, and Matej Kristan. 2017. Discriminative correlation filter with channel and spatial reliability. In Proc. CVPR.Google ScholarGoogle ScholarCross RefCross Ref
  32. Ravi Netravali, Anirudh Sivaraman, Somak Das, Ameesh Goyal, Keith Winstein, James Mickens, and Hari Balakrishnan. 2015. Mahimahi: Accurate record-and-replay for { HTTP}. In USENIX ATC. 417–429.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Nvidia. 2020. Meet Jetson, the Platform for AI at the Edge.https://developer.nvidia.com/embedded-computing.Google ScholarGoogle Scholar
  34. Xukan Ran, Haolianz Chen, Xiaodan Zhu, Zhenming Liu, and Jiasi Chen. 2018. Deepdecision: A mobile deep learning framework for edge video analytics. In Proc. INFOCOM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proc. CVPR. 779–788.Google ScholarGoogle ScholarCross RefCross Ref
  36. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Proc. NeurIPS. 91–99.Google ScholarGoogle Scholar
  37. Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. 2018. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proc. CVPR.Google ScholarGoogle ScholarCross RefCross Ref
  38. Mahadev Satyanarayanan, Paramvir Bahl, Ramón Caceres, and Nigel Davies. 2009. The case for vm-based cloudlets in mobile computing. IEEE pervasive Computing 8, 4 (2009), 14–23.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Xiaofan Zhang, Haoming Lu, Cong Hao, Jiachen Li, Bowen Cheng, Yuhong Li, Kyle Rupnow, Jinjun Xiong, Thomas Huang, Honghui Shi, 2020. Skynet: a hardware-efficient method for object detection and tracking on embedded systems. In Proc. MLSys.Google ScholarGoogle Scholar
  40. Huajun Zhou, Zechao Li, Chengcheng Ning, and Jinhui Tang. 2017. Cad: Scale invariant framework for real-time object detection. In Proc. ICCV Workshops.Google ScholarGoogle ScholarCross RefCross Ref
  41. Xingyi Zhou, Dequan Wang, and Philipp Krähenbühl. 2019. Objects as points. arXiv preprint arXiv:1904.07850 (2019).Google ScholarGoogle Scholar
  42. Pengfei Zhu, Longyin Wen, Dawei Du, Xiao Bian, Qinghua Hu, and Haibin Ling. 2020. Vision Meets Drones: Past, Present and Future. arXiv preprint arXiv:2001.06303 (2020).Google ScholarGoogle Scholar
  43. Pengfei Zhu, Longyin Wen, Dawei Du, Xiao Bian, Haibin Ling, Qinghua Hu, Haotian Wu, Qinqin Nie, Hao Cheng, Chenfeng Liu, 2018. Visdrone-vdt2018: The vision meets drone video detection and tracking challenge results. In Proc. ECCV Workshops.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    IoTDI '23: Proceedings of the 8th ACM/IEEE Conference on Internet of Things Design and Implementation
    May 2023
    514 pages
    ISBN:9798400700378
    DOI:10.1145/3576842

    Copyright © 2023 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 9 May 2023

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Upcoming Conference

    IoTDI '24
  • Article Metrics

    • Downloads (Last 12 months)140
    • Downloads (Last 6 weeks)7

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format