skip to main content
research-article

Scientific Workflows in IoT Environments: A Data Placement Strategy Based on Heterogeneous Edge-Cloud Computing

Published: 10 August 2022 Publication History

Abstract

In Industry 4.0 and Internet of Things (IoT) environments, the heterogeneous edge-cloud computing paradigm can provide a more proper solution to deploy scientific workflows compared to cloud computing or other traditional distributed computing. Owing to the different sizes of scientific datasets and the privacy issue concerning some of these datasets, it is essential to find a data placement strategy that can minimize data transmission time. Some state-of-the-art data placement strategies combine edge computing and cloud computing to distribute scientific datasets. However, the dynamic distribution of newly generated datasets to appropriate datacenters and exiting the spent datasets are still a challenge during workflows execution. To address this challenge, this study not only constructs a data placement model that includes shared datasets within the individual and among multiple workflows across various geographical regions, but also proposes a data placement strategy (DYM-RL-DPS) based on algorithms of two stages. First, during the build-time stage of workflows, we use the discrete particle swarm optimization algorithm with differential evolution to pre-allocate initial datasets to proper datacenters. Then, we reformulate the dynamic datasets distribution problem as a Markov decision process and provide a reinforcement learning–based approach to learn the data placement strategy in the runtime stage of scientific workflows. Through using the heterogeneous edge-cloud computing architecture to simulate IoT environments, we designed comprehensive experiments to demonstrate the superiority of DYM-RL-DPS. The results of our strategy can effectively reduce the data transmission time as compared to other strategies.

References

[1]
Andrey Kashlev, Shiyong Lu, and Artem Chebotko. 2015. Typetheoretic approach to the shimming problem in scientific workflows. IEEE Transactions on Services Computing 8, 5 (2015), 795–809.
[2]
Haibo Li, Keith C. C. Chan, Mengxia Liang, and Xiangyu Luo. 2016. Composition of resource-service chain for cloud manufacturing. IEEE Transactions on Industrial Informatics 12, 1 (2016), 211–219.
[3]
Thang Le Duc, Rafael García Leiva, Paolo Casari, and Per-Olov Östberg. 2019. Machine learning methods for reliable resource provisioning in edge-cloud computing: A survey. ACM Computing Surveys (CSUR) 52, 5 (2019), 94.
[4]
Ziyan Wu, Zhihui Lu, Patrick C. K. Hung, Shih-Chia Huang, Yu Tong, and Zhenfang Wang. 2019. QaMeC: A QoS-driven IoVs application optimizing deployment scheme in multimedia edge clouds. Future Generation Computer Systems 92 (2019), 17–28.
[5]
Zhihui Lu, Nini Wang, Jie Wu, and Meikang Qiu. 2018. IoTDeM: An IoT big data-oriented mapreduce performance prediction extended model in multiple edge clouds. Journal of Parallel and Distributed Computing 118 (2018), 316–327.
[6]
Qingchen Zhang, Laurence T. Yang, Zheng Yan, Zhikui Chen, and Peng Li. 2018. An efficient deep learning model to predict cloud workload for industry informatics. IEEE Transactions on Industrial Informatics 14, 7 (2018), 3170–3178.
[7]
Hongyue Wu, Shuiguang Deng, Wei Li, Jianwei Yin, Xiaohong Li, Zhiyong Feng, and Albert Y. Zomaya. 2019. Mobility-aware service selection in mobile edge computing systems. In 2019 IEEE International Conference on Web Services (ICWS’19). IEEE, 201–208.
[8]
Weisong Shi, Hui Sun, Jie Cao, Quan Zhang, and Wei Liu. 2017. Edge computing—An emerging computing model for the internet of everything era. Journal of Computer Research and Development 54, 5 (2017), 907–924.
[9]
Ai Xiao, Zhihui Lu, Xin Du, Jie Wu, and Patrick C. K. Hung. 2020. ORHRC: Optimized recommendations of heterogeneous resource configurations in cloud-fog orchestrated computing environments. In 2020 IEEE International Conference on Web Services (ICWS’20). IEEE, 404–412.
[10]
X. Du, S. Tang, Z. Lu, J. Wet, K. Gai, and P. C. K. Hung. 2020. A novel data placement strategy for data-sharing scientific workflows in heterogeneous edge-cloud computing environments. In 2020 IEEE International Conference on Web Services (ICWS’20). 498–507. DOI:
[11]
Wei Du, Tao Lei, Qiang Het, Wei Liu, Qiwang Lei, Hailiang Zhao, and Wei Wang. 2019. Service capacity enhanced task offloading and resource allocation in multi-server edge computing environment. In 2019 IEEE International Conference on Web Services (ICWS’19). IEEE, 83–90.
[12]
Yanling Shao, Chunlin Li, and Hengliang Tang. 2019. A data replica placement strategy for IoT workflows in collaborative edge and cloud environments. Computer Networks 148 (2019), 46–59.
[13]
Xiaowei Chen, Songtao Tang, Zhihui Lu, Jie Wu, Yucong Duan, Shih-Chia Huang, and Qifeng Tang. 2019. iDiSC: A new approach to IoT-data-intensive service components deployment in edge-cloud-hybrid system. IEEE Access 7 (2019), 59172–59184.
[14]
Xin Du, Jianlong Xu, Weihong Cai, Changsheng Zhu, and Yindong Chen. 2019. OPRC: An online personalized reputation calculation model in service-oriented computing environments. IEEE Access 7 (2019), 87760–87768.
[15]
Xuejun Li, Lei Zhang, Yang Wu, Xiao Liu, Erzhou Zhu, Huikang Yi, Futian Wang, Cheng Zhang, and Yun Yang. 2019. A novel workflow-level data placement strategy for data-sharing scientific cloud workflows. IEEE Transactions on Services Computing 12, 3 (2019), 370–383.
[16]
Bing Lin, Fangning Zhu, Jianshan Zhang, Jiaqing Chen, Xing Chen, Naixue Xiong, and Jaime Lloret. 2019. A time-driven data placement strategy for a scientific workflow combining edge computing and cloud computing. IEEE Transactions on Industrial Informatics 15, 7 (2019), 4254–4265.
[17]
James Kennedy and Russell Eberhart. 1995. Particle swarm optimization (PSO). In Proceedings of the IEEE International Conference on Neural Networks. 1942–1948.
[18]
A. Kai Qin, Vicky Ling Huang, and Ponnuthurai N. Suganthan. 2008. Differential evolution algorithm with strategy adaptation for global numerical optimization. IEEE transactions on Evolutionary Computation 13, 2 (2008), 398–417.
[19]
Y. C. Liu and C. Y. Huang. 2021. DDPG-based adaptive robust tracking control for aerial manipulators with decoupling approach. IEEE Transactions on Cybernetics (2021), 1–14. DOI:
[20]
L. Chen, Y. Xu, Z. Lu, J. Wu, K. Gai, P. C. K. Hung, and M. Qiu. 2020. IoT microservice deployment in edge-cloud hybrid environment using reinforcement learning. IEEE Internet of Things Journal (2020), 1–1. DOI:
[21]
Shishir Bharathi, Ann Chervenak, Ewa Deelman, Gaurang Mehta, Mei-Hui Su, and Karan Vahi. 2008. Characterization of scientific workflows. In 2008 3rd Workshop on Workflows in Support of Large-Scale Science. IEEE, 1–10.
[22]
Mingjun Wang, Jinghui Zhang, Fang Dong, and Junzhou Luo. 2014. Data placement and task scheduling optimization for data intensive scientific workflow in multiple data centers environment. In 2014 2nd International Conference on Advanced Cloud and Big Data. IEEE, 77–84.
[23]
Dharma Nukarapu, Bin Tang, Liqiang Wang, and Shiyong Lu. 2011. Data replication in data intensive scientific applications with performance guarantee. IEEE Transactions on Parallel and Distributed Systems 22, 8 (2011), 1299–1306.
[24]
Dong Yuan, Yun Yang, Xiao Liu, and Jinjun Chen. 2010. A data placement strategy in scientific cloud workflows. Future Generation Computer Systems 26, 8 (2010), 1200–1214.
[25]
Najme Mansouri, Mohammad Masoud Javidi, and B. Mohammad Hasani Zade. 2021. A CSO-based approach for secure data replication in cloud computing environment. The Journal of Supercomputing 77, 6 (2021), 5882–5933.
[26]
Gang Sun, Yayu Li, Yao Li, Dan Liao, and Victor Chang. 2018. Low-latency orchestration for workflow-oriented service function chain in edge computing. Future Generation Computer Systems 85 (2018), 116–128.
[27]
Xinchen Cai, Hongyu Kuang, Hao Hu, Wei Song, and Jian Lü. 2018. Response time aware operator placement for complex event processing in edge computing. In International Conference on Service-Oriented Computing. Springer, 264–278.
[28]
Vajiheh Farhadi, Fidan Mehmeti, Ting He, Thomas F. La Porta, Hana Khamfroush, Shiqiang Wang, Kevin S. Chan, and Konstantinos Poularakis. 2021. Service placement and request scheduling for data-intensive applications in edge clouds. IEEE/ACM Transactions on Networking 29, 2 (2021), 779–792.
[29]
Adrian-Cristian Nicolaescu, Spyridon Mastorakis, and Ioannis Psaras. 2021. Store edge networked data (SEND): A data and performance driven edge storage framework. In IEEE INFOCOM 2021-IEEE Conference on Computer Communications. IEEE, 1–10.
[30]
Xiaokang Zhou, Xiang Yang, Jianhua Ma, I. Kevin, and Kai Wang. 2021. Energy efficient smart routing based on link correlation mining for wireless edge computing in IoT. IEEE Internet of Things Journal (2021).
[31]
Hong Li, Duo Yang, Wenzhe Su, Jinhu Lü, and Xinghuo Yu. 2019. An overall distribution particle swarm optimization MPPT algorithm for photovoltaic system under partial shading. IEEE Transactions on Industrial Electronics 66, 1 (2019), 265–275.
[32]
Bing Lin, Yinhao Huang, Jianshan Zhang, Junqin Hu, Xing Chen, and Jun Li. 2020. Cost-driven off-loading for DNN-based applications over cloud, edge, and end devices. IEEE Transactions on Industrial Informatics 16, 8 (2020), 5456–5466. DOI:
[33]
Xing Chen, Jianshan Zhang, Bing Lin, Zheyi Chen, Katinka Wolter, and Geyong Min. 2022. Energy-efficient offloading for DNN-based smart IoT systems in cloud-edge environments. IEEE Transactions on Parallel and Distributed Systems 33, 3 (2022), 683–697. DOI:

Cited By

View all
  • (2025)Cost-Effective and Low-Latency Data Placement in Edge Environment Based on PageRank-Inspired Regional ValueIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.350662536:2(185-196)Online publication date: Feb-2025
  • (2025)Edge–Cloud Framework for Vehicle–Road Cooperative Traffic Signal Control in Augmented Internet of ThingsIEEE Internet of Things Journal10.1109/JIOT.2024.348753812:5(5488-5499)Online publication date: 1-Mar-2025
  • (2025)Homomorphic Encryption Applications for IoT and Light-Weighted Environments: A ReviewIEEE Internet of Things Journal10.1109/JIOT.2024.347202912:2(1222-1246)Online publication date: 15-Jan-2025
  • Show More Cited By

Index Terms

  1. Scientific Workflows in IoT Environments: A Data Placement Strategy Based on Heterogeneous Edge-Cloud Computing

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Management Information Systems
    ACM Transactions on Management Information Systems  Volume 13, Issue 4
    December 2022
    255 pages
    ISSN:2158-656X
    EISSN:2158-6578
    DOI:10.1145/3555789
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 August 2022
    Online AM: 17 May 2022
    Accepted: 01 April 2022
    Revised: 01 February 2022
    Received: 01 September 2021
    Published in TMIS Volume 13, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Heterogeneous edge-cloud computing
    2. data-sharing
    3. scientific workflows
    4. IoT environments

    Qualifiers

    • Research-article
    • Refereed

    Funding Sources

    • National Key Research and Development Program of China
    • National Natural Science Foundation of China
    • Shanghai Science and Technology Innovation Action Plan Project

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)170
    • Downloads (Last 6 weeks)22
    Reflects downloads up to 27 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Cost-Effective and Low-Latency Data Placement in Edge Environment Based on PageRank-Inspired Regional ValueIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.350662536:2(185-196)Online publication date: Feb-2025
    • (2025)Edge–Cloud Framework for Vehicle–Road Cooperative Traffic Signal Control in Augmented Internet of ThingsIEEE Internet of Things Journal10.1109/JIOT.2024.348753812:5(5488-5499)Online publication date: 1-Mar-2025
    • (2025)Homomorphic Encryption Applications for IoT and Light-Weighted Environments: A ReviewIEEE Internet of Things Journal10.1109/JIOT.2024.347202912:2(1222-1246)Online publication date: 15-Jan-2025
    • (2024)Edge Computing and Cloud Computing for Internet of Things: A ReviewInformatics10.3390/informatics1104007111:4(71)Online publication date: 30-Sep-2024
    • (2024)Towards Energy-Efficient and Thermal-Aware Data Placement for Storage ClustersIEEE Transactions on Sustainable Computing10.1109/TSUSC.2024.33516849:4(631-647)Online publication date: Jul-2024
    • (2024)Intelligent Defect Detection System Based on Cloud-edge Synergy Technology2024 9th IEEE International Conference on Smart Cloud (SmartCloud)10.1109/SmartCloud62736.2024.00026(103-108)Online publication date: 10-May-2024
    • (2024)Energy-Efficient Resource Allocation for Heterogeneous Edge–Cloud ComputingIEEE Internet of Things Journal10.1109/JIOT.2023.329316411:2(2808-2818)Online publication date: 15-Jan-2024
    • (2024)Evaluation of Distribution Network Asset Effectiveness and Investment Decision-Making: A Study Based on the Multidimensional Lean Management System2024 10th IEEE International Conference on High Performance and Smart Computing (HPSC)10.1109/HPSC62738.2024.00017(54-59)Online publication date: 10-May-2024
    • (2024)An Innovative Approach for Manipulating Tidal-Based Computing Power2024 IEEE 10th International Conference on Edge Computing and Scalable Cloud (EdgeCom)10.1109/EdgeCom62867.2024.00022(94-99)Online publication date: 28-Jun-2024
    • (2024)A power fusion data cleaning method based on exponential moving average and cosine similarity algorithms2024 IEEE 10th International Conference on Edge Computing and Scalable Cloud (EdgeCom)10.1109/EdgeCom62867.2024.00012(25-30)Online publication date: 28-Jun-2024
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media