Abstract
In recent times, the exponential boom of industrial data in cloud is witnessed due to dramatic outcome of digitization and smart environment within the industries. Globalization, easy-to-use and availability of data also plays a key role in driving data production in the cloud environment. However, many industries and organizations use several techniques to collect and process massive amount of data which is obtained through various data acquisition channels. Processing of such huge data with redundancy rate has an impact on time series analysis and cloud storage as well. Hence, an integrated technique to perform data de-duplication and time series analysis is required. Furthermore, optimal location to place the data also become an essential for efficient access of data in the cloud environment. To address the aforementioned issues, the proposed system presents CTS-IIoT: Computation of Time Series data during Index Based De-duplication of Industrial IoT (IIoT) data in Cloud Environment to compute time series data during de-duplication using Merkle Hash Tree (MHT). Finally, the proposed system concludes with the determination of optimal location with minimal transportation cost to reach the storage nodes in the cloud environment using Modified Distribution (MODI) method. The experimental results reveal that the proposed model is efficient since it facilitates less memory and less computation overhead. The proposed technique achieves space reduction by 43%, reduces the computation overhead by 32% and increases the efficacy of data retrieval by 18.5%.








Similar content being viewed by others
Data Availability
Enquiries about data availability should be directed to the authors.
References
Cai, H., Boyi, Xu., Jiang, L., & Vasilakos, A. V. (2017). IoT-based big data storage systems in cloud computing: Perspectives and challenges. IEEE Internet of Things Journal, 4(1), 75–87.
Industrial IoT Market–Global Opportunity Analysis and Industry Forecast (2020–2027) Report, 2019. https://www.meticulousresearch.com/product/industrial-iot-market Accessed Jan, 2021.
Krishnamurthi, R., Kumar, A., Gopinathan, D., Nayyar, A., & Qureshi, B. (2020). An overview of IoT sensor data processing, fusion and analysis techniques. Sensors, MDPI, 20(21), 6076.
Chen, L., Zhou, P., Gao, L., & Jie, Xu. (2018). Adaptive fog configuration for the industrial internet of things. IEEE Transactions on Industrial Informatics, 14(10), 4656–4664.
Rathee, G., Garg, S., Kaddoum, G., & Choi, B. J. (2021). Decision-making model for securing IoT devices in smart industries. IEEE Transactions on Industrial Informatics, 17(6), 4270–4278.
Borujeni, E. M., Rahbari, D., & Nickray, M. (2018). Fog-based energy-efficient routing protocol for wireless sensor networks. Journal of Supercomputing, 74(12), 6831–6858.
Peralta, G., Garrido, P., Bilbao, J., Aguero, R., & Crespo, P. M. (2019). On the combination of multi-cloud and network coding for cost-efficient storage in industrial applications. Sensors, MDPI, 19(7), 1–19.
Prajapati, P., & Shah, P. (2020). A review on secure data deduplication: Cloud storage security issue. Journal of King Saud University—Computer and Information Sciences, 34(7), 3996–4007.
Akhila, K., Ganesh, A., & Sunitha, C. (2016). A study on de-duplication techniques over encrypted data, Fourth international conference on recent trends in computer science & engineering, Procedia Computer Science, Thrissur, Kerala, pp. 38–43.
Zheng, X., Zhou, Y., Yalan, Y., & Fagen, L. (2020). A cloud data de-duplication scheme based on certificateless proxy re-encryption. Journal of Systems Architecture., 102, 101666.
Xia, W., Feng, D., Jiang, H., Zhang, Y., Chang, V., & Zou, X. (2019). Accelerating content defined-chunking based data de-duplication by exploiting parallelism. Future Generation Computer Systems, 98, 406–418.
Yinjin, Fu., Xiao, N., Jiang, H., Guyu, Hu., & Chen, W. (2019). Application-aware big data deduplication in cloud environment. IEEE Transactions on Cloud Computing, 7(4), 921–934.
Xia, W., Jiang, H., Feng, D., & Tian, L. (2016). DARE: A deduplication-aware resemblance detection and elimination scheme for data reduction with low overheads. IEEE Transactions on Computers, 65(6), 1692–1705.
Yan, Z., Ding, W., Xixun, Yu., Zhu, H., & Deng, R. H. (2016). Deduplication on encrypted big data in cloud. IEEE Transaction on Big Data, 2(2), 138–150.
Sharma, S., & Saini, H. (2020). Fog assisted task allocation and secure de-duplication using 2FBO2 and MoWo in cluster-based Industrial IoT (Industrial IoT). Computer Communications, 152, 187–199.
Jun-Song, Fu., Liu, Y., Chao, H.-C., Bhargava, B. K., & Zhang, Z.-J. (2018). Secure data storage and searching for industrial IoT by integrating fog computing and cloud computing. IEEE Transactions on Industrial Informatics, 14(10), 4519–4528.
Yu, C. M., Gochhayat, S. P., Conti, M., & Lu, C. S. (2020). Privacy aware data deduplication for side channel in cloud storage. IEEE Transactions on Cloud Computing., 8(2), 597–609.
Ni, J., Zhang, K., Yu, Y., Lin, X., & Shen, X. S. (2018). Providing task allocation and secure de-duplication for mobile crowdsensing via fog computing. IEEE Transaction on Dependable Secure Computing, 17(3), 581–594.
Tian, G., Ma, H., Xie, Y., & Liu, Z. (2020). Randomized de-duplication with ownership management and data sharing in cloud storage. Journal of Information Security and Applications., 51, 102432.
Jiang, S., Jiang, T., & Wang, L. (2020). Secure and efficient cloud data deduplication with ownership management. IEEE Transactions on Services Computing, 13(6), 1152–1165.
Gao, Y., Xian, H., & Yu, A. (2020). Secure data deduplication for Internet-of-things sensor networks based on threshold dynamic adjustment. International Journal of Distributed Sensor Networks, 16(3), 155014772091100.
Ellapan, M., & Abirami, S. (2021). Dynamic prime chunking algorithm for data deduplication in cloud storage. KSII Transactions on Internet and Information Systems, 15(4), 1342–1359.
Veerachamy, R., & Ravi Kumar, V. (2011). Operational research. Delhi: I K International Publishing.
Li, C., Cai, Q., & Lou, Y. (2021). Optimal data placement strategy considering capacity limitation and load balancing in the geographically distributed cloud. Future Generation Computer Systems, 127, 142–159.
Hu, Z., Li, B., & Luo, J. (2017). Time-and cost-efficient task scheduling across geo-distributed data centers. IEEE Transactions on Parallel and Distributed Systems, 29(3), 705–718.
Wu, Y., Zhang, Z., Wu, C., Guo, C., Li, Z., & Lau, F. C. M. (2017). Orchestrating bulk data transfers across geo-distributed datacenters. IEEE Transactions on Cloud Computing, 41(99), 112–125.
Atrey, A., Van Seghbroeck, G., Mora, H., De Turc, F., & Volckaert, B. (2019). SpeCH: A scalable framework for data placement of data-intensive services in-distributed clouds. Journal of Network and Computer Applications, 142(1), 14.
Yu, B., & Pan, J. (2016). Sketch-based data placement among geo-distributed data center for cloud storages, IEEE Conference on computer communications, San Francisco: IEEE Computer Society Press, pp. 1–9.
Funding
The authors have not disclosed any funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Muthunagai, S.U., Anitha, R. CTS-IIoT: Computation of Time Series Data During Index Based De-duplication of Industrial IoT (IIoT) Data in Cloud Environment. Wireless Pers Commun 129, 433–453 (2023). https://doi.org/10.1007/s11277-022-10105-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-022-10105-5