Abstract
With the development of IoT, edge computing has been attracting attention in recent years. In edge computing, simple data processing, such as aggregation and filtering, can be performed at network edges to reduce the amount of data communication and distribute the processing load. In edge computing applications, it is important to guarantee low latency, high reliability, and fault tolerance. We are working on the solution of this problem in the context of environmental sensing applications. In this paper, we outline our approach. In the proposed method, the aggregate value of each device is calculated approximately and the fault tolerance is also guaranteed approximately even when the input data is missing due to sensor device failure or communication failure. In addition, the proposed method reduces the delay by outputting the processing result when the error guarantee satisfies the user’s requirement.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Apache Flink: Stateful Computations over Data Streams. https://flink.apache.org/
Spark Streaming \(|\) Apache Spark. http://spark.apache.org/streaming/
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Boston (2006). https://doi.org/10.1007/978-1-4615-7566-5
Deshpande, A., Guestrin, C., Madden, S.R., Hellerstein, J.M., Hong, W.: Model-based approximate querying in sensor networks. VLDB J. 14(4), 417–443 (2005)
Duffield, N., Xu, Y., Xia, L., Ahmed, N.K., Yu, M.: Stream aggregation through order sampling. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM). pp. 909–918 (2017)
Enders, C.K.: Applied Missing Data Analysis. Guilford Press, New York (2010)
Huang, Q., Lee, P.P.C.: Toward high-performance distributed stream processing via approximate fault tolerance. Proc. VLDB (PVLDB) 10(3), 73–84 (2016)
Jiang, N., Gruenwald, L.: Estimating missing data in data streams. In: Proceedings of International Conference on Database Systems for Advanced Applications (DASFAA). pp. 981–987 (2007)
Johnson, T., Muthukrishnan, S., Rozenbaum, I.: Sampling algorithms in a stream operator. In: Proceedings of ACM SIGMOD. pp. 1–12 (2005)
Raymond, M.R., Roberts, D.M.: A comparison of methods for treating incomplete data in selection research. Educ. Psychol. Measur. 47(1), 13–26 (1987)
Ren, X., Sug, H., Lee, H.: A new estimation model for wireless sensor networks based on the spatial-temporal correlation analysis. J. Inf. Commun. Converg. Eng. 13(2), 105–112 (2015)
Takao, D., Sugiura, K., Ishikawa, Y.: Approximate streaming aggregation with low-latency and high-reliability for edge computing. IEICE Trans. Inf. Syst. J104-D(5), 463–475 (2021). (in Japanese)
Yi, X., Zheng, Y., Zhang, J., Li, T.: ST-MVL: Filling missing values in geo-sensory time series data. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI), pp. 2704–2710 (2016)
Acknowledgement
This study is based on results obtained from a project, JPNP16007, commissioned by the New Energy and Industrial Technology Development Organization (NEDO). In addition, this work was supported partly by KAKENHI (16H01722, 20K19804, and 21H03555).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Takao, D., Sugiura, K., Ishikawa, Y. (2022). Approximate Fault-Tolerant Data Stream Aggregation for Edge Computing. In: Sachdeva, S., Watanobe, Y., Bhalla, S. (eds) Big-Data-Analytics in Astronomy, Science, and Engineering. BDA 2021. Lecture Notes in Computer Science(), vol 13167. Springer, Cham. https://doi.org/10.1007/978-3-030-96600-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-96600-3_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-96599-0
Online ISBN: 978-3-030-96600-3
eBook Packages: Computer ScienceComputer Science (R0)