Abstract
There are a large number of applications that find the most similar pairs of time sequences in a given time-series database. However, similarity join operation in vast amounts of data is a big challenge in a single machine. For such data-intensive computing, distributed parallel processing framework such as MapReduce is getting a lot of attention. In this paper, we investigate how to operate subsequence similarity joins using MapReduce framework. We first show a sequential subsequence similarity join algorithm. Next, we propose two efficient algorithms to minimize the subsequence similarity join computation. We finally perform the experiments with synthetic data sets. The performance shows that the effectiveness of our MapReduce algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Agrawal, R., Faloutsos, C., Swami, A.N.: Efficient similarity search in sequence databases. In: FODO 1993, pp. 69–84 (1993)
Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast subsequence matching in time-series databases. In: SIGMOD Conference 1994, pp. 419–429 (1994)
Moon, Y.-S., Whang, K.-Y., Loh, W.-K.: Duality-based subsequence matching in time-series databases. In: ICDE 2001, pp. 263–272 (2001)
Lu, W., Shen, Y., Chen, S., Ooi, B.C.: Efficient processing of k nearest neighbor joins using MapReduce. PVLDB 5(10), 1016–1027 (2012)
Ji, C., Dong, T., Li, Y., Shen, Y., Li, K., Qiu, W., Qu, W., Guo, M.: Inverted grid-based kNN query processing with MapReduce. In: ChinaGrid 2012, pp. 25–32 (2012)
Acknowledgments
This work was supported by Institute for Information & communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No. 2017-00253, Development of an Advanced Open Data Distribution Platform based on International Standards).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Park, K., Won, H.S., Ryu, K.H. (2018). A Study on Subsequence Similarity Join in Time Series Data Using MapReduce. In: Park, J., Loia, V., Yi, G., Sung, Y. (eds) Advances in Computer Science and Ubiquitous Computing. CUTE CSA 2017 2017. Lecture Notes in Electrical Engineering, vol 474. Springer, Singapore. https://doi.org/10.1007/978-981-10-7605-3_135
Download citation
DOI: https://doi.org/10.1007/978-981-10-7605-3_135
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7604-6
Online ISBN: 978-981-10-7605-3
eBook Packages: EngineeringEngineering (R0)