A Study on Subsequence Similarity Join in Time Series Data Using MapReduce

Park, Kyounghyun; Won, Hee Sun; Ryu, Keun Ho

doi:10.1007/978-981-10-7605-3_135

Kyounghyun Park³⁶,
Hee Sun Won³⁶ &
Keun Ho Ryu³⁷

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 474))

Included in the following conference series:

478 Accesses
1 Citations

Abstract

There are a large number of applications that find the most similar pairs of time sequences in a given time-series database. However, similarity join operation in vast amounts of data is a big challenge in a single machine. For such data-intensive computing, distributed parallel processing framework such as MapReduce is getting a lot of attention. In this paper, we investigate how to operate subsequence similarity joins using MapReduce framework. We first show a sequential subsequence similarity join algorithm. Next, we propose two efficient algorithms to minimize the subsequence similarity join computation. We finally perform the experiments with synthetic data sets. The performance shows that the effectiveness of our MapReduce algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Framework for Similarity Search in Streaming Time Series based on Spark Streaming

Article 11 June 2022

An Efficient Method for Time Series Join on Subsequence Correlation Using Longest Common Substring Algorithm

Efficient Subsequence Join Over Time Series Under Dynamic Time Warping

References

Agrawal, R., Faloutsos, C., Swami, A.N.: Efficient similarity search in sequence databases. In: FODO 1993, pp. 69–84 (1993)
Google Scholar
Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast subsequence matching in time-series databases. In: SIGMOD Conference 1994, pp. 419–429 (1994)
Google Scholar
Moon, Y.-S., Whang, K.-Y., Loh, W.-K.: Duality-based subsequence matching in time-series databases. In: ICDE 2001, pp. 263–272 (2001)
Google Scholar
Lu, W., Shen, Y., Chen, S., Ooi, B.C.: Efficient processing of k nearest neighbor joins using MapReduce. PVLDB 5(10), 1016–1027 (2012)
Google Scholar
Ji, C., Dong, T., Li, Y., Shen, Y., Li, K., Qiu, W., Qu, W., Guo, M.: Inverted grid-based kNN query processing with MapReduce. In: ChinaGrid 2012, pp. 25–32 (2012)
Google Scholar

Download references

Acknowledgments

This work was supported by Institute for Information & communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No. 2017-00253, Development of an Advanced Open Data Distribution Platform based on International Standards).

Author information

Authors and Affiliations

Smart Data Research Group, ETRI, Daejeon, Korea
Kyounghyun Park & Hee Sun Won
Database/Bioinformatics Laboratory, Chungbuk National University, Cheongju, Korea
Keun Ho Ryu

Authors

Kyounghyun Park
View author publications
You can also search for this author in PubMed Google Scholar
Hee Sun Won
View author publications
You can also search for this author in PubMed Google Scholar
Keun Ho Ryu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kyounghyun Park .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Seoul University of Science and Technology, Seoul, Korea (Republic of)
James J. Park
Department of Business Science, University of Salerno, Salerno, Italy
Vincenzo Loia
Department of Multimedia Engineering, Dongguk University, Seoul, Soul-t’ukpyolsi, Korea (Republic of)
Gangman Yi
Department of Multimedia Engineering, Dongguk University, Seoul, Soul-t’ukpyolsi, Korea (Republic of)
Yunsick Sung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, K., Won, H.S., Ryu, K.H. (2018). A Study on Subsequence Similarity Join in Time Series Data Using MapReduce. In: Park, J., Loia, V., Yi, G., Sung, Y. (eds) Advances in Computer Science and Ubiquitous Computing. CUTE CSA 2017 2017. Lecture Notes in Electrical Engineering, vol 474. Springer, Singapore. https://doi.org/10.1007/978-981-10-7605-3_135

Download citation

DOI: https://doi.org/10.1007/978-981-10-7605-3_135
Published: 20 December 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7604-6
Online ISBN: 978-981-10-7605-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

A Study on Subsequence Similarity Join in Time Series Data Using MapReduce

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Framework for Similarity Search in Streaming Time Series based on Spark Streaming

An Efficient Method for Time Series Join on Subsequence Correlation Using Longest Common Substring Algorithm

Efficient Subsequence Join Over Time Series Under Dynamic Time Warping

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Study on Subsequence Similarity Join in Time Series Data Using MapReduce

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Framework for Similarity Search in Streaming Time Series based on Spark Streaming

An Efficient Method for Time Series Join on Subsequence Correlation Using Longest Common Substring Algorithm

Efficient Subsequence Join Over Time Series Under Dynamic Time Warping

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation