Skip to main content
Log in

On retrieving patterns in environmental sensor data

  • Research Article
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

As many sensor networks are currently being deployed for environmental monitoring, there is a growing need to develop systems and applications for managing, processing and retrieving massive amounts of data generated from those networks. In this research, a query answering system with pattern mining techniques is investigated specifically for marine sensor data. We consider three applications of pattern mining: similar pattern search, predictive query and query by clustering. In pattern mining for query answering, we adopt the dynamic time warping (DTW) method for similarity measurement. We also propose the use of a query relaxation approach that recommends users change parameters of a given query to get an answer. Finally, we show implementation results of pattern query answering in a marine sensor network deployed in the South East of Tasmania, Australia. Pattern query answering system benefits in accessing and discovering knowledge from sensor data for decision making purposes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22

Similar content being viewed by others

Notes

  1. http://earth.google.com/

References

  • Adhikari PR, Hollmén J (2010) Patterns from multiresolution 0-1 data. In: UP ’10 Proceedings of the ACM SIGKDD workshop on useful patterns (UP), pp 8–16

  • Assent I, Kremer H, Gunnemann S, Seidl T (2010) Pattern detector: fast detection of suspicious stream patterns for immediate reaction. EDBT, pp 709–712

  • Assent I, Witchterich M, Krieger R, Kremer H, Seidl T (2009) Anticipatory DTW for efficient similarity search in time series databases. VLDB, pp 826–837

  • Bulut A, Singh AK (2005) A unified framework for monitoring data streams in real time. ICDE, pp 44–55

  • Buono P, Plaisant C, Simione A, Aris A, Shneiderman B, Shmueli G, Jank W (2007) Similarity-based forecasting with simultaneous previews: a river plot interface for time series forecasting. International Conference Information Visualization (IV’07), pp 191–196

  • Cao H, Qi Y, Candan S, Sapino ML (2010) Feedback-driven result ranking and query refinement for exploring semi-structured data collections. EDBT, pp 3–14

  • Chan FKP, Fu AWC, Yu C (2003) Haar wavelets for efficient similarity search of time-series: with and without time warping. IEEE TKDE 15(3):686–705

    Google Scholar 

  • Chen H, Cheng C-C (2011) A distortion-aware intelligent context-aggregation agent for smart environments. IEEE Intelligent Systems, pp 42–49

  • Chen Y, Nascimento MA, Ooi BC, Tung AKH (2007) SpADE: on shape-based pattern detection in streaming time series. ICDE, pp 786–795

  • Cheng H, Tan P-N, Gao J, Scripps J (2009) Multistep-ahead time series prediction. PAKDD, pp 765–774

  • Ciglan M, Habela O, Tran V, Hluchy L, Kremler M, Gera M (2010) Application of ADMIRE data mining nd integration technologies in environmental scenarios. PPAM, pp 165–173

  • Diao Y, Ganesan D, Mathur G, Shenoy P (2007) Rethinking data management for storage-centric sensor networks. CIDR, pp 22–31

  • Ding H, Trajcevski G, Scheuermann P, Wang X, Keogh E (2008) Querying and mining of time series data: experimental comparison of representations and distance measures. VLDB, pp 1542–1552

  • Fu AWC, Keogh E, Lau LYH, Ratanamahatana CA (2005) Scaling and time warping in time series querying. VLDB, pp 649–660

  • Giberta K, Sanchez-Marrea M (2011) Outcomes from the iEMSs data mining in the environmental sciences workshop series. Environ Model Softw 26(7):983–985

    Article  Google Scholar 

  • Herzfeld M, Andrewartha J, Sakov P (2010) Modelling the physical oceanography of the d’entrecasteaux channel and the Huon estuary, south-eastern Tasmania. Marine and Freshwater Research vol 61, CSIRO publishing, pp 568– 586

  • Hluchy L, Habela O, Tran V, Ciglan M (2009) Hydro-meteorological scenarios using advanced data mining and integration. International Conference on Fuzzy Systems and Knowledge Discovery, pp 260–264

  • Hugo D, Howell B, D’este C, Timms G, Sharman C, de Souza P, Allen S (2011) Low-cost marine monitoring: from sensors to information delivery. IEEE Oceans, pp 1–7

  • Huh SY, Moon KH, Lee H (2000) A data abstraction approach for query relaxation. Inf Softw Technol 42:407–418

    Article  Google Scholar 

  • Keogh E, Kassety S, (2002) On the need for time series data mining benchmarks: a survey and impirical demonstration. SIGKDD, pp 102–111

  • Kidron A, Klein ST (2007) An information retrieval approach to predicting meteorological data. Int J Model Simul 27(3):218–225.

    Google Scholar 

  • Koopman A, Knobbe A, Meeng M (2010) Pattern selection problems in multivariate time-series using equation discovery. In: UP ’10 Proceedings of the ACM SIGKDD workshop on useful patterns (UP), pp 74–81 Pattern selection problems in multivariate time-series using equation discovery, Useful Pattern (UP)

  • Lian X, Chen L (2008) Efficient similarity search over future stream time series. IEEE TKDE 20(1):40–54

    Google Scholar 

  • Lian X, Chen L, Yu JX (2009) Multiscale representations for fast pattern matching in stream time series. IEEE TDKE 21(4):568–581

    Google Scholar 

  • Liao TW (2005) Clustering of time series data—a survey. Pattern Recogn 38:1857–1874

    Article  Google Scholar 

  • Liu C, Li J, Yu JX, Zhou R (2010) Adaptive relaxation for querying heterogeneous XML data sources. Inf Syst 35:688–707

    Article  Google Scholar 

  • Mamoulis N, Cao H, Kollios G, Hadjieleftheriou M, Tao, Y, Cheung DW (2004) Mining, indexing, and querying historical spatiotemporal data. KDD, pp 236–245

  • Mirzadeh N, Ricci F, Bansal M (2004) Supporting user query relaxation in a recommender system. EC-Web, LNCS, vol 3182, pp 31–40

    Google Scholar 

  • Morealle P, Callegari J, Valle G, Kendall F (2011) Sensor integration and analysis for visual identification of environmental patterns. IEEE SysCon., pp 7–12

  • Pan L, Luo J, Li J (2008) Probing queries in wireless sensor networks. IEEE International Conference on Distributed Computing Systems, pp 546–553

  • Ricci F, Mirzadeh N, Venturini A (2002) Intelligent query management in a mediator architecture. IEEE International Symposium on Intelligent Systems, pp 221–226

  • Sakurai Y, Faloutsos C, Yamamura M (2007) Stream monitoring under the time warping distance. ICDE, pp 1046–1055

  • Sakurai Y, Yoshikawa M, Faloutsos C (2005) FTW: fast similarity search under the time warping distance. PODS, pp 326– 337

  • SANY-an open service architecture for sensor networks. SANY Consortium, p 161 ISBN: 9783000285714 (2009) http://www.frisia-it.de/assets/images/SANY_Book.pdf

  • Shahriar MS, de Souza P, Timms G (2011) Smart query answering for marine sensor data. Sensors 11:2885–2897. doi:10.3390/s110302885

    Article  Google Scholar 

  • Shan J, Shen D, Nie T, Kou Y, Yu G (2010) An effective and high-quality query relaxation solution on the deep web. APWeb, pp 68–74

  • Tran V, Hluchy L, Habela O (2010) Data mining and integration for environmental scenarios. SoICT, pp 55–58

  • Timms GP, de Souza PA, Reznik L (2010) Automated assessment of data quality in marine sensor networks. IEEE Oceans, pp 1–5

  • Timms GP, McCulloch JW, McCarthy P, Howell B, de Souza PA, Dunbabin MD, Hartmann K (2009) The Tasmanian Marine Analysis Network (TasMAN). IEEE Oceans, pp 1–6

  • Wu J, Zhou Y, Aberer K, Tan KL (2009) Towards integrated and efficient scientific sensor data processing: a database approach. EDBT, pp 922–933

  • Yang K, Shahabi C (2004) A PCA-based similarity measure for multivariate time series, 2004. MMDB, pp 65–74

  • Yuelong Z, Dingsheng W, Xiaohua Z, (2008) A novel approach to the similarity analysis of multivariate time series and its application in hydrological data mining. International Conference on Computer Science and Software Engineering, pp 730–734

  • Zhang X, Liu J, Du Y, Lv T (2011) A novel clustering method on time series data. Expert Syst Appl 38(9):11891–11900

    Google Scholar 

  • Zhang X, Wu J, Yang X (2009) A novel pattern extraction method for time series classification. Optimization Engineering 10:253–271

    Article  Google Scholar 

  • Zhou X, Gaugaz J, Balke TW, Nejdl W (2007) Query relaxation using malleable schemas. SIGMOD, pp 545–556

Download references

Acknowledgements

The Tasmanian ICT Centre is jointly funded by the Australian Government through the Intelligent Island Program and CSIRO. The Intelligent Island Program is administered by the Tasmanian Department of Economic Development, Tourism and the Arts. This research was conducted as part of the CSIRO Wealth from Oceans National Research Flagship and the Sensors and Sensor Networks Transformational Capability Platform(SSN-TCP). We thank Aidan O’Mara for providing improved prediction using clustering.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Md. Sumon Shahriar.

Additional information

Communicated by: H. A. Babaie

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

(ZIP 76.7 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shahriar, M.S., Souza, P.d. & Timms, G. On retrieving patterns in environmental sensor data. Earth Sci Inform 5, 43–59 (2012). https://doi.org/10.1007/s12145-012-0095-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12145-012-0095-x

Keywords

Navigation