Skip to main content

Advertisement

Log in

Identification of extreme wave heights with an evolutionary algorithm in combination with a likelihood-based segmentation

  • Regular Paper
  • Published:
Progress in Artificial Intelligence Aims and scope Submit manuscript

Abstract

This paper presents four configurations of a genetic algorithm (GA) combined with a local search (LS) method for time series segmentation with the purpose of correctly recognising extreme values. The LS method is based on likelihood maximisation of a beta distribution. The proposal is tested on three real ocean wave height time series, where extreme values are frequently found. Concretely, the time series are taken from two oceanographic buoys in the Gulf of Alaska, and another one from Puerto Rico. The results show that the different combinations of LS improve the results of the GA. Furthermore, the algorithm provides segmentations where extreme values are grouped in a well-defined cluster, which allows the study of the characteristics of this type of events.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. 3(1), 1–27 (1974)

    MathSciNet  MATH  Google Scholar 

  2. Chung, F.L., Fu, T.C., Ng, V., Luk, R.W.: An evolutionary approach to pattern-based time series segmentation. IEEE Trans. Evol. Comput. 8(5), 471–489 (2004)

    Article  Google Scholar 

  3. Das, G., Lin, K., Mannila, H., Renganathan, G., Smyth, P.: Rule Discovery from Time Series, pp. 16–22. AAAI Press, USA (1998)

  4. Ding, Y., Yang, X., Kavs, A., Li, J.: A novel piecewise linear segmentation for time series. In: Computer and Automation Engineering (ICCAE), 2010 The 2nd International Conference on, vol. 4, pp. 52–55 (2010)

  5. El-Sagheer, R.: Inferences for the generalized logistic distribution based on record statistics. Intell. Inf. Manag. 6, 171–182 (2014)

    Google Scholar 

  6. Friedman, M.: A comparison of alternative tests of significance for the problem of m rankings. Ann. Math. Stat. 11(1), 86–92 (1940)

    Article  MathSciNet  MATH  Google Scholar 

  7. Guralnik, V., Srivastava, J.: Event detection from time series data. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ’99, pp. 3–42. ACM, New York, NY, USA (1999)

  8. Gutiérrez, P.A., García, S.: Current prospects on ordinal and monotonic classification. Prog. Artif. Intell. 5(3), 171–179 (2016)

    Article  Google Scholar 

  9. Himberg, J., Korpiaho, K., Mannila, H., Tikanmaki, J., Toivonen, H.: Time series segmentation for context recognition in mobile devices. In: Proceedings IEEE International Conference on Data Mining, 2001. ICDM 2001, pp. 203–210 (2001)

  10. Hochberg, Y., Tamhane, A.: Multiple Comparison Procedures. Wiley, New York (1987)

    Book  MATH  Google Scholar 

  11. National buoy data center. National Oceanic and Atmospheric Administration of the USA (NOAA) (2015). http://www.ndbc.noaa.gov/. Accessed 22 Oct 2015

  12. Keogh, E.J., Chu, S., Hart, D., Pazzani, M.: Segmenting time series: a survey and novel approach. In: Last, M., Kandel, A., Bunke, H. (eds.) Data Mining In Time Series Databases, Series In Machine Perception And Artificial Intelligence, vol. 1, pp. 1–22. World Scientific Publishing Company, Cleveland (2004)

    Chapter  Google Scholar 

  13. Lin, W., Orgun, M., Williams, G.: An overview of temporal data mining. In: Proceedings of the 1st Australian Data Mining Workshop, pp. 83–90 (2002)

  14. Menendez, M.: Shannon’s entropy in exponential families: statistical applications. Appl. Math. Lett. 13(1), 37–42 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  15. Nikolaou, A., Gutiérrez, P., Durán, A., Dicaire, I., Fernández-Navarro, F., Hervás-Martínez, C.: Detection of early warning signals in paleoclimate data using a genetic time series segmentation algorithm. Clim. Dyn. 44, 1919–1933 (2015)

    Article  Google Scholar 

  16. Oliver, J., Forbes, C.: Bayesian approaches to segmenting a simple time series. Tech. Rep. 14/97, Monash University, Department of Econometrics and Business Statistics (1997)

  17. Oliver, J.J., Baxter, R.A., Wallace, C.S.: Minimum message length segmentation. In: Wu, X., Kotagiri, R., Korb, K. (eds.) Research And Development In Knowledge Discovery And Data Mining. Lecture Notes In Computer Science, pp. 222–233. Springer, Berlin (1998)

    Chapter  Google Scholar 

  18. Rani, S., Sikka, G.: Recent techniques of clustering of time series data: a survey. Int. J. Comput. Appl. 52(15), 1–9 (2012)

    Google Scholar 

  19. Sato, A.H.: A comprehensive analysis of time series segmentation on Japanese stock prices. Procedia Computer Science 24(0), 307 – 314. In: 17th Asia Pacific Symposium on Intelligent and Evolutionary Systems, IES2013 (2013)

  20. Tseng, V.S., Chen, C.H., Huang, P.C., Hong, T.P.: Cluster-based genetic segmentation of time series with DWT. Pattern Recognit. Lett. 30(13), 1190–1197 (2009)

    Article  Google Scholar 

  21. Wang, X., Smith, K.A., Hyndman, R.J.: Dimension reduction for clustering time series using global characteristics. In: Proc. of the 5th International Conference on Computational Science (ICCS), pp. 792–795. Springer, Atlanta, GA, USA (2005)

  22. Wilks, S.S.: Mathematical Statistics. Wiley, New York (1963)

    MATH  Google Scholar 

  23. Xiao-Ye, W., Zheng-Ou, W.: A structure-adaptive piece-wise linear segments representation for time series. In: Information Reuse and Integration, 2004. IRI 2004. In: Proceedings of the 2004 IEEE International Conference on, pp. 433–437 (2004)

  24. Yang, O., Jia, W., Zhou, P., Meng, X.: A new approach to transforming time series into symbolic sequences. In: Proc. of the 1st Joint Conference between the Biomedical Engineering Society and Engineers in Medicine and Biology, p. 974 (1999)

Download references

Acknowledgements

This work has been subsidised by the Project TIN2014-54583-C2-1-R of the Spanish Ministerial Commission of Science and Technology (MICYT), FEDER funds and the P11-TIC-7508 Project of the Junta de Andalucía (Spain). Antonio M. Durán-Rosal’s research has been subsidised by the FPU Predoctoral Program (Spanish Ministry of Education and Science), Grant reference FPU14/03039. Manuel Dorado-Moreno’s research has been subsidised by the FPU Predoctoral Program (Spanish Ministry of Education and Science), Grant reference FPU15/00647.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Antonio M. Durán-Rosal.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Durán-Rosal, A.M., Dorado-Moreno, M., Gutiérrez, P. . et al. Identification of extreme wave heights with an evolutionary algorithm in combination with a likelihood-based segmentation. Prog Artif Intell 6, 59–66 (2017). https://doi.org/10.1007/s13748-016-0105-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13748-016-0105-1

Keywords

Navigation