Abstract
This paper presents four configurations of a genetic algorithm (GA) combined with a local search (LS) method for time series segmentation with the purpose of correctly recognising extreme values. The LS method is based on likelihood maximisation of a beta distribution. The proposal is tested on three real ocean wave height time series, where extreme values are frequently found. Concretely, the time series are taken from two oceanographic buoys in the Gulf of Alaska, and another one from Puerto Rico. The results show that the different combinations of LS improve the results of the GA. Furthermore, the algorithm provides segmentations where extreme values are grouped in a well-defined cluster, which allows the study of the characteristics of this type of events.



Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. 3(1), 1–27 (1974)
Chung, F.L., Fu, T.C., Ng, V., Luk, R.W.: An evolutionary approach to pattern-based time series segmentation. IEEE Trans. Evol. Comput. 8(5), 471–489 (2004)
Das, G., Lin, K., Mannila, H., Renganathan, G., Smyth, P.: Rule Discovery from Time Series, pp. 16–22. AAAI Press, USA (1998)
Ding, Y., Yang, X., Kavs, A., Li, J.: A novel piecewise linear segmentation for time series. In: Computer and Automation Engineering (ICCAE), 2010 The 2nd International Conference on, vol. 4, pp. 52–55 (2010)
El-Sagheer, R.: Inferences for the generalized logistic distribution based on record statistics. Intell. Inf. Manag. 6, 171–182 (2014)
Friedman, M.: A comparison of alternative tests of significance for the problem of m rankings. Ann. Math. Stat. 11(1), 86–92 (1940)
Guralnik, V., Srivastava, J.: Event detection from time series data. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ’99, pp. 3–42. ACM, New York, NY, USA (1999)
Gutiérrez, P.A., García, S.: Current prospects on ordinal and monotonic classification. Prog. Artif. Intell. 5(3), 171–179 (2016)
Himberg, J., Korpiaho, K., Mannila, H., Tikanmaki, J., Toivonen, H.: Time series segmentation for context recognition in mobile devices. In: Proceedings IEEE International Conference on Data Mining, 2001. ICDM 2001, pp. 203–210 (2001)
Hochberg, Y., Tamhane, A.: Multiple Comparison Procedures. Wiley, New York (1987)
National buoy data center. National Oceanic and Atmospheric Administration of the USA (NOAA) (2015). http://www.ndbc.noaa.gov/. Accessed 22 Oct 2015
Keogh, E.J., Chu, S., Hart, D., Pazzani, M.: Segmenting time series: a survey and novel approach. In: Last, M., Kandel, A., Bunke, H. (eds.) Data Mining In Time Series Databases, Series In Machine Perception And Artificial Intelligence, vol. 1, pp. 1–22. World Scientific Publishing Company, Cleveland (2004)
Lin, W., Orgun, M., Williams, G.: An overview of temporal data mining. In: Proceedings of the 1st Australian Data Mining Workshop, pp. 83–90 (2002)
Menendez, M.: Shannon’s entropy in exponential families: statistical applications. Appl. Math. Lett. 13(1), 37–42 (2000)
Nikolaou, A., Gutiérrez, P., Durán, A., Dicaire, I., Fernández-Navarro, F., Hervás-Martínez, C.: Detection of early warning signals in paleoclimate data using a genetic time series segmentation algorithm. Clim. Dyn. 44, 1919–1933 (2015)
Oliver, J., Forbes, C.: Bayesian approaches to segmenting a simple time series. Tech. Rep. 14/97, Monash University, Department of Econometrics and Business Statistics (1997)
Oliver, J.J., Baxter, R.A., Wallace, C.S.: Minimum message length segmentation. In: Wu, X., Kotagiri, R., Korb, K. (eds.) Research And Development In Knowledge Discovery And Data Mining. Lecture Notes In Computer Science, pp. 222–233. Springer, Berlin (1998)
Rani, S., Sikka, G.: Recent techniques of clustering of time series data: a survey. Int. J. Comput. Appl. 52(15), 1–9 (2012)
Sato, A.H.: A comprehensive analysis of time series segmentation on Japanese stock prices. Procedia Computer Science 24(0), 307 – 314. In: 17th Asia Pacific Symposium on Intelligent and Evolutionary Systems, IES2013 (2013)
Tseng, V.S., Chen, C.H., Huang, P.C., Hong, T.P.: Cluster-based genetic segmentation of time series with DWT. Pattern Recognit. Lett. 30(13), 1190–1197 (2009)
Wang, X., Smith, K.A., Hyndman, R.J.: Dimension reduction for clustering time series using global characteristics. In: Proc. of the 5th International Conference on Computational Science (ICCS), pp. 792–795. Springer, Atlanta, GA, USA (2005)
Wilks, S.S.: Mathematical Statistics. Wiley, New York (1963)
Xiao-Ye, W., Zheng-Ou, W.: A structure-adaptive piece-wise linear segments representation for time series. In: Information Reuse and Integration, 2004. IRI 2004. In: Proceedings of the 2004 IEEE International Conference on, pp. 433–437 (2004)
Yang, O., Jia, W., Zhou, P., Meng, X.: A new approach to transforming time series into symbolic sequences. In: Proc. of the 1st Joint Conference between the Biomedical Engineering Society and Engineers in Medicine and Biology, p. 974 (1999)
Acknowledgements
This work has been subsidised by the Project TIN2014-54583-C2-1-R of the Spanish Ministerial Commission of Science and Technology (MICYT), FEDER funds and the P11-TIC-7508 Project of the Junta de Andalucía (Spain). Antonio M. Durán-Rosal’s research has been subsidised by the FPU Predoctoral Program (Spanish Ministry of Education and Science), Grant reference FPU14/03039. Manuel Dorado-Moreno’s research has been subsidised by the FPU Predoctoral Program (Spanish Ministry of Education and Science), Grant reference FPU15/00647.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Durán-Rosal, A.M., Dorado-Moreno, M., Gutiérrez, P. . et al. Identification of extreme wave heights with an evolutionary algorithm in combination with a likelihood-based segmentation. Prog Artif Intell 6, 59–66 (2017). https://doi.org/10.1007/s13748-016-0105-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13748-016-0105-1