Long-term prediction of time series using fuzzy cognitive maps

doi:10.1016/j.engappai.2021.104274

Engineering Applications of Artificial Intelligence

Volume 102, June 2021, 104274

https://doi.org/10.1016/j.engappai.2021.104274 Get rights and content

Abstract

As a powerful recognized knowledge modeling tool, fuzzy cognitive maps (FCMs) have been investigated for time series modeling and forecasting problems. This methodology performs well in one-step-ahead or short-term prediction but poorly in terms of long-term prediction because of the potentially complex interaction between different ensuing steps. In this article, a sound conceptual method is proposed for long-term time series prediction with FCMs, which melds FCMs, time series segmentation and fuzzy clustering. A time series is divided into suitable and internally homogeneous segments. Dynamic time warping is introduced to evaluate the distance between segments. Subsequently, modified fuzzy c-means based on dynamic time warping is utilized to fuzzify these segments such that the segments are transformed into fuzzy time series and semantic vectors. The convex optimization based method is utilized with intent to rapidly and robustly learn FCMs. Consequently, the weight of FCMs can be obtained on the basis of the fuzzy time series. Eventually, the forecasting time segment will be capable of inference according to the formed FCMs and the semantic vectors. In addition, the semantic vectors can intuitively reflect the main characteristics and change tendencies of the time series. To demonstrate the long-term prediction ability of our method, we test it on both synthetic and real-life datasets in comparison with other representative and up-to-date forecasting methods; the superior performance of our method exhibits its excellent capability in forecasting future values.

Introduction

Time series forecasting is the endeavor of providing speculations by understanding the past (Makridakis, 1994), which has been widely applied in numerous practical applications such as economics, climatology and industries. The significance of time series forecasting is that the decision-making can be efficiently and effectively conducted in these areas. During the past decades, different time series models have been proposed, including traditional and fuzzy methods. The traditional time series models, such as autoregressive integrated moving average (ARIMA) (Box et al., 2015, Lee and Tong, 2011), artificial neural networks model and hybrid ARIMA model (Zhang, 2003, Kihoro et al., 2004, Khashei et al., 2012), and support vector machine (Cao and Tay, 2003, Chaâbane, 2014), have failed to address the prediction problem under the uncertain circumstances in which historical data are incomplete, imprecise or ambiguous. To resolve this problem, the fuzzy time series model provides a feasible alternative to guarantee robustness of the forecasting models (Song and Chissom, 1994, Singh, 2017, Bose and Mali, 2019, Singh, 2016). Most currently available time series forecasting models concentrate on one-step-ahead prediction. As a new challenge in time series modeling, long-term prediction (multistep-ahead prediction) is more pervasive in many cases (Sorjamaa et al., 2007, Helmi et al., 2018). Sorjamaa et al. (2008) proposed a combination of methodologies based on extreme learning machine, partial least squares and nonparametric noise estimation. The combination of the methods projects the high-dimensional input regressor into low-dimensional latent space, maximizing the prediction ability of any nonlinear approximator. Cortez et al. (2019) explored a large set of neural network methods that directly optimize prediction intervals for multistep time series forecasting. Galicia et al. (2019) explored the suitability of combining three methods (decision tree, gradient boosted trees and random forest) into ensembles to forecast big data time series. The ensemble method, which computes the weights for each ensemble member using a least square method, assigns higher weights to the more accurate ensemble members based on their past performance. Koesdwiady et al. (2018) proposed two methods to improve multistep time series prediction: time-step-augmented and conditional generative adversarial network based models. The first method augments the information about the current time step and follows a training process similar to that of the meta algorithm. The second model is developed using multioutput strategy and utilizes the ability of the generative adversarial network in mimicking a dataset distribution. Alameer et al. (2020) proposed an end-to-end deep learning architecture for accurately forecasting monthly coal price fluctuations at different horizons, which combines long-short term memory and a deep neural network. Liu et al. (2021) introduced a two-layers extreme learning machine with the new recurrent algorithm for multistep time series prediction. This method applied a new recurrent technique that not only removed the restriction of the prediction horizon problem but also used a mean squared error of the current step to update the output weights for the next step. Taieb and Atiya (2016) presented a review of the available literature and a comprehensive investigation into the bias and variance behavior of long-term forecasting strategies. In this survey, multistep-ahead forecasting strategies are classified into three major categories: recursive, direct and joint. In the recursive strategy, prediction models iteratively forecast one step at a time with previous predictions as the model inputs. In the direct strategy, separate models are trained for each step ahead. In the joint strategy, a single model that has multiple outputs predicts the whole prediction horizon with a single attempt. In addition, the joint methods exhibit better performance with respect to both forecasting bias and variance (Taieb and Atiya, 2016).

Compared with one-step-ahead prediction, the difficulties of long-term prediction are uncertainty and potentially complex interactions between the different ensuing steps. To overcome this difficulty, one viable solution is to concentrate on high level representations of time series rather than the specific individual values. Li et al. (2010) utilized a vector quantization technique to forecast long-term vector values in one step. This approach, called deterministic vector long-term forecasting, took advantage of the sliding window method to extract features of interest in a time series and fuzzy c-means clustering to fuzzify interval partitioning. In view of the levels of vagueness and uncertainty of medical problems, Wang et al. (2015) used fuzzy information granulation (Zadeh, 1979, Pedrycz and Vukovich, 2001) to segment time series and extract abstract features of the subsequences, and then a multiple fuzzy rules interpolation scheme (Chang et al., 2008) was applied for long-term prediction of time series. Analogously, Yang et al. (2017) structured a linear fuzzy information granulation to reflect the time-dependent trend of change, and Gaussian type fuzzy sets were selected to construct fuzzy information granulation. Subsequently, the constructed granular time series were utilized for training a fuzzy inference system that predicted long-term values of the original time series. To improve the interpretability of long-term prediction, Guo et al. (2018) exploited hidden Markov models to derive the relations existing in the granular time series, which were nonoverlapping segments of the time series based on the principle of justifiable granularity. Similarly to the above approaches, Luo et al. (2020) achieved the prediction of long-term fluctuation of time series in the light of the short-term fluctuation patterns by means of polar coordinate fuzzy information granules.

The above research works revealed that long-term forecast modeling based on fuzzy time series has attracted significant attention during the past years. As a soft computing tool, fuzzy cognitive maps (FCMs) introduced by Kosko et al. (1986) can be used to describe and model the complex system (Papageorgiou and Salmeron, 2013). The FCMs consist of concept nodes and directed weights. The nodes demonstrate different aspects in the behavior of the system and the weights reflect causality presented between concepts. The review article (Felix et al., 2019) presents an up-to-date and comprehensive presentation of the theory and applications of FCMs. On the basis of knowledge-based representation and realizing inference processes, FCMs are able to capture the behavior of a given dynamic system. When an FCMs is designed to model time series, it can be broken down into the following four major tasks: input fuzzification, FCMs learning, modeling, and output defuzzification (Stach et al., 2008). The weight matrix should be determined first when using FCMs. The learning problem of FCMs is concentrated on acquiring the weight matrix based on expert intervention, available historical data or both. The data-based strategy is more suitable to learn FCMs when multiple training sequences are available and there is a lack of a priori knowledge, which can also increase the robustness and generalization of FCMs models (Chen et al., 2015, Papageorgiou and Salmeron, 2013). In our previous work (Lu et al., 2019), a fast and effective learning method for FCMs based on convex optimization is proposed. Following this strategy, Stach et al. (2008) exploited fuzzy cognitive maps (FCMs), which is learned by real-coded genetic algorithms, to perform prediction both at numerical and linguistic levels. For further improving the prediction accuracy of the FCMs model, Lu et al. (2014) designed a high-order fuzzy cognitive map (HFCM) to model and predict time series, in which fuzzy c-means clustering algorithm was used to construct the framework of FCMs and genetic algorithm is applied to learn the weights. The single concept of HFCM depends not only on the last states of all concepts but also on states that are multiple steps ahead, so the approximation ability of FCMs was enhanced. Pedrycz et al. (2015) introduced a framework for description of a numeric time series aided with information granules, which is constructed in the space of amplitude and change in amplitude of the time series. Each information granule, formed with the help of the fuzzy c-means clustering algorithm, was mapped onto a concept of the FCMs. The influences of the first step and the last step of the learning part of time series on the resulting FCMs should be different. Based on this, Salmeron and Froelich (2016) investigated a dynamic optimization of FCMs for time series forecasting with the goal of increasing the accuracy of forecasting. In this approach, the weights of the FCMs and the length of the learning period were dynamically adjusted according to the local characteristics of the time series. For multivariate time series prediction, Papageorgiou and Poczeta (2017) developed a two-stage model that combined evolutionary FCMs and artificial neural networks predictors in a cascade form. For dealing with nonstationary time series, Yang and Liu (2018) resorted to wavelet transform to decompose original nonstationary time series into multivariate time series, and then the high-order FCMs was applied to model and predict multivariate time series. Papageorgiou and Froelich (2012) proposed a long-term predictive model based on FCMs to forecast the state of pneumonia, which was learned by a multistep enhancement of the evolutionary algorithm.

Together, the above-mentioned studies indicate that FCMs and its extension have been successfully applied for time series forecasting. However, most research on forecasting models has been carried out for one-step-ahead prediction. To date, there are few studies that have investigated the long-term predictive models referring to FCMs for time series.

In view of all the observations made above, the ultimate objective of this study is to propose a novel method based on FCMs to improve the prediction accuracy of long-term time series forecasting. Initially, the numerical time series is divided into sequential segments resorting to the piecewise linear representation. Afterwards, the segments are transformed into equalized lengths to the prediction horizon based on dynamic time warping (DTW). Then, the extended fuzzy c-means clustering algorithm is used to cluster all of these segments. As a result, the segments are transformed into multivariate fuzzy time series. Following this, the forecasting model based on FCMs can be constructed from these fuzzy time series, in which the learning of FCMs is realized by a convex optimization method. The key contributions of this study are summarized as follows:

•
FCMs is innovatively applied to construct a long-term time series prediction model. In the FCMs model, each node of FCMs represents one latent variation modality of time series with regard to the prediction horizon and the weights depict the causal relationship that exists among these modalities. The proposed method handles the time series from the perspective of facilitating human understanding and cognition.
•
The unequal-length segments are equalized based on DTW distance, and the modified fuzzy c-means clustering is applied in this method to fuzzify these time series segments.

The remainder of this paper is organized as follows. Section 2 introduces the learning problem of FCMs and time series clustering based on DTW. In Section 3, the complete framework of the proposed long-term prediction model is presented. In Section 4, the experiments based on some publicly available datasets are exploited to demonstrate the advantages of the proposed model. Section 5 provides some helpful conclusions.

Section snippets

Prerequisites

To acquire a comprehensive understanding of the proposed method, fuzzy cognitive maps representation and dynamic time warping are briefly reviewed in this section.

The proposed long-term forecasting model

In this section, we elaborate on the long-term forecasting model based on FCMs. At first, the time series is divided into a set of nonoverlapping segments in chronological order. Then, the modified fuzzy c-means based on DTW is adopted to convert these segments into a fuzzy time series. Next, the long-term prediction model with FCMs is constructed from the fuzzy time series. Finally, the well-learned FCMs model is utilized to produce predictive outputs and then defuzzify and compute the

Experimental studies

The aim of the experiments is to demonstrate the work process and validate the performance of the constructed model. Both synthetic and real world datasets are employed to test and evaluate the performance of the proposed method. In each time series, the first 80% of data are used for model training, with the remaining 20% data utilized for testing. In all experiments, the shape parameter of activation function $λ$ is specified as 5 and the regularization parameter $α$ used in (16) is arbitrarily

Conclusions

This study has proposed a novel method of time series long-term forecasting based on FCMs. The method considers representing time series in high level representations. Furthermore, FCMs and fuzzy clustering are utilized to construct the forecasting model based on the successive segments. In that case, the forecast output of the model is produced in the form of time segments as well, namely, long-term prediction. A series of comparative experiments fully support the high capability of the

CRediT authorship contribution statement

Guoliang Feng: Conceptualization, Methodology, Software, Data curation, Visualization, Investigation, Writing - original draft. Liyong Zhang: Conceptualization, Methodology. Jianhua Yang: Conceptualization, Methodology, Writing - review & editing, Supervision. Wei Lu: Conceptualization, Methodology, Writing - review & editing, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The research was partially supported by the National Natural Science Foundation of China under Grant Nos. 62073056, 61876029 and 62076050, the National Key R&D Program of China under Grant No. 2019YFB1705103, and the Fundamental Research Funds for the Chinese Central Universities, China under Grant DUT20LAB129.

References (47)

AlameerZ. et al.
Multistep-ahead forecasting of coal prices using a hybrid deep learning model
Resour. Policy
(2020)
BezdekJ.C. et al.
Fcm: The fuzzy c-means clustering algorithm
Comput. Geosci.
(1984)
BoseM. et al.
Designing fuzzy time series forecasting models: A survey
Internat. J. Approx. Reason.
(2019)
ChenY. et al.
Inferring causal networks using fuzzy cognitive maps and evolutionary algorithms with application to gene regulatory network reconstruction
Appl. Soft Comput.
(2015)
GaliciaA. et al.
Multi-step forecasting for big data time series based on ensemble learning
Knowl.-Based Syst.
(2019)
HelmiA. et al.
Multi-step ahead time series forecasting via sparse coding and dictionary based techniques
Appl. Soft Comput.
(2018)
IzakianH. et al.
Fuzzy clustering of time series data using dynamic time warping distance
Eng. Appl. Artif. Intell.
(2015)
KhasheiM. et al.
Hybridization of autoregressive integrated moving average (arima) with probabilistic neural networks (pnns)
Comput. Ind. Eng.
(2012)
KoskoB.
Fuzzy cognitive maps
Int. J. Man-Mach. Stud.
(1986)
LeeY.-S. et al.
Forecasting time series using a methodology based on autoregressive integrated moving average and genetic programming
Knowl.-Based Syst.
(2011)

LiS.-T. et al.

Deterministic vector long-term forecasting for fuzzy time series

Fuzzy Sets and Systems

(2010)

LiuZ. et al.

A novel error-output recurrent two-layer extreme learning machine for multi-step time series prediction

Sustainable Cities Soc.

(2021)

LuW. et al.

Fast and effective learning for fuzzy cognitive maps: A method based on solving constrained convex optimization problems

IEEE Trans. Fuzzy Syst.

(2019)

LuW. et al.

The modeling and prediction of time series based on synergy of high-order fuzzy cognitive map and fuzzy c-means clustering

Knowl.-Based Syst.

(2014)

LuoC. et al.

A novel forecasting model for the long-term fluctuation of time series based on polar fuzzy information granules

Inform. Sci.

(2020)

MakridakisS.

Time series prediction: Forecasting the future and understanding the past

Int. J. Forecast.

(1994)

PapageorgiouE.I. et al.

Multi-step prediction of pulmonary infection with the use of evolutionary fuzzy cognitive maps

Neurocomputing

(2012)

PapageorgiouE.I. et al.

A two-stage model for time series prediction based on fuzzy cognitive maps and neural networks

Neurocomputing

(2017)

SalmeronJ.L. et al.

Dynamic optimization of fuzzy cognitive maps for time series forecasting

Knowl.-Based Syst.

(2016)

SongQ. et al.

Forecasting enrollments with fuzzy time series-part ii

Fuzzy Sets and Systems

(1994)

SorjamaaA. et al.

Methodology for long-term prediction of time series

Neurocomputing

(2007)

WangW. et al.

Time series long-term forecasting model based on information granules and fuzzy clustering

Eng. Appl. Artif. Intell.

(2015)

YangX. et al.

Long-term forecasting of time series based on linear fuzzy information granules and fuzzy inference system

Internat. J. Approx. Reason.

(2017)

Cited by (22)

Multi-output time series forecasting with randomized multivariate Fuzzy Cognitive Maps
2023, Chaos, Solitons and Fractals
Fuzzy Cognitive Maps (FCMs) have become a relevant technique for modeling and forecasting time series due to their advantages in dealing with uncertainty and simulating the dynamics of complex systems. Although numerous univariate and multivariate FCM-based forecasting models have been presented in the literature, one of the still open questions is how to enable FCMs to forecast multivariate time series for multiple-input, multiple-output (MIMO) systems with an efficient learning mechanism. from a computational point of view. This paper suggests a randomized MIMO FCM-based forecasting approach called MO-RHFCM to predict low-dimensional multivariate time series. More specifically, MO-RHFCM is a hybrid model merging the concepts of multivariate fuzzy time series, high order FCM (HFCM), and Echo State Networks (ESN). The structure of MO-RHFCM consists of three layers: input layer, reservoir (internal) layer, and output layer. Only the output layer is trainable using the Least Squares minimization algorithm; hence training the proposed MO-RHFCM method is fast and simple. The weights inside each sub-reservoir are selected randomly and remain fixed during the training process. The obtained results indicate the efficacy and validity of the proposed MO-RHFCM technique compared with some machine learning and deep learning baseline models.
Deep attention fuzzy cognitive maps for interpretable multivariate time series prediction
2023, Knowledge-Based Systems
Although time series prediction is widely used to estimate the future state of complex systems in various industries, accurate, interpretable and generalizable methods are still limited when used to make long-term nonstationary predictions. To this end, this article proposes deep attention fuzzy cognitive maps (DAFCM), which is composed of spatiotemporal fuzzy cognitive maps (STFCM), long short-term memory (LSTM) neural network, temporal fuzzy cognitive maps (TFCM) and residual structures. First, an improved attention mechanism is used to build spatiotemporal fuzzy cognitive maps that capture the spatial correlation in pairs of nodes and the temporal correlation of respective nodes. Second, the node state updated through the STFCM is input to the LSTM to capture the long-term trend of these series, and the TFCM with improved time attention is applied for the nonstationary problem in the time series. Finally, we add the state values of previous nodes into the DAFCM and build residual structures through linear transformation to prevent gradient explosion and gradient disappearance in long-term backpropagation. By combining the interpretability of fuzzy cognitive maps (FCM) and the high prediction accuracy of deep learning, the DAFCM can be used to accomplish tasks such as multivariate long-term nonstationary time series forecasting in multiple domains, and its efficiency is validated with 6 public datasets across 9 baselines.
Towards improved multifactorial particle swarm optimization learning of fuzzy cognitive maps: A case study on air quality prediction
2022, Applied Soft Computing
Fuzzy cognitive map (FCM) is a very simple, efficient, and powerful soft computing tool for modeling and analysis of a complex system. Due to its simplicity and transparency, FCM has been widely utilized in engineering, environment, medicine, and other complex systems. However, it remains challenging to handle dynamic, non-stationary, and noisy time series, such as air quality monitoring data, which is with typical temporal periodicity, cross-interference, low-quality, and great noise. Concerning the above challenges, we propose an improved multifactorial particle swarm optimization learning algorithm of FCM, termed as IMFPSO-FCM. Within the framework of IMFPSO-FCM, the learning of an FCM is regarded as a multitask optimization problem. Every task represents learning local connections of a node, and a single population is adopted to process these tasks simultaneously. The task selection mechanism is used to automatically select appropriate target tasks, thereby suppressing negative transfer and enabling useful information to be transferred between tasks. In addition, a multi-dimensional adaptive inertia weight strategy and a local search strategy are employed to further improve the performance of the model. The performance of IMFPSO-FCM is validated on several public datasets and real-world air quality monitoring datasets. The experimental results demonstrate the performance of the development method and emphasize its practicality.
Learning large-scale fuzzy cognitive maps under limited resources
2022, Engineering Applications of Artificial Intelligence
Citation Excerpt :
The connections among concepts can be developed by experts or learning approaches when observed data are available. Benefiting from their advantages in terms of abstraction, flexibility, adaptability, and fuzzy reasoning, FCMs have been applied in a significant number of applications (Felix et al., 2019; Papageorgiou and Salmeron, 2012; Bakhtavar et al., 2021), i.e., time series analysis (Stach et al., 2008; Gao et al., 2020), long-term time series prediction (Feng et al., 2021b), control (Stylios and Groumpos, 2004b), medical diagnosis (Papageorgiou and Froelich, 2012), classification (Homenda and Jastrzebska, 2020; Szwed, 2021), decision making (Stylios et al., 2008), gene regulatory networks (Liu et al., 2016), engineering (Stylios and Groumpos, 2004a), healthcare (Babroudi et al., 2021), and business (Xirogiannis and Glykas, 2004). The premise of making these applications is that a suitable FCM must be constructed.
Research on the problem of learning large-scale fuzzy cognitive maps (FCMs) with a limited computational budget is outstanding. To learn large-scale FCMs from time series, in most work, this problem is decomposed into learning local connections of each concept, respectively, and then one optimizer is employed to optimize each such sub-problem. Each sub-problem may have different requirements for the computational resource, but the existing methods ignore this issue and allocate the same amounts of computational resources for each sub-problem. In this paper, we propose two strategies to address this problem. We first develop a dynamic resource allocation strategy to maximize the performance of the decomposition-based optimizer under a limited computational budget. Second, we propose a half-thresholding memetic algorithm to improve the performance of the traditional evolutionary algorithm. We term our proposal as a half-thresholding memetic algorithm with a dynamic resource allocation strategy (HTMA-DRA). Finally, the experiments on large-scale synthetic data and DREAM datasets compared with the existing state-of-the-art methods demonstrate the effectiveness of the proposed HTMA-DRA.
Multi-source and multivariate ozone prediction based on fuzzy cognitive maps and evidential reasoning theory[Formula presented]
2022, Applied Soft Computing
Citation Excerpt :
Shen et al. [39] proposed a preference-based iterative threshold evolution bi-objective optimization algorithm to learn large-scale fuzzy cognitive maps with sparse attributes and applied them to time series prediction. Feng et al. [40] developed a sound conceptual method for long-term time series prediction with FCMs, which melds FCMs, time series segmentation and fuzzy clustering. Shen et al. [41] proposed a fast prediction model to predict multivariate long non-stationary time series based on the combination of elastic net and HFCM.
Ozone prediction, a key role for ozone pollution control, is facing the following challenges, i.e., the complex evolution trend of ozone, the cross-interference phenomena between ozone and other pollutants, and the low-quality monitoring data. To overcome the above challenges, we propose a multi-source and multivariate ozone prediction model based on fuzzy cognitive maps (FCMs) and evidential reasoning theory from the perspective of spatio-temporal fusion, termed as ERC-FCM. In this framework, an FCM-based prediction model is introduced to solve the ozone forecasting problem. Inspired by the multivariate time series forecasting, a multivariate ozone prediction problem is modeled as an FCM learned by the real-coded genetic algorithm, in which each node denotes a variable (pollutant). Thus, both the complex evolution trend of ozone and the cross-interference phenomena can be reflected by the FCM. Further, we propose an ensemble theoretical framework based on evidence reasoning theory and the matrix 2 norm. This theoretical framework relieves the negative factors from the low-quality monitoring data and improves the prediction accuracy when facing multi-source and multivariate time series. The performance of ERC-FCM is validated on two real-world datasets. The experimental results demonstrate that our method yields the best prediction performance by comparison with the other classical FCM-based methods on mean absolute error (MAE), mean square error (MSE), and root mean square error (RMSE). In addition, the Friedman test and Nemenyi test show that ERC-FCM gets relatively better prediction accuracy than other models.
A Review of Fuzzy Cognitive Map Learning Algorithms and Applications
2024, Zidonghua Xuebao/Acta Automatica Sinica

View all citing articles on Scopus

View full text

Long-term prediction of time series using fuzzy cognitive maps

Abstract

Introduction

Section snippets

Prerequisites

The proposed long-term forecasting model

Experimental studies

Conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Resour. Policy

Comput. Geosci.

Internat. J. Approx. Reason.

Appl. Soft Comput.

Knowl.-Based Syst.

Appl. Soft Comput.

Eng. Appl. Artif. Intell.

Comput. Ind. Eng.

Int. J. Man-Mach. Stud.

Knowl.-Based Syst.

Fuzzy Sets and Systems

Sustainable Cities Soc.

IEEE Trans. Fuzzy Syst.

Knowl.-Based Syst.

Inform. Sci.

Int. J. Forecast.

Neurocomputing

Neurocomputing

Knowl.-Based Syst.

Fuzzy Sets and Systems

Neurocomputing

Eng. Appl. Artif. Intell.

Internat. J. Approx. Reason.