Abstract
Although time series prediction research among engineering and technology has made breakthrough progress in performance, challenges remain in modeling complex dynamic interactions between variables and interpretability. To address these two problems, a novel two-stage strategy framework called DCFA-iTimeNet is introduced. In the first stage, this paper innovatively proposes a dynamic cross-fusion attention mechanism (DCFA) . This module facilitates the model to exchange information between different patches of the time series, thereby capturing the complex interactions between variables across time. In the second stage, we exploit a decomposition-based linear explainable Bidirectional Gated Recurrent Unit (DeLEBiGRU), which consists mainly of standard BiGRU and tensorized BiGRU. It is proposed to analyze each variable’s historical long-term, instantaneous, and future impacts. Such design is crucial for understanding how each variable impacts the overall prediction over time. Extensive experimental results demonstrate that the proposed model can effectively model and interpret complex dynamic relationships of multivariate time series and understand the model’s decision-making process. Moreover, the performance outperforms the state-of-the-art methods.
Graphical abstract
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Changxia G, Ning Z, Youru L, Yan L, Huaiyu W (2023) Multi-scale adaptive attention-based time-variant neural networks for multi-step time series forecasting. Appl Intell 53(23):28974–28993
Karim AA, Nower N (2024) Probabilistic spatio-temporal graph convolutional network for traffic forecasting. Appl Intell 1–16
Salinas D, Flunkert V, Gasthaus J, Januschowski T (2020) Deepar: Probabilistic forecasting with autoregressive recurrent networks. Int J Forecast 36(3):1181–1191
Zheng YB, Zhu GH (2019) Application of twitter time series model in influenza prediction. Chinese Preventive Medicine 20(9):793–798
Karim AA, Nower N (2024) Probabilistic spatio-temporal graph convolutional network for traffic forecasting. Applied Intelligence 1–16
Li X, Shang W, Wang S (2019) Text-based crude oil price forecasting: A deep learning approach. Int J Forecast 35(4):1548–1560
Hong S, Ko SJ, Woo SI et al (2024) Time-series forecasting of consolidation settlement using lstm network. Appl Intell 54:1386–1404. https://doi.org/10.1007/s10489-023-05219-7
Ma S, Zhang T, Zhao YB et al (2023) Tcln: A transformer-based conv-lstm network for multivariate time series forecasting. Appl Intell 53:28401–28417
Keogh E (2023) Time series data mining: A unifying view. Proceedings of the VLDB Endowment 16(12):3861–3863
Wu Y, Zhao X, Li Y, Guo L, Zhu X, Fournier-Viger P, Wu X (2023) Opr-miner: Order-preserving rule mining for time series. IEEE Trans Knowl Data Eng 35(11):11722–11735
Wu Y, Wang Z, Li Y, Guo Y, Jiang H, Zhu X, Wu X (2024) Co-occurrence order-preserving pattern mining with keypoint alignment for time series. ACM Trans Manag Inf Syst 15(2):1–27
Yeh CCM, Kavantzas N, Keogh E (2017) Matrix profile iv: using weakly labeled time series to predict outcomes. Proceedings of the VLDB Endowment 10(12):1802–1812
Galicia A, Talavera-Llames R, Troncoso A, Koprinska I, Martínez-Álvarez F (2019) Multi-step forecasting for big data time series based on ensemble learning. Knowl-Based Syst 163:830–841
Zhang S, Chen Y, Zhang W, Feng R (2021) A novel ensemble deep learning model with dynamic error correction and multi-objective ensemble pruning for time series forecasting. Inf Sci 544:427–445
Salinas D, Bohlke-Schneider M, Callot L, Gasthaus J (2019) High-dimensional multivariate forecasting with low-rank gaussian copula processes. In: Advances in Neural Information Processing Systems
Rasul K, Seward C, Schuster I, Vollgraf R (2021) Autoregressive denoising diffusion models for multivariate probabilistic time series forecasting. In: International Conference on Machine Learning, pp. 8857–8868
Tonekaboni S, Joshi S, Campbell K, Duvenaud DK, Goldenberg A (2020) What went wrong and when? instance-wise feature importance for time-series black-box models. Adv Neural Inf Process Syst 33:799–809
Hsieh TY, Wang S, Sun Y, Honavar V (2021) Explainable multivariate time series classification: a deep neural network which learns to attend to important variables as well as time intervals. In: Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp. 607–615
Turbé H, Bjelogrlic M, Lovis C, Mengaldo G (2023) Evaluation of post-hoc interpretability methods in time-series classification. Nature Machine Intelligence 5(3):250–260
Kierdorf J, Roscher R (2023) Reliability scores from saliency map clusters for improved image-based harvest-readiness prediction in cauliflower. IEEE Geosci Remote Sens Lett 20:1–5
Vu MN, Thai MT (2023) Limitations of perturbation-based explanation methods for temporal graph neural networks, in: 2023 IEEE International Conference on Data Mining (ICDM), pp. 618–627
Sivill T, Flach P (2022) Limesegment: Meaningful, realistic time series explanations, in: International Conference on Artificial Intelligence and Statistics, pp. 3418–3433
Bello M, Nápoles G, Vanhoof K, García MM, Bello R (2022) Explanation of multi-label neural networks with layer-wise relevance propagation. In: 2022 International Joint Conference on Neural Networks (IJCNN). IEEE, pp. 01–10
Ivanovs M, Kadikis R, Ozols K (2021) Perturbation-based methods for explaining deep neural networks: A survey. Pattern Recogn Lett 150:228–234
Tonekaboni S, Joshi S, Campbell K, Duvenaud DK, Goldenberg A (2020) What went wrong and when? instance-wise feature importance for time-series black-box models. Adv Neural Inf Process Syst 33:799–809
Noh K, Kim D, Byun J (2023) Explainable deep learning for supervised seismic facies classification using intrinsic method. IEEE Trans Geosci Remote Sens 61:1–11
Duvnjak M, Merćep A, Kostanjčar Z (2024) Intrinsically interpretable models for credit risk assessment, in: 2024 47th MIPRO ICT and Electronics Convention (MIPRO), pp. 31–36
Liu S, Zhao L, Zhao J, Li B, Wang SH (2022) Attention deficit/hyperactivity disorder classification based on deep spatio-temporal features of functional magnetic resonance imaging. Biomed Signal Process Control 71:103239
Liu M, Guo C, Xu L (2024) An interpretable automated feature engineering framework for improving logistic regression. Applied Soft Computing
Acharak T (2023) Interpretable decision tree ensemble learning with abstract argumentation for binary classification, Vol. 1792, pp. 85–96
Mihaljević B, Bielza C, Larrañaga P (2021) Bayesian networks for interpretable machine learning and optimization. Neurocomputing 456:648–665
Mariotti E, Moral JMA, Gatt A (2023) Exploring the balance between interpretability and performance with carefully designed constrainable neural additive models. Information Fusion 99:101882
Yang Z, Zhang A, Sudjianto A (2021) Gami-net: An explainable neural network based on generalized additive models with structured interactions. Pattern Recogn 120:108192
Boniol P, Meftah M, Remy E, Palpanas T (2022) dcam: dimension-wise class activation map for explaining multivariate data series classification. In: Proceedings of the 2022 International Conference on Management of Data, pp. 1175–1189
Guo T, Lin T, Antulov-Fantulin N (2019) Exploring interpretable lstm neural networks over multi-variable data. In: International Conference on Machine Learning , pp. 2494–2504
Jain S, Wallace BC (2019) Attention is not explanation. In: Proceedings of NAACL-HLT, pp. 3543–3556
Chaoqun W, Yijun L, Xiangqian S, Qi W, Wang D, Huang Z (2023) Delelstm: Decomposition-based linear explainable lstm to capture instantaneous and long-term effects in time series. In: 32th International Joint Conference on Artificial Intelligence (IJCAI 2023)
Bai B, Liang J, Zhang G, Li H, Bai K, Wang F (2021) Why attentions may not be interpretable?. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pp. 25–34
Gao P, Yang X, Huang K, Zhang R, Goulermas JY (2022) Explainable tensorized neural ordinary differential equations for arbitrary-step time series prediction. IEEE Transactions on Knowledge and Data Engineering
Lai G, Chang WC, Yang Y, Liu H (2018) Modeling long-and short-term temporal patterns with deep neural networks, in: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 95–104
Liang X, Zou T, Guo B, Li S, Zhang H, Zhang S, Huang H, Chen SX (2015) Assessing beijing’s pm2.5 pollution: Severity, weather impact, apec and winter heating, Proceedings of the Royal Society A: Mathematical. Physical and Engineering Sciences 471 (2182):20150257
Zhou H, Zhang S, Peng J, Zhang S, Li J, Xiong H, Zhang W (2021) Informer: Beyond efficient transformer for long sequence time-series forecasting. In: The Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Virtual Conference, Vol. 35, pp. 11106–11115
Guo T, Lin T, Antulov-Fantulin N (2019) Exploring interpretable lstm neural networks over multi-variable data. In: International Conference on Machine Learning, pp. 2494–2504
Pantiskas L, Verstoep K, Bal H (2020) Interpretable multivariate time series forecasting with temporal attention convolutional neural networks. In: 2020 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1687–1694
Oreshkin BN, Carpov D, Chapados N, Bengio Y (2020) N-beats: Neural basis expansion analysis for interpretable time series forecasting. In: International Conference on Learning Representations
Peng C, Li Y, Yu Y, Zhou Y, Du S (2018) Multi-step-ahead host load prediction with gru based encoder-decoder in cloud computing. In: 2018 10th International Conference on Knowledge and Smart Technology (KST), pp. 186–191
Dorado Rueda F, Duran Suárez J, del Real Torres A (2021) Short-term load forecasting using encoder-decoder wavenet: Application to the french grid. Energies 14(9):2524
Deng J, Chen X, Jiang R, Yin D, Yang Y, Song X, Tsang IW (2024) Disentangling structured components: Towards adaptive, interpretable and scalable time series forecasting. IEEE Transactions on Knowledge and Data Engineering
Liu M, Zeng A, Chen M, Xu Z, Lai Q, Ma L, Xu Q (2022) Scinet: Time series modeling and forecasting with sample convolution and interaction. Adv Neural Inf Process Syst 35:5816–5828
Choi J, Choi H, Hwang J, Park N (2022) Graph neural controlled differential equations for traffic forecasting. Proceedings of the AAAI conference on artificial intelligence 36:6367–6374
Shang C, Chen J, Bi J (2021) Discrete graph structure learning for forecasting multiple time series. In: Proceedings of the International Conference on Learning Representations
Deng J, Chen X, Jiang R, Song X, Tsang IW (2021) St-norm: Spatial and temporal normalization for multi-variate time series forecasting. In: Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining, pp. 269–278
Deng J, Chen X, Jiang R, Song X, Tsang IW (2023) A multi-view multi-task learning framework for multi-variate time series forecasting. IEEE Trans Knowl Data Eng 35(8):7665–7680
Funding
The work is supported by Chongqing Master’s Graduate Research and Innovation Project (No. CYS240186).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yuan, J., Wu, F., Zhao, L. et al. DCFA-iTimeNet: Dynamic cross-fusion attention network for interpretable time series prediction. Appl Intell 55, 86 (2025). https://doi.org/10.1007/s10489-024-05973-2
Accepted:
Published:
DOI: https://doi.org/10.1007/s10489-024-05973-2