A novel approach for missing data prediction in coevolving time series

Song, Xiaoxiang; Guo, Yan; Li, Ning; Qian, Peng

doi:10.1007/s00607-018-0668-8

A novel approach for missing data prediction in coevolving time series

Published: 26 September 2018

Volume 101, pages 1565–1584, (2019)
Cite this article

Computing Aims and scope Submit manuscript

Xiaoxiang Song¹,
Yan Guo¹,
Ning Li¹ &
…
Peng Qian¹

314 Accesses
2 Citations
Explore all metrics

Abstract

Although various innovative sensing technologies have been widely employed, data missing in collections of time series occurs frequently, which turns out to be a major menace to precise data analysis. However, many existing missing data prediction approaches either might be infeasible or could be inefficient to predict missing data from multiple time series. To solve this problem, we proposed a novel approach based on the compressive sensing theory and sparse Bayesian learning theory for missing data prediction in coevolving time series. First, we model the problem by designing the corresponding sparse representation basis and measurement matrix. Then, the missing data prediction problem is formulated as the multiple sparse vectors recovery problem. Many simultaneous sparse estimation approaches focus on joint estimation of multiple sparse vectors with a common support from given linear observations, which is however too strict in some real applications. In this paper, largely utilizing the interior patterns of coevolving time series, we design a tuning parameter-free algorithm based on the sparse Bayesian learning, which can simultaneously solve multiple sparse estimation takes without the requirement of auxiliary information. Simulation results demonstrate that our approach can recover the entire time series efficiently using only those data that are not missing, even if, a high ratio of collected data are missing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Scalable recovery of missing blocks in time series with high and low cross-correlations

Article 15 November 2019

Mourad Khayati, Philippe Cudré-Mauroux & Michael H. Böhlen

Using Lowly Correlated Time Series to Recover Missing Values in Time Series: A Comparison Between SVD and CD

Structure-aware decoupled imputation network for multivariate time series

Article Open access 08 December 2023

Nourhan Ahmed & Lars Schmidt-Thieme

References

Wu X, Zhu X, Wu GQ, Ding W (2014) Data mining with big data. IEEE Trans Knowl Data Eng 26(1):97–107
Article Google Scholar
Vlahogianni EI, Golias JC (2004) Short-term traffic forecasting: overview of objectives and methods. Transp Res Rev 24(5):533–557
Article Google Scholar
Ruby-Figueroa Ren, Saavedra Jorge, Bahamonde Natalia, Cassano Alfredo (2017) Permeate flux prediction in the ultrafiltration of fruit juices by ARIMA models. J Membr Sci 524:108–116
Article Google Scholar
Lippi M, Bertini M, Frasconi P (2013) Short-term traffic flow forecasting: an experimental comparison of time-series analysis and supervised learning. IEEE Trans Intell Transp Syst 14(2):871–882
Article Google Scholar
Strauman AS, Bianchi FM, Mikalsen KØ (2018) Classification of postoperative surgical site infections from blood measurements with missing data using recurrent neural networks. In: IEEE EMBS international conference on biomedical & health informatics (BHI), pp 307–310. https://doi.org/10.1109/BHI.2018.8333430
Zhong M, Sharma S, Lingras P (2004) Genetically designed models for accurate imputations of missing traffic counts. Transp Res Rec 1879:71–79
Article Google Scholar
Kumar L, Kumar M, Rath SK (2016) Maintainability prediction of web service using support vector machine with various kernel methods. Int J Syst Assur Eng Manag 2:1–18
Google Scholar
Baharaeen S, Masud AS (1986) A computer program for time series forecasting using single and double exponential smoothing techniques. Comput Ind Eng 11:151–155
Article Google Scholar
Holt CC (2004) Forecasting seasonals and trends by exponentially weighted moving averages. Int J Forecast 20:5–10
Article Google Scholar
Chen C, Kwon J, Rice J, Skabardonis A, Varaiya P (2003) Detectingerrors and imputing missing data for single-loop surveillance systems. Transp Res Rec J Board 1855:160–167
Article Google Scholar
Al Deek HM, Chandra CVSR (2004) New algorithms for filtering and imputation of real-time and archived dual-loop detector data in I-4 data warehouse. Transp Res Rec J Transp Res Board 1867:116–126
Article Google Scholar
Boyles S (2011) A comparison of interpolation methods for missing traffic volume data. In: Proceedings of the 90th annual meeting of the transportation research board, pp 23–27
Qu L, Li L, Zhang Y, Hu J (2009) PPCA-based missing data imputation for traffic flow volume: a systematical approach. IEEE Trans Intell Transp Syst 10(3):512–522
Article Google Scholar
Li Y, Li Z, Li L, Zhang Y (2013) Comparison on PPCA, KPPCA and MPPCA based missing data imputing for traffic flow. In: Proceedings of IEEE conference on intelligent transportation system, pp 1535–1540
Shi W, Zhu Y, Yu PS (2017) Temporal dynamic matrix factorization for missing data prediction in large scale coevolving time series. IEEE Access 4(99):6719–6732
Google Scholar
Cai Y, Tong H, Fan W, Ji P (2015) Fast mining of a network of coevolving time series. In: Proceedings of SIAM international conference data mining, pp 298–306
Si Z, Yu H, Ma Z (2016) Learning deep features for DNA methylation data analysis. IEEE Access 4:2732–2737
Article Google Scholar
Cands E, Romberg J, Tao T (2006) Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans Inf Theory 52(2):489–509
Article MathSciNet Google Scholar
Tipping ME (2001) Sparse Bayesian learning and the relevance vector machine. JMLR.org
Babacan SD, Molina R, Katsaggelos AK (2010) Bayesian compressive sensing using laplace priors. IEEE Trans Image Process 19(1):53–63
Article MathSciNet Google Scholar
Andrews DF, Mallows CL (1974) Scale mixtures of normal distributions. J R Stat Soc Ser B (Methodol) 36:99–102
MathSciNet MATH Google Scholar
Wipf D, Rao B (2007) An empirical Bayesian strategy for solving the simultaneous sparse approximation problem. IEEE Trans Signal Process 55(7):3704–3716
Article MathSciNet Google Scholar
Tropp JA, Gilbert AC, Strauss MJ (2006) Algorithms for simultaneous sparse approximation. Part I: greedy pursuit. Signal Process 86:572–588
Article Google Scholar
Cotter SF, Rao BD, Engan K, Kreutz-Delgado K (2005) Sparse solutions to linear inverse problems with multiple measurement vectors. IEEE Trans Signal Process 53:2477–2488
Article MathSciNet Google Scholar
Tropp JA, Gilbert AC, Strauss MJ (2006) Algorithms for simultaneous sparse approximation. Part II: convex relaxation. Signal Process 86:589–602
Article Google Scholar
Wipf DP, Rao BD (2007) An empirical Bayesian strategy for solving the simultaneous sparse approximation problem. IEEE Trans Signal Process 55:3704–3716
Article MathSciNet Google Scholar
Zhang Z, Rao BD (2011) Sparse signal recovery with temporally correlated source vectors using sparse Bayesian learning. IEEE J Sel Top Signal Process 5:912–926
Article Google Scholar
Zhang Z, Rao BD (2010) Sparse signal recovery in the presence of correlated multiple measurement vectors. In: Proceedings of ICASSP, Dallas, TX, USA, pp 3986–3989
Prasad R, Murphy CR, Rao BD (2014) Joint approximately sparse channel estimation and data detection in OFDM systems using sparse Bayesian learning. IEEE Trans Signal Process 62(14):3591–3603
Article MathSciNet Google Scholar
Chen Wei (2017) Simultaneous sparse Bayesian learning with partially shared support. IEEE Signal Process Lett 24(10):1641–1645
Article Google Scholar
Tipping ME (2001) Sparse Bayesian learning and the relevance vector machine. J Mach Learn Res 1:211–244
MathSciNet MATH Google Scholar
Rhee I, Shin M, Hong S (2009) Mobility traces. http://carwdad.org/ncsu/mobilitymodels/
Samuel M. Intel lab data. http://db.csail.mit.edu
Fonollosa J, Sheik S, Huerta R, Marco S (2015) Reservoir computing compensates slow response of chemosensor arrays exposed to fast varying gas concentrations in continuous monitoring. Sens Actuators B Chem 215:618–629
Article Google Scholar
Wu X, Liu M (2012) In-situ soil moisture sensing: Measurement scheduling and estimation using compressive sensing. In: Proceedings of the 11th ACM international conference on information processing in sensor networks, pp 1–12
Bishop CM (2006) Pattern recognition and machine learning. Springer, Berlin
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Communication Engineering, Army Engineering University, Nanjing, China
Xiaoxiang Song, Yan Guo, Ning Li & Peng Qian

Authors

Xiaoxiang Song
View author publications
You can also search for this author in PubMed Google Scholar
Yan Guo
View author publications
You can also search for this author in PubMed Google Scholar
Ning Li
View author publications
You can also search for this author in PubMed Google Scholar
Peng Qian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yan Guo.

Additional information

This work is supported by the National Natural Science Foundation of China (61871400 and 61571463) and the Jiangsu Province Natural Science Foundation (BK20171401).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Song, X., Guo, Y., Li, N. et al. A novel approach for missing data prediction in coevolving time series. Computing 101, 1565–1584 (2019). https://doi.org/10.1007/s00607-018-0668-8

Download citation

Received: 12 March 2018
Accepted: 21 September 2018
Published: 26 September 2018
Issue Date: November 2019
DOI: https://doi.org/10.1007/s00607-018-0668-8

Keywords

Mathematics Subject Classification

035CC

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A novel approach for missing data prediction in coevolving time series

Abstract

Access this article

Similar content being viewed by others

Scalable recovery of missing blocks in time series with high and low cross-correlations

Using Lowly Correlated Time Series to Recover Missing Values in Time Series: A Comparison Between SVD and CD

Structure-aware decoupled imputation network for multivariate time series

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

A novel approach for missing data prediction in coevolving time series

Abstract

Access this article

Similar content being viewed by others

Scalable recovery of missing blocks in time series with high and low cross-correlations

Using Lowly Correlated Time Series to Recover Missing Values in Time Series: A Comparison Between SVD and CD

Structure-aware decoupled imputation network for multivariate time series

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation