skip to main content
10.1145/3144457.3144474acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmobiquitousConference Proceedingsconference-collections
research-article

Interpolating the Missing Values for Multi-Dimensional Spatial-Temporal Sensor Data: A Tensor SVD Approach

Published: 07 November 2017 Publication History

Abstract

With the booming of the Internet of Things, enormous number of smart devices/sensors have been deployed in the physical world to monitor our surroundings. Usually those devices generate high-dimensional geo-tagged time-series data. However, these sensor readings are easily missing due to the hardware malfunction, connection errors or data corruption, which severely compromise the back-end data analysis. To solve this problem, in this paper we exploit tensor-based Singular Value Decomposition method to recover the missing sensor readings. The main novelty of this paper lies in that, i) our tensor-based recovery method can well capture the multi-dimensional spatial and temporal features by transforming the irregularly deployed sensors into a sensor-array and folding the periodic temporal patterns into multiple time dimensions, ii) it only requires to tune one key parameter in an unsupervised manner, and iii) Tensor Singular Value Decomposition structure is more efficient on representation of high-dimension sensor data than other tensor recovery methods based on tensor's vectorization or flattening. The experimental results in several real-world one-year air quality and meteorology datasets demonstrate the effectiveness and accuracy of our approach.

References

[1]
Evrim Acar, Daniel M Dunlavy, Tamara G Kolda, and Morten Mørup. 2011. Scalable tensor factorizations for incomplete data. Chemometrics and Intelligent Laboratory Systems 106, 1 (2011), 41--56.
[2]
Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. 2011. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends® in Machine Learning 3, 1 (2011), 1--122.
[3]
Jian-Feng Cai, Emmanuel J Candès, and Zuowei Shen. 2010. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization 20, 4 (2010), 1956--1982.
[4]
Emmanuel J Candès and Benjamin Recht. 2009. Exact matrix completion via convex optimization. Foundations of Computational mathematics 9, 6 (2009), 717--772.
[5]
Yudong Chen, Ali Jalali, Sujay Sanghavi, and Constantine Caramanis. 2013. Low-rank matrix recovery from errors and erasures. Information Theory, IEEE Transactions on 59, 7 (2013), 4324--4337.
[6]
Yudong Chen, Huan Xu, Constantine Caramanis, and Sujay Sanghavi. 2011. Robust matrix completion with corrupted columns. arXiv preprint arXiv:1102.2254 (2011).
[7]
Zhe Chen and Andrzej Cichocki. 2005. Nonnegative matrix factorization with temporal smoothness and/or spatial decorrelation constraints. Laboratory for Advanced Brain Signal Processing, RIKEN, Tech. Rep 68 (2005).
[8]
Andrzej Cichocki, Rafal Zdunek, Anh Huy Phan, and Shun-ichi Amari. 2009. Nonnegative matrix and tensor factorizations: applications to exploratory multiway data analysis and blind source separation. John Wiley & Sons.
[9]
Li Da Xu, Wu He, and Shancang Li. 2014. Internet of things in industries: a survey. Industrial Informatics, IEEE Transactions on 10, 4 (2014), 2233--2243.
[10]
Gregory Ely, Shuchin Aeron, Ning Hao, and Misha E Kilmer. 2015. 5D seismic data completion and denoising using a novel class of tensor decompositions. Geophysics 80, 4 (2015), V83--V95.
[11]
Silvia Gandy, Benjamin Recht, and Isao Yamada. 2011. Tensor completion and low-n-rank tensor recovery via convex optimization. Inverse Problems 27, 2 (2011), 025010.
[12]
Mohinder S Grewal. 2011. Kalman filtering. Springer.
[13]
Le Gruenwald, Md Shiblee Sadik, Rahul Shukla, and Hanqing Yang. 2010. DEMS: a data mining based technique to handle missing data in mobile sensor network applications. In Proceedings of the Seventh International Workshop on Data Management for Sensor Networks. ACM, 26--32.
[14]
Tamir Hazan, Simon Polak, and Amnon Shashua. 2005. Sparse image coding using a 3D non-negative tensor factorization. In Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on, Vol. 1. IEEE, 50--57.
[15]
Christopher J Hillar and Lek-Heng Lim. 2013. Most tensor problems are NP-hard. Journal of the ACM (JACM) 60, 6 (2013), 45.
[16]
Misha E Kilmer, Karen Braman, Ning Hao, and Randy C Hoover. 2013. Third-order tensors as operators on matrices: A theoretical and computational framework with applications in imaging. SIAM J. Matrix Anal. Appl. 34, 1 (2013), 148--172.
[17]
Misha E Kilmer and Carla D Martin. 2011. Factorization strategies for third-order tensors. Linear Algebra Appl. 435, 3 (2011), 641--658.
[18]
Olga Klopp, Karim Lounici, and Alexandre B Tsybakov. 2014. Robust matrix completion. arXiv preprint arXiv:1412.8132 (2014).
[19]
Tamara G Kolda and Brett W Bader. 2009. Tensor decompositions and applications. SIAM review 51, 3 (2009), 455--500.
[20]
Nadia Kreimer, Aaron Stanton, and Mauricio D Sacchi. 2013. Tensor completion based on nuclear norm minimization for 5D seismic data reconstruction. Geophysics 78, 6 (2013), V273--V284.
[21]
Daniel Kressner, Michael Steinlechner, and Bart Vandereycken. 2014. Low-rank tensor completion by Riemannian optimization. BIT Numerical Mathematics 54, 2 (2014), 447--468.
[22]
Adrian S Lewis. 1995. The convex analysis of unitarily invariant matrix functions. Journal of Convex Analysis 2, 1 (1995), 173--183.
[23]
Ji Liu, Przemyslaw Musialski, Peter Wonka, and Jieping Ye. 2009. Tensor completion for estimating missing values in visual data. In Computer Vision, 2009 IEEE 12th Intl. Conference on. 2114--2121.
[24]
Ji Liu, Przemyslaw Musialski, Peter Wonka, and Jieping Ye. 2013. Tensor completion for estimating missing values in visual data. Pattern Analysis and Machine Intelligence, IEEE Transactions on 35, 1 (2013), 208--220.
[25]
Jie Lu, Dianshuang Wu, Mingsong Mao, Wei Wang, and Guangquan Zhang. 2015. Recommender system application developments: a survey. Decision Support Systems 74 (2015), 12--32.
[26]
Hao Ma, Irwin King, and Michael R Lyu. 2007. Effective missing data prediction for collaborative filtering. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 39--46.
[27]
Benjamin M Marlin, Richard S Zemel, Sam T Roweis, and Malcolm Slaney. 2011. Recommender Systems, Missing Data and Statistical Model Estimation. In IJCAI proceedings-international joint conference on artificial intelligence, Vol. 22. 2686.
[28]
Carla D Martin, Richard Shafer, and Betsy LaRue. 2013. An order-p tensor factorization with applications in imaging. SIAM Journal on Scientific Computing 35, 1 (2013), A474--A490.
[29]
Mohamad J Noori, Hussein H Hassan, and Yaseen T Mustafa. 2014. Spatial Estimation of Rainfall Distribution and Its Classification in Duhok Governorate Using GIS. Journal of Water Resource and Protection 6, 02 (2014), 75.
[30]
Anthony M Norcia, Maureen Clarke, and Christopher W Tyler. 1985. Digital filtering and robust regression techniques for estimating sensory thresholds from the evoked potential. IEEE Engineering in Medicine and Biology Magazine 4, 4 (1985), 26--32.
[31]
Liqiang Pan and Jianzhong Li. 2010. K-nearest neighbor based missing data estimation algorithm in wireless sensor networks. Wireless Sensor Network 2, 02 (2010), 115.
[32]
Wenjie Ruan, Peipei Xu, Quan Z Sheng, Nickolas JG Falkner, Xue Li, and Wei Emma Zhang. 2017. Recovering Missing Values from Corrupted Spatio-Temporal Sensory Data via Robust Low-Rank Tensor Completion. In International Conference on Database Systems for Advanced Applications. Springer, 607--622.
[33]
Wenjie Ruan, Peipei Xu, Quan Z Sheng, Nguyen Khoi Tran, Nickolas JG Falkner, Xue Li, and Wei Emma Zhang. 2016. When sensor meets tensor: filling missing sensor values through a tensor approach. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 2025--2028.
[34]
Marco Signoretto, Lieven De Lathauwer, and Johan AK Suykens. 2010. Nuclear norms for tensors and their use for convex multilinear estimation. Linear Algebra and Its Applications 43 (2010).
[35]
Marco Signoretto, Raf Van de Plas, Bart De Moor, and Johan AK Suykens. 2011. Tensor versus matrix completion: a comparison with application to spectral data. Signal Processing Letters, IEEE 18, 7 (2011), 403--406.
[36]
Laurent Sorber, Marc Van Barel, and Lieven De Lathauwer. 2013. Optimization-based algorithms for tensor decompositions: Canonical polyadic decomposition, decomposition in rank-(L_r, L_r,1) terms, and a new generalization. SIAM Journal on Optimization 23, 2 (2013), 695--720.
[37]
Jimeng Sun, Dacheng Tao, and Christos Faloutsos. 2006. Beyond streams and graphs: dynamic tensor analysis. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 374--383.
[38]
Yufei Tao, Jimeng Sun, and Dimitris Papadias. 2003. Analysis of predictive spatio-temporal queries. ACM Transactions on Database Systems (TODS) 28, 4 (2003), 295--336.
[39]
Kim-Chuan Toh and Sangwoon Yun. 2010. An accelerated proximal gradient algorithm for nuclear norm regularized linear least squares problems. Pacific Journal of Optimization 6, 615-640 (2010), 15.
[40]
Jun Wang, Arjen P De Vries, and Marcel JT Reinders. 2006. Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 501--508.
[41]
Tingting Wu and Yingru Li. 2013. Spatial interpolation of temperature in the United States using residual kriging. Applied Geography 44 (2013), 112--120.
[42]
Zhe Yang, Bing Wu, Kan Zheng, Xianbin Wang, and Lei Lei. 2016. A Survey of Collaborative Filtering-Based Recommender Systems for Mobile Internet Applications. IEEE Access 4 (2016), 3273--3287.
[43]
Xiuwen Yi, Yu Zheng, Junbo Zhang, and Tianrui Li. 2016. ST-MVL: Filling Missing Values in Geo-sensory Time Series Data. In IJCAI 2016.
[44]
Tatsuya Yokota, Qibin Zhao, Chao Li, and Andrzej Cichocki. 2015. Smooth PARAFAC Decomposition for Tensor Completion. arXiv preprint arXiv:1505.06611 (2015).
[45]
Ceylan Yozgatligil, Sipan Aslan, Cem Iyigun, and Inci Batmaz. 2013. Comparison of missing value imputation methods in time series: the case of Turkish meteorological data. Theoretical and applied climatology 112, 1-2 (2013), 143--167.
[46]
Zemin Zhang and Shuchin Aeron. 2015. Exact tensor completion using t-SVD. arXiv preprint arXiv:1502.04689 (2015).
[47]
Zemin Zhang, Gregory Ely, Shuchin Aeron, Ning Hao, and Misha Kilmer. 2014. Novel methods for multilinear data completion and de-noising based on tensor-SVD. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3842--3849.

Cited By

View all
  • (2022)A Robust Latent Factor Analysis Model for Incomplete Data Recovery in Wireless Sensor Networks2022 IEEE International Conference on Edge Computing and Communications (EDGE)10.1109/EDGE55608.2022.00033(178-183)Online publication date: Jul-2022
  • (2021)Predicting Human Behavior with Transformer Considering the Mutual Relationship between Categories and Regions2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)10.1109/MIPR51284.2021.00029(144-150)Online publication date: Sep-2021
  • (2020)Automated Activity Identification for Construction Equipment Using Motion Data From Articulated MembersFrontiers in Built Environment10.3389/fbuil.2019.001445Online publication date: 9-Jan-2020
  • Show More Cited By

Index Terms

  1. Interpolating the Missing Values for Multi-Dimensional Spatial-Temporal Sensor Data: A Tensor SVD Approach

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      MobiQuitous 2017: Proceedings of the 14th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services
      November 2017
      555 pages
      ISBN:9781450353687
      DOI:10.1145/3144457
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 07 November 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. ADMM
      2. Sensor Data Recovery
      3. Tensor Completion
      4. t-SVD

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      MobiQuitous 2017
      MobiQuitous 2017: Computing, Networking and Services
      November 7 - 10, 2017
      VIC, Melbourne, Australia

      Acceptance Rates

      Overall Acceptance Rate 26 of 87 submissions, 30%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)24
      • Downloads (Last 6 weeks)4
      Reflects downloads up to 01 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)A Robust Latent Factor Analysis Model for Incomplete Data Recovery in Wireless Sensor Networks2022 IEEE International Conference on Edge Computing and Communications (EDGE)10.1109/EDGE55608.2022.00033(178-183)Online publication date: Jul-2022
      • (2021)Predicting Human Behavior with Transformer Considering the Mutual Relationship between Categories and Regions2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)10.1109/MIPR51284.2021.00029(144-150)Online publication date: Sep-2021
      • (2020)Automated Activity Identification for Construction Equipment Using Motion Data From Articulated MembersFrontiers in Built Environment10.3389/fbuil.2019.001445Online publication date: 9-Jan-2020
      • (2020)Sensor data quality: a systematic reviewJournal of Big Data10.1186/s40537-020-0285-17:1Online publication date: 11-Feb-2020
      • (2020)Anomaly detection frameworks for outlier and pattern anomaly of time series in wireless sensor networks2020 International Conference on Networking and Network Applications (NaNA)10.1109/NaNA51271.2020.00046(229-232)Online publication date: Dec-2020

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media