Abstract
Sparse coding, often called dictionary learning, has received significant attention in the fields of statistical machine learning and signal processing. However, most approaches assume iid data setup, which can be easily violated when the data retains certain statistical structures such as sequences where data samples are temporally correlated. In this paper we formulate a novel dynamic sparse coding problem, and propose an efficient algorithm that enforces smooth dynamics for the latent state vectors (codes) within a linear dynamic model while imposing sparseness of the state vectors. We overcome the added computational overhead originating from smooth dynamic constraints by adopting the recent first-order smooth optimization technique, adjusted for our problem instance. We demonstrate the improved prediction performance of our approach over the conventional sparse coding on several interesting real-world problems including financial asset return data forecasting and human motion estimation from silhouette videos.
Similar content being viewed by others
Notes
It should not incur any theoretical or practical concern because one can pre-scale the entries of B in accordance with the chosen values of c.
They are publicly available from: http://www.mysmu.edu.sg/faculty/chhoi/olps/datasets.html, and the detailed description can be found in [21].
References
Bar-Shalom Y, Li XR (1993) Estimation and tracking: principles, techniques, and software. Artech House, Boston
Beck A, Teboulle M (2009) A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J Imaging Sci 2(1):183–202
Bengio Y (2009) Learning deep architectures for ai. Found Trends Mach Learn 2(1):1–127
Boyd S, Parikh N, Chu E, Peleato B, Eckstein J (2011) Distributed optimization and statistical learning via the alternating direction method of multipliers. Found Trends Optim 3(1):1–122
Chen SS, Donoho DL, Saunders MA (1998) Atomic decomposition by basis pursuit. SIAM J Sci Comput 20(1):33–61
Cho K, Merrienboer BV, Gulcehre C, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder- decoder for statistical machine translation. arXiv:1406.1078
Chung J, Gulcehre C, Cho K, Bengio Y (2015) Gated feedback recurrent neural networks. International Conference on Machine Learning (ICML)
d’Aspremont A, Banerjee O, Ghaoui LE (2008) First-order methods for sparse covariance selection. SIAM J Matrix Anal Appl 30(56):367–385
Deb K, Padhye N (2014) Enhancing performance of particle swarm optimization through an algorithmic link with genetic algorithms. Comput Optim Appl 57(3):761–794
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc 39:1–38
Donoho DL (2006) Compressed sensing. IEEE Trans Inf Theory 52(4):1289–1306
Efron B, Hastie T, Johnstone I, Tibshirani R (2004) Least angle regression. Ann Statist 32(2):407–499
Fu W, Wang J, Lu H, Ma S (2013) Dynamic scene understanding by improved sparse topical coding. Pattern Recogn 46(7):1841–1850
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Hsaio WH, Liu CL, Wu WL (2017) Locality-constrained max-margin sparse coding. Pattern Recogn 65:285–295
Huang Z, Liu Y, Li X, Li J (2015) An adaptive bimodal recognition framework using sparse coding for face and ear. Pattern Recogn Lett 53(1):69–76
Kim SJ, Koh K, Lustig M, Boyd S, Gorinevsky D (2007) An interior-point method for large-scale l1-regularized least squares. IEEE J Select Topics Signal Process 1(4):606–617
Lee H, Battle A, Raina R, Ng AY (2007) Efficient sparse coding algorithms. Advances in Neural Information Processing Systems (NIPS) 19
Lemaréchal C, Sagastizábal C (1997) Practical aspects of the Moreau–Yosida regularization: theoretical preliminaries. SIAM J Optim 7(2):367–385
Lewicki MS, Sejnowski TJ (2000) Learning overcomplete representations. Neural Comput 12(2):337–365
Li B, Hoi SC (2012) On-line portfolio selection: a survey. Tech. rep., Nanyang Technological University
Li Y, Ngom A (2013) Sparse representation approaches for the classification of high-dimensional biological data. BMC Syst Biol 6:1–14
Liu S, Liu M, Yang Z (2017) Sparse coding based orientation estimation for latent fingerprints. Pattern Recogn 67:164–176
Lu Z, Wang L (2015) Noise-robust semi-supervised learning via fast sparse coding. Pattern Recogn 48 (2):605–612
Mairal J, Bach F, Ponce J, Sapiro G (2009) Online dictionary learning for sparse coding. In: Proceedings of the 26th annual international conference on machine learning
Nesterov Y (2005) Smooth minimization of non-smooth functions. Math Program 103(1):127–152
Nesterov Y, Nemirovsky A (1994) Interior-point polynomial methods in convex programming. Studies in applied mathematics. SIAM, Philadelphia, p 13
Olshausen B, Field D (2004) Sparse coding of sensory inputs. Curr Opin Neurobiol 14(4):481–487
Olshausen BA, Field DJ (1996) Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381:607–609
Padhye N, Bhardawaj P, Deb K (2013) Improving differential evolution through a unified approach. J Glob Optim 55(4):771–799
Parikh N, Boyd S (2014) Proximal algorithms. Found Trends Optim 1(3):123–231
Paul R, Wohlberg B (2009) A generalized vector-valued total variation algorithm. In: Proceedings of the 16th IEEE international conference on image processing
Perkins S, Theiler J (2003) Online feature selection using grafting. In: International conference on machine learning (ICML)
Polyak BT (1987) Introduction to optimization. Optimization Software, New York
Srinivas S, Subramanya A, Babu RV (2017) Training sparse neural networks. In: IEEE International conference on computer vision and pattern recognition workshops
Tian TP, Li R, Sclaroff S (2005) Articulated pose estimation in a learned smooth space of feasible solutions. In: Proceedings of IEEE workshop in CVPR
Tibshirani R (1994) Regression shrinkage and selection via the lasso. J R Statist Soc Series B 58:267–288
Wahlberg B, Boyd S, Annergren M, Wang Y (2012) An ADMM algorithm for a class of total variation regularized estimation problems. In: Proceedings of the 16th IFAC symposium on system identification
Wright S (1997) Primal-dual interior-point methods. Society for industrial and applied mathematics. SIAM, Philadelphia
Yang AY, Zhou Z, Ganesh A, Sastry SS, Ma Y (2013) Fast l 1-minimization algorithms for robust face recognition. IEEE Trans Image Process 22(8):3234–3246
Ye F, Zhang L, Zhang D, Fujita H, Gong Z (2016) A novel forecasting method based on multi-order fuzzy time series and technical analysis. Inf Sci 367–368:41–57
Ye Y (1997) Interior point algorithms: theory and analysis. Wiley, New York
Yu YF, Dai DQ, Ren CX, Huang KK (2017) Discriminative multi-scale sparse coding for single-sample face recognition with occlusion. Pattern Recogn 66:302–312
Yuan Z, Lu T, Tan CL (2017) Learning discriminated and correlated patches for multi-view object detection using sparse coding. Pattern Recogn 69:26–38
Zhu J, Xing EP (2011) Sparse topical coding. In: Proceedings of the conference on uncertainty in artificial intelligence
Funding
This work is supported by National Research Foundation of Korea (NRF-2016R1A1A1A05921948).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interests
The authors have no conflict of interest.
Ethical Approval
This research does not involve human participants nor animals.
Consent for Publication
Consent to submit this manuscript has been received tacitly from the authors’ institution, Seoul National University of Science & Technology.
Rights and permissions
About this article
Cite this article
Kim, M. Dynamic sparse coding for sparse time-series modeling via first-order smooth optimization. Appl Intell 48, 3889–3901 (2018). https://doi.org/10.1007/s10489-018-1189-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-018-1189-z