Abstract
Image prediction is a very important step in image and video coding. LLE (locally linear embedding) is a famous algorithm of NLDR (nonlinear dimensionality reduction), and it is capable of projecting high dimensional image blocks into a low dimensional space of embedding. This paper is concerned with the image prediction by 1D representation of LLE algorithm. Two LLE algorithms have been studied. One is the general LLE algorithm, the other is the proposed distance-keeping based LLE algorithm, which has the merit of preserving the distance property in low dimensional space. 1D representation of LLE algorithms can hugely improve the CR (compression ratio). The training input and output of LLE algorithms are employed as training pair for ERA (embedding and reconstruction algorithm) of testing samples, and the training pair is as large as possible to overcome the inherent disadvantage of classical algorithms for image prediction, which only utilize the adjacent image blocks. Three stable ERAs have been proposed. The first is general ERA, the second is nearest neighbor based ERA, and the third is machine learning based ERA. The nearest neighbor based ERA has the best performance if the training samples are sufficient, while the machine learning based ERA has the best performance if the training samples are insufficient. Three DLAs (dictionary learning algorithms) for selecting training samples are presented. The first is exemplars based DLA, the second is K-means clustering based DLA, and the third is sparse representation based DLA. The K-means clustering based DLA has the best performance. A unified framework for intra-frame, inter-frame, multi-view, 3D and multi-view 3D image prediction, has been built according to the proposed algorithms. The performance of proposed algorithms for image prediction has been evaluated by simulation experimentations. The results of simulation experiments indicate that proposed algorithms are able to gain very high PSNR (peak signal to noise ratio). The results of simulation experiments also reveal that 1D representation of distance-keeping based LLE algorithm, machine learning based ERA, and K-means clustering based DLA are very effective and efficient for image prediction.
Similar content being viewed by others
References
Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Susstrunk S (2012) SLIC Superpixels compared to state-of-the-art superpixel methods. IEEE Trans Pattern Anal Mach Intell 34(11):2274–2282
Ankur A, Bill T (2008) Multilevel image coding with hyperfeatures. Int J Comput Vis 78(1):15–27
Blasi SG, Mrak M, Izquierdo E (2015) Frequency-domain intra prediction analysis and processing for high-quality video coding. IEEE Trans Circ Syst Video Technol 25(5):798–811
Changshui Z (2004) Reconstruction and analysis of multi-pose face images based on nonlinear dimensionality reduction. Pattern Recogn 37:325–336
Chen J, Zengwei J, Cao H, Ma B (2013) Accelerated implementation of adaptive directional lifting-based discrete wavelet transform on GPU. Elsevier, Signal Processing: Image Communication 28:1202–1211
Cherigui S, Guillemot C, Thoreau D, Guillotel P, Perez P (2013) Correspondence map-aided neighbor embedding for image intra prediction. IEEE Trans Image Process 22(3):1161–1174
Chuohao Y, Parvez A, Kannan R (2011) Coding of image feature descriptors for distributed rate-efficient visual correspondences. Int J Comput Vis 94(3):267–281
De Abreu A, Frossard P, Pereira F (2015) Optimizing multiview video plus depth prediction structures for interactive multiview video streaming. IEEE J Sel Top Signal Process 9(3):487–500
Dey B, Kundu M (2015) Efficient foreground extraction from HEVC compressed video for application to real-time analysis of surveillance ‘big’ data. IEEE Trans Image Process 24(11):3574–3585
Farid MS, Lucenteforte M, Grangetto M (2015) Panorama view with spatiotemporal occlusion compensation for 3D video coding. IEEE Trans Image Process 24(1):205–219
Genaro D-S, German C-D, Principe JC (2012) Locally linear embedding based on correntropy measure for visualization and classification. Neurocomputing 80:19–30
Goulermas JY, Liatsis P, Zeng X-J, Cook P (2007) Density-driven generalized regression neural networks (DD-GRNN) for function approximation. IEEE Trans Neural Netw 18(6):1683–1696
Gudivada VN, Baeza-Yates R, Raghavan VV (2015) Big data: promises and problems. Computer 48(3):20–23
Guillemot C, Cherigui S, Thoreau D (2013) K-NN search using local learning based on regression for neighbor embedding-based image prediction. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp 2006–2010. Vancouver, Canada
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507
Hu W, Cheung G, Ortega A (2015) Intra-prediction and generalized graph Fourier transform for image coding. IEEE Sig Process Lett 22(11):1913–1917
Jiang C, Nooshabadi S (2015) A scalable massively parallel motion and disparity estimation scheme for multiview video coding. IEEE Trans Circuits Syst Video Technol PP(99):1–1.
Kamisli F (2015) Block-based spatial prediction and transforms based on 2D Markov processes for image and Video compression. IEEE Trans Image Proc 24(4):1247–1260
Kouropteva O, Okun O, Pietikainen M (2005) Incremental locally linear embedding. Pattern Recogn 38(10):1764–1767
Martin A, Fuchs J-J, Guillemot C, Thoreau D (2007) Sparse representation for image prediction. In: Proceedings of European Signal Processing Conference, pp 1255–1259. Pozna, Poland
Merkle P, Muller K, Marpe D, Wiegand T (2015) Depth intra coding for 3D video based on geometric primitives. IEEE Trans Circuits Syst Video Technol PP(99):1–1
Mikhail Belkin, Partha Niyogi (2001) Laplacian eigenmaps and spectral techniques for embedding and clustering. In: Advances in neural information processing systems 14:586–691. MIT Press, Massachusetts
Monniga ND, Fornberga B, Meyerb FG (2014) Inverting nonlinear dimensionality reduction with scale-free radial basis function interpolation. Appl Comput Harmon Anal 37(1):162–170
Nichols JM, Bucholtz F, Nousain B (2011) Automated, rapid classification of signals using locally linear embedding. Expert Syst Appl 38(10):13472–13474
Philips P (2002) The gait identification challenge problem: data sets and baseline algorithm. In: Proceedings of international conference on pattern recognition, vol 1, pp 385–388. Quebec City, QC, Canada
Purica A, Mora E, Pesquet-Popescu B, Cagnazzo M, Ionescu B. (2015) Multiview plus depth video coding with temporal prediction view synthesis. IEEE Trans Circuits Syst Video Technol PP(99):1–1
Roweis ST (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500):2323–2326
Ruifeng S, Wensheng C, Xueguang S (2014) Variable selection based on locally linear embedding mapping for near-infrared spectral analysis. Chemom Intell Lab Syst 131:31–36
Saul L, Roweis S (2003) Think globally, fit locally: unsupervised learning of nonlinear manifolds. J Mach Learn Res 4:119–155
Song X, Peng X, Xu J, Shi G, Wu F (2015) Cloud-based distributed image coding. IEEE Trans Circuits Syst Video Technol PP(99):1–1
Tan TK, Boon CS, Suzuki Y (2006) Intra prediction by template matching. In: Proceedings of IEEE International Conference on Image Processing, pp 1693–1696. Atlanta, GA
Tang J, Li Z, Wang M, Zhao R (2015) Neighborhood discriminant Hashing for large-scale image retrieval. IEEE Trans Image Process 24(9):2827–2840
Tao H, Huang TS (2002) Visual estimation and compression of facial motion parameters—elements of a 3D model-based video coding system. Int J Comput Vis 50(2):111–125
Tenenbaum JB, de Silva V, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500):2319–2323
Tiirkan M, Guillemot C (2012) Image prediction based on neighbor-embedding methods. IEEE Trans Image Process 21(4):1885–1898
Timo D, Falko S, Wolfgang F (2011) Coding images with local features. Int J Comput Vis 94(2):154–174
Trocan M, Tramel EW, Fowler JE, Pesquet B (2014) Compressed-sensing recovery of multiview image and video sequences using signal prediction. Multimedia Tools Appl 72:95–121
Xiaoming Z, Shiqing Z (2012) Facial expression recognition using local binary patterns and discriminant kernel locally linear embedding. EURASIP J Adv Signal Process 2012:1–9
Yan C, Zhang Y, Xu J, Dai F, Liang L, Dai Q, Wu F (2014) A highly parallel framework for HEVC coding unit partitioning tree decision on many-core processors. IEEE Sig Process Lett 21(5):573–576
Yan C, Zhang Y, Xu J, Dai F, Zhang J, Dai Q, Wu F (2014) Efficient parallel framework for HEVC motion estimation on many-core processors. IEEE Trans Circuits Syst Video Technol 24(12):2077–2089
Yeh C-H, Tseng T-Y, Lee C-W, Lin C-Y (2015) Predictive texture synthesis-based intra coding scheme for advanced video coding. IEEE Trans Multimedia 17(9):1508–1514
Yin Z, Barner KE (2013) Locality constrained dictionary learning for nonlinear dimensionality reduction. IEEE Sig Process Lett 20(4):335–338
Yong R (2014) Big data and image search. IEEE MultiMed 21(3):2–3
Zhang Y, Kwong S, Xu W, Yuan H, Pan Z, Xu L (2015) Machine learning-based coding unit depth decisions for flexible complexity allocation in high efficiency video coding. IEEE Trans Image Process 24(7):2225–2238
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, H. 1D representation of locally linear embedding for image prediction. Multimed Tools Appl 76, 8651–8676 (2017). https://doi.org/10.1007/s11042-016-3491-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-3491-2