Abstract
This paper mainly focuses on the principle component analysis (PCA) and its applications on vision based computing. The underlying mechanism of PCA given and several significant factors, involved with subspace training are discussed theoretically in detail including principle components energy, residuals assessment, and decomposition computation. The typical extensions, including probabilistic PCA (PPCA), kernel PCA (KPCA), multi-dimensional PCA and robust PCA (RPCA), have been presented with critical analysis on both mechanisms and computations. Combining with the studies on, such as, image compression, visual tracking, image recognition and super-resolution image reconstruction, PCA and its extensions applied to computer vision are critically reviewed and evaluated on the targeted issues in each application as well as the role they played at specific tasks to the characteristics, cost and suitable situations of each PCA based vision processing technique.
Similar content being viewed by others
References
Abdi H, Williams LJ (2010) Principal component analysis. Wiley Interdiscip Rev Comput Stat 2(4):433–459
Alex M, Vasilescu O, Demetri T (2002) Multilinear analysis of image ensembles: Tensorfaces, Computer Vision-ECCV 2002. Springer, Berlin, pp 447–460
An L, Kafai M, Bhanu B (2013) Dynamic bayesian network for unconstrained face recognition in surveillance camera networks. IEEE Journal on Emerging and Selected Topics in Circuits and Systems 3(2):155–164
Anthony D, Hines E, Barham J, Taylor D (1990) A comparison of image compression by neural networks and principal component analysis. International Joint Conference on Neural Networks 1:339–344
Arias-Estrada M, Rodrguez-Palacios E (2002) An FPGA co-processor for real-time visual tracking, Field-Programmable Logic and Applications: Reconfigurable Computing Is Going Mainstream. Springer, Berlin, pp 710–719
Atallah MJ (1983) A linear time algorithm for the Hausdorff distance between convex polygons. Inf Process Lett 17(4):207–209
Bae C, Yoo J, Kang K, Choe Y, Lee J (2003) Home server for home digital service environments. IEEE Trans Consum Electron 49(4):1129–1135
Bahl P, Han RY, Li LE, Satyanarayanan M (2012) Advancing the state of mobile cloud computing. In: Proceedings of the third ACM workshop on Mobile cloud computing and services. ACM, pp 21–28
Belimpasakis P, Walsh R (2011) A combined mixed reality and networked home approach to improving user interaction with consumer elec-tronics. IEEE Trans Consum Electron 57(1):139–144
Bravo I, Mazo M, Lzaro JL et al (2010) An intelligent architecture based on field programmable gate arrays designed to detect moving objects by using principal component analysis. Sensors 10(10):9232–9251
Butterworth S (1930) On the Theory of Filter Amplifiers. Experimental Wireless and the Wireless Engineer 7:536–541
Cai JF, Cands EJ, Shen ZA (2010) A singular value thresholding algorithm for matrix completion. SIAM J Optim 20(4):1956–1982
Cands EJ, Li X, Ma Y, Wright J (2011) Robust principal component analysis? J ACM (JACM) 58(3):1–39
Capel D, Zisserman A (2001) Super-resolution from multiple views using learnt image models. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 2. IEEE, pp II–627
Castaneda B, Yuriy L, Cockburn JC (2004) Implementation of a Modular Real-Time Feature-Based Architecture Applied to Visual Face Tracking. Int Conf Pattern Recog 4:167–170
Chakrabarti A, Rajagopalan AN, Chellappa R (2007) Super-resolution of face images using kernel PCA-based prior. IEEE Trans Multimed 9(4):888–892
Chakrabarti A, Rajagopalan AN, Chellappa R (2007) Super-resolution of face images using kernel PCA-based prior. IEEE Trans Multimed 9(4):888–892
Chakrabarti A, Rajagopalan AN, Chellappa R (1999) Superresolution restoration of an image sequence: adaptive filtering approach. IEEE Trans Image Process 8(3):387–395
Chen CH, Pau LF, Wang PSP (eds) (2010) Handbook of pattern recognition and computer vision. Imperial College Press
Cheng KT, Wang Y-C (2011) Using mobile GPU for general-purpose computing-a case study of face recognition on smartphones. In: International Symposium on VLSI Design, Automation and Test. IEEE, pp 1–4
Cho Y, Lim SO, Yang HS (2010) Collaborative occupancy reasoning in visual sensor network for scalable smart video surveillance. IEEE Trans Consum Electron 56 (3):1997–2003
Cho J, Mirzaei S, Oberg J, Kastner R (2009) Fpga-based face detection system using haar classifiers. In: Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays. ACM, pp 103–112
Clausen C, Wechsler H (2000) Color image compression using PCA and backpropagation learning. Pattern Recog 33(9):1555–1560
Cruz-Mota J, Bierlaire M, Thiran J-P (2013) Sample and pixel weighting strategies for robust incremental visual tracking. IEEE Transaction on Circuit and System for Video Technology 23(5):898–911
Dai F, Park MW, Sandidge M, Brilakis I (2015) A vision-based method for on-road truck height measurement in proactive prevention of collision with overpasses and tunnels. Autom Constr 50:29–39
Davis M, Smith M, Canny J, Good N, King S, Janakiraman R (2005) Towards context-aware face recognition. In: Proceedings of the 13th annual ACM international conference on Multimedia. ACM, pp 483–486
De Cristforis P, Nitschea M, Krajnk T, Pirea T, Mejail M (2015) Hybrid vision-based navigation for mobile robots in mixed indoor/outdoor environments. Pattern Recogn Lett 53:118–128
De la Torre F, Black MJ (2001) Robust principal component analysis for computer vision. Eighth IEEE International Conference on Computer Vision, Proceedings, IEEE 1:362–369
De la Torre F, Black MJ (2001) Robust principal component analysis for computer vision. Proceedings of International Conference on Computer Vision (Vancouver, BC.) 1:362–369
Du Q, James FE (2007) Hyperspectral image compression using JPEG2000 and principal component analysis. IEEE Geosci Remote Sens Lett 4(2):201–205
Du Q, Starkville MS, Ly N, Fowler JE (2013) An Operational Approach to PCA+JPEG2000 Compression of Hyperspectral Imagery. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 7(6):2237–2245
Eerenberg O, Aarts RM, de With PHN (2014) PVR system design of advanced video navigation reinforced with audible sound. IEEE Trans Consum Electron 60 (4):681–689
Elad M, Feuer A (1999) Super-resolution reconstruction of image sequences. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(9):817–834
Farsiu S, Robinson MD, Elad M, Milanfar P (2004) Fast and robust multiframe super resolution. IEEE Trans Image Process 13(10):1327–1344
Gai J, Stevenson RL (2011) Studentized dynamical system for robust object tracking. IEEE Transaction on Image Processing 20(1):186–199
Gaidhane V, Singh V, Kumar M (2010) Image compression using PCA and improved technique with MLP neural network. In: International Conference on Advances in Recent Technologies in Communication and Computing, pp 978–0
GlÖsmann P (2005) Nonlinear system analysis with Karhunen-Love transform. Nonlinear Dyn 41(1–3):111–128
Gorur P, Amrutur B (2014) Skip Decision and Reference Frame Selection for Low Complexity H. 264/AVC Surveillance Video Coding. IEEE Transaction on Circuits and Systems for Video Technology 24(7):1156–1169
Han D, Rao YN, Principe JC, Gugel K (2004) Real-time PCA (principal component analysis) implementation on DSP. In: Proceedings of IEEE International Joint Conference on Neural Networks, vol 3. IEEE, pp 2159–2162
Hong H, Yang X, You Z, Cheng F (2014) Visual quality detection of aquatic products using machine vision. Aquac Eng 63:62–71
Hossain MS (2014) QoS-aware service composition for distributed video surveillance. Multimedia Tools and Applications 73(1):169–188
Hotelling H (1933) Analysis of a complex of statistical variables into principal components. Journal of educational psychology 24(6):417
Huang Y-S, Chieu B-C (2011) Architecture for video coding on a processor with an ARM and DSP cores. Multimedia Tools and Applications 54(2):527–543
Hung KW, Siu WC (2012) Single image super-resolution using iterative Wiener filter. In: Processing IEEE International Conference on Acoustics, Speech and Signal, pp 1269–1272
Hyvärinen L (1970) Principal component analysis, Mathematical Modeling for Industrial Processes. Springer, Berlin, pp 82–104
Indelman V, Gurfil P, Rivlin E, Rotstein H (2012) Distributed vision-aided cooperative localization and navigation based on three-view geometry. Robot Auton Syst 60(6):822–840
Jalal A, Uddin MZ, Kim TS (2012) Depth video-based human activity recognition system using translation and scaling invariant features for life logging at smart home. IEEE Trans Consum Electron 58(3):863–871
Jolliffe I (2002) Principal component analysis. Wiley
Karsch K, Liu C, Kang SB (2012) Depth extraction from video using non-parametric sampling. European Conference on Computer Vision:775–788
Kasai H (2013) Development and benchmarking of HTTP-based multivision video server using fast stream joiner. IEEE Trans Consum Electron 59(2):343–351
Khan AA, Yao Z, Thomas HK (2015) Context Aware Indoor Route Planning Using Semantic 3D Building Models with Cloud Computing, 3D Geoinformation Science. Springer, pp 175–192
Kim KI, Franz MO, Schölkopf B (2004) Kernel hebbian algorithm for single-frame super-resolution. Statistical Learning in Computer Vision, ECCV 2004 Workshop, May:135–149
Kim J, Jun H (2008) Vision-based location positioning using augmented reality for indoor navigation. IEEE Trans Consum Electron 54(3):954–962
Kim KI, Keechul J, Kim HJ (2002) Face recognition using kernel principal component analysis. IEEE Signal Processing Letters 9(2):40–42
Kim Y, Kim Y, Kang N (2010) Multimedia push-to-talk service in home networks. IEEE Trans Consum Electron 56(3):1480–1486
Kim JO, Kim JS, Chung CH (2005) Face recognition by the LDA-Based algorithm for a video surveillance system on DSP. In: Proceeding of Computational Science and Its Applications. Springer, Berlin, pp 638–646
Kim KI, Matthias OF, Bernhard S (2005) Iterative kernel principal component analysis for image modeling. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(9):1351–1366
Kwon Y(J), Hong J (2014) Integrated remote control of the process capability and the accuracy of vision calibration. Robot Comput Integr Manuf 30:451–459
Kwon J, Lee KM (2010) Visual tracking decomposition. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1269–1276
Lalonde M, Byrns D, Gagnon L, Teasdale N, Laurendeau D (2007) Real-time eye blink detection with GPU-based SIFT tracking. In: Fourth Canadian Conference on Computer and Robot Vision, pp 481–487
Lathauwer LD, Moor BD, Vandewalle J (2000) A multilinear singular value decomposition. SIAM journal on Matrix Analysis and Applications 21(4):1253–1278
Lee SH, Kim DJ, Cho JH (2012) Illumination-robust face recognition system based on differential components. IEEE Trans Consum Electron 58(3):963–970
Lei L, Kim SW, Park WJ, Kim DH, Ko SJ (2014) Eigen directional bit-planes for robust face recognition. IEEE Trans Consum Electron 60(4):702–709
Lim ST, Yap DFW, Manap NA (2014) A GUI system for region based image compression using Principal Component Analysis. IEEE International Conference on Computational Science and Technology (ICCST):1–4
Liu Y, Yu F (2014) Automatic inspection system of surface defects on optical IR-CUT filter based on machine vision. Opt Lasers Eng 55:243–257
Lopez S, Callico GM, Tobajas F, Lopez JF, Sarmiento R (2009) A novel real-time DSP-based video super-resolution system. IEEE Trans Consum Electron 55 (4):2264–2270
Lu H, Konstantinos NP, Anastasios NV (2008) MPCA: Multilinear principal component analysis of tensor objects. IEEE Transactions on Neural Networks 19 (1):18–39
Lu H, Plataniotis KN, Venetsanopoulos AN (2007) Boosting LDA with regularization on MPCA features for gait recognition. IEEE Biometrics Symposium:1–6
Marcellin MW (2002) JPEG2000 Image Compression Fundamentals, Standards and Practice: Image Compression Fundamentals, Standards, and Practice, vol 1. Springer Science & Business Media
Maugey T, Daribo I, Cheung G, Frossard P (2013) Navigation domain representation for interactive multiview imaging. IEEE Trans Image Process 22(9):3459–3472
Mehta SS, Burks TF (2014) Vision-based control of robotic manipulator for citrus harvesting. Comput Electron Agric 102:146–158
Mei X, Ling H (2009) Robust visual tracking using ℓ 1 minimization. In: 2009 IEEE 12th International Conference on Computer Vision, pp 1436–1443
Mohammed AA, Minhas R, Wu QM J, Sid-Ahmed MA (2011) Human face recognition based on multidimensional PCA and extreme learning machine. Pattern Recog 44(10):2588–2597
Moghaddam B, Pentland A (1995) Probabilistic visual learning for object detection. In: Proceedings of Fifth International Conference on Computer Vision. IEEE, pp 786–793
Moghaddam B, Pentland A (1997) Probabilistic visual learning for object representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 19 (7):696–710
Moore B (1981) Principal component analysis in linear systems: Controllability, observability, and model reduction. IEEE Trans Autom Control 26(1):17–32
Najim M (2008) Karhunen Loeve Transform. Modeling, Estimation and Optimal Filtering in Signal Processing:335–340
Neuvo Y, Dong C-Y, Sanjit KM (1984) Interpolated finite impulse response filters. IEEE Transactions on Speech and Signal Processing Acoustics 32(3):563–570
Nguyen N, Milanfar P (2000) A wavelet-based interpolation-restoration method for superresolution (wavelet superresolution). Circuits, Systems and Signal Processing 19(4):321–338
Nickels K, DiCicco M, Bajracharya M, Backes P (2010) Vision guided manipulation for planetary robotics - position control. Robot Auton Syst 58:121–129
Ohmer J, Maire F, Brown R (2005) Implementation of kernel methods on the GPU. In: Proceedings of Digital Image Computing: Techniques and Applications. IEEE, pp 78–78
Park SC, Park MK, Kang MG (2003) Super-resolution image reconstruction: a technical overview. IEEE Signal Proc Mag 20(3):21–36
Pearson KL III (1901) On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2(11):559–572
Rahman M, Kehtarnavaz N (2008) Real-time face-priority auto focus for digital and cell-phone cameras. IEEE Trans Consum Electron 54(4):1506–1513
Roelofs G, Koman R (1999) PNG: the definitive guide. O’Reilly & Associates, Inc.
Ross D, Lim J, Lin R-S, Yang M-H (2008) Incremental learning for robust visual tracking. Int J Comput Vision 77(1):125–141
Ross D, Lim J, Yang M-H (2004) Adaptive probabilistic visual tracking with incremental subspace update, Computer Vision-ECCV. Springer, Berlin, pp 470–482
Rote G (1991) Computing the minimum Hausdorff distance between two point sets on a line under translation. Inf Process Lett 38(3):123–127
Saleh SAM, Suandi SA, Ibrahim H (2015) Recent survey on crowd density estimation and counting for visual surveillance. Eng Appl Artif Intell 41:103–114
Sazdovski V, Kitanov A, Petrovic I (2015) Implicit observation model for vision aided inertial navigation of aerial vehicles using single camera vector observations. Aerosp Sci Technol 40:33–46
Schölkopf B, Smola A, Mller KR (1997) Kernel principal component analysis, International Conference on Artificial Neural Networks. Springer, Berlin, pp 583–588
Schölkopf B, Smola A, Mller KR (1998) Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput 10(5):1299–1319
Shen Y, Guturu P, Damarla T, Buckles BP, Namuduri K (2009) Video stabilization using principal component analysis and scale invariant feature transform in particle filter framework. IEEE Trans Consum Electron 55(3):1714–1721
Sodemann AA, Ross MP, Borghetti BJ (2012) A review of anomaly detection in automated surveillance. IEEE Trans Syst Man Cybern Part C Appl Rev 42(6):1257–1272
Soyata T, Muraleedharan R, Funai C, Kwon M, Heinzelman W (2012) Cloud-Vision: Real-time face recognition using a mobile-cloudlet-cloud acceleration architecture. In: IEEE Symposium on Computers and Communications. IEEE, pp 59–66
Stewart GW (1993) On the early history of the singular value decomposition. SIAM Rev 35(4):551–566
Storn R (1996) Differential evolution design of an IIR-filter. In: Proceedings of IEEE International Conference on Evolutionary Computation. IEEE, pp 268–273
Su X, Peng J, Feng X, Wu J (2015) Labeling faces with names based on the name semantic network. Multimedia Tools and Applications:1–18
Szydzik T, Callico GM, Nunez A (2011) Efficient FPGA implementation of a high-quality super-resolution algorithm with real-time perform-ance. IEEE Trans Consum Electron 57(2):664–672
Takyar U, Maugey T, Frossard P (2014) Extended layered depth image representation in multiview navigation. IEEE Signal Processing Letters 21(1):22–25
Tipping ME, Bishop CM (1999) Probabilistic principal component analysis. J R Stat Soc Ser B Stat Methodol 61(3):611–622
Tipping MiE, Bishop CM (1999) Mixtures of probabilistic principal component analyzers. Neural Comput 11(2):443–482
Tsuzuki G, Ye S, Berkowitz S (2002) Ultra-selective 22-pole 10-transmission zero superconducting bandpass filter surpasses 50-pole Chebyshev filter. IEEE Transactions on Microwave Theory and Techniques 50(12):2924–2929
Turk MA, Alex PP (1991) Face recognition using eigenfaces. In: Proceedings In Computer Society Conference on Computer Vision and Pattern Recognition, CVPR’91. IEEE, pp 586–591
Uddin MZ, Lee JJ, Kim TS (2009) An enhanced independent component-based human facial expression recognition from video. IEEE Trans Consum Electron 55 (4):2216–2224
Van Loan CF (1976) Generalizing the singular value decomposition. SIAM J Numer Anal 13(1):76–83
Vasilescu M, Terzopoulos D (2003) Multilinear Subspace Analysis of Image Ensembles. In: Proceedings of IEEE Computer Vision and Pattern Recognition, pp 93–99
Verstockt S, Hoecke SV, De Potter P, Lambert P, Hollemeersch C, Sette B, Merci B, Van de Walle R (2014) Multi-modal time-of-flight based fire detection. Multimedia Tools and Applications 69(2):313–338
Visakhasart S, Chitsobhuk O (2009) Multi-Pipeline Architecture for face recognition on FPGA. In: Processing of International Conference on Digital Image. IEEE, pp 152–156
Vo DT, Lertrattanapanich S, Kim YT (2011) Low line memory visually lossless compression for color images using non-uniform quantizers. IEEE Trans Consum Electron 57(1):187–195
Vongbunyong S, Kara S, Pagnucco M (2015) Learning and revision in cognitive robotics disassembly automation. Robot Comput Integr Manuf 34:79–94
Wallace GK (1991) The JPEG still picture compression standard. Commun ACM 34(4):30–44
Wang D, Li J, Memik G (2010) User identification based on finger-vein patterns for consumer electronics devices. IEEE Trans Consum Electron 56(2):799–804
Wiggins RH et al (2001) Image File Formats: Past, Present, and Future 1. Radiographics 21(3):789–798
Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemometr Intell Lab Syst 2(1):37–52
Wright J, Ganesh A, Rao S, Peng Y, Ma Y (2009) Robust principal component analysis: Exact recovery of corrupted low-rank matrices via convex optimization. In: In Advances in neural information processing systems, pp 2080–2088
Xu L, Yuille AL (1995) Robust principal component analysis by self-organizing rules based on statistical physics approach. IEEE Transactions on Neural Networks 6 (1):131–143
Yamada K, Kojima M, Shimizu T, Sato F, Mizuno T (2002) A new RISC processor architecture for MPEG-2 decoding. IEEE Trans Consum Electron 48 (1):143–150
Yan T, Lau RW, Xu Y, Huang L (2013) Depth mapping for stereoscopic videos. Int J Comput Vis 102(1–3):293–307
Yang J et al (2004) Two-dimensional PCA: a new approach to appearance-based face representation and recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(1):131–137
Yang L, Wang J, Ando T, Kubota A, Yamashita H, Sakuma I, Chiba T, Kobayashi E (2015) Vision-based endoscope tracking for 3D ultrasound image-guided surgical navigation. Comput Med Imaging Graph 40:205–216
Yun WH, Kim D, Yoon HS (2007) Fast group verification system for intelligent robot service. IEEE Trans Consum Electron 53(4):1731–1735
Yuan Y, Emmanuel S, Fang Y, Lin W (2014) Visual object tracking based on backward model validation. IEEE Transaction on Circuit and System for Video Technology 24(11):1898–1901
Zdenek K, Matas J, Mikolajczyk K (2010) Pn learning: Bootstrapping binary classifiers by structural constraints. In: Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 49–56
Zhang X, Huang T, Tian Y, Gao W (2014) Background-modeling-based adaptive prediction for surveillance video coding. IEEE Trans Image Process 23 (2):769–784
Zhang C, Platt JC, Viola PA (2005) Multiple instance boosting for object detection. In: Advances in neural information processing systems, pp 1417–1424
Zhang D, Zhou Z-H, Chen S (2006) Diagonal principal component analysis for face recognition. Pattern Recogn 39(1):140–142
Zhang H, Zhang L, Shen H (2012) A super-resolution reconstruction algorithm for hyperspectral images. Signal Process 92(9):2082–2096
Zhao W, Krishnaswamy A, Chellappa R, Swets DL, Weng J (1998) Discriminant analysis of principal components for face recognition. In Face Recognition. Springer, Berlin, pp 73–85
Zhong W, Lu H, Yang MH (2012) Robust object tracking via sparsity-based collaborative model. IEEE Conf Comput Vis Pattern Recognit (CVPR):1838–1845
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, C.Z., Kavakli, M. Extensions of principle component analysis with applications on vision based computing. Multimed Tools Appl 75, 10113–10151 (2016). https://doi.org/10.1007/s11042-015-3025-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-3025-3