research-article

A Sparse Projection and Low-Rank Recovery Framework for Handwriting Representation and Salient Stroke Feature Extraction

Authors:

Ming-Bo ZhaoAuthors Info & Claims

ACM Transactions on Intelligent Systems and Technology (TIST), Volume 6, Issue 1

Article No.: 9, Pages 1 - 26

https://doi.org/10.1145/2601408

Published: 11 March 2015 Publication History

Abstract

In this article, we consider the problem of simultaneous low-rank recovery and sparse projection. More specifically, a new Robust Principal Component Analysis (RPCA)-based framework called Sparse Projection and Low-Rank Recovery (SPLRR) is proposed for handwriting representation and salient stroke feature extraction. In addition to achieving a low-rank component encoding principal features and identify errors or missing values from a given data matrix as RPCA, SPLRR also learns a similarity-preserving sparse projection for extracting salient stroke features and embedding new inputs for classification. These properties make SPLRR applicable for handwriting recognition and stroke correction and enable online computation. A cosine-similarity-style regularization term is incorporated into the SPLRR formulation for encoding the similarities of local handwriting features. The sparse projection and low-rank recovery are calculated from a convex minimization problem that can be efficiently solved in polynomial time. Besides, the supervised extension of SPLRR is also elaborated. The effectiveness of our SPLRR is examined by extensive handwritten digital repairing, stroke correction, and recognition based on benchmark problems. Compared with other related techniques, SPLRR delivers strong generalization capability and state-of-the-art performance for handwriting representation and recognition.

References

[1]

M. Aharon, M. Elad, and A. Bruckstein. 2006. K-svd: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on Signal Processing 54, 11, 4311--4322.

Digital Library

[2]

B. K. Bao, G. C. Liu, C. S. Xu, and S. C. Yan. 2012. Inductive robust principal component analysis. IEEE Transactions on Image Processing (TIP) 21, 8, 3794--3800.

Digital Library

[3]

A. Bar-Hillel, T. Hertz, N. Shental, and D. Weinshall. 2005. Learning a mahalanobis metric from equivalence constraints. Journal of Machine Learning Research 6, 937--965.

Digital Library

[4]

D. P. Bertsekas. 2004. Nonlinear Programming. Athena Scientific.

[5]

A. P. Bradley. 1997. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30, 7, 1145--1159.

Digital Library

[6]

M. Bulacu and L. Schomaker. 2007. Text-independent writer identification and verification using textural and allographic features. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 4, 707--717.

Digital Library

[7]

J. F. Cai, E. J. Candés, and Z. Shen. 2010. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization 20, 4, 1956--1982.

[8]

E. Candes, X. D. Li, Y. Ma, and J. Wright. 2011. Robust principal component analysis&quest; Journal of the ACM 58, 3, 1--37.

Digital Library

[9]

E. Candes and P. Yaniv. 2009. Matrix completion with noise. Proceedings of the IEEE 98, 6, 925--936.

[10]

Y. L. Cun, L. Bottou, Y. Bengio, and P. Haffner. 1998. Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 11, 2278--2324.

[11]

E. Elhamifar and R. Vidal. 2009. Sparse subspace clustering. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2790--2797.

[12]

T. Fawcett. 2006. An introduction to ROC analysis. Pattern Recognition Letters 27, 861--874.

Digital Library

[13]

S. Gao, I. Tsang, and L. Chia. 2013. Laplacian sparse coding, hypergraph Laplacian sparse coding, and applications. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 1, 92--104.

Digital Library

[14]

X. F. He, D. Cai, S. C. Yan, and H. J. Zhang. 2005. Neighborhood preserving embedding. In Proceedings of the International Conference on Computer Vision (ICCV’05).

Digital Library

[15]

X. F. He, S. C. Yan, Y. X. Hu, P. Niyogi, and H. J. Zhang. 2005. Face recognition using Laplacian faces. IEEE Transactions on Pattern Analysis and Machine Intelligence 27, 3, 328--340.

Digital Library

[16]

C. W. Hsu, C. C. Chang, and C. J. Lin. 2010. A Practical Guide to Support Vector Classification. Retrieved from http://www.csie.ntu.edu.tw/&sim;cjlin/papers/guide/guide.pdf.

[17]

J. Hull. 1994. A database for handwritten text recognition research. IEEE Transactions on Pattern Analysis and Machine Intelligence 16, 5, 550--554.

Digital Library

[18]

I. Jolliffe. 1986. Principal Component Analysis. Springer-Verlag.

[19]

E. Kokiopoulou and Y. Saad. 2007. Orthogonal neighborhood preserving projections: A projection-based dimensionality reduction technique. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 12, 2143--2156.

Digital Library

[20]

Z. Lin, M. Chen, L. Wu, and Y. Ma. 2009. The Augmented Lagrange Multiplier Method for Exact Recovery of Corrupted Low-Rank Matrices. University of Illinois at Urbana-Champaign (UIUC) Technical Report, UILU-ENG-09-2215.

[21]

Z. C. Lin, A. Ganesh, J. Wright, L. Q. Wu, M. M. Chen, and Y Ma. 2009. Fast Convex Optimization Algorithms for Exact Recovery of a Corrupted Low-Rank Matrix. UIUC Technical Report UILU-ENG-09- 2214.

[22]

G. C. Liu, Z. C. Lin, S. C. Yan, J. Sun, Y. Yu, and Y. Ma. 2013. Robust recovery of subspace structures by low-rank representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 1, 171--184.

Digital Library

[23]

G. C. Liu and S. C. Yan. 2011. Latent low-rank representation for subspace segmentation and feature extraction. In Proceedings of the International Conference on Computer Vision (ICCV).

Digital Library

[24]

C. L. Liu, F. Yin, D. H. Wang, and Q. F. Wang. 2013. Online and offline handwritten Chinese character recognition: Benchmarking on new databases. Pattern Recognition 46, 1, 155--162.

Digital Library

[25]

K. Min, Z. Zhang, J. Wright, and Y. Ma. 2010. Decomposing background topics from keywords by principal component pursuit. In Proceedings of the 19th International Conference on Information Knowledge and Management. 269--278.

Digital Library

[26]

U. Y. Nahm, M. Bilenko, and R. J. Mooney. 2002. Two approaches to handling noisy variation in text mining. In Proceedings of the ICML-2002 Workshop on Text Learning.

[27]

Y. G. Peng, A. Ganesh, J. Wright, W. L. Xu, and Y. Ma. 2012. RASL: Robust Alignment by Sparse and Low-rank decomposition for linearly correlated images. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 11, 2233--2246.

Digital Library

[28]

F. Provost and T. Fawcett. 2001. Robust classification for imprecise environments. Machine Learning 42, 203--231.

Digital Library

[29]

L. S. Qiao, S. C. Chen, and X. Y. Tan. 2010. Sparsity preserving projections with applications to face recognition. Pattern Recognition 43, 1, 331--341.

Digital Library

[30]

S. N. Srihari, S. H. Cha, H. Arona, and S. Lee. 2001. Establishing handwriting individuality using patten recognition techniques. In Proceedings of the International Conference on Document Analysis and Recognition (ICDAR’01). 1195--1204.

Digital Library

[31]

W. T. Tan, G. Cheung, and Y. Ma. 2011. Face recovery in conference video streaming using robust principal component analysis. In Proceedings of IEEE International Conference on Image Processing (ICIP’11).

[32]

J. Wright, A. Ganesh, S. Rao, Y. Peng, and Y. Ma. 2009. Robust principal component analysis: Exact recovery of corrupted low-rank matrices via convex optimization. In Proceedings of Neural Information Processing Systems. 1--9.

[33]

J. Wright, A. Yang, S. Sastry, and Y. Ma. 2009. Robust face recognition via sparse representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 31, 2, 210--227.

Digital Library

[34]

J. Yang, D. Zhang, J. Y. Yang, and B. Niu. 2007. Globally maximizing, locally minimizing: Unsupervised discriminant projection with applications to face and palm biometrics. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 4, 650--664.

Digital Library

[35]

Z. R. Yang and E. Oja. 2010. Linear and nonlinear projective nonnegative matrix factorization. IEEE Transactions on Neural Networks 21, 5, 734--749.

Digital Library

[36]

F. Yin and C. L. Liu. 2009. Handwritten Chinese text line segmentation by clustering with distance metric learning. Pattern Recognition 42, 3146--3157.

Digital Library

[37]

S. T. Yuan and J. Sun. 2005. Ontology-based structured cosine similarity in document summarization: With applications to mobile audio-based knowledge management. IEEE Transactions on Systems, Man, and Cybernetics, Part B 35, 5, 1028--1040.

Digital Library

[38]

X. Yuan and J. Yang. 2009. Sparse and low-rank matrix decomposition via alternating direction methods. Pacific Journal of Optimization 9, 1 (2013), 167--180.

[39]

Y. Zhang. 2010. Recent advances in alternating direction methods: Practice and theory. Tutorial.

[40]

Z. Zhang, T. W. S, Chow, and M. B. Zhao. 2005. M-Isomap: Orthogonal constrained marginal isomap for nonlinear dimensionality reduction. IEEE Transactions on Systems, Man and Cybernetics Part B: Cybernetics 43, 1, 180--192.

[41]

Z. Zhang, X. Liang, A. Ganesh, and Y. Ma. 2010. TILT: Transform invariant low-rank textures. In Proceedings of the Asian Conference on Computational Vision. 314--328.

Digital Library

[42]

Z. Zhang, C. L. Liu, and M. B. Zhao. 2013. Handwriting representation and recognition through a sparse projection and low-rank recovery framework. In Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN’13).

[43]

Z. Zhang, S. C. Yan, and M. B. Zhao. 2013. Pairwise sparsity preserving embedding for unsupervised subspace learning and classification. IEEE Transactions on Image Processing 22, 12, 4640--4651.

Digital Library

[44]

Z. Zhang, S. C. Yan, and M. B. Zhao. 2013. Robust image representation and decomposition by Laplacian regularized latent low-rank representation. In Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN’13).

[45]

Z. Zhang, M. B. Zhao, and T. W. S. Chow. July 2012. Constrained large margin local projection algorithms and extensions for multimodal dimensionality reduction. Pattern Recognition 46, 12, 4466--4493.

Digital Library

[46]

Z. Zhang, M. B. Zhao, and T. W. S. Chow. October 2013. Binary- and multi-class group sparse canonical correlation analysis for feature extraction and classification. IEEE Transactions on Knowledge and Data Engineering 25, 10, 2192--2205.

Digital Library

[47]

M. Zheng, J. J. Bu, C. Chen, C. Wang, L. J. Zhang, G. Qiu, and D. Cai. 2011. Graph regularized sparse coding for image representation. IEEE Transactions on Image Processing 20, 5, 1327--1336.

Digital Library

[48]

T. Y. Zhou and D. C. Tao. 2011. GoDec: Randomized lowrank & sparse matrix decomposition in noisy case. In Proceedings of the International Conference on Machine Learning (ICML’’11). 33--40.

[49]

G. Zhu, S. Yan, and Y. Ma. 2010. Image tag refinement toward low-rank, content-tag prior and error sparsity. In Proceedings of the International Conference on Multimedia, 461--470.

Digital Library

[50]

C. L. Zitnick. 2013. Handwriting beautification using token means. ACM Transactions on Graphics (TOG) 32, 4, Article 53.

Digital Library

Cited By

Zhang ZZhang YXu MZhang LYang YYan S(2021)A Survey on Concept Factorization: From Shallow to Deep Representation LearningInformation Processing & Management10.1016/j.ipm.2021.10253458:3(102534)Online publication date: May-2021
https://doi.org/10.1016/j.ipm.2021.102534
Zhang HZhang ZZhao MYe QZhang MWang M(2020)Robust Triple-Matrix-Recovery-Based Auto-Weighted Label Propagation for ClassificationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2019.295601531:11(4538-4552)Online publication date: Nov-2020
https://doi.org/10.1109/TNNLS.2019.2956015
Zhang ZZhang YLiu GTang JYan SWang M(2020)Joint Label Prediction Based Semi-Supervised Adaptive Concept Factorization for Robust Data RepresentationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.289395632:5(952-970)Online publication date: 1-May-2020
https://doi.org/10.1109/TKDE.2019.2893956
Show More Cited By

Index Terms

A Sparse Projection and Low-Rank Recovery Framework for Handwriting Representation and Salient Stroke Feature Extraction
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations
  2. Machine learning

Recommendations

Sparse Representation of a Polytope and Recovery of Sparse Signals and Low-Rank Matrices

This paper considers compressed sensing and affine rank minimization in both noiseless and noisy cases and establishes sharp restricted isometry conditions for sparse signal and low-rank matrix recovery. The analysis relies on a key technical tool, ...
Low-rank representation based discriminative projection for robust feature extraction

The low-rank representation (LRR) was presented recently and showed effective and robust for subspace segmentation. This paper presents a LRR-based discriminative projection method (LRR-DP) for robust feature extraction, by virtue of the underlying low-...
On identity testing of tensors, low-rank recovery and compressed sensing
STOC '12: Proceedings of the forty-fourth annual ACM symposium on Theory of computing

We study the problem of obtaining efficient, deterministic, black-box polynomial identity testing algorithms for depth-3 set-multilinear circuits (over arbitrary fields). This class of circuits has an efficient, deterministic, white-box polynomial ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology

ACM Transactions on Intelligent Systems and Technology Volume 6, Issue 1

April 2015

255 pages

ISSN:2157-6904

EISSN:2157-6912

DOI:10.1145/2745393

Issue’s Table of Contents

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 March 2015

Accepted: 01 February 2014

Revised: 01 February 2014

Received: 01 May 2013

Published in TIST Volume 6, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

National Natural Science Foundation of China
Major Program of National Natural Science Foundation of China
Natural Science Foundation of Jiangsu Province of China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

18
Total Citations
View Citations
292
Total Downloads

Downloads (Last 12 months)9
Downloads (Last 6 weeks)1

Reflects downloads up to 14 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhang ZZhang YXu MZhang LYang YYan S(2021)A Survey on Concept Factorization: From Shallow to Deep Representation LearningInformation Processing & Management10.1016/j.ipm.2021.10253458:3(102534)Online publication date: May-2021
https://doi.org/10.1016/j.ipm.2021.102534
Zhang HZhang ZZhao MYe QZhang MWang M(2020)Robust Triple-Matrix-Recovery-Based Auto-Weighted Label Propagation for ClassificationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2019.295601531:11(4538-4552)Online publication date: Nov-2020
https://doi.org/10.1109/TNNLS.2019.2956015
Zhang ZZhang YLiu GTang JYan SWang M(2020)Joint Label Prediction Based Semi-Supervised Adaptive Concept Factorization for Robust Data RepresentationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.289395632:5(952-970)Online publication date: 1-May-2020
https://doi.org/10.1109/TKDE.2019.2893956
Ren JZhang ZLi SWang YLiu GYan SWang M(2020)Learning Hybrid Representation by Robust Dictionary Learning in Factorized Compressed SpaceIEEE Transactions on Image Processing10.1109/TIP.2020.296528929(3941-3956)Online publication date: 2020
https://doi.org/10.1109/TIP.2020.2965289
Zhang ZZhang YLi SLiu GZeng DYan SWang M(2019)Flexible Auto-weighted Local-coordinate Concept Factorization: A Robust Framework for Unsupervised ClusteringIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.2940576(1-1)Online publication date: 2019
https://doi.org/10.1109/TKDE.2019.2940576
Zhang ZJia LZhao MLiu GWang MYan S(2019)Kernel-Induced Label Propagation by Mapping for Semi-Supervised ClassificationIEEE Transactions on Big Data10.1109/TBDATA.2018.27979775:2(148-165)Online publication date: 1-Jun-2019
https://doi.org/10.1109/TBDATA.2018.2797977
Zhang ZZhang YLi SLiu GWang MYan S(2019)Robust Unsupervised Flexible Auto-weighted Local-coordinate Concept Factorization for Image ClusteringICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2019.8683263(2092-2096)Online publication date: May-2019
https://doi.org/10.1109/ICASSP.2019.8683263
Isaza CAnaya KFuentes-Silva CPaz JRizzo AGarcia-Moreno A(2019)Dynamic set point model for driver alert state using digital image processingMultimedia Tools and Applications10.1007/s11042-019-7218-z78:14(19543-19563)Online publication date: 2-Aug-2019
https://dl.acm.org/doi/10.1007/s11042-019-7218-z
Ren JZhang ZLi SLiu GWang MYan S(2018)Robust Projective Low-Rank and Sparse Representation by Robust Dictionary Learning2018 24th International Conference on Pattern Recognition (ICPR)10.1109/ICPR.2018.8546056(1851-1856)Online publication date: Aug-2018
https://doi.org/10.1109/ICPR.2018.8546056
Yu SYiquan W(2018)Subspace clustering based on latent low rank representation with Frobenius norm minimizationNeurocomputing10.1016/j.neucom.2017.11.021275:C(2479-2489)Online publication date: 31-Jan-2018
https://dl.acm.org/doi/10.1016/j.neucom.2017.11.021
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents