Constructing and Combining Orthogonal Projection Vectors for Ordinal Regression

Sun, Bing-Yu; Wang, Hai-Lei; Li, Wen-Bo; Wang, Hui-Jing; Li, Jiuyong; Du, Zhi-Qiang

doi:10.1007/s11063-014-9340-2

Constructing and Combining Orthogonal Projection Vectors for Ordinal Regression

Published: 19 January 2014

Volume 41, pages 139–155, (2015)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Bing-Yu Sun¹,
Hai-Lei Wang^1,2,
Wen-Bo Li¹,
Hui-Jing Wang³,
Jiuyong Li⁴ &
…
Zhi-Qiang Du⁵

442 Accesses
Explore all metrics

Abstract

Ordinal regression is to predict categories of ordinal scale and it has wide applications in many domains where the human evaluation plays a major role. So far several algorithms have been proposed to tackle ordinal regression problems from a machine learning perspective. However, most of these algorithms only seek one direction where the projected samples are well ranked. So a common shortcoming of these algorithms is that only one dimension in the sample space is used, which would definitely lose some useful information in its orthogonal subspaces. In this paper, we propose a novel ordinal regression strategy which consists of two stages: firstly orthogonal feature vectors are extracted and then these projector vectors are combined to learn an ordinal regression rule. Compared with previous ordinal regression methods, the proposed strategy can extract multiple features from the original data space. So the performance of ordinal regression could be improved because more information of the data is used. The experimental results on both benchmark and real datasets proves the performance of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Minimum class variance support vector ordinal regression

Article 18 August 2016

Sparse Ordinal Regression via Factorization Machines

Extended least squares support vector machines for ordinal regression

Article 24 June 2015

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Notes

References

Kramer S, Widmer G, Pfahringer B, DeGroeve M (2001) Prediction of ordinal classes using regression trees. Fundamenta Informaticae 47(1–2):1–13
MATH MathSciNet Google Scholar
Herbrich R, Graepel T, Obermayer K (2000) Large margin rank boundaries for ordinal regression. In: Smola AJ, Bartlett PL, Schölkopf B, Schuurmans D (eds) Advances in large margin classifiers. MIT Press, Cambridge, pp 115–132
Google Scholar
Crammer K, Singer Y (2002) Pranking with ranking. In: Dietterich TG, Becker S, Ghahramani Z (eds) Advances in neural information processing systems. MIT Press, Cambridge, pp 641–647
Google Scholar
Shashua A, Levin A (2003) Ranking with large margin principle: two approaches. In: Becker S, Thrun S, Obermayer K (eds) Advances in neural information processing systems. MIT Press, Cambridge, pp 961–968
Chu W, Keerthi SS (2005) New approaches to support vector ordinal regression. In: Proceedings of the 22nd international conference on machine learning (ICML 2005). Omnipress, pp 145–152
Lin L, Lin H-T (2007) Ordinal regression by extended binary classification. In: Advances in neural information processing systems 19: proceedings of the 2006 Conference (NIPS 2006). MIT Press, pp 865–872
Cardoso JS, Pinto da Costa JF (2007) Learning to classify ordinal data: the data replication method. J Mach Learn Res 8:1393–1429
MATH MathSciNet Google Scholar
Liu Y, Liu Y, Chan KCC (2011) Ordinal regression via manifold learning. In: Proceedings of 25th AAAI conference on artificial Intelligence (AAAI11), pp 398–403
Baccianella S, Esuli A, SebastianiF F (2010) Feature selection for ordinal regression. In: Proceedings of the 2010 ACM symposium on applied computing (SAC ’10). ACM, New York, pp 1748–1754
Bishop CM (2006) Pattern recognition and machine learning. Springer, Heidelberg
MATH Google Scholar
Duda RO, Hart PE, Stork D (2000) Pattern classification. Wiley, Chichester
Google Scholar
Li H, Jiang T, Zhang K (2006) Efficient and robust feature extraction by maximum margin criterion. IEEE Trans Neural Netw 17(1):157–165
Article Google Scholar
Min W, Lu K, He X (2004) Locality pursuit embedding. Pattern Recognit 37(4):781–788
Article MATH Google Scholar
Zhang T, Huang K, Li X, Yang J, Tao D (2010) Generalized discriminant analysis: a matrix exponential approach. IEEE Trans Syst Man Cybern B 40(1):253–263
Article Google Scholar
Xia F, Tao Q, Wang J, Zhang W (2007) Recursive feature extraction for ordinal regression. In: International joint conference on neural networks (IJCNN’07), pp 78–83
Sun B-Y, Li J, Wu DD, Zhang X-M, Li W-B (2010) Kernel discriminant learning for ordinal regression. IEEE Trans Knowl Data Eng 22(6):906–910
Article Google Scholar
Ye J (2005) Characterization of a family of algorithms for generalized discriminant analysis on undersampled problems. J Mach Learn Res 6:4831502
Google Scholar
Ji S, Ye J (2008) Generalized linear discriminant analysis: a unified framework and efficient model selection. IEEE Trans Neural Netw 19(10):1768–1782
Article Google Scholar
Vapnik V (1998) The nature of statistical learning theory. Wiley, New York
Google Scholar
Muller K-R, Mika S, Ratsch G, Tsuda K, Scholkopf B (2001) An introduction to Kernel-based learning algorithms. IEEE Trans Neural Netw 12(2):181–201
Article Google Scholar
Mika S (2002) Kernel fisher discriminants. PhD thesis, University of Technology, Berlin
Guo Y, Hastie T, Tibshirani R (2007) Regularized linear discriminant analysis and its application in microarrays. Biostatistics 8(1):86–100
Article MATH Google Scholar
Kim H, Drake B, Park H (2006) Adaptive nonlinear discriminant analysis by regularized minimum squared errors. IEEE Trans Knowl Data Eng 18(5):603–612
Article Google Scholar
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140
MATH MathSciNet Google Scholar
Rosenwald A, Wright G, Chan WC, Connors JM, Campo E, Fisher RI, Gascoyne RD, Muller-Hermelink HK, Smeland EB, Staudt LM (2002) The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. N Engl J Med 346(25):1937–1947
Article Google Scholar

Download references

Acknowledgments

The authors sincerely thank anonymous reviewers’ constructive comments. The work of this paper has been supported by the Natural Science Foundation of China (Nos: 41101516 and 61203373), Guangdong Natural Science Foundation (No. S2011010006120) and the Shenzhen Science and Technology R & D funding Basic Research Program (No. JC201105190821A).

Author information

Authors and Affiliations

Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei, Anhui, People’s Republic of China
Bing-Yu Sun, Hai-Lei Wang & Wen-Bo Li
The Department of Automation, University of Science and Technology of China, Hefei, People’s Republic of China
Hai-Lei Wang
The Computer College, Shenzhen Institute of Information Technology, Shenzhen, Guangdong, People’s Republic of China
Hui-Jing Wang
The School of Computer and information Science, University of South Australia, Adelaide, Australia
Jiuyong Li
The State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, People’s Republic of China
Zhi-Qiang Du

Authors

Bing-Yu Sun
View author publications
You can also search for this author inPubMed Google Scholar
Hai-Lei Wang
View author publications
You can also search for this author inPubMed Google Scholar
Wen-Bo Li
View author publications
You can also search for this author inPubMed Google Scholar
Hui-Jing Wang
View author publications
You can also search for this author inPubMed Google Scholar
Jiuyong Li
View author publications
You can also search for this author inPubMed Google Scholar
Zhi-Qiang Du
View author publications
You can also search for this author inPubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sun, BY., Wang, HL., Li, WB. et al. Constructing and Combining Orthogonal Projection Vectors for Ordinal Regression. Neural Process Lett 41, 139–155 (2015). https://doi.org/10.1007/s11063-014-9340-2

Download citation

Published: 19 January 2014
Issue Date: February 2015
DOI: https://doi.org/10.1007/s11063-014-9340-2

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Constructing and Combining Orthogonal Projection Vectors for Ordinal Regression

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Minimum class variance support vector ordinal regression

Sparse Ordinal Regression via Factorization Machines

Extended least squares support vector machines for ordinal regression

Explore related subjects

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now