skip to main content
10.1145/3219819.3219951acmotherconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Calibrated Multi-Task Learning

Published: 19 July 2018 Publication History

Abstract

This paper proposes a novel algorithm, named Non-Convex Calibrated Multi-Task Learning (NC-CMTL), for learning multiple related regression tasks jointly. Instead of utilizing the nuclear norm, NC-CMTL adopts a non-convex low rank regularizer to explore the shared information among different tasks. In addition, considering that the regularization parameter for each regression task desponds on its noise level, we replace the least squares loss function by square-root loss function. Computationally, as proposed model has a nonsmooth loss function and a non-convex regularization term, we construct an efcient re-weighted method to optimize it. Theoretically, we frst present the convergence analysis of constructed method, and then prove that the derived solution is a stationary point of original problem. Particularly, the regularizer and optimization method used in this paper are also suitable for other rank minimization problems. Numerical experiments on both synthetic and real data illustrate the advantages of NC-CMTL over several state-of-the-art methods.

Supplementary Material

MP4 File (hu_calibrated_learning.mp4)

References

[1]
Qi An, Chunping Wang, Ivo Shterev, Eric Wang, Lawrence Carin, and David B Dunson . 2008. Hierarchical kernel stick-breaking process for multi-task image analysis Proceedings of the 25th international conference on Machine learning. ACM, 17--24.
[2]
Rie Kubota Ando and Tong Zhang . 2005. A framework for learning predictive structures from multiple tasks and unlabeled data. Journal of Machine Learning Research Vol. 6, Nov (2005), 1817--1853.
[3]
Andreas Argyriou, Theodoros Evgeniou, and Massimiliano Pontil . 2008. Convex multi-task feature learning. Machine Learning Vol. 73, 3 (2008), 243--272.
[4]
Jing Bai, Ke Zhou, Guirong Xue, Hongyuan Zha, Gordon Sun, Belle Tseng, Zhaohui Zheng, and Yi Chang . 2009. Multi-task learning for learning to rank in web search Proceedings of the 18th ACM conference on Information and knowledge management. ACM, 1549--1552.
[5]
Emmanuel J Candès and Terence Tao . 2010. The power of convex relaxation: Near-optimal matrix completion. IEEE Transactions on Information Theory Vol. 56, 5 (2010), 2053--2080.
[6]
Emmanuel J Candes, Michael B Wakin, and Stephen P Boyd . 2008. Enhancing sparsity by reweighted 1 minimization. Journal of Fourier analysis and applications Vol. 14, 5 (2008), 877--905.
[7]
Olivier Chapelle, Pannagadatta Shivaswamy, Srinivas Vadrevu, Kilian Weinberger, Ya Zhang, and Belle Tseng . 2010. Multi-task learning for boosting with application to web search ranking Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 1189--1198.
[8]
Jianhui Chen, Ji Liu, and Jieping Ye . 2012. Learning incoherent sparse and low-rank patterns from multiple tasks. ACM Transactions on Knowledge Discovery from Data (TKDD) Vol. 5, 4 (2012), 22.
[9]
Jianhui Chen, Lei Tang, Jun Liu, and Jieping Ye . 2009. A convex formulation for learning shared structures from multiple tasks Proceedings of the 26th Annual International Conference on Machine Learning. ACM, 137--144.
[10]
Jianhui Chen, Jiayu Zhou, and Jieping Ye . 2011. Integrating low-rank and group-sparse structures for robust multi-task learning Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 42--50.
[11]
Ingrid Daubechies, Ronald DeVore, Massimo Fornasier, and C Sinan Güntürk . 2010. Iteratively reweighted least squares minimization for sparse recovery. Communications on Pure and Applied Mathematics Vol. 63, 1 (2010), 1--38.
[12]
Pinghua Gong, Jiayu Zhou, Wei Fan, and Jieping Ye . 2014. Efficient multi-task feature learning with calibration Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 761--770.
[13]
Lei Han and Yu Zhang . 2016. Multi-Stage Multi-Task Learning with Reduced Rank. AAAI. 1638--1644.
[14]
Ali Jalali, Sujay Sanghavi, Chao Ruan, and Pradeep K Ravikumar . 2010. A dirty model for multi-task learning. In Advances in Neural Information Processing Systems. 964--972.
[15]
Pratik Jawanpuria and J. Saketha Nath . 2012. A Convex Feature Learning Formulation for Latent Task Structure Discovery Proceedings of the 29th International Conference on Machine Learning, ICML 2012, Edinburgh, Scotland, UK, June 26 - July 1, 2012.
[16]
Abhishek Kumar and Hal Daume III . 2012. Learning task grouping and overlap in multi-task learning. arXiv preprint arXiv:1206.6417 (2012).
[17]
Giwoong Lee, Eunho Yang, and Sung Hwang . 2016. Asymmetric multi-task learning based on task relatedness and loss International Conference on Machine Learning. 230--238.
[18]
Han Liu, Lie Wang, and Tuo Zhao . 2014. Multivariate regression with calibration. In Advances in neural information processing systems. 127--135.
[19]
Albert W Marshall, Ingram Olkin, and Barry C Arnold . 1979. Inequalities: theory of majorization and its applications. Vol. Vol. 143. Springer.
[20]
Karthik Mohan and Maryam Fazel . 2012. Iterative reweighted algorithms for matrix rank minimization. Journal of Machine Learning Research Vol. 13, Nov (2012), 3441--3473.
[21]
Feiping Nie, Heng Huang, Xiao Cai, and Chris H Ding . 2010. Efficient and robust feature selection via joint $ell_2,1$-norms minimization Advances in neural information processing systems. 1813--1821.
[22]
Feiping Nie, Jianjun Yuan, and Heng Huang . 2014. Optimal mean robust principal component analysis. In International Conference on Machine Learning. 1062--1070.
[23]
Chong Peng, Zhao Kang, Huiqing Li, and Qiang Cheng. 2015. Subspace clustering using log-determinant rank approximation. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 925-934.
[24]
Kaare Brandt Petersen, Michael Syskind Pedersen, et almbox. . 2008. The matrix cookbook. Technical University of Denmark Vol. 7, 15 (2008), 510.
[25]
Ting Kei Pong, Paul Tseng, Shuiwang Ji, and Jieping Ye . 2010. Trace norm regularization: Reformulations, algorithms, and multi-task learning. SIAM Journal on Optimization Vol. 20, 6 (2010), 3465--3489.
[26]
Robert Tibshirani . 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) (1996), 267--288.
[27]
Hua Wang, Feiping Nie, Heng Huang, Shannon Risacher, Chris Ding, Andrew J Saykin, Li Shen, et almbox. . 2011. Sparse multi-task regression and feature selection to identify brain imaging predictors for memory performance. In Computer Vision (ICCV), 2011 IEEE International Conference on. IEEE, 557--562.
[28]
Debing Zhang, Yao Hu, Jieping Ye, Xuelong Li, and Xiaofei He . 2012. Matrix completion by truncated nuclear norm regularization Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2192--2199.
[29]
Jian Zhang, Zoubin Ghahramani, and Yiming Yang . 2006. Learning multiple related tasks using latent independent component analysis Advances in neural information processing systems. 1585--1592.
[30]
Jian Zhang, Zoubin Ghahramani, and Yiming Yang . 2008. Flexible latent variable models for multi-task learning. Machine Learning Vol. 73, 3 (2008), 221--242.
[31]
Kai Zhang, Joe W Gray, and Bahram Parvin . 2010. Sparse multitask regression for identifying common mechanism of response to therapeutic targets. Bioinformatics Vol. 26, 12 (2010), i97--i105.
[32]
Yu Zhang and Qiang Yang . 2017. A survey on multi-task learning. arXiv preprint arXiv:1707.08114 (2017).
[33]
Yu Zhang and Dit-Yan Yeung . 2010. Multi-task warped gaussian process for personalized age estimation Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2622--2629.
[34]
Jiayu Zhou, Jianhui Chen, and Jieping Ye . 2011 a. Malsar: Multi-task learning via structural regularization. Arizona State University Vol. 21 (2011).
[35]
Jiayu Zhou, Lei Yuan, Jun Liu, and Jieping Ye . 2011 b. A multi-task learning formulation for predicting disease progression Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 814--822.

Cited By

View all
  • (2024)Pivotal-Aware Principal Component AnalysisIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.325260235:9(12201-12210)Online publication date: Sep-2024
  • (2024)Integrating Visualised Automatic Temporal Relation Graph into Multi-Task Learning for Alzheimer's Disease Progression PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338571236:10(5206-5220)Online publication date: Oct-2024
  • (2024)Tensor Completion for Alzheimer's Disease Prediction From Diffusion Tensor ImagingIEEE Transactions on Biomedical Engineering10.1109/TBME.2024.336513171:7(2211-2223)Online publication date: Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
July 2018
2925 pages
ISBN:9781450355520
DOI:10.1145/3219819
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. clustering
  2. dimensionality reduction
  3. multi-task learning

Qualifiers

  • Research-article

Funding Sources

  • National Natural Science Foundation of China

Conference

KDD '18
Sponsor:

Acceptance Rates

KDD '18 Paper Acceptance Rate 107 of 983 submissions, 11%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)55
  • Downloads (Last 6 weeks)14
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Pivotal-Aware Principal Component AnalysisIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.325260235:9(12201-12210)Online publication date: Sep-2024
  • (2024)Integrating Visualised Automatic Temporal Relation Graph into Multi-Task Learning for Alzheimer's Disease Progression PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338571236:10(5206-5220)Online publication date: Oct-2024
  • (2024)Tensor Completion for Alzheimer's Disease Prediction From Diffusion Tensor ImagingIEEE Transactions on Biomedical Engineering10.1109/TBME.2024.336513171:7(2211-2223)Online publication date: Jul-2024
  • (2024)Scalable and parameter-free fusion graph learning for multi-view clusteringNeurocomputing10.1016/j.neucom.2024.128037597(128037)Online publication date: Sep-2024
  • (2024)Informative relationship multi-task learning: Exploring pairwise contribution across tasks’ sharing knowledgeKnowledge-Based Systems10.1016/j.knosys.2024.112187301(112187)Online publication date: Oct-2024
  • (2024)Multi-task label noise learning for classificationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107714130(107714)Online publication date: Apr-2024
  • (2023)Boosting Multitask Learning on Graphs through Higher-Order Task AffinitiesProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599265(1213-1222)Online publication date: 6-Aug-2023
  • (2023)Multitask Learning for Classification Problem via New Tight Relaxation of Rank MinimizationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.313291834:9(6055-6068)Online publication date: Sep-2023
  • (2023)Explainable Tensor Multi-Task Ensemble Learning Based on Brain Structure Variation for Alzheimer’s Disease Dynamic PredictionIEEE Journal of Translational Engineering in Health and Medicine10.1109/JTEHM.2022.321977511(1-12)Online publication date: 2023
  • (2023)Calibrated multi-task subspace learning via binary group structure constraintInformation Sciences10.1016/j.ins.2023.02.036Online publication date: Feb-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media