research-article

Calibrated Multi-Task Learning

Authors:

Xuelong LiAuthors Info & Claims

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 2012 - 2021

https://doi.org/10.1145/3219819.3219951

Published: 19 July 2018 Publication History

Abstract

This paper proposes a novel algorithm, named Non-Convex Calibrated Multi-Task Learning (NC-CMTL), for learning multiple related regression tasks jointly. Instead of utilizing the nuclear norm, NC-CMTL adopts a non-convex low rank regularizer to explore the shared information among different tasks. In addition, considering that the regularization parameter for each regression task desponds on its noise level, we replace the least squares loss function by square-root loss function. Computationally, as proposed model has a nonsmooth loss function and a non-convex regularization term, we construct an efcient re-weighted method to optimize it. Theoretically, we frst present the convergence analysis of constructed method, and then prove that the derived solution is a stationary point of original problem. Particularly, the regularizer and optimization method used in this paper are also suitable for other rank minimization problems. Numerical experiments on both synthetic and real data illustrate the advantages of NC-CMTL over several state-of-the-art methods.

Supplementary Material

MP4 File (hu_calibrated_learning.mp4)

Download
363.12 MB

References

[1]

Qi An, Chunping Wang, Ivo Shterev, Eric Wang, Lawrence Carin, and David B Dunson . 2008. Hierarchical kernel stick-breaking process for multi-task image analysis Proceedings of the 25th international conference on Machine learning. ACM, 17--24.

Digital Library

[2]

Rie Kubota Ando and Tong Zhang . 2005. A framework for learning predictive structures from multiple tasks and unlabeled data. Journal of Machine Learning Research Vol. 6, Nov (2005), 1817--1853.

Digital Library

[3]

Andreas Argyriou, Theodoros Evgeniou, and Massimiliano Pontil . 2008. Convex multi-task feature learning. Machine Learning Vol. 73, 3 (2008), 243--272.

Digital Library

[4]

Jing Bai, Ke Zhou, Guirong Xue, Hongyuan Zha, Gordon Sun, Belle Tseng, Zhaohui Zheng, and Yi Chang . 2009. Multi-task learning for learning to rank in web search Proceedings of the 18th ACM conference on Information and knowledge management. ACM, 1549--1552.

Digital Library

[5]

Emmanuel J Candès and Terence Tao . 2010. The power of convex relaxation: Near-optimal matrix completion. IEEE Transactions on Information Theory Vol. 56, 5 (2010), 2053--2080.

Digital Library

[6]

Emmanuel J Candes, Michael B Wakin, and Stephen P Boyd . 2008. Enhancing sparsity by reweighted 1 minimization. Journal of Fourier analysis and applications Vol. 14, 5 (2008), 877--905.

[7]

Olivier Chapelle, Pannagadatta Shivaswamy, Srinivas Vadrevu, Kilian Weinberger, Ya Zhang, and Belle Tseng . 2010. Multi-task learning for boosting with application to web search ranking Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 1189--1198.

Digital Library

[8]

Jianhui Chen, Ji Liu, and Jieping Ye . 2012. Learning incoherent sparse and low-rank patterns from multiple tasks. ACM Transactions on Knowledge Discovery from Data (TKDD) Vol. 5, 4 (2012), 22.

Digital Library

[9]

Jianhui Chen, Lei Tang, Jun Liu, and Jieping Ye . 2009. A convex formulation for learning shared structures from multiple tasks Proceedings of the 26th Annual International Conference on Machine Learning. ACM, 137--144.

Digital Library

[10]

Jianhui Chen, Jiayu Zhou, and Jieping Ye . 2011. Integrating low-rank and group-sparse structures for robust multi-task learning Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 42--50.

Digital Library

[11]

Ingrid Daubechies, Ronald DeVore, Massimo Fornasier, and C Sinan Güntürk . 2010. Iteratively reweighted least squares minimization for sparse recovery. Communications on Pure and Applied Mathematics Vol. 63, 1 (2010), 1--38.

[12]

Pinghua Gong, Jiayu Zhou, Wei Fan, and Jieping Ye . 2014. Efficient multi-task feature learning with calibration Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 761--770.

Digital Library

[13]

Lei Han and Yu Zhang . 2016. Multi-Stage Multi-Task Learning with Reduced Rank. AAAI. 1638--1644.

Digital Library

[14]

Ali Jalali, Sujay Sanghavi, Chao Ruan, and Pradeep K Ravikumar . 2010. A dirty model for multi-task learning. In Advances in Neural Information Processing Systems. 964--972.

Digital Library

[15]

Pratik Jawanpuria and J. Saketha Nath . 2012. A Convex Feature Learning Formulation for Latent Task Structure Discovery Proceedings of the 29th International Conference on Machine Learning, ICML 2012, Edinburgh, Scotland, UK, June 26 - July 1, 2012.

Digital Library

[16]

Abhishek Kumar and Hal Daume III . 2012. Learning task grouping and overlap in multi-task learning. arXiv preprint arXiv:1206.6417 (2012).

Digital Library

[17]

Giwoong Lee, Eunho Yang, and Sung Hwang . 2016. Asymmetric multi-task learning based on task relatedness and loss International Conference on Machine Learning. 230--238.

Digital Library

[18]

Han Liu, Lie Wang, and Tuo Zhao . 2014. Multivariate regression with calibration. In Advances in neural information processing systems. 127--135.

Digital Library

[19]

Albert W Marshall, Ingram Olkin, and Barry C Arnold . 1979. Inequalities: theory of majorization and its applications. Vol. Vol. 143. Springer.

[20]

Karthik Mohan and Maryam Fazel . 2012. Iterative reweighted algorithms for matrix rank minimization. Journal of Machine Learning Research Vol. 13, Nov (2012), 3441--3473.

Digital Library

[21]

Feiping Nie, Heng Huang, Xiao Cai, and Chris H Ding . 2010. Efficient and robust feature selection via joint $ell_2,1$-norms minimization Advances in neural information processing systems. 1813--1821.

Digital Library

[22]

Feiping Nie, Jianjun Yuan, and Heng Huang . 2014. Optimal mean robust principal component analysis. In International Conference on Machine Learning. 1062--1070.

Digital Library

[23]

Chong Peng, Zhao Kang, Huiqing Li, and Qiang Cheng. 2015. Subspace clustering using log-determinant rank approximation. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 925-934.

Digital Library

[24]

Kaare Brandt Petersen, Michael Syskind Pedersen, et almbox. . 2008. The matrix cookbook. Technical University of Denmark Vol. 7, 15 (2008), 510.

[25]

Ting Kei Pong, Paul Tseng, Shuiwang Ji, and Jieping Ye . 2010. Trace norm regularization: Reformulations, algorithms, and multi-task learning. SIAM Journal on Optimization Vol. 20, 6 (2010), 3465--3489.

Digital Library

[26]

Robert Tibshirani . 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) (1996), 267--288.

[27]

Hua Wang, Feiping Nie, Heng Huang, Shannon Risacher, Chris Ding, Andrew J Saykin, Li Shen, et almbox. . 2011. Sparse multi-task regression and feature selection to identify brain imaging predictors for memory performance. In Computer Vision (ICCV), 2011 IEEE International Conference on. IEEE, 557--562.

Digital Library

[28]

Debing Zhang, Yao Hu, Jieping Ye, Xuelong Li, and Xiaofei He . 2012. Matrix completion by truncated nuclear norm regularization Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2192--2199.

Digital Library

[29]

Jian Zhang, Zoubin Ghahramani, and Yiming Yang . 2006. Learning multiple related tasks using latent independent component analysis Advances in neural information processing systems. 1585--1592.

Digital Library

[30]

Jian Zhang, Zoubin Ghahramani, and Yiming Yang . 2008. Flexible latent variable models for multi-task learning. Machine Learning Vol. 73, 3 (2008), 221--242.

Digital Library

[31]

Kai Zhang, Joe W Gray, and Bahram Parvin . 2010. Sparse multitask regression for identifying common mechanism of response to therapeutic targets. Bioinformatics Vol. 26, 12 (2010), i97--i105.

Digital Library

[32]

Yu Zhang and Qiang Yang . 2017. A survey on multi-task learning. arXiv preprint arXiv:1707.08114 (2017).

[33]

Yu Zhang and Dit-Yan Yeung . 2010. Multi-task warped gaussian process for personalized age estimation Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2622--2629.

[34]

Jiayu Zhou, Jianhui Chen, and Jieping Ye . 2011 a. Malsar: Multi-task learning via structural regularization. Arizona State University Vol. 21 (2011).

[35]

Jiayu Zhou, Lei Yuan, Jun Liu, and Jieping Ye . 2011 b. A multi-task learning formulation for predicting disease progression Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 814--822.

Digital Library

Cited By

Li XLi PZhang HZhu KZhang R(2024)Pivotal-Aware Principal Component AnalysisIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.325260235:9(12201-12210)Online publication date: Sep-2024
https://doi.org/10.1109/TNNLS.2023.3252602
Zhou MWang XLiu TYang YYang P(2024)Integrating Visualised Automatic Temporal Relation Graph into Multi-Task Learning for Alzheimer's Disease Progression PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338571236:10(5206-5220)Online publication date: Oct-2024
https://doi.org/10.1109/TKDE.2024.3385712
Gou YLiu YHe FHunyadi BZhu C(2024)Tensor Completion for Alzheimer's Disease Prediction From Diffusion Tensor ImagingIEEE Transactions on Biomedical Engineering10.1109/TBME.2024.336513171:7(2211-2223)Online publication date: Jul-2024
https://doi.org/10.1109/TBME.2024.3365131
Show More Cited By

Index Terms

Calibrated Multi-Task Learning
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Multi-task learning
2. Mathematics of computing
  1. Probability and statistics
    1. Statistical paradigms
      1. Regression analysis

Recommendations

Integrating low-rank and group-sparse structures for robust multi-task learning
KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining

Multi-task learning (MTL) aims at improving the generalization performance by utilizing the intrinsic relationships among multiple related tasks. A key assumption in most MTL algorithms is that all tasks are related, which, however, may not be the case ...
Variable Selection and Task Grouping for Multi-Task Learning
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

We consider multi-task learning, which simultaneously learns related prediction tasks, to improve generalization performance. We factorize a coefficient matrix as the product of two matrices based on a low-rank assumption. These matrices have sparsities ...
Multi-stage multi-task feature learning

Multi-task sparse feature learning aims to improve the generalization performance by exploiting the shared features among tasks. It has been successfully applied to many applications including computer vision and biomedical informatics. Most of the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

July 2018

2925 pages

ISBN:9781450355520

DOI:10.1145/3219819

General Chairs:
Yike Guo
Imperial College London
,
Faisal Farooq
IBM

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Conference

KDD '18

Sponsor:

KDD '18: The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 19 - 23, 2018

London, United Kingdom

Acceptance Rates

KDD '18 Paper Acceptance Rate 107 of 983 submissions, 11%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

25
Total Citations
View Citations
1,948
Total Downloads

Downloads (Last 12 months)55
Downloads (Last 6 weeks)14

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Li XLi PZhang HZhu KZhang R(2024)Pivotal-Aware Principal Component AnalysisIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.325260235:9(12201-12210)Online publication date: Sep-2024
https://doi.org/10.1109/TNNLS.2023.3252602
Zhou MWang XLiu TYang YYang P(2024)Integrating Visualised Automatic Temporal Relation Graph into Multi-Task Learning for Alzheimer's Disease Progression PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338571236:10(5206-5220)Online publication date: Oct-2024
https://doi.org/10.1109/TKDE.2024.3385712
Gou YLiu YHe FHunyadi BZhu C(2024)Tensor Completion for Alzheimer's Disease Prediction From Diffusion Tensor ImagingIEEE Transactions on Biomedical Engineering10.1109/TBME.2024.336513171:7(2211-2223)Online publication date: Jul-2024
https://doi.org/10.1109/TBME.2024.3365131
Duan YWu DWang RLi XNie F(2024)Scalable and parameter-free fusion graph learning for multi-view clusteringNeurocomputing10.1016/j.neucom.2024.128037597(128037)Online publication date: Sep-2024
https://doi.org/10.1016/j.neucom.2024.128037
Chang XZhou MWang XYang YYang P(2024)Informative relationship multi-task learning: Exploring pairwise contribution across tasks’ sharing knowledgeKnowledge-Based Systems10.1016/j.knosys.2024.112187301(112187)Online publication date: Oct-2024
https://doi.org/10.1016/j.knosys.2024.112187
Liu ZWang ZWang TXu Y(2024)Multi-task label noise learning for classificationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107714130(107714)Online publication date: Apr-2024
https://doi.org/10.1016/j.engappai.2023.107714
Li DJu HSharma AZhang HSingh ASun YAkoglu LGunopulos DYan XKumar ROzcan FYe J(2023)Boosting Multitask Learning on Graphs through Higher-Order Task AffinitiesProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599265(1213-1222)Online publication date: 6-Aug-2023
https://dl.acm.org/doi/10.1145/3580305.3599265
Chang WNie FZhi YWang RLi X(2023)Multitask Learning for Classification Problem via New Tight Relaxation of Rank MinimizationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.313291834:9(6055-6068)Online publication date: Sep-2023
https://doi.org/10.1109/TNNLS.2021.3132918
Zhang YLiu TLanfranchi VYang P(2023)Explainable Tensor Multi-Task Ensemble Learning Based on Brain Structure Variation for Alzheimer’s Disease Dynamic PredictionIEEE Journal of Translational Engineering in Health and Medicine10.1109/JTEHM.2022.321977511(1-12)Online publication date: 2023
https://doi.org/10.1109/JTEHM.2022.3219775
Chang WNie FWang RLi X(2023)Calibrated multi-task subspace learning via binary group structure constraintInformation Sciences10.1016/j.ins.2023.02.036Online publication date: Feb-2023
https://doi.org/10.1016/j.ins.2023.02.036
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten