research-article

Ensemble of Anchor Adapters for Transfer Learning

Authors:

Sinno Jialin Pan,

Qing HeAuthors Info & Claims

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

Pages 2335 - 2340

https://doi.org/10.1145/2983323.2983690

Published: 24 October 2016 Publication History

Abstract

In the past decade, there have been a large number of transfer learning algorithms proposed for various real-world applications. However, most of them are vulnerable to negative transfer since their performance is even worse than traditional supervised models. Aiming at more robust transfer learning models, we propose an ENsemble framework of anCHOR adapters (ENCHOR for short), in which an anchor adapter adapts the features of instances based on their similarities to a specific anchor (i.e., a selected instance). Specifically, the more similar to the anchor instance, the higher degree of the original feature of an instance remains unchanged in the adapted representation, and vice versa. This adapted representation for the data actually expresses the local structure around the corresponding anchor, and then any transfer learning method can be applied to this adapted representation for a prediction model, which focuses more on the neighborhood of the anchor. Next, based on multiple anchors, multiple anchor adapters can be built and combined into an ensemble for final output. Additionally, we develop an effective measure to select the anchors for ensemble building to achieve further performance improvement. Extensive experiments on hundreds of text classification tasks are conducted to demonstrate the effectiveness of ENCHOR. The results show that: when traditional supervised models perform poorly, ENCHOR (based on only 8 selected anchors) achieves $6%-13%$ increase in terms of average accuracy compared with the state-of-the-art methods, and it greatly alleviates negative transfer.

References

[1]

Sinno Jialin Pan and Qiang Yang. A survey on transfer learning. IEEE TKDE, pages 1345--1359, 2010.

Digital Library

[2]

Xuejun Liao, Ya Xue, and Lawrence Carin. Logistic regression with an auxiliary data source. In Proceedings of the 22nd ICML, pages 505--512, 2005.

Digital Library

[3]

Wenyuan Dai, Qiang Yang, Gui-Rong Xue, and Yong Yu. Boosting for transfer learning. In Proceedings of the 24th ICML, pages 193--200, 2007.

Digital Library

[4]

Minmin Chen, Zhixiang Eddie Xu, Kilian Q. Weinberger, and Fei Sha. Marginalized denoising autoencoders for domain adaptation. In Proceedings of the 29th ICML, 2012.

[5]

Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, and Qing He. Transfer learning from multiple source domains via consensus regularization. In Proceedings of the 17th ACM CIKM, pages 103--112, 2008.

Digital Library

[6]

Lixin Duan, Dong Xu, and Shih-Fu Chang. Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach. In IEEE CVPR, pages 1338--1345, 2012.

Digital Library

[7]

Jing Gao, Wei Fan, Jing Jiang, and Jiawei Han. Knowledge transfer via multiple model local structure mapping. In Proceedings of the 14th ACM SIGKDD, 2008.

Digital Library

[8]

Mark Dredze, Alex Kulesza, and Koby Crammer. Multi-domain learning by confidence-weighted parameter combination. Machine Learning, pages 123--149, 2010.

Digital Library

[9]

Fuzhen Zhuang, Ping Luo, Hui Xiong, Qing He, Yuhong Xiong, and Zhongzhi Shi. Exploiting associations between word clusters and document classes for cross-domain text categorization. Statistical Analysis and Data Mining, pages 100--114, 2011.

Digital Library

[10]

Solomon Kullback. Letter to the editor: the kullback-leibler distance. AMERICAN STATISTICIAN, 1987.

[11]

Karsten M Borgwardt, Arthur Gretton, Malte J Rasch, Hans-Peter Kriegel, Bernhard Schölkopf, and Alex J Smola. Integrating structured biological data by kernel maximum mean discrepancy. Bioinformatics, pages e49--e57, 2006.

Digital Library

[12]

Wenyuan Dai, Gui-Rong Xue, Qiang Yang, and Yong Yu. Co-clustering based classification for out-of-domain documents. In Proceedings of the 13th ACM SIGKDD, pages 210--219, 2007.

Digital Library

[13]

Sinno Jialin Pan, James T Kwok, and Qiang Yang. Transfer learning via dimensionality reduction. In AAAI, pages 677--682, 2008.

Digital Library

[14]

Fuzhen Zhuang, Ping Luo, Hui Xiong, Yuhong Xiong, Qing He, and Zhongzhi Shi. Cross-domain learning from multiple sources: a consensus regularization perspective. IEEE TKDE, pages 1664--1678, 2010.

Digital Library

[15]

David Hosmer and Stanley Lemeshow. Applied Logistic Regression. Wiley, New York, 2000.

[16]

Thomas Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine learning, pages 177--196, 2001.

[17]

Wenyuan Dai, Gui-Rong Xue, Qiang Yang, and Yong Yu. Transferring naive bayes classifiers for text classification. In AAAI, pages 540--545, 2007.

Digital Library

[18]

Si Si, Dacheng Tao, and Bo Geng. Bregman divergence-based regularization for transfer subspace learning. IEEE TKDE, pages 929--942, 2010.

Digital Library

[19]

Sinno Jialin Pan, Ivor W Tsang, James T Kwok, Qiang Yang, et al. Domain adaptation via transfer component analysis. IEEE TNN, pages 199--210, 2011.

Digital Library

[20]

Lixin Duan, Ivor W Tsang, Dong Xu, and Tat-Seng Chua. Domain adaptation from multiple sources via auxiliary classifiers. In Proceedings of the 26th ICML, pages 289--296, 2009.

Digital Library

[21]

Yishay Mansour, Mehryar Mohri, and Afshin Rostamizadeh. Domain adaptation with multiple sources. In NIPS, pages 1041--1048, 2009.

Digital Library

[22]

Liang Ge, Jing Gao, Hung Ngo, Kang Li, and Aidong Zhang. On handling negative transfer and imbalanced distributions in multiple source transfer learning. Statistical Analysis and Data Mining: The ASA Data Science Journal, pages 254--271, 2014.

Digital Library

Cited By

Wang QXu YYang SChang JZhang JKong X(2024)A domain adaptation method for bearing fault diagnosis using multiple incomplete source dataJournal of Intelligent Manufacturing10.1007/s10845-023-02075-735:2(777-791)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1007/s10845-023-02075-7
Xi JErsoy OFang JWu TWei XZhao C(2023)Parallel Multistage Wide Neural NetworkIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.312033134:8(4019-4032)Online publication date: Aug-2023
https://doi.org/10.1109/TNNLS.2021.3120331
Zhuang FQi ZDuan KXi DZhu YZhu HXiong HHe Q(2021)A Comprehensive Survey on Transfer LearningProceedings of the IEEE10.1109/JPROC.2020.3004555109:1(43-76)Online publication date: Jan-2021
https://doi.org/10.1109/JPROC.2020.3004555
Show More Cited By

Index Terms

Ensemble of Anchor Adapters for Transfer Learning
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Multi-task learning
        Transfer learning

Recommendations

TranSlider: Transfer Ensemble Learning from Exploitation to Exploration
KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

In transfer learning, what and where to transfer has been widely studied. Nevertheless, the learned transfer strategies are at high risk of over-fitting, especially when only a few annotated instances are available in the target domain. In this paper, ...
Adaptive boosting for transfer learning using dynamic updates
ECML PKDD'11: Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I

Instance-based transfer learning methods utilize labeled examples from one domain to improve learning performance in another domain via knowledge transfer. Boosting-based transfer learning algorithms are a subset of such methods and have been applied ...
Transfer learning with one-class data

When training and testing data are drawn from different distributions, most statistical models need to be retrained using the newly collected data. Transfer learning is a family of algorithms that improves the classifier learning in a target domain of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

October 2016

2566 pages

ISBN:9781450340731

DOI:10.1145/2983323

General Chairs:
Snehasis Mukhopadhyay
Indiana University Purdue University Indianapolis, USA
,
ChengXiang Zhai
University of Illinois at Urbana-Champaign, USA
,
Program Chairs:
Elisa Bertino
Purdue University
,
Fabio Crestani
University of Lugano
,
Javed Mostafa
University of North Carolina
,
Jie Tang
Tsinghua University
,
Luo Si
Alibaba Group Inc & Purdue University
,
Xiaofang Zhou
University of Queensland
,
Yi Chang
Yahoo Research
,
Yunyao Li
IBM Research - Almaden
,
Parikshit Sondhi
WalmartLabs

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Guangdong provincial science and technology plan projects
National Natural Science Foundation of China

Conference

CIKM'16

Sponsor:

CIKM'16: ACM Conference on Information and Knowledge Management

October 24 - 28, 2016

Indiana, Indianapolis, USA

Acceptance Rates

CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
260
Total Downloads

Downloads (Last 12 months)10
Downloads (Last 6 weeks)0

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang QXu YYang SChang JZhang JKong X(2024)A domain adaptation method for bearing fault diagnosis using multiple incomplete source dataJournal of Intelligent Manufacturing10.1007/s10845-023-02075-735:2(777-791)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1007/s10845-023-02075-7
Xi JErsoy OFang JWu TWei XZhao C(2023)Parallel Multistage Wide Neural NetworkIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.312033134:8(4019-4032)Online publication date: Aug-2023
https://doi.org/10.1109/TNNLS.2021.3120331
Zhuang FQi ZDuan KXi DZhu YZhu HXiong HHe Q(2021)A Comprehensive Survey on Transfer LearningProceedings of the IEEE10.1109/JPROC.2020.3004555109:1(43-76)Online publication date: Jan-2021
https://doi.org/10.1109/JPROC.2020.3004555
Yang SKong XWang QLi ZCheng HYu L(2021)A multi-source ensemble domain adaptation method for rotary machine fault diagnosisMeasurement10.1016/j.measurement.2021.110213186(110213)Online publication date: Dec-2021
https://doi.org/10.1016/j.measurement.2021.110213

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten