skip to main content
10.1145/2983323.2983690acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Ensemble of Anchor Adapters for Transfer Learning

Published: 24 October 2016 Publication History

Abstract

In the past decade, there have been a large number of transfer learning algorithms proposed for various real-world applications. However, most of them are vulnerable to negative transfer since their performance is even worse than traditional supervised models. Aiming at more robust transfer learning models, we propose an ENsemble framework of anCHOR adapters (ENCHOR for short), in which an anchor adapter adapts the features of instances based on their similarities to a specific anchor (i.e., a selected instance). Specifically, the more similar to the anchor instance, the higher degree of the original feature of an instance remains unchanged in the adapted representation, and vice versa. This adapted representation for the data actually expresses the local structure around the corresponding anchor, and then any transfer learning method can be applied to this adapted representation for a prediction model, which focuses more on the neighborhood of the anchor. Next, based on multiple anchors, multiple anchor adapters can be built and combined into an ensemble for final output. Additionally, we develop an effective measure to select the anchors for ensemble building to achieve further performance improvement. Extensive experiments on hundreds of text classification tasks are conducted to demonstrate the effectiveness of ENCHOR. The results show that: when traditional supervised models perform poorly, ENCHOR (based on only 8 selected anchors) achieves $6%-13%$ increase in terms of average accuracy compared with the state-of-the-art methods, and it greatly alleviates negative transfer.

References

[1]
Sinno Jialin Pan and Qiang Yang. A survey on transfer learning. IEEE TKDE, pages 1345--1359, 2010.
[2]
Xuejun Liao, Ya Xue, and Lawrence Carin. Logistic regression with an auxiliary data source. In Proceedings of the 22nd ICML, pages 505--512, 2005.
[3]
Wenyuan Dai, Qiang Yang, Gui-Rong Xue, and Yong Yu. Boosting for transfer learning. In Proceedings of the 24th ICML, pages 193--200, 2007.
[4]
Minmin Chen, Zhixiang Eddie Xu, Kilian Q. Weinberger, and Fei Sha. Marginalized denoising autoencoders for domain adaptation. In Proceedings of the 29th ICML, 2012.
[5]
Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, and Qing He. Transfer learning from multiple source domains via consensus regularization. In Proceedings of the 17th ACM CIKM, pages 103--112, 2008.
[6]
Lixin Duan, Dong Xu, and Shih-Fu Chang. Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach. In IEEE CVPR, pages 1338--1345, 2012.
[7]
Jing Gao, Wei Fan, Jing Jiang, and Jiawei Han. Knowledge transfer via multiple model local structure mapping. In Proceedings of the 14th ACM SIGKDD, 2008.
[8]
Mark Dredze, Alex Kulesza, and Koby Crammer. Multi-domain learning by confidence-weighted parameter combination. Machine Learning, pages 123--149, 2010.
[9]
Fuzhen Zhuang, Ping Luo, Hui Xiong, Qing He, Yuhong Xiong, and Zhongzhi Shi. Exploiting associations between word clusters and document classes for cross-domain text categorization. Statistical Analysis and Data Mining, pages 100--114, 2011.
[10]
Solomon Kullback. Letter to the editor: the kullback-leibler distance. AMERICAN STATISTICIAN, 1987.
[11]
Karsten M Borgwardt, Arthur Gretton, Malte J Rasch, Hans-Peter Kriegel, Bernhard Schölkopf, and Alex J Smola. Integrating structured biological data by kernel maximum mean discrepancy. Bioinformatics, pages e49--e57, 2006.
[12]
Wenyuan Dai, Gui-Rong Xue, Qiang Yang, and Yong Yu. Co-clustering based classification for out-of-domain documents. In Proceedings of the 13th ACM SIGKDD, pages 210--219, 2007.
[13]
Sinno Jialin Pan, James T Kwok, and Qiang Yang. Transfer learning via dimensionality reduction. In AAAI, pages 677--682, 2008.
[14]
Fuzhen Zhuang, Ping Luo, Hui Xiong, Yuhong Xiong, Qing He, and Zhongzhi Shi. Cross-domain learning from multiple sources: a consensus regularization perspective. IEEE TKDE, pages 1664--1678, 2010.
[15]
David Hosmer and Stanley Lemeshow. Applied Logistic Regression. Wiley, New York, 2000.
[16]
Thomas Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine learning, pages 177--196, 2001.
[17]
Wenyuan Dai, Gui-Rong Xue, Qiang Yang, and Yong Yu. Transferring naive bayes classifiers for text classification. In AAAI, pages 540--545, 2007.
[18]
Si Si, Dacheng Tao, and Bo Geng. Bregman divergence-based regularization for transfer subspace learning. IEEE TKDE, pages 929--942, 2010.
[19]
Sinno Jialin Pan, Ivor W Tsang, James T Kwok, Qiang Yang, et al. Domain adaptation via transfer component analysis. IEEE TNN, pages 199--210, 2011.
[20]
Lixin Duan, Ivor W Tsang, Dong Xu, and Tat-Seng Chua. Domain adaptation from multiple sources via auxiliary classifiers. In Proceedings of the 26th ICML, pages 289--296, 2009.
[21]
Yishay Mansour, Mehryar Mohri, and Afshin Rostamizadeh. Domain adaptation with multiple sources. In NIPS, pages 1041--1048, 2009.
[22]
Liang Ge, Jing Gao, Hung Ngo, Kang Li, and Aidong Zhang. On handling negative transfer and imbalanced distributions in multiple source transfer learning. Statistical Analysis and Data Mining: The ASA Data Science Journal, pages 254--271, 2014.

Cited By

View all
  • (2024)A domain adaptation method for bearing fault diagnosis using multiple incomplete source dataJournal of Intelligent Manufacturing10.1007/s10845-023-02075-735:2(777-791)Online publication date: 1-Feb-2024
  • (2023)Parallel Multistage Wide Neural NetworkIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.312033134:8(4019-4032)Online publication date: Aug-2023
  • (2021)A Comprehensive Survey on Transfer LearningProceedings of the IEEE10.1109/JPROC.2020.3004555109:1(43-76)Online publication date: Jan-2021
  • Show More Cited By

Index Terms

  1. Ensemble of Anchor Adapters for Transfer Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
    October 2016
    2566 pages
    ISBN:9781450340731
    DOI:10.1145/2983323
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 October 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. classifcation
    2. transfer learning

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    CIKM'16
    Sponsor:
    CIKM'16: ACM Conference on Information and Knowledge Management
    October 24 - 28, 2016
    Indiana, Indianapolis, USA

    Acceptance Rates

    CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)10
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A domain adaptation method for bearing fault diagnosis using multiple incomplete source dataJournal of Intelligent Manufacturing10.1007/s10845-023-02075-735:2(777-791)Online publication date: 1-Feb-2024
    • (2023)Parallel Multistage Wide Neural NetworkIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.312033134:8(4019-4032)Online publication date: Aug-2023
    • (2021)A Comprehensive Survey on Transfer LearningProceedings of the IEEE10.1109/JPROC.2020.3004555109:1(43-76)Online publication date: Jan-2021
    • (2021)A multi-source ensemble domain adaptation method for rotary machine fault diagnosisMeasurement10.1016/j.measurement.2021.110213186(110213)Online publication date: Dec-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media