Transfer active learning by querying committee

Shao, Hao; Tao, Feng; Xu, Rui

doi:10.1631/jzus.C1300167

Transfer active learning by querying committee

Published: 07 February 2014

Volume 15, pages 107–118, (2014)
Cite this article

Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Hao Shao¹,
Feng Tao² &
Rui Xu³

158 Accesses
Explore all metrics

Abstract

In real applications of inductive learning for classification, labeled instances are often deficient, and labeling them by an oracle is often expensive and time-consuming. Active learning on a single task aims to select only informative unlabeled instances for querying to improve the classification accuracy while decreasing the querying cost. However, an inevitable problem in active learning is that the informative measures for selecting queries are commonly based on the initial hypotheses sampled from only a few labeled instances. In such a circumstance, the initial hypotheses are not reliable and may deviate from the true distribution underlying the target task. Consequently, the informative measures will possibly select irrelevant instances. A promising way to compensate this problem is to borrow useful knowledge from other sources with abundant labeled information, which is called transfer learning. However, a significant challenge in transfer learning is how to measure the similarity between the source and the target tasks. One needs to be aware of different distributions or label assignments from unrelated source tasks; otherwise, they will lead to degenerated performance while transferring. Also, how to design an effective strategy to avoid selecting irrelevant samples to query is still an open question. To tackle these issues, we propose a hybrid algorithm for active learning with the help of transfer learning by adopting a divergence measure to alleviate the negative transfer caused by distribution differences. To avoid querying irrelevant instances, we also present an adaptive strategy which could eliminate unnecessary instances in the input space and models in the model space. Extensive experiments on both the synthetic and the real data sets show that the proposed algorithm is able to query fewer instances with a higher accuracy and that it converges faster than the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Query by diverse committee in transfer active learning

Article 11 April 2019

Active Learning Query Strategies for Classification, Regression, and Clustering: A Survey

Article 27 July 2020

Multi-label active learning: key issues and a novel query strategy

Article 30 August 2017

References

Argyriou, A., Maurer, A., Pontil, M., 2008. An algorithm for transfer learning in a heterogeneous environment. Proc. European Conf. on Machine Learning and Knowledge Discovery in Databases, p.71–85. [doi:10.1007/978-3-540-87479-9_23]
Chapter Google Scholar
Balcan, M.F., Beygelzimer, A., Langford, J., 2006. Agnostic active learning. Proc. 23rd Int. Conf. on Machine Learning, p.65–72. [doi:10.1145/1143844.1143853]
Google Scholar
Cao, B., Pan, S.J., Zhang, Y., et al., 2010. Adaptive transfer learning. Proc. 24th AAAI Conf. on Artificial Intelligence, p.407–412.
Google Scholar
Caruana, R., 1997. Multitask learning. Mach. Learn., 28(1):41–75. [doi:10.1023/A:1007379606734]
Article MathSciNet Google Scholar
Chang, C.C., Lin, C.J., 2001. LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol., 2(3):27. [doi:10.1145/1961189.1961199]
Google Scholar
Chattopadhyay, R., Fan, W., Davidson, I., et al., 2013. Joint transfer and batch-mode active learning. Proc. 30th Int. Conf. on Machine Learning, p.253–261.
Google Scholar
Church, K.W., Gale, W.A., 1991. A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams. Comput. Speech Lang., 5(1):19–54. [doi:10.1016/0885-2308(91)90016-J]
Article Google Scholar
Cohn, D., Atlas, L., Ladner, R., 1994. Improving generalization with active learning. Mach. Learn., 15(2):201–221. [doi:10.1007/BF00993277]
Google Scholar
Dagan, I., Engelson, S.P., 1995. Committee-based sampling for training probabilistic classifiers. Proc. 12th Int. Conf. on Machine Learning, p.150–157.
Google Scholar
Dai, W., Yang, Q., Xue, G., et al., 2007. Boosting for transfer learning. Proc. 24th Int. Conf. on Machine Learning, p.193–200. [doi:10.1145/1273496.1273521]
Google Scholar
Freund, Y., Schapire, R.E., 1997. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci., 55(1):119–139. [doi:10.1006/jcss.1997.1504]
Article MATH MathSciNet Google Scholar
Harpale, A., Yang, Y., 2010. Active learning for multi-task adaptive filtering. Proc. 27th Int. Conf. on Machine Learning, p.431–438.
Google Scholar
Krause, A., Guestrin, C., 2009. Optimal value of information in graphical models. J. Artif. Intell., 35:557–591.
MATH MathSciNet Google Scholar
Lewis, D.D., Gale, W.A., 1994. A sequential algorithm for training text classifiers. Proc. 17th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, p.3–12.
Google Scholar
Li, H., Shi, Y., Chen, M.Y., et al., 2010. Hybrid active learning for cross-domain video concept detection. Proc. Int. Conf. on Multimedia, p.1003–1006. [doi:10.1145/1873951.1874135]
Chapter Google Scholar
Li, L., Jin, X., Pan, S., et al., 2012. Multi-domain active learning for text classification. Proc. 18th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.1086–1094. [doi:10.1145/2339530.2339701]
Chapter Google Scholar
Lin, J.H., 1991. Divergence measures based on the Shannon entropy. IEEE Trans. Inform. Theory, 37(1):145–151. [doi:10.1109/18.61115]
Article MATH MathSciNet Google Scholar
Luo, C.Y., Ji, Y.S., Dai, X.Y., et al., 2012. Active learning with transfer learning. Proc. ACL Student Research Workshop, p.13–18.
Google Scholar
McCallum, A.K., Nigam, K., 1998. Employing EM and pool-based active learning for text classification. Proc. 15th Int. Conf. on Machine Learning, p.350–358.
Google Scholar
Muslea, I., Minton, S., Knoblock, C.A., 2002. Active+semi-supervised learning = robust multi-view learning. Proc. 19th Int. Conf. on Machine Learning, p.435–442.
Google Scholar
Pereira, F., Tishby, N., Lee, L., 1993. Distributional clustering of English words. Proc. 31st Annual Meeting of Association for Computational Linguistics, p.183–190. [doi:10.3115/981574.981598]
Chapter Google Scholar
Rajan, S., Ghosh, J., Crawford, M.M., 2006. An active learning approach to knowledge transfer for hyperspectral data analysis. Proc. IEEE Int. Conf. on Geoscience and Remote Sensing Symp., p.541–544. [doi:10.1109/IGARSS.2006.143]
Google Scholar
Reichart, R., Tomanek, K., Hahn, U., et al., 2008. Multi-task active learning for linguistic annotations. Proc. Annual Meeting of Association for Computational Linguistics, p.861–869.
Google Scholar
Rosenstein, M.T., Marx, Z., Kaelbling, L.P., et al., 2005. To transfer or not to transfer. Proc. NIPS Workshop on Inductive Transfer: 10 Years Later.
Google Scholar
Roy, N., McCallum, A., 2001. Toward optimal active learning hrough sampling estimation of error reduction. Proc. 18th Int. Conf. on Machine Learning, p.441–448.
Google Scholar
Settles, B., 2010. Active Learning Literature Survey. Technical Report No. 1648, University of Wisconsin, Madison.
Google Scholar
Seung, H.S., Opper, M., Sompolinsky, H., 1992. Query by committee. Proc. 5th Annual Workshop on Computational Learning Theory, p.287–294. [doi:10.1145/130385.130417]
Chapter Google Scholar
Shao, H., Suzuki, E., 2011. Feature-based inductive transfer learning through minimum encoding. Proc. SIAM Int. Conf. on Data Mining, p.259–270.
Google Scholar
Shao, H., Tong, B., Suzuki, E., 2011. Compact coding for hyperplane classifiers in heterogeneous environment. Proc. European Conf. on Machine Learning and Knowledge Discovery in Databases, p.207–222. [doi:10.1007/978-3-642-23808-6_14]
Chapter Google Scholar
Shi, X.X., Fan, W., Ren, J.T., 2008. Actively transfer domain knowledge. Proc. European Conf. on Machine Learning and Knowledge Discovery in Databases, p.342–357. [doi:10.1007/978-3-540-87481-2_23]
Chapter Google Scholar
Shi, Y., Lan, Z.Z., Liu, W., et al., 2009. Extending semi-supervised learning methods for inductive transfer learning. Proc. 9th IEEE Int. Conf. on Data Mining, p.483–492. [doi:10.1109/ICDM.2009.75]
Google Scholar
Yang, L., Hanneke, S., Carbonell, J., 2013. A theory of transfer learning with applications to active learning. Mach. Learn., 90(2):161–189. [doi:10.1007/s10994-012-5310-y]
Article MATH MathSciNet Google Scholar
Zhang, Y., 2010. Multi-task active learning with output constraints. Proc. 24th AAAI Conf. on Artificial Intelligence, p.667–672.
Google Scholar
Zhu, Z., Zhu, X., Ye, Y., et al., 2011. Transfer active learning. Proc. 20th ACM Int. Conf. on Information and Knowledge Management, p.2169–2172. [doi:10.1145/2063576.2063918]
Google Scholar
Zhuang, F., Luo, P., Shen, Z., et al., 2010. Collaborative Dual-PLSA: mining distinction and commonality across multiple domains for text classification. Proc. 19th ACM Int. Conf. on Information and Knowledge Management, p.359–368. [doi:10.1145/1871437.1871486]
Google Scholar

Download references

Author information

Authors and Affiliations

School of WTO Research & Education, Shanghai University of International Business and Economics, Shanghai, 200336, China
Hao Shao
School of Business, East China University of Science and Technology, Shanghai, 200237, China
Feng Tao
School of Computer Science and Technology, University of Science and Technology of China, Hefei, 230026, China
Rui Xu

Authors

Hao Shao
View author publications
You can also search for this author in PubMed Google Scholar
Feng Tao
View author publications
You can also search for this author in PubMed Google Scholar
Rui Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hao Shao.

Additional information

Project supported by the Humanity and Social Science Youth Foundation of Ministry of Education of China (No. 13YJC630126), the 085 Foundation of SUIBE (Nos. Z085YYJ13014 and 085LXPT13020), the Fundamental Research Funds for the Central Universities (No. WK0110000032), the National Natural Science Foundation of China (Nos. 71171184, 71201059, 71201151, 71090401, and 71090400), and the Funds for the Creative Research Group of China (No. 70821001)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shao, H., Tao, F. & Xu, R. Transfer active learning by querying committee. J. Zhejiang Univ. - Sci. C 15, 107–118 (2014). https://doi.org/10.1631/jzus.C1300167

Download citation

Received: 20 June 2013
Accepted: 09 November 2013
Published: 07 February 2014
Issue Date: February 2014
DOI: https://doi.org/10.1631/jzus.C1300167

Key words

CLC number

TP3

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Transfer active learning by querying committee

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Query by diverse committee in transfer active learning

Active Learning Query Strategies for Classification, Regression, and Clustering: A Survey

Multi-label active learning: key issues and a novel query strategy

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Subscribe and save

Buy Now

Navigation

Transfer active learning by querying committee

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Query by diverse committee in transfer active learning

Active Learning Query Strategies for Classification, Regression, and Clustering: A Survey

Multi-label active learning: key issues and a novel query strategy

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Subscribe and save

Buy Now

Search

Navigation