Batch Mode Active Learning for Networked Data with Optimal Subset Selection

Xu, Haihui; Zhao, Pengpeng; Sheng, Victor S.; Liu, Guanfeng; Zhao, Lei; Wu, Jian; Cui, Zhiming

doi:10.1007/978-3-319-21042-1_8

Haihui Xu¹⁷,
Pengpeng Zhao¹⁷,
Victor S. Sheng¹⁸,
Guanfeng Liu¹⁷,
Lei Zhao¹⁷,
Jian Wu¹⁷ &
…
Zhiming Cui¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9098))

Included in the following conference series:

International Conference on Web-Age Information Management

2854 Accesses

Abstract

Active learning has increasingly become an important paradigm for classification of networked data, where instances are connected with a set of links to form a network. In this paper, we propose a novel batch mode active learning method for networked data (BMALNeT). Our novel active learning method selects the best subset of instances from the unlabeled set based on the correlation matrix that we construct from the dedicated informativeness evaluation of each unlabeled instance. To evaluate the informativeness of each unlabeled instance accurately, we simultaneously exploit content information and the network structure to capture the uncertainty and representativeness of each instance and the disparity between any two instances. Compared with state-of-the-art methods, our experimental results on three real-world datasets demonstrate the effectiveness of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Semi-supervised batch active learning based on mutual information

Article 10 December 2024

An Active Learning Method via Expected Model Loss Reduction

Batch mode active learning via adaptive criteria weights

Article 14 November 2020

References

Baldridge, J., Osborne, M.: Active learning and the total cost of annotation. In: EMNLP 2004, A meeting of SIGDAT, pp. 9–16 (2004)
Google Scholar
Cohn, D.A., Ghahramani, Z., Jordan, M.I.: Active learning with statistical models. J. Artif. Intell. Res. (JAIR) 4, 129–145 (1996)
MATH Google Scholar
Macskassy, S.A.: Using graph-based metrics with empirical risk minimization to speed up active learning on networked data. In: KDDM 2009, pp. 597–606. ACM (2009)
Google Scholar
Shi, L., Zhao, Y., Tang, J.: Batch mode active learning for networked data. ACM Transactions on Intelligent Systems and Technology (TIST) 3(2), 33 (2012)
Google Scholar
Yang, Z., Tang, J., Xu, B., Xing, C.: Active learning for networked data based on non-progressive diffusion model. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 363–372. ACM (2014)
Google Scholar
Joshi, A.J., Porikli, F., Papanikolopoulos, N.: Multi-class active learning for image classification. In: CVPR 2009, pp. 2372–2379. IEEE (2009)
Google Scholar
Melville, P., Mooney, R.J.: Diverse ensembles for active learning. In: Proceedings of the Twenty-First International Conference on Machine Learning, p. 74. ACM (2004)
Google Scholar
Jensen, D., Neville, J., Gallagher, B.: Why collective inference improves relational classification. In: KDDM 2004, pp. 593–598. ACM (2004)
Google Scholar
Hu, X., Tang, J., Gao, H., Liu, H.: Actnet: Active learning for networked texts in microblogging. In: SDM, pp. 306–314. SIAM (2013)
Google Scholar
Cesa-Bianchi, N., Gentile, C., Vitale, F., Zappella, G.: Active learning on trees and graphs. arXiv preprint arXiv:1301.5112 (2013)
Fang, M., Yin, J., Zhang, C., Zhu, X., Fang, M., Yin, J., Zhu, X., Zhang, C.: Active class discovery and learning for networked data. In: SDM, pp. 315–323. SIAM (2013)
Google Scholar
Bilgic, M., Mihalkova, L., Getoor, L.: Active learning for networked data. In: ICML 2010, pp. 79–86 (2010)
Google Scholar
Zhuang, H., Tang, J., Tang, W., Lou, T., Chin, A., Wang, X.: Actively learning to infer social ties. Data Mining and Knowledge Discovery 25(2), 270–297 (2012)
Article MATH MathSciNet Google Scholar
Newman, M.: Networks: an introduction. Oxford University Press (2010)
Google Scholar
Freeman, L.C.: A set of measures of centrality based on betweenness. Sociometry, 35–41 (1977)
Google Scholar
Brandes, U.: On variants of shortest-path betweenness centrality and their generic computation. Social Networks 30(2), 136–145 (2008)
Article MathSciNet Google Scholar
Fu, Y., Zhu, X., Elmagarmid, A.K.: Active learning with optimal instance subset selection. IEEE Transactions on Cybernetics 43(2), 464–475 (2013)
Article Google Scholar
Goemans, M.X., Williamson, D.P.: Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. Journal of the ACM (JACM) 42(6), 1115–1145 (1995)
Article MATH MathSciNet Google Scholar
Fujisawa, K., Kojima, M., Nakata, K.: Sdpa (semidefinite programming algorithm) user manual-version 4.10. Department of Mathematical and Computing Science, Tokyo Institute of Technology, Research Report, Tokyo (1998)
Google Scholar
Sen, P., Namata, G.M., Bilgic, M., Getoor, L., Gallagher, B., Eliassi-Rad, T.: Collective classification in network data. AI Magazine 29(3), 93–106 (2008)
Google Scholar
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3), 27 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Soochow University, Suzhou, 215006, People’s Republic of China
Haihui Xu, Pengpeng Zhao, Guanfeng Liu, Lei Zhao, Jian Wu & Zhiming Cui
Computer Science Department, University of Central Arkansas, Conway, USA
Victor S. Sheng

Authors

Haihui Xu
View author publications
You can also search for this author in PubMed Google Scholar
Pengpeng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Victor S. Sheng
View author publications
You can also search for this author in PubMed Google Scholar
Guanfeng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Lei Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jian Wu
View author publications
You can also search for this author in PubMed Google Scholar
Zhiming Cui
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pengpeng Zhao .

Editor information

Editors and Affiliations

Google, CA, USA
Xin Luna Dong
Postdoc Apartments (Hong Lou) 4-1-4, Shandong University, Li Cheng, Jinan, China
Xiaohui Yu
Tsinghua University, Beijing, China
Jian Li
Northeastern University, BOSTON, USA
Yizhou Sun

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, H. et al. (2015). Batch Mode Active Learning for Networked Data with Optimal Subset Selection. In: Dong, X., Yu, X., Li, J., Sun, Y. (eds) Web-Age Information Management. WAIM 2015. Lecture Notes in Computer Science(), vol 9098. Springer, Cham. https://doi.org/10.1007/978-3-319-21042-1_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-21042-1_8
Published: 06 June 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21041-4
Online ISBN: 978-3-319-21042-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics