Feature-Correlation Based Multi-view Detection

Zhang, Kuo; Tang, Jie; Li, JuanZi; Wang, KeHong

doi:10.1007/11424925_127

Kuo Zhang²⁴,
Jie Tang²⁴,
JuanZi Li²⁴ &
…
KeHong Wang²⁴

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3483))

Included in the following conference series:

International Conference on Computational Science and Its Applications

1694 Accesses

Abstract

A view validation algorithm has been shown to predict whether or not the views are sufficiently compatible for solving a particular learning task. But it only works when a natural split of features exists. If the split does not exist, it will fail to manufacture a feature split to build the best views. In this paper, we present a general algorithm CCFP (Correlation and Compatibility based Feature Partitioner) to automate multi-view detection. CCFP first labels the large amount of unlabeled examples using single view algorithm, then calculates the conditional SU (Symmetric Uncertainty) between every pair of features and the IG (Information Gain) of each feature given the examples labeled previously by single view algorithm with high-confidence predictions. According to the estimated values of SU and IG, all the features will be partitioned into two views that are low correlated, compatible and sufficient enough. The experiment results show that multi-view learner with views generated by CCFP outperforms learner with views generated by other means clearly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Mining Best Strategy for Multi-view Classification

Ensemble multi-view feature set partitioning method for effective multi-view learning

Article 27 May 2024

Co-clustering based classification of multi-view data

Article 15 January 2022

References

Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pp. 189–196 (1995)
Google Scholar
Brefeld, U., Scheffer, T.: Co-EM support vector learning. In: Proceedings of the 21st international conference on Machine learning (2004)
Google Scholar
Nigam, K., Ghani, R.: Analyzing the effectiveness and applicability of co-training. In: Proceedings of Information and Knowledge Management (2000)
Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B 39 (1977)
Google Scholar
Ion, M., Minton, S., Knoblock, C.: Adaptive view validation: A first step towards automatic view detection. In: The 19th International Conference on Machine Learning (ICML 2002), pp. 443–450 (2002)
Google Scholar
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Conference on Computational Learning Theory, pp. 92–100 (1998)
Google Scholar
Rayid, G.: Combining labeled and unlabeled data for text classification with a large number of categories. In: Proceedings of IEEE Conference on Data Mining (2001)
Google Scholar
Rayid, G.: Combining labeled and unlabeled data for multiclass text classification. In: Proceedings of the 19th International Conference on Machine Learning (ICML 2002), pp. 187–194 (2002)
Google Scholar
Yu, L., Liu, H.: Feature Selection for High-Dimensional Data: A Fast Correlation- Based Filter Solution. In: Proceedings of the 19th International Conference on Machine Learning (ICML 2003), pp. 856–863 (2003)
Google Scholar
Quinlan, J.: C4.5: Programs for machine learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical recipes in C. Cambridge University Press, Cambridge (1988)
MATH Google Scholar
Joachims, T.: A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization. In: Proceedings of ICML 1997 (1997)
Google Scholar
Miller, G.: WordNet: An online lexical database. International Journal of Lexicography (1990)
Google Scholar
Ion, M., Minton, S., Knoblock, C.: Active + Semi-supervised Learning = Robust Multi-view Learning. In: The 19th International Conference on Machine Learning (ICML 2002), pp. 435–442 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Knowledge Engineering Lab, Department of Computer Science, Tsinghua University, Beijing, 100084, P.R.China
Kuo Zhang, Jie Tang, JuanZi Li & KeHong Wang

Authors

Kuo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Tang
View author publications
You can also search for this author in PubMed Google Scholar
JuanZi Li
View author publications
You can also search for this author in PubMed Google Scholar
KeHong Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics and Computer Science, University of Perugia, via Vanvitelli, 1, I-06123, Perugia, Italy
Osvaldo Gervasi
Department of Computer Science, University of Calgary, 2500 University Drive N.W., T2N 1N4, Calgary, AB, Canada
Marina L. Gavrilova
William Norris Professor, Head of the Computer Science and Engineering Department, University of Minnesota, USA
Vipin Kumar
Department of Chemistry, University of Perugia, Via Elce di Sotto, 8, I-06123, Perugia, Italy
Antonio Laganá
Institute of High Performance Computing, IHCP, 1 Science Park Road, 01-01 The Capricorn, Singapore Science Park II, 117528, Singapore
Heow Pueh Lee
School of Computing, Soongsil University, Seoul, Korea
Youngsong Mun
Clayton School of IT, Monash University, 3800, Clayton, Australia
David Taniar
OptimaNumerics Ltd, Belfast, United Kingdom
Chih Jeng Kenneth Tan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, K., Tang, J., Li, J., Wang, K. (2005). Feature-Correlation Based Multi-view Detection. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2005. ICCSA 2005. Lecture Notes in Computer Science, vol 3483. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424925_127

Download citation

DOI: https://doi.org/10.1007/11424925_127
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25863-6
Online ISBN: 978-3-540-32309-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Feature-Correlation Based Multi-view Detection

Abstract

Access this chapter

Preview

Similar content being viewed by others

Mining Best Strategy for Multi-view Classification

Ensemble multi-view feature set partitioning method for effective multi-view learning

Co-clustering based classification of multi-view data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Feature-Correlation Based Multi-view Detection

Abstract

Access this chapter

Preview

Similar content being viewed by others

Mining Best Strategy for Multi-view Classification

Ensemble multi-view feature set partitioning method for effective multi-view learning

Co-clustering based classification of multi-view data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation