Using Consensus Ensembles to Identify Suspect Data

Clark, David

doi:10.1007/978-3-540-30133-2_63

David Clark²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3214))

Included in the following conference series:

International Conference on Knowledge-Based and Intelligent Information and Engineering Systems

910 Accesses
3 Citations

Abstract

In a consensus ensemble all members must agree before they classify a data point. But even when they all agree some data is still misclassified. In this paper we look closely at consistently misclassified data to investigate whether some of it may be outliers or may have been mislabelled.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alpaydin, E.: Multiple networks for function learning. In: Proceedings of the 1993 IEEE International conference on Neural Networks, vol. I, pp. 27–32 (1993)
Google Scholar
Barnett, V., Lewis, T.: Outliers in Statistical Data, 3rd edn. Wiley, Chichester, England (1994)
MATH Google Scholar
Brieman, L.: Stacked regressions. Machine Learning 24(1), 49–64 (1996)
Google Scholar
Brieman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)
Google Scholar
Clemen, R.: Combining forecasts: A review and annotated bibliography. Journal of forecasting 5, 559–583 (1989)
Article Google Scholar
Cox, R., Clark, D., Richardson, A.: An investigation into the effect of ensemble size and voting threshold on the accuracy of neural network ensembles. In: The 12th Australian Joint Conference on Artificial Intelligence (AI 1999), Sydney, December 1999, pp. 268–277 (1999)
Google Scholar
Freund, Y., Schapire, R.: Experiments with a new boosting algorithm. In: Proceedings of the Thirteenth International Conference on Machine Learning, pp. 148–156. Morag Kaufmann, San Francisco (1996)
Google Scholar
Hampel, F.R.: Robust Estimation: A condensed partial survey. Z. Wahrsch. Verw. Geb. 27, 87–104
Google Scholar
Huber, P.J.: Robust Statistical Procedures, 2nd edn. SIAM, Philadelphia (1996)
Google Scholar
Knowledge Engineering Laboratory, Department of Information Science, University of Otago at, http://divcom.otago.ac.nz/infosci/kel/
Nash, W.J., Sellers, T.L., Talbot, S.R., Cawthorn, A.J., Ford, W.B.: The Population Biology of Abalone (Haliotis species) in Tasmania. I. Blacklip Abalone (H. rubra) from the North Coast and Islands of the Bass Strait, SFD, Tasmania. Technical Report # 48 (1994)
Google Scholar
UCI machine learning repository at, http://www.ics.uci.edu/~mlearn/MLRepository.html
Wolpert, D.: Stacked generalization. Neural Networks 5, 241–259 (1992)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Sciences and Engineering, University of Canberra, ACT, 2601, Australia
David Clark

Authors

David Clark
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

KES International, 2nd Floor, 145-157 St John Street, EC1V 4PY, London, United Kingdom
Mircea Gh. Negoita
Centre for SMART systems Engineering Research Centre, University of Brighton, BN2 4GJ, Moulsecoomb, Brighton, UK
Robert J. Howlett
School of Electrical and Information Engineering, Knowledge Based Intelligent Engineering Systems Centre, University of South Australia, Mawson Lakes, 5095, Mawson Lakes, SA, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Clark, D. (2004). Using Consensus Ensembles to Identify Suspect Data. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2004. Lecture Notes in Computer Science(), vol 3214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30133-2_63

Download citation

DOI: https://doi.org/10.1007/978-3-540-30133-2_63
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23206-3
Online ISBN: 978-3-540-30133-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics