Skip to main content

Using Consensus Ensembles to Identify Suspect Data

  • Conference paper
Knowledge-Based Intelligent Information and Engineering Systems (KES 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3214))

Abstract

In a consensus ensemble all members must agree before they classify a data point. But even when they all agree some data is still misclassified. In this paper we look closely at consistently misclassified data to investigate whether some of it may be outliers or may have been mislabelled.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alpaydin, E.: Multiple networks for function learning. In: Proceedings of the 1993 IEEE International conference on Neural Networks, vol. I, pp. 27–32 (1993)

    Google Scholar 

  2. Barnett, V., Lewis, T.: Outliers in Statistical Data, 3rd edn. Wiley, Chichester, England (1994)

    MATH  Google Scholar 

  3. Brieman, L.: Stacked regressions. Machine Learning 24(1), 49–64 (1996)

    Google Scholar 

  4. Brieman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)

    Google Scholar 

  5. Clemen, R.: Combining forecasts: A review and annotated bibliography. Journal of forecasting 5, 559–583 (1989)

    Article  Google Scholar 

  6. Cox, R., Clark, D., Richardson, A.: An investigation into the effect of ensemble size and voting threshold on the accuracy of neural network ensembles. In: The 12th Australian Joint Conference on Artificial Intelligence (AI 1999), Sydney, December 1999, pp. 268–277 (1999)

    Google Scholar 

  7. Freund, Y., Schapire, R.: Experiments with a new boosting algorithm. In: Proceedings of the Thirteenth International Conference on Machine Learning, pp. 148–156. Morag Kaufmann, San Francisco (1996)

    Google Scholar 

  8. Hampel, F.R.: Robust Estimation: A condensed partial survey. Z. Wahrsch. Verw. Geb. 27, 87–104

    Google Scholar 

  9. Huber, P.J.: Robust Statistical Procedures, 2nd edn. SIAM, Philadelphia (1996)

    Google Scholar 

  10. Knowledge Engineering Laboratory, Department of Information Science, University of Otago at, http://divcom.otago.ac.nz/infosci/kel/

  11. Nash, W.J., Sellers, T.L., Talbot, S.R., Cawthorn, A.J., Ford, W.B.: The Population Biology of Abalone (Haliotis species) in Tasmania. I. Blacklip Abalone (H. rubra) from the North Coast and Islands of the Bass Strait, SFD, Tasmania. Technical Report # 48 (1994)

    Google Scholar 

  12. UCI machine learning repository at, http://www.ics.uci.edu/~mlearn/MLRepository.html

  13. Wolpert, D.: Stacked generalization. Neural Networks 5, 241–259 (1992)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Clark, D. (2004). Using Consensus Ensembles to Identify Suspect Data. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2004. Lecture Notes in Computer Science(), vol 3214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30133-2_63

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30133-2_63

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23206-3

  • Online ISBN: 978-3-540-30133-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics