Skip to main content

Test-Cost Sensitive Classification Using Greedy Algorithm on Training Data

  • Conference paper
Advances in Computation and Intelligence (ISICA 2010)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6382))

Included in the following conference series:

  • 1643 Accesses

Abstract

Much work has been done to deal with the test-cost sensitive learning on data with missing values. There is a confliction of efficiency and accuracy among previous strategies. Sequential test strategies have high accuracy but low efficiency because of their sequential property. Some batch strategies have high efficiency but lead to poor performance since they make all decisions at one time using initial information. In this paper, we propose a new test strategy, GTD algorithm, to address this problem. Our algorithm uses training data to judge the benefits brought by an unknown attribute and chooses the most useful unknown attribute each time until there is no rewarding unknown attributes. It is more reasonable to judge the utility of an unknown attribute from the real performance on training data other than from the estimation. Our strategy is meaningful since it has high efficiency(We only use training data so GTD is not sequential) and lower total costs than the previous strategies at the same time. The experiments also prove that our algorithm significantly outperforms previous algorithms especially when there is a high missing rate and large fluctuations of test costs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mitchell, T.M.: Machine Learning. McGraw Hill, New York (1997)

    MATH  Google Scholar 

  2. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)

    Google Scholar 

  3. Juang, B.-H., Katagiri, S.: Discriminative learning for minimum error classification. IEEE Transactions on Signal Processing 40(12) (1992)

    Google Scholar 

  4. Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3) (1995)

    Google Scholar 

  5. Turney, P.D.: Types of cost in inductive concept learning. In: Workshop Cost-Sensitive Learning at the 17th Int’l. Conf. Machine Learning (2000)

    Google Scholar 

  6. Elkan, C.: The foundations of cost-sensitive learning. In: 7th Int’l. Joint Conf. Artificial Intelligence, pp. 973–978 (2001)

    Google Scholar 

  7. Domingos, P., Pazzani, M.: On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning, 103–130 (1997)

    Google Scholar 

  8. Kai, M.T.: Inducing cost-sensitive trees via instance weighting. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS, vol. 1510, pp. 139–147. Springer, Heidelberg (1998)

    Google Scholar 

  9. Nunez, M.: The use of background knowledge in decision tree induction. Machine Learning, 231–250 (1991)

    Google Scholar 

  10. Tan, M.: Cost-sensitive learning of classification knowledge and its applications in robotics. Machine Learning J., 7–33 (1993)

    Google Scholar 

  11. Yang, Q., Ling, C., Chai, X., Pan, R.: Test-cost sensitive classification on data with missing values. IEEE Transactions on Knowledge and Data Engineering (5) (2006)

    Google Scholar 

  12. Cebe, M., Gunduz-Demir, C.: Test-cost sensitive classification based on conditioned loss functions. Machine Learning (2007)

    Google Scholar 

  13. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley and Sons, Inc., Chichester (2001)

    MATH  Google Scholar 

  14. Blake, C.L., Merz, C.J.: Uci repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wan, C. (2010). Test-Cost Sensitive Classification Using Greedy Algorithm on Training Data. In: Cai, Z., Hu, C., Kang, Z., Liu, Y. (eds) Advances in Computation and Intelligence. ISICA 2010. Lecture Notes in Computer Science, vol 6382. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16493-4_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16493-4_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16492-7

  • Online ISBN: 978-3-642-16493-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics