Test-Cost Sensitive Classification Using Greedy Algorithm on Training Data

Wan, Chang

doi:10.1007/978-3-642-16493-4_46

Chang Wan²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6382))

Included in the following conference series:

International Symposium on Intelligence Computation and Applications

1643 Accesses

Abstract

Much work has been done to deal with the test-cost sensitive learning on data with missing values. There is a confliction of efficiency and accuracy among previous strategies. Sequential test strategies have high accuracy but low efficiency because of their sequential property. Some batch strategies have high efficiency but lead to poor performance since they make all decisions at one time using initial information. In this paper, we propose a new test strategy, GTD algorithm, to address this problem. Our algorithm uses training data to judge the benefits brought by an unknown attribute and chooses the most useful unknown attribute each time until there is no rewarding unknown attributes. It is more reasonable to judge the utility of an unknown attribute from the real performance on training data other than from the estimation. Our strategy is meaningful since it has high efficiency(We only use training data so GTD is not sequential) and lower total costs than the previous strategies at the same time. The experiments also prove that our algorithm significantly outperforms previous algorithms especially when there is a high missing rate and large fluctuations of test costs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mitchell, T.M.: Machine Learning. McGraw Hill, New York (1997)
MATH Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Google Scholar
Juang, B.-H., Katagiri, S.: Discriminative learning for minimum error classification. IEEE Transactions on Signal Processing 40(12) (1992)
Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3) (1995)
Google Scholar
Turney, P.D.: Types of cost in inductive concept learning. In: Workshop Cost-Sensitive Learning at the 17th Int’l. Conf. Machine Learning (2000)
Google Scholar
Elkan, C.: The foundations of cost-sensitive learning. In: 7th Int’l. Joint Conf. Artificial Intelligence, pp. 973–978 (2001)
Google Scholar
Domingos, P., Pazzani, M.: On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning, 103–130 (1997)
Google Scholar
Kai, M.T.: Inducing cost-sensitive trees via instance weighting. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS, vol. 1510, pp. 139–147. Springer, Heidelberg (1998)
Google Scholar
Nunez, M.: The use of background knowledge in decision tree induction. Machine Learning, 231–250 (1991)
Google Scholar
Tan, M.: Cost-sensitive learning of classification knowledge and its applications in robotics. Machine Learning J., 7–33 (1993)
Google Scholar
Yang, Q., Ling, C., Chai, X., Pan, R.: Test-cost sensitive classification on data with missing values. IEEE Transactions on Knowledge and Data Engineering (5) (2006)
Google Scholar
Cebe, M., Gunduz-Demir, C.: Test-cost sensitive classification based on conditioned loss functions. Machine Learning (2007)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley and Sons, Inc., Chichester (2001)
MATH Google Scholar
Blake, C.L., Merz, C.J.: Uci repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html

Download references

Author information

Authors and Affiliations

School of Information Science and Technology, Sun Yat-sen University, Guangzhou, Guangdong, China
Chang Wan

Authors

Chang Wan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, China University of Geosciences, 430074, Wuhan, Hubei, P.R. China
Zhihua Cai
School of Computer Science, China University of Geosciences, Wu-Han, China University of Geosciences, 430074, Wuhan, Hubei, P.R. China
Chengyu Hu
Computation Center, Wuhan University, 430072, Wuhan, Hubei, China
Zhuo Kang
School of Computer Science and Engineering, The University of Aizu, Tsuruga, Ikki-machi, 965-8580, Aizu-Wakamatsu City, Fukushima, Japan
Yong Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wan, C. (2010). Test-Cost Sensitive Classification Using Greedy Algorithm on Training Data. In: Cai, Z., Hu, C., Kang, Z., Liu, Y. (eds) Advances in Computation and Intelligence. ISICA 2010. Lecture Notes in Computer Science, vol 6382. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16493-4_46

Download citation

DOI: https://doi.org/10.1007/978-3-642-16493-4_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16492-7
Online ISBN: 978-3-642-16493-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics