Skip to main content

Using Weighted Hybrid Discretization Method to Analyze Climate Changes

  • Conference paper
Computer Applications for Graphics, Grid Computing, and Industrial Environment (CGAG 2012, GDC 2012, IESH 2012)

Abstract

Data mining is the process of posing queries to large quantities of data and extracting information, often previously unknown, using mathematical, statistical and machine learning techniques. However some of the data mining techniques like classification and clustering cannot deal with numeric attributes though most real dataset contains some numeric attributes. Continuous attributes should be divided into a small distinct range of nominal attributes in order to apply data mining techniques. Correct discretization makes the dataset succinct and contributes to the high performance of classification algorithms. Meanwhile, several methods are presented and applied, but it is often dependent on the area. In this paper, we propose a weighted hybrid discretization technique based on entropy and contingency coefficient. Also we analyze performance evaluation with well-known techniques of discretization such as Equal-width binning, 1R, MDLP and ChiMerge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Tan, P.-N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Addison Wesley (2006)

    Google Scholar 

  2. Witten, I.H., Frank, E.: Data Mining: Practical Machine learning Tools and Techniques, 3rd edn. Morgan Kaufmann, San Francisco (2011)

    Google Scholar 

  3. Holte, R.C.: Very Simple Classification Rules Perform Well on Most Commonly Used Datasets. Machine Learning 11, 63–91 (1993)

    Article  MATH  Google Scholar 

  4. Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. Artificial Intelligence 13, 1022–1027 (1993)

    Google Scholar 

  5. Barron, A., Rissanen, J., Yu, B.: The Minimum Description Length Principle in Coding and Modeling. IEEE Transactions on Information Theory 44(6), 2743–2760 (1998)

    Article  MATH  MathSciNet  Google Scholar 

  6. Kerber, R.: ChiMerge: Discretization of numeric attribute. In: Proc. AAAI 1991, 10th International Conference on Artificial Intelligence, pp. 123–127 (1992)

    Google Scholar 

  7. Perner, P., Trautzsch, S.: Multi-interval discretization methods for decision tree learning. Pattern Recognition 1451, 475–482 (1998)

    Google Scholar 

  8. Liu, H., Hussain, H.F., Tan, C.L., Dash, M.: Discretization: An enabling technique. Data Mining and Knowledge Discovery 6, 393–423 (2002)

    Article  MathSciNet  Google Scholar 

  9. Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. Artificial Intelligence 13, 1022–1027 (1993)

    Google Scholar 

  10. Han, J., Kamber, M.: Data Mining Conceptsand Techniques. Morgan Kaufmann (2001)

    Google Scholar 

  11. Liu, H., Setiono, R.: Feature selection via discretization. IEEE Transactions on Knowledge and Data Engineering 9, 642–645

    Google Scholar 

  12. Kohavi, M.S.: Error-Based and Entropy-Based Discretization of Continuous Features. In: The 2nd International Conference on Knowledge Discovery and Data Mining, pp. 114–119 (1996)

    Google Scholar 

  13. Zhu, Q., Lin, L., Shyu, M.L., Chen, S.C.: Effective Supervised Discretization for Classification based on Correlation Maximization. IEEE Transactions on Information Feuse and Integration, 390–295 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jung, YG., Kim, K.M., Kwon, Y.M. (2012). Using Weighted Hybrid Discretization Method to Analyze Climate Changes. In: Kim, Th., Cho, Hs., Gervasi, O., Yau, S.S. (eds) Computer Applications for Graphics, Grid Computing, and Industrial Environment. CGAG GDC IESH 2012 2012 2012. Communications in Computer and Information Science, vol 351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35600-1_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35600-1_28

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35599-8

  • Online ISBN: 978-3-642-35600-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics