Skip to main content

A Supervised and Multivariate Discretization Algorithm for Rough Sets

  • Conference paper
Rough Set and Knowledge Technology (RSKT 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6401))

Included in the following conference series:

Abstract

Rough set theory has become an important mathematical tool to deal with imprecise, incomplete and inconsistent data. As we all know, rough set theory works better on discretized or binarized data. However, most real life data sets consist of not only discrete attributes but also continuous attributes. In this paper, we propose a supervised and multivariate discretization algorithm — SMD for rough sets. SMD uses both class information and relations between attributes to determine the discretization scheme. To evaluate algorithm SMD, we ran the algorithm on real life data sets obtained from the UCI Machine Learning Repository. The experimental results show that our algorithm is effective. And the time complexity of our algorithm is relatively low, compared with the current multivariate discretization algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Nguyen, H.S., Skowron, A.: Quantization of real value attributes:rough set and boolean reasoning approach. In: Proceedings of the Second Joint Annual Conference on Information Sciences, pp. 34–37. Society for Information Processing,

    Google Scholar 

  2. Nguyen, S.H., Nguyen, H.S.: Some efficient algorithms for rough set methods. In: Proceedings of IPMU 1996, Granada, Spain, pp. 1451–1456 (1996)

    Google Scholar 

  3. Nguyen, H.S.: Discretization problem for rough sets methods. In: Polkowski, L., Skowron, A. (eds.) RSCTC 1998. LNCS (LNAI), vol. 1424, pp. 545–555. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  4. Nguyen, H.S., Nguyen, S.H.: Discretization Methods in Data Mining. In: Rough Sets in Knowledge Discovery, Physica, pp. 451–482 (1998)

    Google Scholar 

  5. Pawlak, Z.: Rough sets. International Journal of Computer and Information Sciences 11, 341–356 (1982)

    Article  MATH  MathSciNet  Google Scholar 

  6. Pawlak, Z.: Rough sets: Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht (1991)

    MATH  Google Scholar 

  7. Catlett, J.: On Changing Continuous Attributes into Ordered Discrete Attributes. In: Kodratoff, Y. (ed.) EWSL 1991. LNCS, vol. 482, pp. 164–178. Springer, Heidelberg (1991)

    Chapter  Google Scholar 

  8. Kerber, R.: Chimerge: Discretization of Numeric Attributes. In: Proc. of the Ninth National Conference of Articial Intelligence, pp. 123–128. AAAI Press, Menlo Park (1992)

    Google Scholar 

  9. Dougherty, J., Kohavi, R., Sahami, M.: Supervised and Unsupervised Discretization of Continuous Features. In: Proceedings of the 12th International Conference on Machine Learning, pp. 194–202. Morgan Kaufmann Publishers, San Francisco (1995)

    Google Scholar 

  10. Øhrn, A.: Rosetta Technical Reference Manual (1999), http://www.idi.ntnu.no/_aleks/rosetta

  11. Blake, C.L., Merz, C.J.: UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/

  12. Bay, S.D.: Multivariate Discretization of Continuous Variables for Set Mining. In: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 315–319 (2000)

    Google Scholar 

  13. Bay, S.D.: Multivariate Discretization for Set Mining. Knowledge and Information Systems 3(4), 491–512 (2001)

    Article  MATH  Google Scholar 

  14. Monti, S., Cooper, G.F.: A Multivariate Discretization Method for Learning Bayesian Networks from Mixed Data. In: Proceedings of 14th Conference of Uncertainty in AI, pp. 404–413 (1998)

    Google Scholar 

  15. Tsai, C.J., Lee, C.I., Yang, W.P.: A discretization algorithm based on Class-Attribute Contingency Coefficient. Information Sciences 178, 714–731 (2008)

    Article  Google Scholar 

  16. Wong, A.K.C., Chiu, D.K.Y.: Synthesizing Statistical Knowledge from Incomplete Mixed- Mode Data. IEEE Trans. Pattern Analysis and Machine Intelligence, 796–805 (1987)

    Google Scholar 

  17. Liu, H., Setiono, R.: Chi2: Feature Selection and Discretization of Numeric Attributes, pp. 388–391. IEEE Computer Society, Los Alamitos (1995)

    Google Scholar 

  18. Liu, H., Hussain, F., Tan, C.L., Dash, M.: Discretization: an enabling technique. Journal of Data Mining and Knowledge Discovery 6(4), 393–423 (2002)

    Article  MathSciNet  Google Scholar 

  19. Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceeding of Thirteenth International Conference on Artificial Intelligence, pp. 1022–1027 (1993)

    Google Scholar 

  20. Pongaksorn, P., Rakthanmanon, T., Waiyamai, K.: DCR: Discretization using Class Information to Reduce Number of Intervals. In: QIMIE 2009: Quality issues, measures of interestingness and evaluation of data mining models, pp. 17–28 (2009)

    Google Scholar 

  21. Wang, G.Y.: Rough set theory and knowledge acquisition. Xian Jiaotong University Press (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jiang, F., Zhao, Z., Ge, Y. (2010). A Supervised and Multivariate Discretization Algorithm for Rough Sets. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds) Rough Set and Knowledge Technology. RSKT 2010. Lecture Notes in Computer Science(), vol 6401. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16248-0_81

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16248-0_81

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16247-3

  • Online ISBN: 978-3-642-16248-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics