PRE and Variable Precision Models in Rough Set Data Analysis

Düntsch, Ivo; Gediga, Günther

doi:10.1007/978-3-662-47815-8_2

Ivo Düntsch²⁰ &
Günther Gediga²¹

Part of the book series: Lecture Notes in Computer Science ((TRS,volume 8988))

469 Accesses
2 Citations

Abstract

We present a parameter free and monotonic alternative to the parametric variable precision model of rough set data analysis. The proposed model is based on the well known PRE index \(\lambda \) of Goodman and Kruskal. Using a weighted \(\lambda \) model it is possible to define a two dimensional space based on (Rough) sensitivity and (Rough) specificity, for which the monotonicity of sensitivity in a chain of sets is a nice feature of the model. As specificity is often monotone as well, the results of a rough set analysis can be displayed like a receiver operation curve (ROC) in statistics. Another aspect deals with the precision of the prediction of categories – normally measured by an index \(\alpha \) in classical rough set data analysis. We offer a statistical theory for \(\alpha \) and a modification of \(\alpha \) which fits the needs of our proposed model. Furthermore, we show how expert knowledge can be integrated without losing the monotonic property of the index. Based on a weighted \(\lambda \), we present a polynomial algorithm to determine an approximately optimal set of predicting attributes. Finally, we exhibit a connection to Bayesian analysis. We present several simulation studies for the presented concepts. The current paper is an extended version of [1].

Ordering of authors is alphabetical, and equal authorship is implied

Ivo Düntsch– gratefully acknowledges support by the Natural Sciences and Engineering Research Council of Canada.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
For other views of Bayes’ Theorem and its connection to rough sets see e.g. [18–20].

References

Düntsch, I., Gediga, G.: Weighted \(\lambda \) precision models in rough set data analysis. In: Proceedings of the Federated Conference on Computer Science and Information Systems, pp. 309–316. IEEE, Wrocław, Poland (2012)
Google Scholar
Pawlak, Z.: Rough sets. Int. J. Comput. Inform. Sci. 11, 341–356 (1982)
Article MathSciNet MATH Google Scholar
Ziarko, W.: Variable precision rough set model. J. Comput. Syst. Sci. 46, 39–59 (1993)
Article MathSciNet MATH Google Scholar
Gediga, G., Düntsch, I.: Rough approximation quality revisited. Artif. Intell. 132, 219–234 (2001)
Article MATH Google Scholar
Beynon, M.: Reducts within the variable precision rough sets model: a further investigation. Eur. J. Oper. Res. 134, 592–605 (2001)
Article MATH Google Scholar
Zytkow, J.M.: Granularity refined by knowledge: contingency tables and rough sets as tools of discovery. In: Dasarathy, B. (ed.) Proceedings of SPIE 4057, Data Mining and Knowledge Discovery: Theory, Tools, and Technology II., pp. 82–91 (2000)
Google Scholar
Hildebrand, D., Laing, J., Rosenthal, H.: Prediction logic and quasi-independence in empirical evaluation of formal theory. J. Math. Sociol. 3, 197–209 (1974)
Article MATH Google Scholar
Hildebrand, D., Laing, J., Rosenthal, H.: Prediction Analysis of Cross Classification. Wiley, New York (1977)
Google Scholar
Goodman, L.A., Kruskal, W.H.: Measures of association for cross classification. J. Am. Stat. Assoc. 49, 732–764 (1954)
MATH Google Scholar
Holte, R.C.: Very simple classification rules perform well on most commonly used datasets. Mach. Learn. 11, 63–90 (1993)
Article MATH Google Scholar
Wu, S., Flach, P.A.: Feature selection with labelled and unlabelled data. In: Bohanec, M., Kasek, B., Lavrac, N., Mladenic, D. (eds.) ECML/PKDD 2002 workshop on Integration and Collaboration Aspects of Data Mining, pp. 156–167. University of Helsinki (August, Decision Support and Meta-Learning (2002)
Google Scholar
Nevill-Manning, C.G., Holmes, G., Witten, I.H.: The development of Holte’s 1R classifier. In: Proceedings of the 2nd New Zealand Two-Stream International Conference on Artificial Neural Networks and Expert Systems. ANNES 1995, pp. 239–246. IEEE Computer Society, Washington, DC, USA(1995)
Google Scholar
Düntsch, I., Gediga, G.: Simple data filtering in rough set systems. Int. J. Approx. Reason. 18(1–2), 93–106 (1998)
Article MATH Google Scholar
Youden, W.: Index for rating diagnostic tests. Cancer 3, 32–35 (1950)
Article Google Scholar
Böhning, D., Böhning, W., Holling, H.: Revisiting youden’s index as a useful measure of the misclassification error in meta-analysis of diagnostic studies. Stat. Methods Med. Res. 17, 543–554 (2008)
Article MathSciNet Google Scholar
Oehlert, G.: A note on the Delta method. Am. Stat. 46, 27–29 (1992)
MathSciNet Google Scholar
Chen, C.B., Wang, L.Y.: Rough set based clustering with refinement using Shannon’s entropy theory. Comput. Math. Appl. 52, 1563–1576 (2006)
Article MathSciNet Google Scholar
Pawlak, Z.: A rough set view on Bayes’ theorem. Int. J. Intell. Syst. 18, 487–498 (2003)
Article Google Scholar
Ślȩzak, D.: Rough sets and Bayes factor. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets III. LNCS, vol. 3400, pp. 202–229. Springer, Heidelberg (2005)
Chapter Google Scholar
Yao, Y.: Probabilistic rough set approximations. Int. J. Approx. Reason. 49(2), 255–271 (2008)
Article MATH Google Scholar
Wille, R.: Restructuring lattice theory: an approach based on hierarchies of concepts. In: Rival, I. (ed.) Ordered Sets. NATO Advanced Studies Institute, vol. 83, pp. 445–470. Springer, Reidel, Dordrecht (1982)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Brock University, St. Catharines, Ontario, L2S 3A1, Canada
Ivo Düntsch
Department of Psychology, Institut IV, Universität Münster, Fliednerstr. 21, Münster, Germany
Günther Gediga

Authors

Ivo Düntsch
View author publications
You can also search for this author in PubMed Google Scholar
Günther Gediga
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ivo Düntsch .

Editor information

Editors and Affiliations

Electrical and Computer Engineering, University of Manitoba, Winnipeg, Manitoba, Canada
James F. Peters
University of Warsaw, Warsaw, Poland
Andrzej Skowron
University of Warsaw, Warsaw, Poland
Dominik Ślȩzak
University of Warsaw, Warsaw, Poland
Hung Son Nguyen
University of Rzeszów, Rzeszów, Poland
Jan G. Bazan

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Düntsch, I., Gediga, G. (2015). PRE and Variable Precision Models in Rough Set Data Analysis. In: Peters, J., Skowron, A., Ślȩzak, D., Nguyen, H., Bazan, J. (eds) Transactions on Rough Sets XIX. Lecture Notes in Computer Science(), vol 8988. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47815-8_2

Download citation

DOI: https://doi.org/10.1007/978-3-662-47815-8_2
Published: 05 July 2015
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-47814-1
Online ISBN: 978-3-662-47815-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics