Rough Sets and Functional Dependencies in Data: Foundations of Association Reducts

Ślęzak, Dominik

doi:10.1007/978-3-642-02097-1_10

Rough Sets and Functional Dependencies in Data: Foundations of Association Reducts

Dominik Ślęzak²⁰

Chapter

494 Accesses
18 Citations

Part of the book series: Lecture Notes in Computer Science ((TCOMPUTATSCIE,volume 5540))

Abstract

We investigate the notion of an association reduct. Association reducts represent data-based functional dependencies between the sets of attributes, where it is preferred that possibly smallest sets determine possibly largest sets. We compare the notions of an association reduct to other types of reducts previously studied within the theory of rough sets. We focus particularly on modeling inexactness of dependencies, which is crucial for many real-life data applications. We also study the optimization problems and algorithms that aim at searching for the most interesting approximate association reducts in data.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imieliński, T., Swami, A.N.: Mining Association Rules between Sets of Items in Large Databases. In: Proc. of SIGMOD 1993, Washington, DC, May 26–28, pp. 207–216 (1993)
Google Scholar
Armstrong, W.W.: Dependency Structures of Database Relationships. Inform. Process. 74, 580–583 (1974)
Google Scholar
Bazan, J.G., Nguyen, H.S., Nguyen, S.H., Synak, P., Wróblewski, J.: Rough Set Algorithms in Classification Problem. In: Rough Set Methods and Applications. New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56, pp. 49–88. Physica Verlag (2000)
Google Scholar
Bertet, K., Monjardet, B.: The multiple facets of the canonical direct unit implicational basis. Theoretical Computer Science (2009) (to appear)
Google Scholar
Brown, E.M.: Boolean reasoning. Kluwer, Dordrecht (1990)
Book MATH Google Scholar
Burris, S.N., Sankappanavar, H.P.: A Course in Universal Algebra. Springer, Heidelberg (1981)
Book MATH Google Scholar
Ceglar, A., Roddick, J.F.: Association mining. ACM Comput. Surv. 38(2) (2006)
Google Scholar
Davis, L. (ed.): Handbook of Genetic Algorithms. Van Nostrand Reinhold (1991)
Google Scholar
Duentsch, I., Gediga, G.: Uncertainty Measures of Rough Set Prediction. Artif. Intell. 106(1), 109–137 (1998)
Article MathSciNet MATH Google Scholar
Gallager, R.G.: Information Theory and Reliable Communication. Wiley, Chichester (1968)
MATH Google Scholar
Ganter, B., Wille, R.: Formal Concept Analysis: Mathematical Foundations. Springer, Heidelberg (1998)
MATH Google Scholar
Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to The Theory of NP-Completeness. Freeman and Company, New York (1979)
MATH Google Scholar
Grużdź, A., Ihnatowicz, A., Ślęzak, D.: Interactive Gene Clustering – A Case Study of Breast Cancer Microarray Data. Information Systems Frontiers 8(1), 21–27 (2006)
Article Google Scholar
Hajek, P., Havranek, T.: Mechanizing Hypothesis Formation: Mathematical Foundations for a General Theory. Springer, Heidelberg (1978)
Book MATH Google Scholar
Kloesgen, W., Żytkow, J.M. (eds.): Handbook of Data Mining and Knowledge Discovery. Oxford University Press, Oxford (2002)
Google Scholar
Kuncheva, L.I.: Combining Pattern Classifiers: Methods and Algorithms. Wiley, Chichester (2004)
Book MATH Google Scholar
Liu, H., Motoda, H. (eds.): Computational Methods of Feature Selection. Chapman & Hall, Boca Raton (2008)
Google Scholar
McKinney, B.A., Reif, D.M., Ritchie, M.D., Moore, J.H.: Machine Learning for Detecting Gene-Gene Interactions: A Review. Applied Bioinformatics 5(2), 77–88 (2006)
Article Google Scholar
Moshkov, M., Piliszczuk, M., Zielosko, B.: On Construction of Partial Reducts and Irreducible Partial Decision Rules. Fundam. Inform. 75(1–4), 357–374 (2007)
MathSciNet MATH Google Scholar
Nguyen, H.S.: Approximate Boolean Reasoning: Foundations and Applications in Data Mining. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets V. LNCS, vol. 4100, pp. 334–506. Springer, Heidelberg (2006)
Chapter Google Scholar
Nguyen, H.S., Nguyen, S.H.: Rough Sets and Association Rule Generation. Fundamenta Informaticae 40(4), 310–318 (1999)
MathSciNet MATH Google Scholar
Pawlak, Z.: Information systems theoretical foundations. Inf. Syst. 6(3), 205–218 (1981)
Article MATH Google Scholar
Pawlak, Z.: Rough sets – Theoretical aspects of reasoning about data. Kluwer, Dordrecht (1991)
MATH Google Scholar
Pawlak, Z.: Rough set elements. In: Rough Sets in Knowledge Discovery 1 – Methodology and Applications. Studies in Fuzziness and Soft Computing, vol. 18, pp. 10–30. Physica Verlag (1998)
Google Scholar
Pawlak, Z., Skowron, A.: Rudiments of rough sets. Information Sciences 177(1), 3–27 (2007)
Article MathSciNet MATH Google Scholar
Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: Intelligent Decision Support. Handbook of Applications and Advances of the Rough Set Theory, pp. 311–362. Kluwer, Dordrecht (1992)
Google Scholar
Ślęzak, D.: Approximate reducts in decision tables. In: Proc. of IPMU 1996, Granada, Spain, July 1–5, vol. 3, pp. 1159–1164 (1996)
Google Scholar
Ślęzak, D.: Various Approaches to Reasoning with Frequency Based Decision Reducts. In: Rough Set Methods and Applications. New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56, pp. 235–288. Physica Verlag (2000)
Google Scholar
Ślęzak, D.: Approximate Entropy Reducts. Fundamenta Informaticae 53(3–4), 365–390 (2002)
MathSciNet MATH Google Scholar
Ślęzak, D.: Association Reducts: A Framework for Mining Multi-Attribute Dependencies. In: Hacid, M.-S., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds.) ISMIS 2005. LNCS, vol. 3488, pp. 354–363. Springer, Heidelberg (2005)
Chapter Google Scholar
Ślęzak, D.: Association Reducts: Boolean Representation. In: Wang, G.-Y., Peters, J.F., Skowron, A., Yao, Y. (eds.) RSKT 2006. LNCS, vol. 4062, pp. 305–312. Springer, Heidelberg (2006)
Chapter Google Scholar
Ślęzak, D.: Association Reducts: Complexity and Heuristics. In: Greco, S., Hata, Y., Hirano, S., Inuiguchi, M., Miyamoto, S., Nguyen, H.S., Słowiński, R. (eds.) RSCTC 2006. LNCS, vol. 4259, pp. 157–164. Springer, Heidelberg (2006)
Chapter Google Scholar
Ślęzak, D.: Rough Sets and Few-Objects-Many-Attributes Problem – The Case Study of Analysis of Gene Expression Data Sets. In: Proc. of FBIT 2007, Jeju, Korea, October 11–13, pp. 437–440 (2007)
Google Scholar
Ślęzak, D.: Degrees of conditional (in)dependence: A framework for approximate Bayesian networks and examples related to the rough set-based feature selection. Information Sciences 179(3), 197–209 (2009)
Article MathSciNet MATH Google Scholar
Ślęzak, D., Wróblewski, J., Eastwood, V., Synak, P.: Brighthouse: an analytic data warehouse for ad-hoc queries. PVLDB 1(2), 1337–1345 (2008)
Google Scholar
Suraj, Z.: Discovery of Concurrent Data Models from Experimental Tables: A Rough Set Approach. Fundam. Inform. 28(3–4), 353–376, 379–490 (1996)
MathSciNet MATH Google Scholar
Suraj, Z.: Rough Set Method for Synthesis and Analysis of Concurrent Processes. In: Rough Set Methods and Applications. New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56. Physica Verlag (2000)
Google Scholar
Świniarski, R.W., Skowron, A.: Rough set methods in feature selection and recognition. Pattern Recognition Letters 24(6), 833–849 (2003)
Article MATH Google Scholar
Ullman, J.D., Garcia-Molina, H., Widom, J.: Database Systems: The Complete Book. Prentice Hall, Englewood Cliffs (2001)
Google Scholar
Wróblewski, J.: Theoretical Foundations of Order-Based Genetic Algorithms. Fundamenta Informaticae 28(3–4), 423–430 (1996)
MathSciNet MATH Google Scholar
Wróblewski, J.: Ensembles of classifiers based on approximate reducts. Fundamenta Informaticae 47(3–4), 351–360 (2001)
MathSciNet MATH Google Scholar
Yao, Y.Y., Zhao, Y., Wang, J.: On Reduct Construction Algorithms. Transactions on Computational Science 2, 100–117 (2008)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Infobright Inc., Canada/Poland, Poland
Dominik Ślęzak

Authors

Dominik Ślęzak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Calgary, Department of Computer Science, 2500 University Drive N.W, T2N 1N4, Calgary, AB, Canada
Marina L. Gavrilova
OptimaNumerics Ltd., Cathedral House, 23-31 Waring Street, BT1 2DX, Belfast, UK
C. J. Kenneth Tan
Theoretical and Empirical Software Engineering Research Centre (TESERC), University of Calgary, 2500 University Drive, NW, T2N 1N4, Calgary, Alberta, Canada
Yingxu Wang
Department of Computing, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong
Keith C. C. Chan

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ślęzak, D. (2009). Rough Sets and Functional Dependencies in Data: Foundations of Association Reducts. In: Gavrilova, M.L., Tan, C.J.K., Wang, Y., Chan, K.C.C. (eds) Transactions on Computational Science V. Lecture Notes in Computer Science, vol 5540. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02097-1_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-02097-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02096-4
Online ISBN: 978-3-642-02097-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics