Abstract
We investigate the notion of an association reduct. Association reducts represent data-based functional dependencies between the sets of attributes, where it is preferred that possibly smallest sets determine possibly largest sets. We compare the notions of an association reduct to other types of reducts previously studied within the theory of rough sets. We focus particularly on modeling inexactness of dependencies, which is crucial for many real-life data applications. We also study the optimization problems and algorithms that aim at searching for the most interesting approximate association reducts in data.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Imieliński, T., Swami, A.N.: Mining Association Rules between Sets of Items in Large Databases. In: Proc. of SIGMOD 1993, Washington, DC, May 26–28, pp. 207–216 (1993)
Armstrong, W.W.: Dependency Structures of Database Relationships. Inform. Process. 74, 580–583 (1974)
Bazan, J.G., Nguyen, H.S., Nguyen, S.H., Synak, P., Wróblewski, J.: Rough Set Algorithms in Classification Problem. In: Rough Set Methods and Applications. New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56, pp. 49–88. Physica Verlag (2000)
Bertet, K., Monjardet, B.: The multiple facets of the canonical direct unit implicational basis. Theoretical Computer Science (2009) (to appear)
Brown, E.M.: Boolean reasoning. Kluwer, Dordrecht (1990)
Burris, S.N., Sankappanavar, H.P.: A Course in Universal Algebra. Springer, Heidelberg (1981)
Ceglar, A., Roddick, J.F.: Association mining. ACM Comput. Surv. 38(2) (2006)
Davis, L. (ed.): Handbook of Genetic Algorithms. Van Nostrand Reinhold (1991)
Duentsch, I., Gediga, G.: Uncertainty Measures of Rough Set Prediction. Artif. Intell. 106(1), 109–137 (1998)
Gallager, R.G.: Information Theory and Reliable Communication. Wiley, Chichester (1968)
Ganter, B., Wille, R.: Formal Concept Analysis: Mathematical Foundations. Springer, Heidelberg (1998)
Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to The Theory of NP-Completeness. Freeman and Company, New York (1979)
Grużdź, A., Ihnatowicz, A., Ślęzak, D.: Interactive Gene Clustering – A Case Study of Breast Cancer Microarray Data. Information Systems Frontiers 8(1), 21–27 (2006)
Hajek, P., Havranek, T.: Mechanizing Hypothesis Formation: Mathematical Foundations for a General Theory. Springer, Heidelberg (1978)
Kloesgen, W., Żytkow, J.M. (eds.): Handbook of Data Mining and Knowledge Discovery. Oxford University Press, Oxford (2002)
Kuncheva, L.I.: Combining Pattern Classifiers: Methods and Algorithms. Wiley, Chichester (2004)
Liu, H., Motoda, H. (eds.): Computational Methods of Feature Selection. Chapman & Hall, Boca Raton (2008)
McKinney, B.A., Reif, D.M., Ritchie, M.D., Moore, J.H.: Machine Learning for Detecting Gene-Gene Interactions: A Review. Applied Bioinformatics 5(2), 77–88 (2006)
Moshkov, M., Piliszczuk, M., Zielosko, B.: On Construction of Partial Reducts and Irreducible Partial Decision Rules. Fundam. Inform. 75(1–4), 357–374 (2007)
Nguyen, H.S.: Approximate Boolean Reasoning: Foundations and Applications in Data Mining. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets V. LNCS, vol. 4100, pp. 334–506. Springer, Heidelberg (2006)
Nguyen, H.S., Nguyen, S.H.: Rough Sets and Association Rule Generation. Fundamenta Informaticae 40(4), 310–318 (1999)
Pawlak, Z.: Information systems theoretical foundations. Inf. Syst. 6(3), 205–218 (1981)
Pawlak, Z.: Rough sets – Theoretical aspects of reasoning about data. Kluwer, Dordrecht (1991)
Pawlak, Z.: Rough set elements. In: Rough Sets in Knowledge Discovery 1 – Methodology and Applications. Studies in Fuzziness and Soft Computing, vol. 18, pp. 10–30. Physica Verlag (1998)
Pawlak, Z., Skowron, A.: Rudiments of rough sets. Information Sciences 177(1), 3–27 (2007)
Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: Intelligent Decision Support. Handbook of Applications and Advances of the Rough Set Theory, pp. 311–362. Kluwer, Dordrecht (1992)
Ślęzak, D.: Approximate reducts in decision tables. In: Proc. of IPMU 1996, Granada, Spain, July 1–5, vol. 3, pp. 1159–1164 (1996)
Ślęzak, D.: Various Approaches to Reasoning with Frequency Based Decision Reducts. In: Rough Set Methods and Applications. New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56, pp. 235–288. Physica Verlag (2000)
Ślęzak, D.: Approximate Entropy Reducts. Fundamenta Informaticae 53(3–4), 365–390 (2002)
Ślęzak, D.: Association Reducts: A Framework for Mining Multi-Attribute Dependencies. In: Hacid, M.-S., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds.) ISMIS 2005. LNCS, vol. 3488, pp. 354–363. Springer, Heidelberg (2005)
Ślęzak, D.: Association Reducts: Boolean Representation. In: Wang, G.-Y., Peters, J.F., Skowron, A., Yao, Y. (eds.) RSKT 2006. LNCS, vol. 4062, pp. 305–312. Springer, Heidelberg (2006)
Ślęzak, D.: Association Reducts: Complexity and Heuristics. In: Greco, S., Hata, Y., Hirano, S., Inuiguchi, M., Miyamoto, S., Nguyen, H.S., Słowiński, R. (eds.) RSCTC 2006. LNCS, vol. 4259, pp. 157–164. Springer, Heidelberg (2006)
Ślęzak, D.: Rough Sets and Few-Objects-Many-Attributes Problem – The Case Study of Analysis of Gene Expression Data Sets. In: Proc. of FBIT 2007, Jeju, Korea, October 11–13, pp. 437–440 (2007)
Ślęzak, D.: Degrees of conditional (in)dependence: A framework for approximate Bayesian networks and examples related to the rough set-based feature selection. Information Sciences 179(3), 197–209 (2009)
Ślęzak, D., Wróblewski, J., Eastwood, V., Synak, P.: Brighthouse: an analytic data warehouse for ad-hoc queries. PVLDB 1(2), 1337–1345 (2008)
Suraj, Z.: Discovery of Concurrent Data Models from Experimental Tables: A Rough Set Approach. Fundam. Inform. 28(3–4), 353–376, 379–490 (1996)
Suraj, Z.: Rough Set Method for Synthesis and Analysis of Concurrent Processes. In: Rough Set Methods and Applications. New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56. Physica Verlag (2000)
Świniarski, R.W., Skowron, A.: Rough set methods in feature selection and recognition. Pattern Recognition Letters 24(6), 833–849 (2003)
Ullman, J.D., Garcia-Molina, H., Widom, J.: Database Systems: The Complete Book. Prentice Hall, Englewood Cliffs (2001)
Wróblewski, J.: Theoretical Foundations of Order-Based Genetic Algorithms. Fundamenta Informaticae 28(3–4), 423–430 (1996)
Wróblewski, J.: Ensembles of classifiers based on approximate reducts. Fundamenta Informaticae 47(3–4), 351–360 (2001)
Yao, Y.Y., Zhao, Y., Wang, J.: On Reduct Construction Algorithms. Transactions on Computational Science 2, 100–117 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Ślęzak, D. (2009). Rough Sets and Functional Dependencies in Data: Foundations of Association Reducts. In: Gavrilova, M.L., Tan, C.J.K., Wang, Y., Chan, K.C.C. (eds) Transactions on Computational Science V. Lecture Notes in Computer Science, vol 5540. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02097-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-02097-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02096-4
Online ISBN: 978-3-642-02097-1
eBook Packages: Computer ScienceComputer Science (R0)