Abstract.
We introduce purity dependencies as generalizations of functional dependencies in relational databases starting from the notion of impurity measure. The impurity measure of a subset of a set relative to a partition of that set and the relative impurity of two partitions allow us to define the relative impurity of two attribute sets of a table of a relational database and to introduce purity dependencies. We discuss properties of these dependencies that generalize similar properties of functional dependencies and we highlight their relevance for approximate classifications. Finally, an algorithm that mines datasets for these dependencies is presented.
Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Author information
Authors and Affiliations
Additional information
Received: 4 July 2000 / 16 November 2001
Rights and permissions
About this article
Cite this article
Simovici, D., Cristofor, D. & Cristofor, L. Impurity measures in databases. Acta Informatica 38, 307–324 (2002). https://doi.org/10.1007/s002360100078
Issue Date:
DOI: https://doi.org/10.1007/s002360100078