Abstract
The problem of discovering functional dependencies (FDs) from an existing relation instance has received considerable attention in the database research community. To date, even the most efficient solutions have exponential complexity in the number of attributes of the instance. We develop an algorithm, FastFDs, for solving this problem based on a depth-first, heuristic-driven (DFHD) search for finding minimal covers of hypergraphs. The technique of reducing the FD discovery problem to the problem of finding minimal covers of hypergraphs was applied previously by Lopes et al. in the algorithm Dep-Miner. Dep-Miner employs a levelwise search for minimal covers, whereas FastFDs uses DFHD search. We report several tests on distinct benchmark relation instances involving Dep-Miner, FastFDs, and Tane. Our experimental results indicate that DFHD search is more efficient than Dep-Miner’s levelwise search or Tane’s partitioning approach for many of these benchmark instances.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, Rakesh; Mannila, Heikki; Srikant, Ramakrishnan; Toivonen, Hannu and Verkamo, A.I. “Fast Discovery of Association Rules.” Advances in KDD, AAA, Press, Menlo Park, CA, pg. 307–328, 1996.
Demetrovics, J; Katona, G; Miklos, D; Seleznjev, O. and Thalheim, B. “The Average Length of Keys and Functional Dependencies in (Random) Databases.” Lecture Notes in Computer Science, vol. 893, 1995.
Eiter, Thomas and Gottlob, Goerg. “Identifying the Minimal Traversals of a Hypergraph and Related Problems.” SIAM Journal of Computing, vol. 24,no. 6, pg. 1278–1304, 1995.
Flach, Peter and Savnik, Iztok. “Database Dependency Discovery: a Machine Learning Approach.” AI Comm. vol. 12,no. 3, pg 139–160.
Gunopulos, Dimitrios; Khardon, Roni; Mannila, Heikki; and Toivonen, Hannu. “Data Mining, Hypergraph Traversals, and Machine Learning (extended abstract)”, PODS, 1997, pg 209–216.
Huhtala, Ykä; Kärkkäinen, Juha; Porkka, Pasi and Toivonen, Hannu. “TANE: An Efficient Algorithm for Discovering Functional and Approximate Dependencies.” The Computer Journal, vol. 42,no. 2, 1999.
Kantola, Martti; Mannila, Heikki; Räihä, Kari-Jouko and Siirtola, Harri. “Discovering Functional and Inclusion Dependencies in Relational Databases.” Journal of Intelligent Systems, vol. 7, pg. 591–607, 1992.
Lopes, Stephane; Petit, Jean-Marc and Lakhal, Lotfi. “Efficient Discovery of Functional Dependencies and Armstrong Relations.” Proceedings of ECDT 2000. Lecture Notes in Computer Science, vol 1777.
Mannila, Heikki and Räihä, Kari-Jouko. “Dependency Inference (Extended Abstract)”, Proceedings of the Very Large Databases Conference (VLDB), Brighton, pg. 155–158, 1987.
Mannila, Heikki and Räihä, Kari-Jouko. “Algorithms for Inferring Functional Dependencies from Relations”, Data & Knowledge Engineering, 12, pg. 83–99, 1994.
Merz, C.J. and Murphy, P.M. UCI Machine Learning databases (1996). http://www.ics.uci.edu/~mlearn/MLRepository.html. Irvine, CA: University of California, Department of Information and Comp. Sci.
The Tane and Tane/mem source code is available on the web at http://http://www.cs.helsinki.fi/research/fdk/datamining/tane
Wyss, C; Giannella, C; and Robertson E. “FastFDs: A Heuristic-Driven, Depth-First Algorithm for Mining Functional Dependencies from Relation Instances”, Technical Report, Dept. of Comp. Sci, Indiana University, May 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wyss, C., Giannella, C., Robertson, E. (2001). FastFDs: A Heuristic-Driven, Depth-First Algorithm for Mining Functional Dependencies from Relation Instances Extended Abstract. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2001. Lecture Notes in Computer Science, vol 2114. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44801-2_11
Download citation
DOI: https://doi.org/10.1007/3-540-44801-2_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42553-3
Online ISBN: 978-3-540-44801-3
eBook Packages: Springer Book Archive