Abstract
This paper proposes an efficient method for data mining of generalized association rules on the basis of partial-match retrieval. A generalized association rule is derived from regularities of data patterns, which are found in the database under a given data hierarchy with enough frequencies. The pattern search is a central part of data mining of this type and occupies most of the running time. In this paper, we regard a data pattern as a partial-match query in partial-match retrieval then the pattern search becomes a problem to find partial-match queries of which answers include sufficient number of database records. The proposed method consists of a selective enumeration of candidate queries and an efficient partial-match retrieval using signatures. A signature, which is a bit sequence of fixed length, is associated with data, a record and a query. The answer for a query is fast computed by bit operations among the signatures. The proposed data mining method is realized based on an extended signature method that can deal with a data hierarchy. We also discuss design issues and mathematical properties of the method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proceedings of the 1993 International Conference on Management of Data. ACM SIGMOD, 1993.
R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proceedings of the Twenty-first International Conference on Very Large Data Base, 1994.
R. F. Feldman, Y. Aumann, A. Amir, et al. Maximal association rules: a new tool for mining for keywords occurrences in document collections. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, 1997.
C. Faloutsos. Access methods for text. ACM Computing Surveys, 17(1), 1985.
W. B. Frakes and R. Baeza-Yates, editors. Information Retrieval, Data Structures & Algorithms. Prentice Hall, 1992.
R. F. Feldman and H. Hirsh. Mining associations text in the presence of background knowledge. In Proceedings of the Second International Conference on Knowledge Discovery, 1996.
J. Han and Y. Fu. Discovery of multiple-level association rules from large databases. In Proceedings of the Twenty-first International Conference on Very Large Data Base, 1995.
H. Mannila, H. Toivonen, and A. Verkamo. Efficient algorithms for discovering association rules. In Proceedings of the AAAI Workshop on Knowledge Discovery in Databases, 1994.
R. Srikant and R. Agrawal. Mining generalized association rules. In Proceedings of the Twenty-first International Conference on Very Large Data Base, 1994.
S. Sawaragi, S. Thomas, and R. Agrawal. Integrating association rule mining with relational database systems: Alternatives and implications. In Proceedings of the 1998 International Conference on Management of Data. ACM SIGMOD, 1998.
D. Tsur, J. D. Ullman, S. Abiteboul, et al. Query flocks: A generalization of association-rule mining. In Proceedings of the 1998 International Conference on Management of Data. ACM SIGMOD, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Matsumoto, K., Hayase, T., Ikeda, N. (1999). Data Mining of Generalized Association Rules Using a Method of Partial-Match Retrieval. In: Arikawa, S., Furukawa, K. (eds) Discovery Science. DS 1999. Lecture Notes in Computer Science(), vol 1721. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46846-3_15
Download citation
DOI: https://doi.org/10.1007/3-540-46846-3_15
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66713-1
Online ISBN: 978-3-540-46846-2
eBook Packages: Springer Book Archive