ABSTRACT
Dealing with very large databases is one of the defining challenges in data mining research and development. When a data base is not a static repository of data, or if the data come from different data sources and putting all data together might amass a huge database for centralized processing, knowledge discovery in such data environments cannot be a one-time process. Existing techniques include data sampling, windowing, bagging, boosting, batch learning, hierarchical meta-learning, and parallel and distributed data mining. This talk will provide a review on these techniques, and present our own recent research efforts on multi-layer induction and synthesizing association rules from different data sources.
- Knowledge discovery in very large databases
Recommendations
Mining concept associations for knowledge discovery in large textual databases
SAC '05: Proceedings of the 2005 ACM symposium on Applied computingIn this paper, we describe a new approach for mining concept associations from large text collections. The concepts are short sequences of words that occur frequently together across the text collections. It is these concepts that convey most of the ...
Discovery of Direct and Indirect Association Patterns in Large Transaction Databases
CIS '07: Proceedings of the 2007 International Conference on Computational Intelligence and SecurityAssociation rules mining is one of the important tasks in data mining research. While most of the existing discovery algorithms are dedicated to efficiently mining of frequent patterns, it has been noted recently that some of the infrequent patterns can ...
Comments